
publications derived from a temporal knowledge graph
is needed to handle the problem: How to represent
and calculate the influence of a publication using
as much of its information as possible?
Current studies on temporal knowledge graphs try
to manage changes in two adjacent snapshots, assuming
that nodes should update smoothly or evolve dramati-
cally [7, 26]. However, these assumptions require one to
manually set a change rate to limit the evolution, which
is not flexible. Existing works still mainly focus on han-
dling structural and temporal features in separate steps,
which leads to a lack of expression to treat dynamic
graphs as a whole. Furthermore, accumulative citations
are usually modeled as log-normal or cumulative dis-
tribution functions [9, 2]. It is still worth trying some
alternatives to perform a further analysis. The poten-
tial enhancement mentioned above should be studied to
handle another problem: How can we improve the
expressiveness of the framework for prediction
tasks using temporal knowledge graphs?
With the observations above, we propose CT-
PIR (Citation Trajectory Prediction via Influence
Representation), a new framework to predict citation
trajectories with influence representation using tempo-
ral knowledge graphs. First, we optimize the R-GCN
mechanism [22] to automatically learn the gaps between
two adjacent snapshots. Second, we implement a fine-
grained influence (citation momentum) representation
module to make use of all historical information from a
publication’s attributes. Third, a learnable general lo-
gistic function is applied to fit the trajectories using the
influence representation from the previous module.
We experiment our framework with two real world
datasets. One is APS1, a public dataset of academic
papers. Another, named AIPatent, is a new dataset
that we construct with global patents in the field of
artificial intelligence. Compared to some baselines, the
results show that CTPIR outperforms those methods in
all cases.
Our key contributions are summarized in the fol-
lowing points:
•Novel framework: We propose a new framework,
named CTPIR, which implements a fine-grained in-
fluence representation approach using a more ex-
pressive temporal graph learning process and opti-
mizes existing methods to bring prediction results
much closer to observations.
•Improved evaluation: We construct a new tem-
poral knowledge graph dataset named AIPatent for
the task, which is also a strong supplement for the
1https://journals.aps.org/datasets
community to carry out various temporal graph
studies. We also design and implement multiple
subtasks to evaluate approaches from a more com-
prehensive view.
•Multifaceted analysis: We analyze the experi-
mental performance from multiple aspects. Expla-
nations on how CTPIR performs better compared
to other recent approaches are discussed. Some
weaknesses and further efforts are also mentioned
to guide future studies.
The dataset we use in this work, including our
AIPatent contributed dataset, and the code to re-
produce are available in our GitHub repository:
https://github.com/changzong/CTPIR
2 Related Work
2.1 Citation and Popularity Prediction. Mod-
ern approaches to citation count prediction (CCP) aim
to combine attribute information with temporal fea-
tures. GNNs are commonly used to capture topolog-
ical features of citation networks. The encoded nodes
are sent to RNNs or attention models for time-series
forecasting. A previous work [8] follows this simple
encoder-decoder architecture. Some previous studies
[30, 13, 25, 32] put emphasis on cascade graphs for pop-
ularity prediction, using a kernel method to estimate
structural similarities. These works are based on sim-
ply combining graph embedding with time-series meth-
ods. In contrast, we introduce a method to fully utilize
all past characteristics of a publication’s attributes. A
recent work called HINTS [9] adds an imputation mod-
ule to aggregate the information from each snapshot of
graphs. Another work proposes a heterogeneous dynam-
ical graph neural network (HDGNN) [29] to predict the
cumulative impact of articles and authors. The latest
work [24] uses an attention mechanism to represent the
sequence of content from citation relations. Although
these works can take advantage of richer information,
their lack of fine-grained design to represent the influ-
ence of a publication is not conducive to achieving good
prediction performance.
2.2 Temporal Graph Embedding. We focus on
deep learning-based temporal graph embedding ap-
proaches. Several previous works implement a straight-
forward way to combine GCN and RNN models to ex-
tract structural and temporal features [18, 23, 5]. RNN
variants are applied as the temporal module to perform
downstream tasks such as anomaly detection. Mean-
while, temporal attention models can be a substitute
for GCN to extract topological features [28, 15, 17]. A
recent paper [3] tries to represent global structural in-