
Table 1: Predefined meta-paths of real-world datasets. In this table, it can be noticed that most of Rare inter-type relations and
Ptarget on intra-type relations by setting the same type of nodes at both ends of P.
Dataset A R P
DBLP A, P, T, C A-P, P-T, P-C APA, APCPA, APTPA
IMDB M, D, A M-D, M-A MDM, MAM
ACM P, A, S P-A, P-S PAP, PSP
AMiner P, A, R P-A, P-R PAP, PRP
Freebase M, D, A, P M-D, M-A, M-P MAM, MDM, MPM
Last.FM U, A, T U-U, U-A, A-T UU, UAU, UATAU, AUA, AUUA, ATA
Yelp U, B, Co, Ci, Ca U-U, U-B, U-Co, B-Ci, B-Ca UBU, UCoU, UBCiBU, UBCaBU, BUB, BCiB, BCaB, BUCoUB
Douban U, M, G, L, D, A, T U-U, U-G, U-M, U-L, M-D, M-T, M-A MUM, MTM, MDM, MAM, UMU, UMAMU, UMDMU, UMTMU
relations (i.e., edges between different types of nodes). How-
ever, using only these inter-type relations is not enough to
extract useful knowledge from the intricate relations in the
data. To resolve this problem, most HGNNs rely on addi-
tional predefined relational information, and the most com-
monly used methods are meta-path (Sun et al. 2011) and
meta-graph (Fang et al. 2016; Huang et al. 2016), each of
which are a composition of different types of nodes and mul-
tiple meta-paths as shown in Figure 1 (c) and (d). As we will
show later, nearly all meta-paths implicitly derive intra-type
relations (i.e., relations between the same type of nodes) by
manipulating given inter-type relations.
However, there exist three major problems with using pre-
defined methods such as meta-paths for heterogeneous graph
learning. Firstly, there exist certain limitations on induc-
ing intra-type relations from predefined inter-type relations.
When the given inter-type relations are sparse or noisy, in-
duced intra-type relations can also be affected. Secondly,
the appropriate composition of nodes and edges (design-
ing meta-paths and meta-graphs) for representation learning
requires significant domain-specific knowledge. Thus, it is
extremely hard to know which combinations of nodes and
edges are suitable for learning useful representations, espe-
cially in unsupervised environments. Lastly, although there
exist attempts to learn appropriate meta-paths beyond given
ones (Yun et al. 2019), several multiplications of the adja-
cency matrix are required. Due to the high computational
cost of multiple matrix multiplications, their method is lim-
ited to very small datasets (Lv et al. 2021).
To circumvent the above limitations of current meth-
ods, we propose a novel concept of meta-node to construct
simple and powerful MPNNs for learning heterogeneous
graphs. Meta-nodes are virtual nodes in which one meta-
node is added to the graph for each type of node in the het-
erogeneous graph. Each meta-node is connected to all nodes
of each type as illustrated in Figure 1 (b). By introducing
meta-nodes, message passing is no longer limited to sparse
inter-type relations, and every node can directly perform
message passing with other nodes of the same type via meta-
nodes. To do so, we can enrich the information on the rela-
tionship by adding explicit intra-type relations to the given
inter-type relations. After introducing the concept of meta-
nodes, we propose a message passing scheme via meta-node
to learn both intra- and inter-type relations effectively.
Unsupervised representation learning on heterogeneous
graphs has become one of the major challenges in graph-
structured data learning, as it can pave the way to make
use of large amounts of unlabeled multi-modal data. Thus,
we validate the proposed message passing scheme by ap-
plying it to unsupervised representation learning for graph-
structured data. To do so, we apply our meta-node mes-
sage passing layer to the encoder of Deep Graph Infomax
(Veliˇ
ckovi´
c et al. 2019) which is one of the most well-
known graph contrastive models. Through downstream tasks
on four real-world heterogeneous graph datasets, we vali-
date the proposed message passing scheme. We confirm that
our meta-node message passing layer learns rich relational
information and shows competitive performance compared
to existing state-of-the-art HGNNs even without any meta-
paths.
Related Work
Meta-path. A meta-path (Sun et al. 2011) Pis defined as
a path that has a form of A1
R1
−−→ A2
R2
−−→ · · · Rl
−→ Al+1 (ab-
breviated as A1A2· · · Al+1) which describes relations be-
tween A1and Al+1 ∈ A with a composition of relations
R1, R2, . . . , Rl∈ R, where Aand Rdenote sets of node
types and edge types of heterogeneous graphs, respectively.
Each meta-path can describe a semantic relation between
nodes at both ends of the meta-path. For instance, in Figure
1 (c), the meta-path of movie-director-movie can describe
the relationship between two movies by which the director
filmed them. Nearly all meta-paths of the real-world datasets
(Wang et al. 2019; Fu et al. 2020; Wang et al. 2020, 2021)
are implicitly composed for intra-type relations by setting
the same type of nodes at both ends of Pusing given inter-
type relations Ras shown in Table 1.
Representation Learning for Heterogeneous Graphs.
For several past years, there have been many efforts to learn
representations of heterogeneous graphs based on random-
walk-based methods (Dong, Chawla, and Swami 2017; Fu,
Lee, and Lei 2017; Jeong et al. 2020; He et al. 2019) or
GNNs methods (Schlichtkrull et al. 2018; Shi et al. 2018;
Zhang et al. 2019; Yun et al. 2019; Wang et al. 2019; Zhao
et al. 2020; Fu et al. 2020; Hu et al. 2020; Zhao et al. 2021).
Nowadays, HGNNs leveraging the power of GNNs show a
remarkable ability to learn intricate relations between multi-
ple types of nodes and edges both in semi-supervised and
unsupervised conditions. For instance, in semi-supervised
learning, HAN (Zhang et al. 2019) proposed attention-based
MPNNs using meta-paths to take each semantic meaning of