
2
Hasegawa et al [16] examined the role of clustering
and assortativity in the percolating component of tree-
triangle networks. Recently, Mann et al investigated the
inter-subgraph correlation properties that arise naturally
in the percolating component of GCM networks that are
composed of arbitrary sized cliques that have neutral
mixing patterns in the substrate graph [22].
In this paper, we extend that work to examine the
inter-subgraph properties of GCM networks that have
arbitrary clique clustering and subgraph correlations in
the substrate network. To do this, we develop an analyt-
ical method based on generating functions and introduce
an MCMC rewiring algorithm to simulate such networks.
These steps allow the synthesis of random graphs drawn
from an ensemble of graphs with a given clustering coeffi-
cient Cand correlation structure among the joint degree
tuples. This model can be used to probe the role that
both clustering and assortativity have on the properties
of the bond percolation process and will likely lead to the
detailed study of higher-order structure for other kinds
of dynamical processes on networks beyond the percola-
tion model. Additionally, our model allows the cluster-
ing and correlation structure from an empirical network
to be gathered through means of an edge-disjoint clique
cover [22, 23]. Through our MCMC algorithm, synthetic
networks, possibly containing additional vertices to the
empirical network, can then be created. In this way, our
model allows ensemble representations with fixed statis-
tics of empirical networks to be created and studied. We
remark, that the information in our model can readily be
compressed from inter-subgraph correlations to correla-
tions between overall degrees; however, there is a corre-
sponding information loss following this process.
II. THEORETICAL
Correlations can arise via three distinct processes in
GCM networks. Firstly, correlation between the joint
degrees of neighbouring vertices that belong to the same
clique type. For instance, three vertices in a triangle must
all have t6= 0. In this manner, vertices with membership
of a given 3-clique are somewhat assorted together; they
will never connect to vertices that don’t belong to trian-
gles. However, this correlation does not extend beyond
the scope of the motif to which the edge belongs and so,
we refer to it as the trivial correlation. In other words,
the excess joint degrees (the remaining joint degree tu-
ples excluding the motif to which the edge belongs) of two
vertices at the ends of a randomly selected edge are not
correlated in the default GCM construction. And, ex-
cluding the trivial assortativity due to belonging to the
same motif, there are no specific correlations between
vertices at play.
We can also discuss assortativity between the overall
degrees of vertices in the GCM. The overall degree of
a vertex that belongs to s2-cliques, t3-cliques, w4-
cliques (and so on) is given by k=s+ 2t+ 3c+. . . .
This assortativity often arises in an uncontrolled man-
ner. For example, a GCM network composed of 2-cliques
and 10-cliques may be absent of any correlation structure
among the distribution in the numbers of cliques a vertex
belongs to; however, by virtue of the increase in overall
degree, vertices that belong to 10-cliques will likely be of
higher-degree than vertices that belong to just 2-cliques,
and as a result, will exhibit positive correlations among
the overall degrees if this is the case [10, 16].
Finally, we can inject correlation into GCM networks
deliberately, in a detailed and controlled fashion, by spec-
ifying how likely it is that a vertex with a given joint de-
gree tuple (s0, t0, . . . , c0) has a neighbour with a certain
joint degree tuple (s0, t0, . . . , c0). In general, this probabil-
ity is different depending on the topology of the edge (to
which clique it belongs to). This kind of correlation is the
most general of the three types that arise in the GCM;
since the other correlation types are subsets of this; and
so, it is the focus of this paper.
We restrict our attention to GCM networks that are
composed of 2- and 3-cliques, symbolised by ~τ ={⊥,∆},
respectively. Let eτ,s0t0,s0t0be the probability that a ran-
domly chosen τ-edge leads to vertices with excess (re-
maining) joint degrees of (s0, t0) and (s0, t0) where sand
tare the numbers of 2- and 3-cliques, respectively, that
each vertex belongs to, see Fig 2. The joint excess joint
degree distribution is the data input to the model; other
familiar quantities such as excess joint degree distribu-
tion, qτ,s,t and the joint degree distribution, ps,t can be
derived from these information-rich function.
FIG. 2. The joint excess joint degree distribution of topology
τis created by selecting all τ-edges and measuring the joint
excess degrees of the terminal vertices, excluding the motif
the selected edge belongs to. In the example, the selected
edge (yellow dashed) has vertices of excess joint degrees (2,1)
and (1,1).
There is an eτ,s0t0,s0t0function for each τ∈~τ; since, we
can follow card(~τ) distinct edge-topologies. We assume
that
X
s0X
t0X
s0X
t0
eτ,s0t0,s0t0= 1,(1)
with all indices running from 0,...,∞unless otherwise
stated. We can recover the excess joint degree distribu-
tion, which is the probability that an edge of topology τ
is chosen at random and leads to a vertex of remaining