
EFFICIENT DIRECTED GRAPH SAMPLING VIA GERSHGORIN DISC ALIGNMENT
Yuejiang Li?, H. Vicky Zhao?, Gene Cheung†
?Dept. of Automation, Tsinghua University, Beijing, China
†York University, Toronto, Canada
ABSTRACT
Graph sampling is the problem of choosing a node subset via sam-
pling matrix H∈ {0,1}K×Nto collect samples y=Hx ∈RK,
K < N, so that the target signal x∈RNcan be reconstructed
in high fidelity. While sampling on undirected graphs is well stud-
ied, we propose the first sampling scheme tailored specifically for
directed graphs, leveraging a previous undirected graph sampling
method based on Gershgorin disc alignment (GDAS). Concretely,
given a directed positive graph Gdspecified by random-walk graph
Laplacian matrix Łrw , we first define reconstruction of a smooth sig-
nal x∗from samples yusing graph shift variation (GSV) kŁrwxk2
2
as a signal prior. To minimize worst-case reconstruction error of
the linear system solution x∗=C−1H>ywith symmetric coef-
ficient matrix C=H>H+µŁ>
rw Łrw, the sampling objective is
to choose Hto maximize the smallest eigenvalue λmin(C)of C.
To circumvent eigen-decomposition entirely, we maximize instead a
lower bound λ−
min(SCS−1)of λmin(C)—smallest Gershgorin disc
left-end of a similarity transform of C—via a variant of GDAS based
on Gershgorin circle theorem (GCT). Experimental results show that
our sampling method yields smaller signal reconstruction errors at a
faster speed compared to competing schemes.
Index Terms—Graph signal processing, signal sampling, Ger-
shgorin circle theorem
1. INTRODUCTION
Graph signal processing (GSP) extends traditional signal processing
tools to analyze signals on irregular data kernels described by finite
graphs [1, 2]. Most existing GSP works consider undirected graph
structures, where each edge connecting two nodes is bidirectional.
However, directionality plays an important role in many practical
information dissemination scenarios [3]. For example, on Twitter, a
celebrity often has a large following but personally follows very few
users [4]. Thus, in these scenarios it is critical to factor directionality
into the network model, resulting in a directed graph.
Graph sampling selects a node subset to collect samples, so that
the target signal can be recovered in high fidelity [5]. Existing graph
sampling works can be classified into two categories based on prior
assumptions: i) an assumption on strict bandlimitedness of target
signals with a cutoff frequency, and ii) a more general assumption
assuming target signals are “smooth” with respect to (w.r.t.) the un-
derlying graph (e.g., more energy in low frequencies than high fre-
quencies). Bandlimited assumption in the first category means that a
target signal lies strictly inside a linear subspace spanned by the first
eigenvectors (Fourier modes) of a graph variation operator, such as
the graph Laplacian matrix Ł or the adjacency matrix W[6–10]. As-
suming that the observed signal samples contain noise, [6] proposed
Gene Cheung acknowledges the support of the NSERC grants RGPIN-
2019-06271, RGPAS-2019-00110.
a greedy algorithm to select sample nodes under the E-optimality
criterion [11], and [7] designed a greedy algorithm to minimize the
reconstruction MSE. To lower complexity, [9] and [12] used graph
spectral proxies and localization operators, respectively, to mitigate
the computation burden of eigen-decomposition.
However, the strict bandlimited assumption of target signals is a
strong one that many practical graph signals do not satisfy. To relax
this assumption, works in the second category assume that a target
signal is generally smooth over a given graph [13, 14]. For example,
graph Laplacian regularizer (GLR) [15], i.e.,x>Łx, is often used to
quantify smoothness of signal xover a graph specified by Laplacian
Ł [13–17]. GLR is often used to regularize under-determined signal
reconstruction problems, such as denoising, dequantization, and in-
terpolation [15, 18, 19]. Using GLR as signal prior, graph sampling
based on Gershgorin disk alignment (GDAS) was proposed to effi-
ciently select sample nodes on undirected (signed) graphs under the
E-optimality criterion [16, 17]. A key feature of GDAS is that it cir-
cumvents eigen-decomposition entirely and executes in linear time,
and thus is scalable to large graphs.
Although the above sampling algorithms are efficient and ef-
fective, they are all designed for undirected graphs, and cannot be
easily applied to directed graphs. One main challenge in directed
graph sampling is the inherent difficulty in defining graph frequen-
cies, due to the asymmetric nature of the directed graphs’ variation
operators, e.g., adjacency and Laplacian matrices, Wand Ł. Asym-
metry means that the graph operator matrix may not be diagonaliz-
able (and thus eigenvectors cannot be easily obtained), and even if
it is, its eigenvalues can be complex, which are difficult to interpret
(e.g., ordering of eigenvectors into frequencies from high to low is
not obvious). Though [6, 9] discussed in passing how their methods
can be adapted to directed graphs, frequency and bandlimitedness
notions are still not well understood on directed graphs.
To circumvent the above challenge, in this work, we formulate
a novel directed graph sampling problem using graph shift variation
(GSV) [20] with a solution that completely avoids matrix asymme-
try, and in so doing enable a variant of GDAS for fast sampling.
Specifically, we first define GSV kŁrwxk2
2as a smoothness prior
for directed graph signal x∈RN, where Łrw is a random-walk
graph Laplacian for directed graph Gd. Using GSV as regularizer
to reconstruct signal xfrom samples y=Hx ∈RK, where H∈
{0,1}K×Nis a sampling matrix, the solution is x∗=C−1H>y,
with symmetric coefficient matrix C=H>H+µŁ>
rw Łrw. To min-
imize the worst-case reconstruction error (E-optimality), the sam-
pling objective is to choose Hto maximize the smallest eigenvalue
λmin(C)of C. To mitigate eigen-decomposition entirely, we devise
a variant of previous GDAS to efficiently choose Hto maximize a
lower bound λ−
min(SCS−1)—smallest Gershgorin disc left-end of
a similarity transform of C—based on Gershgorin circle theorem
(GCT) [21]. Experimental results show that our sampling method
yields smaller signal reconstruction errors at a faster speed compared
arXiv:2210.14263v1 [eess.SP] 25 Oct 2022