HyperEF Spectral Hypergraph Coarsening by Effective-Resistance Clustering Ali Aghdaei

2025-05-08 0 0 1.12MB 9 页 10玖币
侵权投诉
HyperEF: Spectral Hypergraph Coarsening by
Effective-Resistance Clustering
Ali Aghdaei
Stevens Institute of Technology
aaghdae1@stevens.edu
Zhuo Feng
Stevens Institute of Technology
zhuo.feng@stevens.edu
Abstract—This paper introduces a scalable algorithmic frame-
work (HyperEF) for spectral coarsening (decomposition) of large-
scale hypergraphs by exploiting hyperedge effective resistances.
Motivated by the latest theoretical framework for low-resistance-
diameter decomposition of simple graphs, HyperEF aims at
decomposing large hypergraphs into multiple node clusters with
only a few inter-cluster hyperedges. The key component in
HyperEF is a nearly-linear time algorithm for estimating hyper-
edge effective resistances, which allows incorporating the latest
diffusion-based non-linear quadratic operators defined on hyper-
graphs. To achieve good runtime scalability, HyperEF searches
within the Krylov subspace (or approximate eigensubspace) for
identifying the nearly-optimal vectors for approximating the hy-
peredge effective resistances. In addition, a node weight propaga-
tion scheme for multilevel spectral hypergraph decomposition has
been introduced for achieving even greater node coarsening ratios.
When compared with state-of-the-art hypergraph partitioning
(clustering) methods, extensive experiment results on real-world
VLSI designs show that HyperEF can more effectively coarsen
(decompose) hypergraphs without losing key structural (spectral)
properties of the original hypergraphs, while achieving over 70×
runtime speedups over hMetis and 20×speedups over HyperSF.
Index Terms—hypergraph coarsening, effective resistance,
spectral graph theory, graph clustering
I. INTRODUCTION
Recent years have witnessed a surge of interest in graph
learning techniques for various applications such as vertex
(data) classification [11], [25], link prediction (recommendation
systems) [28], [37], drug discovery [21], [26], solving partial
differential equations (PDEs) [13], [20], and electronic design
automation (EDA) [17], [23], [35], [36]. The ever-increasing
complexity of modern graphs (networks) inevitably demands
the development of efficient techniques to reduce the size of
the input datasets while preserving the essential properties. To
this end, many research works on graph partitioning, graph
embedding, and graph neural networks (GNNs) exploit graph
coarsening techniques to improve the algorithm scalability, and
accuracy [6], [7], [27], [40].
Hypergraphs are more general than simple graphs since they
allow modeling higher-order relationships among the entities
[24]. The state-of-the-art hypergraph coarsening techniques are
all based on relatively simple heuristics, such as the vertex
similarity or hyperedge similarity [3], [8], [16], [29], [34]. For
example, the hyperedge similarity based coarsening techniques
contract similar hyperedges with large sizes into smaller ones,
which can be easily implemented but may impact the original
hypergraph structural properties; the vertex-similarity based
algorithms rely on checking the distances between vertices
for discovering strongly-coupled (correlated) node clusters,
which can be achieved leveraging hypergraph embedding
that maps each vertex into a low-dimensional vector such
that the Euclidean distance (coupling) between the vertices
can be easily computed in constant time. However, these
simple metrics often fail to capture higher-order global
(structural) relationships in hypergraphs.
The latest theoretical breakthroughs in spectral graph theory
have led to the development of nearly-linear time spectral graph
sparsification (edge reduction) [9], [10], [14], [15], [19], [33],
[38] and coarsening (node reduction) algorithms [22], [39].
However,
existing spectral methods are only applicable to
simple graphs
but not hypergraphs. For example, the effective-
resistance based spectral sparsification method [31] exploits
an effective-resistance based edge sampling scheme, while the
latest practically-efficient sparsification algorithm [10] leverages
generalized eigenvectors for recovering the most influential off-
tree edges for mitigating the mismatches between the subgraph
and the original graph.
On the other hand, spectral theory for hypergraphs has
been less developed due to the more complicated structure
of hypergraphs. For example, a classical spectral method has
been proposed for hypergraphs by converting each hyperedge
into undirected edges using star or clique expansions [12].
However, such a naive hyperedge conversion scheme may
result in lower performance due to ignoring the multi-way
high-order relationship between the entities. A more rigorous
approach by Tasuku and Yuichi [30] generalized spectral
graph sparsification for hypergraph setting by sampling each
hyperedge according to a probability determined based on the
ratio of the hyperedge weight over the minimum degree of
two vertices inside the hyperedge. Another family of spectral
methods for hypergraphs explicitly builds the Laplacian matrix
to analyze the spectral properties of hypergraphs: Zhou et
al. proposes a method to create the Laplacian matrix of
a hypergraph and generalize graph learning algorithms for
hypergraph applications [41]. A more mathematically rigorous
approach by Chan et al. introduces a nonlinear diffusion process
for defining the hypergraph Laplacian operator by measuring
the flow distribution within each hyperedge [4], [5]; moreover,
arXiv:2210.14813v2 [cs.LG] 3 Dec 2022
the Cheeger’s inequality has been proved for hypergraphs
under the diffusion-based nonlinear Laplacian operator [5].
However, these theoretical results do not immediately allow
for practically-efficient implementations.
This paper presents a scalable spectral hypergraph coarsening
algorithm via effective-resistance clustering to generate much
smaller hypergraphs that can still well preserve the original
hypergraph structural (global) properties. The key contribution
of this work is summarized below:
1)
To the best of our knowledge, for the first time, we propose
to extend the simple-graph effective resistance formulation
to the hypergraph setting by exploiting approximate eigen-
vectors (Krylov subspace) associated with the hypergraph.
2)
Our approach (HyperEF) allows incorporating the recently
developed diffusion-based non-linear Laplacian operator
defined on hypergraphs [5] for highly-efficient estimation
of hyperedge effective resistances.
3)
A scalable spectral hyperedge contraction (clustering)
scheme and a node weight propagation scheme have
been proposed for constructing coarse-level hypergraphs
that can well preserve the original spectral (structural)
properties.
4)
Our extensive experiment results on real-world VLSI
designs show that HyperEF is over
70×
faster than hMetis,
while achieving comparable or better average conductance
values for most hypergraph decomposition tasks.
The rest of the paper is organized as follows. In Section II, we
provide a background introduction to the basic concepts related
to the spectral hypergraph theory. In Section III, we introduce
an optimization-based formulation for effective resistance
estimation in simple graphs. In section IV, we introduce the
technical details of HyperEF and its algorithm flow. In section
V, we introduce a multilevel spectral hypergraph decompo-
sition framework. In section VI, we provide the complexity
analysis of HyperEF. In Section VII, we demonstrate extensive
experimental results to evaluate the performance of HyperEF
using a variety of real-word VLSI design benchmarks, which
is followed by the conclusion of this work in Section VIII.
II. BACKGROUND
Classical spectral graph theory shows that the structure
of a simple graph is closely related to the graph’s spectral
properties. Specifically, the Cheeger’s inequality shows the
close connection between expansion or conductance and the
first few eigenvalues of graph Laplacians [18]. Moreover, the
Laplacian quadratic form computed with the Fiedler vector (the
eigenvector corresponding to the smallest nonzero Laplacian
eigenvalue) has been exploited to find the minimum boundary
size or cut for graph partitioning tasks [32]. In recent years,
spectral theories for hypergraphs have been extensively studied
by theoretical computer scientists [5]. To allow a better
understanding of our work, the following important definitions
for hypergraphs are introduced.
Definition II.1.
Let
H= (V, E, w)
denote a weighted
hypergraph, where
w:ER+
is a weight function over
hyperedges,
du:= Σue,eEwe
, and
vol(S) := ΣSV:uSdu
.
The conductance of a given node set SVis defined as
CH(S) := cut(S, ¯
S)
min{vol(S), vol(¯
S)},(1)
where the cut is the sum of the weights of the hyperedges
containing the nodes from both
S
and
¯
S
. The hypergraph
conductance is defined as CH:= min
*SV
CH(S).
Definition II.2.
The non-linear quadratic form of a hypergraph
H= (V, E, w)
for any input vector
χRV
is defined as [5]
QH(χ) := X
eE
wemax
u,ve(χuχv)2.(2)
III. EFFECTIVE RESISTANCES IN SIMPLE GRAPHS
Definition III.1.
Let
G= (V,E, z)
denote a weighted and
connected undirected graph with weights
zRV
0
,
bpRV
denote the standard basis vector with all zero entries except
for the
p
-th entry being
1
, and
bpq =bpbq
, respectively. The
effective resistance between nodes (p, q)∈ V is defined as
Reff (p, q) = b>
pqL
Gbpq =
|V|
X
i=2
(u>
ibpq)2
λi
=
|V|
X
i=2
(u>
ibpq)2
u>
iLGui
,
(3)
where
L
G
denotes the Moore-Penrose pseudo-inverse of the
graph Laplacian matrix
LG
, and
uiRV
for
i= 1, ..., |V|
denote the unit-length, mutually-orthogonal eigenvectors cor-
responding to Laplacian eigenvalues λifor i= 1, ..., |V|.
Theorem 1.
The effective resistance between
p
and
q
can be
computed by solving the following optimization problem:
Reff (p, q) = max
xRV
(x>bpq)2
x>LGx.(4)
Proof.
The original problem in (4) can be alternatively solved
by tackling the following convex optimization problem:
min
xRVx>LGx
s.t. x>bpq = 1.
(5)
After introducing a Lagrangian multiplier
λ
, the following
Lagrangian function needs to be minimized
F(x, λ) = x>LGxλ(x>bpq 1),(6)
which will lead to the following optimal solution:
λ=1
Reff (p, q), x=L
Gbpq
Reff (p, q).(7)
Substituting xinto (4) will complete the proof.
IV. SPECTRAL COARSENING VIA RESISTANCE CLUSTERING
A. Overview of HyperEF
A recent theoretical work proves that it is possible to
decompose a simple graph into multiple node clusters with
effective-resistance diameter at most the inverse of average
node degree (up to constant losses) by removing only a
摘要:

HyperEF:SpectralHypergraphCoarseningbyEffective-ResistanceClusteringAliAghdaeiStevensInstituteofTechnologyaaghdae1@stevens.eduZhuoFengStevensInstituteofTechnologyzhuo.feng@stevens.eduAbstract—Thispaperintroducesascalablealgorithmicframe-work(HyperEF)forspectralcoarsening(decomposition)oflarge-scaleh...

展开>> 收起<<
HyperEF Spectral Hypergraph Coarsening by Effective-Resistance Clustering Ali Aghdaei.pdf

共9页,预览2页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:9 页 大小:1.12MB 格式:PDF 时间:2025-05-08

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 9
客服
关注