On the Resilience of Traffic Networks under Non-Equilibrium Learning Yunian Pan Tao Li and Quanyan Zhu Abstract We investigate the resilience of learning-based

2025-05-02 0 0 2.19MB 8 页 10玖币
侵权投诉
On the Resilience of Traffic Networks under Non-Equilibrium Learning
Yunian Pan, Tao Li, and Quanyan Zhu
Abstract We investigate the resilience of learning-based
Intelligent Navigation Systems (INS) to informational flow at-
tacks, which exploit the vulnerabilities of IT infrastructure
and manipulate traffic condition data. To this end, we propose
the notion of Wardrop Non-Equilibrium Solution (WANES),
which captures the finite-time behavior of dynamic traffic
flow adaptation under a learning process. The proposed non-
equilibrium solution, characterized by target sets and mea-
surement functions, evaluates the outcome of learning under
a bounded number of rounds of interactions, and it pertains
to and generalizes the concept of approximate equilibrium.
Leveraging finite-time analysis methods, we discover that under
the mirror descent (MD) online-learning framework, the traffic
flow trajectory is capable of restoring to the Wardrop non-
equilibrium solution after a bounded INS attack. The resulting
performance loss is of order ˜
O(Tβ)(1
2β < 0)), with a
constant dependent on the size of the traffic network, indicating
the resilience of the MD-based INS. We corroborate the results
using an evacuation case study on a Sioux-Fall transportation
network.
I. INTRODUCTION
The past decades have witnessed significant growth in the
Internet-based traffic routing demand, along with the rapid
development of modern Intelligent Navigation Systems (INS).
The INS infrastructures, which consists of Online Navigation
Platforms (ONP) such as Google Maps and Waze, together
with the widely adopted Internet-of-Things (IoT), including
smart road sensors and toll gates, are designed to make real-
time, efficient routing recommendations for their users. The
best-effort routing of the individuals leads to macroscopic
traffic conditions, which is encapsulated by the notion of
Wardrop equilibrium (WE) [1] in congestion games.
As transportation networks become increasingly inter-
connected, the number of attack vectors against the entire
transportation system is also on the rise. Consequently, the
well-being of the traffic networks is vulnerable to emerging
cyber-physical threats. For example, attacks on individual
GPS devices and road sensors can lead to the unavailability
of critical information and cause wide disruptions to the
infrastructure. As discussed in [
2
], a strategic data poisoning
attack on the ONP can lead to significant traffic congestion
and service breakdown.
In this work, we focus on a class of man-in-the-middle
(MITM) attacks on ONP systems that aim to mislead the
users to choose routes that are favored by the attackers. A
quintessential case was demonstrated in 2014, Israel, where
two students hacked the Waze GPS app and used bots to
crowdsource false location information, which misled the
The authors are with the Department of Electrical and Computer
Engineering, Tandon School of Engineering, New York University, Brooklyn,
NY, 11201 USA; E-mail: {yp1170,tl2636,qz494}@nyu.edu
Fig. 1. Intelligent Navigation Systems (INS) are vulnerable to informational
attacks when data transmission is intercepted, yielding deteriorated traffic
flows. Mirror descent, as a non-equilibrium learning scheme, is capable of
restoring the path flow to the proposed Wardrop non-equilibrium solution.
users and caused congestion [
3
]. As reported in [
4
], the
existing real-time traffic systems are intrinsically vulnerable
to malicious attacks such as modified cookie replays and
simulated delusional traffic flows.
This class of attacks can be further categorized as infor-
mational attacks on INS. They intercept the communication
channel between individual users and the information infras-
tructure and exploit the vulnerabilities of data transmission
to misguide users and achieve an adversarial traffic condition.
While there have been efforts on preventing and detecting
attacks, it is indispensable to create resilient mechanisms that
can allow users to adapt and recover after the attack since
perfect protection is either cost-prohibitive or impractical
[
5
], [
6
]. To achieve this self-healing property, a dynamic
feedback-driven learning-based approach is essential [
7
], and
a non-equilibrium solution concept in contrast to the classical
Wardrop equilibrium is needed to capture the non-stationary
nature of the ante impetum and post impetum behaviors as
well as enable the time-critical performance assessment and
design for resiliency.
To this end, we investigate the notion of Non-Equilibrium
Solution (NonES) in the context of repeated congestion games.
It measures the probability of the traffic flows “enveloping”
given target sets, with the envelop volume defined by a
measurement function. The Non-Equilibrium learning does not
necessarily yield an equilibrium solution but a trajectory that
falls into the envelope with high probability. Based on NonES,
we define Wardrop Non-Equilibrium Solution (WANES),
which specifies mean Wardrop equilibrium (MWE) as its
arXiv:2210.03214v1 [eess.SY] 6 Oct 2022
target set, and the weighted potential loss as the measurement.
In this work, we focus on a class of Mirror Descent
Non-Equilibrium learning algorithms and elaborate on its
role in the resilience of traffic networks under adversarial
environments. We first establish a high probability bound
on the distance between the output and MWE for generic
Mirror Descent (MD) algorithms without assumptions on the
boundedness of the latency function. This high probability
bound can be transformed into the resilience metrics, showing
that after the attack, a WANES with weighted potential loss
that is sublinear in time can be recovered through learning.
Next, we develop a learning-based resiliency mechanism
based on an MD algorithm as the and two classes of
flow disturbance attacks. We demonstrate the performance
resilience under MD using an evacuation case study to
illustrate the process of learning-based recovery. A schematic
illustration of our non-equilibrium learning approach is
provided in Figure 1.
Outline of this paper.
We briefly discuss the related
works in Section II. In Section III, we introduce the repeated
stochastic congestion game and MWE as the solution concept,
based on which we introduce the notion of Non-Equilibrium
learning and the formalism of resilience. In Section IV, we
establish several finite-time results for the Non-Equilibrium
learning dynamics and elaborate on the numerical experiments
to illustrate the attack and resilience in Section V.
II. RELATED WORK
Our work bridges the gap between the online-learning and
resilience in traffic assignment. Traffic assignment naturally
fits in the online-learning framework, see [
8
], [
9
], [
10
], where
the convergence in Ces
´
aro sense is shown as interests. Vu
et al. in [
11
] improved the rate of Ces
´
aro convergence to
O(1/T 2)and provided a last-iterate convergence guarantee.
The existent studies on the resilience of traffic assignment
have targeted on event-based disruptions, (see, e.g., [
12
]),
the system impedance to misinformation disturbance has
been rarely studied. To the best of our knowledge, the first
informational equilibrium-poisoning concept was proposed by
Pan et al. [
2
], such a phenomenon occurs when the sensors,
GPS devices in the INS are under attacks [13].
III. PROBLEM FORMULATION
A. Preliminary: Mean Wardrop Equilirbium
We are given a traffic network represented as a directed,
finite, and connected graph
G= (V,E)
without self-loops.
The vertices
V
represent road junctions, and the edges
E
represent road segments. The set of distinct origin-destination
(OD) pairs is
W V ×V
, indexed by
w
, with cardinality
W
.
Let
P:= Sw∈W Pw
be the set of all directed paths between
origins and destinations, where
Pw⊆ P(E)
is the path set
between the pair w.
We assume that there is a set of infinitesimal players
over
G
, denoted by a measurable space
(X,M, m)
. The
players are non-atomic, i.e.,
m(x) = 0 x∈ X
; they
are split into distinct populations indexed by the OD pairs,
i.e.,
X=Sw∈W Xw
and
XwTXw0=,w, w0∈ W
.
For each OD pair
w∈ W
, let
mw=m(Xw)
represent
the traffic demand. Let
¯
M=Pw∈W mw
For each player
x∈ Xw
, we assume that their travel path
a∈ Pw
is fixed
right after the path selection. The action profile of all the
players
X
induces an edge flow vector
qR|E|
0
, where
qe:= RX1{ea}m(dx), e ∈ E
, and a path flow vector
µ∆ := {(µp)p∈∪w∈W Pw|µp:= RXw
1{a=p}m(dx)}.
We define the edge-path incident matrix of graph
G
as
Λ =
1|,...,|Λ|W|]R|E|×|P|
such that
Λw
e,p =1{ep},e
E, w ∈ W, p ∈ Pw
. Hence the compact form of edge-path
flow relation is q= Λµ.
Let
(Ω,F,P)
be the probability space for the formalism,
le:R0×7→ R+
be the cost/latency functions, measuring
the travel delay of the edge
e∈ E
determined by its edge flow
qe
and a state variable
ω
that is universal for the entire
traffic network, e.g.,
ω
can represent the weather condition,
road incidents or anything that affects the congestion level. Let
l:R|E|
0×7→ R|E|
+
denote the vector-valued latency function.
For an instance
ω
, the latency of path
p
is defined as
`p:= Peple(qe, ω)=Λ>
plµ, ω)
, which can be seen as
a function of
µ
and
ω
, written as
`p=`p(µ, ω)
. We write
the vector-valued path latency function as
`: ×7→ R|P|
+
.
Each instance
ω
determines a congestion game, captured by
the tuple
Gω
c= (G,W,X,P, `(·, ω))
. Each path flow profile
µ
induces a probability measure associated with the
positive random vector `(µ, ·):Ω7→ R|P|
+.
Assumption 1.
For all
e∈ E
, the latency functions
le
are
ω
-measurable, for all
ω
,
le
are
L0
-Lipschitz continuous
and differentiable in qewith le(qe, ω)
qe
>0for all qe0.
Remark.
The continuity assumption reflects the fact that
adding a small amount of traffic does not drastically affect
the travel latency; the monotonicity implies the increments
of traffic does not decrease the latency.
We adopt a stochastic alternative definition of Wardrop
equilibrium. In doing so, we consider a “meta” version of the
congestion game,
Gc= (G,W,X,P,Eω[`(·, ω)])
, with the
utility functions replaced by the expected latency function.
This “meta” congestion game gives rise to a solution concept
corresponding to Definition 1.
Definition 1
(Mean Wardrop Equilibrium [
1
])
.
A path flow
µ
is said to be a Mean Wardrop Equilibrium (MWE) if
w∈ W
,
µp>0
indicates
E[`p]E[`p0]
for all
p0∈ Pw
.
The set of all MWE is denoted by
µ
. Equivalently,
µµ
if and only if the following variational inequality is satisfied:
Eω[hµµ0, `(µ, ω)i]0,µ0
, where
Eω[·]
is the
expectation operator with respect to ω.
The MWE
µ
is in general not a singleton, but a convex set
given the strict monotonicity of latency in Assumption 1. The
seeking of MWE can be cast as a minimization problem of
the expectation of the Stochastic Beckmann Potential (SBP)
[
14
], defined as
φ(µ, ω) = Pe∈E Rµ)e
0le(z, ω)dz
, where
µ)e
is the
e
th element of
Λµ
. We refer to the expectation
摘要:

OntheResilienceofTrafcNetworksunderNon-EquilibriumLearningYunianPan,TaoLi,andQuanyanZhuAbstract—Weinvestigatetheresilienceoflearning-basedIntelligentNavigationSystems(INS)toinformationalowat-tacks,whichexploitthevulnerabilitiesofITinfrastructureandmanipulatetrafcconditiondata.Tothisend,wepropose...

展开>> 收起<<
On the Resilience of Traffic Networks under Non-Equilibrium Learning Yunian Pan Tao Li and Quanyan Zhu Abstract We investigate the resilience of learning-based.pdf

共8页,预览2页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:8 页 大小:2.19MB 格式:PDF 时间:2025-05-02

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 8
客服
关注