DEEPKOOPMAN LEARNING OF NONLINEAR TIME-VARYING SYSTEMS Wenjian Hao

2025-04-26 0 0 626.02KB 17 页 10玖币
侵权投诉
DEEP KOOPMAN LEARNING OF NONLINEAR TIME-VARYING
SYSTEMS
Wenjian Hao
School of Aeronautics and Astronautics Engineering
Purdue University, IN, USA
hao93@purdue.edu
Bowen Huang
Pacific Northwest National Laboratory
Richland, USA
bowen.h@pnnl.gov
Wei Pan
Department of Computer Science
University of Manchester,UK
wei.pan@tudelft.nl
Di Wu
Pacific Northwest National Laboratory
Richland, USA
di.wu@pnnl.gov
Shaoshuai Mou
School of Aeronautics and Astronautics Engineering
Purdue University, IN, USA
mous@purdue.edu
June 22, 2023
ABSTRACT
This paper presents a data-driven approach to approximate the dynamics of a nonlinear time-varying
system (NTVS) by a linear time-varying system (LTVS), which is resulted from the Koopman op-
erator and deep neural networks. Analysis of the approximation error between states of the NTVS
and the resulting LTVS is presented. Simulations on a representative NTVS show that the proposed
method achieves small approximation errors, even when the system changes rapidly. Furthermore,
simulations in an example of quadcopters demonstrate the computational efficiency of the proposed
approach.
1 Introduction
In recent years, data-driven methods have received a significant amount of research attention due to the increasing
complexity of the autonomous systems in both dynamics [14] and mission objectives [57]. In the direction of
learning system dynamics, the Koopman operator has recently been proven to be an effective method to approximate a
nonlinear system by a linear time-varying system based on state-control pairs [24]. Along this direction, two popular
methods dynamic mode decomposition (DMD) and extending dynamic mode decomposition (EDMD) are used to lift
the state space to a higher-dimensional space, where the evolution is approximately linear [8]. However, choosing
the proper observable functions for the lifting transformation is still an open question, and the potentially large lifted
dimension may hinder real-time applications.
Recent work has proposed several methods for choosing proper observable functions of Koopman-based methods for
time-invariant systems. Lusch et al. [9] proposed applying deep learning methods to discover the eigenfunctions of
the approximated Koopman operator. Yeung et al. [1013] introduced deep neural networks (DNN) as observable
functions of the Koopman operator, which are tuned based on collected state-control pairs by minimizing a properly
defined loss function. While some work, such as [14], has extended the DMD method to approximate nonlinear time-
varying systems (NTVS) that change sufficiently slowly by linear time-varying systems (LTVS), this method is not
directly applicable to approximate nonlinear systems with rapidly changing dynamics.
arXiv:2210.06272v3 [eess.SY] 21 Jun 2023
APREPRINT - JUNE 22, 2023
In this paper, we propose a deep Koopman learning method to approximate NTVS, which employs DNN as the
observable function of the Koopman operator and adjusts both the DNN and the approximated dynamical system
simultaneously. This is achieved by tuning the DNN parameters based on the latest state-control data pairs to track the
unknown NTVS.
Compared to existing results in [14], the proposed method is able to approximate an NTVS which does not necessarily
change slowly. Contributions of this paper are summarized as follows:
We propose a deep Koopman representation formulation for the NTVS and provide a practical online algo-
rithm for implementation.
We investigate the error bound of the system state estimation of the proposed method.
We perform a convergence analysis of the proposed method concerning the observable function of DNN.
This paper is organized as follows. In Section 2, we state the problem. Section 3presents the main results. The
numerical simulations are exhibited in Section 4. Finally, Section 5concludes the paper.
Notations. Let ∥·∥denote the Euclidean norm. For a matrix ARn×m,AFdenotes its Frobenius norm; AT
denotes its transpose; Adenotes its Moore-Penrose pseudoinverse. For positive integers nand m,Indenotes the
n×nidentity matrix; 0nRndenotes a vector with all value 0;0n×mdenotes a n×mmatrix with all value 0.
sgn(·)denotes the sign function. ⌈·⌉ denotes the ceiling function, i.e., given real numbers y, integers kand the set of
integers Z,y= min{kZ|ky}.
2 Problem Formulation
Consider an NTVS, the dynamics of which is unknown. Let xtRnand utRmdenote its state and control input
at time t, respectively. t[0,)denotes the continuous-time index.
Suppose the states and control inputs can be obtained from unknown continuous NTVS at certain sampling time
instances tk[0,)with k= 0,1,2,· · · the index of sampled data points. For notation brevity, one denotes
xk:=xtkand uk:=utkas the k-th observed system state and control input, respectively, in the remainder of this
manuscript. Then one can partition the observed states-inputs pairs as the following series of data batches
Bτ={xk, uk:kKτ}, τ = 0,1,2,· · · ,(1)
where
Kτ={kτ, kτ+ 1, kτ+ 2,· · · , kτ+βτ}
denotes the ordered labels set of sampling instances for the τ-th data batch Bτwith βτpositive integers such that
kτ=
τ1
X
i=0
βi, τ 1, k0= 0.
It follows that
kτ+1 =kτ+βτ, τ = 0,1,2,· · · ,
which implies the last data in the τ-th data batch is the first data in (τ+ 1)-th data batch. For notation simplicity, one
defines Bx
τ:={xk:kKτ},Bu
τ:={uk:kKτ}in the rest of this manuscript. An illustration of the above
indexes is shown in Fig. 1.
Figure 1: Time indexes, where blue color denotes kτ.
This paper aims to develop an iterative method that approximates the dynamics of an unknown NTVS based on avail-
able data batches Bτby a linear time-varying discrete-time system. One way to achieve such a linear approximation is
ii
APREPRINT - JUNE 22, 2023
by employing the Koopman operator as in [1013]. Namely, based on the data batch Bτ, one finds a nonlinear mapping
g(·, θτ) : RnRrparameterized by θτRq1and constant matrices AτRr×r,BτRr×m,CτRn×rsuch
that for kKτ, k < kτ+βτ, the following holds approximately,
g(xk+1, θτ) = Aτg(xk, θτ) + Bτuk,(2)
xk+1 =Cτg(xk+1, θτ).(3)
Here, g(·, θτ),Aτ,Bτ,Cτachieved from Bτare put together in the following KBτ:
KBτ:={g(·, θτ), Aτ, Bτ, Cτ},(4)
which is called a deep Koopman representation (DKR) in this manuscript.
Based on the DKR in (4), one could introduce ˆxkRnfor kKτ, k < kτ+βτas follows:
g(ˆxk+1, θτ) = Aτg(ˆxk, θτ) + Bτˆuk,(5)
ˆxk+1 =Cτg(ˆxk+1, θτ),(6)
where
ˆuk=uk,kKτ, k < kτ+βτ,ˆxkτ=xkτ.(7)
This leads to the following linear system
ˆxk+1 =ˆ
Aτˆxk+ˆ
Bτˆuk, k Kτ, k < kτ+βτ,(8)
with ˆ
Aτ=CτAτC
τ,ˆ
Bτ=CτBτand initial conditions in (7). The system (8) can be viewed as a linear approximation
to the NTVS based on the data batch Bτ. Note that when for any data batch Bτ, one has Aτ,Bτ,Cτremain constant
for kKτ. It follows that (8) is a linear time-invariant system for kKτ.
To sum up, the problem of interest is to develop an iterative update rule to achieve a DKR in (4) based on data batch
Bτin (1) such that the linear system (8) is a nice approximation of the unknown NTVS, i.e. ˆxk(8) is close to xk
observed in Bτfrom the unknown NTVS in the sense that for any given accuracy ε0,ˆuk=ukand ˆxkτ=xkτ, the
estimation error ˆxkxk∥≤ ε.
3 Main Results
This section proposes an algorithm to achieve a deep Koopman representation (DKR) that can approximate an un-
known NTVS. We then investigate the estimation error between the state obtained from this DKR, as given in (8), and
the observed state of the unknown NTVS.
3.1 Key Idea
Motivated by deep Koopman operator-based methods developed in [1013], an optimal θτfor the deep Koopman
representation (DKR), denoted by θ
τ, can be obtained by solving the following optimization problem based on the
data batch Bτ:
θ
τ= arg min
θτRq{wL1(Aτ, Bτ, θτ) + (1 w)L2(Cτ, θτ)},(9)
where
L1(Aτ, Bτ, θτ) = 1
βτ
kτ+βτ1
X
k=kτ
g(xk+1, θτ)(Aτg(xk, θτ) + Bτuk)2(10)
and
L2(Cτ, θτ) = 1
βτ
kτ+βτ1
X
k=kτ
xkCτg(xk, θτ)2.(11)
The objectives of (10) and (11) are to approximate (2) and (3), respectively. Here, 0< w < 1is a constant that
combines the objective of minimizing L1and L2. In simple terms, L1and L2measure the simulation errors in the
lifted and original coordinates, respectively.
1Here, g(·, θτ)is usually represented by a DNN with a known structure gand an adjustable parameter θτRq.
iii
APREPRINT - JUNE 22, 2023
To solve (9), one needs to rewrite the available data batch and objective functions L1and L2in compact forms. Toward
this end, the following notation is introduced:
Xτ= [xkτ, xkτ+1,· · · , xkτ+βτ1]Rn×βτ,
¯
Xτ= [xkτ+1, xkτ+2,· · · , xkτ+βτ]Rn×βτ,
Uτ= [ukτ, ukτ+1,· · · , ukτ+βτ1]Rm×βτ.
Then L1in (10) and L2in (11) can be rewritten as
L1=1
βτ
¯
Gτ(AτGτ+BτUτ)2
F(12)
and
L2=1
βτ
XτCτGτ2
F,(13)
where
Gτ= [g(xkτ, θτ),· · · , g(xkτ+βτ1, θτ)] Rr×βτ,
¯
Gτ= [g(xkτ+1, θτ),· · · , g(xkτ+βτ, θτ)] Rr×βτ.(14)
By minimizing L1with respect to Aτ, Bτin (12) and minimizing L2regarding Cτin (13), Aτ, Bτ, Cτcan be deter-
mined by θτas follows:
[Aθ
τ, Bθ
τ] = ¯
GτGτ
Uτ
,(15)
Cθ
τ=XτG
τ.(16)
Replacing Aτand Bτin (10) by (15) and Cτin (11) by (16), the objective function in (9) can be reformulated as
L(θτ) = 1
βτ
kτ+βτ1
X
k=kτ
g(xk+1, θτ)
xkKθ
τg(xk, θτ)
uk2,(17)
with
Kθ
τ=Aθ
τBθ
τ
Cθ
τ0n×m.
Applying the existing deep Koopman operator methods developed in [1013] to achieve the DKR by solving (9) based
on each Bτavailable has two shortcomings. First, computing the pseudo-inverse in (15) and (16) repeatedly while
solving (9) becomes computationally expensive as τincreases. Second, θτmust be initialized for each Bτ, which
can be challenging in time-varying systems applications. To overcome these two limitations, one can apply the deep
Koopman operator method to approximate the unknown NTVS efficiently, and we propose the following method.
3.2 Algorithm
Before proceeding, we need the following assumption.
Assumption 1 The matrix GτRr×βτin (14)and Gτ
UτR(r+m)×βτare of full row rank.
Remark 1 Assumption 1is to ensure the matrices GτRr×βτand Gτ
UτR(r+m)×βτinvertible and it naturally
requires βτr+m.
Lemma 1 Given KBτin (4), if Assumption 1holds, then the matrices Aθ
τ+1, Bθ
τ+1,Cθ
τ+1 can be achieved by
[Aθ
τ+1, Bθ
τ+1]=(¯
Gτ+1 [Aθ
τ, Bθ
τ]χτ+1)λτχT
τ+1(χτχT
τ)1+ [Aθ
τ, Bθ
τ],(18)
Cθ
τ+1 = (Xτ+1 Cθ
τGτ+1)¯
λτGT
τ+1(GτGT
τ)1+Cθ
τ,(19)
where χτ=Gτ
UτR(r+m)×βτ,λτ= (Iβτ+1 +χT
τ+1(χτχT
τ)1χτ+1)1Rβτ+1 ×βτ+1 ,¯
λτ= (Iβτ+1 +
GT
τ+1(GτGT
τ)1Gτ+1)1Rβτ+1 ×βτ+1 with Gτ+1 Rr×βτ+1 ,¯
Gτ+1 Rr×βτ+1 defined in (14).
iv
摘要:

DEEPKOOPMANLEARNINGOFNONLINEARTIME-VARYINGSYSTEMSWenjianHaoSchoolofAeronauticsandAstronauticsEngineeringPurdueUniversity,IN,USAhao93@purdue.eduBowenHuangPacificNorthwestNationalLaboratoryRichland,USAbowen.h@pnnl.govWeiPanDepartmentofComputerScienceUniversityofManchester,UKwei.pan@tudelft.nlDiWuPacif...

展开>> 收起<<
DEEPKOOPMAN LEARNING OF NONLINEAR TIME-VARYING SYSTEMS Wenjian Hao.pdf

共17页,预览4页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:17 页 大小:626.02KB 格式:PDF 时间:2025-04-26

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 17
客服
关注