Online Probabilistic Model Identification Using Adaptive Recursive MCMC Pedram Agand

2025-05-02 0 0 2.26MB 9 页 10玖币
侵权投诉
Online Probabilistic Model Identification Using
Adaptive Recursive MCMC
Pedram Agand
Department of Computer Science,
Simon Fraser University
Burnaby, Canada.
pagand@sfu.ca
Mo Chen
Department of Computer Science,
Simon Fraser University
Burnaby, Canada.
0000-0001-8506-3665
Hamid D. Taghirad
Department of Electrical Engineering,
K. N. Toosi University of Technology
Tehran, Iran.
0000-0002-0615-6730
Abstract—Although the Bayesian paradigm offers a formal
framework for estimating the entire probability distribution
over uncertain parameters, its online implementation can be
challenging due to high computational costs. We suggest the
Adaptive Recursive Markov Chain Monte Carlo (ARMCMC)
method, which eliminates the shortcomings of conventional online
techniques while computing the entire probability density func-
tion of model parameters. The limitations to Gaussian noise, the
application to only linear in the parameters (LIP) systems, and
the persistent excitation (PE) needs are some of these drawbacks.
In ARMCMC, a temporal forgetting factor (TFF)-based variable
jump distribution is proposed. The forgetting factor can be
presented adaptively using the TFF in many dynamical systems
as an alternative to a constant hyperparameter. By offering
a trade-off between exploitation and exploration, the specific
jump distribution has been optimised towards hybrid/multi-
modal systems that permit inferences among modes. These trade-
off are adjusted based on parameter evolution rate. We demon-
strate that ARMCMC requires fewer samples than conventional
MCMC methods to achieve the same precision and reliability.
We demonstrate our approach using parameter estimation in
a soft bending actuator and the Hunt-Crossley dynamic model,
two challenging hybrid/multi-modal benchmarks. Additionally,
we compare our method with recursive least squares and the
particle filter, and show that our technique has significantly more
accurate point estimates as well as a decrease in tracking error
of the value of interest.
Index Terms—MCMC, Bayesian optimization, hybrid/multi-
modal systems, temporal forgetting factor.
I. INTRODUCTION
Bayesian methods are powerful tools to not only obtain a
numerical estimate of a parameter but also to give a measure of
confidence [1]. This is obtained by calculating the probability
distribution of parameters rather than a point estimate, which
is prevalent in frequentist paradigms [2], [3]. One of the main
advantages of probabilistic frameworks is that they enable
decision making under uncertainty. In addition, knowledge
fusion is significantly facilitated in probabilistic frameworks;
different sources of data or observations can be combined
Disclaimer: This work has been accepted for publication in the International
Joint Conference on Neural Networks (IJCNN). © 2023 IEEE. Personal use
of this material is permitted. Permission from IEEE must be obtained for all
other uses, in any current or future media, including reprinting/republishing
this material for advertising or promotional purposes, creating new collective
works, for resale or redistribution to servers or lists, or reuse of any
copyrighted component of this work in other works.
according to their level of certainty in a principled manner [4].
Using credible intervals instead of confidence intervals [5], an
absence of over parameterized phenomena [1], and evaluation
in the presence of limited number of observed data [6] are
other distinct features of this framework.
Nonetheless, Bayesian inference requires high computa-
tional effort for obtaining the whole probability distribution
and requires prior general knowledge about the noise dis-
tribution. Consequently, strong simplifying assumptions were
often made (e.g. calculating specific features of the model
parameters distribution rather than the whole distribution).
One of the most effective methods for Bayesian inferences is
Markov Chain Monte Carlo (MCMC). In the field of system
identification, MCMC variants such as the one proposed by
[7] are mostly focused on offline system identification. This is
partly due to computational challenges which prevent its real-
time use. There are extensive research in the literature which
investigate how to increase sample efficiency (e.g. [8], [9]).
The authors in [10] first introduced reversible jump Markov
chain Monte Carlo (RJMCMC) as a method to address the
model selection problem. In this method, an extra pseudo-
random variable is defined to address dimension mismatch.
There are further extensions of MCMC in the literature;
however, there is a lack of variants suitable for online esti-
mation. One example which claims to have often much faster
convergence is No-U-Turn Sampler (NUTS), which can adapt
proposals “on the fly” [11].
If the general form of the relation between inputs and
outputs is known either by physical relation or knowing the
basis function, parametric identification techniques are an
effective way to model a system. To identify the parameters in
a known model, researchers propose frequency ( [12]) or time
domain ( [13]) approaches. Noisy measurements, inaccuracy,
inaccessibility, and costs are typical challenges that limit direct
measurement of unknown parameters in a physical/practical
system [14]. Parameter identification techniques have been
extensively used in different subject areas including but not
limited to chemistry, robotics, fractional models and health
sectors [15]–[19]. For instance, motion filtering and force
prediction in robotics applications are important fields of study
with interesting challenges which makes them suitable test
cases for Bayesian inferences [20].
arXiv:2210.12595v2 [cs.LG] 19 Oct 2023
Different parametric identification methods have been pro-
posed in the literature for linear and Gaussian noise [21];
however, in cases of nonlinear hybrid/multi-modal systems
(e.g. Hunt-Crossley or fluid soft bending actuator) or systems
with non-Gaussian noise (e.g. impulsive disturbance), there is
no optimal solution for the identification problem. In addition,
in cases that the assumed model for the system is not com-
pletely valid, MCMC implementation of Bayesian approach
can provide reasonable inferences. Authors in [22] utilize the
least square methods for nonlinear physical modeling of air
dynamics in pneumatic soft actuator. A method to determine
the damping term in the Hunt-Crossley model was proposed
in [23]. A single-stage method for the estimation of the Hunt-
Crossley model is proposed by [24] which requires some
restrictive conditions to calculate the parameters. Moreover,
the method does not offer any solution for discontinuity in
the dynamic model, which is common in the transition phase
from contact to free motion and vice versa [25].
This paper proposes a new technique, Adaptive Recursive
Markov Chain Monte Carlo (ARMCMC), to address several
weaknesses of traditional online identification methods, such
as solely being applicable to systems Linear in Parameters
(LIP), having Persistent Excitation (PE) requirements, and
assuming Gaussian noise. ARMCMC is an online method that
takes advantage of the previous posterior distribution whenever
there is no abrupt change in the parameter distribution. To
achieve this, we define a new variable jump distribution that
accounts for the degree of model mismatch using a temporal
forgetting factor (TFF). The TFF is computed from a model
mismatch index and determines whether ARMCMC employs
modification or reinforcement to either restart or refine the
estimated parameter distribution. As this factor is a function of
the observed data rather than a simple user-defined constant, it
can effectively adapt to the underlying dynamics of the system.
We demonstrate our method using two different examples:
a soft bending actuator and the Hunt-Crossley model. We
show favorable performance compared to the state-of-the-art
baselines.
II. PRELIMINARIES
A. Problem statement
In the Bayesian paradigm, parameter estimations are given
in the form of the posterior probability density function (pdf);
this pdf can be continuously updated as new data points are
received. Consider the following general model:
Y=Fj(X, θj) + νj,j∈ {1,2, . . . , m},(1)
where Y,X,θ, and νare concurrent output, input, model
parameters set and noise vectors, respectively. For a hybrid
system, there can be mdifferent nonlinear functions (Fj)
with different parameters set (θj) for each mode. For multi-
modal systems, there can be mdifferent noise distribution νj
for each mode. To calculate the posterior pdf, the observed
data (input/output pairs) along with a prior distribution are
time
Data
Interval
Algorithm
Interval
𝑡!𝑡"𝑡"+ 𝑁#+ 1
𝑡 − 1 𝑡
Phase: A
𝑡 + 1
BC
Fig. 1: Data timeline and different phases of ARMCMC
algorithm. For algorithm at time t: Phase (A) Data collection
[Nsdata points are packed for the next algorithm time
step], Phase (B) Adjustment [the method is applying to the
most recent pack], (C) Execution [after the min evaluation of
algorithm, the results will be updated on posterior distribution
and any byproduct value of interest].
combined via Bayes’ rule [26]. The Bayesian rule is defined
as follows [1].
P(θ|D) = P(D|θ)P(θ)
P(D),(2)
where, P(θ)is the prior probability of parameters, P(D) =
RP(D|θ)P(θ), is called as evidence, P(D|θ)is the like-
lihood function, and P(θ|D)is posterior probability of pa-
rameters given the data. We will be applying updates to the
posterior pdf using batches of data points; hence, it will be
convenient to partition the data as follows:
Dt={(X, Y )tm+1,(X, Y )tm+2,··· ,(X, Y )tm+Ns+1},(3)
where Ns=Ts/T is the number of data points in each
data pack with T, Tsbeing the data and algorithm sampling
times, respectively. This partitioning is convenient for online
applications, as Dt1should have been previously collected
so that the algorithm can be executed from tmto tm+Ns+1,
an interval which we will define as algorithm time step t.
Ultimately, inferences are completed at tm+Ns+ 2. Fig.
1 illustrates the timeline for the data and the algorithm.
It is worth mentioning that the computation can be done
simultaneously with the task in the adjacent algorithm step
(e.g. phase A of algorithm t, phase B of algorithm t1and
phase C of algorithm t2can all be done simultaneously).
According to Bayes’ rule (2) and assuming data points are
independent and identically distributed ( i.i.d) in Eq. (1), we
have
P(θt|[Dt1, Dt]) = PDt|θt, Dt1P(θt|Dt1)
RPD1|τt, Dt1P(τt|Dt1)t,
(4)
where θtdenotes the parameters at current algorithm time
steps. P(θt|Dt1)is the prior distribution over parameters,
which is also the posterior distribution at the previous algo-
rithm time step. Probability PDt|θt, Dt1is the likelihood
function which is obtained by sampling from the one-step-
ahead prediction:
ˆ
Yt|t1=F(Dt1, θt),(5)
摘要:

OnlineProbabilisticModelIdentificationUsingAdaptiveRecursiveMCMCPedramAgandDepartmentofComputerScience,SimonFraserUniversityBurnaby,Canada.pagand@sfu.caMoChenDepartmentofComputerScience,SimonFraserUniversityBurnaby,Canada.0000-0001-8506-3665HamidD.TaghiradDepartmentofElectricalEngineering,K.N.ToosiU...

展开>> 收起<<
Online Probabilistic Model Identification Using Adaptive Recursive MCMC Pedram Agand.pdf

共9页,预览2页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:9 页 大小:2.26MB 格式:PDF 时间:2025-05-02

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 9
客服
关注