Online Probabilistic Model Identification Using
Adaptive Recursive MCMC
Pedram Agand
Department of Computer Science,
Simon Fraser University
Burnaby, Canada.
pagand@sfu.ca
Mo Chen
Department of Computer Science,
Simon Fraser University
Burnaby, Canada.
0000-0001-8506-3665
Hamid D. Taghirad
Department of Electrical Engineering,
K. N. Toosi University of Technology
Tehran, Iran.
0000-0002-0615-6730
Abstract—Although the Bayesian paradigm offers a formal
framework for estimating the entire probability distribution
over uncertain parameters, its online implementation can be
challenging due to high computational costs. We suggest the
Adaptive Recursive Markov Chain Monte Carlo (ARMCMC)
method, which eliminates the shortcomings of conventional online
techniques while computing the entire probability density func-
tion of model parameters. The limitations to Gaussian noise, the
application to only linear in the parameters (LIP) systems, and
the persistent excitation (PE) needs are some of these drawbacks.
In ARMCMC, a temporal forgetting factor (TFF)-based variable
jump distribution is proposed. The forgetting factor can be
presented adaptively using the TFF in many dynamical systems
as an alternative to a constant hyperparameter. By offering
a trade-off between exploitation and exploration, the specific
jump distribution has been optimised towards hybrid/multi-
modal systems that permit inferences among modes. These trade-
off are adjusted based on parameter evolution rate. We demon-
strate that ARMCMC requires fewer samples than conventional
MCMC methods to achieve the same precision and reliability.
We demonstrate our approach using parameter estimation in
a soft bending actuator and the Hunt-Crossley dynamic model,
two challenging hybrid/multi-modal benchmarks. Additionally,
we compare our method with recursive least squares and the
particle filter, and show that our technique has significantly more
accurate point estimates as well as a decrease in tracking error
of the value of interest.
Index Terms—MCMC, Bayesian optimization, hybrid/multi-
modal systems, temporal forgetting factor.
I. INTRODUCTION
Bayesian methods are powerful tools to not only obtain a
numerical estimate of a parameter but also to give a measure of
confidence [1]. This is obtained by calculating the probability
distribution of parameters rather than a point estimate, which
is prevalent in frequentist paradigms [2], [3]. One of the main
advantages of probabilistic frameworks is that they enable
decision making under uncertainty. In addition, knowledge
fusion is significantly facilitated in probabilistic frameworks;
different sources of data or observations can be combined
Disclaimer: This work has been accepted for publication in the International
Joint Conference on Neural Networks (IJCNN). © 2023 IEEE. Personal use
of this material is permitted. Permission from IEEE must be obtained for all
other uses, in any current or future media, including reprinting/republishing
this material for advertising or promotional purposes, creating new collective
works, for resale or redistribution to servers or lists, or reuse of any
copyrighted component of this work in other works.
according to their level of certainty in a principled manner [4].
Using credible intervals instead of confidence intervals [5], an
absence of over parameterized phenomena [1], and evaluation
in the presence of limited number of observed data [6] are
other distinct features of this framework.
Nonetheless, Bayesian inference requires high computa-
tional effort for obtaining the whole probability distribution
and requires prior general knowledge about the noise dis-
tribution. Consequently, strong simplifying assumptions were
often made (e.g. calculating specific features of the model
parameters distribution rather than the whole distribution).
One of the most effective methods for Bayesian inferences is
Markov Chain Monte Carlo (MCMC). In the field of system
identification, MCMC variants such as the one proposed by
[7] are mostly focused on offline system identification. This is
partly due to computational challenges which prevent its real-
time use. There are extensive research in the literature which
investigate how to increase sample efficiency (e.g. [8], [9]).
The authors in [10] first introduced reversible jump Markov
chain Monte Carlo (RJMCMC) as a method to address the
model selection problem. In this method, an extra pseudo-
random variable is defined to address dimension mismatch.
There are further extensions of MCMC in the literature;
however, there is a lack of variants suitable for online esti-
mation. One example which claims to have often much faster
convergence is No-U-Turn Sampler (NUTS), which can adapt
proposals “on the fly” [11].
If the general form of the relation between inputs and
outputs is known either by physical relation or knowing the
basis function, parametric identification techniques are an
effective way to model a system. To identify the parameters in
a known model, researchers propose frequency ( [12]) or time
domain ( [13]) approaches. Noisy measurements, inaccuracy,
inaccessibility, and costs are typical challenges that limit direct
measurement of unknown parameters in a physical/practical
system [14]. Parameter identification techniques have been
extensively used in different subject areas including but not
limited to chemistry, robotics, fractional models and health
sectors [15]–[19]. For instance, motion filtering and force
prediction in robotics applications are important fields of study
with interesting challenges which makes them suitable test
cases for Bayesian inferences [20].
arXiv:2210.12595v2 [cs.LG] 19 Oct 2023