also of interest to be able to make decisions based on accurate information to best attenuate
the spread of disease. Moreover, understanding specific attributes of the disease, such as the
incubation time, the number of unreported cases, and how certain we are about this knowledge
are also crucial.
These types of applications are examples of so-called dynamic systems, which are the focus
of this article. Dynamic systems have the property that the future system response depends on
the past system response [1]. Capturing these types of dynamic phenomena can be achieved
using mathematical models, which offer a concrete mechanism for making predictions and
supporting decisions. The extreme flexibility and versatility of mathematics affords modeling
of highly disparate dynamic behavior. However, it also creates a challenge, since it is not
always obvious how to choose an appropriate model. This diversity is perhaps best illustrated
by contrasting examples.
Consider the modeling of rigid-body vehicle dynamics, such as the motion of a car or a
plane. In this case, it is possible to exploit prior knowledge of the system and adopt a classical
mechanics approach to derive Newton-Euler equations of motion for each application [2]. The
mathematical model structure is largely determined by knowledge of the physical system, and
the model will depend on certain parameters such as mass and inertia terms, and damping and
friction coefficients. In many cases, these parameter values can be difficult to obtain based on
first principles approaches alone. It is important to also note that some parameters may have
feasible ranges, such as mass terms being nonnegative, which is also a form of prior knowledge.
Contrasting this type of model, it is also possible to employ highly flexible and general
model structures to describe dynamic systems, such as deep neural networks (DNNs) or Gaus-
sian processes (GPs) [3, 4, 5]. The flexibility of the DNN model class stems from the general
construction of the model, which involves potentially many layers of interacting nonlinear func-
tions. Importantly, these interactions are allowed to adapt for each new application, since they
rely on coefficients/parameters that are free to change values. In the case of GP, the model
structure is also highly flexible, nonparametric and adapted based on available data. For ei-
ther of these flexible model classes, it is more challenging to impose prior system knowledge.
However, some progress is being made along these lines [6, 7].
Irrespective of the type of model, there are unknown quantities that must be determined,
which are often inferred from observations from the system that is being modeled. There are
many different approaches for extracting or estimating these unknown values from observed
system data [8, 9, 10, 11, 12]. Among the many possibilities, this article concentrates on two
commonly used and complimentary approaches. In particular, the presented inference methods
are grouped according to two main attributes: 1) the assumptions made about how to model
unknown parameters, and, 2) what should be estimated in addition to the parameters.
More specifically, if the model parameters are assumed to be deterministic variables, then
this results in a frequentist inference perspective, where the so-called maximum likelihood
(ML) approach has proven to be highly successful in providing accurate point estimates of
the parameters [9, 10]. Alternately, if uncertainty about the parameter values is incorporated
by treating them as random variables, then this results in the so-called Bayesian perspective,
where the posterior distribution of the parameters is the object of interest [11]. An attractive
attribute of the Bayesian approach is that it provides quantification of uncertainty, which is
essential when making decisions based on the associated models. Otherwise, decisions may be
executed based on misplaced confidence. It is also worth mentioning that there is a connection
between these two approaches by considering so-called maximum a posteriori methods [13].
2