1
Bayesian Calibration of the Intelligent Driver Model
Chengyuan Zhang, Graduate Student Member, IEEE, and Lijun Sun, Senior Member, IEEE
Abstract—Accurate calibration of car-following models is essen-
tial for understanding human driving behaviors and implementing
high-fidelity microscopic simulations. This work proposes a
memory-augmented Bayesian calibration technique to capture
both uncertainty in the model parameters and the temporally
correlated behavior discrepancy between model predictions
and observed data. Specifically, we characterize the parameter
uncertainty using a hierarchical Bayesian framework and model
the temporally correlated errors using Gaussian processes. We
apply the Bayesian calibration technique to the intelligent driver
model (IDM) and develop a novel stochastic car-following model
named memory-augmented IDM (MA-IDM). To evaluate the
effectiveness of MA-IDM, we compare the proposed MA-IDM
with Bayesian IDM in which errors are assumed to be i.i.d.,
and our simulation results based on the HighD dataset show
that MA-IDM can generate more realistic driving behaviors and
provide better uncertainty quantification than Bayesian IDM.
By analyzing the lengthscale parameter of the Gaussian process,
we also show that taking the driving actions from the past five
seconds into account can be helpful in modeling and simulating
the human driver’s car-following behaviors.
Index Terms—car-following model, serial correlation, Bayesian
inference, hierarchical model, Gaussian processes
I. INTRODUCTION
Microscopic car-following models are powerful tools to
study and simulate human driving behaviors in traffic flows at
the trajectory level. They reveal the mechanisms of complex
interactions between a subject vehicle and its leading vehicle.
These interactions are the essential factors that affect the
dynamics of traffic flow and create diverse macroscopic traffic
phenomena [1]. Although experiences indicate that we cannot
calibrate a zero error model that perfectly fits the data [2],
some car-following models are still valid with which one
can adopt various calibration (or parameter identification)
methods to reproduce reality to different extents. Given that the
performance and fidelity of microscopic car-following models
are heavily dependent on accurate calibration, improving the
quality of calibration methods is a critical research question.
In recent studies, probabilistic calibration methods have
become an emerging and promising approach with a solid
statistical foundation. There are essentially two probabilistic
approaches for model calibration. The first is maximum likeli-
hood estimation (MLE): we often solve MLE as an optimization
problem and finally obtain a point estimation for model
(Corresponding author: Lijun Sun)
C. Zhang and L. Sun are with the Department of Civil Engineering, McGill
University, Montreal, QC H3A 0C3, Canada. E-mail: enzozcy@gmail.com (C.
Zhang), lijun.sun@mcgill.ca (L. Sun).
The authors would like to thank the McGill Engineering Doctoral Awards
(MEDA), the Institute for Data Valorisation (IVADO), the Interuniversity
Research Centre on Enterprise Networks, Logistics and Transportation
(CIRRELT), Fonds de recherche du Qu
´
ebec – Nature et technologies (FRQNT),
and the Natural Sciences and Engineering Research Council (NSERC) of
Canada for providing scholarships and funding to support this study.
parameters; see, e.g., [3]. The second is Bayesian inference,
which allows us to exploit the full posterior distribution for
model parameters through Markov chain Monte Carlo (MCMC)
or variational inference (VI); see, e.g., [4], [5]. In general, the
Bayesian inference approach is advantageous in two aspects:
(1) we can perform uncertainty quantification based on the
full posterior distribution. This is particularly important for a
simulation model as we are often interested in the distribution
and uncertainty of the simulation results; (2) Bayesian inference
offers a hierarchical modeling scheme that allows us to learn
parameter distributions at both population and individual
levels, where the population distribution prevents overfitting
by imposing certain dependencies on the parameters. This
is particularly important for calibrating car-following models
since samples are often short trajectories collected from a large
number of drivers with diverse vehicle configurations/dynamics.
Considering substantial differences in personal driving styles
and vehicle configurations/dynamics, it is critical to learn
the population distribution that can generalize the parameter
distribution from short trajectory data. In addition, the learned
population posterior distribution can help create diverse driving
behaviors to create more realistic simulations with driver
heterogeneity. Note that in the following of this paper, we
refer to “driver behavior” as the characteristics resulting from
both the driver and the vehicle.
A fundamental question in performing probabilistic cali-
bration is how to define the probabilistic model and data
generation process. Most existing car-following models seek
parsimonious structures by simply taking observations from
the most recent (i.e., only one) step as input to generate
acceleration/speed as output for the current step. However,
given physical inertia, delay in reaction, and missing important
covariates (e.g., car follower models, in general, do not take
the status of the breaking light of the leading vehicle as
input variable), we should expect unexplained behavior (ie,
the discrepancy between predicted acceleration/speed and
observed acceleration/speed) to be temporally correlated [6],
[7]. For instance, Wang et al. [7] show that the best-performed
deep learning model essentially takes states/observations from
the most recent
∼
10 sec as input. However, most existing
probabilistic calibration methods are developed based on a
simple assumption—the errors are independent and identically
distributed (i.i.d.), and as a result, ignoring the autocorrelation
in the residuals will lead to biased calibration [8], [9]. For
example, Fig. 1 shows the residual process in acceleration
(
a
), speed (
v
), and gap (
s
) when calibrating an IDM model
with i.i.d. noise, and we can see that the residuals have strong
serial correlation (i.e., autocorrelation). Essentially, two classes
of methods are developed in the literature to model serial
correlation. One approach is to directly process the time series
data to eliminate serial correlations. For instance, Hoogendoorn
arXiv:2210.03571v2 [stat.AP] 12 Jan 2024