1
Data-Driven Quickest Change Detection in (Hidden)
Markov Models
Qi Zhang Zhongchang Sun Luis C. Herrera Shaofeng Zou
Abstract—The paper investigates the problems of quickest
change detection in Markov models and hidden Markov models
(HMMs). Sequential observations are taken from a (hidden)
Markov model. At some unknown time, an event occurs in the
system and changes the transition kernel of the Markov model
and/or the emission probability of the HMM. The objective is to
detect the change quickly, while controlling the average running
length (ARL) to false alarm. The data-driven setting is studied,
where no knowledge of the pre-, post-change distributions is
available. Kernel-based data-driven algorithms are developed,
which can be applied in the setting with continuous state, can be
updated in a recursive fashion, and are computationally efficient.
Lower bounds on the ARL and upper bound on the worst-case
average detection delay (WADD) are derived. The WADD is at
most of the order of the logarithm of the ARL. The algorithms
are further numerically validated on two practical problems of
fault detection in DC microgrid and photovoltaic systems.
Index Terms—Maximum Mean Discrepancy, Kernel Method,
Fault Detection, Non-i.i.d..
I. INTRODUCTION
In the problem of quickest change detection (QCD), se-
quential observations are generated from a stochastic process.
At some unknown time (change-point), an event occurs and
causes the data-generating distribution to undergo a change.
The objective is to quickly detect the change, while controlling
the false alarm rate. This QCD problem has been widely
studied in the literature [3]–[7]. It models a wide range
of applications, e.g., fault detection in DC microgrids [8],
quality control in online manufacturing systems [9], spectrum
monitoring in wireless communications [10], early detection
of epidemics [11] and signal processing in genetic area [12].
Most existing studies investigate the setting where pre-
and post-change samples are independent and identically dis-
tributed (i.i.d.), respectively. In [13]–[15], the authors prove
that the CuSum algorithm is optimal when the samples are
i.i.d.. However, the i.i.d. assumption may not hold in practice,
e.g., in photovoltaic systems [16], samples are not independent
over time. In [17]–[20], the general QCD problem for the non-
i.i.d. setting is studied, where the approaches are model-based
with known pre- and post-change distributions. In [21]–[23],
the case where samples are generated from a Markov model is
investigated. In these literature, both the pre- and post-change
distributions are known exactly, and therefore those algorithms
This paper was presented in part at the 2023 IEEE International Confer-
ence on Acoustics, Speech, and Signal Processing [1] and the 2023 IEEE
International Symposium on Information Theory [2].
The authors are with the Department of Electrical Engineering, Univer-
sity at Buffalo, Buffalo, NY 14228 USA (e-mail: qzhang48@buffalo.edu,
zhongcha@buffalo.edu, lcherrer@buffalo.edu, szou3@buffalo.edu).
may not be applicable if such information is unavailable or
the performance may degrade significantly if such information
is inaccurate. In this paper, we first study QCD problem in
Markov models under the data-driven setting, where neither
the pre- nor the post-change transition kernel is known. We
then focus on the more challenging hidden Markov models
(HMMs), where the pre- and post-change transition kernels
and emission probability distributions are unknown.
A. Related Works
For Markov models, the data-driven setting where the pre-
and the post-change transition kernels are unknown is studied
in, e.g., [24], [25]. QCD in Markov models with known
pre-change and unknown post-change transition kernels is
investigated in [24]. The maximum likelihood estimate (MLE)
of the unknown post-change transition kernel is employed
to construct a generalized CuSum-type test. However, this
method only works for finite state Markov chains or param-
eterized Markov transition kernels, which is not applicable
to general problems with continuous state. QCD in Markov
models with both pre- and post-change transition kernels
unknown is studied in [25], where a kernel-based method is
developed. A kernel method of maximum mean discrepancy
(MMD) [26] is employed to construct a CuSum-type test. The
kernel MMD method for data-driven QCD under the i.i.d.
setting is also investigated in e.g., [27], [28]. However, they
focus on the case with i.i.d. samples, and thus their approach
cannot be directly applied to our Markov models.
In [29]–[33], QCD in HMMs is studied. All these studies
investigate the model-based setting with both pre- and post-
change information known. However, the information may
not be accurate or available in practice, especially when the
change is due to some unexpected fault. In [29], the model-
based setting with known pre- and post-change distributions
for QCD in HMMs with a finite state space is investigated.
The L1-norm of products of Markov random matrices are
employed to construct a CuSum test. This approach however
only applies to HMMs with a finite state space. In [30], the
same problem is investigated. The ratio of the L1-norms of
products of Markov random matrices is employed to construct
the log-likelihood in their Shiryayev–Roberts–Pollak test. In
[32], a recursive algorithm is designed for the same problem.
A quasi-generalized likelihood ratio is employed to update
the algorithm in a recursive fashion. The proposed approach
however only applies to HMMs with two states and does not
generalize to problems with a continuous state space. QCD
in HMMs under Bayesian setting is studied in [31], which
arXiv:2210.11988v5 [eess.SP] 8 Nov 2023