et al., 2012; Peel and Clauset, 2015), medical image processing (Malladi et al., 2013), cyber-
security (Tartakovsky et al., 2012b), video surveillance (Lee and Kriegman, 2005), COVID-19
intervention (Dehning et al., 2020), and so on. Recently, there has been much interest in
out-of-distribution detection (Ren et al., 2019; Magesh et al., 2022), which aims to detect
the shift of the underlying distribution of the test data from training data, which is related
to sequential change-point detection.
This tutorial considers sequential (also known as the online or quickest, in some context)
change point detection problems, where the goal is to monitor a sequence of data or time
series for any change and raise an alarm as quickly as possible the change has occurred
to prevent any potential loss. The above problem is different from another major type of
change-point detection problem, which is offline and aims to detect and localize (possibly
multiple) change-points from sequential data in retrospect (see Truong et al. (2020) for a
review). This tutorial focuses on sequential change-point detection.
Shewhart (1925) introduced a control chart that computes a statistic from every sam-
ple, where a low statistic value represents the process still within the desired control. Later
it is generalized to compute a statistic for several consecutive samples. Page (1954) pro-
posed the CUSUM procedure based on the log-likelihood ratio between the two exactly
known distributions, one for the control and one for the anomaly. Moustakides (1986); Lor-
den (1971) proved that CUSUM has strong optimality properties. Shiryaev-Roberts (SR)
procedure (Shiryaev, 1963; Roberts, 1966) is similar to CUSUM inspired by the Bayesian
setting, which is also optimal in several senses (Pollak, 1985; Pollak and Tartakovsky, 2009;
Polunchenko and Tartakovsky, 2010; Tartakovsky et al., 2012a). More recent works on
change point detection focus on relaxing the strong assumptions of parametric and known
distributions. The generalized likelihood ratio (GLR) procedure (Lorden, 1971; Siegmund
and Venkatraman, 1995) aims to detect the change to an unknown distribution assuming
a parametric form by searching for the most probable post-change scenario. These classic
procedures and their variants can also be found in Basseville et al. (1993); Tartakovsky et al.
(2014). Sparks (2000) proposed the first adaptive CUSUM, which detects an unknown mean
shift by inserting an adaptive estimate of new mean to the CUSUM recursion, followed by
Lorden and Pollak (2005); Abbasi and Haq (2019); Cao et al. (2018); Xie et al. (2022). Such
procedures allow an unknown post-change distribution while also having the computational
benefit of CUSUM. More recently, Romano et al. (2023); Ward et al. (2022) present a novel
computationally-efficient approach that implements the GLR test statistic for the Gaussian
mean shift and the Poisson parameter shift detection, respectively. The above works belong
to parametric change-point detection, i.e., assuming the pre- and post-change distributions
belong to the parametric family and detect a certain type of change (for instance, the mean
shift and covariance change). There have also been many non-parametric and distribution-
2