A primary advantage of deep learning is its data-driven
nature. When trained on diverse and comprehensive datasets,
it can yield superior accuracy and robustness. The efficacy
of these models is heavily influenced by the quality and
diversity of the training data, highlighting the importance of
meticulously curated datasets. However, these models also
pose challenges. A significant constraint is their reliance on ag-
gregated patient datasets. When trained with an mean-squared
error (MSE) criterion, deep networks often exhibit a strong
bias towards the mean. Consequently, in situations with noise
where multiple plausible waveforms might have produced the
observed data, these MSE-trained networks often output the
posterior mean. This bias complicates the task of tailoring the
model to an individual patient’s unique ECG characteristics.
Furthermore, the complexity of these architectures requires
substantial computational resources. Additionally, their ”black
box” nature introduces challenges in model interpretability.
Among the arsenal of effective ECG denoising techniques,
non-parametric methods have attracted considerable atten-
tion [24]. empirical mode decomposition (EMD) methods [25],
such as [26]–[28], stand out for their ability to decompose a
signal into intrinsic mode functions (IMFs), with each IMF
representing a specific oscillatory mode within the original
ECG. While EMD’s decomposition provides a clear repre-
sentation of the ECG’s frequency components and facili-
tates effective noise separation, it can sometimes be sensitive
to fluctuations, leading to mode mixing or the emergence
of spurious modes. Additionally, wavelet transforms offer
a robust multi-resolution analysis, enabling a precise time-
frequency representation of signals [29]–[31]. Methods, such
as [32]–[36], distinguish essential ECG components from
high-frequency noise and facilitate selective noise removal,
preserving the integrity of the original ECG waveform. While
their computational efficiency makes them suitable for real-
time clinical applications, their success heavily relies on the
correct choice of wavelet and its parameters, and incorrect
tuning can lead to ECG distortion. Moreover, in the face of
heavy noise, wavelets might not be fully effective.
Model-based techniques have emerged as a notable al-
ternative for ECG denoising. Central to this approach is
the utilization of state space (SS) models, complemented
by various variations of the Kalman filter (KF) [37], as
in [38]–[42]. Local approximations of the ECG waveforms
have been explored through windowed SS models in [43]–
[45], and via autoregressive models [46], [47]. Nevertheless,
these approximations can sometimes fall short of capturing
the intricate intra-heartbeat dynamics and the inherent quasi-
periodicity between consecutive heartbeats, posing challenges
for optimal denoising.
A fundamental principle in numerous ECG denoising
methodologies is to model the signal’s complex evolution
by capitalizing on its quasi-periodicity. This principle is
subsequently utilized as a prior belief in Bayesian filtering
techniques [48], such as the KF [38]. For instance, [40]
omits intra-heartbeat variations and chooses instead to rep-
resent the evolution of consecutive heartbeats with an identity
function. This approach is grounded in the premise that, in
the absence of arrhythmia, two successive heartbeats, when
centered around the R-peak, closely resemble each other. In
essence, this method is equivalent to a weighted average of
multiple heartbeats. While this strategy enhances the Signal
to Noise Ratio (SNR), it also runs the risk of obscuring
vital physiological dynamics. The research presented in [41]
incorporates the expectation-maximization (EM) algorithm to
determine the evolution function. When combined with a bank
of KFs, this method efficiently targets both high and low-
frequency noise. However, its application is limited by its
reliance on a linear prior function, its constraint to filter signals
of a fixed length, and its omission of the ECG’s periodic
information. In contrast, [49] introduces non-linear priors us-
ing partial differential equation models [38]. While innovative,
these models frequently face challenges in capturing patient-
specific variations. To address this issue, [42] attempts to
automatically fit model parameters using a least squares (LS)
optimizer, based on several pre-recorded heartbeats.
In this work, we propose a hierarchical Kalman filter
(HKF) for ECG denoising, which enhances signal quality
without obscuring dynamic changes potentially linked to
pertinent pathophysiology. Our HKF is designed based on
our innovative hierarchical SS model, which describes the
ECG signal dynamics both within individual heartbeats and
across consecutive heartbeats. Specifically, HKF consists of an
online learned structured evolution prior for a single heartbeat;
a Rauch-Tung-Striebel (RTS) intra-heartbeat smoother [50]
that harnesses this prior; and an inter-heartbeat KF [37] for
denoising spanning multiple heartbeats. The online warm-up
phase is meticulously designed to tackle challenges such as the
highly patient-specific heartbeat shape and substantial noise
variation resulting from myriad factors, ranging from equip-
ment intricacies to room temperature variations. While one
might conceptualize a typical ECG signal shape, the reality is
that there’s considerable inter-variability among patients. Not
only can the signal shape vary significantly between patients,
but there’s also intra-patient variability; the placement and ori-
entation of electrodes can introduce alterations in the observed
waveform. This renders the task of crafting a universal prior
quite challenging. Crucially, HKF doesn’t require supervised
pre-training and is inherently patient-adaptive, due to its online
covariance estimation and its learned structured evolution
prior. Yet, it preserves the transparent and interpretable nature
of the KF.
Our experimental study shows that the proposed HKF effec-
tively denoises ECG signals, even in challenging setups, while
retaining the subtle, clinically valuable structures within the
signals. These attributes make it especially suited for medical
and healthcare applications where a high degree of confidence
and reliability is essential.
The remainder of this paper is organized as follows: Sec-
tion II formulates the task and introduces the hierarchical SS
model. Section III delves into the details of the proposed
HKF. Section IV elaborates on parameter estimation. Finally,
Section Vpresents our empirical study, demonstrating that the
HKF surpasses both model-based and data-driven benchmarks.
2