2. Combining within the same CNN model the dual tasks of predicting parts of the ECG signal (re-
gression) and detection of cardiac arrhythmia (classification) —which use unlabeled and labeled
data, respectively, for training— may improve model performance without increasing model com-
plexity inordinately.
The paper is structured as follows. Section 1.1 presents the related work. Section 2 introduces materials
and methods, and Section 3 summarizes and discusses the results of the conducted experiments. Section 4
concludes the paper.
1.1 Related work
Classification of ECG signals The process of classification of ECG signals is traditionally split into
feature extraction and classification steps. The feature vectors obtained in the feature extraction stage
are fed into classifiers. In automated ECG classification, features such as morphology and intervals of
specific waves on the ECG signal have been widely used in the past. In [57], QRS wave width, amplitude,
and offset, T-wave slope, and prematurity features were obtained for each beat, and the classification was
made using a neural network. In [43], the authors proposed a detection approach using morphologic fea-
tures, such as QRS wave onset/offset and T-wave offset, heartbeat intervals, e.g., RR intervals, features for
automated ECG beat classification using linear discriminant-based classifier models. Features may also
be transformed using methods like Fourier and wavelet transformation prior to classification. In [18], the
authors obtained distinctive features for ECG signals using the Fourier and wavelet transform approach.
With a hybrid neural network model, they reported 96% accuracy in ten-class ECG beat recognition us-
ing these transformed features. In another study [60] that used discrete wavelet transform, the features
obtained from the coefficients of different wavelet levels were classified using extreme learning machines
architecture. In [20], the authors used auto-regressive model coefficients, third-order cumulants, and dis-
crete wavelet transform approaches to extract ECG signal feature vectors, which were classified using
fuzzy-hybrid neural networks. Various statistical methods can be used for feature extraction, including
principal component analysis, linear discriminant analysis, independent component analysis, higher order
statistic, and transformation methods. Martis et al. [36] used principal component, linear discriminant,
and independent component analyses in the feature extraction step of their ECG beat classification model.
The authors in [42] performed ECG beat classification by feeding second, third and fourth order features
to a hybrid fuzzy neural network with higher order statistic. Acharya et al. [2] extracted higher order
statistic bispectrum and cumulant features from ECG signals to classify coronary artery disease using a
k-nearest neighbor (KNN) classifier. As some feature extraction methods can generate large-sized feature
vectors, various approaches can be used to select the most distinctive features to reduce the dimension-
ality. In [45], extracted features were selected using genetic algorithm, which were then fed to a support
vector machine (SVM)-based classifier. In [35], a sequential forward floating search was used to select
features, which were then classified using the multilayer perceptron approach.
Several authors have used deep models for analysis of physiological signals, including for ECG ar-
rhythmia classification [21, 40, 25], and the number of publications on deep learning models has in-
creased significantly in the last few years [21]. In [1], the authors developed a CNN to classify normal
and arrhythmia ECG segments that did not require QRS detection. The 12-layer CNN model was also
used by [55] to classify the five micro-classes of heartbeat types. In [61], the authors proposed a one-
dimensional (1D)-CNN to process long-duration ECG signal fragments, which was computationally effi-
cient and highly accurate. The authors of [6] used a CNN to classify myocardial infarction using 12-lead
ECG signals, and achieved 99% accuracy. Long short-term memory (LSTM) network was used by [22]
to detect atrial fibrillation based on heart rate signals. The authors partitioned the data with a sliding
window of 100 beats and used the resulting signal segments to train and evaluate the network. In [58],
the authors used a convolutional auto-encoder to reduce the ECG signal size and a LSTM network to
3