Abstract
Arrhythmia classification on Electrocardiogram (ECG) signals is an important process for the diagnosis of cardiac disease and arrhythmia disease. The existing researches in arrhythmia classification have limitations of imbalance data problem and overfitting in classification. This research applies Fuzzy C-Means (FCM) – Enhanced Tolerance-based Intuitionistic Fuzzy Rough Set Theory (ETIFRST) for feature selection in arrhythmia classification. The selected features from FCM-ETIFRST were applied to the Multi-class Support Vector Machine (MSVM) for arrhythmia classification. The ResNet18 – Convolution Neural Network (CNN) was applied for feature extraction in input signal to overcome imbalance data problem. Conventional feature extraction along with CNN features are applied for FCM-ETIFRST feature selection process. The FCM-ETIFRST method in arrhythmia classification is evaluated on MIT-BIH and CPCS 2018 dataset. The FCM-ETIFRST has 98.95% accuracy and Focal loss-CNN has 98.66% accuracy on MIT-BIH dataset. The FCM-ETIFRST method has 98.45% accuracy and Explainable Deep learning Model (XDM) method have 93.6% accuracy on CPCS 2018 dataset.
Keywords
Introduction
Arrhythmia classification using Electrocardiogram (ECG) is an interesting research topic for researchers and this plays a vital role in the diagnosis and early prevention of cardiovascular diseases. Feature extraction and selection are critical processes in ECG signal classification [1]. ECG is a low cost and commonly used diagnostic tool in healthcare institutes for heart electrical signals screening. Arrhythmia can be defined as abnormal heart signals and Cardiac arrhythmia, which can be dangerous or in most cases, it can cause death. Arrhythmia consists of different types and this can be detected using an ECG test. Various researchers develops the model of automatic screening of arrhythmia classification using ECG signals. Automatic classification tools of arrhythmia using ECG signal benefits patients and also for a doctor to assist in the decision of treatment [2]. Early and accurate classification of cardiac arrhythmia in ECG signal prevent many premature deaths. Cardiac arrhythmia causes due to electrical impulses of improper conduction throughout the heart [3]. Multi-channel ECG signal continuous recording and analysis are necessary for accurate diagnosis of arrhythmia. Large amount of data is required to be processed and analyzed for arrhythmia classification. Several medications are prescribed based on the arrhythmia type of the patient. Rapid treatment is not required for some mild arrhythmia and others can be treated with medication. Treatments such as surgery, cardiac defibrillation, and pacemakers are required if a case is serious [4, 5].
Daily cardiac health monitoring is carried out based on a single-lead wearable ECG device including Atrial Fibrillation (AF) for automatic abnormal heart rhythms detection. Recently, end-to-end deep learning techniques such as Recurrent Neural Networks (RNNs), Convolutional Neural Networks (CNNs) are widely applied to meet the needs of automated feature selection and ECG signals classification [6]. Automatic cardiac disease classification is carried out based on denoised ECG signals. Feature extraction methods such as Hermit function coefficients and temporal features, deep learning networks, Wavelet-based kernel Principle Component Analysis, Wavelet Packet Decomposition, Principle Component Analysis, temporal and morphological features, and Discrete Wavelet Transform were used in existing methods [7]. Various methods of ECG signal classification were carried out based on extracted features. Some of the classifiers are Optimized block-based Neural Network, Naïve Bayes, Support Vector Machine-Radial Basis Function, Support Vector Machine (SVM), Probabilistic Neural Network, Random Forest (RF), and Back Propagation Neural Network (BPNN) were applied for arrhythmia classification [8, 9, 10]. Existing methods in arrhythmia classification have limitations of overfitting problem and imbalance data problem that degrades the efficiency of the model. The objectives and contribution of this research are discussed as follows:
The FCM-ETIFRST method is proposed for the feature selection process in arrhythmia classification to avoid overfitting and imbalance data problems. The FCM-ETIFRST method selects unique features related to the class that helps to reduce the correlation of features between class and solves imbalance data problem. The Non-membership function and hesitance degree are added in fuzzy technique to control the feature selection for classification. The Non-membership function provides the probability of the features that are not selected and hesitance degree provides the similarity of features between classes. This process helps to select unique feature sets from the signal for effective classification. Tolerance Intuitionistic in the proposed method measures the similarity in Fuzzy Rough Set and helps to select the relevant features. Tolerance Intuitionistic helps to adaptively select the features from input signal and solves imbalance data problem. The ability of the FCM-ETIFRST method helps to reduce the number of features to solve the overfitting problem in the network. The ResNet18-CNN model helps to select more relevant features and the proposed FCM-ETIFRST model helps to select unique features for classification. The FCM-ETIFRST method provides higher efficiency than existing methods in arrhythmia classification. The FCM-ETIFRST method is compared with existing methods in two datasets of MIT-BIH and CPCS 2018 dataset. The FCM-ETIFRST-MSVM has 98.95% accuracy, 99.49% sensitivity, and Focal Loss-CNN model has 98.66% accuracy and 76.71% sensitivity.
The organization of the paper is given as follows: Recent research in Arrhythmia classification was reviewed in Section 2 and ResNet18 and ETIFRST models are explained in Section 3. The implementation details are given in Section 4 and the Result is discussed in Section 5. The conclusion of this research work is given in Section 6.
Arrhythmia is a cardiovascular disease that is a serious threat to human health and ECG signal were used to classify arrhythmia disease. Arrhythmia classification performed in recent research was reviewed in this section.
The MIT-BIH and CPSC 2018 datasets were common datasets used by existing research for the evaluation of the models. One part of the literature review is related to the MIT-BIH dataset of existing methods and the other part is CPSC 2018 dataset.
Gnecchi et al. [11] applied quadratic wavelets of wavelet transform and Probabilistic Neural Network (PNN) for arrhythmia classification. The 17 ECG recordings from MIT-BIH were used for the classification process and test the performance. The resulting patterns were extracted from peaks and wavelets, then applied to PNN for arrhythmia classification. Mathunjwa et al. [12] applied VGG-16, VGG-19, and AlexNet models with ReLU activation function for arrhythmia classification. The model performs a selection of the optimal number of parameters in the deep learning models. The models were evaluated with training and validation data in classification. Yang and Wei [13] extracted the morphological features of ECG signals such as duration, interval, and amplitude for arrhythmia classification. The clustering based feature extraction is applied based on the QRS complex morphology. The KNN, SVM and Neural Network were used as a classifier for automatic diagnosis of arrhythmia classification.
Lu et al. [14] proposed the Focal Loss with CNN model to solve the imbalance data problem in arrhythmia classification. Focal loss can train on a small dataset and performs on imbalanced dataset. Focal loss reduces the number of parameters in the CNN for arrhythmia classification. Weight value was assigned to the minority class in the CNN model for solving the imbalance data problem. Meng et al. [15] applied the CNN model with convolutional, pooling and fully connected layers for the classification of arrhythmia. The MIT-BIH dataset was used to evaluate the model performance in arrhythmia. Ullah et al. [16] applied a 1D-CNN model with a fully connected layer, two down sampling layers, two convolutional layers for arrhythmia classification. The 1D-CNN model was applied for feature extraction and the 2D-CNN model was applied for the classification process. The wavelet threshold method was applied to eliminate the noise from the signal. A gradient descent-based optimizer algorithm with a learning rate was applied for the classification process.
Essa et al. [17] applied the CNN-LSTM model to capture the temporal dynamics and local features in ECG data for arrhythmia classification. The higher-order statistics and RR intervals integrate the classical features to highlight abnormal heartbeats classes. Different sub-sampling datasets of highly imbalance distribution were used to create a bagging model for arrhythmia classification. Weighted loss function of each model was used to provide high weight for the model. The bagging model predictions of a feedforward fully connected network were used as meta-classifier. Yan and Zhang [18] applied a hybrid model to learn the deep-seated essential features and temporal correlation for arrhythmia classification. The bandstop filter and median filter were used to pre-process the ECG signal. Feature extraction and classification were integrated into one to avoid bias in the feature extraction process.
The recent research on arrhythmia classification uses the CPCS 2018 dataset for evaluation was reviewed below.
Tutuko et al. [19] applied CNN based end-to-end implementation for arrhythmia classification. A single learning system was used in the model that didn’t consider frequency sampling and signals lengths. Cloud-based deep learning model was used to perform classification of Atrial Fibrillation detection. The 1-D CNN model was used to perform the classification of arrhythmia in the input signal. Jo et al. [20] applied six deep learning models for feature extraction and an ensemble model for the classification of arrhythmia. The explainable Deep learning Model (XDM) was an ensemble tree Neural Network based model applied for the classification process. The developed model shows considerable performance in the classification of arrhythmia. Yoo et al. [21] applied CNN end-to-end model and the attention map was fine tuned to resemble Ground Truth label attention map. A regularization loss function was measured between the averaged map and attention map to fine tune based on objective function. Multi-loss optimization method was applied to explainability and multi-label subset accuracy to find unique features of the model. Li et al. [22] applied multi-label feature selection method and kernelized fuzzy rough sets to optimize ECG feature space and optimal feature subset. The model explores the correlation between the feature map and arrhythmia disease based on sparity constraints.
From the review of existing techniques, common limitations in existing arrhythmia classification techniques are imbalance data problem and overfitting problem. The imbalance data problem is some classes consists of less data instances and model requires more training data for effective classification. The overfitting problem is caused mostly in CNN based models due to generation of more features in the feature extraction process.
Proposed method
The MIT-BIH and CPCS 2018 dataset signals were used for the evaluation of the FCM-ETIFRST method in arrhythmia classification. Sliding window technique is used to segment the signal into 5 s window. The conventional feature extraction method and ResNet18-CNN feature extraction were applied. The FCM-ETIFRST model is used to select the features from extracted features from the model. The selected features are applied to MSVM to perform arrhythmia classification. The flow of FCM-ETIFRST-MSVM model in arrhythmia classification is shown in Fig. 1.
Fuzzy C-Means (FCM) – Enhanced Tolerance-based Intuitionistic Fuzzy Rough Set Theory (ETIFRST) technique in arrhythmia classification.
CNN model applied with extra layers instead of feature selectin stage involves in creating overfitting problem due to the generation of more features in the feature extraction. Dropout technique can be applied to reduce overfitting problem and dropout technique randomly discard the features that doesn’t guarantee higher efficiency in classification. Feature Selection technique adaptively learns the features and select the relevant features for the classification that helps to improve the efficiency of the classification. This is better to apply the feature selection instead of applying extract layers in CNN for feature extraction due to adaptive learning of features.
Two common deep learning techniques for feature extraction are CNN based models and LSTM based models. CNN based models apply convolution filters in the signal to exploit spatial correlation and this is much suitable for audio and images than LSTM models. LSTM based models are effective in analysis the sequence of data and lower efficiency in exploit the spatial correlation compared to CNN.
CNN based and conventional feature extraction methods were applied to extract the features from input signals.
Conventional feature extraction method
where the signal is denoted using
where epoch last sample is denoted as
where epoch last sample is denoted as
where indicator function is denoted as
Two-dimensional signals are handled using Convolutional Neural Network (CNN) which is based on deep learning techniques and a recent neural network. Fully Connected Layers (FCLs), Pooling Layers (PLs), and Convolutional Layers (CLs) are present in the CNN model. CNN model shows significant performance improvement than traditional AI methods such as naive Bayesian classifier, Decision Tree (DT), and Support Vector Machine (SVM). CNNs have the advantage of learning features from data during training and significantly reduces the time required for feature design engineering i.e., to distinguish features/biomarkers [23, 24, 25].
CNN most important procedure is convolution and CL is the most important layer that consists of 2D convolution process of input and forward pass the kernels. The weights of each CL kernels are randomly initialized and updated at each iteration using network training of loss function. The kernel final learnt detect some types of patterns in input signals.
Three steps of CNN models are convolution, stack and Non-Linear Activation Function (NLAF). Consider
where convolution operation is denoted as
where total number of filters are denoted as
where activation map channels, width, and height size are denoted as three variables of (
Padding is denoted as
where floor function is denoted as
where activation map
The ReLU main advantage is its improved gradient propagation, i.e., compared to
Convolutional neural network architecture.
The FCM-ETIFRST feature selection method is applied to extracted features to select relevant features for arrhythmia classification.
Fuzzy C-Means
Feature selection is based on Fuzzy C-Means clustering technique [26, 27, 28]. The FCM method has the advantage of producing better results for overlapped datasets than the k-means method. The data point is assigned membership to every cluster to form the appropriate cluster. The data point is applied with membership based on the distance of the data point and cluster centre
The estimated membership is increased for every iteration. An enhanced fuzzy C-means algorithm is implemented in the proposed method. The algorithm is given as follows:
Step 1: Clusters number
Step 2: Signal
where
Step 3: Signal matrix
The
Step 4: Prototype
Convergence criteria are measured as
Enhanced fuzzy rough set feature selection involves in applying non-membership function and hesitance degree to select unique set of features from the signal. The non-membership function helps to select the features to distinguish between the classes and hesitance degree helps to differentiate between more similar classes. This technique helps to solve imbalance data problem and overfitting problem by selecting unique features for classification.
The FCM has a wide range of applications and is significantly applied in numerous fuzzy clustering methods. Every sample point of membership degree to the class centre is used as objective function optimization.
Euclidean distance is applied for the objective function of sample point and clustering centre. Solving non-similarity index value function is the minimum value of every clustering centre. The generalization is given in Eq. (18).
where weighted index number is denoted as
The FCM output are centres
Fuzzy rough set of lower approximation is given in Eq. (21).
where object
Each pixel membership degree is normalized and initialized as in Eq. (22).
where signal value of minimum and maximum intensity is denoted as
Non-membership value is present in the new method in uncertainty presence. A high grade of certainty is measured based on observations when membership value is near to 0 or 1 and membership value is near to 0.5 for high grade of uncertainty. Non-membership value is measured in Eq. (23).
where (
Every feature of fuzzy clustering gets every object membership degree, as given in Eq. (24).
Model equality is based on equivalence relations; approximate similarity or equality is measured based on fuzzy equivalence. Fuzzy equivalence relation
The size of the opening on one side is denoted as value
Support Vector Machine (SVM) is a supervised learning machine for the classification process [29, 30]. SVM is viewed as binary classifier and data of decision boundary is used to classify patterns of two alike classes. A distance between optimal separating hyperplane is denoted as margin
SVM method is based on input data that is linearly non-separable projection to high-dimensional feature space and select an optimal hyperplane in feature space of maximum possible margin. Training set labelled as
where class label
where Lagrangian multiplier is denoted as
Linear Kernel: Polynomial: Gaussian Radial Basis function: Sigmoid:
SVM classifier solution for an unknown feature vector
An SVM model is successfully applied for multi-class classification based on binary SVMs group. The One against one method performs pairwise classification based on
A
Dataset used to evaluate FCM-ETIFRST-MSVM model for arrhythmia classification and parameter settings of the model were described in this section.
Sample signal from MIT-BIH dataset.
Lead I sample signal.
Lead II sample signal.
The FCM-ETIFRST method is compared with various classifiers and feature selection techniques on MIT-BIH dataset. The FCM-ETIFRST also evaluated with various k-fold validation for arrhythmia classification.
Fuzzy C-Means (FCM) – Enhanced Tolerance-based Intuitionistic Fuzzy Rough Set Theory (ETIFRST) performance on various classifiers
Fuzzy C-Means (FCM) – Enhanced Tolerance-based Intuitionistic Fuzzy Rough Set Theory (ETIFRST) performance on various classifiers
Accuracy and sensitivity for various classifiers.
Figure 6 and Table 1 show the performance of the FCM-ETIFRST method for various classifiers such as Multi-Class Support Vector Machine (MSVM), Deep Neural Network (DNN), Decision Tree (DT), Random Forest (RF), and K-Nearest Neighbours (KNN). The DNN and RF have lower efficiency due to overfitting problem in arrhythmia classification. The DE has imbalance data problem and KNN has outlier sensitivity that degrades the performance. MSVM has the advantage of handling high-dimensional data and CNN extracted features helps to overcome imbalance data problem. The FCM-ETIFRST method selects the unique features to represents the class that solves the problem of imbalance and overfitting. Modified Non-membership function helps to provide exploitation and modified membership function performs exploration. Similarity measure helps to maintain the exploration and exploitation of the feature selection process. MSVM shows a higher sensitivity than other classifiers due to its efficiency in handling the features.
Feature selection method comparison
Accuracy and sensitivity of various feature selection methods.
Figure 7 and Table 2 show various feature selection models such as Whale Optimization Algorithm (WOA), Particle Swarm Optimization (PSO), Ant Colony Optimization (ACO), and Grey Wolf Optimization (GWO) were applied for arrhythmia classification. Modified Non-membership function increases the exploration and Modified membership function increases the exploitation in feature selection. Similarity measure in FCM-ETIFRST method balance the exploration-exploitation and overcome trap into local optima. The existing feature selection methods have either struck into local or global features in the feature selection process. The existing feature selection methods have lower sensitivity due to their poor convergence in feature selection. The FCM-ETIFRST method has a sensitivity of 99.49% and ACO method has a 92.03% sensitivity. The FCM-ETIFRST shows significant improvement in the sensitivity measure.
K-fold validation performance analysis
K-fold validation comparison.
The FCM-ETIFRST-MSVM model is evaluated for various k-fold validation in arrhythmia classification, as given in Fig. 8 and Table 3. The FCM-ETIFRST method shows higher performance in 4-fold to 10-fold cross validation. The performance reduction in 7-fold to 10-fold validation is due to reduces in the number of training data. The FCM-ETIFRST-MSVM model has 98.97% accuracy in 5-fold cross validation and 98.67% accuracy in 10-fold cross validation.
Feature selection and classifier performance on MIT-BIH dataset
The classifier with various feature selection performance is evaluated and compared with other techniques, as shown in Table 4. This shows that FCM-ETIFRST-MSVM feature selection and MSVM classifier shows higher performance in the arrhythmia classification than existing methods. The FCM-ETIFRST-MSVM model has advantage of selects unique set of features due to non-membership function and hesitance degree. The DNN model has overfitting problem and RF has instable performance in the arrhythmia classification.
Feature Extraction performance on MIT-BIH dataset for arrhythmia classification
Comparative analysis on MIT-BIH dataset
Area Under the Curve of proposed model on MIT-BIH dataset.
Comparative analysis of MIT-BIH dataset.
Various deep learning techniques are applied as feature extraction and compared with proposed ResNet18 model, as shown in Table 5. The FCM-ETIFRST-MSVM and ResNet18 model has higher performance compared with existing methods. The ResNet18 model architecture extracts the relevant features from the input signal. Existing deep learning models have overfitting problem that degrades the performance of the classification.
The Area Under the Curve (AUC) of FCM-ETIFRST-MSVM and ResNet18 for arrhythmia classification on MIT-BIH dataset is shown in Fig. 9. The point on the Receiver Operating Characteristic (ROC) curve corresponds to have an equal probability of miss-classifying a positive or negative sample. From Fig. 9, it clearlyt shows that the FCM-ETIFRST-MSVM and ResNet18 model shows 0.9895 AUC for the classification which indicates that TPR of the model significantly increases for the arrhythmia classification.
The FCM-ETIFRST-MSVM method is compared with existing research models in arrhythmia classification in MIT-BIH and CPCS 2018 dataset. The FCM-ETIFRST-MSVM method and existing methods were compared on the MIT-BIH dataset in Fig. 10 and Table 6. This shows FCM-ETIFRST-MSVM model has higher efficiency due to its modified non-membership and membership function providing efficient exploration-exploitation in feature selection. The existing models have imbalance data problems and overfitting problems in the classification performance.
Comparative analysis on CPCS 2018 dataset
Comparative analysis on CPCS 2018 dataset
The FCM-ETIFRST-MSVM and existing models were evaluated on CPCS 2018 dataset, as given in Fig. 11 and Table 7. The existing models have limitations of overfitting that degrades the performance of arrhythmia classification. The FCM-ETIFRST-MSVM model effectively balances the exploration-exploitation that improves the efficiency of classification. The Proposed method has achieved the accuracy of 98.45%, CNN [21] model has 68.9% accuracy, and XDM [20] model has 93.6% accuracy.
Comparative analysis of CPCS 2018 dataset.
Diagnosis of arrhythmia is mainly based on ECG signals due to its capacity to analyze the abnormal activity in the heartbeat. The existing researches in arrhythmia classification have limitations of imbalance and overfitting problem. This research applies FCM-ETIFRST-MSVM model to improve arrhythmia classification efficiency and overcome the overfitting problem. The modified Non-membership function improves the exploitation efficiency and the modified membership function improves exploration efficiency in the feature selection process. The MSVM model has higher efficiency in classification than DNN due to its capacity to handle high-dimensional data. The FCM-ETIFRST model has higher efficiency in feature selection due to its advantage in balancing the exploration-exploitation in feature selection which helps to improve convergence. The FCM-ETIFRST-MSVM model has higher efficiency in arrhythmia classification on MIT-BIH and CPCS 2018 dataset. The FCM-ETIFRST-MSVM model has 99.49% sensitivity and CNN has 98.83% sensitivity on the MIT-BIH dataset. The FCM-ETIFRST-MSVM has 99.36% sensitivity and XDM has 79.3% sensitivity on arrhythmia classification on CPCS 2018 dataset. The future work of the FCM-ETIFRST-MSVM model involves in improving the model to train in small training dataset.
Funding
This research received no external funding.
Data availability
The datasets generated during and/or analysed during the current study are available in the MIT-BIH/CPSC 2018 repository, https://archive.physionet.org/cgi-bin/atm/ATM, http://2018.icbeb.org/Challenge.html.
Footnotes
Conflict of interest
The authors declare that they have no conflict of interest.
Author’s Bios
