Abstract
Background
Deep neural networks (DNNs) have recently been significantly applied to automatic arrhythmia classification. However, their classification accuracy still has room for improvement.
Objectives
The aim of this study is to address the existing limitations in current models by developing a more effective approach for automatic arrhythmia classification. The specific objectives include enhancing the receptive field sizes to capture more detailed information across various temporal scales, and incorporating inter-channel correlations to improve the feature extraction process.
Methods
This study proposes a pyramidal dense connectivity layer and bidirectional long short-term memory network (PDC-BiLSTM) to effectively extract waveform features across various temporal scales, which can capture the intricate details and the broader global information in the signals through a wide range of sensory fields. The efficient channel attention (ECA) is additionally introduced to dynamically allocate weights to each feature channel, assisting the model inefficiently prioritizing essential characteristics during the training process.
Results
The experimental results on the MIT-BIH arrhythmia database showed that the overall classification accuracy of the proposed method under the intra-patient paradigm reached 99.82%, and the positive predictive value, sensitivity and F1 Score were 99.64%, 97.61% and 98.60% respectively; under the inter-patient paradigm, the overall accuracy was 96.30%.
Conclusion
Compared with the latest research results in this field, the proposed model is also better than the existing models in terms of accuracy, which has the potential value of being applied to devices that assist in diagnosing cardiovascular diseases.
Keywords
Introduction
According to the World Health Organization statistics, more than 17 million people die yearly from cardiovascular diseases. 1 Arrhythmia, a pervasive cardiovascular disorder, is characterized by its nuanced nature, intricate mechanisms, and abrupt onset. This ailment manifests as irregular heartbeats or alterations in rhythm, and in severe instances, can lead to cardiac arrest, stroke, or other consequential complications. The timely detection of cardiac illnesses is of utmost importance, as it is crucial for individuals to promptly identify abnormal heart rhythms in order to gain insight into their cardiac health and maybe prevent fatalities among patients.2, 3
The Electrocardiogram (ECG) is the most potent tool for non-invasive diagnosis of cardiovascular conditions. It carries a wealth of data, reflecting the intricacies of heart activity and providing practical, detailed insights. Conventionally, the interpretation and classification of heartbeats have depended on individual clinical expertise and theoretical understanding, a process that is labor-intensive and places a substantial burden on healthcare resources. 4 Moreover, this approach can be influenced by the subjective judgment of cardiologists. Consequently, developing a precise, real-time, AI-powered method for heartbeat classification, capable of autonomous learning from historical data, has emerged as a crucial area of research. 5
With the advancement of artificial intelligence technology, machine learning and deep learning algorithms in arrhythmia classification are becoming increasingly widespread. Machine learning-based arrhythmia classification typically involves two main steps: feature extraction and classifier training. For example, Mondéjar et al. 6 presented an arrhythmia classification model combining multiple support vector machines to extract features from ECG signals. Jung et al. 7 enhanced the representative features of the original ECG signal by combining the coefficients obtained from principal component analysis (PCA) and linear discriminant analysis (LDA). They applied the weighted k-nearest neighbor (WKNN) algorithm alongside a fitness rule to classify the ECG data. Rahul et al. 8 proposed an improved arrhythmia classification method based on RR interval. Wan et al. 9 devised a method for atrial fibrillation detection that combines ensemble learning and multi-feature discrimination. This approach integrates multi-domain features and constructs a robust BSK-Model classifier by optimizing multiple single machine learning classifiers. The method underwent testing on the AFDB dataset, achieving high specificity and accuracy rates of up to 99%. Automatic classification using machine learning enhances diagnostic accuracy compared to manual classification. However, these methods still rely on researchers manually selecting features and have limited self-learning capabilities, thus exhibiting a high dependence on expert knowledge.
Deep learning methods have recently been successfully applied in extracting electrocardiogram information for arrhythmia classification and have demonstrated strong capabilities. For example, Hannun A et al. 10 constructed a DNN with 33 convolutional layers to categorize 12 rhythms. They used a dataset of 91,232 single-lead electrocardiograms from 53,549 individuals collected using a single-lead ambulatory ECG monitoring device. Yang et al. 11 combined convolutional neural networks (CNN) and long short-term memory (LSTM) networks to automatically identify normal beats, ventricular premature beats, atrial premature beats, and unidentified beats using a multi-input structure for feature extraction and merging of large and small-scale heartbeats. Gao et al. 12 introduced the attention mechanism to optimize the effect of neural networks in processing time-series signals and, through the comparative analysis of the experimental results, demonstrated that the attention mechanism could significantly enhance the classification performance of neural networks. Zhan et al. 13 proposed an arrhythmia classification method based on a densely connected convolutional network (DenseNet) and an efficient channel attention (ECA) mechanism, which successfully classified six different types of beats. Zhao et al. 14 merged ResNet's frequency feature extraction capability with TCN's time domain analysis to introduce a TCN-ResNet model for efficiently classifying atrial fibrillation in single-lead electrocardiograms. The experiments demonstrated a high accuracy of 97% and an F1 score of 87%.
The aforementioned deep learning methods have demonstrated promising results in arrhythmia detection, yet certain challenges persist. These include the need for intricate convolutional structures in algorithms and the limitations of standard CNNs with fixed receptive field sizes, which can hinder effective feature capture. Additionally, traditional convolution techniques often overlook inter-channel connections, leading to potential redundancy and inefficiencies in feature extraction. To address these challenges, this study presents a pyramidal dense connectivity layer and bidirectional long short-term memory network (PDC-BiLSTM) model for accurate classification of arrhythmias, which combines a densely connected network based on the pyramidal convolutional layer (PC-DenseNet) and a bidirectional long short-term memory network (BiLSTM). The key innovation lies in using PC-DenseNet to enhance the receptive field and extract multi-scale deep features from ECG data. Additionally, the model incorporates ECA for adaptive channel weighting, reducing redundant information transmission. Furthermore, by integrating the BiLSTM model, temporal characteristics of ECG data are effectively captured.
In summary, the primary contributions of this study can be outlined as follows:
We innovatively constructed the PC-DenseNet, where the PC layer expands the scope of sensory fields. This architectural design enables the extraction of multi-scale features directly from raw ECG data while facilitating feature transfer and reuse through dense connectivity. Such characteristics endow our work with a unique advantage in ECG signal detection. Another primary strength is our implementation of the ECA method, which dynamically assigns weights to each channel. This approach effectively suppresses the propagation of redundant information, thereby enhancing our model's discriminative capability. Lastly, a notable aspect of our research is the superior performance exhibited by our model compared to state-of-the-art methods. The experimental results confirm the effectiveness and efficiency of the proposed method, which outperformed prior research findings across various metrics, including accuracy, sensitivity, positive predictive value, and F1 Score.
The paper is structured as follows: Section 2 explains the method, Section 3 presents the experiment and results, Section 4 describes the discussion, and Section 5 concludes the paper.
Method
The structure of the arrhythmia classification algorithm proposed in this paper is shown in Figure 1. The initial step involves performing wavelet denoising and heartbeat segmentation on the ECG signals affected by noise. Subsequently, the heartbeats are normalized, and the Borderline-SMOTE algorithm is employed to balance the data. Lastly, the PDC-BiLSTM network is developed to enable the automatic classification of arrhythmia using ECG signals.

Flow chart of arrhythmia classification process.
The MIT-BIH arrhythmia database, a collaboration between the Massachusetts Institute of Technology and Beth Israel Hospital, 15 is currently recognized as the most extensively utilized in evaluating and validating arrhythmia classification algorithms. It comprises 48 dual-channel half-hour dynamic ECG recordings obtained from 47 patients(25 men aged 32 to 89 years and 22 women aged 23 to 89 years). The recordings comprised MLII (Modified Lead II) and V5 lead signals, both of which were sampled at a fixed frequency of 360 Hz. The data from the MIT-BIH arrhythmia database were categorized into five groups, as shown in Table 1, following the guidelines outlined in the ANSI/AAMI EC57: 2012 (AAMI) standard. 16 These categories include normal beat (N), supraventricular premature beat (S or SVEB), ventricular ectopic beat (V or VEB), fusion beat (F), and unknown beat (Q). Notably, the four data points generated by pacemakers (102, 104, 107, 217) were excluded from the analysis, and category Q should have been excluded during the experimental phase in this work 17 since it has no application value. This paper explores experiments conducted under two paradigms: intra-patient and inter-patient. Under the intra-patient paradigm, the dataset is randomly divided into training and test sets at a ratio of 7:3. Conversely, under the inter-patient paradigm, reflecting the variability between different patients, all records are partitioned into training set DS1 and test set DS2, as illustrated in Table 2.
Classification of heartbeats according to AAMI.
Classification of heartbeats according to AAMI.
Segmentation of the dataset under the inter-patient.
Wavelet denoising and heartbeat segmentation
In this paper, the discrete wavelet transform (DWT) is employed due to its effectiveness in analyzing non-stationary signals. The db6 wavelet basis function is utilized for denoising the data, involving the decomposition of ECG signals into nine levels. The denoised and smoothed ECG signal is obtained through the inverse wavelet transform from the third to the ninth level detail sub-bands. 18 Figure 2(a) and Figure 2(b) represent ECG signals before and after denoising, respectively. Secondly, each consecutive ECG signal was segmented into heartbeats based on the annotated R peak position in the MIT-BIH arrhythmia database, and each processed heartbeat was standardized into a data segment consisting of 300 sample points to maintain input uniformity. 19 Finally, the segmented heartbeats were Z-Score normalized to unify the signal scale and eliminate the offset effect. According to the AAMI standard, the corresponding label was annotated for each heartbeat, and Table 3 displays the number of heartbeats for each category in both intra-patient and inter-patient paradigms.

ECG signals before and after denoising(109 record): (a) ECG signal before denoising, (b) ECG signal after denoising.
The number of samples in both intra-patient and inter-patient paradigms.
The traditional SMOTE
20
algorithm enhances the intra-class aggregation of minority class samples by generating them based on nearest neighbor characteristics. However, it overlooks the distribution characteristics of majority class samples, potentially leading to duplication between classes and decreased recognition accuracy. In contrast, the Borderline-SMOTE
21
algorithm addresses these limitations by selecting the most representative boundary samples to generate new samples. This approach helps overcome overfitting and intra-class confusion, thereby improving the accuracy of the classification results.
22
To amplify the rare S and F classes in the database, the new samples are synthesized by first calculating the Euclidean distance among the K samples closest to the minority class sample Xi:
The proposed model PDC-BiLSTM in this study is divided into three main parts: the feature extractor PC-DenseNet, the ECA, and the temporal information extractor BiLSTM. Initially, it extracts and fuses features from diverse perspectives, enhancing the learning process with multiple DenseNet layers. Subsequently, the ECA mechanism is utilized to reduce redundancy and refine the feature selection. Furthermore, the temporal characteristics of the ECG data are comprehensively captured using the BiLSTM network. Finally, Softmax is employed for classification, facilitating more straightforward output interpretation. The specific structure is shown in Figure 3.

Arrhythmia classification model based on PDC-BiLSTM.
Traditional convolutional neural networks are limited by the inherent constraint of the receptive field in each convolutional layer, thereby restricting them to only a tiny portion of the input sequence and yielding limited feature information. To address this limitation and enhance the comprehensive extraction of features from ECG data, this paper introduces a novel architectural design, PC-DenseNet, as illustrated in Figure 4. The architecture comprises two distinct types of convolutional layers: the pyramidal convolutional layer (PC) and the DenseNet layer. The raw ECG signals undergo data preprocessing to obtain heartbeat segments of size 1 × 300, which are then fed as input to the PC layer. This layer consists of several parallel convolution branches with different kernel sizes, specifically set to 1 × 16, 1 × 8, and 1 × 4. Subsequently, features of different scales are concatenated to obtain cascaded features. Finally, a convolution kernel of size 1 × 1 is used to realize cross-channel feature fusion, thereby capturing global information. The structure of the PC layer is shown in Figure 4, and the mathematical expression is provided by Equation (3).

The structure of PC.
Where x represents the input ECG signal, Conv() is the convolution operation, X is the feature map obtained from one convolution branch, Xn represents the feature map obtained from the nth convolution branch, Concat() is the splicing operation, Y represents the multiscale feature after the n convolution branch feature maps have been spliced. R is the fusion feature obtained after passing through the convolution kernel of size 1 × 1.
In order to augment the network's capabilities of local feature extraction and feature reuse, the features fused post the PC layer are directed to two DenseNet 23 layers for further processing. As depicted in Figure 5, DenseNet primarily comprises of the dense block and the transition layer. The dense block, composed of a batch normalization layer (BN), Relu function, and Conv, facilitates improved information interchange between layers and appends dimensions to the output of each layer, thereby bolstering the transmission of ECG signal waveform features. Located between two densely connected blocks, the transition layer amalgamates the high-dimensional signals produced by the preceding dense block. This layer addresses the issue of network width and resolves the inability of the densely connected block to decrease the size of the feature map via pooling operations.

The structure of DenseNet.
In the process of arrhythmia classification, accurately capturing key features is crucial for correct classification. In the feature extraction process, the feature maps generated after PC-DenseNet often contain a large amount of redundant or similar information across different channels, which affects the performance of the model. Therefore, this paper considers embedding the efficient channel attention (ECA) 24 module after the PC-DenseNet network to adaptively allocate feature attention and assign differentiated weights to each extracted feature, enhancing the network's ability to represent important features in ECG signals. The structure of ECA, as illustrated in Figure 6, involves a series of steps to process the input matrix F0 with dimensions (C, H, W). Initially, the feature of unreduced dimensionality denoted as (C, 1, 1) is acquired through global average pooling (GAP). Subsequently, 1D convolution is applied to capture information specific to the channel domain. Lastly, the Sigmoid activation function normalized the feature weights to a range of [0, 1]. These weights are then utilized to perform a weighted combination with the input matrix F0, generating the feature matrix F1.

The structure of ECA.
A long short-term memory network (LSTM) is a special RNN that can effectively tackle the problem of gradient disappearance and gradient explosion and record long-term dependencies, improving the network's performance while dealing with sequential data. However, LSTM models are limited to capturing solely past contextual information, hence needing more ability to incorporate future contextual information. In order to tackle this issue, a bidirectional long short-term memory network (BiLSTM) is implemented. As shown in Figure 7, this network consists of two LSTM units operating in opposite directions, which enhances the capacity to capture time series features. This paper uses the bidirectional temporal information processing capability of BiLSTM to fuse and process features extracted from PC-DenseNet at various scales. This enables the model to identify valuable patterns and regularities within a sequence of continuous ECG waveforms, thereby enhancing the classification accuracy of arrhythmias.

The network structure of BiLSTM.
To optimize the algorithm in this study, the Adam optimization method was utilized. During model training, the step size for one-dimensional maximum pooling (MaxPool1D) was set to 2, and the one-dimensional convolution (Conv1D) was configured as 1. The batch size was 128, the number of epochs was 50, and the learning rate was specified as 0.0005. The network model was trained using a focal loss function
25
(FL) to enhance classification performance. By adjusting the weights of the classification samples, the FL function guided the model's focus towards the challenging data from the S and F classes. The FL function is formally defined as follows:
The parameters of the proposed model.
Experimental platform and evaluation indicators
The hardware configuration utilized to train the model in this study includes an Intel Xeon E5-2620 processor, 32GB of operating memory, an NVIDIA GeForce GTX 1080Ti graphics card, Keras for deep learning, and TensorFlow for the back-end. Windows Server 2016 Datacenter 64 was the operating system used to train the model. The four indicators of accuracy (Acc), sensitivity (Sen), specificity (Spe), positive predictive value (Ppv), and F1 Score (F1) are used to evaluate the performance of this model, and the calculation formulas are provided below:
The experimental steps are as follows:
Under the intra-patient paradigm, the total number of sparse heartbeats, categorized as either S or F, is increased to 3810. Under the inter-patient paradigm, the quantity of sparse cardiac beats, S and F, inside the training set DS1 is expanded to 7400. The datasets under the two paradigms are inputted into the network model for training.
The confusion matrices of the classification results of the model before and after data balancing are shown in Figure 8. Under the intra-patient paradigm, the neural network demonstrates significant accuracy in correctly classifying heartbeats N, V, and S when trained with the original dataset. However, it fails to identify heartbeats belonging to class F, misclassifying them as class N. After using data balancing techniques, the identification rate for class F significantly improves, reaching an impressive 93%. Under the inter-patient paradigm, the application of Borderline-SMOTE for data balancing resulted in significant enhancements in the recognition accuracies of classes V, S, and F. Prior to the use of data balancing techniques, a considerable quantity of S and F heartbeats were erroneously categorized as N, rendering them nearly indistinguishable. However, after applying data balancing methods, the accurate recognition rates for S and F heartbeats showed notable improvement, surpassing 60%. Integrating the above analysis into the two paradigms highlights the necessity of achieving a balanced dataset.

Confusion matrix before and after balancing the dataset: (a) before data balancing under the intra-patient, (b) after data balancing under the intra-patient, (c) before data balancing under the inter-patient, (d) after data balancing under the inter-patient.
The performance of the model
By employing the experimental procedures outlined in Section 3.2 under the intra-patient paradigm, Table 5 presents the specific experimental results. Under the intra-patient paradigm, the model achieves an overall accuracy of 99.82%, Ppv of 99.64%, Sen of 97.61%, F1 of 98.60%, and Spe of 99.99%. Figure 9 and Figure 10 describe the trends of accuracy and loss values over time under the intra-patient paradigm, and the model did not suffer from severe overfitting.

Change curves of accuracy under the intra-patient paradigm.

Change curves of loss values under the intra-patient paradigm.
Experimental results under the intra-patient paradigm.
Under the intra-patient paradigm, ablation experiments were used to analyze the impact of the PC layer and ECA strategies on arrhythmia classification performance. As shown in Table 6, the Acc, Ppv, Sen, and F1 scores of model A2 increase by 0.94%, 7.17%, 8.63%, and 8.51%, respectively, compared to model A1. Notably, there is an improvement in the ability to classify limited samples into S and F categories. Both Sen and F1 scores for Class S and Class F increased by at least 15%. These results indicate that incorporating the PC layer into the model can improve its classification performance. Upon comparing A2 and A3 in the provided table, it is evident that the implementation of ECA results in an overall improvement of 0.53% in classification accuracy. More specifically, Sen values for the difficult-to-identify S and F categories exhibit 13.97% and 1.72% enhancements. The results above highlight the effectiveness of integrating ECA into the system.
Comparison of model classification performance results under the intra-patient.
Comparison of model classification performance results under the intra-patient.
A1: DenseNet + BiLSTM, A2: A1 + PC, A3: A2 + ECA.
The performance of the model
By applying the experimental procedure described in Section 3.2 to the inter-patient paradigm, Table 7 shows the specific performance of the proposed model. he model is effective in classifying categories N and V, with Sen reaching 97.95% and 95.75%, respectively. The relatively low Sen values for categories S and F are mainly due to the small number of samples and the significant differences in the shape of the beats of different patients when arrhythmia classification is performed under the inter-patient paradigm, which can lead to a reduction in classification performance. However, The overall accuracy of the model reaches 96.30%, which can still accurately classify important arrhythmia-related diseases.
Experimental results under the inter-patient paradigm.
Experimental results under the inter-patient paradigm.
As shown in Table 8, an inter-patient performance comparison experiment is devised to examine the effect of PC layer and ECA strategies on the classification performance of arrhythmia across various classification models. Upon comparing A1 and A2 in the provided table, it is observed that including the PC layer substantially improves overall accuracy, increasing it from 92.60% to 94.68%. Furthermore, significant improvements are observed in two key metrics, the Sen and the F1 Score of the S class. Specifically, the Sen of the S class demonstrates a notable increase of 32.01%, while the F1 Score exhibits a substantial increase of 48.37%. These results presented illustrate the effectiveness of integrating the PC layer. After comparing A2 and A3 in the provided table, it is observed that the implementation of ECA led to an overall accuracy improvement of 1.62%. Furthermore, the Sen, Ppv, and F1 Score for the sparse sample categories S and F increased. This improvement signifies an improved capability to recognize the S and F categories to a certain extent, thereby indicating the effectiveness of incorporating ECA.
Comparison of model classification performance results under the inter-patient.
Comparison of model classification performance results under the inter-patient.
A1: DenseNet + BiLSTM, A2: A1 + PC, A3: A2 + ECA.
Classification of arrhythmias typically involves the utilization of machine learning techniques, two-dimensional images, and one-dimensional ECG signals. However, traditional machine learning methods occasionally require the manual generation and selection of features, which can be laborious and time-consuming. Furthermore, most approaches fail to adequately capture the temporal information and fully exploit the time-dependent characteristics of the ECG signal.27–30 Numerous existing approaches to arrhythmia classification that employ 1D CNN concentrate predominantly on local feature extraction from the ECG signal. Nevertheless, these algorithms fail to sufficiently capture the feature information across various frequency ranges and scales.31–34 Spatial information is extracted from ECG signals by classifying arrhythmias based on two-dimensional images. These techniques employ image operations, including scaling, translation, and rotation, to improve the data quality and the classification's performance. However, it is critical to acknowledge that this methodology is computationally intensive, necessitating substantial computational resources. Additionally, it is susceptible to information loss and requires more storage space to convert 1D signals to 2D images. This study presents an innovative approach by combining PC-DenseNet and BiLSTM as feature extractors for ECG data and temporal features, respectively. The initial step involves using the PC layer to extract the signals at various scales. This enables the capture of various aspects of ECG signals across multiple frequency ranges. As shown in Tables 6 and 8, the overall accuracy of ECG signals passing through the PC layer increased by 0.94% under the intra-patient paradigm and 2.05% under the inter-patient paradigm. Subsequently, dense connectivity is employed to minimize the loss of information transmission. Finally, the temporal dependency of ECG signals is modeled using BiLSTM.
Additionally, different channels contain different ECG signal features in the arrhythmia classification task. These features demonstrate a certain degree of connection among themselves. Meanwhile, ECG signals often encompass a substantial amount of redundant information, including noise and interference. In order to enhance the model's ability to build channel relationships and mitigate information redundancy, this study proposes integrating an efficient channel attention mechanism into the PC-DenseNet module. The channel attention mechanism is a method that effectively trains and directs attention toward the most pertinent channels or features within an input sequence. Tables 6 and 8 show that with the inclusion of ECA, the model collects key feature information more efficiently and performs better in both paradigms.
Furthermore, a notable disparity exists in the distribution of arrhythmia samples among various kinds in the arrhythmia classification task. This study employs the Borderline-SMOTE algorithm for data balancing to mitigate the issue of network training disproportionately emphasizing a more significant number of samples while inadequately learning from a few categories. Figure 8 shows the effectiveness of the Borderline-SMOTE, under the intra-patient paradigm, the number of correct classifications of difficult-to-categorize samples with algorithmic treatment grew from nearly unrecognizable to 217 samples for class F. Under the inter-patient paradigm, the number of correct classifications of difficult-to-categorize samples with classes S and F was also much higher than before the treatment, with a correct classification rate of more than 93%.
As shown in Table 9 and Figure 11, this paper compares the classical arrhythmia classification models in recent years. The focus of the literature35,36 aligns with the aim of this paper in that both consider the model's performance in both intra-patient and inter-patient paradigms. Literature 35 and this study utilize DenseNet as the backbone network to extract features from ECG signals. However, it overlooks the correlation between each signal channel, resulting in insufficient recognition accuracy for a limited number of challenging samples that are difficult to identify. Literature 36 transforms one-dimensional signals into two-dimensional images for arrhythmia classification, effectively enhancing the accuracy of arrhythmia classification. However, the use of image-based input in the network significantly lengthens the processing time, posing a challenge to real-time arrhythmia classification. Literature 37 successfully combined the excellent feature extraction capability of CNN with the high sensitivity of gated recurrent units (GRU) to temporal signals, which performed well on the MIT-BIH arrhythmia database. However, this method needed to fully account for feature changes at different time scales; thus, its feature extraction capability could have been improved. Literature 38 uses the AdaBoost method to classify arrhythmias, but this method requires manual feature selection during the data preprocessing stage, which is time-consuming and labor-intensive. Literatures 39 and 40 both build networks from the perspective of extracting multi-scale features. Literature 39 proposed a multi-scale ResNet-based heartbeat classification method. This method first extracts shallow features from the convolutional layer. Then, it sends the feature map to three ResNet branches with different kernel sizes to combine receptive fields of different sizes. The method in this paper aims to expand the receptive field through the PC-DenseNet layer. The DenseNet layer is more concise and effective than three parallel complex ResNet branches. Its dense connection method allows the network to directly access the feature maps of all previous layers, reducing information loss and improving feature utilization. Literature 40 proposed a method for arrhythmia classification based on multi-scale convolution. This method adds convolution modules of different sizes to the front layer of CNN to extract richer feature information. This scheme considers the idea of multiple receptive fields, but its classification performance is not as good as the method proposed in this paper. The above analysis shows that the model proposed in this paper obtains high Acc, Spe, F1 Score, Sen, Spe and Ppv values for arrhythmia classification.

Comparison with existing methods: (a) histogram comparing multiple methods under intra-patient paradigm, (b) histogram comparing multiple methods under inter-patient paradigm.
Comparison with existing methods.
This paper proposes a PDC-BiLSTM model to categorize four types of arrhythmias, N, V, S, and F, in both intra-patient and inter-patient paradigms. Multi-scale features are created using distinct convolutional branches in different feature channel dimensions, based on which the transfer and reuse of features are promoted using a dense connection. Meanwhile, ECA is introduced to focus on crucial aspects and eliminate information duplication. The model developed in this research boosts the capacity to describe multi-scale feature information and substantially improves classification resilience and accuracy. Subsequent studies will attempt to apply the proposed model on portable devices to enable doctors to conduct electrocardiogram analysis promptly and offer improved, personalized medical care for enhancing arrhythmia diagnosis and ensuring timely treatment.
Footnotes
Acknowledgments
The authors have no acknowledgments.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Wuhan Knowledge Innovation Project (No. 2022020801010258) and Natural Science Foundation of Hubei Province (No. 2022CFA007).
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
