Abstract
Many cardiac disorders were diagnosed by analyzing an electrocardiogram signal, in particular, atrial fibrillation. We join the SDCST method with the Detrended Fluctuation Analysis (DFA) and the backpropagation net to identify atrial fibrillation in one hundred ECG signals obtained from Physionet Challenge 2017 database. The accuracy of the proposed classifier parameter is 97% for the training set and 95% for the test set.
Keywords
Introduction
The electrocardiogram signal (ECG) can diagnose several cardiac disorders (Andreotti et al., 2017), one of these diseases is a particular case of cardiac arrhythmia called atrial fibrillation. Estimates indicate that two percent of the general population may present episodes of atrial fibrillation and, there is a strong correlation between this type of anomaly with mortality risks (Behar et al., 2017).
In this context, there is paroxysmal atrial fibrillation that not always occurs during a conventional ECG test. Then the Holter Monitor appears like an efficient tool (Fan et al., 2018). In possession of these data, the cardiologists must search for anomalies. However, atrial fibrillation can occur in a short time, which makes this work exhausting and subject to errors. For this reason, the proposition of methods that identify atrial fibrillation on short-term ECG signals, thirty or sixty seconds, is of extreme importance.
Aligned with this objective, the Physionet has proposed a challenge (Clifford et al., 2017) that had as target classifies whether a short ECG lead recording shows atrial fibrillation. Among the papers that emerged from this challenge, we can cite Datta et al. (2017) that incorporate to their methodology the multi-layer cascade binary classification approach and, Teijeiro et al. (2017) that uses a set of high-level and clinically meaningful features provided by the abductive interpretation of the records.
The signal denoising by clustering and soft thresholding (SDCST) (Vargas and Veiga, 2018) clusters the detail wavelet coefficients in two sets and, respectively, shrink them according to estimated thresholds. Here we join the SDCST method with the Detrended Fluctuation Analysis (DFA) and the backpropagation net to identify atrial fibrillation in two hundred ECG signals obtained from Physionet Challenge 2017 database.
The novelty of our method is the union of these three theories. For example, Teijeiro et al. (2017) and Datta et al. (2017) use as input parameters the general features of the ECG signal. Here, we use the Hurst Exponent value (calculated by the DFA method) of the time series determined by the SDCST method.
The DFA is a method that detects the self-affinity in a time series. We can see applications of this method in ECG signals (Linhares, 2016, 2018), in econophysics (Kutner et al., 2019) and, in seismic processing (Linhares, 2017).
Our work also differs from the method introduced by Linhares (2016), because Linhares uses directly the Hurst Exponent value obtained from the RR-interval of the ECG series submitted to her SDFA method. We apply the DFA directly in two time series and fit a backpropagation net to classify healthy and unhealthy signals. Beyond that, Linhares has identified a general type of arrhythmia, processing a long time ECG signals. Here we detect the specific atrial fibrillation arrhythmia, processing short-time ECG signals.
In Section 2, we review some basic properties about the SDCST method, DFA and backpropagation net. In Section 3, we present the proposed SDCST Atrial Fibrillation Classifier (SDCST-ACF) and in Section 4 we show the application. In Section 5 we summarize the major conclusions of this study.
Basic concepts
In this section, we introduce the basic concepts about the SDCST method, DFA, and backpropagation net. We begin with the SDCST method in Section 2.1.
SDCST method
The Signal Denoising by Clustering and Soft-thresholding (SDCST) method uses the discrete wavelet transform, Hidden Markov Models, and the Viterbi algorithm to cluster the detail wavelet coefficients in groups of signal and noise.
After that, we submit each of these sets to thresholding processes and sum these two pre-processed sets.
Over the result of this sum, we apply the Inverse Discrete Wavelet Transform and obtain the clean signal estimate. For details about the SDCST method see Vargas and Veiga (2018).
Detrended fluctuation analysis
The Detrended Fluctuation Analysis (DFA) method, proposed by Peng et al. (1994), is a method useful for analysing time series that appear to be long-memory processes. Given a time series
In the second step we divide the time series
where
If we apply the natural logarithm to each
The backpropagation net is a method of artificial neural network training that calculates a gradient. This gradient allows us to estimate the network weights. Here we use a neural network with two hidden layers, the first contains five neurons and the second one contains two neurons. The input layer contains two neurons and, the output layer contains one neuron. The Fig. 3 represents this neural network.
The neural net is fitted to the data by the backpropagation of the error, see Fausett et al. (1994) for details. Which these estimated weights we can apply the backpropagation algorithm by the following steps:
Each input unit
Each hidden unit
applies activation function to compute output signal
and sends its output signal to the units in the second hidden layer.
Each hidden unit
applies activation function to compute output signal
and sends its output signal to output units.
Each output unit
and applies activation function to compute its output signal
Atrial fibrillation classifier
In this section we present the SDCST Atrial fibrillation classifier (SDCST-ACF). Given a sequence of ECG signals
The second step consists in to calculate the Hurst exponents
Let:
In the third step, we obtain the normalized Hurst exponents
|
|
|
|
|
|---|---|---|---|
| 00001 | 1 | 0.712 | 0.487 |
| 00002 | 2 | 0.736 | 0.250 |
| 00006 | 3 | 0.718 | 0.092 |
| 00007 | 4 | 0.670 | 0.073 |
| 00011 | 5 | 0.769 | 0.519 |
| 00012 | 6 | 0.708 | 0.281 |
| 00014 | 7 | 0.770 | 0.484 |
| 00016 | 8 | 0.685 | 0.218 |
| 00018 | 9 | 0.766 | 0.483 |
| 00019 | 10 | 0.684 | 0.077 |
| 00021 | 11 | 0.672 | 0.453 |
| 00025 | 12 | 0.768 | 0.536 |
| 00028 | 13 | 0.722 | 0.475 |
| 00031 | 14 | 0.791 | 0.555 |
| 00033 | 15 | 0.787 | 0.572 |
| 00034 | 16 | 0.782 | 0.586 |
| 00035 | 17 | 0.698 | 0.352 |
| 00037 | 18 | 0.718 | 0.539 |
| 00040 | 19 | 0.738 | 0.265 |
| 00042 | 20 | 0.753 | 0.544 |
| 00045 | 21 | 0.730 | 0.458 |
| 00046 | 22 | 0.756 | 0.565 |
| 00048 | 23 | 0.767 | 0.545 |
| 00049 | 24 | 0.775 | 0.329 |
| 00050 | 25 | 0.745 | 0.555 |
| 00051 | 26 | 0.683 | 0.489 |
| 00052 | 27 | 0.790 | 0.545 |
| 00053 | 28 | 0.756 | 0.363 |
| 00057 | 29 | 0.883 | 0.529 |
| 00059 | 30 | 0.735 | 0.524 |
| 00060 | 31 | 0.737 | 0.233 |
| 00062 | 32 | 0.732 | 0.249 |
| 00063 | 33 | 0.689 | 0.053 |
| 00064 | 34 | 0.782 | 0.542 |
| 00068 | 35 | 0.777 | 0.559 |
| 00072 | 36 | 0.784 | 0.567 |
| 00073 | 37 | 0.752 | 0.553 |
| 00076 | 38 | 0.724 | 0.042 |
| 00080 | 39 | 0.623 | 0.208 |
| 00081 | 40 | 0.674 | 0.421 |
| 00084 | 41 | 0.711 | 0.243 |
| 00085 | 42 | 0.787 | 0.568 |
| 00089 | 43 | 0.734 | 0.531 |
| 00091 | 44 | 0.680 | 0.056 |
| 00093 | 45 | 0.751 | 0.529 |
| 00094 | 46 | 0.777 | 0.548 |
| 00097 | 47 | 0.728 | 0.040 |
| 00098 | 48 | 0.671 | 0.292 |
| 00099 | 49 | 0.732 | 0.540 |
| 00104 | 50 | 0.753 | 0.304 |
Hurst exponent values, see the second step of Section 3, unhealthy ECG signals
Denote
In the fifth step we fit a backpropagation net considering the
In both the training and predict stage, the output values need to be denormalized. We do this by multiplying the normalized output by 0.5 and add to this result the value 0.5. Then, we obtain the output
Then, in the sixth step, we use the values previously estimated for the weights to apply the algorithm described in Section 2.3. The value of the atrial fibrillation classifier parameter
We consider one hundred ECG signals for training, fifty healthy and fifty unhealthy, from Physionet Challenge 2017 database (Clifford et al., 2017). The ECG names start with the letter “A” followed by five digits.
Then we apply the steps given by the Section 3. The Tables 2 and 2 show the values of the Hurst exponent given in the second step of the Section 3.
Output values
Output values

|
|
|
|
|
|---|---|---|---|
| 00435 | 1 | 0.687 | 0.250 |
| 00436 | 2 | 0.726 | 0.053 |
| 00437 | 3 | 0.743 | 0.571 |
| 00440 | 4 | 0.692 | 0.294 |
| 00447 | 5 | 0.672 | 0.071 |
| 00448 | 6 | 0.692 | 0.290 |
| 00449 | 7 | 0.689 | 0.162 |
| 00452 | 8 | 0.768 | 0.480 |
| 00454 | 9 | 0.686 | 0.226 |
| 00455 | 10 | 0.730 | 0.275 |
| 00457 | 11 | 0.763 | 0.536 |
| 00458 | 12 | 0.763 | 0.524 |
| 00462 | 13 | 0.610 | 0.226 |
| 00463 | 14 | 0.757 | 0.520 |
| 00467 | 15 | 0.718 | 0.075 |
| 00469 | 16 | 0.677 | 0.264 |
| 00472 | 17 | 0.707 | 0.560 |
| 00475 | 18 | 0.658 | 0.208 |
| 00481 | 19 | 0.666 | 0.053 |
| 00483 | 20 | 0.731 | 0.521 |
| 00487 | 21 | 0.752 | 0.547 |
| 00488 | 22 | 0.746 | 0.544 |
| 00491 | 23 | 0.732 | 0.536 |
| 00492 | 24 | 0.770 | 0.509 |
| 00496 | 25 | 0.755 | 0.573 |
| 00501 | 26 | 0.733 | 0.497 |
| 00505 | 27 | 0.736 | 0.525 |
| 00508 | 28 | 0.783 | 0.566 |
| 00510 | 29 | 0.761 | 0.543 |
| 00516 | 30 | 0.779 | 0.536 |
| 00517 | 31 | 0.728 | 0.500 |
| 00518 | 32 | 0.726 | 0.528 |
| 00528 | 33 | 0.770 | 0.562 |
| 00532 | 34 | 0.677 | 0.068 |
| 00538 | 35 | 0.698 | 0.287 |
| 00540 | 36 | 0.737 | 0.557 |
| 00543 | 37 | 0.769 | 0.528 |
| 00552 | 38 | 0.728 | 0.506 |
| 00553 | 39 | 0.720 | 0.541 |
| 00555 | 40 | 0.731 | 0.529 |
| 00560 | 41 | 0.780 | 0.323 |
| 00564 | 42 | 0.725 | 0.542 |
| 00569 | 43 | 0.680 | 0.475 |
| 00571 | 44 | 0.722 | 0.204 |
| 00574 | 45 | 0.640 | 0.305 |
| 00575 | 46 | 0.730 | 0.494 |
| 00576 | 47 | 0.744 | 0.556 |
| 00580 | 48 | 0.663 | 0.074 |
| 00581 | 49 | 0.711 | 0.538 |
| 00582 | 50 | 0.663 | 0.047 |
Hurst exponent values, see the second step of Section 3, test set, unhealthy ECG signals
Then, we apply the fifth step of the Section 3 to obtain the output values
To illustrate the generalizability of the method, we present to the classifier a test set consisting of 100 ECG signals whose the neural network did not have access to. In this case, the results are presented in the Tables 5–6. Figure 2 shows a graphical representation of these results. The classifier accuracy for the test set is 95%.
Output values, test set,

Graphical representation of the backpropagation net.
In this paper, we presented a new classifier parameter for the detection of atrial fibrillation of short-time ECG signals (between 30 and 60 seconds). This classifier parameter, based on the Detrended Fluctuation Analysis (DFA), SDCST method, and backpropagation net presented an accuracy of 97% and 95% for the training set and test set, respectively.
This article does not contain any studies with human participants or animals performed by any of the authors.
