Abstract
Background and objective:
Epilepsy is one of the most common neurological disorders caused by recurrent seizures. Electroencephalograms (EEGs) record neural activity and can detect epilepsy. Visual inspection of an EEG signal for epileptic seizure detection is a time-consuming process and may lead to human error; therefore, recently, a number of automated seizure detection frameworks were proposed to replace these traditional methods. Feature extraction and classification are two important steps in these procedures. Feature extraction focuses on finding the informative features that could be used for classification and correct decision-making. Therefore, proposing effective feature extraction techniques for seizure detection is of great significance.
Methods:
Principal Component Analysis (PCA) is a dimensionality reduction technique used in different fields of pattern recognition including EEG signal classification. Global modular PCA (GModPCA) is a variation of PCA. In this paper, an effective framework with GModPCA and Support Vector Machine (SVM) is presented for epileptic seizure detection in EEG signals. The feature extraction is performed with GModPCA, whereas SVM trained with radial basis function kernel performed the classification between seizure and nonseizure EEG signals. Seven different experimental cases were conducted on the benchmark epilepsy EEG dataset. The system performance was evaluated using 10-fold cross-validation. In addition, we prove analytically that GModPCA has less time and space complexities as compared to PCA.
Results:
The experimental results show that EEG signals have strong inter-sub-pattern correlations. GModPCA and SVM have been able to achieve 100% accuracy for the classification between normal and epileptic signals. Along with this, seven different experimental cases were tested. The classification results of the proposed approach were better than were compared the results of some of the existing methods proposed in literature. It is also found that the time and space complexities of GModPCA are less as compared to PCA.
Conclusions:
This study suggests that GModPCA and SVM could be used for automated epileptic seizure detection in EEG signal.
Keywords
Introduction
Epilepsy is a neurological disorder which affects approximately 50 million people of world population as reported by World Health Organization [1]. Electroencephalogram (EEG) is a common measure of reading brain’s electrical activity [2]. EEG is popularly used in the medical application for the diagnosis of epileptic seizure [3,4]. EEG signals are recorded by placing electrode over the scalp. Manual analysis of these recorded signals by visual inspection is time consuming as well as it may lead to error. Hence, automated schemes with a high seizure detection rate is significantly required.
In the last few years, a number of methods have been proposed by researchers for seizure detection in EEG signal. The two basic steps involved in these methods are feature extraction and classification. Feature extraction reduced the dimension of the input patterns by keeping the most important attribute and constitute the feature vectors which are then given as input to a classifier to carry out the classification. Some of the methods suggested in the literature include the techniques like Fourier transform [5,6], wavelet transform [7–13], multi wavelet transform [14] and time frequency analysis [15]. Empirical mode decomposition (EMD) has also been used for the classification between normal and epileptic EEG signals [16]. Principal component analysis (PCA) is a dimensionality reduction technique and has been used for seizure detection and classification [17,18].
Recently, different techniques based on linear prediction error energy [19], fractional linear prediction [20], Hilbert Huang Transformation [21], wavelet based nonlinear feature with extreme machine learning [22] have been reported for epileptic seizure detection in EEG signal.
Local binary pattern (LBP) is a well known feature extraction technique used for face recognition [23]. Kaya et al. [24] applied 1-d LBP to the raw EEG signal for feature extraction. These extracted features were then trained with different classifier to perform the classification.
In this study, global modular PCA (GModPCA) and SVM have been employed for seizure detection. PCA focuses only on finding global variation. Modular PCA (MPCA) was introduced by Asari et al. [25] for face recognition. In MPCA, the input images are divided into subimages and PCA is performed on each subimage. It focuses on finding local variation in a small segment. Recently, Kadappa and Negi [26] introduced GModPCA. This technique focuses on both local and global variations of input patterns. This technique consists of two steps. The first step is done identically with the MPCA. In the second step, PCA is performed on the features extracted in first step to further reduce the dimensionality and take the advantage of inter-sub-pattern correlation [26]. These feature vectors obtained are given as input to SVM for the classification.
To evaluate the performance, the scheme has been tested on the benchmark EEG data set considering 10-fold cross validation. In addition, we have also explained the time and space complexities of PCA, MPCA and GModPCA.
The remaining content of this paper is organized as follows: Methodology and materials used are discussed in Section 2. Experimental results are shown in Section 3. Finally, Section 4 concludes the article with future direction.
Methodology and materials
PCA is a dimensionality reduction technique. MPCA and GModPCA are two variations of PCA. MPCA was introduced by Asari et al. [25] for face recognition, which focuses on local variations. GModPCA was proposed by Kadappa et al. [26] in order to take the advantage of both local and global variations. In this study, we have applied both the techniques for epileptic seizure detection. For the classification between seizure and nonseizure EEG signals SVM has been used. Even though MPCA was introduced for face recognition, we have also tested this technique for epileptic seizure detection in EEG signal. In both these techniques the input patterns are divided into S subpatterns. Figure 1 depicts the example of a subpattern.

An EEG signal is divided into S subpatterns, where
The steps involved in GModPCA are as follows:
Let
The mean pattern of
The covariance matrix is computed as,
Compute the eigenvalues (
Select r (
The local PCs for the subpattern set I is obtained by projecting it onto E. The local PCs set (Y) is obtained as,
Concatenate Y, in accordance with the partition sequence followed in step 1. Let Y be the set obtained after concatenation.
Once the feature set
Compute the covariance matrix,
Find eigenvalues (
Select w (
The final features set Z is obtained by projecting Y onto
GModPCA consists of two steps. The first step is constituted by MPCA (Fig. 2). The second step is the injection of PCA on features extracted in the previous step (Fig. 3).

Step 1 of GModPCA.

Step 2 of GModPCA.
The partition of patterns into equal size subpatterns set must be carried out such that, the loss of pattern is avoided or minimized. The subpattern formation can be done in a contagious manner or randomly. In this research, a contiguous partitioning approach has been followed (Fig. 1).
Selection of projection vectors (r, w)
In both the approaches, i.e., MPCA and GModPCA, projection vectors (PVs) are computed from the covariance matrix. The basic two approaches for selecting the number of PVs are as follows: (1) selecting a fixed number of eigenvectors for projection (2) setting a threshold (δ) on total variation.
MPCA
The set of operations performed in step 1 of GModPCA constitutes MPCA. In case of MPCA, the features set Y (Fig. 2) obtained is used for classification. However, in case of GModPCA the features set Z (Fig. 3) is used for classification.
Support vector machine (SVM)
Support Vector Machine (SVM) is a binary classifier [27]. It draws a maximum margin decision boundary to separate the classes. Figure 4 depicts the diagram of an SVM.

Support Vector Machine.
Consider a binary classification problem with a training data set T having a n number of samples. Let d be the dimension of each sample.
In order to find the best separating hyperplane, the following optimization condition needs to be solved, i.e.,
k-fold cross validation is well known technique for evaluating the system performance. k-fold cross validation is performed by partitioning the entire dataset to the k number of equal subparts. One out of the k subparts is taken as the testing set and the remaining
Time complexity of PCA, MPCA and GModPCA
Here we have discussed the time complexity (Tc) involved in all these three techniques. Suppose
Let
The Tc of PCA as described in equation (11) is
The Tc of GModPCA as described in equation (13) is
It should be noted that
Here we have discussed the space complexity (Spc) of the PCA, MPCA and GModPCA for the same input patterns set
For PCA the space complexity, including input patterns set
The Spc of PCA as described in equation (14) is
The Spc of MPCA as described in equation (16) is Case 1: Case 2:
We have used the publicly available EEG time series dataset of Department of Epileptology1
EEG time series dataset

Epilepsy Data Set.
In this section, the experimental results and analysis have been done.
Results
GModPCA begins by dividing the input patterns to S number of non overlapping subpatterns of equal sizes. For this study, the number of partitions,
The publicly available epilepsy EEG dataset has been used. The dataset consist of 5 subset (A to E). We have used all these five subsets for classification between seizure and nonseizure EEG signals. A set of seven different experimental cases has been tested, i.e., A vs E (case 1), B vs E (case 2), C vs E (case 3), D vs E (case 4), AB vs E (case 5), CD vs E (case 6), and ABCD vs E (case 7). For each experimental case, we have performed k fold cross validation with
Classification accuracy of MPCA and GModPCA with SVM for A vs E
Classification accuracy of MPCA and GModPCA with SVM for A vs E
Classification accuracy of MPCA and GModPCA with SVM for B vs E
Classification accuracy of MPCA and GModPCA with SVM for C vs E
Classification accuracy of MPCA and GModPCA with SVM for D vs E
Classification accuracy of MPCA and GModPCA with SVM for AB vs E
Classification accuracy of MPCA and GModPCA with SVM for CD vs E
Classification accuracy of MPCA and GModPCA with SVM for ABCD vs E
The classification accuracy of PCA, MPCA and GModPCA for each experimental case is shown in Fig. 6.

Classification Accuracy of GModPCA, MPCA, and PCA with SVM for different experimental cases, i.e., (a) A–E, (b) B–E, (c) C–E, (d) D–E, (e) AB–E, (f) CD–E, (g) ABCD–E.
The abnormality or disorder recorded in EEG signal posses certain unique patterns. It is very crucial to capture these hidden patterns for correct diagnosis. PCA focus on the extraction of global features and hence its capability for detecting these unique patterns becomes limited. On the other hand, MPCA and GModPCA begins by dividing the signal into subparts and extracted features from these subparts individually. As a result of which, both the techniques capture the hidden unique patterns and the chances for the correct diagnosis of a disorder is maximized. From the Fig. 6 it could be seen that the epileptic seizure detection rate of MPCA and GModPCA is high as compared to PCA. A feature extraction technique not only focus on extracting the informative features, but also it should be computationally simple. In this research, we not only show that the MPCA and GModPCA have a high capability for seizure detection, but also we proved analytically that the time and space complexities of both the methods are less as compared to PCA.
The following observations are made from the experimental results. With the same number of projection vectors, the classification accuracy obtained by MPCA for different experimental cases is more than the accuracy achieved through PCA. It proved that the features extracted from the modules are more informative than the features extracted directly from the EEG signals. MPCA focus only on finding the local features. However, GModPCA focus on both the local features and global features. The experimental results presented in Tables 1–7 proved that the GModPCA has been able to achieve better classification accuracy than MPCA with less number projection vectors. In most of the cases, it is found that GModPCA acieved the best classification accuracy with 24–35 features. As can been seen in Fig. 6 that for different experimental cases, GModPCA achieved higher classification accuracy than PCA and MPCA with less number of extracted features or principal components.
As mentioned earlier MPCA was introduced for face recognition. We have also tested its effectiveness for epileptic seizure detection along with GModPCA. From Lemmas 1–4, it is proved that MPCA and GModPCA have less time and space complexity than PCA.
Different methods have been proposed in literature for seizure detection on the same dataset used under in this study. A comparative study of these methods along with the our proposed approach is presented in Table 8.
Reference, year, methods and Classification accuracy obtained for some cases in literature
Reference, year, methods and Classification accuracy obtained for some cases in literature
For case 1, classification the maximum accuracy reported in the literature is 100% which was achieved by Srinivasan et al. [31] with the application of entropy and neural network. Similarly, the classification accuracy of 100% was achieved by Iscan et al. [32] through the combination of time frequency domain features. In this study, MPCA and GModPCA achieved the classification accuracy of 99.5% and 100% respectively, for case 1, which is better than the recent classification accuracy achieved by Lee et al. [35] and Chai et al. [21].
For cases 2–4, the best classification accuracy (%) achieved by MPCA and GModPCA are 98.90, 98.10, 93.80 and 99.20, 98.50, 94.20 respectively. Nicolaou et al. [34] reported the classification accuracy of 82.88, 88.00, and 78.98 respectively for these experimental cases.
For cases 5–7, MPCA achieved the best accuracy (%) of 99.33, 95.33, and 96.92 respectively. Similarly, with GModPCA the best accuracy (%) found to be 99.66, 95.80, and 97.17 respectively.
These results show that GModPCA has the tendency to acquire high seizure detection rate. As shown in Table 8. Even though a number of methods have been proposed in the literature, none of these methods addressed the issue of inter-sub-pattern correlation between the EEG signals. This research aims to strengthen the research in the direction of exploring the inter-sub-pattern correlation and showing the possibility of the effectiveness in the field of biomedical signal processing. Both the techniques work directly on the raw EEG signal.
In this paper, an effective approach with GModPCA and SVM have been proposed for automated seizure detection in EEG signal. Features are extracted using GModPCA. We have also tested the effectiveness of MPCA, which focuses on local variation, whereas the GModPCA focus on both local and global variations. After the feature extraction is performed, the extracted feature vectors are fed to the SVM to carry out the classification. Seven different experimental cases for classification have been conducted. By observing the classification accuracy it could be interpreted that GModPCA with SVM achieved a better classification accuracy as compared to some of the existing techniques proposed in literature. This shows that there exist a strong inter-sub-pattern correlation in EEG signals. In this paper, It is also proved analytically that MPCA and GModPCA have less time and space complexities as compared to PCA. GModPCA is an efficient dimensionality reduction technique which can be applied to other medical applications in future.
Conflict of interest
The authors have no conflict of interest to report.
