Abstract
The purpose of this study is to present a novel method which can objectively identify the subjective perception of tonic pain. To achieve this goal, scalp EEG data are recorded from 16 subjects under the cold stimuli condition. The proposed method is capable of classifying four classes of tonic pain states, which include No pain, Minor Pain, Moderate Pain, and Severe Pain. Due to multi-class problem of our research an extended Common Spatial Pattern (ECSP) method is first proposed for accurately extracting features of tonic pain from captured EEG data. Then, a single-hidden-layer feedforward network is used as a classifier for pain identification. With the aid of extreme learning machine (ELM) algorithm, the classifier is trained here. The advantages of ELM-based classifier can obtain an optimal and generalized solution for multi-class tonic cold pain. Experimental results demonstrate that the proposed method discriminates the tonic pain successfully. Additionally, to show the superiority for the ELM-based classifier, compared results with the well-known support vector machine (SVM) method show the ELM-based classifier outperform than the SVM-based classifier. These findings may pay the way for providing a direct and objective measure of the subjective perception of tonic pain.
Keywords
Introduction
Pain is a complex human experience and a symptom or sign of numerous medical conditions. Therefore, pain is very difficult to gauge and quantify [1, 2, 3]. To help physicians and caregivers guide treatment accurately, the intensity and pattern of pain are necessary to be assessed. So far, the most common measures for pain intensity are visual analog scales, numerical rating scales and verbal rating scales [4]. However, those conventional methods have intrinsic limitations. On the one hand, human intervention by physicians or caregivers is that patients need to be required to perform the rating form. On the other hand, those measures cannot achieve time-dependent dynamic changes in pain. In addition, according to [5], patient-reported pain is not always reliable, valid, and even meaningless, e.g. for demented patients and newborn patients. Furthermore, some patients who are addicted to drugs lie their pain intensity on purpose which is to get more medicines. Consequently, objective measures of pain have been expected for clinical pain assessment and pain management.
Significant efforts have been made to identify reliable and valid indicators of pain. Various medical imaging techniques are used to investigate the human pain, including single-photon emission tomography [6], positron emission tomography [7], and functional magnetic resonance imaging [8]. However, those methods are more expensive and used inconvenience for patients and physicians. Therefore, more and more experts are looking for other methods to achieve this purpose. The most comprehensive method is to adopt the facial expression identification for pain intensity analysis [9, 10]. However, the challenge of this method is how to identify characteristics of whether patients are in real pain, since some patients could have abilities to pretend themselves in pain condition as an actor. Fortunately, pain could be expressed through multiple modalities and efforts are needed to extend technologies beyond facial expression to other methods. A prime example is the “Emotion and Pain Projection (EmoPain)” initiated by Engineering and Physical Sciences Research Council (EPSRC) [11]. Its purpose is to design and develop an intelligent system that will enable ubiquitous monitoring and assessment of patients’ pain-related mood and movements in the clinical environment. The project involves multidisciplinary collaboration of computer vision and machine learning scientists, clinical psychologists, physiotherapists, experts in pain management and rehabilitation.
Electroencephalogram (EEG) plays an important role in neurophysiological studies, since it is an inexpensive and non-invasive measure of brain activity. Therefore, it has gained much interest from the scientific community in searching for brain measures to characterize the perception of pain. Especially, related dynamic changes in the brain activity are captured appropriately, efficient analysis of EEG data may effectively reflect dynamic changes of pain perception due to the high temporal resolution of EEG.
This study attempts to research and assess the tonic pain under the innoxious cold stimuli condition, which is built with iced water simulation experiment. In this experiment, we recruited twenty subjects and collected EEG data from each subject under the tonic cold pain stimuli condition. In order to identify tonic cold pain, this paper proposes a new method which is capable of classifying four classes of pain states. It consists of two modules: feature extraction and classification. Because this study investigated the multi-classification task, we propose an extended Common Spatial Pattern (ECSP) method which can extract the features of four types of pain states. Then, a single-hidden-layer feedforward network classifier is utilized as a pain classifier. The classifier is trained by extreme learning machine (ELM) algorithm. To show the superiority for the proposed pain classification, compared results with the well-known support vector machine method are reported. Experimental results demonstrate that the method can be regarded as an objective cortical measure associated with the subjective perception of tonic cold pain and a promising candidate for the development of an expert system for clinical pain application.
The rest of this paper is organized as follows. Some related works to pain classification are given in Section 2. The methods used in our research are described in Section 3. Section 4 presents the process of the experimental design and the dataset establishment. Experimental results are reported in Section 5. Finally, in Section 6, conclusions are given.
Related works
In early studies, many researchers took advantage of EEG to investigate the association and relationship between EEG data and the perception of pain [12, 13, 14].
Steriade et al. [15] and Silva [16] showed that a decrease of alpha frequency band and an increase of beta frequency band were performed by an experimental pain stimulus. Similar research results were reported by other researchers [17, 18, 19, 20, 21]. With the analysis of power spectral changes, Dowan et al. [22] successfully identified features that tonic pain-related cortical activity under the cold condition. Nir et al. [23] provided a research work focused on the association between subjective perception of tonic pain and peak frequency of alpha band during stimulation and at rest. Then, Nir et al. [24] carried out another research for the prediction of subjective pain perception on alpha-1 frequency band (8–10 Hz) and found that it was able to be served as a direct, objective and experimentally stable measure of subjective perception of tonic pain. Shao et al. [25] investigated electrocortical responses in five frequency bands, i.e., 1–4 Hz, 4–8 Hz, 8–12 Hz, 12–18 Hz, and 18–30 Hz. Then, the pain states were discriminated using frequency-domain analysis. Gram et al. [26] performed a research to investigate the reliability of tonic cold pain scores and EEG spectral indices. The results showed that the EEG assessed during tonic cold pain was a valid experimental pain model.
In above research works, pain-related EEG signals were only analyzed in frequency domain, which focused mainly on the power spectrum peaks and coherence values at specific EEG frequency bands. Other researchers considered to utilize machine learning methods to identify pain states from EEG data. Hajileontiadis [27] have successfully attempted to discriminate relax from tonic cold pain state using the wavelet higher order spectral as a feature extraction method. Kumar et al. [28] designed an method based on fuzzy model for pain estimation and observed that EEG parameters ‘Hjorth Activity’ and ‘Spectral Entropy’ could reflect the level of pain experienced by the surgical patient during the postoperative period. Similar research provided by [29] was to develop an adaptive neuro-fuzzy interface system based on Support Vector Machine (SVM) to perform pain classification.
Except for those methods, Becker et al. [30] and Granocsky et al. [31] made efforts in investigating the pain intensity and NRS using evoked potentials (EP). The EP component was related to phasic and short stimuli, since it could be measured as a distinct waveform up to several hundreds of milliseconds after the stimulus [32]. However, [24] reported that phasic and short stimuli was not able to well adequately simulate natural and clinical painful experiences. Compared with phasic pain modes, therefore, it is commonly believed that tonic cold pain could better simulate clinic pain [23]. Additionally, it is well known that human brain can perceive distinct cold (or hot) somatic stimulation and discriminate its intensity with different affective responses [33].
The proposed method
Figure 1 illustrates the framework of our proposed tonic cold pain recognition method. This method consists of two stages, i.e., training stage and testing stage. In each stage, it contains three successive modules: 1) preprocessing module, 2) feature extraction module, and 3) ELM module. All training and testing EEG data come from the experimental data (see Section 4.4). The preprocessing module aims to remove noise signals. In the feature extraction module, an extended CSP (ECSP) algorithm is proposed for extracting the pain features from EEG data. The ELM module is taken as a pain classifier in this method.
Framework of the proposed procedure for pain assessment.
Specifically, in the training stage, the ELM algorithm is used to estimate output weights of the SFNN with all training EEG data in a batch learning mode. The training inputs include a feature matrix
In the testing stage, the same pre-treatments of EEG data are carried out for noise signals. Then the ECSP with trained pair-wise project matrices extracts features of received EEG data. The features pass through the trained ELM module to output pain class labels.
Severe contamination of EEG activity by strong muscle, eye movements, eye blinks, and heart noise is serious problem for EEG interpretation and analysis. Three steps are used to preprocess EEG raw data. Firstly, EEG signals contaminated with strong muscle artifacts are manually rejected by visual inspection. Secondly, using an independent component analysis (ICA) with scalp topography, eye blinks, eye movements, and heart artifacts are removed from EEG data. Lastly, EEG data are filtered to five frequency bands, i.e., delta (1–4 Hz), theta (4–8 Hz), alpha (8–13 Hz), beta (13–30 Hz), and gamma (30–49 Hz), with a band-pass Butterworth filter. All 32 channels have been filtered and used for further processing.
Extended common spatial pattern
After preprocessing the acquired EEG data, next step is to extract the features from EEG data. In this paper, we investigate the classification of multi-class pain of EEG signals. For this purpose, an extended CSP algorithm is proposed for pain features extraction.
Common spatial pattern
The method of CSP is to decompose raw EEG signals into spatial patterns, which are extracted from two classes of single trial EEG [46]. Obtained patterns can maximize the difference between two classes. It is first applied to EEG for detection of abnormalities [34].
The EEG data of a single trail is represented as an
where
where
The whitening transformation is expressed as:
Then, using the whitening transformation
where
where
With the projection matrix (2), the single trail EEG
According to the formulation, the original EEG signals is reconstructed by multiplying the inverse of
The columns of
Using above formulas, the features used for pain classification are obtained by decomposing EEG data. A
Therefore, the features of two classes of pain are obtained by:
where
The CSP intends to find the directions in the pattern space to optimally distinguish the two classes. In our research, however, this mission is a four-classification problem. For this purpose, we propose an extended CSP algorithm. Firstly, three CSP algorithms are used to extract three pair-wise projection matrices with each one representing a pair of pain states. Then, the three pair-wise projection matrices are concatenated. The detailed information is shown as follows.
Using Eq. (8) the pair-wise projection matrix
The proposed ECSP scheme.
The preprocessed EEG data corresponding to four classes of pain states is projected into pair-wise projection matrices to obtain feature vectors in term of Eqs (11) and (12), respectively. Then feature vectors of all pairs are concatenated to final ECSP feature matrices, which are used to train the ELM classifier. The illustration of the proposed ECSP scheme is shown in Fig. 2.
Algorithm 1 gives the generation process of ECSP features matrix based on pair-wise projection matrices.
Extreme learning machine (ELM) proposed by [35] is used to train a SFNN as shown in Fig. 3. The method well overcome innate slow learning ability of traditional optimization techniques. It has been successfully applied many applications such as motor-imagery EEG classification for brain-computer interfaces [45], handwritten digit classification [40], human action recognition [41], traffic sign recognition [42], and ship detection [43]. In this paper, we consider known and unknown feature mapping functions applied into ELM, i.e., ELM-based classifier and kernel ELM-based classifier.
A diagram of a single-hidden-layer feedforward neural network (SFNN).
The ELM-based classifier consists of three layers: input layer, hidden layer and output layer. The input layer is the feature vector
where
According to [36, 37, 38], almost all nonlinear piecewise continuous functions can be used as the activation function. In this paper, the sigmoid function is chosen as the activation function, it is expressed as:
Through the activation function, a
where
In the output layer, the number of neurons is denoted as
According to Eq. (16), for the input sample
where
Above the description it can be learned that the structure of ELM by an input feature vector from the training pain EEG data. Then, the train process of ELM-based classifier is given as follows.
The proposed pain classification method belongs to the supervised learning problem. We have a training set with
The weights and biases, i.e.,
where
and
Based on the theory in [39], the ELM aims to achieve not only the smallest error but also the smallest norm of output weights. Therefore, its purpose is to minimize the training error
where
In our experimental data, the number of training samples are not huge, so
Based on the KKT theorem, the optimization problem of the ELM can be solved [39]. According to the above equations, we found that just two parameters need to be tuned, i.e., the number of hidden neurons
The kernel ELM-based classifier is also considered in this paper. If the feature mapping function
Then, based on Eq. (20), the output function of the ELM classifier can be represented compactly as:
where
and
where
This section presents the experimental apparatus and procedure for collecting and storing experimental data. Then the consistency verification of pain intensity ratings across the subjects is given. Lastly, the obtained data are used to train and test the proposed pain assessment method.
Experimental setup
The EEG data of subjects are recorded using the Enobio 32-electrode EEG wireless recording system (Neuroelectrics Inc., Barcelona, Spain). The captured data are streamed through the standard Bluetooth ISM band, operating distance range is 10 meters. The electrodes are inserted and arranged in the cap (Neuroelectrical cap) according to the international 10-10 system. The recoded scalp sites include P7, P4, Cz, Pz, P3, P8, O1, O2, T8, F8, C4, F4, Fp2, Fz, C3, F3, Fp1, T7, F7, Oz, PO4, FC6, FC2, AF4, CP6, CP2, CP1, CP5, FC1, FC5, AF3, and PO3 positions. The scalp electrodes are referenced to a pair of electrodes which are connected to CMS and DRL channels. The system is digitized at a sampling rate of 500 Hz.
Experimental procedure
Twenty subjects, nine female and eleven males, aged from 23–42 years, take part in the experiment. All are right-handed, not on any medications, and without any history of neurological and psychiatric disease. The whole experimental process is completed within five days. To be specific, in the first day, the participated subjects need to be familiar with the experimental procedure. From second day to fourth day, the tonic cold pain stimulation experiment is carried out once in each day. During these experiments, EEG data does not need to be captured. The purpose is to determine the tolerance time of hand submerged in iced water and further validate the pain level consistency on all subjects in three experiments. The last day, the EEG data are captured from each subject. The detailed process of the experiment is described as follows.
Illustration of the experimental procedure.
Each subject with eyes open is comfortably seated in an upright chair with a distance of 1 meter from a computer screen. The diagram of experimental process in a trail is shown in Fig. 4. Firstly, the subject is given 5 minutes to relax prior to the initiation of each trail. Then the subjects are asked to put his/her left-hand into a barrel with iced water mixture (1
After each green dot, 3-second break is given to each subject and a number will be displayed on the screen. Then the subject is required to rate the perceived pain intensity on 11-point (0–10) verbal rating scales (VRSs, 0: no pain, 1, barely noticeable pain, 5: mild pain and 10: worst pain ever). The experimental process is repeated until the subject feel the pain became unbearable.
Some subjects were from Northeastern University, Boston, USA, and the experimental procedure was approved by the Northeastern University Institutional Review Board (IRB). Left subjects were volunteers and gave written informed consent prior to the beginning of the experiments.
Through experiments in three days, the longest tolerance time with a hand submerged into iced water is 210 seconds. In other words, it covers nine verbal rating scores for pain perception. However, each subject probably rates different perceived pain intensity on same point-in-time due to nature of subjectivity and uncertainty. The results are shown in Fig. 5.
Pain intensity ratings across 20 subjects in three experimental phases from second day to fourth day. (red line: 2nd day; blue line: 3rd day; green line: 4th day).
In this paper, the proposed method is capable of classifying four classes of pain states, which include No Pain (0), Minor Pain (1–3), Moderate Pain (4–6), and Severe Pain (7–10), respectively. Hence, to ensure the reliability and availability of pain EEG data, the consistency of perceived pain scores on self-reported VRS needs to be validated with two criteria. One is that nine pain scores for each subject are required to fall into the range of four classes of pain states averagely, i.e., each pain contains at least two pain scores. Another is that two adjacent subjective scores do not exceed 2. Figure 5 shows that subjects 11, 13, 16, and 19 do not meet the criteria. Other 16 subjects with subjective pain intensity ratings have a good consistency and continue to complete the tonic cold stimulation experiment.
According to the experimental procedure described last subsection, selected 16 subjects participate the experiment in last day. Figure 6 shows subjective pain scores of 16 subjects. Each pain state has at least two 20 seconds EEG data. For no pain state, the EEG data are extracted from the relax phase prior to the initiation of each trail.
Pain intensity ratings across 16 subjects in formal experimental phase.
Then, EEG data of pain are partitioned for establishing pain dataset used for training and testing proposed classification method of pain states. From Fig. 6, we determine to extract two 20 seconds EEG data for each pain state. In our work, the size of one sample is set as 0.5 second EEG data. Hence 80 samples (Sampling Frequency: 500 Hz) are created for each pain state. The data preprocessing and proposed model were completed in MATLAB 2015.
To assess the generalization capability and choose optimal parameters of the proposed classifier, the N-fold cross-validation method is used in our research. Firstly, the EEG dataset corresponding to each pain state is divided into N subsets. Then, the N-1 subsets are used as the training sets and the remaining one subset is used as the testing set. The training data window is moved with one subset interval. The training process is performed iteratively for N times. Lastly, the classification accuracies of N folds are averaged, the optimal parameters are obtained from the best average validation results.
More formally, considering to the lower mean square and bias, we decide to perform ten rounds of 10-fold cross-validation on the all dataset. This process is also called the 10-fold stratified cross-validation. Firstly, the dataset corresponding to each pain state is divided into 10 subsets with equal data size. In each round, then, nine out of ten subsets are utilized for the 10-fold cross-validation procedure, and the left one is used for testing. This process is repeated 10 times until all EEG signals are included. Its aim is to enable a different subset used as the testing data in each time. The classification performance across all ten rounds of testing for each subject is averaged for the final experimental results.
Results
To test the performance of tonic cold pain identification, this section gives the experimental results obtained by our proposed method on EEG database. The procedure of parameters selection is first presented for the ELM model. Then, classification and performance evaluation results of the proposed method are reported. Lastly, to show the superiority of the ELM model, we present the comparison result with support vector machine on identifying the tonic cold pain.
Parameters selection
Because this paper investigates pain classification accuracy in five frequency bands of EEG data, i.e., delta (1–4 Hz), theta (4–8 Hz), alpha (8–13 Hz), beta (13–30 Hz), and gamma (30–49 Hz). Taking this into consideration, the procedure of parameters selection needs to be performed in each frequency band. For a more concise description of this procedure, this section will give results of parameters selection in alpha and theta bands as two examples.
Performance of (a) the ELM-based classifier and (b) the kernel ELM-based classifier to different combinations of (
In this study, the grid search approach is utilized to select the optimal parameters for the ELM model. There are four parameters in the ELM model: 1) The ELM-based classifier has two parameters including regularization factor
For the ELM-based classifier, the value of
Figures 7a and 8a show average classification rate curves with different combinations of
For the kernel ELM-based classifier, the Gaussian kernel spread
Figures 7b and 8b show average classification rate curves with different combinations of
Therefore, based on this procedure, we can obtain the optimal parameter combinations of the ELM-based and kernel ELM-based classifiers in each frequency band, as shown in Tables 1 and 2.
Optimal parameters of the ELM-based classifier
Optimal parameters of the kernel ELM-based classifier
Performance of (a) the ELM-based classifier and (b) the kernel ELM-based classifier to different combinations of (
Once optimal parameters are chosen, those are used to train the model over all the training data. In order to obtain lower mean square and bias, we performe ten rounds of 10-fold cross-validation on the all dataset. The procedure is described in subsection 4.5. Tables 3 and 4 show the classification results and average classification results which are evaluated across all 16 subjects with the ELM-based and kernel ELM-based classifier in five frequency bands of EEG data.
In Table 3, each column illustrates the classification performance comparison in five bands. As highlighted by red in this table, it can be seen that the ELM-based classifier achieves best classification results appeared in alpha and beta bands. Because of individual differences, we observe that the highest classification accuracy is 71.67% in beta band of subject 9 and the lowest classification accuracy is just 57.50% in beta band of subject 18. Across all 16 subjects, the average classification accuracy 66.77% obtained in alpha band is little higher than 66.17% in beta band.
Classification accuracy obtained by the ELM-based classifier for 16 subjects in five bands of EEG data
Classification accuracy obtained by the ELM-based classifier for 16 subjects in five bands of EEG data
Table 4 listes the classification accuracies obtained by the kernel ELM-based classifier in five bands of EEG data. It is also noted that best classification results appeared in alpha and beta bands. Compared with results obtained by the ELM-based classifier, the kernel ELM-based classifier achieves higher classification accuracies highlighted by red in five bands for each subject. For average classification results, it can be seen that the kernel ELM-based classifier achieved 68.90% of classification accuracy which is higher than the ELM-based classifier in alpha band.
Classification accuracy obtained by the kernel ELM-based classifier for 16 subjects in five bands of EEG data
In this subsection, two performance measures account for evaluating outcomes of our proposed method, being the ‘Sensitivity’ and ‘Specificity’ metrics. The process is performed in alpha and beta bands which obtain higher classification accuracy showed in last subsection. Here, four classes of pain states, which include No pain, Minor Pain, Moderate Pain, and Severe Pain, are shortly expressed as P1, P2, P3, and P4, respectively.
We first present the results of performance evaluation for pain classification using the ELM-based classifier. Figure 9 showes sensitivity and specificity metrics in each pain class. It can be seen that the method achieves higher sensitivity metrics 79.48% and 83.54% in P1 classification. For the sensitivity of P4 classification, the results are lower than ones of P1 classification obtained 55.10% and 52.81%. However, in P4 classification, the specificity metrics are up to 94.30% and 92.85%. Similarity, we observe that the sensitivity metrics of P3 classification are 62.29% and 52.50%. Otherwise, it obtains 94.76% and 90.04% in specificity metrics.
Based on above results, it is concluded that the method used for correctly identifying P1 and P2 classes is better than identifying P3 and P4 classes. However, the ability of the method used for correctly identifying P1 and P2 classes is weaker than identifying P3 and P4 classes.
Average sensitivity and specificity obtained by ELM-based classifier in (a) P1, (b) P2, (c) P3, and (d) P4 classification.
The results of performance evaluation for the kernel ELM-based classifier are given in Fig. 10. For the alpha band, it can be seen that sensitivity values of P1, P2, and P4 classification are 72.19%, 73.54%, and 72.54%, respectively. However, the sensitivity of P3 classification is 57.19%, its performance is weaker than other pain classes. For the beta band, the sensitivity values of P1 and P2 classification are better than ones of P3 and P4 classification. We observe clearly that the specificity value is over 85% in each pain class. Therefore, it can be easily concluded that the kernel ELM-based classifier obtains better performance on correctly identifying each pain class.
Performance comparison between ELM and SVM
Average sensitivity and specificity obtained by kernel ELM-based classifier in (a) P1, (b) P2, (c) P3, and (d) P4 classification.
To test the performance of the ELM-based and kernel ELM-based classifier, this subsection takes the SVM-based and kernel SVM-based model as two competing classifiers. The classification accuracies in alpha and beta bands, average training time, and average testing time are used as evaluation measures. All of classifiers are evaluated using the same database, same data preprocessing, same training process, same testing process, and same computation platform. In this paper, SVM and kernel SVM algorithms are implemented with LIBSVM toolkit.
From Table 5, it can be seen that the kernel ELM-based classifier achieves a slightly higher classification accuracies (alpha: 68.90
Based on the results obtained in this experiment, we found that the computation speeds of the kernel ELM-based and ELM-based classifier were much faster than the kernel SVM-based and SVM-based classifier during the model training. The reason is that input weights and hidden node biases randomly are chosen and output weights analytically are computed by the ELM model. Because the processes of complicated iterative operation are avoided well, the training speed of the ELM model is very fast. However, in the SVM model, the training procedure contains a convex quadratic programming problem. Hence, the computation time is quadratic to the number of training instances.
Conclusions
A new framework used for objectively identifying the subjective perception of tonic cold pain has been proposed in this paper. In particular, this method presents an extended Common Spatial Pattern (ECSP) algorithm as a feature extractor, which is capable of dealing with multi-class pain identification, to extract features of tonic pain EEG data. Then, the ELM algorithm is utilized for training the tonic pain classifier, which is a single-hidden-layer neural network. To verify this purpose of the study, 20 subjects are recruited and a procedure of cold stimuli simulation experiment is designed and performed.
The proposed method, in this paper, is capable of classifying four classes of pain states, which include No pain, Minor Pain, Moderate Pain, and Severe Pain. The promising experimental results reveal the potentiality of objectively identifying the subjective perception of tonic pain. Compared with the well-known SVM method, the proposed method costs much less computation time during the training and testing. Therefore, it is very promising for real-time applications. Overall, the proposed approach contributes with an alternative way to the endeavor toward object quantification of the subjective characterization of tonic pain. Future work will be focused on enhancing the recognition accuracy of tonic cold pain. All programs run on a laptop with 2.4 GHz i7-4700MQ CPU and 8 GB DDR3 RAM.
Footnotes
Acknowledgments
This work was supported by Program for Changjiang Scholars and Innovative Research Team in University [grant number IRT_16R07], National Natural Science Foundation of China [grant number 51705024], U.S. National Science Foundation (NSF) [grant number 1838796]. We thank all the participants who have participated in this work.
