Abstract
Neurodegenerative diseases affect a large part of the population in the world and also in Mexico, deteriorating gradually the quality of patients’ life. Therefore, it is important to diagnose them with a high degree of reliability. In order to solve it, various computational methods have been applied in the analysis of biomarkers of human gait. In this study, we propose employing the automatic model selection and hyperparameter optimization method that has not been addressed before for this problem. Our results showed highly competitive percentages of correctly classified instances when discriminating binary and multiclass sets of neurodegenerative diseases: Parkinson’s disease, Huntington’s disease, and Spinocerebellar ataxias.
Introduction
Neurodegenerative diseases are pathologies that affect the nervous system producing gradually different aspects of motor skills. Gait is one of the motor skills that gets severely affected. A major problem is that their diagnosis and categorization are difficult in the initial stages and they prevail in older adults [9, 30]. Their diagnosis in the early stages is an open problem in medicine. These diseases can be detected as patterns of behavior in the patient’s gait, since as shown in many studies: variations in gait are an early indicator [23, 32].
Parkinson’s disease (PD) is a progressive disorder of the nervous system that affects movements particularly [13]. PD develops gradually and sometimes starts with a barely noticeable tremor in one upper extremity. In Mexico, this disease represents the main neurodegenerative disease that affects adult patients, and it is more common in men than in women [27, 29]. Huntington’s disease (HD) and spinocerebellar ataxias (SCA) are hereditary neurodegenerative disorders that affect motor coordination, and consequently, the gait and also lead to progressive functional deterioration. HD and SCA patients have a definite molecular diagnosis as the responsible autosomal dominant mutation with trinucleotide repeats was identified in their genome. The most frequent SCAs in Mexico are SCA2 and SCA3 (or Machado-Joseph’s disease). We chose this group of progressive neurological disorders because they represent the most frequent diseases, which cause movement disorders. These disorders affect gait similarly. They alter the balance, slow the gait velocity (bradykinesia) which produces shortening steps, and among other parameters of gait are affected [8, 25]. These diseases can be so subtle that can go undetected by an observer.
From a computational perspective, Gait recognition methods are well-established to provide gait biomarkers that can serve in the categorization of diseases such as PD, Alzheimer’s disease (AD), HD, SCA, among others [1, 39]. In this work, we explore the use of these types of biomarkers for the classification of neurodegenerative diseases using a method of Machine Learning. The classification consists of predicting a label within a finite set of labels [28]. There are a wide variety of Machine learning classification algorithms that are based on: bayesian networks, artificial neural networks, fuzzy logic, decision trees, association rules, assembled methods, etc. [36]. In particular, in this study, we followed the method proposed in [20, 33]: Automatic model selection and hyperparameter optimization, to identify the appropriate classifier based on characteristics of the gait biomarkers. After exhaustive experimentation, we identified the Random Forest classification algorithm, as the one which better performs.
The rest of this paper is organized as follows: Section 2 depicts related works to this research, the materials and methods are detailed in Section 3, in Section 4 the experiments and results are shown and, finally, the results and possible future works are discussed in Section 5.
Related works
The categorization of neurodegenerative diseases based on gait recognition has had important advances. this section is divided into two subsections, the first focuses on studies of neurodegenerative diseases based on gait. The second subsection reviews works that are concerned with the gait classification.
Gait as an identifier of neurodegenerative diseases: a neurological approach
The following three studies suggest a relationship between gait disorders and neurodegenerative diseases:
In a group of 45 patients with SA and 15 control subjects, Nakamura et al. studied the relationship between gait and SA. Results revealed that as gait skills declined, the same happened with the mental abilities of patients. They established that the gait speed is significantly reduced as the severity of dementia symptoms increased [24].
Theill et al. studied the gait of 1,072 older adults. The tests revealed that as the mental decline known as mild cognitive impairment increased, the striding speed also decreased [32].
Mielke et al. performed evaluations on 1,478 patients. Tests of mental abilities and gait with each patient were carried out. The results showed that a higher gait velocity was associated with a lower cognitive decline [23].
These studies established a strong effect of neurodegenerative diseases on gait. In the next subsection, we review relevant works that focus on gait recognition with several purposes.
Classification of neurodegenerative diseases based on gait recognition
There are several works focused on PD. El Maachi et al. exposed that intelligent algorithms can reduce the subjectivity of gait analysis, so they applied a convolutional neural network to detect PD and predict the severity. Their results achieved 98.7% accuracy in detection and 85.3% in severity prediction [11]. Khan et al. proposed a method based on the analysis of movement in videos to recognize the gait in patients with PD. The model was based on the idea that a normal human body reaches equilibrium during gait by aligning the body’s posture with the gravity axis using feet as a base of support; In contrast, the posture of patients with PD seems to be leaning forward, as they are less able to align their body. To capture the gait data, 3 patients with PD and 4 control subjects were recorded while walking. Results showed a 100% recognition rate [18]. On the other hand, Barth et al. combined the analysis of the movement of hands and gait with a sensor-based system, for PD recognition. The experiments were carried out with 18 patients with PD and 17 control subjects. On the data, they did an exploration with four classifiers and obtained the highest percentage with AdaBoost, 97% [4]. Finally, Li et al. developed a gait recognition system based on a network of 16 sensors to identify gait patterns in patients with PD. They also developed an algorithm based on local linear embedding to extract and recognize gait characteristics. Results show that the proposed system has a recognition rate of around 95.57% [21]. In addition to the sensor networks, the images use has been proposed. Chien-Wen et al. proposed an image-based diagnostic system that uses an algorithm that combines principal component analysis with linear discriminant analysis. The study was done with 7 patients with PD and 7 control subjects. Results showed a recognition rate of 95.49% [7]. Finally, Barnes and Jafari designed a sensors network that was placed in such a way that apart from gait also recorded data from other movements of the body. The study showed that a sensor system quantitatively measures some of the factors involved in locomotion in real-time and also they proposes that it can be extended to detect SA [3].
Gait has also been related to HD. Grimbergen et al. using sensors, followed gait in 47 patients with HD, and 27 control subjects. The results revealed that concerning controls, patients with HD had a decrease in gait speed (1.15 m/s versus a 1.45 m/s, p < 0.001) and a decrease in stride length [14]. The clinical features included were lateral swaying, spontaneous knee flexion, variable cadence, and parkinsonian characteristics. The biomechanical analysis showed that gait oscillation varied in each foot with an average descent of speed, stride length, and cadence [19].
In ALS case also been proposed diagnostic methods have through gait. Sugavaneswaran et al. explored the concepts of Machine Learning-kernel functions by incorporating the ambiguity of time-frequency space. The proposed technique was evaluated with gait information (acquired by sensors) of 13 patients with ALS and 16 control subjects. A classification percentage of 93.1% was reached [31]. Similarly, Wu and Ng conducted a study on gait cadence (step intervals) in patients with ALS. Gait information was acquired by sensors. The probability density functions of step intervals were estimated with the non-parametric Parzen-window method. Results showed that stride patterns can be effectively distinguished between patients with ALS and healthy subjects with an accuracy rate of 82.8% [37].
So far, works have been directed to a single disease, but the discriminative power for different diseases has also been studied. Elden et al. presented a method for classifying PD, HD, ALS, and healthy control subjects. They implemented Fisher score in feature selection and Support Vector Machine (SVM) in classification. They obtained 95.31% of accuracy for discriminating neurodegenerative diseases versus healthy control [12]. Dutta et al. proposed an algorithm for the gait pathological classification of 15 patients with PD, 13 with ALS, 20 with HD, and 16 control subjects. Their technique consists of time extraction and frequency domain characteristics of correlograms obtained by cross-correlation of gait signals, subsequently, a previously trained Elman’s recurrent neuronal network is used. Results show a precision of 87.5% for PD, 88.9% for HD, and 83.3% for ALS in the problem of a set of neurodegenerative diseases (multi-class) [10]. Merory et al. did a statistical analysis of spatio-temporal motion characteristics in patients gait with dementia with Lewy body (LBD) and compared them with patients with AD and control subjects. In performed tests on a treadmill, it was observed that the stride length and velocity values were significantly reduced in both patients groups compared to the control group at different speeds. Significant correlations were found between gait speeds of patients with LBD and AD, but significantly different from the control group [22]. Verghese et al. present a broader study with gait statistical methods of 422 patients with different neurodegenerative pathologies. For the study, videos of walking patients were analyzed. As a result, it was found that the presence of gait alterations is a significant predictor of the risk of developing dementia, especially the called non-Alzheimer dementia [34]. Finally, Hausdorff et al. compared the gait pace of 11 patients with ALS versus control subjects and PD versus HD. The subjects walked for 5 minutes at their usual pace using sensors. In the study, it was found that the gait of patients with ALS is less stable and more disorganized temporarily compared with healthy people. Therefore, the step interval and gait fluctuations are apparently compromised with ALS [15].

a) Sensor network, b) sensor network topology.
Other studies have focused on identifying the best Machine Learning classifier. Iram et al. divided neurological diseases into three stages: retrogenesis, cognitive impairment, and gait disorder. For tests, they used information acquired from sensors of 15 patients with PD, 13 with ALS, 20 with HD, and 16 control subjects. They evaluated 11 classifiers employing confusion matrices to determine the exact degree of disease. Results show a better performance with the Bayes quadratic classifier, 90% for PD, 50% for HD, and 50% for ALS in multi-class classification [16]. Similarly, Banaie et al. proposed new attributes starting from statistical processes on a dataset with gait data. For tests, they used data from 15 patients with PD, 13 with ALS, 20 with HD, and 16 control subjects. They evaluated 17 classifiers and the best performance was obtained with the Bayes quadratic classifier with percentages of 100% in control subjects, 71.429% in HD, 80% in PD, and 100% in ALS, in the problem of four classes [2]. Yang et al. implemented an SVM classifier to examine four types of attribute selection. Tests were performed on data from 15 patients with PD, 13 with ALS, 20 with HD, and 16 control subjects. Results showed that with a set of four attributes it is easier to distinguish PD from the control subjects, the HD, and ALS with percentages of 86.43%, 79.04%, and 85.47% respectively; ALS is better distinguished than the control subjects and HD with 93.96% and 86.52%; likewise, HD is better distinguished from the control subjects with 84.17% [39]. Xia et al. applied the Leave-One-Out Cross-Validation method obtaining 96.83% of accuracy [38]. Ren et al. studied the Empirical Mode Decomposition method for decomposing the time series of gait rhythms into intrinsic mode functions. Their general values of AUC got a good performance in binary classification [26]. Bilgin through wavelet function “bior2.6” decomposed the compound force signal for determination of features and applying Näive Bayesian classifier achieved 90.93% of accuracy in distinguishing ALS from the control subjects [5].
Based on these studies, it is evident that there is no method that allows obtaining very acceptable results in both classification cases: binary and multiclass. On the other hand, the use of a sensor network allows obtaining biomarkers by means of sensors placed on the lower extremities. In addition, to our knowledge, there is no database with gait information of a group of patients with neurodegenerative diseases in Mexico, so this study is considered valuable, in the sense that other researchers can carry out experiments from the database. Finally, the automatic model selection and hyperparameter optimization method has not yet been used to address a medical problem such as the one proposed in this research.
Topology of sensor network
To design the sensor network five ADXL-335 3-axis accelerometers were used, wired on an Arduino MEGA-2560 card, these accelerometers cover the extremities of the knees (left and right), ankles (left and right), and the chest (Fig. 1). From the sensor network, the information of the cartesian axes x, y, and z were obtained. The sensor network is low-cost.
Dataset
An ethics committee of the National Institute of Neurology and Neurosurgery (NINN) approved the implementation of a gait laboratory inside NINN to obtain gait biomarkers of patients suffering neurodegenerative diseases. On the other hand, each patient with his family member signed consent reports authorizing the public availability of data for studies with scientific purposes, as long as the anonymity of patients was protected. The population was 82 patients suffering neurodegenerative diseases: 47 with PD, 13 affected with HD, and 22 with SA; of the total population were 48 men and 34 women. In addition, gait information from 19 control subjects was collected, 7 men and 12 women. The age and gender distributions of patients are shown in Table 1. The estimated time per person during the capture process was two minutes. The exclusion criteria were patients who used a cane, who was helped to walk by their caregivers, or who used wheelchairs.
Age and sex distribution in the NINN-Database
Age and sex distribution in the NINN-Database
The first step to build datasets was grouping the information by disease, i.e, having a file for each patient, we proceeded to unite all the files by disease (including control subjects files), which resulted in four datasets which to be processed in the classification phase to records level and with the raw data
1
. Having the data grouped, we proceeded to construct the datasets in two categories: Binary sets: {control, SA}, {control, HD}, {control, PD}, {SA, HD}, {SA, PD}, {PD, HD}, Multiclass sets: {SA, HD, PD}, {control, SA, HD, PD}.
The automatic model selection and hyperparameter optimization method is used to select the classification algorithm [20, 33]: Given a set of algorithms
Where
Based on the previous criterion, it was determined that the Random Forest algorithm was appropriate for classification. The algorithm, which belongs to the assembled methods, consists of a collection of classifiers structured by tree {h (x, Θ k ) , k = 1, . . .} where Θ k are independent random vectors distributed identically and each tree throws a unitary vote for the most popular class at the input x [6]. The algorithm is shown below:
In this technique, the inputs are the data (T) which are previous recordings (x) with their corresponding diagnosis (y), the number of decision trees to generate (m), and the maximum number of levels that each tree will have (k). For each tree, a new set (T′) of training data is created through bootstrapping (i.e., sampling with replacement). Once each data is selected a tree is selected by randomly selecting two features and choosing the best partition among both features, this is repeated until all the features have been analyzed or it reaches the maximum number of levels [6].
To assess the performance, confusion matrix (see Table 2), true positive rate or recall (Equation 2), false positive rate (Equation 3), precision (Equation 4), f-measure (Equation 5), and ROC area were used as evaluation metrics. In the confusion matrix in each column the number of predictions of each class is represented and each row represents the real values. The parameters used are true positive (TP), true negative (TN), false positive (FP), and false negative (FN). The confusion matrix allows calculating the true positive rate as the classification of correctly classified instances. On the other hand, the FP rate represents the rate of incorrectly classified negative instances. The precision gets the fraction of instances classified in the positive class that is in fact, positive class. F-measure captures the properties of TP rate and precision into a single measure. Finally, the ROC curve represents the true positive rate versus the false positive rate.
Confusion matrix
Confusion matrix
For binary sets, the percentages of correctly classified instances shown in Table 3 were obtained. On the other hand, for multiclass sets, the percentages of correctly classified instances are shown in Tables 4–6.
Classification results of binary sets using Random Forest classifier and their respective confusion matrix
Classification results of binary sets using Random Forest classifier and their respective confusion matrix
Classification results of multiclass sets using Random Forest classifier
Confusion matrix of multiclass set {SA,HD,PD}
Confusion matrix of multiclass set {control,SA,HD,PD}
It is observed in a general way (Table 3) that Random Forest generates classifications results highly competitive, around 99% . In particular: sick against healthy, confused with healthy 46 of 3300 with SA, also confused with healthy 38 of 1950 with HD and confused with healthy 36 of 4163 with PD; sick against sick, confused 65 of a total of 5250 with SA and PD, confused 30 out of a total of 7499 with SA and PD and confused 31 of a total of 6149 with HD and PD.
It is observed that Random Forest generates, again, highly competitive results, very close to 100% . When considering four classes we can see in the confusion matrix (Table 5) that numbers confused are very low when classifying the instances. In addition, the statistical metrics of Table 7 confirm in detail by class, highly competitive values.
Detailed metrics by class
The experiments were performed in Waikato Environment Knowledge Analysis v.3.8 [35], on a Lenovo ideapad 300 laptop, with Windows 10 of 64 Bits, Intel Celeron @ 1.60GHz processor, and RAM of 4.00 GB.
The automatic model selection and hyperparameter optimization method after exhaustive experimentation, it allowed determining that the Random Forest algorithm is appropriate for the binary ({control,SA}, {control,HD}, {control,PD}, {SA,HD}, {SA,PD}, {HD,PD}) and multiclass ({SA,HD,PD}, {control,SA,HD,PD}) classification of neurodegenerative diseases. Random Forest showed highly competitive results with a true positive rate, precision, f-measure, and ROC area around 99% . In contrast, the false positive rate was less than 0.005. The results obtained are highly reliable, which makes them suitable for implementation in a computer-aided diagnostic system.
Conclusions and future work
Our results showed the effectiveness of Random Forest algorithm for the classification of neurodegenerative diseases based on gait biomarkers. It is evident that when classifying binary and multiclass sets using the criterion proposed by [20, 33] to select the classification algorithm, results close to the optimal 100% are obtained, this is at least corroborated with the NINN-Database and considering the raw data.
Results of this study were obtained from patients suffering from neurodegenerative diseases in different stages, therefore, it is necessary to acquire gait information of patients in early stages.
As a future challenge of this research is considered to develop software so that neurologists can have a second opinion and sharpen the diagnosis as this may be biased due to external factors that directly affect the specialist, such as visual fatigue, emotional tiredness, or other factors of psychological type.
As future work, we propose: Implement deep learning algorithms in the NINN-Database, in binary and multiclass data sets, to compare and analyze the results. Use other sensors: gyroscopes or floor sensors, but maintain a low-cost solution. This is to have a greater information spectrum and be able to implement attribute selection algorithms. Since other diseases present gait disorders, we considered extending the study to others, for example, diabetic neuropathy.
