Abstract
Background:
Alzheimer’s disease (AD) is a progressive neurodegenerative disease that is not easily detected in the early stage. Handwriting and walking have been shown to be potential indicators of cognitive decline and are often affected by AD.
Objective:
This study proposes an assisted screening framework for AD based on multimodal analysis of handwriting and gait and explores whether using a combination of multiple modalities can improve the accuracy of single modality classification.
Methods:
We recruited 90 participants (38 AD patients and 52 healthy controls). The handwriting data was collected under four handwriting tasks using dot-matrix digital pens, and the gait data was collected using an electronic trail. The two kinds of features were fused as inputs for several different machine learning models (Logistic Regression, SVM, XGBoost, Adaboost, LightGBM), and the model performance was compared.
Results:
The accuracy of each model ranged from 71.95% to 96.17%. Among them, the model constructed by LightGBM had the best performance, with an accuracy of 96.17%, sensitivity of 95.32%, specificity of 96.78%, PPV of 95.94%, NPV of 96.74%, and AUC of 0.991. However, the highest accuracy of a single modality was 93.53%, which was achieved by XGBoost in gait features.
Conclusions:
The research results show that the combination of handwriting features and gait features can achieve better classification results than a single modality. In addition, the assisted screening model proposed in this study can achieve effective classification of AD, which has development and application prospects.
INTRODUCTION
With the acceleration of population aging around the world, the incidence rate of neurodegenerative disease (ND) in the elderly is increasing year by year. 1 Among them, AD is the most common ND, and its incidence increases exponentially with age. 2 Currently, approximately 50 million people worldwide suffer from AD. It is expected that the number of patients will double by 2050. 3 Due to the aging of society, the increase in AD patients is quite significant and will become an expensive public health burden in the coming years. 4 The characteristic of AD is the gradual loss of memory and other cognitive functions, which lasts for a long time. 5 Its early stage is not easily detected, and when patients show obvious symptoms, the disease usually progresses to the middle and late stages. Mild cognitive impairment (MCI) is a transitional stage between normal elderly individuals and AD, with approximately 80% of MCI patients transitioning to AD. 6 Although AD patients cannot be cured, identifying the early stage can lead to timely intervention measures, greatly delaying the progression of the disease and controlling symptoms. The initial and most common symptom of AD is short-term memory loss, where patients may forget the latest events and repeat the same problem. Afterward, the problem-solving ability and execution ability are compromised. Patients may also experience language disorders, impairment of visual-spatial skills, 7 and motor disorders such as slowed gait and reduced flexibility.
Currently, the diagnosis of AD requires a comprehensive clinical assessment, involving various cognitive assessment scales, imaging tests, fluid examinations, genetic testing, and eye-tracking technology. 8 However, existing diagnostic methods have certain limitations. Firstly, the scoring system of the scales is based on qualitative descriptions of symptoms, corresponding to some respective scores assigned by physicians, introducing a degree of subjectivity. The most widely used cognitive assessment scales currently include the Mini-Mental State Examination (MMSE) 9 and the Montreal Cognitive Assessment (MoCA). 10 Issues such as multicultural communication and subjective judgment can lead to inaccurate screening when using these scales. Secondly, assessing patients in a stressful medical environment may not accurately reflect their real-life behavioral abilities. Thirdly, medical imaging technologies are not only expensive but also pose some risks to patients’ bodies, making them difficult to access in regions with limited medical resources. In particular, in positron emission tomography (PET) scans, only a few hospitals have the capability for Aβ-PET and Tau-PET examinations, making this technology quite limited. Therefore, there is an urgent need to identify methods for early screening of AD to develop more effective intervention or treatment plans, thereby slowing disease progression and reducing the burden on families and society.
With the development of computer technology and artificial intelligence, utilizing computer technology to screen AD has become a promising approach, assisting in identifying early-stage AD patients. 11 Modality refers to a form in which things happen or exist. In the field of artificial intelligence, multimodality refers to the fusion of data from multiple sources to achieve information integration and complementarity between different modalities. In the realm of computer-assisted AD screening, each biomarker represents a modality. A single modality can reflect different impairments in patients. By merging data from different modalities and inputting them into machine learning or deep learning models, a comprehensive assessment of patients can be provided, enhancing the accuracy of classification. In the screening process for AD, handwriting and gait are two commonly used biomarkers.
Handwriting is a complex process involving cognitive domains such as fine motor skills and kinesiology and is often impaired by AD. 12 It is considered a crucial source of information for assessing neurological disorders. There is evidence that an individual’s health can be distinguished based on their performance in several simple handwriting tasks. 13 Numerous studies have explored the connection between AD, MCI patients, and handwriting characteristics. The results of these studies have shown that it is possible to effectively screen normal, AD, and MCI patients. Common abnormalities observed in AD patients include slow writing speed, reduced writing pressure, increased writing thinking time, more pen strokes, and writing errors.12,14,15, 12,14,15 Ghaderyan et al. 16 recruited 13 subjects with mild cognitive impairment, 15 with AD, and 15 healthy participants. Their handwriting features under the four writing tasks were input into the sparse non-negative least-square classifier, which achieved 95.3% accuracy in distinguishing AD patients from normal people. Nicole et al. 17 recruited 66 AD patients and 64 healthy participants to finish handwriting tasks. They used Random Forest and the Decision Tree to distinguish two groups of subjects. The highest accuracy achieved 72.30%. Giuseppe et al. 18 proposed a multi-classifier approach to distinguish AD patients from normal people. This multi-classifier framework consisted of 6 basic machine learning models, such as Random Forest, SVM, Logistic regression, and so on. The outputs of the classifiers were eventually combined by a majority vote. The experimental results proved that classification using a multi-classifier works better than a single classifier.
Gait refers to the posture and movements of the body when a person walks, involving coordinated movements of the legs and body. Normal gait is a complex process that requires the coordination of multiple body parts to maintain balance and propel the body forward. Gait disturbances affect many elderly individuals, and the relationship with cognitive function is closely intertwined. Both share the same physiopathological basis and have recently been emphasized as potential indicators of cognitive decline. 19 Gait changes not only serve as markers of ND but also as one of the assessment components for dementia. 20 Gait disturbances associated with AD are mainly increased variability due to executive function, attention impairments, and memory problems. Studies by Grande 21 and Castrillo et al. 22 suggest that AD patients may experience gait disturbances in the early stage. Therefore, assessing relevant gait indicators can aid in detecting the early stage of AD and cognitive impairment. The most common gait abnormalities in AD patients include slowed walking speed, shortened stride length, and increased gait variability.23–25 In recent years, many have begun to investigate the use of machine learning techniques to screen for AD through gait. Ghoraani et al. 26 proposed a diagnostic algorithm to detect MCI and AD health based only on gait and machine learning. They collected gait data from the subjects under single and dual-tasking and entered them into support vector machine (SVM), which achieved the highest accuracy of 86%. Chen et al. 27 proposed a method of using machine learning models to predict different types of MCI patients. The recruited subjects performed walking, timing, and jumping tests. The collected gait parameters were used in model based on SVM and principal component analysis (PCA), which achieved the highest accuracy of 91.67%. Mahmoud et al. 28 proposed the first application of Kinect V.2 camera and SVM to detect AD patients from normal people through the Timed Up and Go test. The model classified the two groups with an accuracy of 97.75%.
Thus, many studies have been conducted to prove that two types of behavioral data, handwriting, and gait, can achieve better classification performance with machine learning models. They can reflect different and complementary features of cognitive impairments associated with AD. 29 Combining features of these two modalities for analysis can provide higher accuracy in screening AD patients.
This paper proposes an assisted screening framework for AD based on multimodal analysis of handwriting and gait. The framework employs a feature fusion method and achieves effective classification of AD with an accuracy of 96.17%. The contributions of this paper are as follows: An assisted screening framework for AD based on multimodal analysis of handwriting and gait is proposed to achieve rapid, noninvasive screening for the early stage of AD. Study showed that the combination of these two kinds of behavioral data was better than the two alone for screening AD. This study can be applied to healthcare resource-poor areas or grassroots elderly institutions such as communities to achieve collective screening of AD and management in the elderly population.
MATERIALS AND METHODS
The framework of the early AD screening proposed in this paper is shown in Fig. 1, which consists of five stages: data acquisition, feature extraction, feature selection and fusion, classification and evaluation, and statistical analysis. The following subsections describe the details of these five stages.

The framework of the early AD screening.
Data acquisition
Participants
This study recruited participants from Huzhou Rehabilitation Hospital. The recruited participants were right-handed. Their native language was Chinese, and they were excluded from psychiatric disorders, history of substance abuse, and disorders of the hands, waist, and legs that would affect writing and walking. All participants met possible AD criteria 30 and were in the early stage of AD. All participants were examined on the MMSE. 9 After screening, the study included a total of 90 participants, consisting of 38 probable AD patients and 52 healthy controls (HC). This study was conducted under the Medical Ethics Committee of Huzhou University (20230702) and followed the ethical code stated by the Helsinki Declaration. Before the experiment, all participants signed informed consent forms, voluntarily agreeing to participate in this study. Table 1 provides the basic demographic information for the AD and HC groups.
Basic demographic information for the AD and HC groups
AD, Alzheimer’s disease; HC, healthy control; MMSE, Mini-Mental State Examination.
Equipment
For the collection of handwriting data, in previous studies, most researchers used digital tablets or tablet computers when screening AD patients.31,32, 31,32 To fit the usual writing habits of the elderly and to control other variables, the Tstudy dot-matrix digital pens and dot-matrix paper were chosen as the writing tools in this study. The dot-matrix can provide the coordinate parameter information for the dot-matrix digital pen to record the trajectory, which can be printed out on ordinary paper. When the pen tip is pressed, the pressure sensor and camera inside the pen are triggered. The dot coordinates, writing sequence, pressure data, movement speed, and other information are transferred to the inbuilt processor and finally output via Bluetooth. In this study, the subjects wrote with a dot-matrix digital pen and A4 paper covered with dot-matrix, which fit the subjects’ daily writing habits.
Regarding the formulation of handwriting tasks, Kjerstin et al.
33
proposed as early as 1996 that when screening cognitive impairment in the elderly, geometric replication and writing ability are commonly assessed. Among these tasks, those most sensitive to subtle cognitive changes include copying cubes and pentagons, which can be complemented by tasks such as copying. Based on this, considering the characteristics of writing tasks from previous research14,34,35, 14,34,35 and demographic features of the Chinese elderly, this study ultimately defined four writing tasks, including two graphic tasks and two text tasks: 1) Copying two crossed pentagons; 2) Copying a cube; 3) Picture naming. Subjects were shown three pictures: a flower, a bed, and a lamp. Subjects were asked to describe them with a single word and write it down; 4) Copying a sentence. The sentence is “
”, which means “Happy every second, joy every day.” The specific writing tasks are shown in Table 2. We merged four writing tasks into a complete handwriting dataset.
Four writing tasks
For the collection of gait data, this study employed the GAITRite walkway. GAITRite is a portable, single-layer pressure-sensitive walkway, measuring 6 meters in length and 0.6 meters in width. It is embedded with pressure-sensitive sensors that can measure temporal and spatial parameters and easily identify gait abnormalities. Subjects wore shoes and walked 2-3 rounds on the walkway at their usual walking speed and manner. The relevant data were uploaded to the GAITRite system on the computer. To minimize the impact of subjects’ nervousness on the results during the experiment, the GAITRite system automatically merged data from several rounds for each subject before exporting the final results.
Experimental procedure
The process of collecting data in this study was divided into two main stages. Firstly, the collection of subjects’ handwriting data. Subjects were invited to enter a quiet room and sit at a table where a dot-matrix digital pen and A4 paper with tasks and dots printed on it were placed. Before the start of the trial, the researcher explained the writing process and requirements to the subjects. After ensuring that the subject fully understood the writing requirements, the subject began to write at a normal speed.
The second stage was the collection of subjects’ gait data. After a short break from writing, subjects entered another corridor which was paved with GAITRite walkway. To prevent the effects of acceleration and deceleration, markers were placed 1 meter away from the beginning and 1 meter away from the end of the walkway to serve as the start and end of the gait measurement. 24 The researcher guided the subject to stand at the beginning marker. Before the start of the trial, the researcher explained the walking procedure and requirements to the subject. After the researcher said “begin", the subject started walking from the beginning marker at the same speed and posture as their usual walking pace, proceeding along the walkway until they reached the endpoint marker. To minimize the impact of subjects’ nervousness on the results during the experiment, this study requires subjects to walk continuously for 2-3 rounds.
Feature extraction
Based on the characteristics of AD patients and previous research, this study included a total of 30 handwriting features and 32 gait features.
Due to the characteristics of velocity and pressure in writing of AD patients, the extracted handwriting features include 1 item in the pressure category (average pressure), 2 items in the time category (time in air, time on paper), and 7 items in the writing form category (number of strokes, task score, single stroke length, x-axis/y-axis stroke length, tilt, repeated dots). Additionally, there are 10 items in the velocity category (average velocity, maximum/minimum velocity, maximum/minimum velocity on x-axis/y-axis direction, average velocity on x-axis/y-axis direction, number of times velocity slows down), and 10 items in the acceleration category (average acceleration, maximum/minimum acceleration, maximum/minimum acceleration on x-axis/y-axis direction, average acceleration on x-axis/y-axis direction, number of times acceleration slows down).
Due to the occurrence of some qualitative writing errors in AD patients during writing, and according to previous studies,14,35, 14,35 the task score is scored according to the following rules. Task 1 has a maximum score of 3 points, with 1 point each for meeting the following criteria: drawing a pentagon with 5 clear angles and sides; intersecting two pentagons to form a quadrilateral; each side’s length is roughly equal. Task 2 has a maximum score of 4 points, with 1 point each for meeting the following criteria: drawing in three dimensions; correct orientation of the figure; correct drawing of the figure’s lines; and basic parallelism of opposite sides. The remaining two text tasks are scored based on the correctness of each word, with 1 point awarded for each correctly written word. To standardize the full scores for each task, this study maps the scores for each task to percentages, setting the full scores at 10 points. The 30 handwriting features and their meanings are shown in Table 3.
30 handwriting features
Due to the characteristics of walking velocity and stride in AD patients, the extracted gait features include 3 items in the velocity category (velocity, stride velocity, stride velocity std dev), 20 items in the time category (step time differential, cycle time differential, step time, cycle time, swing time, stance time, single supp. time, double supp. time, swing % of cycle, stance % of cycle, single supp % cycle, double supp % cycle, heel-off-on time, step time std dev, stride time std dev, swing time std dev, stance time std dev, single supp time std dev, double supp time std dev, heel-off-on std dev), 7 items in the length category (step length differential, step length, stride length, HH base support, step length std dev, stride length std dev, supp base on std dev). Additionally, it also contains 2 other features (cadence, toe in / out). The 32 gait features and their meanings are shown in Table 4.
32 gait features
Feature selection and fusion
To preprocess the extracted data, missing values were imputed using the mean, and then feature selection was performed on the handwriting dataset and gait dataset. To reduce the dimensionality of the feature space, mitigate the curse of dimensionality associated with high-dimensional feature vectors, and identify the most useful features for screening AD patients, this study employed the Sequential Forward Selection (SFS) algorithm. The basic idea of the SFS algorithm is to start with the initial feature set, add one feature at a time, evaluate the accuracy of each classification model, and continue this process until further inclusion of features does not lead to accuracy improvement. After obtaining the optimal feature subsets for both handwriting and gait features separately, the handwriting and gait features were concatenated to create the fused feature vector.
Classification model and evaluation
The purpose of classification is to distinguish AD patients from normal subjects and obtain an AD assisted screening model. Therefore, this study utilized PyCharm Community Edition 2022.2×64 for programming. Classification models, including Logistic Regression (LR), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGB), Adaptive Boosting (Adaboost), and LightGBM (LGB), were established. We chose to use the default values for the hyperparameters or to adjust them manually and empirically. The hyperparameters of each classification model are shown in Table 5. The performance of these classifiers was assessed using 10-fold cross-validation iterated 20 times. In each iteration, the dataset would be shuffled anew. We calculated the average of 20 iterations as the final result. Evaluation of the classification models was based on metrics such as accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), Receiver Operating Characteristic (ROC) curve, and Area Under the Curve (AUC) value. Additionally, to assess the stability of these models, the variance of these parameters was also computed.
The hyperparameters of each classification model
Furthermore, to verify whether the fused features had better classification performance, this paper used the same method to construct classification models for handwriting features and gait features respectively. Additionally, the accuracies of handwriting and gait, iterated 20 times, were subjected to Wilcoxon test when compared with the fused features, ensuring that the differences were statically valid.
Statistical analysis
To explore the different characteristics of AD patients in writing and walking compared to normal elderly people, this study employed independent sample Student’s t-test and Mann-Whitney U test using SPSS 26.0. The choice between the two tests was based on whether the samples satisfied the normality test and ANOVA. The difference was considered statistically significant at p < 0.05.
RESULTS
Statistical analysis
In the basic demographic information for the AD and HC groups shown in Table 1, there was a significant difference between the two groups in terms of MMSE scores (p < 0.001) as shown by the Mann-Whitney U test. There was no significant difference in gender (p = 0.572), age (p = 0.454), and education (p = 0.051).
Using the Mann-Whitney U test for handwriting data showed that there was a significant difference between the AD and HC groups in 28 features such as repeated dots, time in air, and average velocity (p < 0.05). The use of independent sample Student’s t-test and Mann-Whitney U-test for gait data showed that there was a significant difference between the AD and HC groups in 4 features, namely cycle time differential, step length std dev, stride length std dev, and supp base on std dev (p < 0.05). The handwriting and gait features with significant differences are shown in Table 6.
The handwriting and gait features with significant differences
AD, Alzheimer’s disease; HC, healthy control; IQR, interquartile range.
The results of statistical analysis indicated that there was no significant difference in gender, age, and education between AD and HC groups. However, there was a significant difference in MMSE scores (p < 0.001). The U-test results for handwriting data showed significant differences in most handwriting features between the two groups, especially in terms of writing time, writing velocity, and writing form. It can be concluded that AD patients tend to have more strokes, and an increased number of repeated dots for a particular stroke, possibly due to uncertainty about the correctness of their writing, leading to repetition and pauses in the middle of writing. Regarding time, the AD group exhibited longer time in air and time on paper, indicating increased thinking time during writing. In terms of velocity and acceleration, the AD group had lower average velocity, horizontal velocity, and vertical velocity compared to the HC group. Additionally, the AD group tended to make more writing errors. The U-test results for gait data indicated that only four features show significant differences. The AD group has a greater cycle time differential, step length standard deviation, stride length standard deviation, and support base on standard deviation compared to the HC group. It can be concluded that AD patients exhibit irregular walking rhythm, greater variability in step length during walking, and instability in the support base while walking.
Classification performance
To determine which features are most effective for screening AD patients, before training classification models, this study used the SFS algorithm to screen the optimal feature subsets for handwriting and gait. For LR, the optimal handwriting feature subset had 8 items (average pressure, time in air, time on paper, minimum velocity, average velocity on y-axis direction, minimum velocity on x-axis direction, minimum acceleration on x-axis direction, task score); The optimal gait feature subset had 5 items (cycle time differential, HH base support, swing % of cycle, stride velocity std dev, heel-off-on std dev). For SVM, the optimal handwriting feature subset had 5 items (repeated dots, time in air, average velocity, number of strokes, task score); The optimal gait feature subset had 3 items (velocity, step time, HH base support). For XGBoost, the optimal handwriting feature subset had 4 items (repeated dots, time in air, number of strokes, task score); The optimal gait feature subset had 3 items (velocity, step time differential, HH base support). For Adaboost, the optimal handwriting feature subset had 4 items (repeated dots, time in air, number of strokes, task score); The optimal gait feature subset had 4 items (velocity, step time differential, stride velocity, HH base support). For LGB, the optimal handwriting feature subset had 5 items (repeated dots, time in air, minimum velocity on x-axis direction, number of strokes, task score); The optimal gait feature subset had 3 items (velocity, step length differential, HH base support).
To explore whether combining handwriting and gait modalities has better screening effects, in this study, handwriting features, gait features, and fusion features were used to classify the two groups separately by 10-fold cross-validation iterated 20 times. The parameters used to evaluate the classification performance of models were accuracy (Acc.), sensitivity (Se.), specificity (Sp.), PPV, NPV, ROC curve, and AUC value. The results showed that among the classification accuracies obtained on the fusion features, LightGBM had the highest accuracy of 96.17%, sensitivity of 95.32%, specificity of 96.78%, PPV of 95.94%, NPV of 96.74%, and AUC of 0.991 as shown in Table 7. The ROC curve is shown in Fig. 2. It proves that our proposed scheme can effectively classify the AD and HC groups and has application value in assisting the screening of AD.

ROC curves were calculated to differentiate AD patients and HC under fused features iterated 20 times. The black dashed line represents the performance of a random classifier.
Classification performance of each classifier on fusion features. Values were obtained by 20 iterations of 10-fold cross validation
Acc., Accuracy; Se., Sensitivity; Sp., Specificity; PPV, positive predictive value; NPV, negative predictive value; Variance: the variance obtained from running classifiers 20 times.
In addition, as shown in Fig. 3, the highest accuracies of classification using handwriting features and gait features alone were 91.94% and 93.53% respectively. The classification accuracies of all five classifiers on the fusion features were higher than those of the separate modalities. The result of Wilcoxon tests also marked in Fig. 3. The experimental results showed significant differences among LR, Adaboost, and LGB models, whether in the handwriting modality or the gait modality, compared to the combined modality. In SVM, there were significant differences between the gait and combined modalities. In XGB, there were significant differences between the handwriting and combined modalities. It is indicated that the combination of the two modalities of handwriting and gait was better than the use of the two alone in the classification of AD. It also proved the superiority of our proposed scheme.

Comparison of classification accuracy of each classifier on different modalities. *p < 0.05; **p < 0.001.
DISCUSSION
This study combined handwriting and gait features to distinguish AD patients from normal elderly people, and compared the performance of different classifiers. Our first conclusion was that better classification results can be obtained by combining data from both modalities of the subjects, which can effectively classify the AD group from the HC group, and had good application value. The second conclusion was that combining data from both modalities can obtain better classification results than a single modality. In addition, the differences in handwriting and gait that we observed in the AD and HC groups were equally meaningful for screening.
It has been demonstrated that handwriting involves several cognitive domains, including language, visuospatial skills, memory, and attention, most of which are affected by AD 12 The experimental results of this study found that out of 30 handwriting features, 28 of them were significantly different. This study finds that people with AD wrote with an increase in the total number of strokes as well as in the number of repeated dots for each stroke, and with shorter stroke lengths. These findings are the same as previous studies.14,35, 14,35 In terms of time, this study can observe that longer time in air and time on paper for each stroke in the AD group. It can respond to the fact that AD patients spend more time thinking before and while writing than the HC group, and to some extent reflect the impaired cognitive ability of AD patients. This finding is also similar to previous studies.31,36, 31,36 In terms of writing velocity, this study can observe that the average velocity, maximum and minimum velocity, and the number of times the velocity slowed down were smaller in the AD group than in the HC group, both horizontally and vertically, as well as the same in terms of writing acceleration. However, there was no significant difference between the AD and HC groups in terms of writing pressure. But we can also observe that the average pressure of the AD group (523.90 level) was slightly greater than that of the HC group (521.51 level). This finding is consistent with the findings of Zhao et al.37,38, 37,38 However, it has also been shown that writing pressure decreases rather than increases in AD patients. 39 Overall, the main characteristics of AD patients in writing are increased thinking time, slower writing velocity, as well as writing errors and increased number of strokes. The present study obtained an accuracy of 91.94% on handwriting features, which was of high value as an aid to screening. In addition, to evaluate the effectiveness of our proposed method, we also conducted experiments using the DARWIN dataset. 40 The DARWIN dataset includes data from 174 participants. Each participant completed 25 writing tasks. After experiments, we achieved the highest accuracy of 83.99% on task 23, while the highest accuracy in the original paper was 82.00%, which also proved that our method is applicable on public datasets. The specific experimental results are shown in Table 8.
Comparative experiments results with DARWIN dataset
In addition, compared with neuropsychological tests, which are highly specialized in content and require a high level of screening personnel, the dot-matrix digital pens used in the present study to collect handwriting data are inexpensive and easy to use. So that the present technology can be applied to grass-roots geriatric institutions with scarce healthcare resources to achieve group screening and management of the elderly.
Gait involves the cognitive domains of complex attention, executive function, and perceptual-motor function. The experimental results of this study found significant differences in only 4 out of 32 gait features. This study can find that the AD group had a greater cycle time differential in walking than the HC group. Cycle time differential refers to the difference in time spent on two consecutive walking cycles of the subjects. Under normal conditions, the gait cycles of the left and right feet are close to each other, and if there is gait abnormality or asymmetry, it may lead to different gait cycles of the left and right feet. Therefore, it can conclude that AD patients show asymmetry of the right and left feet and gait rhythm disorder when walking, which may be due to cognitive impairment and imbalance of bilateral muscle strength. This finding is consistent with the previous study. 41 There were also significant differences in the two features of step length standard deviation and stride length standard deviation. It can observe that the AD group’s step length and stride length standard deviation were greater than the HC group. These two features refer to the standard deviation of the step length/stride length of the subjects during walking and are used to measure the degree of change in the step length/stride length of the subjects. Therefore, it can conclude that the AD patients show gait instability during walking. The large variation in stride length also reflects the asymmetry of stride length between the right and left feet of AD patients, which may be due to factors such as lesions in certain parts of the brain and neuronal damage.42,43, 42,43 In addition, there was a significant difference in the standard deviation of the base of support. We can find that the support base standard deviation in the AD group was larger than that in the HC group. The support base refers to the horizontal distance between the two feet and the center point, which is also known as the step width. Therefore, it can conclude that AD patients have greater variability in step width and experience problems such as gait instability and decreased movement coordination when walking. This conclusion is also consistent with previous study. 44
Many studies have reported that gait in AD patients is characterized by slowed gait velocity, shortened stride length, etc.23–25 However, in this study, the two features were not significant. But the overall mean velocity and mean stride length of the AD group were smaller than those of the HC group. Overall, this study found that cycle time differential, step length/stride length standard deviation, and support base standard deviation increased in AD patients during normal walking. Also, walking velocity and stride length were smaller in AD patients than in normal elderly.
Handwriting and gait cover different cognitive domains. Better classification performance can be obtained by analyzing these two modalities together. This study can be applied to the community, nursing homes, and other grassroots geriatric organizations to achieve group screening and management of the elderly. However, this study also has some limitations. First, this study only included two modalities: handwriting and gait. In the future, other information about the subjects can be collected, such as data that is commonly available on smart wristbands or watches. In addition, clinical data, eye movements, EEG, and other information can also be included in the study.45,46, 45,46 Second, this study divided the AD and HC groups controlling only for variables such as age, education, habitual hand, and diseases that can affect writing and walking. Future studies could control for variables such as writing habits and exercise habits of the elderly to improve the accuracy of the sample. In addition, many studies introduce dual tasks47–49 (e.g., counting down numbers while walking normally) when studying gait changes in AD patients. This method can better reflect the cognitive impairment of the patients, and then better classification results can be harvested. This practice can be adopted in the future, which will help the study of gait in AD patients.
In addition, in this study, the feature subsets with significant differences in Table 5 were input into each classification model, but the classification effect obtained was not as good as the optimal feature subset derived from the SFS algorithm. After analyzing, there may be several reasons. First, the statistical test may only consider the significance of individual features without considering the correlation or interaction between features, resulting in missing some important features. Second, the feature selection algorithm starts with the contribution of features to the model, which can find the optimal feature subset in a more comprehensive search space, not only relying on the significance of features.
Conclusion
This study proposed an assisted screening framework for AD based on multimodal analysis of handwriting and gait, 38 AD patients and 52 healthy controls were included. Handwriting data of the subjects were collected using dot-matrix digital pens and dot-matrix papers. Gait data were collected using the GAITRite walkway. Differences in handwriting and gait features between the AD and HC groups were analyzed using statistical methods. In addition, five classification models (LR, SVM, XGBoost, Adaboost, LightGBM) were trained on handwriting features, gait features, and fusion features using 10-fold cross-validation iterated 20 times. The experimental results showed that the fusion features obtained the highest test accuracy of 96.17% on LightGBM, with an AUC value of 0.991. It demonstrated that our proposed assisted screening framework can effectively classify AD patients and HC. This study also found that the classification effect of combining two modalities was higher than that of a single modality, proving the classification superiority of multimodal feature fusion. This study can be applied in communities, nursing homes, and other primary geriatric institutions to achieve group-based screening and management of the elderly and to provide ideas for solving the multimodal problem of geriatric diseases.
AUTHOR CONTRIBUTIONS
Hengnian Qi (Conceptualization; Project administration; Writing – review & editing); Xiaorong Zhu (Data curation; Methodology; Writing – original draft); Yinxia Ren (Data curation; Methodology); Xiaoya Zhang (Data curation); Qizhe Tang (Resources; Software); Chu Zhang (Writing – review & editing); Qing Lang (Funding acquisition; Project administration); Lina Wang (Conceptualization; Data curation; Resources).
Footnotes
ACKNOWLEDGMENTS
We are very grateful to the Huzhou Rehabilitation Hospital for its help in the data collection.
FUNDING
This work was supported by the Chaomi S & T Company Cooperation Project (No. HK16003), the National Natural Science Foundation of China (No. 72174061), the Scientific Research Fund of Zhejiang Provincial Education Department (No. Y202250185), China Scholarship Council foundation (No. 202308330251).
CONFLICT OF INTEREST
The authors have no conflict of interest to report.
DATA AVAILABILITY
The data supporting the findings of this study are available on request from the corresponding author. The data are not publicly available due to privacy or ethical restrictions.
