Abstract
Background
Huntington's disease is a rare, progressive, neurodegenerative disease. Capturing symptomatic progression and treatment effects reliably in clinical therapeutic trials has shown to be a challenging task, facing the problem of small cohorts and a variable phenotype. Hence, robust and sensitive outcome measures are needed, to assess efficacy and safety of novel therapies in clinical studies with small cohorts.
Objective
Objectives of this study were to assess feasibility, discriminative potential, and correlation to clinical and imaging endpoints of the Q-Motor isometric force matching task with visual feedback. Furthermore, a statistical comparison with the Q-Motor grasping and lifting task should be assessed.
Methods
220 huntingtin gene expansion carriers (HGEC) and 110 non-huntingtin gene expansion carrier (Non-HGEC) participants of the observational TRACK-HD study completed the Q-Motor force matching assessment, along with a standard battery of clinical tests (UHDRS) and MRI assessments. During the Q-Motor force matching assessments, patients were reproducing a target force in precision grip with help of visual feedback. Q-Motor utilizes a highly sensitive force transducer to record force feedback. HGEC participants were categorized into four groups by a CAG- and age-based survival score.
Results
Q-Motor force matching allowed for very good discrimination between HGEC and Non-HGEC participant groups (p < 0.001 for all but least affected HGEC group) and between different HGEC groups (all p < 0.05). Strong correlations with UHDRS scores (TMS = −0.708, TFC = −0.638), CAG-Age product scores (Survival Score = 0.626) and imaging outcomes (caudate volume = −0.609, striatum volume = −0.624) were observed. A statistical difference between correlation strength of Q-Motor force matching and Q-Motor grasping and lifting tasks could not be observed.
Conclusions
Cross-sectional analysis of Q-Motor force matching showed promising results, outperforming clinical rating scales in sensitivity. Further efforts are required to assess longitudinal robustness of the task, and to further explore its potential of capturing cognitive effects by increasing cognitive load during the task.
Keywords
Introduction
Huntington's disease (HD) is a progressive, neurodegenerative, autosomal dominant CAG-repeat disease. The disease pattern involves disruption and atrophy of white matter particularly in the vicinity of the basal ganglia and cortical regions. 1 The basal ganglia and the basal ganglia-cortical loops are suspected to play a central role in many processes that are known to be impaired in persons with HD (PwHD). They include reactive control and timing control, as well as sequential hand movements and movement planning, likely caused by reduction in somatosensory input to cortex, and decreased facilitation of cortical neurons.1–3
Currently, there is no effective therapy to slow down the progression of HD. However, there are some promising therapeutic approaches reducing the supposedly toxic mutant Huntingtin protein (mHTT) or targeting other neuroprotective effects.4,5 As recent studies have shown, there is a strong need for quantitative, objective measures, to reliably and sensitively assess the efficacy and safety of such approaches. This is concluded from repeated observations showing that clinical rating scales suffer from high inter and intra rater variability and appear to be vulnerable to patient and rater induced placebo effects, which mask potential treatment effects.6–8
One quantitative measure frequently used in HD clinical trials is the quantitative motor assessment battery Q-Motor. 9 The Grasping and Lifting task of the Q-Motor battery, revealed increased grip force variability that correlated with HD severity and progression. The measure allows for differentiation between gene-expansion carriers and Non-HGEC controls up to a decade before clinical disease onset in the clinically pre-symptomatic/premanifest stages. 10 The isometric force matching Q-Motor task investigated in this study was designed to specifically assess grip force control with visual feedback and under well-defined conditions, i.e., with the intention to reduce the impact of chorea and motor impersistence regularly observed in the Grasping and Lifting task investigated previously.11,12 Isometric force matching has already been shown to be a sensitive tool to measure motor performance in HD in a small pilot study, as well as in other movement disorders or diseases that involve motor impersistence.13–15
The data presented here was generated in the TRACK-HD study, a multicenter observational study, that was conducted in four clinical sites in London (UK), Paris (France), Leiden (Netherlands) and Vancouver (Canada) between 2008 and 2011. The study was designed as adoptive study with a baseline visit in year 1, and 3 years of annual follow-up visits. The Q-Motor isometric force matching tasks were introduced in the course of the study and conducted in 220 individuals that were huntingtin gene expansion carriers (HGEC) and 110 control participants that were non-huntingtin gene expansion carriers (Non-HGEC). The objective of this work was to investigate the feasibility, sensitivity and applicability of the Q-Motor force matching task in clinical studies in Huntington's disease and to explore its possible usefulness as quantitative outcome measures.
We specifically aimed to investigate the hypotheses whether isometric force matching with visual feedback (1) allows for discrimination between (Non-HGEC) and (HGEC) groups; (2) correlates with core HD clinical endpoints and/or imaging measures and (3) delivers more sensitive data about fine motor force control of participants than the Q-Motor Grasping and Lifting task.
Materials and methods
The TRACK-HD study
The study was approved by the local ethics committees; all enrolled participants gave written informed consent. Q-Motor tests were conducted by trained and certified Q-Motor raters. TRACK-HD was designed as an adoptive biomarker study; novel tests were introduced at annual visits following steering committee recommendations. During the visits 3 and 4 of TRACK-HD, the Q-Motor Force Matching task was introduced as part of the assessment battery driven by exploratory data, published later. 13 Therefore in this work, only TRACK-HD year 3 data was used for the cross-sectional analyses of Force Matching data. Basic demographics of the study population at year 3 are listed in Table 1.
Basic demographic data of the study population at baseline (year 3 of the TRACK-HD study), grouped by the traditional staging model of HD (cont = controls (non-HGEC), preHD = premanifest HD, HD = symptomatic HD).
Participant grouping
The TRACK-HD study enrolled 366 participants, with 332 participants attending the 24 moths visit (year 3) of the study. Two hundred and twenty individuals carrying the mutant huntingtin gene (mHTT) and 110 control participants without the gene mutation (Non-HGEC) completed the Q-Motor Force Matching assessment in year 3. HGEC groups were stratified by the Unified Huntington's disease Rating Scale (UHDRS) diagnostic confidence score into premanifest and manifest groups of matched population sizes. 10 The premanifest group was further divided into two subgroups preHD A and preHD B, by the median of the individuals’ predicted years to HD diagnosis, with preHD B being closer to a predicted clinical onset of HD than preHD A. The manifest group was divided into the subgroups HD stage 1 and HD stage 2, based on the UHDRS Total Functional Capacity score. HD stage 1 was less impaired than HD stage 2.
First assessment of isometric Force Matching started in year 3 of the TRACK-HD study. Considering this visit as baseline for our analyses, we found that the stratification at baseline no longer matched the original criteria, as the disease had progressed. Re-grouping the participants using the same stratification on the other hand resulted in rather unbalanced subgroup population sizes. Thus, we divided participants in subgroups solely based on the survival model, presented in Langbehn et al.; a parametric model that predicts the probability of surviving until at least the given age without developing neurological symptoms. 16 We applied the model to all HGEC participants, regardless whether they were clinically symptomatic or not. Since the model takes age and CAG repeat length as its sole input parameters, it makes the grouping strategy independent from clinical ratings and thus from subjective bias.
In lack of common cutoffs for this grouping strategy, and based on the originally used sub-groups pre-HD 1 and 2 and manifest HD stage 1 and 2, we chose cutoffs at the quartiles of the survival score to subdivide HGEC participants into four groups from
These grouping criteria are un-biased from the physicians’ assessments of diagnostic confidence and the UHDRS sub-scores, and independent from the population sample that is here analyzed. Furthermore, they result in a more homogeneous distribution across the sub-groups in our sample than the original grouping criteria used at baseline of TRACK-HD. Analogue to Table 1, Table 2 shows the demographic distribution per group as stratified by survival score.
Basic demographic data of the study population at baseline (year 3 of the TRACK-HD study), distributed in four HGEC groups (HGEC1, HGEC2, HGEC3, HGEC4) by survival score, plus non-HGEC control group.
In the analyses we included Q-Motor data correlations with the survival score and with the CAP (CAG-Age product) score, since CAP scores are frequently used tools in HD clinical research e.g., for patient stratification. The CAP score as used for our analyses is computed as
Q-Motor force matching
The quantitative motor (Q-Motor) battery was designed to supplement the UHDRS Total Motor Score (UHDRS-TMS), the clinical gold standard to assess motor performance and progression in HD, with objective data. 9 Q-Motor captures and tracks movement characteristics that are known to be affected in HD, including involuntary movements (chorea), bradykinesia, increased motor variability and impairments in motor persistence (i.e., motor impersistency). Movement characteristics of tapping and isometric force tasks are detected using a pre-calibrated and temperature-controlled six-axis force transducer (ATI Force/Torque SI-20-1, ATI Industrial Automation, Apex, NC) measures with a sampling frequency of ∼350 Hz at a precision of 0.01 N in z-direction. 17
Q-Motor Force Matching was an isometric force persistence task using a device specifically designed to assess motor impersistency (Figure 1). The participant was seated comfortably in front of the device. The force transducer was mounted to the device, which was firmly attached on the table in front of a monitor, that displayed a target force (orange line) and the current force applied on the force transducer (blue line).

A: Task set-up. The participant is seated in front of a screen; the grasping device is mounted fixed on the table. B: The device with attached force transducer is grasped in precision grip, using the thumb and index finger. C: The monitor displays the target force (orange line) and currently applied force (blue line).
When the examiner started the task, an acoustic signal was audible. The participant was instructed to grasp the force transducer in the precision grip with thumb and index finger of the assessed hand, and to match the applied force (blue line on the display) with the target force (orange line on the display) as good as possible. After 20 s another acoustic signal marked the end of the trial.
The task has been performed with two target forces, low (2 N) and high (10 N), and was recorded first with the right and then with the left hand. Each task comprised six consecutive trials including an initial practice trial that was not evaluated. All data was recorded using the software WinSC (Department of Physiology, University of Umeå, Sweden).
Definitions of Q-motor measures applied
Static phase
Participants grasp the device after they hear the start sound and first forces are recorded until the trial ends signaled by an audio cue played 20 s after the start. The “static phase” is defined as the 15 s interval from second 5–20 aiming to assess performance of participants after being able to adjust applied forces to the target force and reaching a fairly steady-state or stationary force level.
Force
The ATI-Mini40 force transducer applied is a six-axis force sensor measuring forces in three dimensions (x, y, z) as well as corresponding shearing forces. The z-force channel denotes the force applied perpendicular to the grasping surface of the sensor and is the force channel used and analysed in this study. It is referred to as “force” hereafter.
Force deviation
“Force deviation” is defined as the difference between measured force and displayed target force level.
Force variability
“Force variability” is defined as the coefficient of variation of the static phase force, calculated as standard deviation of the force divided by the mean force level measured during the static phase (see above).
Force deviation variability
Analogue to force variability, the “force deviation variability” is defined as coefficient of variation of the force deviation.
Controls
Group of non gene-expanded (Non-HGEC) participants included in the study.
HGEC1
Group of gene-expanded participants with a survival score higher than 0.75 ((0.75, 1]) that has a low risk of showing symptoms of HD.
HGEC2
Group of gene-expanded participants with a survival score between 0.5 and 0.75 ((0.5, 0.75]) that has an increased risk of showing symptoms of HD.
HGEC3
Group of gene-expanded participants with a survival score between 0.25 and 0.5 ((0.25, 0.5]) that has a high risk of showing symptoms of HD.
HGEC4
Group of gene-expanded participants with a survival score lower or equal 0.25 ((0, 0.25]) that has the highest risk of showing symptoms of HD.
Statistics
Statistical analyses were conducted with R version 4.1.2, using the tidyverse framework.18,19 If not explicitly noted otherwise, R-core packages have been used for computations. We considered a type I error of
To identify the set of independent Force Matching outcome variables with the highest sensitivity for HD progression, we applied a random forest model using the party package R function
For group comparisons, the Wilcox Rank Sum test has been used to determine group differences. Visualization of group comparisons was done using notched box plots, with boxes that denote the inter quartile range (IQR) of the data sample, and whiskers that reach to the highest and lowest value within a
Correlation to clinical measures and brain imaging results were computed using Spearman's
To determine and separate the influence of age, sex, education and study site, we computed a linear model using the formula
Comparisons of correlation coefficients were done using the WRS package. 22 We followed Wilcox et al., using a percentile bootstrap approach with B = 500 bootstrap samples, to investigate whether differences in correlation coefficients between Q-Motor Grasping and Lifting task and Q-Motor Force Matching task can be considered statistically significant. 23
Results
Sample demographics
Year 3 of the TRACK-HD study was completed by 110 Non-HGEC participants, 101 pre-symptomatic and 119 symptomatic HGEC participants (see Table 1), as enrolled for year 1 of the study. For three subjects of the symptomatic group, no valid data was available for the low target force assessment, while 4 data points of symptomatic HGECs were missing for the high target force matching data.
While demographic characteristics were generally well balanced, the group ages differed between groups; the pre-symptomatic HGEC group was significantly younger than Non-HGEC controls (p < 0.001) and symptomatic HGECs (p < 0.001).
Group comparisons
Figure 2 shows boxplots of two sample parameters of the matching task (coefficient of variation of the applied force (force variability) and absolute deviation of residual force (force deviation variability)) plotted against the classical staging model groups (top) and the groups that resulted from the survival scores (bottom). While the performance distribution of participants from less to more affected participants followed a sigmoid shape in the staging groups, it was almost linear when looking at survival score groups.

Comparison of force matching variables performance in different grouping strategies. Top: Force matching variability and force deviation variability of the left hand grouped by classical staging model. Bottom: Force matching variability and force deviation variability of the left hand grouped into four equidistant survival score subgroups. The sample sizes of subgroups using the survival score strategy are more balanced and the relationship with the Q-Motor variables was more linear, compared to the classical staging model.
With the exception of the least affected HGEC group (HGEC1), all HGEC subgroups differed significantly from the Non-HGEC control group (p < 0.001). In the survival score grouping model, also the four HGEC groups were differentiated from each other (HGEC1 ∼ HGEC2 p < 0.001; HGEC2 ∼ HGEC3 p < 0.05; HGEC3 ∼ HGEC4 p < 0.001), using Wilcox Rank Sum tests.
In the following analysis, group comparisons of force matching performance are reported for the survival score groups, only.
Figure 3 shows the un-adjusted p-values with 95% confidence intervals from comparisons of HGEC groups against Non-HGEC group for all variables and test paradigms, using Wilcox Rank Sum tests. In all categories, the force coefficient of variation showed the best performance in terms of differentiation from controls.

Unadjusted estimates and 95% confidence intervals from Wilcox Rank Sum tests, comparing the control group to the individual survival score subgroups for each test paradigm (high and low forces, right and left hand). The subgroup with the lowest risk to show symptoms soon (HGEC1) is the only one that did not significantly differ from controls in every test.
A detailed table of the un-adjusted estimates, confidence intervals and p-values is presented in the Supplementary Material (see Table 6).
Variables of importance and correlations
While the force coefficient of variation qualitatively showed the best performance of extracted matching task features, we decided to verify this impression with a statistical approach, using a random forest model under consideration of correlations among the predictors.
The importance distribution was similar in all test cases, Table 3 shows the mean results, ordered from highest to lowest.
Mean variable importance, conditionally determined using a random forest model over each test paradigm.
For each test paradigm, the force coefficient of variation (force matching variability) turned out to be the most influential and important variable. In line with this finding, the force coefficient of variation also yielded the overall highest correlation with clinical scores and imaging results (see Appendix Table 7).
Correlations and linear regressions
All correlation coefficients between Q-Motor variables and clinical scores or imaging outcome were performed using Spearman's
Highest correlation was observed between force variability (low target force, left hand) and the UHDRS-TMS (
A detailed correlation table of all variables of interest can be found in the Supplementary Material (see Figure 6 and Table 7).
In Figure 4, correlations between the best performing force matching coefficient of variation left hand (force variability) and various outcome measures of the Track-HD study were plotted, with 95% confidence interval ellipses in ordinal scale data and linear regression lines in interval scale data for individual groups.

Top: scatter plot with 95% confidence interval ellipses, showing force matching coefficient of variation of the left hand (force variability) versus UHDRS total functional capacity (TFC) and Total Motor Score (TMS). Bottom: Scatter plots with linear regression lines and standard error of Spearman correlations between force variability versus Single Digit Modality Test correct score (SDMT) and striatum volume change normalized to intra cranial volume (Striatum / ICV).
Strongest correlations were observed with the clinical TMS, ranging from
Correlations for the CAP and Survival Score (Figure 5) were not computed per group, since both variables, as well as the grouping strategy, were based on age and CAG as factors. Both showed strong correlations (CAP:

Linear regression with standard error of spearman correlations between force matching coefficient of variation of the left hand (force variability) versus the disease burden score (CAP) and survival score.
Linear model
Except for the least affected gene-expanded group HGEC1, all groups of HGECs showed significantly more variance in the Force Matching task (
Coefficient table of the linear model. With respect to further factors like age, sex, education and study site, the least affected HGEC group HGEC1 does not show significant performance differences compared to controls, while the difference in all other groups reach highly significant levels.
Comparison to Q-motor lifting task performance
When comparing Q-Motor Force Matching results with grasping force variability of the Q-Motor Lifting task, we found higher correlation coefficients for Force Matching variability for all computed correlations (Table 5).
Comparison of correlation coefficients. Spearman's
However, comparison of correlation coefficients using the bootstrapping approach as suggested by Wilcox et al. showed no statistically significant differences between Force Matching and Lifting correlation coefficients. 23
Discussion
We found that assessing the Force Matching task in the multicenter TRACK-HD study was feasible with a very high level of data availability and -quality.
Grasping and holding an object is a task we perform in everyday life, e.g., when eating and drinking, brushing our teeth or holding a pen or smartphone. Impairment of grip force control has a direct impact on these daily activities and as such on quality of life. The measures defined and reported are therefore aimed at assessing functionally relevant tasks.
One limitation of this study were unbalanced groups when looking at classical HD categories (pre-manifest, stage 1, stage 2, etc), induced by the late implementation of the Force Matching task into TRACK-HD in year 3. Furthermore, being an observational study, neither participants nor physicians and raters were blinded for the participants’ genetic status, which may have introduced a bias in the clinical rating scales the HD categories depend on.
We addressed these limitations, using an alternative approach to HD categories, using the Langbehn Survival Score which is in nature similar to the CAG-Age product (CAP) scores and does not rely on qualitative rating scales. 16 Another possibility of categorizing participants could have been the HD Integrated Staging System (HD-ISS), a framework that has been developed to become the standard tool e.g., for cohort stratification in HD clinical research. 24 However, in lack of some variables required for correct computation of HD-ISS stages, a reliable comparison with our data was not possible. Particularly the volumetric ratios of putamen and caudate to intracranial brain volume have to be computed using a specified imaging processing pipeline, to receive robust landmarks for HD-ISS stage 0 and 1. The Independence Scale which serves as landmark for stage 2 and 3 was not included in the TRACK-HD protocol at all.
Force Matching allowed for very good discrimination between HGEC groups and Non-HGEC group, particularly for HGEC groups with a Survival Score of
In contrast to earlier findings in a smaller cohort (see Medzech et al.), the correlations were evident not only in 2 N target force matching but also with the 10 N target force. 13 Still, the lower target force differentiated more sensitively between Non-HGEC and HGEC groups, particularly in early stages of the disease, while high forces tended to be harder to match for more affected participants.
Grasping force variability showed strong and highly significant correlations with clinical rating scales such as the UHDRS-TMS and TFS, with CAG-Age product scores like CAP and the Survival Score and also with imaging variables, particularly with caudate and striatum volumes.
The Force Matching assessment was designed to assess aspects of grasping force control and -variability in a more defined environment, compared to the already available Q-Motor Grasping and Lifting task, which does record grasping force variability but does not include a visual feedback or the premise to match a target force.
As both tasks have been assessed in the TRACK-HD study, we could compare the performance in correlations using a percentile bootstrap approach. All correlation coefficients were increased in the Force Matching task, however, significant differences could only be observed in correlations with the non-normalized striatum and putamen volumes. Still, in summary the results suggest an increase in sensitivity when using the Force Matching task to assess grasping force.
Our findings suggest, that sensitivity for a plain force matching task may suffer from a floor effect in patient populations with little to no motor symptoms. While the task may be a useful marker of disease progression in trials of symptomatic therapies, pre-symptomatic trials are gaining popularity in pharmaceutical HD research. However, the visual feedback design of the presented task yields the potential of adding additional motor and cognitive challenges to the assessments e.g., through variable target patterns or in dual-task designs. It could potentially be used to answer further questions, regarding fine motor control, force endurance and fatigue, and shifting sensitivity towards less motor symptomatic populations.
Further analyses of longitudinal performance will be necessary to assess the question whether force matching can still be a sensitive marker for subtle performance changes also in pre-manifest HGECs. In combination with the robust and sensitive hardware and the high degree of standardization of the task, careful tuning of task paradigms may enable detection of smallest performance differences.
Quantitative digital assessments like the one presented here potentially can increase effect sizes while reducing population sizes in clinical trials, resulting in more accurate and cost effective trials and less burden for participants, compared to categorical, qualitative scales. We have previously observed that compared to categorical clinical rating scales, quantitative Q-Motor data was free of patient- and rater bias, yielding placebo-free results.6–8 Several Q-Motor measures were more sensitive than clinical scales in detecting signals early and in treatment studies.6–8,10,25,26 The data observed in this study suggest that the Q-Motor Force Matching task investigated here is a candidate for further exploration as part of the Q-Motor assessment battery in future studies.
Supplemental Material
sj-docx-1-hun-10.1177_18796397251386987 - Supplemental material for Isometric force matching in the TRACK-HD study - a novel quantitative assessment for clinical applications?
Supplemental material, sj-docx-1-hun-10.1177_18796397251386987 for Isometric force matching in the TRACK-HD study - a novel quantitative assessment for clinical applications? by Robin Schubert, Pascal Barallon, Benjamin Habbel, Michael Deppe and Ralf Reilmann in Journal of Huntington's Disease
Footnotes
Acknowledgments
We acknowledge the contribution of the TRACK-HD study team and sites in Leiden, London, Paris & Vancouver. TRACK-HD including the development of Q-Motor tasks at GHI was funded by the CHDI-Foundation. Q-Motor tasks can be made available for the research community in collaborations and sponsored projects. The Isometric Force Matching task is protected by patent DE102012018124A1 (QuantiMedis GmbH. Isometrische finger-greifkraft-ziel-messvorrichtung. DE102012018124A1, 2012.
(accessed 21 February 2025)).
Ethical approval and informed consent statements
The study was approved by the local ethics committees. All enrolled participants gave written informed consent.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The George-Huntington-Institute received funding for Q-Motor assessments and data analyses in the TRACK-HD study.
Declaration of conflicting interests
Robin Schubert, Benjamin Habbel and Pascal Barallon are employees of the George-Huntington-Institute and QuantiMedis, both developing and distributing the Q-Motor system for clinical studies and research.
Ralf Reilmann is founder and CEO of the George-Huntington-Institute and QuantiMedis.
Data availability
Artificial intelligence policy
No artificial intelligence has been used for generation or enhancement of content of any kind in any part of this manuscript.
Supplemental material
Supplemental material for this article is available online.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
