Abstract
Background:
Gray matter (GM) atrophy in the striatum and across the brain is a consistently reported feature of the Huntington Disease (HD) prodrome. More recently, widespread prodromal white matter (WM) degradation has also been detected. However, longitudinal WM studies are limited and conflicting, and most analyses comparing WM and clinical functioning have also been cross-sectional.
Objective:
We simultaneously assessed changes in WM and cognitive and motor functioning at various prodromal HD stages.
Methods:
Data from 1,336 (1,047 prodromal, 289 control) PREDICT-HD participants were analyzed (3,700 sessions). MRI images were used to create GM, WM, and cerebrospinal fluid probability maps. Using source-based morphometry, independent component analysis was applied to WM probability maps to extract covarying spatial patterns and their subject profiles. WM profiles were analyzed in two sets of linear mixed model (LMM) analyses: one to compare WM profiles across groups cross-sectionally and longitudinally, and one to concurrently compare WM profiles and clinical variables cross-sectionally and longitudinally within each group.
Results:
Findings illustrate widespread prodromal changes in GM-adjacent-WM, with premotor, supplementary motor, middle frontal and striatal changes early in the prodrome that subsequently extend sub-gyrally with progression. Motor functioning agreed most with WM until the near-onset prodromal stage, when Stroop interference was the best WM indicator. Across groups, Trail-Making Test part A outperformed other cognitive variables in its similarity to WM, particularly cross-sectionally.
Conclusions:
Results suggest that distinct regions coincide with cognitive compared to motor functioning. Furthermore, at different prodromal stages, distinct regions appear to align best with clinical functioning. Thus, the informativeness of clinical measures may vary according to the type of data available (cross-sectional or longitudinal) as well as age and CAG-number.
Keywords
INTRODUCTION
Huntington disease (HD) is a progressive, delayed-onset disorder primarily associated with involuntary motion (chorea) and also characterized by cognitive alterations spanning executive functioning (including task-switching, response inhibition, and working memory) [1–3], odorant recognition [4], and emotional processing [5, 6]. HD-associated disturbances also extend beyond the central nervous system, affecting cardiac, respiratory, and sleep systems.
HD is a rare condition, affecting around 1 in 7,300 individuals in the United States (global prominence is variable) [7]. However, as both a proteinopathy and the most common PolyQ disease (Q is the symbol for glutamine, the amino acid encoded by the CAG codon), it shares key features with disorders that are more difficult to disentangle due to increased genetic complexity. Proteinopathies, including HD, Alzheimer’s disease, and Parkinson’s disease, are distinguished by regionally-selective neuronal death (brain structural alterations are widespread in HD but concentrated in medium spiny striatal neurons) and protein misfolding (manifesting as expanded huntingtin in HD, Lewy bodies in Parkinson’s disease, and β-amyloid plaques in Alzheimer’s disease). These conditions typically demonstrate delayed onset and disproportionately affect the elderly [8].
There is no known cure for HD, though its cause has been known since the nascence of human genome sequencing. In 1993, an abnormally expanded cytosine-adenine-guanine (CAG) repeat at an exon 1 locus of the chromosome 4 HTT gene was identified as causal for HD [9]. The normal range of CAG-expansion at this locus is 17–20 [10]; repeat-numbers of 36 and above confer highly elevated risk of HD, with higher CAG-numbers being associated with earlier onset and more rapid symptom progression [11]. In individuals with abnormally-expanded CAG, brain structural and cognitive differences from non-expanded individuals are measurable a decade or more before HD diagnosis, during a period known as the prodrome. The relationship between CAG and onset makes it possible to index the prodrome using CAG-number and age [12]; many researchers have leveraged this fact to identify the earliest prodromal changes as potential targets for HD interventions. Because HD typically manifests in mid-to-late life, treatments that further delay symptoms could prevent disease manifestation in many individuals.
PREDICT-HD is among the most extensive research groups to undertake characterization of the prodrome; this multi-site study has collected genomic, brain imaging, and clinical (motor and cognitive) data from 33 sites in the United States, Canada, Australia, and Europe [13]. PREDICT-HD has identified many early prodromal indicators, including widespread gray matter (GM) concentration reductions [14], longitudinal motor, cognitive, and volumetric changes (in the putamen, caudate, and nucleus accumbens) [15] as well as clinically-associated subcortical brain volume differences [2, 17], and resting state functional connectivity alterations [18].
Brain structure in prodromal HD
GM atrophy, particularly in the basal ganglia, is a long-established symptom of the HD prodrome. More recently, prodromal white matter (WM) deterioration around the striatum [19–25] and across the brain [16, 26–29] has also been reported. Cerebral WM can offer independent power for predicting diagnosis proximity in prodromal HD [27]. Prodromal WM changes may also precede those observed in GM, although reports have differed.
To date, most prodromal HD WM studies have examined diffusion magnetic resonance imaging (dMRI) measures such as fractional anisotropy (FA), mean diffusivity (MD), radial diffusivity (RD), and axial diffusivity (AD) to compare WM integrity between prodromal and control participants as well as among prodromal groups with varying estimated proximities to diagnosis. This work has generally demonstrated prodromal anisotropy reductions and diffusivity increases in WM adjacent to the striatum [22, 30–32] as well as in the thalamus [22, 31–34], corpus callosum [22, 35], and parietal and occipital areas [22, 35]. Frontal regions are also common sites of reported prodromal diffusivity differences [22, 34–36]. In one of the only studies probing GM-adjacent-WM, the effect sizes for atrophy increases were more significant in GM-adjacent-WM than whole WM [22].
Longitudinal findings have been more limited and conflicting. The largest longitudinal diffusion-weighted imaging (DWI) study to date (N = 191 pre-HD and 70 control participants) reported cross-sectional and longitudinal differences that were consistent with degeneration in WM tracts connecting the striatum to premotor, primary motor, and sensory cortices, with the most prominent longitudinal effects observed in putamen-connected tracts [25]. Another two-year longitudinal WM diffusivity study of 64 prodromal participants reported differences in longitudinal diffusivity that were restricted to the superior fronto-occipital fasciculus [33]. An additional two-year study that included 211 prodromal individuals found increased rates of atrophy in cerebral WM that were most pronounced in the frontal lobe [37]. Notably, no changes were observed in lobe-specific or cortical GM volumes, and disease-related whole-brain WM atrophy exceeded striatal atrophy when age-related atrophy was included as a statistical nuisance variable. Other studies, however, did not report any changes over a 12- or 30-month period [26, 39].
Prodromal WM and clinical functioning
Most prodromal studies linking WM to motor and cognitive functioning have also been cross-sectional, associating volumetric and FA reductions as well as MD, RD, and AD increases with clinical impairment [16, 41] (Table 1). In the HD prodrome, correlations between WM and clinical measures have indicated widespread regional involvement in cognitive and motor functioning. Few studies have compared WM and clinical change trajectories concurrently. In one such study, motor symptom increases were associated with diffusivity changes in the superior fronto-occipital fasciculus as well as volumetric changes in the corpus callosum, cerebrospinal fluid, and lateral ventricle [33]. In another longitudinal study, the Unified Huntington Disease Rating Scale Total Motor Score (UHDRS-TMS) was the only independent predictor of longitudinal FA decline in both prodromal and symptomatic HD [38].
Reported Prodromal Cross-sectional WM-Clinical Associations
Previously reported associations between prodromal WM volumes/tracts and cognitive and motor functioning variables. R, right; SDMT, Symbol Digit Modalities Test; SC, Stroop Color; SW, Stroop Word; SI, Stroop Interference; TMTA, Trail-Making Test A; TMTB, Trail-Making Test B; UHDRS-TMS, Unified Huntington Disease Rating Scale Total Motor Score; RD, Radial Diffusivity; FA, Fractional Anisotropy; MD, Mean Diffusivity. All reported FA differences are reductions, and all other diffusivity differences reflect increases.
The present study
This work complements and extends the current literature regarding brain-structural and clinical changes in the HD prodrome by: investigating co-occurring regional patterns in the brain; examining cross-sectional and longitudinal WM changes; concurrently comparing change trajectories in WM and clinical functioning variables; and analyzing one of the largest prodromal HD brain imaging data sets ever collected.
To accomplish these goals, we extracted brain imaging patterns primarily consisting of GM-adjacent-WM from our population using source-based morphometry (SBM), an independent component analysis (ICA)-based method optimized for brain imaging data [42]. The ICA-derived structural patterns allowed us to examine co-occurring changes across multiple brain regions and reduce the number of required statistical tests (compared to voxel-by-voxel testing). We applied Linear Mixed Model (LMM) analyses to concurrently compare cross-sectional differences and longitudinal trajectories of these WM profiles with those of clinical (cognitive and motor) functioning measures.
Cross-sectionally, we predicted that the prodromal group closest to estimated diagnosis would most drastically differ from controls, whereas the group furthest from onset would present with the most modest differences. We also expected the most robust longitudinal changes in the near-onset group, particularly in motor functioning and in areas underlying diagnosis-associated motor symptoms, such as the striatum. Across groups, we anticipated that frontal changes would be the most similar to cognitive changes, whereas changes in the striatum, thalamus, and cortical areas important for movement (such as motor cortices) would align most with changes in motor functioning. These predictions reflect the broader underlying hypothesis that different brain structural changes accompany early prodromal cognitive changes compared to the delayed motor dysfunction that typically instigates diagnosis.
MATERIALS AND METHODS
Participants
The study data included 1,336 PREDICT-HD participants (1,047 prodromal and 289 control) who took part in at least two separate MRI sessions. A total of 3,700 sessions were analyzed. Participant demographics are presented in Table 2. Prodromal participants were stratified into Low, Medium, or High disease progression levels based on estimated time to diagnosis as indexed by CAG-age-product (CAP) score, a widely used metric for establishing clinical diagnosis likelihood that was developed based on an accelerated failure time model analysis with the entire PREDICT-HD dataset [12]. CAP was calculated using the formula CAP = (age at first MRI scan session)×(CAG– 33.6). Based on this grouping, the participant pool (at baseline) included 288 Low (CAP scores <287.16), 365 Medium (CAP: 287.16–367.12), and 394 High (CAP > 367.12) prodromal participants, with High group designation indicating the highest likelihood of a looming HD diagnosis. Because CAP-score encompasses age and CAG-repeat-number, age differences between CAP-groups were anticipated. The High group was older than the Medium and Low groups at their earliest scans, and the Medium group was older than the Low group. The Low and Medium groups were also younger than the Control group. The High group had fewer years of education than the Control and Medium groups, and the Low group had fewer years of education than the Control group. Significant CAP-group differences in gender were also observed (there were more female participants overall) but were driven by the High vs. Low contrast. These statistics are presented in Supplementary Material 1.
Participant Demographics
Participant demographics, including baseline CAG-age-product (CAP), age, gender, years of education, and the number of sessions.
All PREDICT-HD participants provided written, informed consent and were treated in accordance with protocols approved by each participating institution’s internal review board. Participants underwent genotyping before study enrollment; those with less than 36 HTT CAG-repeats were designated as controls and those with 36 or more as prodromal. Participants with any other central nervous system condition or an unstable medical or psychiatric disorder were excluded from the study.
Imaging data acquisition
High-resolution anatomical MR images were collected at 33 sites using General Electric, Philips, and Siemens scanners with 1.5 T (Tesla) or 3 T field strengths and various acquisition parameters. Each site underwent a calibration process to minimize inherent differences between site and scanner manufacturer capabilities. The intent was to collect images with similar properties at each site under the constraints of each site’s scanner capabilities. T1 images were obtained using three-dimensional (3D) T1-weighted inversion recovery turboflash (MP-RAGE) sequences. The Siemens protocol was designed to be similar to the General Electric scan parameters: GRAPPA factor, 900 ms TI (inversion time), 2,530 ms TR (relaxation time), 3.09 ms TE (excitation time), 256 mm×256 mm field of view (FoV), 10° flip angle, 240 coronal slices (1 mm thickness), 256×128 matrix with 1/4 phase FoV, and 220 Hz/pixel receiver bandwidth. 3 T scanner protocol commonly included a sagittal localizing series followed by an axial 3D volumetric spoiled gradient recalled acquisition in steady state (GRASS) sequence acquisition and the following scan parameters: ∼1 mm×1 mm×1.5 mm voxel size, 0 mm gap, 18 ms TR, 3 ms TE, 24 cm FoV, 20° flip angle, 124 slices (1.5 mm thickness), 256× 192 matrix with 3/4 phase FoV, number of excitations (NEX) = 2.
Imaging data preprocessing
T1-weighted images were processed using a Nipype pipeline optimized for efficient large-scale multicenter data processing [43] using the BRAINSTools suite (http://github.com/BRAINSia/BRAINSTools.git) and the advanced normalization tools (ANTs) package [44]. Images were spatially normalized to an anterior commissure-posterior commissure (AC– PC) and interhemispheric fissure reference orientation, then segmented into tissue types (GM, WM, and cerebrospinal fluid) using an expectation maximization (EM) and a fuzzy k-Nearest Neighbor (KNN) classification incorporating bias-field correction and image registration [45–47]. The extracted WM probability maps were used in further analyses.
Clinical variables
Seven clinical variables were selected for analyses based on their established clinical reliability and sensitivity to prodromal HD progression [1], as well as data availability (e.g., although speeded tapping and emotion recognition exhibit strong effect sizes, data were available for <1000 participants): Symbol Digit Modalities Test (SDMT), three conditions from the Stroop Color Word Test, Trail-Making Test parts A (TMTA) and B (TMTB), and the Unified Huntington Disease Rating Scale Total Motor Score (UHDRS-TMS). Table 3 includes examples of each clinical measure. The SDMT, an adaptation of the Wechsler Digit Symbol subtest, is an indicator of working memory, complex scanning, and processing speed capabilities [48, 49]; participants are tasked with matching numbers to corresponding symbols using a provided key. The Stroop Color and Word Test includes the color (SC), word (SW), and interference (SI) conditions. The color and word conditions measure basic attention, whereas the interference condition also entails inhibition of a dominant (automatic processing) response [50]. For the color condition, participants identify colors presented on stimulus cards. For the word condition, the task is to read color names that are presented in black ink. For the interference condition, participants identify the ink color in which a color-name is presented (e.g., for the word “blue” printed in green ink, the correct response is “green”). TMTA primarily measures visual attention, whereas its more complex counterpart, TMTB, also measures task-switching [51, 52]. TMTA entails sequentially connecting a series of numbered circles (e.g., 1-2-3-4). For TMTB, consecutive numbers and letters are alternately connected in ascending/alphabetical order (e.g., 1-A-2-B-3-C). The UHDRS-TMS encompasses several indicators of motor functioning, including oculomotor function, bradykinesia, chorea, dystonia, gait, and postural stability [1, 53].
Clinical Variables
Examples of each clinical variable analyzed in the present study. For SC, SW, SI, TMTA, and TMTB, correct responses are shown. For SDMT, an example key and uncompleted response sheet are displayed, and two sample questions from the UHDRS-TMS are presented.
Source-based morphometry
ICA, a blind-source separation technique, was applied to processed WM probability maps using SBM via the GIFT Matlab toolbox (http://mialab.mrn.org/software/gift) to extract maximally independent spatial maps and their subject loading parameters [42, 55]. This method was chosen because it enables the examination of co-occurring brain structural patterns across several brain regions, in addition to requiring far fewer statistical tests than voxel-by-voxel comparisons. In order to extract a spatial map for every participant at every time point, enabling longitudinal examination of structural changes in the ensuing analyses, all available imaging data (3,700 sessions) were included in the SBM analysis. The number of components representing the 3,700 images was estimated to be 45 using a minimum description length (MDL) criterion modified to account for correlated voxels [56]. In a general ICA model, X = AS: X is an observation (i.e., participant-by-variable matrix); S is a statistically independent component matrix (component-by-variable); and A is the loading coefficient or mixing matrix, the representation of each component in the participant or sample (participant-by-component). SBM is an extension of ICA optimized for brain imaging data; each image is converted into a one-dimensional vector (a participant-by-voxel data matrix) [42]. This matrix is then decomposed into a mixing matrix (participant-by-component, where components are the 45 white matter profiles) and a source matrix (component-by-voxel) using the infomax algorithm for spatial ICA. The goal of ICA is to determine the mixing matrix (or rather, its inverse, the unmixing matrix), which represents how much each participant’s brain structure resembles each maximally independent white matter pattern extracted from the population. Rows of the mixing matrix are weights (or loading coefficients) representing how much each of the 45 components/profiles contributes to one participant’s data, and columns represent how strongly a component is represented in each participant. For the source matrix, by contrast, rows indicate the degree to which each voxel is represented in a component and columns denote how much a single voxel contributes to each of the 45 components. For each participant session, this decomposition produces a loading coefficient for each component, and each component is a spatial map.
As quality control, component stability was confirmed using Icasso software [57] and the most representative decomposition was moved forward for analysis [58]. All 45 components were stable, with 41 components having minimum stabilities above 0.95 and only two components with minimum stabilities below 0.90 (components C and A had stabilities of 0.85 and 0.72, respectively).
Linear mixed model analyses
To determine mean baseline (intercept) differences and rates of longitudinal change (slopes) in the SBM WM profiles among CAP and Control groups, we employed LMMs for longitudinal data [59]. LMMs are particularly appropriate for large, complex datasets because they can furnish unbiased parameter estimates even when some data is missing, as long as the data is missing at random (MAR). MAR describes variables with missing datapoints, for which the missing information can be accounted for by complete information at other datapoints. Maximum likelihood estimation (MLE) can be used to provide unbiased parameter estimates of the mean and variance of the MAR variable under assumptions of normality. We employed the MLE method to estimate random effects and random error [59].
For each WM profile/component (outcome variable), two models were fit and compared via a likelihood ratio test to assess the omnibus null hypothesis of no intercept or slope group differences: (1) a full model, inclusive of CAP-group-specific intercepts and slopes; (2) a reduced model that lacked CAP-specific effects, in which each group shared the same slope and intercept. Any component for which the two models were not significantly different was excluded from further analyses and results.
Both LMMs included a time variable representing years of study participation (with 0 denoting study entry), as well as covariates for gender, education (years), and CAP-group. Because the study was longitudinal, random intercepts for each participant were included in the model to account for correlations due to repeated measurements. Because data from several collection sites and scanners were included in the study, random site intercepts were also added to account for correlations due to common scanner or site-specific features. General linear model hypothesis testing was used to compare component intercepts and slopes among groups.
Stacked linear mixed model analyses
To compare longitudinal trajectories of the SBM WM profiles to those of clinical (cognitive and motor) functioning measures, we used what we hereafter refer to as a “stacked” LMM regression analysis, which permitted simultaneous analysis of the brain structural and clinical data. A detailed description of the statistical approach is provided in the supplemental materials of Shaffer et al., 2017 [25]. Concurrent testing of these variables enabled us to address hypotheses involving direct comparisons of clinical functioning and brain structural integrity, both at baseline and over time across groups. For example, we tested the hypothesis that motor functioning in the prodromal group furthest from estimated HD-onset deteriorates at a rate (and in a direction) similar to that of WM in motor-implicated frontal regions (e.g., supplementary and premotor cortices), whereas motor functioning changes in the group closest to onset is more comparable to striatal and subcortical changes.
To allow meaningful comparisons across variables that are scaled differently, outcome vectors (i.e., cognitive and motor variables) were standardized by subtracting the vector mean (across participants and times) and dividing by the standard deviation. Following standardization of the outcome vectors, the repeated measures outcome variables and their respective covariate data were combined into a stacked format to enable concurrent analysis [60]. For example, to test the hypothesis that a supplementary motor WM profile and motor performance are changing at similar rates, vectors containing the repeated measure values for each participant’s supplementary motor profile were combined with vectors containing each participant’s repeated UHDRS-TMS scores. The covariate matrices were likewise concatenated. Like the previously described LMMs, the stacked analysis included fixed effect covariates for time (years of study participation), gender, years of education and CAP-group, and random intercepts by participant and site. The stacked analysis also included random intercepts by ‘domain’, with ‘domain’ referring to the two stacked outcome variables.
Dummy variables were coded for each outcome variable and its covariates, enabling estimation of baseline and longitudinal covariance for outcome variables. The use of these dummy variables facilitates direct comparison of two outcome variables by generating a confidence interval representing the slope difference between two outcome variables. Random effects (site, participant, and domain) were assumed to be joint-normally distributed and have a general covariance matrix. To permit accurate comparisons of outcome variable trajectories, component (WM profile) values were multiplied by – 1 for comparisons with clinical variables for which higher scores indicate poorer performance (TMTB, TMTA, and UHDRS-TMS).
RESULTS
In the combined sample, 22 out of 45 components (∼49%) yielded Chi-square values with associated p-values of <0.001, representing the most robust CAP group effects observed in our cohort. Nine components exhibited no differences between the null and CAP models; these components were excluded from further analyses.
Source-based morphometry
A summary of the 15 SBM components (WM profiles) that exhibited the most prominent effects in our results (based on the number and strength of significant contrasts) is presented in Table 4. An expanded table with all components is available in Supplementary Material 2, and accompanying images are provided in Supplementary Material 3. A glossary of regions within components is provided in Supplementary Material 4.
SBM Component Regions
Regions within SBM components displaying the most robust significant effects, including maximum z-scores (Max. Stat. column) and coordinates of the most strongly represented regions. Components were ordered alphabetically based on the number and strength of significant contrasts across all analyses (i.e., component A displayed the maximum observed number of significant results). Primary contributing regions corresponding to maximum z-scores appear in bold, and regions in each component are listed in order of largest-to-smallest voxel cluster size in the most prominent cluster in the component. In cases where regional representation is hemisphere-specific, this is denoted with a superscript L (left) or R (right).
Baseline (intercept) differences in WM profiles
The t- and p-values for components with three or more significant pairwise CAP-group comparisons are presented in Table 5, and an expanded table including all 36 analyzed components is provided in Supplementary Material 5.
Baseline (intercept) pairwise group comparisons for components with 3 + significant contrasts
Baseline (intercept) pairwise group comparisons. Results (t-value/p-value) for components yielding three or more significant pairwise contrasts are displayed. Bold denotes significant contrasts, and the most robust effects (p < 1×10–4) are also underlined.
Cross-sectionally, components most sensitive to differences between Controls and all three CAP groups (i.e., those significant in all three Control group contrasts: X, N, O, AB, and C) were most commonly parietal. Areas of overlap among these components included angular gyrus (in three of the five components), superior temporal gyrus, superior and middle occipital, and inferior parietal lobule. Component X also included premotor, primary motor, and supplementary motor cortices.
Three components (T, Q, and N) exhibited the most robust intercept effects across prodromal and control group comparisons, with significant differences in five out of six contrasts (no component was significant in every intercept contrast). These components had few overlapping regions; component T most strongly represented cerebellar posterior lobe (along with cerebellum crus 1 and uvula), component Q sub-gyral inferior parietal WM, and component N superior and middle temporal WM (with weaker parietal and occipital contributions). These three components are superimposed in Fig. 1. Although the non-significant contrast was different among the three components (Control vs. Low for component T, Control vs. Medium for component Q, Low vs. Medium for component N), it reassuringly never included the High group (i.e., the High group was always different from the other groups for these components).

Components with the most robust cross-sectional group differences: superimposition of the three components with the strongest cross-sectional (intercept) effects across groups (components T, Q, and N, thresholded at p = 0.05). These components were significant in all but one of the possible comparisons, highlighting differences across prodromal groups. Primary regions represented in each component are listed along with corresponding Montreal Neurological Institute (MNI) coordinates (x, y, z).
The most common significant baseline differences were observed between the High and Low groups (16 significant components, four at p < 1×10–4), the High and Control groups (16 significant components, seven at p < 1×10–4), and the Low and Control groups (15 significant components, minimum p = 1×10–4). Significant differences from the Medium group were slightly less common, observed in 11 components for the Medium vs. Control (minimum p = 1×10–4) and Medium vs. Low contrasts (one component significant at p < 1×10–4) and 12 components in the Medium vs. High contrast (four components significant at p < 1×10–4).
Longitudinal (slope) differences in WM profiles
Component t- and p-values for pairwise CAP-group slope comparisons are presented in Table 6, and an expanded table is provided in Supplementary Material 6. The rate of change in parietal component Q was the most distinct among groups, exhibiting significant differences in every group contrast Table 6. Component Q was also strongly significant, yielding p < 1×10–4 in every group contrast except Control >Low. Other components with the most robust rate of change differences among groups were frontal and sub-cortical, including F (premotor, supplementary motor, medial frontal), H (cuneus, middle occipital), T (cerebellum posterior lobe, cerebellum crus 1, uvula), AA (left middle frontal), and AF (thalamus, ventral anterior nucleus), which were each significant in all but one group contrast.
Longitudinal (slope) pairwise group comparisons for components with 3 + significant contrasts
Longitudinal (slope) pairwise group comparisons for components with three or more significant contrasts (t-value/p-value) in the unstacked LMM analyses. Bold denotes significant contrasts, and the most robust effects (p < 1×10–4) are also underlined.
Like the baseline comparisons, the most common significant slope differences were seen between the High and Low groups (22 significant components, four at p < 1×10–4) and the High and Control groups (21 significant components, three at p < 1×10–4). The next most frequently observed rate of change differences were between the Low and Medium groups (17 significant components, four at p < 1×10–4), the Medium and Control groups (16 significant components, five at p < 1×10–4), and the Low and Control groups (11 significant components, minimum p = 1×10–4).
Baseline (intercept) differences between WM profiles and clinical variables
For the LMM analyses comparing WM profiles at baseline and over time among groups, we were most interested in significant group differences. The stacked analyses, by contrast, compared baseline values (intercepts) and change trajectories (slopes) between clinical functioning variables and WM profiles; consequently, here we were most interested in non-significant differences, as they indicate the most similarity between clinical variables and WM profiles. Ergo, in Table 7, we summarize the results by listing the components yielding the highest p-values in comparisons with clinical variables. Full results from the clinical test comparisons, including p-values for every contrast and group, are provided in Supplementary Materials 7–13. Across the clinical variables, WM aligned least with cognitive and motor functioning in the High and Medium groups (in that order) and most in the Low and Control groups. The UHDRS-TMS and TMTA scores were most similar to baseline WM, with the UHDRS-TMS being the only clinical variable that did not differ from at least one WM component at baseline (Table 7).
WM components most similar to clinical variables at baseline
The maximum number of non-significant (NS) contrasts (out of four) observed in at least one component is provided for each test along with corresponding component identities and the participant groups in which these associations were observed (e.g., components H and I were NS different from UHDRS-TMS at baseline in all four groups). Regions most contributing to these effects are listed in the far-right column and color-coded according to location (frontal = green, temporal = blue, parietal = purple, occipital = red, sub-gyral/other = black).
Longitudinal (slope) differences between WM profiles and clinical variables
Generally, the similarities between WM profiles and clinical functioning variables were much more apparent longitudinally than cross-sectionally (i.e., similarities between rates of change in WM profiles and clinical functioning variables were more readily detectable than similarities at baseline); the sole exception to this pattern of findings was the UHDRS-TMS. In Table 8 we list regions within components that were most similar to clinical variables. In Table 9, we display significant contrasts (indicating dissimilarity) as well as groups aligning most and least with each clinical variable in the study. These results are visualized in Fig. 2 and Table 10, and p-values for every contrast and group are provided in Supplementary Materials 7–13. Supplementary Material 14 contains superimposed clinical variable and component plots.∥

WM Changes and Clinical Functioning across the Prodrome: simplified schematic depicting areas in which rates of change were most similar to changing motor (UHDRS-TMS) and cognitive functioning in each CAP group (yellow = Low, orange = Medium, red = High). UHDRS-TMS, Unified Huntington Disease Rating Scale – Total Motor Score; TMTA, Trail-Making Test part A; TMTB, Trail-Making Test part B; SDMT, Symbol Digit Modalities Test; SW, Stroop word condition; SC, Stroop color condition; SI, Stroop interference condition.
WM profiles and clinical variables with similar change trajectories
The maximum number of non-significant contrasts (out of four) observed in at least one component is listed for each test, and components with two or more NS contrasts for a given clinical measure are listed for each group. Regions most contributing to these effects are displayed in the far-right column and color-coded according to location (frontal = green, temporal = blue, parietal = purple, occipital = red, sub-gyral/other = black).
Significant component - clinical variable trajectory differences
Significant differences between component and clinical variable trajectories, presented as: (# of significant differences)/(# significant at p < 1×10–4). Bold denotes the most similarity with a given group (e.g., in the High group, WM changes were most similar to changes in Stroop Interference performance), as significant differences indicate less similarity. The two rightmost columns list groups showing the most and least alignment with each clinical variable in the study (e.g., change in UHDRS-TMS was most similar to WM change in the High group).
Summary: Prodromal WM and clinical functioning
There were overlaps and differences among WM regions that most readily changed with prodromal progression (unstacked analyses) and those most implicated in cognitive and motor functioning across the prodrome (stacked analyses). Two components (A and B) stood out in both sets of analyses (Fig. 3). One of these (A) is a mixture of parietal, temporal (especially middle) and to a lesser extent, occipital areas; the other (B) is mostly parietal (superior and inferior) and occipital (superior and middle). Both of these components have strong representation from the angular gyrus and displayed prominent differences and changes across the prodrome as well as similarity to clinical changes.

Components with Robust Effects Across Analyses: two WM profiles containing angular gyrus exhibited robust baseline and longitudinal group differences in addition to reliably aligning well with clinical functioning measures.
DISCUSSION
This study used T1-weighted images and tissue-specific probability maps to investigate WM morphology. It should be noted that the findings thus do not reflect WM integrity to the same extent that DWI studies do. Many of our findings were in GM-adjacent-WM, and changes to GM and CSF may influence the morphology of adjacent WM. Nonetheless, our findings aligned quite well with those of previous DWI studies as well as work examining GM-adjacent-WM in this population. Like previous studies, our analyses detected widespread WM differences inclusive of regions in every lobe of the brain. The anticipated Low-to-High prodromal disease gradient of WM degradation was observed both cross-sectionally and longitudinally; cross-sectional and longitudinal differences were most commonly detected in comparisons with the High group.
Hemispheric asymmetry
Although WM changes were mostly bilateral, the left hemisphere seemed overall more affected, mainly by the late prodrome. Few prior studies have directly addressed brain asymmetry in HD outside of the striatum, where left-hemispheric asymmetry is often reported [61, 62]. One volumetric meta-analysis reported converging evidence for left-biased atrophy [61] and an additional study reported bilateral GM atrophy that became more left-lateralized with prodromal progression [62]. In our previous cross-sectional GM analysis, hemispheric effects were more mixed [14], indicating that WM atrophy may proceed along a more uniformly left-hemispheric trajectory than GM atrophy; this may in part reflect connectivity increases that have been observed in right hemispheric regions (such as the dorsolateral prefrontal cortex, or DLPFC), particularly in the late prodrome, which have been proposed as possible compensatory mechanisms [63].
Striatal changes
Regarding the striatum, we observed the most prominent striatal WM changes in the putamen, consistent with reportedly greater WM connectivity disruption within the putamen compared to the caudate [25]. In this study, striatal WM signal came mostly from two putamen-driven components; one contained the frontal lobe, cingulate cortex, and caudate, and the other co-localized with temporal regions (including the amygdala and parahippocampal gyrus) and lacked caudate contributions. The former, fronto-striatal WM component exhibited group differences as early as the Control vs. Low contrast, but the latter, temporo-limbic striatal WM component (which lacked caudate) did not exhibit significant differences until the High prodromal phase. These results suggest that early WM changes in the putamen may be more robust than those in the caudate and occur in conjunction with changes in other functionally-related areas. Although the caudate is generally thought to be affected before the putamen in HD [64], greater atrophy in the putamen relative to the caudate (50.1% and 27.7% reductions compared to controls, respectively) has also been reported [65], and putamen volumes have been identified as more reliable indicators of disease progression than caudate volumes [1, 66].
Motor cortices
The prominence of supplementary motor and premotor cortices in our results is also in keeping with previous findings from our group and others. In two previous PREDICT-HD studies, a similar motor cortical profile in GM exhibited robust group differences [14], alignment with clinical functioning, and an association with a genetic profile related to brain-derived neurotropic factor [67] (Fig. 4). Furthermore, in the most extensive longitudinal study of prodromal WM tract changes to date, these regions also exhibited the most robust effects [25].

Supplementary Motor Components in WM and GM: components that maximally represented supplementary motor cortex in three separate PREDICT-HD studies. A) GM source-based morphometry (SBM) component, from [14]. B) WM SBM component, from the present study. C) GM parallel independent component analysis (pICA) component, from [67]. Images thresholded at p < 0.05, with cross-hairs placed at the global maxima. Coordinates and Z-scores corresponding to the global maxima are shown to the right of each component.
WM changes most coinciding with clinical functioning changes
Rows are clinical variables and columns are components thresholded to show effects (z-scores) of p = 0.05 or less. These components correspond to the plots in Fig. 2 (solid lines).
These converging findings provide compelling evidence that motor cortical regions change across the prodrome and likely play a vital role in both early and late prodromal symptoms. In the present study, this supplementary motor profile was also unique in its relevance to both cognitive and motor functioning, which otherwise was evident only in parietal regions. The prominence of inferior frontal regions in our results is also in agreement with their known roles in planning and coordinating complex movements (supplementary motor) and selecting and timing motor sequences (premotor cortex), and aligns with previous studies implicating these regions in HD pathology. For example, inferior frontal (but not striatal) cortical thinning distinguished HD patients with more prominent bradykinesia, rigidity, and dystonia from those with more prominent chorea, indicating that cortical thinning may underlie clinical heterogeneity in HD that has previously been attributed to the striatum [68].
Default mode network
The robust group differences, changes, and alignment with clinical functioning variables that we observed in angular gyrus and precuneus are also in keeping with previous findings, and may reflect an early disruption of the default mode network DMN). The striatum is structurally and functionally connected to DMN hubs, including the precuneus, posterior cingulate cortex, anterior cingulate, inferior parietal and DLPFC. DMN disturbances have been reported in prodromal and diagnosed HD during rest as well as task performance, including increased DMN connectivity that was evident before volumetric loss [69], intrinsic functional connectivity increases [70], and greater M1 connectivity with a posterior cingulate DMN hub (which was associated with cognitive and motor dysfunction) [71]. Although the angular gyrus is not typically reported as a primary region of change in prodromal HD, one study examining post-mortem brain tissue from HD patients found a 55% reduction in angular gyrus pyramidal neurons, and the authors suggested that HD may disproportionately affect posterior cortical regions [72].
WM throughout the HD prodrome
Regarding comparisons with the Control group, frontal components that were most significant longitudinally were least likely to show noticeable cross-sectional effects (e.g., components AC and K), and conversely other components like striatal component S and parietal component Q demonstrated significant intercept and slope effects. Thus, the comparison with the Control group suggests that prodromal differences from Controls in striatal and parietal WM may be more detectible in cross-sectional comparisons than some frontal differences, even though changes in all these areas occur throughout the prodrome.
The suggested patterns of most robust change among the intercept (cross-sectional) and slope (longitudinal) results are similar for the Medium and High groups but distinct for the early prodrome. In other words, cross-sectional and longitudinal findings highlighted many of the same regions in the Medium and High groups, whereas areas most implicated in the Low group differed in the intercept compared to slope results. All results highlighted inferior parietal component Q as strongly representative of changes in the middle and late prodrome. However, the most prominent early (Control vs. Low) cross-sectional differences manifested in temporal and superior parietal regions, whereas the longitudinal results highlighted many early frontal effects, with the most substantial early changes observed in premotor, supplementary motor and middle frontal component AC (in addition to commonly reported striatal and sub-gyral areas in component S). Across groups, cross-sectional effects were weaker overall, but it is comforting for the sake of study feasibility everywhere that similar interpretations become apparent across cross-sectional and longitudinal analyses, especially at later prodromal phases, when intercept and slope results shared more regional overlap.
Taken together, these results suggest robust early prodromal changes in premotor, supplementary motor and middle frontal gyral GM-adjacent-WM, in agreement with previous reports of robust changes in these regions. These findings illustrate a widespread prodromal pattern of WM change that becomes increasingly sub-gyral with prodromal progression (with steeper changes in the cingulum, thalamus, and other sub-gyral and striatal regions). Cerebellum component T, inferior parietal component Q, and frontal component AA (inclusive of premotor and supplementary motor cortices and frontal eye fields) were highly and commonly significant in both intercept and slope contrasts, hinting that these regions may show the most robust group differences and changes over time. If longitudinal data collection is not possible, measures from these regions (in addition to the striatum) may offer the most insight into disease progression. The parietal and frontal regions within these components are in complete agreement with striatal connectivity findings [25, 73]. Parietal projections to striatal convergence zones begin in the inferior parietal lobule (the primary contributor to component Q) and angular gyrus (another prominent region highlighted in our results), and two prefrontal connectivity clusters that displayed prodromal group differences in an independent study included middle frontal gyrus and superior frontal gyrus, respectively [25]. Previous studies have also implicated the cerebellum in HD pathology, including cerebellar volume loss in both GM and WM [74]. Most recently, Wolf et al. reported reduced volumes in right cerebellar lobule VII in early manifest HD patients, in addition to abnormal coupling with this region; right cerebellar lobule VII was functionally coupled with paracentral, lingual, and inferior frontal regions in early HD and bilateral cerebellar, right prefrontal, and cingulate areas in control participants [75]. Furthermore, paracentral connectivity was related to motor symptoms and disease burden in HD patients. An additional study reported abnormal diffusion in cerebellar GM and WM that was associated with poorer motor functioning (e.g., higher total motor score and saccade, finger tapping, and tandem walking impairments) [76], and a further post-mortem study of eight HD patients found multiple neurodegenerative cerebellar features that included macroscopic atrophy and Purkinje cell loss, in addition to nerve cell loss in all deep cerebellar nuclei [77]. These features were present even in patients with milder striatal atrophy.
Clinical functioning and WM throughout the HD prodrome
WM components that aligned most with clinical functioning typically also exhibited many significant group differences cross-sectionally and over time, indicating that areas essential for prodromal clinical functioning also differ across prodromal stages as well as between prodromal and control individuals. The overall patterns of alignment among component and clinical variable trajectories support our prediction that distinct regional changes accompany changes in different clinical functioning domains (Fig. 2). However, we observed some regional overlap in brain areas that were most implicated in clinical functioning, particularly in superior occipital (SC, SI, SW, TMTA, TMTB), precuneus (SC, SI, TMTA, TMTB), superior parietal lobule (SC, SI, TMTA, TMTB), angular gyrus (SI, TMTB, UHDRS-TMS), middle occipital (SDMT, SI, TMTB), associative visual cortex (SW, TMTA, TMTB), supramarginal gyrus (SI, TMTB), cuneus (SW, TMTA), and inferior parietal lobule (SI, TMTB).
Additionally, the results overall suggest that clinical functioning is best reflected by different brain regions depending on prodromal stage, although there was also some overlap among regions implicated at different prodromal stages. Parietal regions (including precuneus, superior parietal lobule, and angular gyrus) appear to play an influential role in clinical functioning throughout the prodrome, with the angular gyrus standing out as one of the only regions actively involved in both cognitive and motor functioning.
The involvement of parietal and frontal regions in cognitive functioning during early stages of (and throughout) the prodrome is consistent with proposed compensatory mechanisms involving increased cortical recruitment of these regions during movement (e.g., the Simon button press task) in HD patients [78]. Intrinsic neural activity alterations have been observed in the precuneus and angular gyrus of HD patients, and significant correlations between cognitive performance and precuneus activation and thinning have also been reported [79, 80]. Furthermore, parietal regions BA 5 and 7 contain neurons that selectively respond to specific hand and mouth movements [81], and these regions were both present in components with noticeable clinical effects in our studies. Frontal regions implicated in movement (premotor, supplementary motor, and middle frontal) showed some of the strongest similarities with clinical functioning in the early prodrome. In a positron emission tomography (PET) study comparing brain activation during hand movements in HD patients and controls, HD patients exhibited abnormal movement-related activation in the supplementary motor cortex, premotor cortex, and anterior cingulate, as well as the striatum, in conjunction with enhanced parietal activity that may reflect compensatory mechanisms [82]. Similarly, an additional PET study reported frontal lobe and putamen activity alterations in HD patients during a movement task, with inadequate activation in supplementary and premotor cortices (in addition to the cingulum) correlating with performance accuracy [81]; here, enhanced left parietal cortex activation was associated with improved performance in HD patients.
By the middle prodrome, the results suggest a more pivotal role of the angular gyrus, limbic lobe, and middle occipital gyrus in clinical functioning, with comparatively diminished frontal involvement. Previous work has shown that middle occipital diffusivity changes can distinguish prodromal stages, and poorer Stroop Word performance has been associated with increased diffusivity in the superior and middle occipital gyri [83]. An additional study found that the only diffusivity changes distinguishing prodromal groups were in the superior fronto-occipital fasciculus, with the most considerable changes observed in the group closest to diagnosis. Diffusivity increases in the superior fronto-occipital fasciculus were also associated with worsening motor functioning over time (i.e., motor symptom increases) [33].
By the late prodrome, parietal areas implicated in early prodromal clinical functioning in our results (including the precuneus and superior parietal lobule) again aligned well with clinical functioning, and change in the angular gyrus continued to strongly resemble changes in both cognitive and motor functioning. Other regions that appeared to play an increasing role in late prodromal clinical functioning included the superior occipital gyrus and inferior parietal lobule. Our results suggest shifting occipital involvement in prodromal clinical functioning, with early prodromal clinical changes aligning most with associative visual cortex changes, and middle and superior occipital being most implicated in middle and late prodromal clinical changes, respectively.
Conclusions
Overall, these results support the hypothesis that distinct regions most underlie cognitive compared to motor impairments in the HD prodrome, and further suggest that regions underlying these processes differ at different prodromal stages. These findings portray a regionally widespread prodromal pattern of WM changes marked by intense, early motor cortical and middle frontal gyral (in additional to striatal) changes that increasingly extend sub-gyrally with increased prodromal progression. The results are also in keeping with previous reports highlighting UHDRS-TMS as the best indicator of prodromal WM changes. However, at the High prodromal stage, cognitive variables (particularly Stroop Interference) outperformed UHDRS-TMS, and TMTA exhibited the most similarity to WM profiles overall across groups (especially cross-sectionally).
Many observations were in keeping with predictions; WM in the High and Low groups respectively differed most and least drastically from that of the Control group, and the most robust longitudinal clinical functioning changes were also observed in the High group. The prediction that motor performance changes would most resemble striatal and motor-implicated frontal change was partially supported, but results indicated a stronger frontal involvement in the early prodrome than anticipated; supplementary motor and striatal changes most agreed with changing UHDRS-TMS performance in the Low and High groups, respectively. Prodromal frontal WM changes also did not always align most with cognitive changes. Instead, many structural components that best reflected changing clinical functioning also highlighted non-frontal regions with prominent striatal connections and essential roles in cognition, such as the angular gyrus.
Limitations
The present study used T1-weighted images and tissue-specific probability maps to examine WM morphology. These findings can thus not be interpreted as analogous to DTI and DWI studies examining WM tract integrity. As the structural patterns in this study are adjacent to GM and CSF, it is likely that changes in these adjacent areas influence the morphology of, and are not completely separate from, the WM morphology examined in this study. Nonetheless, the findings largely overlapped with previous WM studies, including a DWI LMM study involving a cohort of the same participants included in this study, and the issue of prodromal changes in GM-adjacent-WM is in itself an interesting question that has not been examined in many previous studies. It will be necessary and interesting to compare these findings to those in the GM morphology maps from the same data set to investigate this issue further.
Because HD is a rare condition, there is an unavoidable tradeoff between maximizing power (afforded by a large sample) and data homogeneity (enabled by uniform data collection). Like most studies involving many participants with an uncommon condition, PREDICT-HD collected data from multiple sites using different MRI scanners. Several steps were taken to minimize the complications of analyzing multi-site data. Before data collection, PREDICT-HD established uniform protocols across sites. Following data collection, sites were examined for outliers and unbalanced participant demographics. For analyses, random site intercepts were included to account for correlations due to inherent scanner or site-specific features. Notably, the SBM method itself has also been shown to eradicate site effects [84], and the reliability of DWI data collected across these sites has also been previously demonstrated [85].
Future directions
Reports regarding the chronology of brain structural changes in prodromal HD are conflicting. It is still unclear whether WM and gray matter (GM) changes occur mostly simultaneously or if robust changes in WM precede those in GM (or vice versa); thus, a promising future direction for this work is to directly compare WM changes with those in GM, which is achievable by applying SBM to the GM probability maps extracted in the first phase of this study and performing similar stacked LMM analyses to compare WM and GM trajectories from the same participants and scans. This analysis could also address some of the uncertainty regarding how much GM morphology changes influence the morphology of adjacent WM (examined in this study). It is also possible that prodromal GM change more aptly reflects cognitive or motor changes, which could similarly be addressed in a similar LMM study assessing GM in conjunction with clinical variables.
CONFLICT OF INTEREST
The authors have no conflict of interest to report.
Footnotes
ACKNOWLEDGMENTS
We thank Drs. Jeffrey Long and Spencer Lourens for their assistance developing the statistical approach.
This project was supported by 1U01NS082074 (V.C. and J.T., co-principal investigators) from the National Institutes of Health, National Institute of Neurological Disorders and Stroke. The PREDICT-HD study was supported by NIH/NINDS grant 5R01NS040068 awarded to J.P.; CHDI Foundation, Inc., A3917 and 6266 awarded to J.P.; Cognitive and Functional Brain Changes in Preclinical Huntington Disease (HD) 5R01NS054893 awarded to J.P.; 4D Shape Analysis for Modeling Spatiotemporal Change Trajectories in Huntington 1U01NS082086; Functional Connectivity in Premanifest Huntington Disease 1U01NS082083; and Basal Ganglia Shape Analysis and Circuitry in Huntington Disease 1U01NS082085 awarded to Christopher A. Ross. This work was also supported by grants from the National Institutes of Health (5R01NS040068, 1U01NS082083, 5R01 NS054893, NS050568, P20GM103472, and R01REB020407) and the CHDI Foundation (A-5008), National Alliance for Medical Image Computing (NAMIC; EB005149 / Brigham and Women’s Hospital), and Enterprise Storage in a Collaborative Neuroimaging Environment (S10 RR023392/NCCR Shared Instrumentation Grant).
We thank the PREDICT-HD Investigators and coordinators of the Huntington Study Group; PREDICT-HD sites: University of Iowa, Cleveland Clinic, University College London, Indiana University, Washington University in St. Louis, University California San Francisco; the study participants; the National Research Roster for Huntington Disease Patients and Families; the Huntington Disease Society of America.
