Abstract
Background:
Patient-reported outcomes (PROs) for mental health are important for persons with Huntington disease (HD) who commonly experience symptoms of depression, anxiety, irritability, anger, aggression, and apathy. Given this, there is a need for reliable and valid patient-reported outcomes measures of mental health for use as patient-centered outcomes in clinical trials.
Objective:
Thus, the purpose of this study was to establish the psychometric properties (i.e., reliability and validity) of six Neuro-QoL and PROMIS mental health measures to support their clinical utility in persons with HD.
Methods:
294 individuals with premanifest (n = 102) or manifest HD (n = 131 early HD; n = 61 late HD) completed Neuro-QoL/PROMIS measures of Emotional and Behavioral Dyscontrol, Positive Affect and Well-Being, Stigma, Anger, Anxiety, and Depression, legacy measures of self-reported mental health, and clinician-rated assessments of functioning.
Results:
Convergent validity and discriminant validity for the Neuro-QoL and PROMIS measures of Emotional and Behavioral Dyscontrol, Positive Affect and Well-Being, Stigma, Anger, Anxiety, and Depression, were supported in persons with HD. Neuro-QoL measures of Anxiety and Depression also demonstrated moderate sensitivity and specificity (i.e., they were able to distinguish between individuals with and without clinically significant anxiety and depression).
Conclusions:
Findings provide psychometric support for the clinical utility of the Neuro-QoL/PROMIS measures of mental health measures in persons with HD. As such, these measures should be considered for the standardized assessment of health-related quality of life in persons with HD.
Huntington disease (HD) is an autosomal dominant neurodegenerative disease that principally affects the basal ganglia [1]. Age of disease onset is related to the number of repeats of the CAG triplicate repeats in the HD gene. With typical expanded repeat numbers, people gradually begin to develop signs and symptoms of HD over one to two decades, which is then diagnosed once motor signs of HD are identified unequivocally. HD symptoms include mental health symptoms (e.g., irritability and impulsivity), motor signs (chorea and dystonia), as well as cognitive decline [2]. Patients may experience problems in all of these domains, or may present with clinically significant symptoms in only one area of function. Between 73–98% of individuals with premanifest or manifest HD experience neuropsychiatric symptoms [3–8] including elevated levels of depression, anxiety, irritability, anger, aggression, and apathy [3, 9–28]. These neuropsychiatric symptoms are associated with negative outcomes, including strained inter-personal relationships and poor health-related quality of life. In addition, persons with HD are at increased risk for psychiatric conditions [29–33], as well as increased suicide risk [34–43].
Much of the work examining mental health symptomatology has focused on the examination of clinician-rated symptoms, either through examination of comorbid neuropsychiatric diagnoses [12, 22] or a standard clinician-rated behavioral examination (either the short form of the Problem Behavior Assessment [PBA-s] or the Behavioral Exam from the Unified Huntington Disease rating Scale [UHDRS] [6, 28]). Only a handful of studies have examined how these symptoms impact health-related quality of life (HRQOL) [5, 14]. This is unfortunate given the increasing emphasis for patient-centered outcomes and the importance of patient-reported outcomes (PROs) as relevant and meaningful endpoints for clinical trials [44]. Therefore, the purpose of this study was to examine the clinical utility of using PROs to assess the different mental health symptoms that can impact HRQOL.
As such, we were interested in providing psychometric support for two PRO measurement systems designed to measure key symptoms and health concepts. The Quality of Life in Neurological Disorders (Neuro-QoL) [45, 46], and the Patient Reported Outcomes Measurement Information System (PROMIS) [47, 48] are HRQOL measurement systems that include different item banks that capture self-reported mental health (including the following neuropsychiatric symptoms: Depression, Anxiety, Anger, Stigma, Emotional and Behavioral Dyscontrol, and Positive Affect and Well-Being). Neuro-QoL focuses on the evaluation of mental health constructs in individuals with neurological conditions (i.e., stroke, Parkinson’s disease, multiple sclerosis, child and adult epilepsy, amyotrophic lateral sclerosis, and muscular dystrophy [45], whereas PROMIS focuses on assessing similar concepts across chronic medical conditions. Both systems offer the advantage of computerized adaptive test (CAT) administration technology, a test method where each individually administered item is uniquely tailored based on the previous test response. CATs allow clinicians and researchers to ascertain a person’s level of functioning with minimal items and maximal precision. In addition, these systems allow for cross-disease comparison (generic item banks) and disease-specific sensitivity (disease-specific item banks). Many identical items (i.e., common data elements; CDEs) are used on both Neuro-QoL and PROMIS to allow “linking” between measures, such that a score on one measure (e.g., Neuro-QoL) can be used to estimate a score on the other (e.g., PROMIS).
The purpose of this paper was to provide data to support the psychometric properties of the Neuro-QoL and PROMIS mental health PROs so that they can be used with confidence to measure mental health in persons with HD. Specifically, we examined the reliability (internal consistency) and validity (convergent, discriminant, and known groups validity) of several Neuro-QoL and PROMIS mental health PROs in persons with or at-risk for HD. Specifically, we hypothesized that the Neuro-QoL/PROMIS mental health item banks would demonstrate acceptable reliability [i.e., Cronbach’s Alpha >0.70; 49] in this population and that floor and ceiling effects for these measures would be better than other commonly used self-report measures. In addition, convergent validity would be supported in our sample by moderate to high correlations (i.e., ≥0.4) among self-report measures of mental health, and discriminant validity would be supported by negligible to small correlations (i.e., 0.0–0.2) between the mental health PROs and other measures of clinician-rated functioning [50]. We also hypothesized that premanifest HD participants would report better mental health functioning than manifest HD groups, and that early-stage HD participants would report better functioning that late-stage HD participants (i.e., known groups validity); effect sizes for self-reported neuropsychiatric symptoms (i.e., Cohen’s d) were expected to be larger for individuals with more clinician-rated mental health problems. Finally, we hypothesized that the PRO measures of depression and anxiety would demonstrate adequate sensitivity and specificity relative to established cutoff scores on established measures for clinically significant anxiety or depression as evidenced by AUC values ≥0.70 in other clinical populations [51]. Results will provide evidence for the clinical utility of these PRO measures of mental health in individuals with premanifest and manifest HD.
MATERIALS AND METHODS
Participants
Study participants were individuals with either premanifest (gene-positive status for the HD CAG expansion and no clinical diagnosis) or manifest HD. Inclusion criteria were as follows: participants had to be at least 18 years of age, able to read and understand English, and able to provide informed consent. A convenience sample of persons with HD were recruited from HD treatment centers at the University of Michigan, University of Iowa, University of California-Los Angeles, Indiana University, Johns Hopkins University, Rutgers University, Struthers Parkinson’s Center, and Washington University. Recruitment was also supported by the National Huntington Disease Roster and existing online medical record data capture systems [52]. In addition, several study participants (n = 96) were recruited in conjunction with the Predict-HD research study [53], a nationwide cohort which aimed to examine symptoms in participants with premanifest and early manifest HD. All study procedures were completed in compliance with institutional review board requirements, and informed consent was obtained from each participant prior to the administration of study measures.
Measures
Self-reported HRQOL
We administered six different measures from the Neuro-QoL/PROMIS systems. Specifically, we administered Neuro-QoL measures of Emotional and Behavioral Dyscontrol, Neuro-QoL Positive Affect and Well-Being, Neuro-QoL Stigma, Neuro-QoL Anxiety, Neuro-QoL Depression, and the PROMIS measure of Anger (given that Neuro-QoL does not include a measure of this construct). Details for these measures are provided elsewhere [45, 54]. These measures were designed to evaluate important aspects of HRQOL (including aspects of symptoms and functioning) according to the Internal Classification of Functioning, Disability, and Health (ICF) framework [55, 56]. Briefly, each measure was administered as CAT, followed by fixed 8-item short-forms (SFs; participants completed items in the CAT, followed by any items on the SF that were not already administered as a part of the CAT). This was repeated for each Neuro-QoL/PROMIS measures to allow for the comparison of different administration methods. For CAT administrations, participants completed a minimum of 4 items, and test administration stopped after either a standard error (SE) ≤0.3 was achieved or the participant answered 12 items. Scores for both CATs and SFs are on a T metric (M = 50, SD = 10); higher scores indicate more of the trait that is being assessed. That is, higher scores for Positive Affect and Well Being indicate better mental health, whereas higher scores on all other PROs indicate poorer mental health. While these measures are commonly recommended as common data elements for several neurological populations (http://www.commondataelements.ninds.nih.gov/ and [57]), this data is not yet available for persons with HD.
Generic self-report legacy measures of HRQOL
We administered three generic measures of HRQOL: the WHODAS 2.0 [55], RAND-12 Health Status Inventory [HSI; 58] and the EQ5D [59]. The WHODAS 2.0 [55] is a 12-item self-report measure that evaluates HRQOL as it relates to disability across six subdomains: understanding and communication, self-care, mobility, interpersonal relations, work and household roles, and community and civic roles. Total scores ranging from 0 (highest level of health) to 48 (low health). The RAND-12 his [58] is a 12-item self-report measure that produces a physical health composite (PHC) and a mental health (MHC). Scores on PHC and MHC are on T metric (M = 50, SD = 10); higher scores indicate better health. The EQ5D [59] is a self-report measure of HRQOL that measures mobility, ability to do self-care, ability to do usual activities, pain, and anxiety/depression. We examined scores on both the Health Scale (which range from 0 [poor health] to 100 [perfect health]) and the Index Value (scores range from 0 [worst health] to 1 [best health]). There is data to support the clinical utility of all three these measures in persons with clinical conditions, and for the WHODAS 2.0 and the Rand-12 in persons with HD, specifically [60–101].
Legacy self-report measures of mental health
We administered two legacy measures of mental health symptoms, the Patient Health Questionnaire 9 (PHQ-9) [102] and the Generalized Anxiety Disorder-7 (GAD7) [103]. The PHQ-9 is part of the Patient Health Questionnaire and is a 9-item self-administered questionnaire used for assessing frequency of symptoms for clinical depression. Response categories were Not at all (0), Several days (1), More than half the days (2), and Nearly every day (3). Total scores range from 0 to 27, with cut point scores of 5, 10, 15, and 20 representing mild, moderate, moderately severe, and severe depression [102]. The GAD-7 is a 7-item self-administered patient questionnaire used to measure the severity of symptoms for Generalized Anxiety Disorder, with responses of Not at all (0), Several days (1), More than half the days (2), and Nearly every day (3). Total Scores range from 0–21, with cut point scores of 5, 10, and 15 representing mild, moderate, and severe anxiety [103]. Given that there are no consensus PRO measures for mental health in persons with HD [https://www.commondataelements.ninds.nih.gov/HD.aspx#tab=Data_Standards and 104], the PHQ-9 and GAD-7 were selected given their broad clinical applicability across different clinical populations [102, 105–128], the presence of established diagnostic cutoff scores, and the relative brevity of administration items.
Clinician rated assessments
For all participants, clinicians completed the Independence Scale, the Total Motor Scale, and Total Functional Capacity measure from the Unified Huntington’s Disease Rating Scale (UHDRS) [129], as well as the Problem Behaviors Assessment (PBA-s) [130]. The Independence Scale requires the clinician to rate the participants overall level of independence on a scale of 0 to 100; higher scores reflect better functioning/greater independence. The Total Motor Scale provides a clinician-rated measure of motor functioning. We used the final question on this scale to determine if a participant had premanifest or manifest HD; in cases where the clinician indicates with >99% certainty that motor symptoms indicate unequivocal signs of manifest HD participants were classified as having manifest HD; otherwise the participant was classified as having premanifest HD. The TFC is a 5-item clinician-rated measure of day-to-day functioning across the domains of occupation, finances, domestic chores, activities of daily living, and care level. Scores range from 0 (low functioning) to 13 (highest level of functioning) with higher scores indicating better functioning. TFC scores were used to classify participants with an HD diagnosis as either early stage (Stages I –II; sum scores of 7–13) or later-stage (Stages III – V; sum scores of 0–6) [131]; staging data was combined due to the relatively small number of participants in the later stages of the disease (i.e., n = 7 Stage IV and n = 1 in Stage V), as well as to maximize the power for group comparison analyses. The PBA-s [130] provides a clinician-rated assessment of 11 different behaviors (depression, suicidality, anxiety, irritability, anger/aggression, apathy, perseverative thinking, obsessive compulsive behaviors, delusions, hallucinations, and disorientation). Each item is rated for severity (rated on a scale of 0 [symptom absent] to 4 [severe] in regards to the last 4 weeks) and frequency (rated on a scale of 0 [never/almost never] to 4 [daily/almost daily for most or all of the day]). We examined scores on individual items, which is the product of the frequency and severity score for that item (which can range from 0 to 16); higher scores indicating more problems.
Analyses
Reliability
Cronbach’s alphas were calculated to evaluate internal consistency reliability for the SF version of each mental health measure in our sample. These values were compared to the Cronbach’s alphas for the CATs, which were drawn from the Neuro-QoL and PROMIS calibration samples [48, 54]. A critical cut-off of ≥0.70 was considered minimal acceptable reliability [49].
Floor and ceiling effects
The proportion of participants with the lowest or the highest possible scores on each of the mental health measures were calculated to establish the floor and ceiling effects of these measures. A priori acceptable rates for floor and ceiling effects were ≤20% [132, 133].
Timing data
Due to concerns about test burden in HD participants (especially for participants in the later stages of the disease), median and SD timing data were examined for both CAT and Short Form versions of the mental health measures.
Convergent and discriminant validity
A correlation matrix was used to examine the associations between the NeuroQoL/PROMIS PROs, other self-report measures of mental health symptoms (PHQ9 and GAD7), generic measures of HRQOL (WHODAS, EQ5D, and RAND), clinician-rated mental health (PBA-s), and clinician-rated assessments of functioning (UHDRS Motor, Functional Assessment and TFC) in our sample. Convergent validity was supported by moderate to high correlations (i.e., ≥0.4) between the mental health PRO measures and other self-report measures of mental health [50]. Discriminant validity was supported by small correlations (magnitude ≥0.1) between Neuro-QoL/PROMIS PROs and corresponding clinician-rated measures of emotion (i.e., PBAs items) and negligible to small correlations (i.e., 0.0 – 0.2) between the mental health PROs and other measures of clinician-rated functioning [50].
Known-groups validity
One-way analysis of variance was used to examine group differences (premanifest, early- and late-HD) for each of the different mental health PROs; Tukey’s post hoc analyses were used to identify significant group differences. Specifically, we hypothesized that premanifest participants would report better mental health than manifest HD groups, and that early-stage HD participants were report better mental health than late-stage HD participants.
Effect sizes
In order to evaluate the relative influence that neuropsychiatric symptom severity (as determined by a median split using the matched “established” measure) has on Neuro-QoL/PROMIS scores, effect sizes (i.e., Cohen’s d) were calculated. Specifically, a median split on the PBAs Irritability item was used for Neuro-QoL Emotional and Behavioral Dyscontrol, a median split on the WHODAS was used for Neuro-QoL Positive Affect and Well Being, a median split on the PBAs Anger/Aggression item was used for PROMIS Anger, a median split on the PBAs Anxiety item was used for Neuro-QoL Anxiety, and a median split on the PBAs Depression item was used for Neuro-QoL Depression. Effect size were calculated for each of the Neuro-QoL/PROMIS PROs (based on comparison of each group relative to the means and standard deviations from the PROMIS/NQ normative sample; Neuro-QoL normative sample [N = 549, M = 50, SD = 10; 46, 54] and PROMIS normative sample [N = 858; M = 50, SD = 10; 48]. Effect sizes were expected to be larger for the groups with more clinician-rated severity.
Impairment rates
Clinical impairment rates (participants whose scores were >1 SD worse than the Neuro-QoL normative sample mean[N = 549, M = 50, SD = 10; 46, 54] or participants whose scores were >1 SD worse than the PROMIS normative sample mean [N = 858; M = 50, SD = 10; 48] were used to determine if individuals with HD were at greater risk than the general population for mental health problems. In a normal distribution, 16% of the scores would fall 1 SD below the mean (i.e., impaired); therefore, we treated impairment rates that exceeded 16% to indicate greater impairment than would be expected compared to demographically-comparable neurologically healthy peers [134].
Classification accuracy (sensitivity/specificity)
Finally, we conducted logistic regression models to determine the accuracy with which Neuro-QoL Anxiety and Depression scores could discriminate between those individuals with and without anxiety or depression (as determined by the published cutoff scores of ≥5 on the GAD-7 [103] or the PHQ9 [102], for anxiety and depression, respectively). Criterion for likelihood ratios (i.e., sensitivity/[1-sensitivity]) for clinical decision making should be ≥2 [135]. In addition, receiver operating characteristic (ROC) analysis was used to compare diagnostic performance of the mental health PROs. The resulting area under the curve (AUC) values criterion was specified as ≥0.70 [51].
RESULTS
Two-hundred-ninety-four individuals with premanifest (n = 102) or manifest HD (n = 131 early-stage HD and n = 61 late-stage HD; Table 1) completed the mental health PROs as part of a larger study [136]. There were group differences for gender, χ{2 (2, N 294) = 7.97, p = 0.02; there were more females than males for the premanifest group relative to the early-stage manifest group. There were also group differences for education, F (2, 290) = 10.97, p < 0.0001; the early-stage HD and late-stage HD groups had slightly less education than the premanifest group. As disease progresses with age, it was not surprising that there were significant groups differences on age, F (2, 291) = 26.91, p < 0.0001; the premanifest group was significantly younger than both manifest groups. Groups did not differ on race, χ2 (3, N 294) = 7.35, p = 0.12, or ethnicity χ2 (3, N 294) = 0.82, p = 0.94.
Demographic Information for Huntington disease participants
Note.* = significant group differences.
Internal consistency
All Cronbach’s alphas in the sample were ≥0.90 for both the CAT and SF versions of the Neuro-QoL/PROMIS measures; values comfortably exceeded the a priori minimum standard for internal consistency reliability (≥0.70; see Table 2).
Descriptive information and reliability data for neuro-QoL/PROMIS and comparator measures
Note. CAT = computer adaptive test; SF = short form; HRQOL = health-related quality of life; UHDRS = Unified Huntington’s Disease Rating Scales; PBAs = Problem Behaviors Assessment; * = higher scores = better functioning; °the paper form of the EQ5D was used for administration and timing data was not recorded;× = CAT scores for the HD sample were simulated, and thus reliability coefficients could not be calculated, the provided reliability coefficients are based on the Neuro-QoL [54] or the PROMIS [48] calibration samples; All SF are 8 items except for Positive Affect and Well-Being which was 9 items.
Floor and ceiling effects
The Neuro-QoL/PROMIS PROs were free of floor and ceiling effects (meeting our a priori criterion of ≥20%), with the exception of the Stigma and Depression SFs (which both had floor effects >20%; see Table 2). In all cases, the CATs performed better than the SFs.
Timing data
Median administration times for the PROMIS/Neuro-QoL mental health PROs were brief (all <48.5 seconds; Table 2).
Convergent and discriminant validity
In general, convergent and discriminant correlations were consistent with the proposed hypotheses, with moderate to large correlations among self-report measures, less robust correlations between the Neuro-QoL/PROMIS PROs and clinician-rated emotion items, and negligible to small correlations with clinician-rated measures of functioning (Table 3). Results were virtually identical for the SFs, thus we only present data for the CATs.
Pearson correlations to support convergent and discriminant validity of the new Neuro-QoL/PROMIS mental health PROs
Note. PROs = patient-reported outcomes; HRQOL = health-related quality of life; CAT = computer adaptive test; SF = short form; UHDRS = Unified Huntington’s Disease Rating Scales; PBAs = Problem Behaviors Assessment. **Correlation is significant at the 0.01 level (2-tailed). *Correlation is significant at the 0.05 level (2-tailed).
Known-groups analyses
While there were significant group differences for Positive Affect and Well-Being and Stigma (as hypothesized), we did not see group differences on Anger, Anxiety of Depression (which was contrary to our hypotheses); see Table 4.
Known groups validity for Neuro-QoL/PROMIS mental health computer adaptive tests
*a = significant group differences between premanifest and early, b = significant group differences between premanifest and late, c = significant group differences between early and late.
Effect sizes
Effect sizes are included in Table 5. With the exception of anxiety, those with higher clinician rated mental health problems had larger effect sizes.
Cohen’s d effect sizes for Neuro-QoL/PROMIS mental health patient-reported outcome measures
Note. CATs = computer adaptive tests; SF = short form.
Impairment rates
In general, impairment rates for our HD participants were comparable to the general population (general population ≤16%); the only elevated impairment rates were for late-stage HD participants for stigma, anger, anxiety and depression (Table 3).
Classification accuracy (sensitivity/specificity)
ROC results showed that when Neuro-QoL Depression equaled 50.3, the measure demonstrated moderate sensitivity (71.6%) and moderate specificity (85.9%), with an AUC of 0.85 and likelihood ratio of 2.52 for distinguishing between those individuals with and without clinically significant “mild” depression, meeting a priori criterion for clinical decision making. In addition, ROC results showed that when Neuro-QoL Anxiety equaled 55.1, the measure demonstrated moderate sensitivity (82.6%) and moderate specificity (86.6%), with an AUC of 0.91 and likelihood ratio of 4.75 for distinguishing between those individuals with and without clinically significant “mild” anxiety symptoms, also meeting a priori criterion for clinical decision making.
DISCUSSION
Study findings support the reliability and validity of several Neuro-QoL and PROMIS mental health PROs to assess mental health in individuals with premanifest and manifest HD. With regard to reliability, results provided strong support for the internal consistency reliability of all of the new CAT and SF administrations to assess PRO mental health in persons with HD (i.e., all Cronbach’s α >0.92). In addition, these reliability coefficients were either equivalent to, or higher than other generic measures of HRQOL (i.e., higher than internal consistencies for the RAND 12, and equivalent to internal consistencies for the WHODAS). All of the examined Neuro-QoL and PROMIS PRO measures required relatively brief administration times (i.e., <1 minute for both CATs and SFs) and generally, all the Neuro-QoL/PROMIS mental health PROs exceeded acceptable standards for floor and ceiling effects. The exceptions to this were Stigma and Depression SF administrations, which both had a small floor effects. This indicates that these measures lack sensitivity for persons that are experiencing little to no depression or stigma.
Convergent and discriminant validity of the Neuro-QoL/PROMIS mental health PROs were also supported in our sample. Moderate to strong associations between Neuro-Qol/PROMIS mental health PROs and measures of self-reported mental health suggest convergent validity in persons with HD. Discriminant validity was supported by less robust correlations between the Neuro-QoL/PROMIS PROs and clinician-rated assessments of mental health, as well as by small to negligible correlations between the Neuro-QoL/PROMIS PROs and clinician-rated measures of functioning.
Findings for known groups validity were mixed. While Neuro-QoL Stigma differentiated between all three HD groups (with increasing reports of stigma as HD stage progressed), and Neuro-QoL Positive Affect and Well-Being differentiated those with premanifest and late-stage HD (with premanifest reporting better functioning that late-stage participants), none of the other PROs (i.e., Depression, Anxiety, or Anger) were able to differentiate between the different HD staging groups. While these findings for anxiety, depression and anger were not consistent with our proposed hypotheses, the literature suggests that HD stage may not be related to anxiety [31, 137] or depression [3, 32]. It is also possible that anosognosia and/or frontal dysfunction, which can occur in HD (especially in the later stages, as well as for symptoms such as depression, apathy, and anger) [138–140] may have precluded our ability to detect group differences in this study. These findings, in conjunction with the fact that overall impairment rates for the PROs were consistent with impairment rates in the general population, would suggest that the absence of group differences in our data does not mitigate support for the validity of these measures to assess mental health in persons with HD. Since sensitivity and specificity of the Anxiety and Depression measures met standards for clinical decision making, our findings provide some evidence of these measures’ construct validity to assess mental health in persons with HD. And finally, as expected, effect sizes for the new PROs were higher for participants with worse clinician-rated mental health functioning than those with better functioning (with the exception of anxiety where both groups showed moderate effects, but the effects for lower rated severity were higher than those with higher rated severity). There were generally moderate to large effects for individuals with higher clinician-rated severity and small to moderate effects for those with lower clinician-rated severity. Together, these findings support construct validity of these new measures of mental health in HD.
Our findings have implications for the clinical care of mental health problems in persons with HD. Specifically, all of these measures are scored on a T-metric with a mean of 50 and a standard deviation of 20. This means that persons with scores ≥60 are likely experiencing clinically significant symptoms (i.e., their scores are worse than 84% of the general population) and persons with scores ≥70 are almost definitely experiencing clinically significant symptoms (their scores are worse than 95.5% of the general population). Thus, scores on these measures can be used to guide clinical referrals for mental health treatment. In addition, for Neuro-QoL Depression and Anxiety, where our findings provide support for the sensitivity and specificity of these measures due to an established marker of clinically significant depression and anxiety, respectively. As such, persons with scores ≥50.3 on Neuro-QoL Depression and persons with scores ≥55.1 on Neuro-QoL Anxiety would be appropriate for a mental health referral. This is especially important given the elevated rates of psychiatric diagnoses in persons with HD [29–33].
While this study provides important psychometric support for Neuro-QoL/PROMIS measures of mental health in persons with HD, there are some study-specific limitations. First, this convenience sample may not represent the HD population at large. The participants in this study were recruited from well-established HD clinics, and thus may have better access to mental health services than the general HD population. Our inclusion criteria also required participants to be cognitively capable of providing informed consent, and thus excludes persons with HD whom are in the latest stages of the disease. In addition, our sample included more females and the education levels for premanifest participants was higher than the manifest participants, limiting generalizability. In addition, while clinician-rated assessments of mental health were administered, these ratings do not correspond to mental health diagnoses. Therefore, although we used established cutoff rates for determining those with and without anxiety and depression, these rates were based on self-reported ratings and do not provide information about how well these measures may or may not be at differentiating between individuals with and without clinical diagnoses of anxiety and/or depression. Also, given the fact that there are no consensus measures for assessing self-reported mental health symptoms in HD [104], we elected to use measures that were in common use in other clinical measures, but did not always have existing psychometric data to support their use in HD. As such, it is possible that our findings may slightly over or underestimate clinical impairment rates in HD. More work is needed to directly examine the relationship between these self-report measures and the actual rates of psychiatric diagnoses in persons with HD. Finally, patient-reported outcomes depend largely on participants’ ability to accurately assess their own symptoms. Anosognosia, or symptom unawareness, has been found to be present even in the prodromal stages of the disease [141], and previous research has shown that cognitive impairment can impact the reliability of PROs especially in individuals with late-stage HD and associated cognitive declines [142]. Specifically, data has shown that there is increased variability in PRO responses and associated decreased in psychometric reliability as HD progresses; however, psychometric reliability for the PROs is typically still within acceptable limits (i.e., >0.70 [143]). For those individuals working with later-stage HD, there are established clinical cutoffs for cognitive scores that can be used to maximize PRO reliability among those with cognitive decline. Regardless, more work to be done examining anosognosia and its relationship to PRO reporting in persons with HD.
In summary, the Neuro-QoL/PROMIS mental health PROs are brief, reliable, and valid assessments of emotional functioning and health-related quality of life in HD. Findings support further utilization of these measures by clinicians and researchers. Future work focused on examining change over time will be useful to inform their utility in repeated trials or longitudinal study. Ultimately, these measures can help fulfill a significant gap in PRO measurement in individuals with HD.
CONFLICTS OF INTEREST
The authors declare that there is no conflict of interest.
Work on this manuscript was supported by the National Institutes of Health (NIH), National Institute of Neurological Disorders and Stroke (R01NS077946) and the National Center for Advancing Translational Sciences (UL1TR000433). In addition, a portion of this study sample was collected in conjunction with the Predict-HD study. The Predict-HD study was supported by the NIH, National Institute of Neurological Disorders and Stroke (R01NS040068), the NIH, Center for Inherited Disease Research (provided supported for sample phenotyping), and the CHDI Foundation (award to the University of Iowa). The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.
Footnotes
ACKNOWLEDGMENTS
We would like to acknowledge the HDQLIFE Site Investigators and Coordinators: Noelle Carlozzi, Praveen Dayalu, Stephen Schilling, Amy Austin, Matthew Canter, Siera Goodnight, Jennifer Miner, Nicholas Migliore (University of Michigan, Ann Arbor, MI); Jane Paulsen, Nancy Downing, Isabella DeSoriano, Courtney Shadrick, Amanda Miller (University of Iowa, Iowa City, IA); Kimberly Quaid, Melissa Wesson (Indiana University, Indianapolis, IN); Christopher Ross, Gregory Churchill, Mary Jane Ong (Johns Hopkins University, Baltimore, MD); Susan Perlman, Brian Clemente, Aaron Fisher, Gloria Obialisi, Michael Rosco (University of California Los Angeles, Los Angeles, CA); Michael McCormack, Humberto Marin, Allison Dicke (Rutgers University, Piscataway, NJ); Joel Perlmutter, Stacey Barton, Shineeka Smith (Washington University, St. Louis, MO); Martha Nance, Pat Ede (Struthers Parkinson’s Center); Stephen Rao, Anwar Ahmed, Michael Lengen, Lyla Mourany, Christine Reece, (Cleveland Clinic Foundation, Cleveland, OH); Michael Geschwind, Joseph Winer (University of California – San Francisco, San Francisco, CA), David Cella, Richard Gershon, Elizabeth Hahn, Jin-Shei Lai (Northwestern University, Chicago, IL).
We would also like to thank the University of Iowa, the Investigators and Coordinators of this study, the study participants, the National Research Roster for Huntington Disease Patients and Families, the Huntington Study Group, and the Huntington’s Disease Society of America. We acknowledge the assistance of Jeffrey D. Long, Hans J. Johnson, Jeremy H. Bockholt, and Roland Zschiegner. We also acknowledge Roger Albin, Kelvin Chou, and Henry Paulsen for the assistance with participant recruitment.
Work on this manuscript was supported by the National Institutes of Health (NIH), National Institute of Neurological Disorders and Stroke (R01NS077946) and the National Center for Advancing Translational Sciences (UL1TR000433). In addition, a portion of this study sample was collected in conjunction with the Predict-HD study. The Predict-HD study was supported by the NIH, National Institute of Neurological Disorders and Stroke (R01NS040068), the NIH, Center for Inherited Disease Research (provided supported for sample phenotyping), and the CHDI Foundation (award to the University of Iowa). The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.
