Abstract
INTRODUCTION
A clinical dashboard is a “method of visually displaying aspects of performance data relevant to patient care” [1]. Dashboards typically integrate large amounts of numerical information, and assimilate these into a visually-powerful interface, which can then be interpreted by clinical staff to inform and influence clinical practice [1]. They have been embraced across a range of healthcare specialties such as emergency care [2, 3], neonatal intensive care [4], and maternity [5] – and were noted by NHS England as a powerful way of “inform[ing] the daily decisions that drive quality improvement” [6].
Computers perform well in fields that human brains may struggle, such as in rapidly processing large amounts of disparate numerical data [7]. Consequently, there is huge potential for dashboards to complement clinician expertise in developing a holistic approach to informing care. Despite this, the use of a Huntington’s disease clinical dashboard has not previously been reported in the literature.
Access to reliable data is an important factor that shapes the quality of tools such as clinical dashboards [8], and the development of international multi-centre HD data collection, via studies such as Enroll-HD, offers great potential. In HD, the clinical manifestations of progressive motor dysfunction, cognitive deterioration and psychiatric disturbance [9] occur in patients with at least 35 Huntingtin (HTT) gene CAG repeats (CAGn ≥35) [10]. The ability to quantify this in every individual, and the systematic recording of symptomatic changes, using the Unified Huntington’s Disease Rating Scale (UHDRS) [11], facilitates the comparison of the individual against the group.
We note that it remains challenging for a single clinician to assess whether symptoms of HD in an individual are progressing at a typical rate for someone of that demographic. This uncertainty is compounded by conflicting research data, some of which conclude that there is a link between CAGn and clinical HD progression [12–14], and others suggesting there is an ambiguous association [15, 16] or none at all [17].
We therefore postulated that pooling the longitudinal multi-centre trial data from Enroll-HD would provide a foundation to develop a clinical dashboard that compares, and visually represents, the symptomatic progression of an HD-manifest patient with a cohort of similar peers. The development of such a tool could help clinicians rapidly identify whether a specific patient is experiencing disproportionate decline compared to that predicted from group data, and could be used alongside a clinician’s judgement as a triaging tool to suggest specific intervention points.
Our aim was to develop a dashboard that was tailored to the individual, and required minimal data entry, to represent the comparison outputs immediately [18] and in a visually-powerful manner [19]. These features have been previously recognised [18, 19] as highly valued features of innovative tools, and our intention was to incorporate these elements into the dashboard design.
METHODS
Data from 745 motor-manifest participants in the Enroll-HD study [20] were used, from the January 2015 data cut. All participants were assessed using the UHDRS and the data were passed through quality control and de-identification processes [20]. Figure 1 illustrates the selection criteria for the participants, including that patients must be manifest, have a specific age recorded, and have clinical assessment scores available, either for baseline only (cross-sectional part of dashboard), or for both baseline and follow-up (longitudinal part of dashboard).

Flow diagram showing the recruitment of participants to the dashboard. The process undertaken by Enroll-HD (adapted from Enroll-HD Data Handling Manual [20]) and the process undertaken in this publication have been distinguished.
CAGn was selected as an important demographic variable due both to its role in determining age of symptomatic onset [10] and also its possible association with clinical HD progression [12–16]. Once participants’ CAGn was identified, they were then allocated one of five CAGn bands (a CAGn of ≤41, 42–43, 44–45, 46–48, ≥49), depending on their CAGn. These CAGn bands were used instead of absolute CAGn values to provide sufficient data points in each group for a trend to emerge. With the expansion of the data source and more individuals present for each CAGn, over time, these bands can be eliminated and replaced with regression lines grouped by absolute CAGn values.
Total motor score (TMS), total functional capacity (TFC) and symbol digit modality test (SDMT) were selected as the key UHDRS clinical assessment scores. TMS, where an increasing score indicates a higher burden of motor pathology [21], defines clinical diagnosis [22], is widely used [22], and is well validated as a marker of clinical progression, particularly following diagnosis [11, 23]. TFC, where a low score indicates worse functional ability [11, 21], is an important outcome measure in clinical trials [24]. The SDMT, where a low score indicates worse performance [21], is very sensitive for follow-up of HD patients [25] and is also widely-used.
The demographic breakdown (age, CAGn, sex and geographical distribution) of participants is provided by Table 1.
Demographic information of participants
Three cross-sectional, and three longitudinal, regression equations were defined for later use in the output display of the dashboard. The three cross-sectional regressions used raw TMS, TFC or SDMT score as the dependent variable, and age of participant at baseline as the independent variable. In the three longitudinal regression equations, annualised TMS, TFC or SDMT change over the year prior to consultation was defined as the dependent variable, and age of participant at follow-up as the independent variable.
The independent variables of the longitudinal regression slopes were calculated using annualisation, a previously-described technique [26]: baseline UHDRS clinical assessment scores were subtracted from the follow-up score, then divided by the number of days between these two recordings, and multiplied by 365. Consequently, all scores were standardised as the amount of change in 365 days prior to the consultation. Scores were rounded to the nearest integer. Each participant’s age at follow-up was calculated as age at baseline plus one year. Longitudinal calculations included only participant data from baseline and one-year follow-up. This is because the small amount of data from two- and three-year follow ups in the January 2015 Enroll-HD data cut meant that generating a meaningful regression was impractical with such sporadic data.
To assess whether bias had been introduced by participants not continuing to follow-up, independent-samples t-tests were conducted, comparing those who continued to follow-up and those who did not for age and CAGn.
The dashboard was developed using Microsoft Excel 2016 with the aim of graphic comparison of an individual patient’s key UHDRS clinical assessment scores with those of Enroll-HD participants in the same CAGn band. The input section of the dashboard required a specific patient’s CAGn, date of birth and date of consultation (demographic information), along with TMS, TFC and SDMT scores from each consultation.
For each cross-sectional longitudinal regression slope, the independent and dependant variables were plotted in the dashboard and grouped based on CAGn band. Regression lines of slope y = mx+c (Tables 2 and 3) were fitted, along with 95% confidence and 95% prediction intervals, to which polynomial best fit lines were added. Specific data points were then hidden, showing only the lines. The dashboard was programmed to display only the regression line of the CAGn band that matched the patient’s input values. Visual Basic for Applications (VBA) code was created and assigned to a button which, when selected, plotted a patient’s cross-sectional and longitudinal scores onto the regression lines.
Descriptive statistics of the cross-sectional regression slopes
Descriptive statistics of the longitudinal regression slopes
Further VBA code was created and assigned to a button that copies past clinical assessment scores into a ‘Past record’ section, which stores every score previously entered for that patient. VBA code was also created and assigned to the RESET button, which, when selected allows all patient data to be instantly cleared, ready for a new patient.
All data were provided for this project anonymously from REC-approved sites participating in the Enroll-HD Study.
RESULTS
Annual follow up consultations were completed at a mean interval of 381 days, with a standard deviation of 49 days and a range of 791 to 285 days (n = 270). There was no significant difference in age (mean 51.6 vs 53.2 years; p = 0.09) or CAGn (mean 43.7 vs 43.8 repeats; p = 0.7) for those who continued with follow up and those who did not.
The dashboard used the CAGn regression data as previously described in the Methods section. Figure 2 shows the full dashboard, which, to demonstrate its functionality, includes data from three consultations for an anonymised Enroll-HD participant. The input screen, which is visible in the top left corner of the dashboard, has been designed for ease-of use [18]. Figure 3 demonstrates the dynamic nature of the dashboard, where the output is dependent upon the CAGn entered onto the patient’s input screen. This shows how longitudinal outcomes are highly dependent upon CAGn, and offer visualisation of the potential relevance of an individual’s actual performance in relation to their predicted ability.

Screenshot of the populated dashboard. On the input screen (top left corner), data input into the “CAGn” cell from the “Demographic information” section informs which trend lines to display on the graphs. “DOB” is used to calculate the patient’s age at each consultation. “TMS”, “TFC” and “SDMT” cells on the “Most recent consultation” section are the scores recorded in clinic. The date in this section is used alongside DOB to calculate age at consultation. Button 1 links to VBA code that clears the “Most recent consultation data” and stores it in the “Past record” section, thus allowing new consultation data to be entered. Button 2 links to VBA code that plots cross-sectional and longitudinal data onto the graph. “RESET” links to VBA code that clears all data, thus resetting the system. The purple box automatically updates to remind the user what CAGn band the patient belongs to. Each black shape (diamond, triangle, square) represents patient clinical assessment scores from separate consultations, with the square representing the most recent consultation data. The ‘Past record’ table expands to the right with each subsequent consultation. On the output screen, the patient’s cross-sectional (top graphs) and longitudinal (bottom graphs) data are visualised, showing regression lines fitted to each CAGn band (unbroken straight line). 95% prediction intervals are the outermost lines with longer dashes. 95% confidence intervals of the best fit line are the dashed lines closest to the unbroken best fit line. The bottom graphs will always have one fewer data points compared to the top graphs because at least two consultations are required to make a longitudinal assessment.

Left: graphical display if the patient’s CAGn is 45. Right: graphical display if the patient’s CAGn is 48. The black shapes (patient clinical assessment score / change in score) have the same values on the left and right. Once a patient’s demographic data are entered, regression lines of Enroll-HD data relevant only to the patient’s CAGn band appear on the graphs. Consequently, the individual is only being compared to Enroll-HD participants who share a similar demographic.
DISCUSSION
In HD, assessment of the patient’s journey involves collection and assimilation of information from a range of clinicians [27] over a prolonged period of time. Truly comprehending the long-term significance of these data, where there is such great symptomatic variation, is a major challenge. Monitoring of symptoms through clinical dashboards has great potential for powerfully integrating these data, informing clinicians and improving outcomes [18]. This work developed a dashboard that offers support for clinicians in building an overall picture of a patient’s clinical status relative to similarly assessed peers, and shows the practical contribution that large multi-centre data repositories such as Enroll-HD can provide to the care of the individualpatient.
The six regressions encompassing the output display demonstrate how the cross-sectional and longitudinal components of the dashboard can complement each other to build an overall clinical picture for the patient, as has been noted in other areas of HD research [28]. The three cross-sectional graphical displays illustrate the patient’s absolute clinical assessment scores compared to his or her peers, whilst the longitudinal displays compare the rates of change. This asset has potential to provide real clinical benefit. For example, a rapidly-declining patient may experience a noticeable improvement in longitudinal score following initiation of medical treatment, while still scoring relatively poorly on clinical assessment scores within the cross-sectional displays. This longitudinal improvement could offer the basis of an encouraging discussion in clinic. The use of clinical dashboards in such a manner could improve, and even standardise, aspects of care and care protocols by providing an objective marker for specific intervention points. The pooled Enroll-HD dataset used in this dashboard encompasses a large experience of HD clinical assessment, thereby facilitating benchmarking against a larger cohort of equivalent peers than is available to an individual clinician at aspecific site.
Graphical representation of patient data is a well-established way of improving decision making and positively improving healthcare standards [18], and this was an attribute we aimed to capture within this dashboard. The clean input screen and rapidly-updating displays (through adopting VBA codes and formulae) were designed with the appreciation that the aid would be unlikely to be embraced if information retrieval takes more than about 30 seconds [18]. These factors all shaped its design as a tool which immediately visualises individual performance, and helps with identification of those most at risk of poor outcome, providing clinicians with the potential to intervene early.
LIMITATIONS
There are some limitations to this work which we regard as proof-of-concept and first-of-a-kind in HD. Firstly, we plotted regression lines based on CAGn bands, rather than absolute CAGn values. It would be preferable to group the regression lines based on absolute CAGn values, but by grouping into CAGn bands, it ensured that enough data points were present in each group to generate a strong trend. As the data source is further enriched with new individuals, these CAGn bands will be able to be removed to allow regression lines to be grouped based on absolute CAGn values.
Furthermore, the width of the 95% prediction intervals illustrates a limitation of the dashboard’s ability to identify outliers. An attempt to reduce this interval size in TMS was made by identifying dystonia [28, 29] and chorea [30] as those aspects suffering particularly from inter-rater and intra-rater variability, and eliminating them from the TMS. However, this proved on qualitative assessment to make a negligible difference to the size of the prediction intervals, and so the attempt to remove chorea and dystonia from TMS was abandoned.
The low R2 values of the linear regression equations, as shown in Tables 2 and 3, highlights a limitation in the extent to which these regressions fully capture the dynamic nature of HD as a complex neurodegenerative condition. Consequently, the future identification of additional input variables –beyond CAGn and age –could help to tailor the output more appropriately to the patient in clinic, thus improving its inherent predictive value.
There was a potential risk of annualisation exaggerating small changes in clinical assessment scores if two consultations occurred close together. However, no follow-up in the Enroll-HD data occurred earlier than 294 days after baseline (as per Enroll-HD protocol for annual visits [20]), a sufficient time for annualisation to be largely representative of the overall rate of change. Therefore, when using the dashboard, it is recommended that patients’ longitudinal graphs are only considered if follow up occurs at least 294 days (or approximately 9 months) after baseline (the lower limit of the range used in the Enroll-HD data). Age, rather than time since diagnosis, was selected as the independent variable on the graphs as objective identification of an absolute time of diagnosis has been recognised as unreliable [31]. On balance, we concluded that absolute age would be the most reliable independent variable.
Finally, it should be noted that North American participants contributed to 56% of the cross-sectional data, and 94% of the longitudinal data (Table 1), creating the potential for bias in the regression equations used in the dashboard. Over time, with expansion of the number of observations in the Enroll-HD dataset available for modelling, this is likely to become less relevant.
FUTURE DIRECTIONS
As research into the symptomatic progression of HD strengthens, so will the understanding of genetic and non-genetic influencers of long-term clinical outcomes. This clinical dashboard could be advanced by introducing new variables to the input screen (alongside the current variables of CAGn and date of birth), in order to allow for more tailored information to be displayed. For example, body mass index (where a small observational study has previously associated a higher score with slower symptomatic progression [33]), or concomitant medications (some of which have been associated with increased rates of motor decline [33]) could feasibly be included as variables if indicated as significant factors in symptomatic progression. Meanwhile, recent work into potential genetic modifiers of HD –such as the role specific loci [34] and polymorphisms [35] –may lead to a more comprehensive genetic picture of HD, which could also be built into the input screen of the dashboard, and the subsequent regression calculations. Further stratification into predominantly choreiform versus hypokinetic-rigid disease may help further refine progression norms for the individual compared with the group.
Future implementation of an input screen which more accurately reflects the patient’s complex clinical history should go some way to countering these limitations. However, it is important to recognise that as the output becomes more tailored, fewer relevant data points will contribute to the graphs, and so continually adding comprehensive source data from Enroll-HD should be a priority.
The implementation of an online clinical dashboard with real-time updates of the source data would also be a significant advance –an idea which has already been postulated in other fields [36]. This would provide a constantly-expanding pool of source data, which should remove the requirement for CAGn bands, while also possibly shrinking the prediction intervals. Such a feature could be provided within the Enroll-HD platform, and provide a direct, and immediate, clinical benefit to research participation.
CONCLUSION
The accumulation of multi-trial longitudinal data through the Enroll-HD study affords an excellent opportunity for further development of the clinical dashboard concept. This work has demonstrated a powerful way of utilising such data to aid clinicians’ evaluation of an individual’s symptomatic progression in HD. While dashboards should be used holistically alongside –rather than instead of –a clinician’s expertise, this dashboard’s ability to rapidly compare a specific patient in clinic with global peers makes it a highly valuable tool for flagging disproportionate clinical decline. This would then prompt review to determine the causes, and potential reversibility, of such decline. Conversely, it may be beneficial in identifying individuals progressing at a slower rate than expected, providing encouragement, and possibly offering support for any interventions (pharmacological and non-pharmacological alike). Accurately modelling chronic degenerative diseases such as HD is highly complex, and this dashboard’s effectiveness in improving clinical care is yet to be determined. However, the success of dashboards in other fields of medicine makes us optimistic that this concept, once adopted, will also improve patient outcomes.
CONFLICTS OF INTEREST
The authors have no conflict of interest to report.
Footnotes
ACKNOWLEDGMENTS
CMK received support from the National Institute for Health Research (NIHR) Collaboration for Leadership in Applied Health Research and Care (CLAHRC) Wessex. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health.
Enroll-HD is a longitudinal observational study for Huntington’s disease families intended to accelerate progress towards therapeutics; it is sponsored by CHDI Foundation, a nonprofit biomedical research organization exclusively dedicated to developing therapeutics for HD. Enroll-HD would not be possible without the vital contribution of the research participants and their families.
