Abstract
Cognitive impairment is a core clinical feature of traumatic brain injury (TBI). After TBI, cognition is a key determinant of post-injury productivity, outcome, and quality of life. As a final common pathway of diverse molecular and microstructural TBI mechanisms, cognition is an ideal endpoint in clinical trials involving many candidate drugs and nonpharmacological interventions. Cognition can be reliably measured with performance-based neuropsychological tests that have greater granularity than crude rating scales, such as the Glasgow Outcome Scale-Extended, which remain the standard for clinical trials. Remarkably, however, there is no well-defined, widely accepted, and validated cognition endpoint for TBI clinical trials. A single cognition endpoint that has excellent measurement precision across a wide functional range and is sensitive to the detection of small improvements (and declines) in cognitive functioning would enhance the power and precision of TBI clinical trials and accelerate drug development research. We outline methodologies for deriving a cognition composite score and a research program for validation. Finally, we discuss regulatory issues and the limitations of a cognition endpoint.
Introduction
O
The GOS-E is an eightpoint ordinal rating scale, but it is typically dichotomized for analysis. It involves asking patients and/or their caregivers a series of questions (e.g., “Are you able to shop without assistance?”) to determine the overall degree of disability. The International Mission on Prognosis and Clinical Trial Design in TBI 7 and agencies such as the United States Department of Defense 8 have recognized the limitations of these blunt tools, especially for mild to moderate TBI, and called for research to advance outcome measures for TBI clinical trials.
Although improved methods for measuring global functional outcome would be valuable, the thesis of this article is that developing and validating a cognition endpoint should also be a top research priority. Further, the ability to generate a cognition composite from differing neuropsychological batteries will enable interrogation of existing databases and thereby greatly accelerate the validation process. We outline the advantages of a cognition endpoint and sketch a road map for its development and validation.
Cognitive impairment is a core clinical feature of TBI. Depending on the severity of brain injury, cognition is often severely affected in the acute post-injury stage and gradually improves over time, with a slowing trajectory. 9 –11 Even the mildest of TBIs at least transiently impacts cognition. 12 The risk of persistent or permanent cognitive impairment increases with the severity of injury. 9 –11,13 –15 Patient characteristics such as age further modify the recovery curve. 16 –19 Cognition is associated with functional outcome in most studies, as measured by rehabilitation gains, functional independence, community reintegration, and employment. 20 –25 Given its prevalence and clinical significance, cognitive impairment is a key therapeutic target after TBI and requires sound measurement tools. It has been the primary outcome for efficacy in major TBI trials 26,27 but far less frequently than the GOS-E and similar global outcome scales. 6
Cognition has several distinct advantages as an endpoint for TBI trials. First, cognition can be reliably measured by performance-based neuropsychological tests, which have a rich scientific literature in general 28 and in TBI in particular. 10,11,29 Second, a cognition endpoint can provide improved granularity over global functional outcome scales. In particular, neuropsychological tests may extend the dynamic range upward by detecting subtle differences among persons who score at the ceiling of the GOS-E. 30,31 Third, compared with disability rating scales, cognition is more proximal to the TBI neuropathology such that it should be more responsive to treatments that alter neurophysiology while it should also be less influenced by non-TBI factors such as social support and comorbid bodily injures.
Fourth, cognition is functionally relevant. It underlies TBI-related challenges with daily functioning, or in the World Health Organization's International Classification of Functioning framework, cognition links body structure and activity/participation. 32 Fifth, a cognition endpoint could facilitate the transition from pre-clinical drug development to human clinical trials. Cognition, typically measured in animals by the Morris Water Maze task or variant, is one of the most widely used endpoints in rodent models of TBI. 33,34 A discordance between the endpoints in pre-clinical studies (cognition) and Phase III clinical trials (global disability) might partly explain why promising pre-clinical evidence fails to translate into positive human trials. 35
Sixth, cognition has value as a prognostic marker. Cognition can be assessed early, before hospital discharge. 20 The strongest predictors of outcome from moderate to severe TBI (e.g., pupil response, computed tomography (CT) findings, and Glasgow Coma Scale) 36 lose their prognostic power when applied to patients with mild TBI. 37 Early neuropsychological testing, on the other hand, robustly predicts long-term outcome across the range of TBI severity. 20,25,38,39 Therefore, a cognition outcome can be used to enrich clinical trials (e.g., enrolling only patients who are likely to have persistent cognitive impairment) or enhance the power of clinical trials through risk stratification or covariate adjustment. 40
Finally, cognition is a final common pathway of diverse TBI mechanisms, such as focal contusions, hematomas, diffuse axonal injury, edema, cellular dysfunction (e.g., excitotoxicity, calcium overload, oxidative stress, mitochondrial dysfunction, and inflammation), impaired synaptic transmission, cell death (necrosis or apoptosis), and axonal degeneration. 41 Cognition is therefore well suited to measure benefit from therapeutics that influence multiple neurophysiological processes. For example, citicoline was recently evaluated in a large TBI trial, where it was hypothesized to exert neuroprotective effects and promote neurorecovery through numerous mechanisms. 26 Combination therapies, which are increasingly studied in TBI, by nature have diverse mechanisms of action and so require a downstream endpoint. 42 Most nonpharmacological interventions also have complex neurophysiological mechanisms (e.g., hyperbaric oxygen therapy) 43 or they target cognition directly (e.g., attention training). 44
Remarkably, there is no well-defined, widely accepted, and validated cognition endpoint for TBI. There is incomplete to no overlap in the test batteries used in past and ongoing clinical trials and observational studies. Further complicating matters, neuropsychological test data have been scored and analyzed in several different ways, reflecting a lack of agreement as to how multidimensional outcomes in clinical trials should be handled. Selecting a cognition endpoint and analytic approach are not trivial decisions. They directly influence sample size requirements, stopping rules for futility, and conclusions about treatment efficacy. Before initiating efforts to validate a cognition endpoint, it is essential to decide which tests to use and how to combine information from those tests. Only then will we have a single “measure” to validate.
Which Tests?
No single neuropsychological test captures the diversity of cognitive impairments seen after TBI. Adequate measurement of cognitive outcome from TBI requires administering a battery of tests that, at minimum, cover the domains of attention, memory, processing speed, and executive functioning. 45,46 A large number of neuropsychological tests are available to measure these functions. 28
Over the past 25 years, there have been several attempts to endorse and promote particular neuropsychological tests for TBI clinical trials. The National Institutes of Health/National Institute of Neurologic Disorders and Stroke (NIH/NINDS) sponsored a conference in 1991 to identify appropriate outcome measures for TBI clinical trials. 47 The NINDS Head Injury Centers subcommittee met the following year and advised a broader battery to use when a richer characterization of cognition is desired. 48
The National Institute of Child Health and Development-sponsored TBI Clinical Trials Network was established in 2008 to support multicenter clinical trials. 46 They selected a battery of neuropsychological tests for the Citicoline Brain Injury Treatment Trial 26 and presumably others to follow. More recently, the NINDS Common Data Elements (CDE) project was undertaken to standardize outcome measurement for TBI research. One aim of this project was to improve the precision and consistency of measuring treatment effects. 45 In 2010, the NINDS-CDE TBI Outcomes Workgroup recommended a set of core measures across multiple domains, including neuropsychological functioning, with a revised set (version 2.0) in 2012. 49 The specific tests and group endorsing them are summarized in Table 1.
NIH/NINDS, National Institutes of Health/National Institute of Neurologic Disorders and Stroke; TBI, traumatic brain injury; NINDS-CDE, NINDS Common Data Elements; BVMT-R, Brief Visuospatial Memory Test-Revised; CVLT, California Verbal Learning Test-2; COWA, Controlled Oral Word Association test; MAE, Multilingual Aphasia Examination; PASAT, Paced Auditory Serial Addition Test; RAVLT, Rey Auditory Verbal Learning Test; SDMT, Symbol Digit Modalities Test; CWIT, Color-Word Interference Test; WAIS, Wechsler Adult Intelligence Scale; WCST, Wisconsin Card Sorting Test.
Not recommended for severe TBI.
Designated as “supplementary.”
A large number of additional tests (not shown in this table) were designated as “supplementary.”
The NIH Blueprint for Neuroscience Research (
Another option, rather than settling on a particular set of tests, is to instead validate a composite scoring system that can be used for differing or even nonoverlapping test batteries, provided they meet some minimal criteria (e.g., coverage of cognitive domains, reliability, and adequate normative data). Recent research supports that a cognition composite based on different tests that measure the same construct (e.g., Rey Auditory Verbal Learning Test vs. California Verbal Learning Test) perform very similarly. 52 Batteries that are composed of different tests cannot be expected to perform comparably to one another when the differences in battery composition become extreme. For example, a composite score derived from a brief battery of mostly speeded tests would likely have different properties than one derived from a more comprehensive battery with balanced coverage of processing speed, attention, memory, and executive functioning.
Further empirical work will be necessary to identify the limits of flexibility with respect to the number and type of tests. A composite that can be derived from multiple neuropsychological batteries will permit harmonization of these disparate measures across existing databases, thereby permitting interrogation of measures with more highly powered analyses, thus accelerating the validation process.
How to Combine Information from the Tests in the Battery?
Neuropsychological test data have been analyzed in numerous ways, reflecting a lack of agreement as to how to handle multidimensional outcomes in clinical trials. Each of these methods has unique limitations. The most common approach has been to perform group comparisons for each neuropsychological test in the battery. For example, High and associates 53 compared patients receiving growth hormone replacement versus placebo on a comprehensive battery of neuropsychological tests. They found an “encouraging” (pg. 1573) treatment effect on total correct responses on the Wisconsin Card Sorting Test, but no significant group differences on another 27 measures. This approach risks mixed findings or chance positive findings (Type I error). Corrections for the family-wise error rate can guard against Type I error, but at the cost of sacrificing statistical efficiency.
Multivariate techniques such as multivariate analysis of variance/multivariate analysis of covariance (MANOVA/MANCOVA) are also widely used, such as in a study of prescribed rest for sport-related concussion. 54 Methods for translating the variance-accounted-for effect size (e.g., eta-squared) yielded by MANOVA/MANCOVA into a clinically meaningful metric are lacking. 55 Multivariate techniques can also provide misleading, biologically implausible results. 46 For example, a large improvement on one cognitive test and small decline on several other cognitive tests in the battery could lead to rejection of the null hypothesis.
Another solution implemented in at least one TBI trial (evaluating citicoline) 26 is a global test procedure based on the logistic model. This approach does not grade the severity of impairment on each test and assumes a common treatment effect across all tests in the battery.
Finally, there are nonparametric alternatives. For example, a study investigating the side effects of valproate in TBI 56 rank ordered neuropsychological test scores within the sample, averaged the ranks across the tests in the battery, and performed nonparametric statistics on the resulting mean rank score.
There are several advantages to analyzing a cognition composite endpoint instead of a set of neuropsychological test scores. First, having a single prospectively defined primary outcome is required by the Food and Drug Administration (FDA) to license new drugs. 57 Second, a single endpoint is also more readily subjected to validation studies such that the endpoint can be qualified for drug development research. Third, more flexible and powerful statistical approaches can be used to analyze a single endpoint. For example, generalized linear modeling and extensions can be used for an endpoint with any distribution that was measured across any number of time points. Fourth, clinical trials with a single endpoint can be readily meta-analyzed to more definitively establish treatment efficacy. Finally, a clinical trial with a single endpoint produces findings that will be easier for patients and their caregivers to understand, 58 empowering them to make informed treatment decisions.
Although a cognition composite endpoint integrates information from multiple tests, it can be constructed in a manner that facilitates straightforward interpretation, with higher scores indicating better (or worse) cognition. This is in contrast to truly multidimensional endpoints that integrate diverse outcomes, such as mortality and functional, neuropsychological, and patient-reported measures. 59
Approaches to Creating a Cognition Endpoint
The simplest and most widely used contemporary method for creating a composite score is to place the individual test scores on a common metric (e.g., norm-referenced z, T, scaled score, percentile, or other standard scores) and average them, producing what can be called the Overall Test Battery Mean (OTBM). 60 The OTBM has been shown to discriminate between levels of TBI severity. 29 and it has been used for clinical trials in TBI. 43
An alternative approach is to use Item Response Theory (IRT)-based methods to model latent ability based on a set of neuropsychological test scores. A model is built using confirmatory factor analysis in which cognitive test scores serve as indicators of the latent trait, and the model that fits the available data and is concordant with cognitive theory is selected. 61 The latent ability model captures variance across cognitive tests and can also account for covariance attributable to methods effects or theoretical similarities across cognitive subtests. 62
A latent ability composite score is then computed for each participant in the dataset; the scale is typically and arbitrarily anchored to a mean of 0 and variance of 1. The latent ability composite score can be validated by comparing it with constituent cognitive test scores or other relevant data with respect to strength of association with markers of cognition or disease (e.g., imaging or other biomarkers) or ability to detect change over time. This approach has proven valuable for dementia research 63,64 but it has not yet been applied to TBI.
The primary advantages of a latent ability composite are that it has linear measurement properties and accounts for differences among tests in the battery with regard to precision and relationships with demographic variables. Linear measurement refers to the notion that differences between points on a test or scale are equal across the range of the scale. Linear scaling has the potential to improve precision across a wider range of cognitive ability and reduce bias in estimating changes in cognition over time. 65
When constituent cognitive tests vary in difficulty and precision (e.g., one test measures high cognitive functioning with good precision and another precisely measures low cognitive functioning), the latent ability score can measure the underlying trait well across the full range of ability. A further advantage of this approach to modeling is its ability to quantify and account for differences in test performance across demographic groups that persist after controlling for the underlying ability measured by all of the tests (differential item functioning). 66
Each test that contributes to the latent ability composite can be evaluated for differential item functioning effects; that is, systematic bias for people with certain demographic profiles. It is possible to model the direct effect of pre-injury variables (e.g., pre-morbid intelligence) on latent ability, as well as their relationship with the test scores independent of latent ability. Latent ability composite scores have been created for Alzheimer disease 63,64 but not yet for TBI.
A third possibility is to aggregate information about the number and severity of low scores in a neuropsychological battery. This may be particularly important for TBI because the neuropathology and resulting neuropsychological deficits after TBI are heterogeneous. 67,68 As well, the pathophysiological mechanisms underlying cognitive impairment may be different during the acute versus chronic phases. Information from a neuropsychological battery should be combined in a manner that does not “wash out” individual differences. In a clinical trial, prominent treatment effects on certain tests could be averaged out by relatively stable performances in other areas.
We refer to a cognition composite score created by weighting the number and severity of low performances as a Neuropsychological Deficit Score (NDS). The NDS can be derived by assigning a pre-specified weight to the norm-referenced percentile score for each test (where greater weights are assigned as observed scores deviate farther from normative expectations), summing the weights, and then dividing the sum by the number of tests in the battery. The NDS has a legacy in neuropsychology, with the Halstead Impairment Index (circa 1955), Average Impairment Rating (circa 1970), General Neuropsychological Deficit Scale (circa 1988), and Global Deficit Scale (circa 1994). 69,70
An NDS for TBI could extend these methods by (1) creating finer gradations, because responsiveness (sensitivity to change) is a highly desirable property of clinical trial endpoints; (2) raising the ceiling, to potentially better detect impairment in patients with milder injuries and/or high pre-morbid ability; and (3) differentially weighting deficit levels informed by multivariate base rates in healthy persons. In analyzing multivariate base rates, we have previously shown 71 –75 that (1) a substantial percentage of healthy persons obtain one or more low scores when administered a battery of neuropsychological tests; (2) the more tests that are administered, the more likely it is for a healthy person to obtain a low score; and (3) there are differences in the prevalence of low scores in healthy persons associated with level of education, race, and level of intellectual ability. Therefore, these factors need to be considered in the development of an algorithm for the NDS.
Validating a Cognition Endpoint
Extensive research is required before a cognition endpoint will be ready for clinical trials, and more broadly, for use as an outcome measure in TBI research. The three approaches to creating a cognition endpoint outlined above (OTBM, latent ability modeling, NDS) should be evaluated and compared on multiple aspects of validity, including (1) diagnostic validity, i.e., the ability of the composite scores to discriminate between patients with TBI versus with bodily injuries not involving the head (controls) throughout the TBI severity spectrum; (2) concurrent validity, i.e., the strength of the relationships between the composite scores and other TBI outcomes such as trauma-related intracranial abnormalities on magnetic resonance imaging and patient-reported outcomes; (3) responsiveness to TBI recovery, i.e., sensitivity of the composite scores to change with time since TBI; (4) prognostic validity, i.e., the ability of early neuropsychological testing to independently predict long-term TBI outcomes.
It may be possible to further improve a cognition composite derived from any of the above approaches (OTBM, latent ability modeling, and NDS) by accounting for pre-injury individual differences. Age, sex, race, education level, and pre-morbid intelligence explain at least 40% of the variability in cognitive performance before injury. 76 Neuropsychological tests may therefore systematically over- or underestimate cognitive impairment after TBI in patients with certain demographic profiles. TBI “signal” could theoretically be better isolated by removing the “noise” variance associated with pre-morbid functioning. Demographic characteristics, however, appear to contribute to TBI outcome through multiple pathways 77 such that attempting to remove their contribution to a cognition composite might weaken, not strengthen, outcome prediction. 66 This needs to be examined carefully through applied research studies with TBI patients across the spectrum of concussion to lengthy coma.
The cognition endpoint emerging from head-to-head validity testing will ideally further demonstrate that it can capture small but important intra-individual changes. Its Minimal Clinically Important Difference could be derived through distribution-based methods (e.g., Reliable Change Index) and by anchoring to external criteria (e.g., change from one level of the GOS-E to the next among persons below the ceiling of the GOS-E). 78 Reanalyzing clinical trial results or simulating treatment effects in cohort databases could help contrast the responsiveness of a cognition composite with traditional approaches to analyzing neuropsychological outcomes (e.g., group mean comparisons on individual tests).
Finally, if not tied to a fixed battery of neuropsychological tests, the context in which the cognition endpoint will be used must be further refined. The endpoint cannot be expected to perform similarly for all possible neuropsychological test batteries. It will be important to examine the equivalence of partially overlapping and nonoverlapping test batteries, of variable size (e.g., 4 through N tests) and coverage of cognitive domains, with simulation studies using healthy normative samples as well as TBI samples. This will inform the requirements for a minimally adequate set of tests on which to derive the composite.
Much of this work can be accomplished using existing databases. Responding to the US Department of Defense Psychological Health and Traumatic Brain Injury Research Program's January 2014 call for proposals to address the problem of insensitive outcome measurement, the TBI Endpoints Development (TED) Initiative received a $17 million 5-year award to improve clinical trial methodology by advancing outcome measurement. The investigators have compiled the TED metadataset, which integrates data from eight TBI trials involving more than 3500 patients. More patients and studies are expected to be added.
The TED metadataset contains longitudinal demographic, clinical, biomarker, and neuropsychological data from civilian, sport, and military cohorts (
Regulatory Requirements
An important goal of TBI clinical trials is to achieve market-ready therapeutics that improve patient outcomes. Qualification of a primary endpoint through the FDA's Drug Development Tool program can facilitate FDA approval for new therapeutics. To be considered for qualification, the FDA requires that a Clinical Outcome Assessment be supported by evidence of being “well-defined and reliable” in measuring a Concept of Interest (e.g., a symptom, such as fatigue) in a specified Context of Use (COU). 80
The FDA defines COU as a comprehensive statement that fully and clearly describes the way an outcome measure is to be used and the drug development-related purpose of the use (e.g., enriching a clinical trial in mild to moderate TBI by selecting only patients with early neuropsychological impairment). The COU delineates the boundaries within which the available data adequately justify use of an outcome measure and describes important criteria regarding the circumstances under which the measure is qualified.
The FDA recognizes different types of clinical outcome measures. The cognition composites described above are performance outcomes for cognition (the construct of interest). They are derived from a series of standardized tasks performed by a patient according to instructions from a trained professional.
Like all endpoints, a cognition composite will only be valid in a defined COU. We anticipate that a cognition composite will be appropriate as a primary or secondary endpoint for studying interventions that target cognition directly (e.g., attention training) or underlying neurophysiological processes that support brain function (e.g., with a neuroprotective drug). It is not appropriate as the sole endpoint for interventions designed to promote survival, but might enhance such a trial with further understanding of the implications of those interventions on cognition.
For interventions with a known specific mechanism, a more proximal outcome would be preferable (e.g., slowed intracranial hemorrhage growth on serial CT for tranexamic acid), at least in the early stages of drug development. 81 For interventions that target a narrow aspect of cognition (e.g., mental imagery for prose memory), 82 a specific cognitive domain score may be more appropriate than a global cognition composite.
Neuropsychological tests are not designed to capture the transition from coma to minimally conscious state. An aspirational goal is to develop a cognition composite that will be suitable for patients with mild to severe TBI who are in the post-acute stage of recovery, excluding those with a persistent disorder of consciousness. Studies are needed to determine whether the cognition composite retains psychometric robustness (1) across the full range of pre-morbid ability, TBI severity, and time since injury, and (2) with variations in the specific tests that comprise it. Empirically based consensus recommendations on specific measures for TBI clinical trials can then provide guidance on the COU (i.e., what measures to use based on acuity, population, setting, and form of intervention under study).
Limitations of Cognition Endpoint
We advocate for a cognition composite score as a primary endpoint for clinical trials that target cognition directly and/or multiple neurophysiological processes upstream to cognition. A number of other outcome assessments should be developed and validated alongside, however. Most neuropsychological tests are insufficiently process-pure for mechanistic targeting. 83 Cognition is also influenced by medical and psychiatric comorbidities (e.g., mood disturbance, sleep disorders, and chronic pain) commonly encountered after TBI. Especially in the proof-of-concept and dosing phases of drug development, it will be important to have intermediate biomarkers that quantify the drug mechanisms (e.g., reducing inflammation, edema, ischemia, or oxidative stress) and link the mechanism(s) to changes in cognition. 3 We expect that genotyping, 84 serum proteins, 85 neuroimaging, 86 and other biomarkers obtained soon after TBI will improve outcome prediction above and beyond neuropsychological testing, further refining risk stratification and enhancing the power of clinical trials.
A comprehensive picture of the therapeutic benefits of a new intervention requires supplementing neuropsychological testing with measures of neurobehavioral symptoms, psychological health, physical functioning, and life participation. 45,48 Some researchers have proposed that a multidimensional endpoint that captures several or all of these domains is most appropriate for TBI clinical trials. 7,59,87 For example, a battery of neuropsychological tests supplemented with functional, mood, and behavioral outcome measures could be analyzed using statistical methods that accommodate multiple dependent variables. Figure 1 illustrates this model.

Framework for multidimensional outcome assessment for traumatic brain injury clinical trials.
We believe that a single pre-specified endpoint supplemented by a comprehensive battery of secondary outcomes has more advantages (summarized above) and less problematic limitations than multidimensional outcomes. The main advantages of a multidimensional endpoint are statistical efficiency and the ability to capture a range of important outcomes in a single metric. Disadvantages of multidimensional endpoints include having to make assumptions about the magnitude of the treatment effect across outcome domains, including domains that are less influenced by the treatment, obtaining stronger treatment effects for less important outcomes, susceptibility to post hoc “cherry-picking,” and that interpretation (explaining what the treatment effect means) can be challenging. 58,88
Several potential threats to the validity of a cognition endpoint are anticipated. First, a cognition endpoint may be systematically biased in patients with diverse language and cultural backgrounds. 89 Second, interpreting neuropsychological performance as a measure of cognitive ability rests on the assumption that the examinee put forth good effort. We know that this assumption is sometimes violated, especially in TBI-related disability claim contexts but in other settings as well. 90 –92 Several relatively brief performance-based measures of response bias are now available, and could be included in a research battery.
Third, when a neuropsychological test battery is administered at baseline (pre-treatment) and again after treatment, some improvement is expected on the basis of previous exposure (e.g., via practice effects). Fortunately, this is only an issue for certain clinical trial designs (e.g., ABA) and can sometimes be mitigated by “washing out” practice effects with repeated baselines 93 or by using psychometrically comparable alternate forms. Finally, patients with severe TBI may be untestable at the first time point, creating “Missing Not at Random” data. Detailed completion codes may help attenuate or at least quantify this bias. 46,94
As mentioned above, patients who have a persistent disorder of consciousness will likely fall outside of the COU for a cognition endpoint because neuropsychological testing will be insensitive to clinically meaningful improvements in alertness and responsiveness. Alternative methods for measuring therapeutic benefit within this lower end of the ability spectrum, such as neurobehavioral rating scales, will be preferable. 95
Whereas cognitive impairment remains detectable many years after severe TBI, 13 –15,19 impairment is no longer detectable at the group level by 3 months after a mild TBI in most controlled studies. 11 There is a debate in the literature as to whether neuropsychological tests are inadequately sensitive to detect subtle deficits in post-acute mild TBI (i.e., the tests are too imprecise to detect the signal) or whether mild TBI does not result in lasting neurocognitive impairment (i.e., there is no signal to detect). 96 One possible explanation is that a minority of patients with mild TBI have residual impairments and that this subgroup is masked by group comparisons. 97 Case-control studies that address selection and attrition bias are needed to further evaluate this hypothesis. An impaired subgroup argues strongly for having validated neuropsychological metrics for enriching clinical trials.
Conclusions
TBI intervention research will benefit greatly from a well-validated global cognition endpoint. Adequately measuring cognitive outcome from TBI requires a conceptually sound and psychometrically sophisticated method for integrating information from a battery of neuropsychological tests. A program of research is needed to develop, refine, and evaluate a cognition composite score for TBI in a manner that aligns with FDA regulatory requirements.
A starting point is to derive and compare candidate cognition composite scores created through both traditional and innovative methodologies. Preliminary validation studies will help identify the most promising cognition composite score and assess its readiness for TBI outcome research. Examining how the cognition composite performs across the full range of pre-morbid ability, TBI severity, and time since injury (acute, subacute, and chronic) will lead to evidence-based recommendations for how and when the cognition composite should be used as an endpoint for TBI clinical trials.
Footnotes
Author Disclosure Statement
GLI has been reimbursed by the government, professional scientific bodies, and commercial organizations for discussing or presenting research relating to MTBI and sport-related concussion at meetings, scientific conferences, and symposiums. He has a clinical practice in forensic neuropsychology involving individuals who have sustained mild TBIs. He has received honorariums for serving on research panels that provide scientific peer review of programs. He is a co-investigator, collaborator, or consultant on grants relating to mild TBI funded by several organizations. He has received grant funding from pharmaceutical companies to do psychometric research using neuropsychological tests. He has received research support from neuropsychological test publishing companies in the past (not in the past 5 years). For the remaining authors, no competing financial interests exist.
NDS receives salary support from a Vancouver Coastal Health Research Institute Clinician-Scientist Career Development Award. GLI acknowledges support from the INTRuST Posttraumatic Stress Disorder and Traumatic Brain Injury Clinical Consortium funded by the Department of Defense Psychological Health/Traumatic Brain Injury Research Program (X81XWH-07-CC-CSDoD), Harvard Integrated Program to Protect and Improve the Health of NFLPA Members, and the Mooney-Reed Charitable Foundation. GTM acknowledges support from NIH U01 NS086090-01, DOD USAMRAA W81XWH-13-1-0441, DOD W81XWH-14-2-0176, and One Mind. This work is related to the TBI Endpoints Development Initiative and a grant entitled Development and Validation of a Cognition Endpoint for Traumatic Brain Injury Clinical Trials.
