Abstract
Diffusion tensor imaging (DTI) of brain white matter (WM) may be useful for characterizing the nature and degree of brain injury after sport-related concussion (SRC) and assist in establishing objective diagnostic and prognostic biomarkers. This study aimed to conduct a systematic review using an a priori quality rating strategy to determine the most consistent DTI-WM changes post-SRC. Articles published in English (until June 2020) were retrieved by standard research engine and gray literature searches (N = 4932), using PRISMA guidelines. Eligible studies were non-interventional naturalistic original studies that conducted DTI within 6 months of SRC in current athletes from all levels of play, types of sports, and sex. A total of 29 articles were included in the review, and after quality appraisal by two raters, data from 10 studies were extracted after being identified as high quality. High-quality studies showed widespread moderate-to-large WM differences when SRC samples were compared to controls during the acute to early chronic stage (days to weeks) post-SRC, including both increased and decreased fractional anisotropy and axial diffusivity and decreased mean diffusivity and radial diffusivity. WM differences remained stable in the chronic stage (2–6 months post-SRC). DTI metrics were commonly associated with SRC symptom severity, although standardized SRC diagnostics would improve future research. This indicates that microstructural recovery is often incomplete at return to play and may lag behind clinically assessed recovery measures. Future work should explore interindividual trajectories to improve understanding of the heterogeneous and dynamic WM patterns post-SRC.
Introduction
Sport-related concussion (SRC) is a relatively common form of traumatic brain injury (TBI). It can occur in any sport when there is a blow to the head, neck, or body that sends a strong force to the head, with diagnosis based on international consensus guidelines. 1 Compared to other forms of TBI, SRC is characterized by a high chance of repetitive concussive blows and contact given that playing sport is often an ongoing activity. SRC is an important public health issue, given that it impacts all levels of play (amateur to professional) across a variety of sports. 2,3 It is estimated that 1.6–3.8 million incidences occur every year in the United States. 4 After an SRC, persons may experience rapid-onset short-lived neurological impairments and a range of clinical signs and symptoms, including possible loss of consciousness, headaches, balance problems, dizziness, and cognitive changes, which usually subside within the first 2 weeks post-injury. Persistent post-concussive clinical symptoms can occur for some athletes, 1 with ∼15% of athletes returning to sport after >30 days. 5
The neuropathophysiology of SRC is not well understood. 6,7 Typically, there appears to be no gross brain pathology, but a range of biochemical and neurophysiological changes that cause dysfunction at the cellular level, including transient neurotransmitter dysregulation, metabolic changes, and neuroinflammation. 8 Additionally, abnormal brain perfusion and microstructural damage can occur post-SRC, particularly in deep subcortical white matter (WM). 9 WM sequences (such as T1W, T2W, and T2 fluid-attenuated inversion recovery) are not sufficiently sensitive to reliably detect WM microstructural changes. In contrast, more advanced brain imaging techniques, such as diffusion tensor imaging (DTI), can provide information about changes in WM microstructural integrity. DTI changes can reflect myelin alteration, differences in WM fiber density, alterations of axonal membrane permeability, and changes in axon density. 10 In the past decade, diagnostic guidelines have recommended further studies to determine the clinical utility of DTI for improving SRC diagnosis and prognosis. 11 This has resulted in a rapidly expanding SRC-DTI literature in recent years.
There have been six systematic reviews between 2012 and 2019 investigating the characteristic patterns of WM post-SRC, 9,12 –16 with the most recent review including peer-reviewed literature up to January 2019. 9 The reviews have reported evidence for widespread WM-DTI metric changes after SRC, with major inconsistencies in the direction and magnitude of these changes. In these reviews, higher, lower, and no change in magnetic resonance (MR) diffusion metrics (i.e., axial diffusivity [AD], fractional anisotropy [FA], mean diffusivity [MD], and radial diffusivity [RD]) have been reported post-SRC when compared to controls. Thus far, no clear conclusions regarding DTI's potential clinical utility in SRC populations have been reported.
Divergent findings within the literature relate to variability in study characteristics (e.g., SRC diagnosis, participant demographics, and sports played), study design (e.g., cross-sectional, longitudinal, or period of study post-injury), substantial interstudy variability in DTI methodology (e.g., b-value, number of gradient directions), reported DTI parameters (e.g., AD, FA, MD, and RD), the analytical protocol (e.g., brain areas studied, tractography), as well as other underlying pathology. Further, divergent conclusions across previous systematic reviews relate to disparities in study eligibility criteria, including measurement at different time points post-SRC (e.g., acute vs. chronic outcomes 9,12,13,15,16 ), eligibility of retired athletes, 12,13,16 eligibility of participants with subconcussive impacts rather than formally diagnosed SRC, 9,15,16 and eligibility of various study designs (e.g., prospective, longitudinal studies 9 ). These methodological differences contribute to large variations in study quality.
In light of these disparities, this systematic review aimed to synthesize the rapidly expanding literature and determine the clinical utility of DTI in the acute (0–6 days), subacute (7–30 days), and chronic stages (1–6 months) post-SRC when applying strict eligibility criteria and quality assessment measures to overcome previous study and review limitations. The current review focused on studies performing assessments within 6 months of formally diagnosed SRCs in current athletes. This time frame reflects the most beneficial diagnosis period. The exclusion of undiagnosed SRC avoids confounds of other sport-related pathology or other forms of mild TBI, 17 and the exclusion of retired athletes avoids WM variance related to older age. 15
Additionally, and most important, because previous reviews have shown heterogeneous methodologies and major inconsistencies in the direction/presence of effects for DTI metrics after SRC, the current review used an a priori study-quality strategy that only extracted and compared results from high-quality studies. This approach has not been used in previous systematic reviews of this literature. Importantly, this strategy aligns with current efforts in the field to improve the use of DTI in research (e.g., TRACK-TBI group 18 and National Institute of Neurological Disorders and Stroke [NINDS] Common Data Elements [CDEs] 19 ) and address current gaps that limit clinical translation. 14
Methods
Methods for analysis and inclusion criteria were registered in the PROSPERO (the International Prospective Register of Systematic Reviews) database for systematic reviews (protocol ID: CRD42018094340). The methodology for the current systematic review uses PRISMA (Preferred Reporting Items for Systematic reviews and Meta-Analyses) guidelines. 20
Search methods
Study retrieval occurred by searching the following databases (from inception to June 1, 2020): Medline, EMBASE, PsycInfo, PsycExtra, PsycArticles, Cumulative Index to Nursing and Allied Health Literature, Academic Search Premier, SPORTdiscus, PubMed, Scopus, Informit, Web of Science, and multiple Cochrane databases (i.e., Reviews, DARE, and Centre for Controlled Trials). Search strategies were created with the assistance of Research Librarians (see Supplementary Table S1). Search terms (n = 56) were generated from the keywords “diffusion tensor imaging,” “concussion,” and “sport” (see Supplementary Table S2 for further details).
To cover the gray literature, the first 200 articles (with no date restriction) from a manual Google Scholar search titled “diffusion tensor imaging in sports concussion” were searched. A snowballing technique was also applied with the retrieval of references from citation by the Google Scholar citation option from all eligible studies. All processes and outcomes are outlined in Figure 1.

PRISMA flow chart.
Eligibility criteria
Studies were eligible for inclusion in the review if they met the following pre-specified criteria: 1) included only sports-related TBI; 2) examined current human athletes of either sex from any level of play and types of sports; 3) DTI conducted within 6 months of injury to ensure that any SRC-related brain injury was captured within a clinically meaningful period; 4) reported on cross-sectional, observational, cohort, correlational, or longitudinal study design to allow a comprehensive overview of naturalistic articles; and 5) the study was original research published in a peer-reviewed journal. Studies were excluded if participants included retired athletes, if the study reported on non-naturalistic study designs (e.g., conference abstracts, reviews, commentary articles, or randomized controlled trials), or if the study was not available in English.
Study screening
One reviewer (N.E.) screened all titles and abstracts to identify studies that were potentially relevant for inclusion in the review (k = 491). All decisions were checked by a second reviewer (L.C.). If needed, study authors were contacted by e-mail to obtain further information to determine the eligibility of the study (authors of 15 studies were contacted; nine authors did not respond, four provided the requested information, and two did not provide the requested information). The snowballing technique was then applied where the reference lists of identified articles were screened for suitable studies, and one additional eligible record was obtained. Next, full-text versions were screened to further determine the eligibility for inclusion.
Data extraction process
Information was extracted regarding general study characteristics (i.e., authors, publication details, funding, study design, and inclusion/exclusion criteria), sample characteristics, SRC diagnosis, clinical assessment, DTI acquisition and analyses, statistical analyses, study findings, and study strengths and weaknesses. Study findings of interest included conventional DTI metrics (i.e., AD, FA, MD, and RD) and associations between clinical or cognitive assessments and DTI results. A quantitative synthesis was conducted by directly extracting and/or computing effect sizes for all available data in each individual study. Effect sizes were computed from group means and standard deviations (SDs), t values, or p values depending on what data were available. Average effect sizes for each DTI metric reported in each study were calculated by averaging all regional effects (Figs. 2 –4). Cohen's d was used as the measure of effect, where 0.2 = small, 0.5 = medium, and 0.8 = large. Group-level regional effects are provided in Supplementary Figures S1–S3.

Acute effects of sports-related concussions from high-quality studies (0–6 days).

Subacute effects of sports-related concussions from high-quality studies (7–30 days).

Chronic effects of sports-related concussions from high-quality studies (1–6 months).
Risk of bias assessment and optimization of the National Institutes of Health quality assessment tool
Extracted studies were rated to assess the risk of bias attributable to flaws in study methodology. The National Institutes of Health (NIH) Quality Assessment Tool for Observational and Cohort and Cross-Sectional Articles 21 was modified to suit the context of SRC and DTI research (see Supplementary Table S3). Details of the modified tool are provided in the Supplementary Materials. Briefly, key modifications included: 1) in-depth quality assurance analysis of the SRC definitions, the DTI methods and analytical protocol (given that these aspects were central to the primary aim of this review); 2) subdividing quality assessment questions into multiple subdomains with independent scores; 3) including subsections applicable to specific study designs (e.g., attrition rate in longitudinal design); 4) implementation of a percentage-weighted scoring system; and 5) grouping of studies into three quality categories based on their weighted ratings and quartile. (i.e., low [quartile 1], medium [quartile 2], and high [quartiles 3–4]).
The modified quality tool included 14 questions related to study population, sample size, study design (e.g., cross-sectional vs. longitudinal), validity of measures, analysis procedure, and appropriate consideration of confounding factors. Question 11 was used to assess the DTI protocol and DTI analyses in detail (see Supplementary Table S3 and legend).
Quality ratings were completed by two independent raters (N.E. and L.J.). Agreement between raters was adequate with an intraclass correlation of 0.74 (p = 0.004). This was followed by a final consensus discussion between raters to fully review the unmatched low-, medium-, and high-quality classifications. After this discussion (with L.C.), the absolute agreement intraclass correlation coefficient on the quality rating scores was excellent (0.96; p < 0.001). The reviewers who completed the rating had expertise in neuropsychology (N.E., L.C.), neuroimaging (L.C., L.J.), and TBI (N.E.).
Results
Studies' demographic, sampling, ascertainment, timeline, clinical, and diffusion tensor imaging characteristics
There were 29 eligible studies (20 longitudinal, nine cross-sectional), published between 2010 and 2020 (Fig. 1; Table 1). Studies report on athletes in the United States (16 of 29 studies) and Canada (13 of 29 studies). There was considerable growth in the number of published studies, with 12 of 29 published since the last systematic review (2019–2020).
Summary of Study Characteristics (k = 29)
AAN, American Academy of Neurology; ACRM, American Congress of Rehabilitation Medicine; AD, axial diffusivity; ADC, apparent diffusion coefficient; ADHD, attention-deficit hyperactivity disorder; AIS, Abbreviated Injury Scale; ANAM, Automated Neuropsychological Assessment Metrics; BESS, Balance Error Scoring System; CT scan, computed tomography scan; dir, directions; FA, fractional anisotropy; GAD-7, Generalized Anxiety Disorder 7-item; GCS, Glasgow Coma Scale; HAM-A, Hamilton Anxiety Rating Scale; HAM-D, Hamilton Depression Rating Scale; ImPACT, Immediate Post-Concussion Assessment and Cognitive Testing; Kax, axial kurtosis; Krad, radial kurtosis; LD, learning disorder; LOC, loss of consciousness; MD, mean diffusivity; MK, mean kurtosis; MRI, magnetic resonance imaging; mTBI, mild traumatic brain injury; N/A, not applicable; PCSC, Post-Concussion Symptom Checklist; PCSS, Post-Concussion Symptom Scale; PTA, post-traumatic amnesia; PTSD, post-traumatic stress disorder; RD, radial diffusivity; SAC, Standardized Assessment of Concussion; SCAT, Sport Concussion Assessment Tool; SDMT, Symbol Digit Modality Test; SRC, sports-related concussion; T, Tesla; TBI, traumatic brain injury; TMT, Trail Making Test; PHQ-9, Patient Health Questionnaire- 9; WM, white matter; WTAR, Wechsler Test of Adult Reading.
The timing of the DTI post-SRC ranged from acute (0–6 days), subacute (7–30 days), to chronic (>1 month to ∼6 months) periods, with 20 studies conducting a DTI scan during the acute phase. 22 –41 Two studies did not provide information regarding timing of DTI post-SRC 42,43 ; subsequently, study authors clarified that it was within the eligible 6-month time frame.
All but three studies 23,37,38 conducted clinical assessments post-SRC, with 11 studies also conducting further follow-up assessments 24 –29,31,36,39,40,44 and 11 conducted pre-SRC clinical assessment. 22,25,30,31,33,36,39 –42,44 Clinical assessment was primarily conducted through brief screening assessment tools. One study used a qualitative measure of concussive symptomatology. 45 Traditional neuropsychological measures were rarely used (6 of 29 studies 29,35,39,40,46,47 ).
FA was quantified in all but one study, 39 MD in 20 of 29 studies, AD in 15 of 29 studies, and RD in 14 of 29 studies. DTI analytical techniques were varied and included whole-brain or ROI (region of interest) voxel-based analyses, ROI-based analyses, and whole-brain or ROI tractography.
Quality ratings
Ten of 29 studies (35%) were rated high quality, 14 of 29 (48%) were rated medium quality, and 5 of 29 (17%) were rated low quality (see Table 2). Mean weighted quality score was 66% (SD = 10%) for both raters. A small, non-significant positive correlation was observed between mean weighted consensus quality scores and year of publication (r = 0.19, t = 1.03, p = 0.31).
Quality-Assurance–Weighted Scores Rating after Consensus
Studies were rated as medium quality if only one rater rated them as low quality.
H, high (≥Q3, i.e., 72% for both raters); M, medium (Q2 and Q1); L, low (<Q1, i.e., 62% for rater #1 and 60% for rater #2).
High-quality studies were those that reported demographic information 28,29,40,47,48 and inclusion/exclusion criteria 24 –26,28,29,36,47,48 and where DTI results were reported accurately based on a DTI protocol that was able to assess the stated study aims. These studies also adjusted for key confounding variables and were all longitudinal except for one, 48 although only three studies collected DTI data pre- and post-SRC. 33,36,47 Six did not have significant attrition. 24 –26,36,40,47 Nevertheless, the high-quality studies had some limitations. Only three studies used control participants with no previous concussion history, 24,47,48 and one provided no information about control participants' concussion history. 29 The SRC diagnosis method was limited in another two studies, 47,48 and two had suboptimal reliability and validity of clinical assessment. 29,47 Only one study reported that imaging analysis was conducted blinded of clinical status. 47 Finally, none of the high-quality–rated studies reported sample-size justification, power description, or variance and effect estimates relevant for populations using DTI.
Diffusion tensor imaging findings in high-quality studies
Computed average effect sizes from the 10 high-quality studies are presented in Figures 2 (acute effects), 3 (subacute effects), and 4 (chronic effects). Only conventional DTI metrics (i.e., AD, FA, MD, and RD) from high-quality studies are synthesized in the text because there remain too few high-quality studies reporting on diffusional kurtosis imaging metrics to compare findings across studies (k = 2) adequately.
Group differences in AD were examined in seven studies. Compared to controls post-SRC, three of seven studies reported significantly higher AD in SRC groups, 24,26,28 three of seven studies reported significantly lower AD in SRC groups, 25,26,48 and two of seven studies found no significant group differences in AD. 29,36 Significant moderate-to-large effect sizes were observed across the acute (k = 4) and subacute stages (k = 2), and large effect sizes were observed in the chronic stage (k = 2; see Figs. 1–3). Of the studies that investigated within- or between-group changes over time, 24,25,36 no significant longitudinal changes were identified. However, cross-sectional group differences observed in the acute stage of one study were no longer significant in the subacute or chronic stages. 28
FA was examined in 10 studies; group differences were explored in 9 of 10 studies, and individual-level differences were explored in 1 of 10 studies. Compared to controls post-SRC, four of nine studies reported significantly higher FA in SRC groups, 24,26,28,48 three of nine studies reported significantly lower FA in SRC groups, 29,40,47 and two of nine studies found no group differences in FA at any time points. 25,36 Significant moderate-to-large effect sizes were observed at the acute (k = 4), subacute (k = 3), and chronic stages (k = 4). Of the studies that investigated within- or between-group changes over time, 24,29,36,47 three of four studies reported no significant longitudinal changes, 24,29,36 and one study showed further reductions in FA over 6 months for the SRC versus control group (d = 1.18; large effect). 47 At the individual level, abnormal voxels for FA were observed among the SRC group, with limited spatial overlap observed between the acute and chronic time points. 33
MD was examined in seven studies; group differences were explored in six of seven studies, and individual-level differences were explored in one of seven studies. Compared to controls post-SRC, four of six studies reported significantly lower MD in SRC groups, 24 –26,48 one of six studies reported significantly higher MD in the SRC group, 40 and one of six studies found no group differences in MD. 36 Significant moderate-to-large effect sizes were observed across the acute (k = 4) and subacute stages (k = 3), and large effect sizes were observed in the chronic stage (k = 3). Of the studies that investigated within- or between-group changes over time, 24,25,36 no significant longitudinal changes were identified. At the individual level, abnormal voxels for MD were observed among the SRC group, with limited spatial overlap observed between the acute and chronic time stages. 33
Group differences in RD were examined in six studies. Compared to controls post-SRC, three of six studies reported significantly lower RD in SRC groups, 25,26,28 two of six studies reported no significant group differences, 36,48 and one of six studies reported significantly higher RD in the SRC group. 29 Significant moderate-to-large effect sizes were observed in the acute stage (k = 4), moderate effect sizes were observed in the subacute stage (k = 1), and large effect sizes were observed in the chronic stage (k = 1). Of the studies that investigated within- or between-group changes over time, 25,29,36 two of three studies reported no significant longitudinal changes, 25,36 and one study found that the significantly higher RD in the acute stage for the SRC group was no longer significantly different from controls in the subacute or chronic stages. 29
Clinical and cognitive assessment correlations with diffusion tensor imaging metrics in high-quality studies
Seven of 10 high-quality studies reported on correlations between DTI metrics and clinical assessment measures (i.e., symptom severity, duration, and SRC frequency; see Fig. 5). Associations between symptom severity and DTI metrics were investigated in six studies, with 19 of 27 correlations (70%) showing a significant relationship (2 of 19 small effect; 11 of 19 moderate effect; and 6 of 19 large effect). The largest correlations were observed between symptom severity with AD and FA, albeit an inconsistent direction of effects was reported. Symptom duration and SRC frequency were investigated less frequently against DTI metrics (k = 4), and when this was examined, significant correlations were rarely observed (3 of 22 correlations were significant; 14%). No high-quality studies reported on correlations between cognitive task performance and DTI metrics.

Correlations between DTI metrics and clinical assessments in high-quality studies. A = acute; SA = subacute; C = chronic; ns = non-significant (value not provided). Green values = small effect size; blue values = moderate effect size; purple values = large effect size; uncolored values = non-significant correlations.
Discussion
Main findings of high-quality studies
Moderate-to-large effects on DTI metrics were typically observed post-SRC in current athletes, with WM abnormalities usually showing stable trajectories throughout the acute, subacute, and chronic stages. The evidence therefore suggests that microstructural recovery is often incomplete at return to play, which typically occurs after recovery of clinical measures. Without regional specificity, decreased MD and RD were most commonly observed after an SRC, whereas the FA and AD findings were inconclusive (i.e., both higher and lower FA and AD observed). WM abnormalities observed using conventional DTI metrics were generally associated with greater severity of clinical SRC symptoms, but less frequently associated with symptom duration or SRC frequency, suggesting that microstructural recovery may lag behind clinically assessed recovery measures.
Diffusion tensor imaging changes after sport-related concussion
The results from high-quality studies suggest that some DTI metrics may be sensitive to acute and early chronic microstructural WM changes after an SRC. Regional effects were reported in group-level analyses in high-quality studies, indicating that there is common WM alteration attributable to SRC (Supplementary Figs. S1–S3). The lack of reproducible regional effects across studies may be attributable to SRCs exerting broadly distributed biochemical sequelae on the brain, variability of athlete characteristics and concussive injury mechanisms, as well as methodological limitations such as small sample size and differences in DTI parameters. 49 Four studies observed increased FA post-SRC (and decreased MD/RD, which are inversely associated with FA 50 ). This is thought to reflect diffuse axonal injury, with subtle cytotoxic or vasogenic edema and axonal swelling increasing anisotropic diffusion and causing increases in FA. 51,52 However, it is important to note that whereas four studies reported increased FA post-SRC, three studies reported decreased FA when comparing SRC athletes to controls. Decreased FA post-SRC is thought to reflect tract disruption, progressive Wallerian degeneration, and loss of myelin integrity. 53,54
Although it has been theorized that FA is increased acutely after SRC and decreased thereafter, 55 the current findings reveal a more complex picture whereby studies have reported both broadly distributed increases and decreases in FA (and inverse association with MD/RD) in the acute, subacute, and chronic stages, with limited evidence of direction of effects being tied to temporality (at least over a ∼6-month period). Similarly, a recent meta-analysis examining tract-based spatial statistics in SRC did not find evidence to support a simple time effect, but more dynamic effects likely to reflect interindividual variability in brain recovery processes. 56 Decreased FA and increased MD/RD could reflect chronic and cumulative physiological injuries subsequent to repetitive subconcussive impacts often observed among athletes in contact sports. 57 In contrast, increased FA and decreased MD/RD may reflect specific microstructural neuroinflammatory injuries post-SRC. 57 Of relevance, Churchill and colleagues found that although the SRC group reported decreased FA compared to controls, elevated FA was associated with greater SRC symptom severity, 40 which could reflect both SRC neuroinflammatory injuries and cumulative impacts of subconcussive knocks.
Although this review attempted to limit the effect of subconcussive impacts on DTI outcomes by only including studies with formally diagnosed SRC, it is impossible to completely parse out these effects among athletes in contact sports. Further, it may be the case that both increased and decreased FA coexist in the same person depending on the region of the brain. Indeed, when comparing the abnormal voxels of individual athletes, Churchill and colleagues identified both abnormally elevated and depressed FA and MD voxels post-SRC. 33 Future longitudinal work into individual-level differences post-SRC can provide greater clarification on diverging direction of effects in DTI metrics and their timeline. It has also been suggested that FA directions are ambiguous, with differences simply reflecting alterations in microstructure, and further interpretations must be backed by strong theoretical foundations. 10
There was no evidence of further DTI metric changes in studies with follow-up assessments over 1–6 months, meaning that the DTI differences did not significantly recover or worsen across time. This is consistent with recent reports of sustained neuroanatomical differences years after concussive hits. 58 Future longer-term studies with larger sample sizes are needed to confirm this conclusion. 59
Methodological considerations
The validity of DTI findings in SRC remains limited by a number of methodological issues. Besides observing that DTI SRC research has typically been exploratory in nature (with no power analyses), the most significant quality issues noted included limited demographic information and limited assessment validity (i.e., examination of associations between clinical variables, SRC, and DTI), among other issues affecting risk of bias (i.e., absence of assessor blinding and significant attrition in some longitudinal studies). In high-quality–rated studies, extracted WM areas were reported either by ROI, voxel-based WM, or tractography with clear exclusion of gray matter. In contrast, several low- to medium-quality studies included gray matter, which fundamentally affects the magnitude and nature of DTI parameters. 60
Methodological issues were also present in high-quality studies. There was an unequal representation of sex and sport type, which are known to impact SRC recovery and outcome. 61,62 Critically, SRC diagnostic methods also varied between studies (including limited or absent reporting in some instances), which is concerning given that this diagnosis is central to the study of SRC-related DTI changes. The presence of previous concussion history (i.e., ranging from 1 to 4 across athletes with available information) may be contributing to DTI directionality differences and impacting recovery and outcome post-SRC. 59 Yet, only two studies examined correlations between SRC frequency and DTI metrics. This makes it difficult to ascertain whether the etiology of observed DTI-WM changes were from a singular concussion during the study or cumulative change from multiple previous hits. 9,13
Methodological limitations were also observed with the control groups among high-quality studies. Ideally, the control group should be well matched demographically, and information regarding concussion history and sport involvement during the study should be clearly reported, given that interpretation of the extent of WM changes could be further complicated by any subconcussive hits experienced. 63 In this regard, two control groups may be required: one with no SRC history that does not play sport and another with no SRC history that regularly engages in sport. Finally, the ability to comment on the extent to which DTI changes reflect the severity of SRC is limited because of heterogeneous cognitive measures used, the common lack of injury severity information (i.e., Glasgow Coma Scale, loss of consciousness, retrograde amnesia, and duration of post-traumatic amnesia), or tests sensitive to subtle cognitive deficits and their changes over time. 64 Standardized SRC diagnosis and better neuropsychological measures would improve future imaging research and study replication. 65
Common among both high- and low- to medium-quality studies was the relatively small sample size and limited neuroimaging before SRC. Larger sample sizes may be achievable through collaborative cohorts and consortiums. Greater representation of female athletes is also necessary. Additionally, longitudinal study design is a fundamental requirement given that SRC is a dynamic process, which requires serial evaluation and extraction of individual trajectories. Future work should include imaging pre-SRC and examine DTI changes over a longer follow-up period to determine microstructural recovery time. 34
Strengths and limitations
This is the first systematic review of the SRC and DTI literature to apply an a priori study-quality strategy that only extracted and compared results from high-quality studies. This approach aligns with current efforts in the field to improve the use of DTI in research (e.g., TRACK-TBI group 18 and NINDS CDEs 19 ) and address current gaps that limit clinical translation. 14 The standard NIH quality rating tool was adapted (Supplementary Material) for use in this review and can be used in future reviews of neuroimaging studies. Despite adequate inter-rater reliability, it is recommended that two raters, with cross-disciplinary expertise, independently perform the quality assessment followed by an inter-rater discussion to achieve 100% concordance for high-, medium-, and low-quality ratings.
Several limitations of this review warrant consideration. First, this review focused on the effect of SRC on WM microstructure, whereas the impact on white and gray matter volume was not evaluated. Integration of all three outcomes in future reviews of high-quality studies will yield a more complete understanding of the effect of SRC. Second, the variability in magnitude and direction of DTI metrics observed in this review may relate to continued maturation of white matter microstructure throughout adolescence and young adulthood. 66 Some high-quality studies reported on in this review recruited participants as young as 13 years; thus, variation in neurodevelopmental stage could be an important moderating factor for microstructural outcomes, although this should be controlled by age- and sex-matched controls.
Conclusion
Decreased MR diffusivities (MD and RD) in SRC samples (maximum of 6 months after SRC) compared to controls are the most consistent DTI-WM changes, as extracted by an a priori quality review strategy. Future DTI SRC research should aim to further standardize the SRC diagnosis protocols and neuropsychological measures used, conduct longitudinal studies that capture pre-SRC DTI metrics, and ensure analyses are adequately powered. The development of a “standard” diffusion protocol that performs equally across different scanner types and sites, and clearly specifies all diffusion acquisition parameters (e.g., ABCD imaging protocol for young people, 67 ENIGMA DTI harmonization processing steps 68 ), would benefit DTI SRC research.
Footnotes
Acknowledgments
We thank the library staff at Macquarie University (Sydney, New South Wales, Australia) who provided valuable input into the review process.
Funding Information
This work was supported by the NHMRC Career Development Fellowship with UNSW/NeuRA extension to 2019 (APP1045400; CIA/PI, Cysique) and an NHMRC Scholarship (GNT1169377; to Lees).
Author Disclosure Statement
No competing financial interests exist.
Supplementary Material
Supplementary Figure S1
Supplementary Figure S2
Supplementary Figure S3
Supplementary Table S1
Supplementary Table S2
Supplementary Table S3
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
