Abstract
Objective:
The current study sought to clarify and harness the incremental validity of emotional dysregulation and unawareness (EDU) in emerging adulthood, beyond ADHD symptoms and with respect to concurrent classification of impairment and co-occurring problems, using machine learning techniques.
Method:
Participants were 1,539 college students (Mage = 19.5, 69% female) with self-reported ADHD diagnoses from a multisite study who completed questionnaires assessing ADHD symptoms, EDU, and co-occurring problems.
Results:
Random forest analyses suggested EDU dimensions significantly improved model performance (ps < .001) in classifying participants with impairment and internalizing problems versus those without, with the resulting ADHD + EDU classification model demonstrating acceptable to excellent performance (except in classification of Work Impairment) in a distinct sample. Variable importance analyses suggested inattention sum scores and the Limited Access to Emotional Regulation Strategies EDU dimension as the most important features for facilitating model classification.
Conclusion:
Results provided support for EDU as a key deficit in those with ADHD that, when present, helps explain ADHD’s co-occurrence with impairment and internalizing problems. Continued application of machine learning techniques may facilitate actuarial classification of ADHD-related outcomes while also incorporating multiple measures.
ADHD is characterized by symptoms of inattention and hyperactivity/impulsivity (American Psychiatric Association [APA], 2022), with the disorder being diagnosed in up to 5% of emerging adults (Song et al., 2021). ADHD in emerging adulthood (aged 18–29 years; Arnett, 2000) has been linked to functional impairment across academic, occupational, and social domains, as well as increased risk for co-occurring internalizing problems including depression and anxiety (Anastopoulos et al., 2016). Despite trends showing striking increases in ADHD’s prevalence (Chung et al., 2019), standardized assessment approaches that accurately capture the disorder’s expression and impact across multiple domains remain lacking, particularly for emerging adults. Such approaches, while also helping to explain ADHD’s co-occurrence with impairment and internalizing problems, could have benefits both in terms of transdiagnostic assessment and tailored treatment; specifically, they may help shift the focus from symptom reduction alone to enhancing quality of life and addressing co-occurring issues. Ongoing research in this area has primarily focused on ADHD in childhood, with studies using contemporary computational tools such as adaptive testing and machine learning algorithms to improve prediction of ADHD and its key correlates (e.g., functional impairment; Caye et al., 2019; Goh, Elkins, et al., 2023; Goh, Martel, et al., 2023; Mooney et al., 2023). In contrast, little work has focused on understanding the nature and improving classification of impairment and co-occurring problems in emerging adults with ADHD, despite a growing body of research underscoring their vulnerability and unmet needs across a wide and heterogeneous range of functional domains (Kosheleff et al., 2023).
Limitations in ADHD’s Symptom List During Emerging Adulthood
Commonly used assessment tools for ADHD in adulthood rely primarily on the Diagnostic and Statistical Manual of Mental Disorders, 5th Edition, Text Revision (DSM-5-TR) ADHD symptom list (APA, 2022). Yet, though ADHD is considered to be a chronic, lifelong concern (Faraone et al., 2021), many adults may not meet the threshold for a diagnosis potentially because symptoms of ADHD, particularly those within the hyperactive/impulsive symptom domain, were not written with adults in mind (e.g., “leaves seat in situations when remaining seated is expected;” Gibbins et al., 2010). Further stressing this point are the results of studies suggesting that although only 20% to 35% of children with ADHD continue to meet diagnostic criteria as emerging adults, a majority—60% to 72%—continue to experience clinically significant levels of corresponding impairment (Biederman et al., 2010). These findings indicate that ADHD symptoms, in isolation and as they stand in DSM-5-TR, may not be sufficient to adequately capture ADHD’s expression and wide-ranging impact particularly in emerging adulthood. Although the core DSM ADHD symptom list will never fully incorporate all related functional impairment and co-occurring problems like anxiety and depression, consideration of other well-validated risk markers may be needed to facilitate a better understanding of the nature of these difficulties in emerging adults with ADHD. Such consideration could also inform more comprehensive assessment and intervention tools that accommodate a focus on co-occurring impairment and internalizing problems in addition to symptoms (Fedele et al., 2010).
To address the limitations of the ADHD symptom list, studies have begun examining the incremental validity of non-symptom measures, with findings suggesting that some of these measures demonstrate incremental validity beyond symptoms in diagnosing ADHD (Sawaya et al., 2024). Yet, almost no work has been conducted investigating the incremental validity of non-symptom measures for classification of those with versus without impairment and internalizing problems that commonly accompany the disorder. Such work could identify non-symptom risk markers of ADHD that, despite being excluded from current diagnostic criteria, may be crucial in explaining the disorder’s link with common impairment domains and internalizing problems. If so, these risk markers could potentially be incorporated into more comprehensive assessment protocols, as well as interventions, to provide broader benefits beyond symptom reduction in emerging adults with ADHD.
Emotional Dysregulation and Unawareness
One proposed aspect of ADHD in emerging adulthood that may help address this critical issue is emotional dysregulation. Emotional dysregulation has been characterized as the expression and experience of emotions which are excessive relative to social norms, context, and developmental stage; indeed, emotional dysregulation involves rapid and poorly controlled changes in emotion, along with an anomalous allocation of attention to emotional stimuli (Shaw et al., 2014). A large body of work has highlighted emotional dysregulation as a multidimensional construct, with one model by Gratz and Roemer (2004) defining six dimensions: (a) Non-Acceptance of Emotional Responses, (b) Difficulties Engaging in Goal-Directed Behavior, (c) Impulsive Control Difficulties, (d) Lack of Emotional Awareness, (e) Limited Access to Emotion Regulation Strategies, and (f) Lack of Emotional Clarity. Of note, work investigating this model has suggested that the Lack of Emotional Awareness dimension, although associated with ADHD (Factor et al., 2016), may be distinct from the broader construct of emotional dysregulation (i.e., whereas other dimensions assess reactions to emotions, Lack of Emotional Awareness assesses noticing of emotion; Hallion et al., 2018). As such, instead of grouping dimensions together into a single construct of emotional dysregulation, we henceforth refer to these two constructs as Emotional Dysregulation and Unawareness (EDU).
Clinic-based studies have suggested rates of EDU in up to 70% of adults with ADHD, with the combination of ADHD and EDU associated with high risk for impairment, emotional difficulties, and substance use (Bodalski et al., 2019; Factor et al., 2016; Shaw et al., 2014; Soler-Gutierrez et al., 2023). Conceptual studies have proposed several models for understanding the connection between ADHD and EDU. In particular, research with adolescents and adults has suggested that facets of EDU related to negative emotional responses and lability, while not entirely overlapping with ADHD symptoms, may represent important features of the disorder that uniquely help explain its relation with negative outcomes (Beheshti et al., 2020). Such findings are consistent with the idea that EDU, when present in those with ADHD, may be an important component in explaining commonly observed impairment and co-occurring internalizing problems. If so, measurement of EDU as a supplement to symptom-based ADHD assessment may facilitate improved insight into risk for specific types of impairment and inform transdiagnostic assessment of problems that commonly co-occur with the disorder but are not necessarily encompassed by ADHD’s symptoms.
Using Machine Learning to Clarify EDU Dimensions’ Incremental Validity
Despite evidence supporting the incremental validity of EDU beyond ADHD symptoms (Hirsch et al., 2018), there remains a lack of understanding in how to best incorporate measurement of EDU-related deficits into assessment protocols in a robust manner. This is particularly evident in emerging adulthood; despite recommendations by Marshall et al. (2021) encouraging the use of multiple measures in assessing ADHD in emerging adults, no work, to our knowledge, has proposed a standardized method of integrating scores from these measures.
Advances in machine learning provide a potential means to address this issue; specifically, random forest regression can be a technique used to inform data-driven classification algorithms (Walsh et al., 2017). Praised for its robustness and accuracy, this technique involves the creation of a “forest” of decision trees generated from varying sets of indicators designed for classification (i.e., with versus without impairment or co-occurring internalizing problems; Scornet et al., 2015). It both complements and provides numerous benefits over methods applied in prior work (e.g., cut-scores, linear regression, logistic regression, SEM, network analysis). Such benefits include random forest regression being data-driven, aimed at classification/prediction, less sensitive to outliers, not bound to linear relations between variables, and able to parse variable heterogeneity even in the context of multicollinearity. Prior work mostly in children has shown some success in using machine learning to incorporate measures of IQ, depressive symptoms, oppositional defiant and conduct disorders, and cognitive function, among others, with ADHD symptoms to predict outcomes (Caye et al., 2019; Goh, Elkins, et al., 2023; Lorenzi et al., 2023; Mooney et al., 2023). However, to our knowledge, no work has sought to apply the machine learning data analytic approach to the classification of ADHD’s correlates in emerging adults, despite evidence that ADHD phenotypes and, correspondingly, elements that best link the disorder with its external correlates, change throughout development (Olson, 2002).
One additional benefit provided by machine learning concerns clarification of the relative importance of different indicators via Variable Importance Scores. In random forest models, such scores are derived by ranking indicators according to their relative contribution to accurate classification (Ishwaran, 2007). Such an analyses could be helpful for determining the relative importance of the different EDU dimensions in the context of ADHD symptoms, particularly in light of prior work suggesting heterogeneity in EDU dimensions’ unique relations with several types of psychopathology (Ruan et al., 2023). Further, in contrast to inattention and EDU which may persist throughout the lifespan, hyperactivity/impulsivity has been found to decrease in salience throughout development (Faraone et al., 2021). As a result of this decrease, EDU may grow to be more important in capturing the impact of ADHD during emerging adulthood. Ultimately, identification of EDU dimensions most important for classification of ADHD-related impairment and co-occurring problems could inform the development of improved assessment tools and interventions focused on these dimensions to produce the greatest downstream benefits.
The Current Study
The current study had three primary aims. The first was to clarify EDU’s incremental validity, beyond ADHD symptoms, in improving classification accuracy of functional impairment and internalizing problems using machine learning techniques. We hypothesized that the inclusion of EDU would significantly increase accuracy of classification beyond ADHD sum scores, thus suggesting that EDU, when present, may help to explain commonly observed impairment or co-occurring internalizing problems in emerging adults with ADHD. Second, we validated accuracy of the classification model incorporating ADHD symptoms and EDU dimensions using data from a “test sample” distinct from that used for Aim 1 (although drawn from the same larger study), hypothesizing that model performance would remain consistent with results of Aim 1. Third, we sought to clarify the importance of EDU dimensions compared to ADHD symptom domains, hypothesizing they would be more important than hyperactivity/impulsivity and just as important as inattention, which would still emerge as the most important classifier.
Methods
Participants
Participants were 1,539 college students aged 18 to 25 years (Mage = 19.47 years, SD = 1.54) who self-reported a prior diagnosis of ADHD. Data from these participants was collected across eight universities in the United States. In this sample, 30.2% participants identified as male, 69.2% as female, 0.1% as intersex, and 0.5% preferring not to answer. Further, 17.5% participants identified as Hispanic/LatinX and 66.8% identified as White. Detailed demographic information is available in Table 1.
Descriptive Data for Demographic and Study Variables.
A total of 42 participants had missing racial data or specified an unlisted racial category
ADHD Symptom Count scores were determined by counting the number of symptoms endorsed as “Often” or “Very Often.” The finding that many participants, despite endorsing prior ADHD diagnoses, did not meet current symptom criteria for ADHD (i.e., ≥5 symptoms in either domain) is consistent with prior work suggesting that ADHD severity decreases with age (Faraone et al., 2006). However, given work suggesting that patterns of ADHD may fluctuate over time (Sibley et al., 2022), we included all participants that reported prior diagnoses of ADHD.
In line with test recommendations (Weiss et al., 2018), any domain that had one item rated 3 or two items rated 2 was be considered to be impaired.
Cut-off sum scores greater than 6 and 5 met study criteria for depression and anxiety, respectively (Lovibond & Lovibond, 1995).
Procedures
Ethics approval was obtained from the Institutional Review Board at Syracuse University for this multisite study of ADHD symptoms in college students (i.e., Undergraduate Learning, Emotion, and Attention Research Network). Undergraduate students across the universities were recruited primarily via online participant management sites (i.e., Sona Systems, https://www.sona-systems.com), as well as classroom announcements and flyers. The survey was hosted on a secure online survey platform (i.e., Qualtrics, https://www.qualtrics.com) through which participants gave their informed consent to participate and then completed a series of measures. Data was collected from 12,501 college students, although only those endorsing a prior ADHD diagnosis (12.3%, consistent with prior ADHD research on online surveys; Wymbs & Dawson, 2019) were included in the current study. Of these 1539 participants, 1200 provided data on medication use with 579 (48%) endorsing current use of stimulants for ADHD. Four attention check questions were included, and data from participants who answered more than one attention check incorrectly were removed (n = 34). Participants were compensated with research credits for study participation. Some sites also offered entry into a raffle for a $100 gift card to incentivize participation.
Measures
ADHD Symptoms
To test the incremental validity of EDU over and above DSM’s ADHD symptoms, a self-report checklist was created based on the exact wording of ADHD symptoms in the DSM-5 (APA, 2013). DSM-5 symptoms used in the current study were identical to those in the newer DSM-5-TR (APA, 2022), with examples being “Fail to give close attention to details or make careless mistakes in schoolwork, at work, or during other activities (overlook or miss details or work is inaccurate)” and “Fidget with or tap hands or feet or squirms in seat.” Participants rated their current symptoms using the following scale: 0 (Never or rarely), 1 (Sometimes), 2 (Often), or 3 (Very often). Item scores were then added to create sum scores representing symptom severity on inattention (IA; α = .92) and hyperactivity/impulsivity (HI; α = .87) symptom dimensions.
EDU 1
The 36-item Difficulties in Emotion Regulation Scale (DERS; Gratz & Roemer, 2004) measures how frequently participants experience difficulties related to EDU. Items included statements such as “I am confused about how I feel” and “When I’m upset, I lose control over my behavior.” Participants responded to items using the following scale: 1 (Almost never), 2 (Sometimes), 3 (About half the time), 4 (Most of the time), or 5 (Almost always). Response scores were averaged into six subscales corresponding with Non-Acceptance of Emotional Responses, Difficulties Engaging in Goal-Directed Behavior, Impulse Control Difficulties, Lack of Emotional Awareness, Limited Access to Emotional Regulation Strategies, and Lack of Emotional Clarity dimensions (αs = .82–.93).
Functional Impairment
The Weiss Functional Impairment Rating Scale (WFIRS; Weiss, 2000) is a 70-item self-report measure of impairment that has shown robust psychometric properties for use in college students (Canu et al., 2020). Example items include “having problems with family” (Family domain), “problems taking notes” (School domain), and “problems keeping an acceptable appearance” (Life Skills domain). Participants responded to items using the following scale: 0 (Never or not at all), 1 (Sometimes or somewhat), 2 (Often or much), or 3 (Very often or very much). Items fall within Family, Work, School, Life Skills, Self-Concept, Social, and Risk-Taking Behavior domains (αs = .87–.93). For the current study, and in line with test recommendations (Weiss et al., 2018), any domain that had one item rated 3 or two items rated 2 was be considered to be impaired.
Internalizing Problems
The Depression, Anxiety, and Stress Scale (DASS-21) contains 21 items that assess depression, anxiety, and stress (Lovibond & Lovibond, 1995). Examples of items included “I felt scared without any good reason” and “I felt down-hearted and blue.” Participants rated items using the following scale: 0 (Did not apply to me at all), 1 (Applied to me some of the time), 2 (Applied to me a good part of the time), 3 (Applied to me very much, or most of the time). Item responses were summed to create depression (α = .91) and anxiety (α = .84) scores. Past studies have supported the validity of these dimensions (Zanon et al., 2021). Cut-off sum scores of 6 and 5, corresponding with moderate levels of depression and anxiety, respectively, were used to determine prevalence or absence of these internalizing problems (Lovibond & Lovibond, 1995).
Statistical Analyses
Transparency and Openness
We report how we determined our sample size, data exclusions (if any), manipulations, and measures in the study, and we follow JARS (Kazak, 2018). Data and research materials are available upon request. Pre-registered analyses (https://osf.io/2upw7/?view_only=6ff4ed43afc54d398c56700deef2e01e) and R-code (https://osf.io/t7x48/?view_only=9e377479232c45b4aa57425d436b8ea1) are available on Open Science Framework.
Random Forest Development
Random forests were created using the caret package (Kuhn, 2008) in R (version 4.3.2). Based on procedures recommended by Nguyen et al. (2021), participants were first randomly split into training (70% of the data) and testing samples (30%). Participants in the training sample who met study criteria for impairment were up-sampled (i.e., randomly duplicated with replacement) to match the size of the non-impaired sample to avoid bias in resulting algorithms, in line with recent studies suggesting random oversampling as the best means to address moderately imbalanced data (Wongvorachan et al., 2023). No adjustments were made to the testing sample. The training sample was then used to develop random forest models and tune them via the mtry hyperparameter (i.e., the number of random indicators considered at each split within trees), using fivefold cross-validation repeated 5 times. All other hyperparameters were set to the default criteria in the caret package (e.g., 500 trees per forest, one observation for terminal nodes). Optimal models were determined based on the receiver operator characteristic (AUC) curve metric, which provides an indication of the algorithm’s overall discriminative ability (Kumar & Indrayan, 2011).
Integrating EDU with ADHD Symptoms and Testing the Final Model
To test Hypothesis 1 (i.e., the inclusion of EDU would significantly improve model performance), we compared the performance of two random forest classification models across each impairment domain and internalizing problem using data from the training sample. First, we created an ADHD Algorithm by including IA and HI as indicators to generate the random forest model, with the mtry parameter set to 2. Second, we created an ADHD + EDU Algorithm with IA and HI along with EDU dimensions as indicators, while also testing all mtry values from 2 to 8 to determine the value that led to the best-performing model while maximizing the incremental validity of EDU dimensions. We set the seed prior to random sampling, so the same resampling datasets were used to create ADHD and ADHD + EDU Algorithms. This resulted in paired estimates of performance across training resamples. Hence, we could test whether the AUCs of model pairs significantly differed using paired t-tests, with any increase in AUC attributable to the inclusion of EDU dimensions (Kuhn & Johnson, 2018). For these comparisons, a cut-off of p < .05 was used to determine statistical significance. To test Hypothesis 2 (i.e., consistent performance in a distinct testing sample), we evaluated the algorithm in the distinct testing sample. Model performance was evaluated using AUC, with 0.5 suggesting no discrimination, 0.7 to 0.8 considered acceptable, 0.8 to 0.9 excellent, and greater than 0.9 considered outstanding (Hosmer et al., 2013).
To test Hypotheses 3 (i.e., EDU dimensions would be more important for machine learning models than HI, and just as important as IA) we calculated variable importance scores with respect to classification in the training sample for each impairment domain and internalizing problem. Scores were created by comparing classification error (i.e., mean-squared error) before and after each variable’s values were randomly shuffled (Ishwaran, 2007). These scores were scaled to fall between 0 and 100, with higher scores indicating greater importance (i.e., increased discrimination power between classes) within machine learning algorithms. To our knowledge, there is no existing statistical test to determine the significance of differences in variable importance scores; consequently, we evaluated the differences qualitatively.
Results
Testing Hypotheses 1 and 2: Examining the Incremental Validity of EDU 2
See Tables 2 and 3 for detailed results of machine learning analyses. Initial analyses of the ADHD + EDU algorithm suggested that mtry = 2 yielded the best performing model for all outcomes. Our first hypothesis was that the inclusion of EDU would significantly improve classification performance with respect to impairment and co-occurring internalizing problems above and beyond ADHD sum scores. This hypothesis was supported. Comparison of AUC values for models without versus with EDU dimensions suggested that the inclusion of EDU significantly improved model classification (average increase in AUC = 0.14; range of AUC increase = 0.10–0.18; ps < .001). Further, average sensitivity across outcomes (i.e., the model’s ability to correctly identify those meeting impairment or internalizing problem criteria, ranging from 0 to 1 with higher values indicating greater accuracy) increased from 0.64 to 0.80, and specificity (the model’s ability to correctly identify those not meeting criteria on the same scale as sensitivity) increased from 0.71 to 0.80, after adding EDU to the classification model. Our second hypothesis was that model classification of the ADHD + EDU algorithm in a distinct testing sample would remain consistent with the training sample across outcomes. This hypothesis was partially supported. Consistent with those in the training sample, testing sample results suggested that the model demonstrated excellent performance in discriminating those meeting study criteria for impairment in Life Skills (AUC = 0.83) and Self-Concept (AUC = 0.85) domains, as well as Depression (AUC = 0.83) and Anxiety (AUC = 0.82). The model demonstrated acceptable performance with respect to Social (AUC = 0.73), School (AUC = 0.76), Family (AUC = 0.77), and Risk-Taking Behavior (AUC = 0.70) domains, and poor performance with respect to the Work domain (AUC = 0.66). However, compared to the training sample, examination of model accuracy in the testing sample suggested sensitivity decreased by 0.07 to 0.73, and specificity by 0.13 to 0.67, on average. Overall, results suggested that the inclusion of EDU significantly improved classification accuracy beyond IA and HI sum scores, with the resulting ADHD + EDU model also demonstrating acceptable to excellent performance in a distinct sample except with regard to classification of Work impairment.
Results of Machine Learning Analyses Classifying of ADHD-Related Impairment and Co-Occurring Problems.
Note. Sensitivity = the model’s ability to correctly identify those meeting impairment or internalizing problem criteria. Specificity = the model’s ability to correctly identify those who do not meet criteria. Acc = accuracy; Sens = sensitivity; Spec = specificity.
Mtry hyperparameter was set to 2 for all algorithms. All other hyperparameters were left at their defaults.
Detailed Statistics of Final Model Performance in the Testing Sample a .
Note. Sensitivity = the model’s ability to correctly identify those meeting impairment or internalizing problem criteria. Specificity = the model’s ability to correctly identify those who do not meet criteria. Positive predictive value (PPV) = the proportion of participants classified as meeting criteria by the model who truly meet criteria. Negative predictive value (NPV) = the proportion of participants classified as not meeting criteria by the model who truly do not meet criteria.
Mtry hyperparameter was set to 2 for all algorithms.
Youden’s index (sensitivity + specificity − 1; Youden, 1950) was used to select the threshold on the receiver operator curve (ROC) that optimized both sensitivity and specificity in determining accuracy metrics.
Testing Hypothesis 3: Importance of ADHD Symptoms and EDU Dimensions Across Algorithms
Detailed results are depicted in Table 4 and Figure 1. Our third hypothesis was that EDU dimensions would emerge as more important for classification than HI and just as important as IA. This hypothesis was supported. Examining rank order of ADHD sum scores and EDU dimensions across domains, IA emerged as most important for classification of four domains (Life Skills, Self-Concept, School, and Work), HI for two (Family and Risk-Taking Behavior), the Limited Access to Emotion Regulation Strategies dimension for two (Depression and Anxiety), and Non-Acceptance of Emotional Responses for one (Social). At least one EDU dimension emerged as more important than HI across six of nine impairment domains and internalizing problems. Similarly, at least one EDU dimension emerged as more important than IA across four impairment domains and internalizing problems.
Variable Importance in the Training Sample.
Note. Scores were estimated by randomly shuffling each variable’s values and then comparing the resulting classification error of a forest to the error obtained before the shuffle. These values were averaged across all trees and then scaled to fall between 0 to 100. Higher scores indicate greater importance.

Variable importance scores in machine learning analyses encompassing ADHD sum scores and EDU dimensions.
When comparing variable importance scores, results suggested that across impairment domains, IA was most important for classification (average variable importance = 78.87). The Limited Access to Emotion Regulation Strategies EDU dimension emerged as the next most important (54.11), followed by HI (45.07) and the Non-Acceptance of Emotional Responses (37.52) EDU dimension. Overall, results suggested IA as most important for facilitating model classification, with the Limited Access to Emotion Regulation Strategies dimension generally emerging as more important than HI.
Post Hoc Analysis: Replicating Findings in Those Without ADHD and Testing the Incremental Validity of Sex
It is possible that analyses described above might have suffered from a restriction of range problem, given that the sample used consisted only of those with prior ADHD diagnoses and thus likely elevated levels of ADHD symptom severity. To address this, we also explored the incremental validity of EDU beyond ADHD symptoms using data from all participants with and without prior ADHD diagnoses (n = 12,501). Consistent with primary analyses, comparison of AUC values suggested that the inclusion of EDU significantly improved model classification across all outcomes (increase in AUC with the inclusion of EDU: Life Skills = 0.07; Self-Concept = 0.11; Social = 0.20; School = 0.08; Work = 0.15; Family = 0.18; Risk-Taking Behavior = 0.19; Depression = 0.20; Anxiety = 0.18; ps < .001). Detailed information is available upon request from the corresponding author.
Past findings have suggested sex differences in ADHD’s expression during emerging adulthood (Fedele et al., 2010). As such, we tested whether the addition of sex into the ADHD + EDU algorithm significantly improved accuracy. These comparisons were conducted similarly to those described above (i.e., ADHD + EDU vs. ADHD + EDU + sex algorithms), although new bootstrapping procedures were conducted and accounted for any differences between primary and post-hoc results. Results suggested that sex did not significantly improve accuracy across any impairment domain or co-occurring problems (ps ranged from .30 to .92; average p = .56).
Discussion
The current study sought to investigate whether EDU, when experienced by emerging adults with self-reported ADHD, may help to explain commonly observed impairment or co-occurring internalizing problems. To do so, we used machine learning to develop an algorithm encompassing EDU dimensions and ADHD sum scores that could discriminate between those reporting impairment and internalizing problems in a reliable and statistically robust manner. Consistent with our first hypothesis, results suggested that EDU significantly improved model performance, with AUC increasing by 11.4% with respect to classification of Life Skills impairment, 19.5% for Self-Concept impairment, 23.9% for Work impairment, 14.9% for Family impairment, 21.9% for Social impairment, 13.4% for School impairment, and 16.4% for Risk-Taking Behavior impairment beyond ADHD sum scores alone. Combining EDU with ADHD sum scores also increased AUC with respect to classification of Depression and Anxiety by 25.4% and 26.6%, respectively. In support of our second hypothesis, the ADHD + EDU classification algorithm achieved acceptable to excellent performance in a separate sample across all impairment domains and internalizing problems except for Work-related impairment, although some decreases in sensitivity and specificity were noted. Finally, and in support of our third hypothesis, results of variable importance analyses indicated IA as most important for classification across all impairment domains and internalizing problems. Further, the Limited Access to Emotional Regulation Strategies EDU dimension was more important for classification of impairment and co-occurring problems than HI, with the Non-Acceptance of Emotional Responses EDU dimension also demonstrating some importance for classification. Overall, results suggested that EDU, when present, may help to explain the presence of commonly observed impairment or internalizing problems in emerging adults with ADHD. Further, results provided support for the use of machine learning as a methodology to facilitate actuarial classification of co-occurring impairment and internalizing problems in emerging adults with ADHD.
EDU Demonstrated Incremental Validity Beyond ADHD Symptoms
In general, results supported EDU as playing a key role in explaining ADHD’s relation with impairment and co-occurring problems. Specifically in the training sample, average classification performance of only ADHD symptoms across outcomes was Acceptable, whereas performance of ADHD symptoms and EDU dimensions was Excellent. Additionally, overall classification accuracy increased by 0.11 on average, and appeared to be primarily driven by increases in sensitivity (0.16 on average, in comparison to specificity which increased by 0.08 on average). Unsurprisingly, the greatest benefits pertained to the classification of Depression and Anxiety, although classification of impairment domains, particularly Self-Concept, Social, and Work, also improved with the inclusion of EDU. Recent studies evaluating the accuracy of ADHD-related screening tools has set 0.8 as the cut-off for acceptable sensitivity and specificity (Mulraney et al., 2021). Based on these criteria, the ADHD algorithm would not have met criteria for any domain tested, whereas the ADHD + EDU algorithm met criteria for Life Skills, Self-Concept, School, and Family domains based on performance in the training sample. Although findings supporting EDU dimensions’ incremental validity are not necessarily surprising given existing research highlighting their role in contributing to ADHD-related phenotypes (Shaw et al., 2014), results are notable in supporting the idea that measures of ADHD symptoms and EDU dimensions can be incorporated into actuarial classification tools that significantly outperform those only including ADHD symptoms. In fact, results suggested that it may important to harness both ADHD symptoms and EDU dimensions’ unique validity within actuarial ADHD-related algorithms.
Application of the ADHD + EDU Algorithm to the testing sample also revealed that at the optimal threshold, the model classified those who met impairment and internalizing problem criteria more accurately than those who did not (i.e., sensitivity > specificity, average difference = 0.15) with respect to Life Skills, Social, Family, Risk-Taking Behavior, and Depression domains. Such findings suggested that EDU may be particularly helpful for identifying those experiencing difficulties in these areas, with further addition to the algorithm needed to improve its ability to correctly identify those who do not. The opposite may be true for Self-Concept, School, and Work domains (specificity > sensitivity, average difference = 0.08).
It should be noted that model performance in the testing sample decreased compared to the training sample with respect to AUC (average decrease = 0.1), sensitivity (0.07), and specificity (0.12). Although performance still ranged from Acceptable to Excellent, such decreases may have been due to upsampling procedures, which may have led to some overfitting of the model to the training sample. This is evidenced by follow-up findings that the greatest decreases in AUC from training to testing sample results occurred for impairment and internalizing problem domains that had the greatest class imbalances between those meeting versus not meeting criteria. Hence, although the addition of EDU appeared to provide meaningful benefits to performance beyond ADHD sum scores, further refinement of the model, including tuning in more balanced samples (without oversampling), may be needed before deployment into real-world settings.
Post-hoc analyses also provided support for EDU’s incremental validity even when we included participants without ADHD, which is notable in light of dimensional models conceptualizing ADHD as falling on a continuum (Haslam et al., 2006; Marcus & Barry, 2011). Individuals demonstrating subthreshold levels of ADHD have still been found to exhibit impairments and benefit from intervention (McLennan, 2016), and results suggested that EDU may be relevant for explaining ADHD’s relations with impairment and internalizing problems across the entire spectrum of ADHD symptom severity. Overall, findings provided additional support for the idea that EDU, when present in emerging adults with ADHD, may play a key role in explaining the presence of impairment or co-occurring internalizing problems and thus warrant attention in actuarial assessment and intervention protocols.
Limited Access to Emotion Regulation Strategies Was More Important for Classification Than HI
In general, results provided support for the idea that certain dimensions of EDU may be almost as important as IA, and more important than HI, for differentiating between emerging adults with ADHD who have co-occurring impairment and internalizing problems and those who do not. In particular, the Limited Access to Emotional Regulation Strategies dimension emerged as the most important for classification of Anxiety and Depression, and also demonstrated the second highest average variable importance value. IA emerged as most important for Life Skills, Self-Concept, School, and Work impairment classification, and also demonstrated the highest average variable importance value.
Conversely, although HI emerged as the most important for classification of two impairment domains (Family and Risk-Taking Behavior), its variable importance value fell below that of IA sum scores and the Limited Access to Emotion Regulation Strategies dimension, and was only slightly above the Non-Acceptance of Emotional Reactions EDU dimension. Such results provide support for prior work suggesting HI as demonstrating less salience during adulthood compared to IA (Biederman et al., 2010; Gibbins et al., 2010; Sibley et al., 2012). On the other hand, EDU has been found to remain stable or even worsen throughout adulthood (Deutz et al., 2020). As such, ADHD assessment during emerging adulthood may benefit from an increased focus on IA and EDU, particularly regarding acceptance of emotional reactions and access to emotional regulation strategies. Further, given the seeming importance of the Limited Access to Emotion Regulation Strategies and Non-Acceptance of Emotional Reactions dimensions, focusing intervention effects on these dimensions via emotional regulation strategies is likely to produce benefits beyond those focused only on symptom reduction.
Potential Benefits of Incorporating EDU Into ADHD-Related Clinical Practices
Despite elevated rates of impairment and co-occurring internalizing problems being identified as key features of ADHD in emerging adulthood (Anastopoulos et al., 2016; DuPaul, 2022), there has been very limited research providing guidance on how to incorporate different measures into assessment tools that can accurately provide insight into risk for these difficulties. The minimal work in this area has mostly proposed incorporating measures based on clinical judgment (Marshall et al., 2021), despite a large body of work supporting the superiority of actuarial methods for integrating different types of data (Sawyer, 1966). Results of the current study provided support for the idea that assessment of certain EDU domains, namely Limited Access to Emotion Regulation Strategies and Non-Acceptance of Emotional Reactions, may provide insight into ADHD’s connection with commonly co-occurring impairment domains and internalizing problems. As such, including measures of these domains in ADHD assessment protocols may help facilitate accurate classification of impairment and internalizing problem profiles in emerging adults with ADHD. Conversely, the Lack of Emotional Awareness domain consistently performed poorly across all models, so assessment of this domain may provide little benefit in classifying impairment and internalizing problems in those with ADHD. Results also provided support for the use of machine learning to incorporate different types of data and inform robust and replicable assessment practices. In contrast to many past studies that have created machine learning algorithms using time-consuming or costly measures (e.g., Das & Khanna, 2021), our findings suggested that accuracy of classification of ADHD-related impairment and co-occurring problems can be significantly improved through the addition of low-cost and relatively brief measures to the ADHD symptom list.
Limitations and Constraints on Generality
Despite EDU significantly improving the algorithm’s performance beyond ADHD sum scores, there was still room for improvement through refinement of the model or the addition of new relevant risk markers of ADHD (e.g., executive function, temperament/personality; Goh et al., 2020), particularly with respect to Work impairment classification which fell below the acceptable range in the testing sample. One important future direction concerns an examination of reasons for EDU dimensions’ importance in classification, due to the nuances of random forests methodology occurring in a “black box” that is relatively inaccessible. Hence, although random forest may be useful for identifying what variables are important for classification, the precise reasons why such variables are important are difficult to untangle. Some research has been conducted in this area, with findings suggesting EDU may stem from similar neural underpinnings as ADHD and mediate the disorder’s relation with common correlates (Shaw et al., 2014). However, additional work is needed to fully clarify reasons for EDU’s importance in explaining ADHD’s relation with impairment and internalizing problems. All measures, along with ADHD diagnosis, were based on self-report. Future studies should thus seek to validate findings using gold-standard diagnostic procedures and different reporters (i.e., informant report; clinical interview). Although correlations between predictors did appear to be strongly indicative of multicollinearity (>.8), some were moderate to large in strength. Thus, future work should replicate findings while determining whether these correlations impacted results. Since the data was collected at a single time point, an important future direction involves the use of longitudinal studies.
Further, the current work used a convenience sample of college students who mostly identified as female, which contrasts with prior work estimating prevalence of ADHD in males versus females at a 1:1 to 2:1 ratio (Cortese et al., 2016). Such a focus represents an important clinical aim, given that ADHD symptoms were created primarily based on observations of males who tend to express higher levels of hyperactivity/impulsivity compared to females (reviewed in Babinski, 2024). Females with ADHD remain underdiagnosed despite some evidence suggesting that they may report levels of impairment higher than males (Fedele et al., 2012), so the development and validation of actuarial assessment protocols in females represents an important aim. However, further validation is needed to determine the incremental validity of EDU in samples with more male-identified participants as well. Relatedly, the racial makeup of this cohort, which was predominantly White, imposes limitations to the degree to which results may generalize to other racial and ethnic groups. Overall, further work is needed to confirm results in other populations outside of that used in the current study to provide further validation for the incremental validity of EDU and the potential of incorporating measures of it into algorithms with ADHD symptoms to facilitate actuarial classification.
Conclusion
The ability to explain ADHD’s link with impairment and internalizing problems is of great clinical importance, particularly in emerging adulthood. Results of the current study suggested that EDU dimensions, particularly Limited Access to Emotion Regulation Strategies and Non-Acceptance of Emotional Reactions, may provide unique insights in line with this aim. Further incorporation of EDU dimensions into actuarial ADHD assessment procedures via machine learning could meaningfully improve classification of impairment and co-occurring internalizing problems in emerging adults with ADHD. Additionally, continued establishment of the role of EDU in ADHD could facilitate the implementation of interventions aimed at addressing the most salient outcomes.
Footnotes
Acknowledgements
The authors acknowledge Jasmine Black and Abigail Cummings for their help in writing the manuscript. The authors thank all participants for making this work possible. The current study uses data from an ongoing multisite study of ADHD symptoms in college students (i.e., the Undergraduate Learning, Emotion, and Attention Research Network [U-LEARN] study). This study has been ongoing for several years with manuscripts published using data from prior academic years. Some of this data (24%) partially overlaps with those used in the current study. However, the current study is the first to use data collected between Fall 2021 and Spring 2023. Further, no prior manuscripts have examined relations between emotional dysregulation and functional impairment/internalizing problems as was done in the current study.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
