Abstract
Keywords
The Behavior Rating Inventory of Executive Functioning–Preschool version (BRIEF-P) is an ecological measure for the assessment of executive functions in 2- to 5-year-old children as perceived by their parents or their teachers (Gioia, Espy, & Isquith, 2003). Together with the other versions of the questionnaire (BRIEF for parents/teachers of 5- to 18-year-olds, BRIEF–Self-Report for 11- to 18-year-olds, and the BRIEF–Adult for ages 18 to 90), it permits the study, from a developmental perspective, of executive functioning as observed in daily functioning. The characteristics and utility of the version that address observed executive functioning in the earliest developmental stage (the BRIEF-P) and the extent to which the scale relates to disruptive behavior disorders are not yet known. For this reason, it would seem necessary to assess the scale further.
The questionnaire, which is quick and easy to answer, is made up of 63 items on five clinical scales: (a) Inhibit (IN), reflecting problems in inhibitory control; (b) Shift (SH), reflecting difficulties in moving freely among situations, activities, or aspects of a problem; (c) Emotional Control (EC), covering difficulties for modulating emotional response; (d) Working Memory (WM), measuring difficulties in holding information in mind for completing a task; and (e) Plan/Organize (PO), assessing problems for anticipating future events and taking appropriate steps, and for putting information to achieve an objective. Principal components analysis of the five clinical scales on data for normative and clinical samples of the answers from parent or teachers yielded a three-factor model: (a) Inhibitory Self-Control Index (ISCI), made up of IN and EC; (b) Flexibility Index (FI), made up of SH and EC; and (c) Emergent Metacognition Index (EMI), made up of WM and PO, explaining 90% (parents) and 92% (teachers) of the variance (Gioia et al., 2003). The items of the five clinical scales also provide a Global Executive Composite (GEC). The questionnaire has two validity scales (Negativity and Inconsistency) that help consideration of the validity of the data provided. The simplicity of application of the BRIEF has made this questionnaire one of the most widely used in the assessment of executive functioning in clinical, educational, and research settings.
In contrast to the growing body of recent literature on the 5-to-18-version (BRIEF), which has shown evidence that broadly supports its psychometric properties, few studies have focused on the characteristics of the BRIEF-P. Valid and reliable assessment tools that help to identify early in life the specific disabilities of these impairing disorders could be beneficial for the planning of interventions in children affected. Two studies have focused on the psychometric properties of the BRIEF-P. In the teacher’s normative samples (n = 302, 2- to 5-year-old children and subgroups), the questionnaire has shown good reliability (Cronbach’s α between .86 and .95), low parent–teacher agreement (overall M correlation = .19), good mean test–retest correlation in 2 to 5 weeks (M correlation = .83), and significant association with Attention and Externalizing Behavior scales (Gioia et al., 2003). WM and PO (and EMI) obtained the strongest correlations with inattention and inhibit with hyperactivity. Bonillo, Araujo, Jané, Capdevila, and Riera (2012) carried out a second-order factor analysis of the five scales of the parent and teacher Catalan version for 3- to 6-year-old children and obtained a satisfactory three-factor solution (emergent metacognition, flexibility, and inhibitory self-control) with very good reliability indexes, but they failed to confirm the complete structure (63 items in five scales and the three broader indexes) in the two forms of the questionnaire (teachers and parents) through confirmatory factor analysis (CFA) because of the lack of fit.
The executive functions are a core component of ADHD (Barkley, 2012; Lambek et al., 2011). There is scarce literature regarding the usefulness or incremental validity of the BRIEF-P for identifying disruptive behavior disorders. Mahone and Hoffman (2007) compared twenty-five 3- to 5.9-year-old children with and without ADHD using the parents’ BRIEF-P and found that all the scales and global indices significantly discriminated between groups, with the greatest effect observed for the WM scale. They also reported significant correlations between BRIEF-P indexes and Conners’ Parent Rating Scale–Revised (Conners, 1997), which assess ADHD behaviors. In this line, Sherman and Brooks (2010) have noted some degree of overlap between the behaviors described in the BRIEF-P scales and those included in ADHD screening scales. Gioia et al. (2003) pointed out that although some specificity in the relationships between BRIEF-P scales and ADHD was found, in comparison to the school-age samples, the associations were less differentiated in the preschoolers. Therefore, more information is needed about the validity of the questionnaire in preschoolers.
The objective of this work is to evaluate the psychometric properties of the teacher BRIEF-P in preschool children at age 3, in an extensive Spanish community sample, and to report data about its usefulness for clinical assessment of disruptive behavior disorders. The specific goals are (a) to confirm the five scales plus three global indices structures proposed by Gioia et al. (2003); (b) to analyze the internal consistency reliability of the derived factors; (c) to provide validity evidence in relation to external variables relevant in mental health, such as psychopathology, temperament, functional impairment, or executive functioning; (d) to analyze whether the BRIEF-P adds significant information for the identification of disruptive behavior disorders when used in combination with other measures of behavior problems or performance-based executive functioning; and (e) to assess the screening accuracy of the BRIEF-P for identifying the Diagnostic and Statistical Manual of Mental Disorders (4th ed.; DSM-IV-TR; American Psychiatric Association, 2000) disruptive behavior disorders.
Method
Participants
Data are from a large-scale longitudinal study of behavioral problems in preschool children from age 3 (Ezpeleta, de la Osa, & Doménech, 2011). The measures were applied in a two-phase design, starting with a random sample of 2,283 children selected from the census of preschoolers in Grade P3 (3 years old) in Barcelona for 2009 to 2010. A total of 1,341 families agreed to participate in the first phase of the study (58.7%). Of these, 33.6% were of high socioeconomic status (SES), 43.1% middle, and 23.3% low. Children’s mean age was 3.0 years (SD = 0.18), 683 were boys (50.9%), and 89.3% were White. There were no sex differences (p = .95) between those who agreed to participate and those who declined, but semipublic schools were significantly more likely to refuse to participate than public ones (p < .001), and high SES families participated more than low-status families (p < .001). The parents of children participating in this first phase completed the Strengths and Difficulties Questionnaire (SDQ3-4)–Parent version (R. Goodman, 1997), which was used for screening purposes.
In the second phase, all children with a positive screening score for behavioral problems and a random sample of 30% of children with a negative screening score were invited to continue. The final second-phase sample included families of 622 children (10.6% of those invited refused to participate in the second phase). No statistical differences were found on comparing participants and refusals by sex (p = .82) or by type of school (p = .85). Children’s mean age was 3.0 (SD = 0.16), 311 were boys (50%), and 88.9% were White; 32.9% were of high SES, 45.7% middle, and 21.4% low. Weighted DSM-IV prevalence in the final sample (N = 622) was as follows: 3.6% of the children presented ADHD (3.1% hyperactive and 0.5% combined), 6.9% oppositional defiant disorder (ODD), 1.4% conduct disorder (CD), 0.4% major depression, 2.2% separation anxiety, 3.7% specific phobia, and 1.9% social phobia. A total of 17.6% of the mothers and 22.6% of the fathers had less than 7 years of education. BRIEF-P was unavailable for 2 children, resulting in a final sample of 620 for the analyses. At age 4, 604 children remained in the study.
Children showing intellectual disability or pervasive developmental disorders and families with difficulties with Spanish or Catalan, without a primary caregiver who could report about the child, or who were moving to another city within a year were excluded.
A total of 94 teachers from 54 schools answered the BRIEF-P (96.8 females, representative of the percentage of women working in infant education in our country). Participating teachers had known the 3-year-olds for a mean of 7.6 months (SD = 2.2). In total, 3.4% of the teachers indicated that they had met the children less than 3 months previously, and 18.6% less than 6 months before. A comparison was made between BRIEF-P mean scores of all the scales (including the two validity scales) of teachers who had known the children for at least 6 months and those who had not, yielding no significant differences (p from .10 to .88). Teachers spent between 5 and 8 hr a day with the children at the school. Since there were no differences according to whether the children had been known to the teacher for more or less than 6 months, and the number of hours the teachers shared with the children guaranteed the possibility of good observation, we retained in the analyses the assessments of all the teachers in the sample, so as to give more power to the statistical analyses.
Instruments
The BRIEF-P (Gioia et al., 2003), described previously, contains 63 items referring to behaviors in the last 6 months with 3 response options (0 = never to 2 = very often/always) reflecting the degree to which these behaviors are a problem. It takes approximately 10 to 15 min to complete. It was answered by teachers. The Spanish version provided by the publisher was used for the study.
Large differences in T-scores between the three BRIEF-P second-order scales ISCI, FI, and EMI (which preclude the use of GEC index as a summary measure) were found in 16.3% of the boys and 19.2% of the girls (differences by sex, p = .350). There were no differences in the percentage of inconsistent questionnaires for boys (3.2%) and girls (1.3%, p = .114) or in the percentage of high scores on the Negative scale (2.2% for boys and 3.6% for girls, p = .322).
The SDQ3-4 (R. Goodman, 1997) for parents of 3- to 4-year-old children was used as a screening instrument. The SDQ3-4 is made up of 25 items with three response options (0 = not true, 1 = somewhat true, 2 = certainly true) on five scales: Emotional Symptoms, Conduct Problems, Hyperactivity/Inattention, Peer Relationship Problems, and Prosocial Behavior. Two broader internalizing (sum of the items of Emotional and Peers subscales) and externalizing factors (sum of Conduct and Hyperactivity subscales; A. Goodman, Lamping, & Ploubidis, 2010) were also used. The Cronbach’s alpha reliability of the scales in the present sample ranged from .55 (peers) to .76 (externalizing) for parents and between .63 (peers) and .84 (hyperactivity) for teachers (Ezpeleta, Granero, de la Osa, Penelo, & Domènech, 2012).
The Diagnostic Interview for Children and Adolescents for Parents of Preschool and Young Children (DICA-PPYC; Ezpeleta, de la Osa, Granero, Doménech, & Reich, 2011) is a semistructured diagnostic interview designed to assess the most common psychological disorders at ages 3 to 7 years according to the DSM-IV-TR (4th ed., text rev.; American Psychiatric Association, 2000) criteria. Parents were interviewed about the presence of symptoms, and where information on symptoms was available, additional questions explored service use. Disorders were assessed over the lifetime.
The Children’s Behavior Questionnaire for ages 3-7 years (CBQ3-7; Rothbart, Ahadi, Hershey, & Fisher, 2001) measures reactive and self-regulative temperament, with 94 items on a 7-point Likert-type scale, ranging from 1 (extremely untrue) to 7 (extremely true). It was answered by parents. The three broad dimensions of temperament, negative affectivity (anger-frustration, discomfort, fear, sadness, soothability), effortful control (attention focusing, inhibitory control, low-intensity pleasure, perceptual sensitivity), and surgency (activity level, high-intensity pleasure, impulsivity, shyness), were used for this study. Cronbach’s alpha of the scales in the present sample ranged from .71 (negative affectivity) to .79 (effortful control).
The Children’s Global Assessment Scale (CGAS; Ezpeleta, Granero, & de la Osa, 1999; Shaffer et al., 1983) was applied by the interviewer after the diagnostic interview. It was used to rate global functional impairment associated with psychopathology (0 indicating the highest impairment and 100 the lowest). Scores above 70 indicate normal functioning.
The Kiddie–Continuous Performance Test (K-CPT; Conners, 2006) is a performance-based measure of attention function and response inhibition for children aged 4 to 5. The software present stimuli consisting of familiar pictures for very young children (doll, truck, ball, etc.), and response/no response is to be made depending on the picture that appears. The program provides 12 variables related to attention: omission means lack of a required response, commission is a response after a stimulus that requires none, hit reaction time is the mean response time over all five time blocks, hit reaction time standard error is a measure of erratic responding, variability of standard errors is the response time consistency, attentiveness (d′) denotes how well the individual discriminates between targets and nontargets, perseverations is a response that occurs less than 100 ms following a stimulus, and the remaining variables are measures of reaction time. K-CPT was administered in the 2nd year of the longitudinal study, when the children were 4 years old (as stipulated in the questionnaire manual).
Procedure
The longitudinal project was approved by the ethics review committee of the authors’ institution. The heads of the schools participating, as well as the children’s parents, received a complete description of the study. Families were recruited at the schools and gave written consent. All parents of children from P3 (3-year-olds) in the participating schools were invited to answer the SDQ3-4. Families who agreed and met the screening criteria were contacted by telephone and interviewed at the school. Interviewers were previously trained and were blind to the children’s screening group. After the interview, the interviewer completed the CGAS and parents answered the questionnaires. After obtaining permission from the parents, the teachers’ questionnaires were given to them for completion before the end of the academic year.
All the measures were completed when the children were 3 years old, except the K-CPT, which was completed at age 4. Assessments were carried out with a 1-year interval (M between assessments = 11.4 months, SD = 2.3).
Statistical Analysis
The statistical analysis was carried out with SPSS19 and Mplus6. Because of the multistage sample, data were analyzed through complex sample tools in SPSS, creating a plan file with sampling weights inversely proportional to the probability of participant selection and with the case weighting procedure in the Mplus6 program.
The 63 BRIEF-P items were submitted to CFA with Mplus6, using weighted least squares means and variance (WLSMV) adjusted for the categorical data method of estimation. Covariance matrices were analyzed. The model considered was a five-first-order-factor plus a three-second-order-factor model, following the scoring instructions (Gioia et al., 2003). The five first-order factors considered were as follows: IN (16 items), EC (10 items), SH (10 items), WM (17 items), and PO (10 items). The three second-order-factor models added were as follows: ISCI (26 items grouping IN and EC), FI (20 items grouping SH and also EC), and EMI (27 items comprising WM and PO). Goodness of fit was assessed with the common fit indices (Jackson, Gillaspy, & Purc-Stephenson, 2009): χ2, Tucker and Lewis index (TLI), comparative fit index (CFI), and root mean square error of approximation (RMSEA). The following thresholds were adopted: RMSEA < 0.05 and TLI and CFI > 0.95 are indicative of good fit, whereas RMSEA < 0.08 and TLI and CFI > 0.90 represent reasonable fit (Marsh, Hau, & Wen, 2004). Internal consistency of the derived scores was measured with Cronbach’s alpha.
The association between BRIEF-P and other psychological measures was carried out with the Pearson correlation; |R| ≥ .30 was considered a good effect size.
Hierarchical binary logistic regressions explored the incremental validity of BRIEF-P scale scores (added to the SDQ, CGAS, and K-CPT scores) to discriminate the presence of ADHD, ODD, and disruptive disorders (ADHD, ODD, or CD). The incremental validity was measured with the change/increase of Nagelkerke’s R2 coefficient (ΔR2), comparing the first block, including the SDQ, CGAS, or K-CPT, and the second block that added the BRIEF-P scores.
Binary logistic regressions valued the screening accuracy of BRIEF-P to identify ADHD and ODD at ages 3 and 4. The area under the receiver operating characteristic (ROC) curve (AUC) measured the global discriminative accuracy of the BRIEF-P scores.
The best empirical cut-off of each BRIEF-P scale was calculated to detect the presence of ADHD, ODD, and disruptive behavior disorders. Given that we were analyzing here the supposed use of the questionnaire for screening purposes, the selection was based on sensitivity of no less than 0.70.
Results
CFA for BRIEF-P Items and Internal Consistency
Table 1 shows the results for the model considered. To adjust the model correctly, 4 items showing an extreme endorsement were removed: Items 1 (“overreacts to small problems”), 13 (“has to be more closely supervised than similar playmates”), 35 (“has trouble changing activities”), and 58 (“has trouble remembering something, even after a brief period of time”). The five-first-order-factor model plus a three-second-order-factor model yielded acceptable goodness-of-fit indices: χ2 (1646) = 4,892.6, CFI = 0.89, TLI = 0.89, RMSEA = 0.056. All factor loadings and factor correlations were statistically significant, and standardized factor loadings were above 0.50. Internal consistency was satisfactory (α ≥ .87) for the resulting scales.
Confirmatory Factor Analysis for the BRIEF-P: Standardized Solution.
Note: BRIEF-P = Behavior Rating Inventory of Executive Functioning–Preschool version; IN = Inhibit; EC = Emotional Control; SH = Shift; WM = Working Memory; PO = Plan/Organize; ISCI = Inhibitory Self-Control Index (IN + EC); FI = Flexibility Index (SH + EC); EMI = Emergent Metacognition Index (WM + PO); GEC = Global Executive Composite. N = 620. Standardized factor loadings (λ), factor correlations (φ), and internal consistency (α). In italics: Cronbach’s alpha (no. items).
The following analyses are conducted with the direct scores derived, and therefore, Items 1, 13, 35, and 58 are not included.
Convergent Validity of BRIEF-P With Other Psychological Measures
Table 2 shows the correlations of the BRIEF-P with other psychological measures. Only BRIEF-P-IN correlated positively with SDQ-parents externalizing problems. However, most of the correlation coefficients between BRIEF-P and SDQ-teachers scales showed significant positive associations. CBQ-surgency correlated positively with BRIEF-P-IN and negatively with BRIEF-P-SH, whereas CBQ-effortful control correlated negatively with BRIEF-P-WM, PO, EMI, and GEC. None of the BRIEF-P scales were associated with the performance-based measure of attention (K-CPT).
Association Between BRIEF-P and Other Psychological Measures.
Note: BRIEF-P = Behavior Rating Inventory of Executive Functioning–Preschool version; IN = Inhibit; SH = Shift; EC = Emotional Control; WM = Working Memory; PO = Plan/Organize; ISCI = Inhibitory Self-Control Index (IN + EC); FI = Flexibility Index (SH + EC); EMI = Emergent Metacognition Index (WM + PO); GEC = Global Executive Composite; SDQ = Strengths and Difficulties Questionnaire; CBQ = Children’s Behavior Questionnaire. Pearson’s correlation coefficients. In bold: |R| ≥ .30.
Incremental Validity of BRIEF-P
The results of the hierarchical binary logistic regressions that valued the incremental validity of BRIEF-P to identify the presence of disruptive disorders (Table 3) indicate that BRIEF-P achieved incremental predictive accuracy to detect ADHD added to SDQ-parents (ΔR2 = 5.3%), CGAS (ΔR2 = 11.4%), and K-CPT (ΔR2 = 15.3%). The specific BRIEF-P scale scores that obtained significant incremental predictive accuracy for detecting the presence of ADHD were IN (ΔR2 = 4.9% added to SDQ-parents and ΔR2 = 11.0% added to CGAS), and IN plus SH (ΔR2 = 13.3%). No significant incremental validity was found when the BRIEF-P measures were added to SDQ-teachers scores for detecting the presence of ADHD. The BRIEF-P scales only achieved significant incremental predictive accuracy for the presence of ODD added to K-CPT, but the increase was low (ΔR2 = 1.9% for the whole BRIEF-P questionnaire and ΔR2 = 0.9% for EC). Considering the presence of any disruptive disorder (ADHD, ODD, or CD), the BRIEF-P achieved significant incremental validity added to K-CPT (ΔR2 = 5.6% for the complete questionnaire and ΔR2 = 3.2% for the specific scale with significant predictive association, EC; Table 3 online).
Incremental Validity of BRIEF-P to Identify ADHD and ODD.
Note: BRIEF-P = Behavior Rating Inventory of Executive Functioning–Preschool version; ODD = oppositional defiant disorder; SDQ = Strengths and Difficulties Questionnaire; CGAS = Children’s Global Assessment Scale; K-CPT = Kiddie–Continuous Performance Test; R2 = Nagelkerke’s R2 coefficient; RT = reaction time; SE = standard error; ISI = interstimulus interval; ΔR2 = increase/change in the Nagelkerke’s R2 coefficient for the second step. In bold: significant parameters.
New hierarchical regressions in three steps were carried out, so as to value the incremental validity of BRIEF-P (entered in the third step) on SDQ-parents (entered in the first-step) and K-CPT (entered in the second step). BRIEF-P achieved significant predictive increase for the presence of ADHD (ΔR2 = 6.6% for the complete BRIEF-P questionnaire and ΔR2 = 5.5% for IN) and ODD (ΔR2 = 2.0% for the complete BRIEF-P questionnaire and ΔR2 = 0.8% for IN; Table 3 online).
Screening Accuracy of BRIEF-P for Identifying DSM-IV Disorders
Table 4 (top) shows the means of BRIEF-P scale scores for children with and without the presence of ADHD diagnosis and the binary logistic regressions that valued the screening accuracy of the questionnaire to identify this disorder. The BRIEF-P achieved acceptable discriminative accuracy for ADHD: AUC ranged from 0.74 to 0.80, and Nagelkerke’s R2 was between 11.0% and 18.5%. The specific BRIEF-P scales with significant predictive accuracy were IN, ISCI, and GEC at ages 3 and 4, and FI at age 4. The discriminative accuracy for identifying the presence of ODD was low (AUC between 0.55 and 0.62, and R2 between 0.2% and 5.3%) and nonsignificant for the second-order scales ISCI, FI, and EMI (Table 4, bottom). The only scales with significant discriminative accuracy for the screening of ODD were EC and GEC.
Screening Accuracy of BRIEF-P to Identify ADHD and ODD.
Note: BRIEF-P = Behavior Rating Inventory of Executive Functioning–Preschool version; ODD = oppositional defiant disorder; OR = odds ratio; CI = confidence interval; AUC = area under ROC curve; ROC = receiver operating characteristic; ISCI = Inhibitory Self-Control Index; FI = Flexibility Index; EMI = Emergent Metacognition Index; GEC = Global Executive Composite; R2 = Nagelkerke’s R2. In bold: significant coefficient (.05 level).
Table 5 shows the empirical best cut-off of BRIEF-P scale scores for the screening of ADHD, ODD, and disruptive behavior disorders (ADHD, ODD, or CD).
Empirical Cut-Off of BRIEF-P to Detect ADHD, ODD, and Disruptive Behavior Disorders.
Note: BRIEF-P = Behavior Rating Inventory of Executive Functioning–Preschool version; ODD = oppositional defiant disorder; Se = sensitivity; Sp = specificity; ISCI = Inhibitory Self-Control Index; FI = Flexibility Index; EMI = Emergent Metacognition Index; GEC = Global Executive Composite. Disruptive behavior disorders include ADHD, oppositional defiant disorder, and conduct disorder.
Description of Scores by Sex
Annex 1 (online material) shows the means and standard deviations of raw scores for the BRIEF-P scales. Some statistical differences emerged by sex (p < .05; boys scored higher than girls in IN, WM, and PO, and ISCI, EMI, and GEC indexes). Effect sizes comparing mean scores by sex through Cohen’s d (Cohen, 1988) were <0.50, indicating that the effects were small.
Annex 2 (online material) shows the clinical thresholds for direct scores (percentiles 80, 90, and 95).
Discussion
This is one of the few studies analyzing the psychometric properties of the teachers’ BRIEF-P. The five-first-order-factor plus three-second-order-factor model was confirmed with satisfactory internal consistency, moderate convergent validity with other measures of psychopathology and temperament, and good ability to discriminate between those children with ADHD and those without. BRIEF-P scales were not associated with a performance-based measure of attention and inhibition. The teacher’s BRIEF-P added a considerable percentage of significant clinical information when used with other instruments that assess psychopathology reported by parents, functional impairment or performance-based attention and inhibition, showing that it might be a useful assessment tool, specifically for ADHD.
Regarding validity, it was possible to confirm the internal structure proposed by Gioia et al. (2003). Different alternative models were considered—a model containing the 63 items and five first-order factors, and another containing the 63 items and three first-order factors—but they yielded unsatisfactory solutions. Therefore, the five-first-order-factor plus a three-second-order-factor model emerged as most appropriate for explaining the structure underlying the items of the BRIEF-P, despite the fact that 4 items were omitted for fitting the model (2 in IN, 1 in EC, and 1 in SH). This deletion could limit slightly the comparisons with results of other studies using this questionnaire. However, the results of the psychometric properties reported throughout this work with the direct scores of the empirical factors support the reliability and validity of the structure proposed. To our knowledge, this is the first study in preschoolers that obtained this internal structure by factorizing items instead of scales.
In relation to external measures, there was strong convergence between the BRIEF-P and measures of psychopathology (SDQ3-4) when the informant was the same (teacher), but the association was low when the information from different informants (parent–teacher) was correlated. Teachers were consistent in reporting the behavior described in the two different instruments and reported that behaviors reflecting dysfunction in executive functioning were positively correlated with behaviors reflecting psychopathology. The lack of agreement between teachers’ and parents’ reports is a common phenomenon in psychological assessment, which has been explained by the fact that different informants observe different behaviors in different contexts (de los Reyes & Kazdin, 2005; Smith, 2007). Our results highlight the need to include both teacher and parent ratings in the assessment of disruptive behavior disorders in preschool children. In addition, there was a significant negative association between BRIEF-P-WM and PO and the temperament dimension effortful control, typically associated with ADHD (Martel & Nigg, 2006), and between surgency (extraversion) and difficulties in BRIEF-P-IN and low SH, supporting convergence with temperament measures.
In addition in relation to convergent validity, a lack of association has been observed between traditional performance-based measures of executive functioning and questionnaires in adults and in children, to the extent that researchers have asked themselves whether the BRIEF-P is more a measure of impairment than a measure of executive function (Bakar, Taner, Soysal, Karakas, & Turgay, 2011; Garcia-Molina, Tormos, Bernabeu, Junque, & Roig-Rovira, 2012). We found no significant association between BRIEF-P scales and performance-based measures of attention and inhibition, in line with the previous literature on preschoolers and older children (Mahone & Hoffman, 2007; Toplak, Bucciarelli, Jain, & Tannock, 2009). However, most interestingly, we extended our study to incremental validity, and also studied the amount of variance explained by the BRIEF-P when used in combination with other measures. The incremental validity of the BRIEF-P has not been tested previously in preschoolers. We found that the BRIEF-P added a substantial amount of information for identifying ADHD to questionnaires of psychopathology answered by parents, to diagnostic interview with parents for rating functional impairment, and to K-CPT performance-based measures of attention and inhibition answered by the child. Used with these measures and these reporters, the BRIEF-P answered by teachers contributes specific and significant clinical information on dysfunction in executive functioning. However, when the teachers answered a measure of psychopathology (such as SDQ3-4 Conduct and Hyperactivity scales) and the BRIEF-P, the incremental validity of the BRIEF-P was negligible, and in this case, the results yielded by examination of everyday behavior for assessing executive function as observed in school are very close to those yielded by the observation of psychopathological behavior. Consequently, different informants must be used to obtain the incremental information with the BRIEF-P for identifying ADHD when used together with other measures of psychopathology.
In the area of test–criterion relationships, the BRIEF-P also showed good discriminative ability to identify cases with and without ADHD cross-sectionally and 1 year later. The most predictive scale was IN, and the best index was ISCI. Considering that in our sample the most frequent ADHD subtype was hyperactive, these results are congruent with executive dysfunction found in this disorder (Willcutt, Doyle, Nigg, Faraone, & Pennington, 2005). In contrast, the discriminative ability to identify ODD was poor. Several reports have indicated that deficits in executive functioning are more specific of ADHD than of ODD (Brocki, Nyberg, Thorell, & Bohlin, 2007; Thorell & Wahlstedt, 2006) and that ADHD is associated with inhibition problems, whereas ODD is more related to emotional dysregulation (Espy, Sheffield, Wiebe, Clark, & Moehr, 2011).
Finally, we also provide different cut-offs that might be useful for identifying disruptive behavior problems if the questionnaire is used for screening purposes. If executive dysfunction was to be identified early in life, preventive measures could be implemented. Early detection is especially relevant in the general population, as used in the present study.
Some limitations should be taken into account on interpreting the present results. As parents’ ratings on the BRIEF-P were not obtained, the findings cannot be generalized to the use of the questionnaire by parents. In addition, we studied only 3- and 4-year-old children, and this limits the generalization of the results to other ages. However, we focused the study on this age group because it corresponds to the start of schooling, when the detection of dysfunctions for prevention is most important. The methods used in this study were standard in Western clinical psychology, as far as diagnostic criteria (DSM-IV-TR) and assessment techniques are concerned, so that the constructs assessed are defined similarly in other Western cultures. However, despite the fact that the internal structure and internal consistency we found are similar to those presented in other published studies with samples of other ages and from other countries, teachers may evaluate the behaviors differently across different cultures. Consequently, the present results may not be generalizable to other cultures without the support of accumulated empirical evidence guaranteeing the equivalence of the constructs and their correlates. We obtained a higher percentage of differences in T-scores between the three BRIEF-P second-order-scale scores than that estimated in the manual (10%), so that caution should be exercised on interpreting the GEC as a summary executive index measure. Furthermore, in a small percentage of cases, there were inconsistencies in the answers to the questionnaire, as measured by the Inconsistency scale. We should bear in mind the authors’ indication in the questionnaire that the content of the Inconsistency scale items, although very similar, is not exactly the same. Analysis of the inconsistencies found indicates that 80% of them were between Items 48 (“talks or plays too loudly”) and 54 (“has trouble putting the brakes on his or her actions even after being asked”), and between 1 (“overreacts to small problems”) and 26 (“small events trigger big reactions”), where the interpretation of the teachers might have been different. Another possible limitation is that because we studied a sample of the general population and psychopathology is not very frequent in community samples, the discriminative power may have been underestimated. Finally, few families of low SES participated, and this could have led to bias, insofar as our results might be better representing executive dysfunction in more favored socioeconomic classes.
Footnotes
Acknowledgements
We thank the participating schools and families.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Funding was from Spanish Ministry of Science and Innovation Grant PSI2009-07542.
