Abstract
The current study investigated the cognitive deficit profiles among individuals with mathematics difficulties (MD) and potential moderators and mechanism for these profiles. Seventy-five cognitive profiling studies on MD were included, representing a total of 13,001 individuals and 126 independent samples. Results showed that compared with typically developing individuals, individuals with MD showed deficits (from most severe to less severe) in phonological processing, processing speed, working memory, attention, short-term memory, executive functions, and visuospatial skills. Moderation analyses indicated that comorbidity (with reading disabilities) and types of MD screening affected the cognitive deficits. Severity of MD was related to processing speed deficits. Deficits in phonological processing and attention were more severe in younger individuals with MD. Deficits in processing speed and working memory were most severe in the numerical domain. Deficits in low-level cognitive skills (i.e., processing speed and short-term memory) could not completely explain the deficits in high-level skills (i.e., working memory, attention, and executive functions), partially supporting the bottleneck theory. These findings, taken together, suggest that (a) deficits in processing speed and working memory are most salient and stable cognitive markers of MD, (b) numerical-processing deficit and the cognitive deficits of MD are relatively independent from each other, and (c) MD may be a discrete construct with heterogeneity reflected by MD subtypes and age. Implications for incorporating cognition in the diagnosis and the interventions for MD are discussed.
While mathematical competency is critical for competing successfully in today’s high-technology world, learning mathematics is challenging for many children. Converging evidence shows that approximately 6% of the school-aged population has some form of mathematics difficulties (MD) even with average or higher IQ and adequate instruction (Berch & Mazzocco, 2007; Gross-Tsur, Manor, & Shalev, 1996; Shalev, 2007). Despite the prevalence of MD, MD is studied far less than reading difficulties (RD; Gersten, Clarke, & Mazzocco, 2007). In recent years, an increasing number of studies have examined the factors associated with MD and suggested two major approaches to understanding the deficit profiles of MD. One is through the investigation of deficits in domain-specific skills (Geary, 1993; Gersten, Jordan, & Flojo, 2005). The other is through the investigation of deficits in domain-general cognitive skills (e.g., Johnson, Humphrey, Mellard, Woods, & Swanson, 2010; Peng & Fuchs, 2016; Swanson & Jerman, 2006).
Regarding the domain-specific skill deficits among MD, there is a consensus that MD is related to deficits in the numerical-processing system. For example, Geary (1993) conducted a literature review of MD research with a focus on conceptual and procedural competencies of arithmetic and suggested that MD is related to deficits in the representation of arithmetic facts from semantic memory, the execution of arithmetic procedures, and visuospatial presentation of numerical information. The synthesis by Gersten et al. (2005) is consistent with Geary (1993), suggesting that the deficits in numerical-related skills, such as magnitude comparison, counting strategies, identification of numbers, and numerical working memory, are important markers for MD.
Regarding the domain-general skill deficits among MD, previous reviews focused on the memory system. Specifically, Swanson and Jerman (2006) and Peng and Fuchs (2016) used meta-analyses to examine the memory deficits of MD among elementary school children. Both reviews indicated that children with MD, regardless of comorbidity in mathematics and reading difficulties (MDRD), show memory deficits. Children with MD are characterized with distinctive visuospatial and numerical memory deficits. Using a more comprehensive set of cognitive variables, Johnson et al. (2010) studied the deficit profiles of working memory, short-term memory, executive functions, and processing speed of MD. However, due to the limited number of effect sizes (less than 8 effect sizes for each skill), they could not draw a robust conclusion on these deficit profiles of MD or systematically examine moderators that influence the profiles.
Although previous reviews have advanced our understanding of domain-specific and domain-general deficits of MD, several important questions remain unanswered, which we aimed to address in the present study. Specifically, we examined (a) whether MD is related to deficits in other cognitive skills besides memory deficits; (b) whether the cognitive deficit profiles of MD are influenced by comorbidity (i.e., MDRD vs. MD-only), types of MD screening (i.e., calculation vs. problem solving vs. comprehensive mathematics), severity of MD, domains of task (i.e., verbal vs. numerical vs. visuospatial), or age; and (c) whether deficits in low-level cognitive skills (i.e., processing speed and short-term memory) explain the deficits in high-level cognitive skills (e.g., attention, working memory, and executive functions) among MD.
Investigating these questions is both theoretically and practically important. From the theoretical perspective, the deficit profiles in cognitive skills help further understand the nature of MD. That is, besides numerical-processing and memory deficits, is MD also related to deficits in other cognitive skills (Geary, 2005; Mazzocco, 2005)? If yes, would this relation reflect the numerical deficit nature of MD such that cognitive deficits among MD are related to numerical processing specifically? Is there heterogeneity among MD? If yes, does this heterogeneity reflect different cognitive deficit profiles among different types of MD or does this heterogeneity reflect the changing cognitive deficit profiles as a function of time? Moreover, does MD have deficits in both low-level and high-level cognitive skills? If yes, do deficits in low-level cognitive skills explain the deficits in high-level cognitive skills, as suggested by the bottleneck theory (Peng, Sha, & Li, 2013; Salthouse, 1996; Swanson & Sachse-Lee, 2001)?
From a practical perspective, answering these questions has implications for MD identification and intervention. Specifically, if the cognitive deficit profiles vary among different types of MD or vary with time, there should be more accurate identification of MD subtypes and thus more individualized instruction for a MD subtype at a specific period of time. Moreover, an increasing number of studies in recent years have investigated whether training high-level cognitive skills (e.g., working memory) improves cognition and whether such improvement transfers to mathematics outcomes (e.g., Barnes et al., 2016; Holmes, Gathercole, & Dunning, 2009; Kroesbergen, van’t Noordende, & Kolkman, 2014). Although some studies have found training effects on cognitive skills, most have failed to find the transfer effects (e.g., Jacob & Parkinson, 2015; Melby-Lervåg & Hulme, 2013; Morrison & Chein, 2011; Peng & Miller, 2016; Shipstead, Redick, & Engle, 2012). The lack of far transfer has often been discussed as being related to (a) variability in the types of cognitive training (Jacob & Parkinson, 2015), (b) variability in the domains of task training materials (Peng & Fuchs, 2016), and (c) variability in the population receiving the training (Shipstead et al., 2012). Investigating the moderation effects of types of MD, severity of MD, domains of task, and age on the cognitive deficit profiles of MD can provide empirical evidence that may help address the conflicting findings in the cognitive intervention research. In the subsequent sections, we provide the rationale for the inclusion of the cognitive skills, moderators, and relevant theories.
Cognitive Deficits Associated With MD
Processing Speed
Processing speed is the speed a person can encode, transform, and retrieve information (Conway, Cowan, Bunting, Therriault, & Minkoff, 2002) and is often operationalized as the speed of responding to simple tasks in which everyone would answer correctly if there were no time limits (Salthouse, 1996). Processing speed is assumed to influence early mathematics development by facilitating simple processes, such as fluent counting for finding answers, thereby permitting associations between problems and their answers to be held in working memory before committing to long-term memory (Geary, 1993). From a developmental perspective, processing speed is the major reason for the age differences in numerical competence (e.g., Holloway & Ansari, 2008). With slower processing, the interval increases for deriving counted answers and for pairing a problem stem with its answer in working memory; this creates the possibility that “decay” sets in before completing the computational sequence. In fact, some research indicates that low processing speed is a prominent cognitive feature among children with MD (e.g., Cirino, Fuchs, Elias, Powell, & Schumacher, 2015; L. S. Fuchs et al., 2006; L. S. Fuchs et al., 2008; Jordan & Montani, 1997). By contrast, others indicate that children with MD do not show deficits in processing speed (e.g., Andersson & Lyxell, 2007; Jordan, Hanich, & Kaplan, 2003; Moll, Göbel, Gooch, Landerl, & Snowling, 2016).
Types of MD, age, and domains of task may be potential factors contributing to these mixed findings. For example, some research suggests that processing speed may be especially important for the early mathematics development (e.g., L. S. Fuchs et al., 2006), and thus processing speed deficits may be most severe among young children with MD, especially for those with calculation difficulties (e.g., Bull & Johnston, 1997; L. S. Fuchs et al., 2006). Others suggest that slow processing may be related to word problem–solving difficulties, such that children with word problem–solving difficulties tend to use more time-consuming, inefficient word problem–solving strategies (Geary, 1993). Moreover, some research shows that children with MD only show slow numerical information processing, suggesting that processing speed deficits in MD may be numerical-specific (Geary, 1993).
Phonological Processing
Phonological processing refers to the ability to manipulate phonological representations (e.g., sounds) and efficiently retrieve phonological representations from long-term memory (Wagner, Torgesen, & Rashotte, 1999). According to the phonological representation hypothesis, solving calculation problems requires encoding and maintaining phonological representations in short-term memory as well as the retrieval of phonological codes from long-term memory (Hecht, Torgesen, Wagner, & Rashotte, 2001; Simmons & Singleton, 2008; Vukovic & Lesaux, 2013). For example, to solve the equation, (4 + 5)/3 =, one needs to encode this equation phonologically as “the sum of four plus five divided by three,” retrieve “9” from long-term memory for “4 + 5,” and hold the phonological information in short-term memory to solve the problem. Poor phonological processing skills may cause poor representation of numerical codes in both short-term and long-term memory, which may impede mathematics performance. This hypothesis explains the high prevalence of comorbidity between RD and MD in early childhood such that young children with RD with poor phonological processing skills often have MD (e.g., Simmons & Singleton, 2008). However, some research also documents that young children with MD, nonetheless, have good phonological processing skills (e.g., L. S. Fuchs et al., 2005; Landerl, Fussenegger, Moll, & Willburger, 2009; Simmons & Singleton, 2008), suggesting that phonological processing may not be an influential factor in early mathematics competence.
These mixed findings should be interpreted with respect to types of MD and how phonological processing was measured. For example, due to phonological processing deficits being closely related to RD, it is likely that MDRD, but not MD-only, is related to deficits in both manipulating and efficiently retrieving phonological representations (Simmons & Singleton, 2008). Previous research also shows that the manipulation of phonological representations may indirectly influence mathematical achievement via learning number words and the number-word sequence (Krajewski & Schneider, 2009), and thus deficits in phonological manipulation may be particularly related to MD identified with calculation difficulties early on (Hecht et al., 2001). Moreover, some suggest that the deficits in efficient retrieval of numerical information from long-term memory are distinct deficits in MD (Geary, 1993), and thus phonological retrieval deficits, compared with phonological manipulation, is more important to MD, regardless of comorbidity and age.
Visuospatial Skills
Visuospatial skills are the abilities to represent, transform, generate, and recall symbolic, nonlinguistic information (Halpern, 2004; Linn & Petersen, 1985). Previous research suggests that visuospatial skills are linked to mathematics achievement, and training visuospatial skills may improve mathematics performance (Cheng & Mix, 2014). However, whether deficits in visuospatial skills contribute to MD is unclear (e.g., Badian, 1999; Landerl, Bevan, & Butterworth, 2004; Swanson, 2012). Types of visuospatial skills and types of MD should be considered for the visuospatial deficit profiles of MD. Visuospatial skills are a comprehensive construct that includes (but are not limited to) several skills, such as visuospatial perception (i.e., determine visuospatial relationships with respect to the orientation of their own bodies, in spite of distracting information), spatial visualization (i.e., complicated and multistep manipulations of visuospatial information), and mental rotation (i.e., mentally rotate one stimulus to align it with a comparison stimulus; Linn & Petersen, 1985). While visuospatial perception is suggested to be related to early numerical skills, such as understanding place value, transferring and laying out numbers, and executing operations in calculations (Geary, Hoard, Byrd□Craven, Nugent, & Numtee, 2007; Mazzocco, Singh Bhatia, & Lesniak-Karpiak, 2006; Venneri, Cornoldi, & Garuti, 2003), mental rotation and spatial visualization are shown to be related to more complex and advanced mathematics skills, such as constructing schematic diagrams in word problem solving (Edens & Potter, 2008; Hegarty, Mayer, & Monk, 1995). Thus, it is expected that different types of MD may demonstrate deficits in different types of visuospatial skills, at least to varying degrees.
Memory
We focused on short-term memory and working memory. Short-term memory refers to the temporary storage of information, which is the base for early mathematics skills, such as counting, number sense, and simple arithmetic (e.g., single-digit addition and subtraction; Bull & Johnston, 1997; Hitch & McAuley, 1991; Noël, Désert, Aubrun, & Seron, 2001; Passolunghi & Siegel, 2001). Working memory refers to the ability to simultaneously store and process information, which is critical for the performance on all types of mathematics tasks (Peng, Namkung, Barnes, & Sun, 2016). Although previous research and reviews demonstrate that children with MD have deficits in both short-term memory and working memory, one question remains to be answered: Are deficits in the memory system of MD, especially working memory, domain-general or domain-specific?
The literature is mixed on this point. Previous reviews on the memory deficits among MD (Peng & Fuchs, 2016; Swanson & Jerman, 2006) indicate that individuals with MD have memory deficits across numerical, visuospatial, and verbal domains. However, Swanson and Jerman (2006) suggested that children with MD may have distinctive visuospatial working memory deficits, while Peng and Fuchs (2016) suggested that children with MD may have distinctive numerical working memory deficits. One explanation for these mixed findings may be the small and different body of studies included in those reviews. Swanson and Jerman (2006) included 28 studies before 2004 that tapped working memory measures across verbal, numerical, and visuospatial domains. Peng and Fuchs (2016) included 29 studies before 2014 and only focused on working memory tasks in the verbal or numerical domains.
Types of MD may also affect the domain specificity of the memory deficit profiles of MD. Some research shows that numerical and visuospatial processing skills are closely related to calculations, while verbal skills are more closely related to word problem solving (e.g., Raghubar, Barnes, & Hecht, 2010; Peng et al., 2016). Thus, it is expected that MD identified with calculation difficulties are likely to show more severe numerical and visuospatial memory deficits, whereas MD identified with word problem–solving difficulties may demonstrate more severe verbal memory deficits. Yet individuals with both calculation and word problem–solving difficulties may have memory deficits across verbal, numerical, and visuospatial domains to similar degrees.
Attention
Attention refers to the appropriate allocation of cognitive resources to relevant stimuli (Posner & Petersen, 1990). Because attention is often considered as the hub of cognition, children who have attention problems often demonstrate learning difficulties (e.g., Willcutt, Pennington, Olson, Chaabildas, & Hulslander, 2005). However, whether MD is necessarily associated with attention problems remains unclear (Busch, Schmidt, & Grube, 2015; Cirino, Fletcher, Ewing-Cobbs, Barnes, & Fuchs, 2007; Gold et al., 2013; McCall, 1999; Reimann, Gut, Frischknecht, & Grob, 2013; Swanson, 2012; Willcutt et al., 2013). How attention was measured may be connected to the relation between attention problems and MD. For example, studies that reported severe attention problems among MD mostly used subjective attention rating scale/checklist, such as Strengths and Weaknesses of Attention-Deficit/Hyperactivity-symptoms and Normal-behaviors (e.g., Cirino et al., 2007) and Diagnostic and Statistical Manual of Mental Disorders (e.g., Willcutt et al., 2013), whereas less severe attention problems among MD were often reported from studies using objective attention measures (e.g., Gold et al., 2013).
Age may also affect the relation between attention problems and MD. Research on children with attention-deficit hyperactivity disorder (ADHD) suggests that the negative effects of their attention deficits in mathematics are mostly due to their failure in automatized math facts retrieval. That is, the lack of automatized arithmetic facts retrieval consumes limited attention resources for children with ADHD, which hinders their mathematics performance and development (e.g., Zentall, 1990). Following this logic, if MD is related to attention difficulties, this relation may be stronger in the early learning stages, when children are actively engaged in learning and automatizing arithmetic facts. This age effect should be strong when one also considers that attention span is smaller in younger children (Ruff & Lawson, 1990). An alternative hypothesis is that the relation between attention difficulties and MD may be stronger among older individuals with MD. This may be because older individuals with MD also often struggle with automatized arithmetic facts retrieval (Geary, 1993), and the amount of attention needed in mathematics tasks may increase with more complex and advanced mathematics skills.
Executive Functions
Executive functions refer to a set of cognitive skills required to direct a behavior toward the attainment of a goal (Jacob & Parkinson, 2015). Executive functions usually include inhibition, updating, and switching (Miyake, Emerson, & Friedman, 2000). Inhibition is the ability to deliberately inhibit a dominant, automatic response in favor of a subdominant response (e.g., Passolunghi & Cornoldi, 2000; Peng, Congying, Beilei, & Sha, 2012; Swanson, 2006). Updating is the process of encoding and evaluating incoming information for relevance to the task at hand and subsequent revision of the information held in memory. Switching is the ability to shift attention or to shift between strategies or response sets. Theoretically, executive functions are critical for many mathematics tasks that involve multiple steps, sequential thinking, and complex strategies (e.g., multidigit computation, mental arithmetic performance, and problem solving; Peng et al., 2012; Raghubar et al., 2010).
Deficits in executive functions are assumed to be closely associated with poor mathematics performance, but findings on the deficit profiles of these executive functions in the MD literature are mixed (e.g., Andersson, 2008; Andersson & Lyxell, 2007; Hitch & McAuley, 1991; Passolunghi & Siegel, 2004; Peng et al., 2012; Peng et al., 2013; van der Sluis, de Jong, & van der Leij, 2004). Domains of task, types of MD, and age should be considered regarding the relation between executive function deficits and MD. For example, some research suggests that children with MD demonstrate executive function deficits across domains, but the deficits in the numerical domain are distinct (Peng et al., 2012). Some studies report that deficits in inhibition, updating, and switching are all related to calculations and word problem–solving difficulties (L. S. Fuchs et al., 2005; Passolunghi & Siegel, 2001; Passolunghi, Cornoldi, & De Liberto, 1999), while others suggest that inhibition deficits may be particularly related to calculation difficulties (Toll, Van der Ven, Kroesbergen, & Van Luit, 2011) but not to word problem–solving difficulties (Swanson & Beebe-Frankenberger, 2004). Moreover, research shows that the construct of executive function system may be different between older children/adults and younger children, such that the inhibition, updating, and switching in younger children (e.g., preschoolers) are less differentiated from one another, representing a unitary construct (Bull & Lee, 2014). Given this developmental feature of executive functions, if executive function deficits are related to MD, young children with MD should demonstrate comprehensive executive function deficits, while older children and adults with MD may show selective executive function deficits depending on types of MD (L. S. Fuchs et al., 2005; L. S. Fuchs et al., 2006; Passolunghi & Siegel, 2001; Swanson & Beebe-Frankenberger, 2004).
Factors Influencing Cognitive Deficit Profiles of MD
Types of MD
MD is often considered as a heterogeneous group, and the present study focused on several most studied subgroup comparisons in previous MD profiling research: (a) comorbidity (individual with MDRD vs. individuals with MD-only [with normal reading skills]) and (b) MD identified with calculation difficulties (MD identified with calculation measures; L. S. Fuchs et al., 2006) versus MD identified with word problem–solving difficulties (MD identified with word problem–solving measures; Fuchs, Fuchs, & Prentice, 2004) versus MD identified with comprehensive MD (identified with mathematics measures that tap calculations, problem solving, and/or other mathematics skills). Specifically, previous research suggests that different MD groups may demonstrate different cognitive deficit profiles. For example, some research suggests that compared with MD-only, MDRD show more comprehensive and severe deficits in cognitive skills (e.g., Cirino et al., 2015; Jordan & Montani, 1997; Peng et al., 2012; van der Sluis et al., 2004). In contrast, the cognitive deficit profiles among the subgroups within MD are less clear. For example, some research suggests that MD identified with calculation difficulties are mostly related to deficits in low-level cognitive skills, such as phonological processing (L. S. Fuchs et al., 2008; Swanson, 2006), processing speed (L. S. Fuchs et al., 2006; L. S. Fuchs et al., 2008; Geary et al., 2007), short-term memory (McCall, 1999), and visuospatial skills (e.g., Rovet, Szekely, & Hockenberry, 1994). By contrast, MD identified with word problem–solving difficulties are more closely associated with deficits in high-level cognitive skills, such as working memory and executive functions (L. S. Fuchs et al., 2006; L. S. Fuchs et al., 2008; Swanson, 2006; Swanson & Beebe-Frankenberger, 2004). MD identified with comprehensive MD seem to have more comprehensive and severe cognitive deficits (e.g., Cirino et al., 2015; Peng et al., 2012). Yet there are other studies suggesting that the deficit profiles among MD screened by different mathematics measures overlap (e.g., Gold et al., 2013; Landerl et al., 2004; Peng et al., 2012).
Severity of MD
Low-achievement identification is a widely adopted approach in the MD research (e.g., Andersson & Lyxell, 2007; Geary et al., 2007; Murphy, Mazzocco, Hanich, & Early, 2007; Passolunghi & Siegel, 2004; Siegel & Ryan, 1989). In accordance with low-achievement identification, individuals with MD are identified if their IQs are in a normal range but their performance is below a cutoff point on mathematics screening measures (Peng & Fuchs, 2016). However, one major problem associated with this approach is the inconsistent cutoff criterion on the mathematics screening measures. The cutoff criterion used to establish MD varies from the 35th percentile to less than the 5th percentile across studies (e.g., Landerl et al., 2004; Peng et al., 2012).
Different cutoff scores lead to different degrees of severity in MD (e.g., Murphy et al., 2007), which may affect cognitive profiles of MD. For example, Murphy et al. (2007) studied the relation between the cognitive profiles of children with MD and the mathematics achievement cutoff criteria. Murphy et al. (2007) found qualitative group differences in the profiles of mathematics related cognitive skills across groups defined by different cutoff scores (below the 10th percentile vs. between the 11th and 25th percentiles on mathematics measures). Children with less severe MD appeared to show less severe cognitive deficits than children with more severe MD. These results pose a theoretical question on whether MD is a continuous or a discrete construct, and highlight the potential effects of severity of MD on the cognitive deficits profiles. That is, if MD is a discrete construct as suggested by some research (e.g., Landerl et al., 2004), the severity of MD should not influence the deficit profiles. On the other hand, if MD is not a discrete construct, the severity of cognitive deficits in MD may change as a function of MD severity (e.g., Branum-Martin, Fletcher, & Stuebing, 2013; Murphy et al., 2007).
Age
Mathematics achievement is cumulative throughout and beyond the elementary school years with quantitative and qualitative changes occurring within and across grade levels. The required changes concern performance demands and the necessary prerequisite skills. Thus, performing a specific mathematics task (e.g., calculation) at different ages may involve different cognitive skills (Raghubar et al., 2010). Following this logic, a specific type of MD identified at different ages may relate to different underlying cognitive deficits. For example, some research shows that younger children (e.g., first grade) identified with calculation difficulties show deficits in visuospatial skills and working memory (e.g., Geary et al., 2007), while other studies indicate that older children (e.g., fourth grade and above) with calculation difficulties do not show deficits in these areas (e.g., McCall, 1999). This may be because young children have not established strong numerical-processing skills and tend to rely heavily on visuospatial skills and working memory to perform calculation tasks (Raghubar et al., 2010). Thus, deficits in visuospatial skills and working memory among young children may be related to their calculation difficulties. In contrast, older children are likely to use their numerical skills, not their visuospatial skills and working memory, to perform calculation tasks, and thus their calculation difficulties are less likely due to their poor visuospatial skills and working memory. These findings suggest that age, a proxy for skill level and experience with mathematics tasks, as well as the interaction between age and different MD types should be an important factor to understand the cognitive deficits profiles of MD.
Domains of Task
Information processing in different domains may play different roles in mathematics learning. Numerical and visuospatial information processing are critical in calculations (Geary, 1993; Swanson & Jerman, 2006) and verbal information processing is particularly important for word problem solving (L. S. Fuchs et al., 2006; L. S. Fuchs et al., 2008). Thus, it is reasonable to assume that domains of task may influence the cognitive deficit profiles of different types of MD. Indeed, previous reviews suggest that different types of MD may have domain-general working memory deficits, but the severity of cognitive deficits vary across domains, such that numerical and visuospatial cognitive deficits are more severe than verbal cognitive deficits (Peng & Fuchs, 2016; Swanson & Jerman, 2006). These findings, taken together, indicate that domains of task may be an important factor contributing to the working memory–deficit profiles of MD. Nevertheless, it remains unclear whether the domains of task affect the deficit profiles of other cognitive skills. This question is important for understanding the domain-general and domain-specific nature of cognitive skills in the context of MD. That is, the cognitive skills, especially those often considered as domain-general high-level cognitive skills (e.g., executive functions) may show domain-specific nature (deficits related to the core deficits of MD: numerical-processing deficits) among MD.
The Bottleneck Theory
One important aspect of MD profiling research that has been largely ignored is the fundamental mechanism of high-level cognitive deficits. That is, do high-level cognitive deficits among MD truly reflect the deficit in themselves or the deficits in low-level fundamental cognitive skills? This issue is related to the bottleneck theory, which claims that low-level cognitive skills, including processing speed and short-term memory, often serve as the foundations for many high-level cognitive skills, including working memory, attention, and executive functions (Salthouse, 1996). That is, processing speed and short-term memory are important for keeping incoming information available to be processed by high-level cognition, such as working memory, attention, and executive functions (Salthouse, 1996). Thus, any deficits in those low-level skills would lead to impairment in high-level cognitive skills (e.g., Peng et al., 2012; Swanson & Sachse-Lee, 2001). In mathematics, processing speed is responsible for saving more cognitive capacity for the execution of mathematics problems while short-term memory, working with processing speed, is responsible for retaining relevant intermediate information, such as interim numbers in a calculation, and verbal or numerical information in a word problem (Peng et al., 2012). These low-level cognitive skills directly support the process of working memory, attention, and executive functions in mathematics performance, such as regrouping, inhibiting irrelevant information, and forming effective word problem–solving strategies (e.g., Alloway, Gathercole, & Pickering, 2006; Noël et al., 2001; Swanson & Sachse-Lee, 2001). Thus, it is possible that inefficient low-level skills may create a “bottleneck” that constricts information flow to high levels of processing. Without controlling for low-level cognitive skills, it is not clear whether MD is related to impairments in high-level or low-level cognitive skills (Peng et al., 2012; Swanson & Sachse-Lee, 2001).
The bottleneck theory can also be linked to the Mathew effect, in which the rich get richer and the poor get poorer. Specifically, if low-level cognitive skills are foundational to high-level cognitive skills, those with weak foundational cognitive skills will also have deficits in high-level cognitive skills, which lead to worsening mathematics performance as mathematics learning becomes more complex and requires more high-level cognitive skills. Thus, deficits in low-level cognitive skills may cause the Matthew effect on the deficits of high-level cognitive skills of MD from a developmental perspective. That is, compared with the deficits in low-level cognitive skills, deficits in high-level cognitive skills among MD may become more severe as a function of age.
Aims
To sum up, the purpose of this meta-analysis was to examine three major questions. First, compared with the typically developing (TD) group, does the MD group show deficits in phonological processing, processing speed, short-term memory, working memory, visuospatial skills, attention, and executive functions? We hypothesized that individuals with MD show deficits in those skills, but the severity of deficits may vary. The severity of deficits may also vary among subtypes within some skills, such as phonological processing (manipulation vs. retrieval), visuospatial skills (visuospatial perception vs. mental rotation vs. spatial visualization), attention (subjective measures vs. objective measures), and executive functions (inhibition vs. updating vs. switching).
Second, are the deficit profiles of MD influenced by types of MD (i.e., comorbidity and types of MD screening), severity of MD, age, or domains of task? Regarding the types of MD, we hypothesized that the MDRD group may show more severe deficits across all skills than the MD-only group. MD identified with calculation difficulties would be related to deficits in phonological processing, processing speed, short-term memory, and visuospatial perception; MD identified with word problem–solving difficulties would be related to deficits in working memory, attention, and executive functions; MD identified with comprehensive MD are related to more severe deficits in all skills. With respect to severity of MD, we expected that the severity of cognitive deficits in MD may change as a function of MD severity. Age may influence the phonological processing, visuospatial, processing speed, attention, and executive function–deficit profiles of MD. Regarding the effects of task domains, we hypothesized that MD may show cognitive deficits across verbal, numerical, and visuospatial domains in processing speed, memory, attention, and executive functions, but these deficits may be more severe in the numerical or visuospatial domain.
Third, do the deficits in processing speed or short-term memory explain the deficits in working memory, attention, and executive functions of MD? Based on the bottleneck theory, we hypothesized that deficits in processing speed and short-term memory may “fully” or “partially” explain the variance in the deficits in attention, working memory, and executive functions of MD.
Method
Literature Search
Articles for this meta-analysis were identified in three ways. First, a computer search of the Education Resources Information Center and databases of ProQuest and PsycINFO was conducted. We used the earliest possible start date (1920) till June, 2017. The following terms were used to search in full text: (math*AND difficult* OR disabilit*) OR discalcul*. The terms math*, difficult*, disabilit*, discalcul* allow for inclusion of mathematics, difficulties/difficulty/disability/disabilities, dyscalculia/dyscalculic, and so forth. Second, we searched for unpublished literature through Dissertation and Masters Abstract indexes in ProQuest, Cochrane Database of Systematic Reviews, and relevant conference programs (e.g., Conference of Society for Research in Child Development; National Council of Teachers of Mathematics Annual Meeting; Psychology of Mathematics Education Annual Conference). Third, previous relevant reviews (Geary, 1993; Gersten et al., 2005; Johnson et al., 2010; Peng & Fuchs, 2016; Swanson & Jerman, 2006) were reviewed to include extra articles not identified in the first two search steps. The initial search yielded 908 studies. Two authors of this study then reviewed all studies by titles and abstracts. After excluding the duplicate 26 articles, the remaining 882 articles were closely reviewed using specific criteria. First, the study had to include a group of individuals with MD and a group of age-matched TD individuals. That is, the study had to report information showing that the MD group scored at least below the 35th percentile or below one standard deviation on mathematics screening measures, which is a common learning disability identification criterion (Swanson & Harris, 2013). Also, the study had to provide information showing that the IQ (nonverbal IQ, verbal IQ, or combination of nonverbal and verbal IQ) of individuals with MD were in the normal range (e.g., standardized score 80–120). We excluded 499 studies that did not meet these criteria. Then, we reviewed the remaining 383 studies to select studies that compared the MD with TD group on measures that tap at least one of the following cognitive skills: phonological processing, processing speed, short-term memory, working memory, attention, visuospatial skills, and executive functions. We excluded another 308 studies that did not meet this criterion. The final sample included 75 studies, including 4 dissertations, 1 book chapter, and 70 peer-reviewed articles. Table 1 shows the definition and examples of task for each cognitive skill.
Description of cognitive skills and examples
Note. SWAN = Strengths and Weaknesses of Attention-Deficit/Hyperactivity-symptoms and Normal-behaviors rating scale.
Coding Procedure and Interrater Reliability
These 75 studies were coded according to the characteristics of participants, mathematics, and IQ screening tasks, tasks used to measure phonological processing, processing speed, short-term memory, working memory, visuospatial skills, attention, and executive functions. The descriptive features of individual studies and coding information are provided in Supplementary Table S1 (available in the online version of the journal).
Variables were discussed until a consensus was reached between the first and the second author. Then, both authors used the coding system to conduct the final coding of all studies. Across the total variable matrix, the mean interrater agreement (i.e., the percentage of agreement indexed by the data points coded in agreement divided by total coded data points) was .98, ranging from .93 to .99 for all moderators of interest and all cognitive skills investigated in this study. Any disagreements between the raters were resolved by consulting the original article or by discussion.
Missing Data
Not all studies provided sufficient information on the variables of interest for the present study. In case of insufficient information, authors were contacted to obtain the missing information. However, if missing data could not be obtained for a particular moderator variable, the study was excluded from that particular moderator analysis but was included in other moderator analyses for which the data were present.
Analytic Strategies
Effect Size
Hedges’ g was used as the measure of effect sizes. We chose Hedges’ g because MD profiling research tends to have small sample sizes and Hedges’ g provides a more accurate estimate of effect sizes than Cohen’s d on small sample sizes (Hedges’ g pools using n − 1 for each sample instead of n; Grissom & Kim, 2005). For studies reporting means, standard deviations, and sample size, the following formulae were used:
with
in which
Effect sizes of deficits on all cognitive skills were estimated for MD in comparison to the TD group. Next, meta-regression analyses were used to examine the moderation effects of types of MD (i.e., comorbidity and types of MD screening), severity of MD, age, and domains of task on the deficit profile for each cognitive skill. For the moderation analyses, each moderator was examined with other moderators simultaneously controlled in one meta-regression model. For moderators that were categorical, we created different sets of dummy codes and entered them into the meta-regression model (Cohen, Cohen, West, & Aiken, 2013). To examine the “bottleneck theory,” we controlled for the group differences between MD and TD on processing speed or short-term memory for the group comparison between MD and TD on working memory, attention, or executive functions. Specifically, we first identified studies that provided correlations among at least one low-level cognitive skill and one high-level cognitive skill. Then, controlling for the group difference on the low-level cognitive skill and the correlation between low-level and high-level cognitive skills, we calculated the group difference between MD and TD on the high-level cognitive skill, which was synthesized to indicate the group differences on high-level cognitive skills, partialling out the effects of low-level cognitive skills.
Nested Structure of Effect Sizes
We considered all eligible effect sizes in each study. That is, studies could contribute multiple effect sizes as long as the sample for each effect size was independent. For studies that reported multiple effect sizes from the same sample (e.g., two effect sizes based on two working memory measures were calculated for MD vs. TD in one study), we accounted for the statistical dependencies using the random effects robust standard error estimation technique developed by Hedges, Tipton, and Johnson (2010). This analysis allows for the clustered data (i.e., effect sizes nested within samples) by correcting the study standard errors to take into account the correlations between effect sizes from the same sample. The robust standard error technique requires that an estimate of the mean correlation (ρ) between all the pairs of effect sizes within a cluster be estimated for calculating the between-study sampling variance estimate, τ2. In all analyses, we estimated τ2 with ρ = .80; sensitivity analyses showed that the findings were robust across different reasonable estimates of ρ.
Because we included studies from a wide age span and on different mathematics and cognitive skills, we hypothesized that the research body is reporting a distribution of effect sizes with significant between-studies variance, as opposed to a group of studies attempting to estimate one true effect size. Thus, we used a random-effect model for the current study (Lipsey & Wilson, 2001). Weighted, random-effects meta-regression models using Hedges et al.’s (2010) corrections were conducted with ROBUMETA in Stata (Hedberg, 2011) to summarize effect sizes and to examine potential moderators.
Publication Bias
Publication bias (the problem of selective publication, in which the decision to publish a study is influenced by its results) was examined using the method of Egger, Smith, Schneider, and Minder (1997) and funnel plot. We did not find a significant publication bias based on Egger et al.’s (1997) publication bias statistics (i.e., the standard errors of correlations did not significantly predict correlations among studies with ROBUMETA in Stata, ps > .05), except for the comparisons on phonological processing and working memory, ps < .05. Further funnel plot analyses showed reasonable symmetry in all reported comparisons (the significant Egger’s test for the comparisons on phonological processing may be due to four outliers and two outliers for working memory). Taken together, Egger’s test and funnel plot suggested that there was little influence of publication bias in the data and thus the original data set was used in all reported analyses and we treated the publication type as a potential moderator (Rothstein, Sutton, & Borenstein, 2006).
Results
The 75 studies included in the meta-analysis represented a total of 13,001 individuals (5,251 for MD and 7,750 for TD) obtained from 126 independent samples. There were 846 effect sizes that indicated the comparison between MD and TD on phonological processing (108 effect sizes, with 24 effect sizes for manipulation and 84 effect sizes for retrieval), processing speed (47 effect sizes), short-term memory (192 effect sizes), working memory (286 effect sizes), attention (49 effect sizes), visuospatial skills (20 effect sizes), and executive function (144 effect sizes, with 114 effect sizes for inhibition, 11 effect sizes for updating, 19 effect sizes for switching).
The Cognitive Deficit Profiles of MD
The estimated average effect size indicating the deficits of MD as compared with TD was as follows: phonological processing: Hedges’ g = −.91, CI95 [−1.24, −0.57] (Manipulation: Hedges’ g = −1.31, CI95 [−1.82, −.79]; Retrieval: Hedges’ g = −0.61, CI95 [−0.80, −0.43]), processing speed: Hedges’ g = −0.90, CI95 [−1.08, −0.72], short-term memory: Hedges’ g = −0.56, CI95 [−0.67, −0.46], working memory: Hedges’ g = −0.76, CI95 [−0.88, −0.64], visuospatial skills: Hedges’ g = −0.43, CI95 [−0.63, −0.23], attention: Hedges’ g = −0.72, CI95 [−0.92, −0.52] (Subjective: Hedges’ g = −0.44, CI95 [−0.73, −0.14]; Objective: Hedges’ g = −0.86, CI95 [−1.09, −0.63]), and executive function: Hedges’ g = −0.50, CI95 [−0.60, −0.39] (Inhibition: Hedges’ g = −0.37, CI95 [−0.48, −0.26]; Updating: Hedges’ g = −0.76, CI95 [−1.04, −0.48]; Switching: Hedges’ g = −0.75, CI95 [−0.87, −0.63]).
Factors That Influence the Deficit Profiles of MD
Next, we examined whether comorbidity, types of MD screening, severity of MD, age, or domains of task influenced the deficit profiles of each cognitive skill of MD. Table 2 presents the effect sizes of each subgroup within each categorical moderator for each cognitive skill. Specifically, across these cognitive deficits, the MDRD group showed more severe deficits than the MD-only group; MD identified by comprehensive mathematics measures showed more severe deficits than MD identified with calculation or word problem–solving difficulties; cognitive deficits were domain-general but numerical deficits were more severe on processing speed and working memory. Manipulation deficits were more severe than retrieval deficits in phonological processing. Attention deficits were more severe if measured by subjective measures than objective measures.
Descriptive statistics on subgroups within different cognitive skills for the comparison between MD and TD
Note. CI = confidence interval; ES = effect size; TD = typically developing; MD-only = individuals with only mathematics difficulties; MDRD = individuals with mathematics difficulties and reading difficulties; — = no results due to no/insufficient data points. The second group in each group comparison variable is the reference group (e.g., in MD-only vs. MDRD, MDRD is the reference group in the dummy coding of comorbidity).
Next, we used meta-regression models with all moderators in one model to examine the group comparison between MD and TD on each cognitive skill (see Table 3). Regarding the phonological processing deficit profiles of MD, age, comorbidity, types of MD screening, and type of phonological processing were significant moderators. Specifically, younger individuals with MD showed more severe phonological processing deficits than older individuals with MD, β = −0.15, p = .03; the MDRD group showed more severe phonological deficits than the MD-only group, β = 0.49, p = .03; MD identified with word problem–solving difficulties showed less severe phonological processing deficits than those identified with calculation difficulties, β = 0.76, p = .001. Manipulation deficits were more severe than retrieval deficits, β = −0.45, p = .02.
Meta-regression of the moderation analysis for the comparison between MD and TD on different cognitive skills
Note. MD-only = individuals with only mathematics difficulties; MDRD = individuals with mathematics difficulties and reading difficulties; TD = typically developing); — = no results due to no/insufficient data points; Comprehensive = MD identified with comprehensive mathematics difficulties; Word problem solving = MD identified with word problem–solving difficulties; Calculation = MD identified with calculation difficulties; CI = confidence interval. All moderators were entered in one model. Several models were run for thorough subgroup comparisons among moderators with more than 2 categories. For the convenience of presentation, subgroup comparisons within domains of tasks, types of MD screening, types of visuospatial skills, and types of executive functions are all listed in the model. The second group in each group comparison variable is the reference group (e.g., in MD-only vs. MDRD, MDRD is the reference group in the dummy coding of comorbidity). The values in bold indicate statistically significant moderators.
p < .05. **p < .01. ***p < .001.
Next, we examined moderators for the manipulation and retrieval deficit profiles of MD, respectively. For the manipulation deficit profiles, with all moderators (i.e., publication type, age, MD severity, comorbidity, and types of MD screening) in one model, only age and comorbidity were significant moderators such that younger individuals with MD and individuals with MDRD have more severe manipulation deficits than older individuals with MD and individuals with MD-only, β = 0.67/0.70, ps < .01. For the retrieval deficit profiles, with all moderators (i.e., publication type, age, MD severity, comorbidity, and domains of retrieval: numerical vs. nonnumerical, and types of MD screening) in one model, only types of MD screening was the significant moderator such that MD identified with calculation difficulties showed more severe retrieval deficits than MD identified with word problem–solving difficulties or comprehensive MD, β = 0.67/0.70, p = .01/.03. We also examined moderators for MD defined by different screening measures. Results showed that there were no significant moderators for the phonological processing deficit profiles among MD identified with calculation difficulties or MD identified with word problem–solving difficulties, ps > .05. However, for MD identified with comprehensive MD, manipulation deficits were more severe than retrieval deficits, β = 0.53, p = .01. Moreover, we examined moderators among the MD-only group and the MDRD group, respectively, and we did not find significant moderators, ps > .08.
Regarding the processing speed deficit profiles of MD, comorbidity, MD severity, and domains of task were significant moderators. That is, the MDRD group showed more severe processing speed deficits than the MD-only group, β = 0.48, p = .02. The processing speed deficits were more severe among those with more severe MD, β = −0.26, p = .02. Deficits in numerical-processing speed were more severe than deficits in visuospatial processing speed, β = 0.66, p = .02. We also examined moderators among MD identified with calculation difficulties, and we did not find significant moderators, ps > .06.
With respect to the deficit profiles of short-term memory and visuospatial skills, only comorbidity affected the group differences such that the MDRD group showed more severe short-term memory and visuospatial processing deficits than that of the MD-only group, β = 0.37/0.46, p = .001/.04.
With respect to the working memory deficit profiles of MD, types of MD screening and domains of task were significant moderators. That is, MD identified with word problem–solving difficulties showed less severe working memory deficits than those identified with calculation or comprehensive MD, β = 0.37/0.40, p = .03/.02. The deficits in numerical working memory were more severe than the deficits in visuospatial working memory, β = 0.30, p = .02. Next, we examined moderators for the working memory deficit profiles among MD defined by different screening measures. Results showed that only domains of task was a significant moderator for MD identified with comprehensive MD such that the visuospatial working memory deficits were less severe than the verbal and numerical working memory deficits, β = 0.35/0.49, ps < .05, with the numerical working memory deficits being the most severe. Moreover, we examined moderators for the working memory deficit profiles among the MD-only group and the MDRD group, respectively. For the MD-only group, we did not find any significant moderators, ps > .05. For the MDRD group, only types of MD screening was significant such that those screened with word problem–solving difficulties have less severe working memory deficits than those identified with calculation or comprehensive MD, β = 0.55/0.64, ps < .05.
Regarding the attention deficit profiles of MD, age and type of attention measure were the only significant moderators. That is, younger individuals with MD showed more severe attention problems than older individuals with MD, β = −0.10, p = .001; attention problems seemed more severe when attention was measured by subjective measures than objective measures, β = 0.42, p = .01.
With respect to the executive function deficit profiles of MD, comorbidity and types of MD screening were significant moderators such that the MDRD group showed more severe executive function deficits than the MD-only group, β = 0.21, p = .01; MD identified with comprehensive MD showed more severe executive function deficits than those identified with calculation difficulties or word problem–solving difficulties, β = 0.27/0.54, ps = .01. We did not examine the effects of domains of task because we did not have data on domains for switching (all switching tasks used materials across domains). That said, among inhibition and updating, we did not find that domains of task was a significant moderator, ps > .30. Next, we examined whether type of executive functions and age affected the executive function deficit profiles among MD defined by different screening measures. Results showed that type of executive functions did not influence executive deficits for MD identified with calculation difficulties or comprehensive MD, ps > .31. However, for MD identified with word problem–solving difficulties, updating deficits were more severe than inhibition deficits, β = 0.44, p = .03 (not enough effect sizes to make comparison with switching). Age did not explain the executive function deficit profiles of MD identified by different screening measures, ps > .31.
Testing the Bottleneck Theory
Next, we examined whether the group differences between MD and TD on high-level cognitive skills (i.e., working memory, attention, and executive function) were (partially) explained by group differences on low-level cognitive skills (i.e., short-term memory and processing speed) as suggested by the bottleneck theory of cognition. There were 12 studies that provided correlation tables to calculate 221 effect sizes on group comparisons on working memory, attention, or executive functions, partialling out the effects of short-term memory or processing speed.
Among these 12 studies, the mean group differences between MD and TD on working memory, attention, and executive function was Hedges’ g = −0.75, CI95 [−0.90, −0.60]. When the effects of short-term memory or processing speed were controlled for, the mean group differences between MD and TD on working memory, attention, and executive function decreased, Hedges’ g = −0.50, CI95 [−0.61, −0.38]. t-Test showed that there was a significant difference between these two effect sizes (i.e., with vs. without partialling out short-term memory or processing speed). This finding indicates that low-level cognitive skills significantly accounted for some variance (Hedges’ g = .25) in the group differences between MD and TD on high-level cognitive skills. However, the low-level cognitive skills could not completely explain the group differences between MD and TD on high-level cognitive skills.
Summary of Results
To sum up, our findings show that (a) MD is related to deficits (from most severe to less severe) in phonological processing, processing speed, working memory, attention, short-term memory, executive functions, and visuospatial skills; (b) regarding the comorbidity, the MDRD group showed more severe deficits than MD-only group on phonological processing, processing speed, short-term memory, visuospatial skills, and executive functions; (c) types of MD screening affected the deficit profiles of phonological processing, working memory, and executive functions such that MD identified with word problem–solving difficulties seemed to have less severe deficits on these cognitive skills than those identified with calculation difficulties or comprehensive MD. For MD identified with comprehensive MD, they had more severe numerical working memory deficits than their visuospatial working memory deficits, and they also had more severe phonological processing-manipulation deficits than their retrieval deficits; (d) severity of MD was only related to the severity of processing speed deficits; (e) age only affected deficits in phonological processing and attention such that younger individuals with MD have more severe deficits in these skills than older individuals with MD; (f) domains of task mostly affected the deficit profiles of processing speed and working memory such that deficits on these skills among MD are most severe in the numerical domain; (g) low-level skills (short-term memory and processing speed) significantly accounted for some (not all) variance in the group differences between MD and TD on high-level skills (working memory, attention, and executive functions).
Discussion
The current meta-analysis investigated the cognitive deficit profiles of MD and potential moderators and mechanism for these profiles. Results showed that compared with TD, individuals with MD have deficits in different cognitive skills to varying degrees. The moderating effects of comorbidity, types of MD screening, severity of MD, domains of task, and age on the cognitive deficit profiles of MD vary by different cognitive skills. The low-level cognitive skills could not completely explain the group differences between MD and TD on high-level cognitive skills. In the following sections, we discuss the findings for each cognitive skill in detail.
The Deficit Profile of Processing Speed and Moderators
We found that processing speed deficits were one of the most salient cognitive deficits among MD, in line with most previous research (Bull & Johnston, 1997; L. S. Fuchs et al., 2006; L. S. Fuchs et al., 2008; Geary, 1993). Some previous research suggests processing speed may be especially important for the early mathematics development (e.g., L. S. Fuchs et al., 2006), and processing speed deficits may be most severe among young children with calculation difficulties (Bull & Johnston, 1997; L. S. Fuchs et al., 2006). However, our findings showed that processing speed deficits were not affected by types of MD or age, indicating that processing speed deficits may be fundamental cognitive deficits for all types of MD (e.g., L. S. Fuchs et al., 2008; Geary, 1993) and can serve as a stable marker of MD across time (e.g., McCall, 1999; Mussolin, Martin, & Schiltz, 2011, Swanson, 2012).
In addition, Geary (1993) suggests that MD only demonstrate processing speed deficits in the numerical domain. Although our finding showed that numerical-processing speed deficits were most severe, in line with Geary (1993), we did find that MD was related to processing deficits across domains. One way to understand the relation between domain-general processing speed deficits and MD is based on the bottleneck theory. That is, low-level cognitive skills such as processing speed directly support the process of working memory, attention, and executive functions in mathematics performance (e.g., Alloway et al., 2006; Noël et al., 2001; Swanson & Sachse-Lee, 2001). Slow numerical facts processing during calculations (Cirino et al., 2015; L. S. Fuchs et al., 2006; L. S. Fuchs et al., 2008; Jordan & Montani, 1997) or use of inefficient and slow strategies during word problem solving (Geary, 1993) may create a ‘‘bottleneck’’ that constricts information flow (e.g., information may decay easily in short-term memory because of slow processing speed) to high levels of processing during mathematics performance. Indeed, our findings showed that the deficits in processing speed and short-term memory accounted for some variance in the deficits of high-level cognitive skills (i.e., working memory, attention, executive functions) among MD, partially supporting the bottleneck theory and highlighting the fundamental role of processing speed deficits in MD.
The Deficit Profile of Memory and Moderators
Regarding the memory deficits of MD, we found that MD has deficits in both short-term memory and working memory, which is consistent with previous reviews (Peng & Fuchs, 2016; Swanson & Jerman, 2006). Our findings showed that the deficits in short-term memory and working memory among MD are across verbal, numerical, and visuospatial, also consistent with previous reviews (Peng & Fuchs, 2016; Swanson & Jerman, 2006). However, numerical working memory deficits were more severe than the visuospatial working memory deficits, especially among MD identified with comprehensive MD. This finding is more in line with the findings of Peng and Fuchs (2016) and Gersten et al. (2005), indicating that numerical working memory, but not visuospatial working memory as suggested by Swanson and Jerman (2006), may be the most significant working memory deficits among MD.
We found that MD identified with word problem–solving difficulties had less severe working memory deficits than MD identified with calculation and comprehensive MD. This finding is contrary to our hypothesis that working memory deficits are closely related to word problem–solving difficulties, at least in comparison to calculation difficulties. This hypothesis was based on the assumption that solving word problems involves more complex steps, such as reading and comprehending the problem, identifying relevant/irrelevant information to set up an equation, and then solving calculations, thereby requiring more working memory. There may be two explanations for this unexpected finding. First, a recent meta-analysis on the relation between mathematics and working memory suggests that calculations and word problems actually show comparable relations with working memory (Peng et al., 2016), suggesting that word problem–solving difficulties may not necessarily be associated with more severe working memory deficits than do calculation difficulties. Second, among the reviewed studies, in which MD were identified with word problem–solving difficulties, these MD individuals were often reported to have relatively normal calculation competence whereas most studies, in which MD identified with calculation difficulties, did not report whether these MD individuals had normal word–problem solving skills. Thus, it is likely that some MD identified with calculation difficulties may also have word problem–solving difficulties. Future studies are needed to compare word problem–solving difficulties only to calculation difficulties only to further test whether these two groups differ on working memory.
The Deficit Profile of Phonological Processing and Moderators
We found that MD had deficits in phonological processing, with the MDRD group showing more severe deficits than the MD-only group. However, the phonological processing deficits of MD were not stable across time. That is, we found decreasing effects of age on the severity of phonological processing deficits among MD. Moreover, comorbidity and types of MD screening affected the phonological processing deficits profile of MD, and phonological manipulation is more severe than retrieval. These findings, taken together, suggest that phonological processing deficits are not only an important feature of RD but also an early marker for MD, regardless of comorbidity (although phonological processing deficits are worse in RDMD compared with MD-only). It appears that phonological processing plays a key role in early stages of mathematics learning, especially in calculation development (language and reasoning may be more important prerequisite skills for word problem solving as evidenced by L. S. Fuchs et al., 2006), but it may not be a distinct marker of MD among older individuals. This is in line with the phonological representation hypothesis that solving a simple calculation problem initially requires manipulating and maintaining phonological representation in short-term memory (Geary, 1993; Krajewski & Schneider, 2009; Simmons & Singleton, 2008; Vukovic & Lesaux, 2013). As children with MD build more fluency and automaticity, they rely less on phonological processing (especially phonological manipulation) to solve a mathematics problem. At the same time, phonological processing deficits are often alleviated with school instruction (Bus & Van IJzendoorn, 1999), which may reduce the relation between phonological processing deficits and MD among older population.
The Deficit Profile of Visuospatial Skills and Moderators
Regarding the relation between visuospatial deficits and MD, we were especially interested in the moderating effects of types of visuospatial skills and types of MD. However, we did not find those as significant moderators, nor did we obtain enough effect sizes to compare different types of visuospatial skills among different types of MD. Thus, we cautiously conclude that types of visuospatial skills do not affect the visuospatial deficit profiles of MD. That said, we do point out that the visuospatial deficits were the least severe cognitive deficits of MD, and the deficits in the visuospatial domain of other cognitive skills, such as attention, short-term memory, executive functions, and working memory all appeared to be nonsignificant or play a less important role in MD, compared with the deficits in the verbal or numerical domain.
There are two plausible explanations for this moderate relation between visuospatial deficits and MD. One is that the majority of MD individuals included in the present meta-analysis were elementary or middle school children (94% of included studies) and were identified as MD based on calculation or word problem–solving difficulties (77% of included studies). Prior studies suggest that visuospatial skills are more strongly related to geometry and other higher level mathematics skills (e.g., algebra) among older individuals. For example, Rohde and Thompson (2007) found that visuospatial skills significantly explained performance on college students’ mathematics scores on the Scholastic Aptitude Test. Similarly, Reuhkala (2001) found moderate correlations (r = .44–.57) among visuospatial processes and high school mathematics competence, which included items on algebra and geometry. In addition, Tolar, Lederberg, and Fletcher (2009) found visuospatial skills affected algebra achievement among college students. Another explanation is that the MD sample in this meta-analysis were individuals with MD and normal IQ. Because IQ, in our review, was often indexed via measures that assess visuospatial abilities (e.g., WASI matrix reasoning and Raven’s Progressive Matrix), IQ and visuospatial skills share significance variance and may be competing against each other in our analyses. Further studies are needed to clarify whether visuospatial deficits are related, but not a distinct marker for MD.
The Deficit Profile of Attention and Moderators
MD was also related to attention difficulties. This finding is consistent with previous research (Cirino et al., 2007; McCall, 1999; Swanson, 2012; Willcutt et al., 2013) and the view that individuals with learning disabilities (e.g., RD, MD, and MDRD), even without the diagnosis of ADHD, demonstrate attention difficulties to certain degrees (e.g., Mayes, Calhoun, & Crowell, 2000). As expected, the attention difficulties of MD were more severe if attention was measured by subjective measures than by objective measures. This suggests that different methods used to measure attention may affect the sensitivity of attention measures when screening MD. One possible explanation is that the subjective measures of attention, which mostly rely on teachers’ and parents’ ratings, may have been clouded by students’ poor mathematics performance. That is, the subjective ratings of attention may serve as a proxy for students’ mathematics competence (L. S. Fuchs et al., 2006)
Research on children with ADHD suggests the lack of automatized arithmetic facts retrieval consumes limited attention resources for children with ADHD, which hinders their mathematics performance (e.g., Zentall, 1990). Following this logic, we had two competing hypotheses with regard to the age effects on the relation between attention difficulties and MD. One was that the relation between attention problems and MD is stronger in the early learning stage when children are actively engaged in learning and automatizing math facts. The other was that the relation between attention difficulties and MD is stronger among older individuals with MD because older individuals with MD also struggle with automatized arithmetic facts retrieval to certain degrees (Geary, 1993) and the amount of attention needed in mathematics tasks may increase with more complex and advanced mathematics skills. Our finding is consistent with the former. The relation between attention problems and MD decrease with age possibly because of the increasing mathematics learning experiences and thus more fluent arithmetic facts retrieval abilities among older individuals with MD. That said, most of the MD screening measures we reviewed tap calculations or word problem solving, which may not reflect complex and advanced mathematics skills. Future studies should further investigate whether MD identified with more complex mathematics skills (e.g., algebra) reflects a different age effect on the relation between MD and attention problems.
The Deficit Profile of Executive Functions and Moderators
We found that individuals with MD have comprehensive executive function deficits, which is consistent with findings from previous research (Peng et al., 2012; van der Sluis et al., 2004). We did not find domains of task as a significant moderator, suggesting that executive function deficits among MD are domain general. These findings are interesting when compared with working memory deficit profiles of MD. Specifically, working memory deficits were more severe than the executive function deficits among MD. Also, the working memory deficits of MD showed domain specificity in the numerical domain whereas executive functions deficits of MD did not show such domain specificity. These inconsistent findings on the deficit profiles between working memory and executive functions of MD do not support the view that working memory and executive functions are the same construct (Engle, 2002). These findings, however, are in line with previous research showing that compared with executive functions, working memory seems to be a more influential component to mathematics (Friso-van den Bos, van der Ven, Kroesbergen, & van Luit, 2013; Jacob & Parkinson, 2015; Peng et al., 2016), especially among MD (Friso-van den Bos et al., 2013; Peng et al., 2016), which may be due to that working memory includes not only executive functions but also the memory component as well as the coordination between memory and executive functions (Baddeley, 1986). That said, we did not acquire sufficient data on executive functions (especially on updating and switching), nor did we have enough studies that included both working memory and other executive functions. Future studies are needed to investigate the difference on the deficit profiles between working memory and different types of executive functions among MD, ideally controlling for each other in the analysis (partialling out the common variance).
Regarding the specific components of executive functions, we found deficits in inhibition, updating, and switching were related to different types of MD, which is consistent with previous research (L. S. Fuchs et al., 2005; Passolunghi & Siegel, 2001; Swanson & Beebe-Frankenberger, 2004). This finding is consistent with the executive function framework that emphasizes the high intercorrelations among different executive functions (Miyake et al., 2000), suggesting that different executive functions may work simultaneously and collaboratively in mathematics tasks and thus deficits in executive functions are related to all types of MD. That said, we found that updating deficits were more severe for MD identified with word problem–solving difficulties. This provides further evidence that word problem solving draws heavily on updating as students continuously comprehend, integrate, and update the information read in the word problem, and thus word problem–solving difficulties are more related to updating deficits (Palladino, Cornoldi, De Beni, & Pazzaglia, 2001; Passolunghi & Pazzaglia, 2004). Moreover, based on the developmental feature of executive functions, we hypothesized that younger children with MD would demonstrate comprehensive executive function deficits, while older individuals with MD may show selective executive function deficits depending on types of MD (L. S. Fuchs et al., 2005; L. S. Fuchs et al., 2006; Passolunghi & Siegel, 2001; Swanson & Beebe-Frankenberger, 2004). However, we found that age did not explain the executive function deficit profiles of MD identified by different measures. This finding may suggest the executive function deficits are relatively stable among different types of MD across age.
Limitations
Our findings are based on the combined results of 75 studies conducted with more than 13,000 participants. Despite the scale of our literature search and sample size, we note several limitations when interpreting the findings. First, due to the small number of studies that reported comparisons between the MD and TD groups on visuospatial skills and attention, we could not run moderation analyses on types of MD screening for visuospatial skills and domains of task for attention. We may also be underpowered, as mentioned earlier, to run analyses on some moderators, such as types of visuospatial skills for the visuospatial deficit profiles. Thus, the moderation analyses for the profiles of MD on visuospatial skills and attention should be regarded as exploratory in nature and warrant further investigation.
Another limitation is that we were unable to investigate other interesting moderators due to insufficient data. For example, for attention, we could not examine the moderation effect of types of attention. Specifically, among the attention measures, we coded for alerting (becoming and staying attentive toward a specific stimulus) and orienting (directing attention to a specific stimulus) based on Posner and Petersen’s (1990) attention model, but found only less than five effect sizes that represented each of those attention components. For the same reason, for types of MD screening, we only focused on the most common MD screening measures (i.e., calculation vs. word problem solving vs. comprehensive mathematics), and did not include less common ones, such as nonsymbolic measures (e.g., Approximate Number System). As some research suggests that nonsymbolic measures may be especially related to numerical-processing and visuospatial skills (e.g., Dehaene, 2011; Halberda & Feigenson, 2008), MD children identified with these measures may show more severe deficits in visuospatial skills and cognitive skills measured in numerical materials. Moreover, we had a fair amount of IQ data missing (26% of the total effect sizes) in the original studies, so we were unable to examine whether IQ differences between MD and TD influence the cognitive deficits of MD. Future studies may further examine these potential moderators for the cognitive deficit profiles of MD.
Following the logic of the bottleneck theory, if low-level cognitive skills are foundational to high-level cognitive skills, then deficits in low-level cognitive skills may cause the Matthew effect on the deficits in high-level cognitive skills from a developmental perspective. That is, compared with the deficits of low-level cognitive skills, deficits in high-level cognitive skills among MD may become more severe as a function of age. However, our results did not support this Matthew effect, which may be because the deficits in low-level cognitive skills of MD only partially explained the deficits in high-level cognitive skills. That said, we used concurrent data from different samples to examine the age effect, which may not accurately reflect the accumulative age effects. A longitudinal design on the same sample may better investigate the Matthew effect related to the bottleneck theory.
Last, we did not use the Cohen’s benchmarks (Cohen, 1998) to indicate magnitude of effect size. Hill, Bloom, Black, and Lipsey (2008) argued that for education research, effect sizes should be interpreted with respect to empirical benchmarks that are relevant to specific population and measures. Given that there is a lack of empirical studies to establish meaningful benchmarks in MD profiling research, future studies should examine whether cognitive deficits of MD are causally related to mathematics performance and to what degrees the change of cognitive skills are associated with meaningful changes in mathematics among MD (Hill et al., 2008). This line of research can provide more refined effect size benchmarks for MD profiling research.
Implications
For Theory
With the limitations in mind, the present study, to our knowledge, was the first meta-analysis that systematically and comprehensively examined the cognitive deficit profiles of MD and the factors that influence these deficit profiles. Findings have theoretical and practice implications for MD. Theoretically, the findings add to our understanding of MD. First, MD is associated with comprehensive cognitive deficits that are not specifically related to numerical processing, with exceptions that deficits in processing speed and working memory are more related to the numerical domain. Thus, the cognitive deficits of MD are generally not affected by domains of materials. Any individual who is identified with MD is likely to experience both domain-specific (numerical processing) and domain-general cognitive deficits.
Second, MD is a categorical group with heterogeneity, which lies in two aspects. One is that there is heterogeneity among MD subgroups. The MDRD group is different from the MD-only group such that the MDRD group demonstrates more severe cognitive deficits. MD identified with different screening measures also differ from each other. Calculation difficulties are related to more severe deficits in phonological processing and working memory, and comprehensive MD are related to more severe deficits in executive functions and working memory. The other aspect of heterogeneity of MD is reflected by age. That is, deficits in attention and phonological processing are more severe among younger individuals with MD. This age effect may be attributed to the characteristics of early mathematics learning (or instruction) that emphasizes automatized arithmetic facts retrieval.
Third, previous research suggests that different cutoff points on MD screening may result in MD with different profiles, indicating that MD may be a continuous construct (e.g., Branum-Martin et al., 2013; Murphy et al., 2007). However, the present review found that after controlling for comorbidity, types of MD screening, and other moderators, severity of MD only affected deficits in processing speed, but not in other cognitive skills. This finding is consistent with previous studies and reviews (Landerl et al., 2004; Peng & Fuchs, 2016), suggesting that the severity of MD, in general, does not affect the deficit profiles of MD, and MD may be a discrete construct. That said, the current study only focused on concurrent data. Future studies, especially those that focused on at-risk MD and persistent MD (Vukovic & Siegel, 2010), should consider longitudinal data to further investigate this issue.
Fourth, the deficits in the high-level cognitive deficits are relatively independent of the deficits in low-level cognitive skills of MD. Compared with the deficits in low-level skills (except for processing speed), MD is more strongly and stably related to the deficits in high-level cognitive skills. Thus, it is likely that for individuals with MD, both their basic information-processing system (e.g., numerical processing) and the complex information-processing system (e.g., forming strategies and understanding conceptual knowledge) in mathematics are impaired.
For Practice
Our findings also have implications for practice. Regarding the diagnosis of MD, our findings suggest the efficiency of MD diagnosis, especially in early grades, might be improved by conducting screenings on skills with salient cognitive deficits among MD. For example, in early grades (e.g., kindergarten and first grade) where traditional MD screening tests (e.g., calculations) may be insensitive (i.e., most children reach the floor on those tests), teachers/practitioners can use measures, such as processing speed, phonological processing, and attention ratings to help identify children at-risk for MD.
Our findings may also have implications for interventions that consider cognitive skills. For example, interventions may be designed to compensate for the cognitive deficits of MD (L. S. Fuchs et al., 2014). Given that MD is related to distinct deficits in processing speed and working memory, especially in the numerical domain, instruction for MD may emphasize increasing numerical-processing speed such as building fluency (e.g., improving arithmetic facts retrieval during calculations; using/selecting more efficient strategies during word problem solving) to reduce working memory load in solving complex mathematics problems. This suggestion is also in line with the load reduction instruction theory (Martin, 2016). Moreover, interventions may directly address these cognitive deficits among MD (e.g., Kroesbergen et al., 2014), and these interventions can be individualized for a specific type of MD. For example, whereas updating training may be important to remediate word problem–solving difficulties, phonological processing training may benefit MD identified with calculation difficulties most, especially those identified at a young age. Interventions may address verbal, visuospatial, and numerical domains, but working memory training in the numerical domain seems most promising. With all being said, we remind readers that our findings are correlational in nature, and that intervention studies are needed to confirm the causal relation between the change of cognitive profiles and the change of mathematics performance among individuals with MD.
Footnotes
Authors
PENG PENG is currently an assistant professor in the Department of Special Education and Communication Disorders at University of Nebraska, Lincoln, 301 Barkley Memorial Center, Lincoln, NE 68583-0738, USA; email:
CUICUI WANG is currently a doctoral student in the Faculty of Psychology at Beijing Normal University, Beijing, China; email:
JESSICA NAMKUNG is currently an assistant professor in the Department of Special Education and Communication Disorders at University of Nebraska, Lincoln, 301 Barkley Memorial Center, Lincoln, NE 68583-0738, USA; email:
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
