Abstract
This meta-analysis extended the current literature regarding the effects of computer technology (CT) on mathematics achievement, with a particular focus on low-performing students. A total of 45 independent effect sizes extracted from 31 empirical studies based on a total of 2,044 low-performing students in K-12 classrooms were included in this meta-analysis. Consistent with previous reviews, this study suggested a statistically significant and positive effect of CT (
The use of computer technology in K-12 classrooms has been growing at a stunning pace over the past several decades. In the late 20th century, 85% of U.S. schools had computers, and 64% of U.S. schools had access to the internet (Coley et al., 1997). Another national report published in 2010 showed that almost all schools owned computers, and all teachers had access to one or more computers in classrooms (Gray et al., 2010). As evident in the increasing use of computer technology in schools, technology has become a critical instructional technology in teaching and learning.
Several policies documents—including the Common Core State Standards for Mathematics (National Governors Association Center for Best Practices & Council of Chief State School Offices, 2010), Principles and Standards for School Mathematics (NCTM, 2000), and Adding It Up (NRC, 2001)—have emphasized that technology plays a vital and centralized role in mathematics teaching and learning. Incorporating technology in mathematics classrooms makes substantial contributions to student mathematics learning in various ways, including aiding for mathematics conceptualizing and modeling (NCTM, 2000; Tucker, 2018); allowing students to make conjectures, collect and record data (Cullen et al., 2020; Martinez, 2017); and providing chance to engage in complex problem-solving or learning activities (Hobbs, 2012). Given a wide array of technology (i.e., computer program, graphing calculators, visual reality) applied in mathematics classrooms, in this work, we focused on computer technology (CT), which refers to the use of computer software/applications or web-based programs for providing instruction, managing or monitoring student learning, or any other learning purposes. Online tutoring or learning environments that could provide differentiated and individualized interventions are included, while some types of distance learning (i.e., watching recorded lectures) are excluded.
The increasingly popular use of CT in mathematics education has stirred up a huge amount of research that focuses on the effectiveness of different types of CT across multiple aspects of mathematics learning. For example, Chang et al. (2006) found that a computer-based problem-solving system produced a large significant effect (d = 0.78) on fifth-grade students’ problem-solving tests. Another widely used problem-solving software, Go Solve Word Problems, was also found to have a large effect of improving problem-solving skills for fifth graders (Fede et al., 2013) and for second graders (Leh & Jitendra, 2013). The possible explanation for the big impact is that this software was designed to include instructional elements that are particularly beneficial for student problem-solving, such as using schematic diagrams, strategy instruction, and instructional feedback. Hwang et al. (2006) found that a web-based multimedia whiteboard system significantly increased student performance of mathematics problem solving, especially when students’ critical thinking behind solutions was elicited and collaborative peer learning was encouraged.
Besides, improving students’ arithmetical skills and fluency is a big concern for the CT usage in mathematics classroom, particularly in lower grades. Researchers have conducted a series of studies focusing on how game-based technology or software is supported for early number skills development. Wilson et al. (2009) found that an adaptive computer game (The Number Race) had a positive effect for kindergarteners’ number sense. Other researchers found this game intervention was also effective for improving kindergarten students’ number skills, such as number comparison (Räsänen et al., 2009) and basic arithmetic (Salminen et al., 2015a). Salminen et al. (2015b) detected that another similar adaptive computer game (GraphoGame Math) had positive impact for kindergarten students’ verbal counting and dot counting fluency.
In sum, CT interventions were found to be effective in fostering student mathematics learning. As these CT interventions have become more sophisticated and more accessible, the expectations that CT can improve schooling mathematics outcomes have increased. Many researchers now see CT as a potential solution to the issues of low-performing mathematics students.
Improving Mathematics Achievement for Low-Performing Students
The No Child Left Behind Act placed great emphasis on high mathematics achievement for all students, especially for subgroup populations who have traditionally performed below expectations (Ohnemus, 2002). Addressing the need for improving mathematics achievement for low-performing students remains a big concern. Years of effort failed to enhance students’ proficiency levels in mathematics achievement. The report from National Assessment of Educational Progress (National Center for Education Statistics, 2005) showed that 63% of fourth graders and 70% of eighth graders performed below proficiency in mathematics. The updated report of NAEP in 2011 showed that there were still 60% of fourth-grade students and 65% of eighth-grade students who failed to reach the proficient level in mathematics (National Center for Education Statistics, 2011).
CT is highly anticipated to increase the success of academically disadvantaged students. Before we dive into how CT can address the issues of low-performing (LP) students, it is necessary to first explain our position regarding this term since there is inconsistent usage around the meaning and scope of low-performing (LP) and learning disability (LD) or mathematical learning disability (MLD). For instance, Robotti and Baccaglini-Frank (2017) defined students with MLD as students with persistent low achievement in mathematics and students who have been diagnosed with LD. While students are labeled as LD when they are clinically diagnosed with situations, such as mental or emotional disorder, or having severe difficulties in obtaining or retaining information (Bateman, 1992); hence, they are possible to be at risk for mathematics learning disability and tend to perform low in mathematics. In the current study, LP students means those who score below the basic proficiency level of mathematics tests but which is not caused by learning disability or mental disability. Given the inconsistent criteria of LP in literature, for this study, as long as students are clearly defined as scoring below basic proficiency level based on a mathematics test in the included study, they are incorporated as participants.
One early review found that LP students (
However, literature shows inconsistent results regarding CT intervention effects on LP students’ mathematics achievement. While CT interventions have significant and positive effects on LP students’ mathematics gains (Burns et al., 2012; Chappell et al., 2015; Kanive et al., 2014; Karner, 2016; Kulik et al., 1980; Ok & Bryant, 2016; Outhwaite et al., 2017; Perkins & Gilman, 2002; Räsänen et al., 2009; Rutherford et al., 2014; Zunker, 2008), some fail to show significant effects (Arnold, 2013; Cook, 2008; Leh & Jitendra, 2013).
Further, Brown (2018) found that instructions with laptops were less effective for LP students. Tienken and Maher (2008) reported a large and negative effect (
CT Interventions on Mathematics Achievement in Previous Reviews
In the past four decades, over twenty meta-analyses have been performed to examine the effects of CT interventions on student learning (Bayraktar, 2001; Blok et al., 2002; Demir & Başol, 2014; Fletcher-Flinn & Gravatt, 1995; Goldberg et al., 2003; Hsu, 2003; Kulik, 1994; Kulik et al., 1985; Niemiec et al., 1987; Schenker, 2007). These reviews examined a wide range of subjects (i.e., reading, science, writing, mathematics, social study, and statistics) and different grade levels from kindergarten to college stage. The majority of the reviews include students from K-12 and only three reviews focused on college students’ statistics achievement (Larwin & Larwin, 2011; Schenker, 2007; Sosa et al., 2011). Very few reviews (only eight) included LP students or students with special needs. Thirteen out of 23 reviews focused only on mathematics achievement. All the reviews reported that there were positive effects of technology on students’ mathematics achievement, with effect sizes ranged widely, from 0.09 to 1.80. Table 1 presents a summary of findings for mathematics outcomes for these major 23 reviews.
Summary of Results From Previous Meta-Analyses on CT Effect.
Note: Effect sizes were extracted from mathematics outcomes for studies focusing on multiple subjects. NS refers to the number of studies for mathematics. LP Coverage refers to studies cover low-performing students as separate groups. CAI: Computer-assisted instruction, CBI: Computer-based instruction, CBM: Computer-based modules, DGBL: Digital game-based learning, ITS: Intelligent tutoring systems.
Various CT Interventions
Meta-analyses before 2010 mainly focused on computer-assisted instruction (CAI) and computer-based instruction (CBI), which were the most prevalent technology-supported teaching and learning in the early years. CAI is a method of instruction in which computer contained instruction is designed to teach, guide, or aid students until they reach the desired level (AECT Task Force on Definition & Terminology, 1977). CBI is a broad concept that refers to either stand-alone computer learning activities or computer-mediated activities that teachers are also involved in (Cotton, 1991). Several meta-analyses found CBI had a medium effect on mathematics achievement for elementary and secondary students (Bangert-Drowns et al., 1985; Hartley, 1977; Jitendra et al., 2018; Kulik et al., 1985; Kulik & Kulik, 1991; Niemiec et al., 1987) and for college students (Kulik et al., 1980). While the effects of CAI on mathematics achievement varied from small to large (Kroesbergen & Van Luit, 2003; Liao, 2007; Slavin et al., 2009; Slavin & Lake, 2008; Xin & Jitendra, 1999). After 2010, the focus of review studies extended to technology that is beyond the traditional CAI or CBI. Another trend is to focus on reviewing specific types of technology, such as Intelligent tutoring systems (Steenbergen-Hu & Cooper, 2013) and digital game-based learning (Byun & Joung, 2018).
Factors Matter to CT Application
The literature on the usage of CT in mathematics classrooms suggests that the impact of CT on mathematics learning may rely on many other contextual and methodological factors. Of them, CT type is one of most crucial factors that has been widely examined. Lou et al. (2001) meta-analysis found that CBI-tutor programs (
Another two important factors related to treatment have been found to moderate the effects of CT interventions—that is intervention duration and intervention intensity. Consistent results were found that the shorter intervention had a larger effect on students’ mathematics achievement than the longer one despite the disunifying criteria that define whether an intervention is short or long. For example, Li and Ma (2010) found that shorter duration (less than one-year intervention) had a larger effect on students’ mathematics achievement than interventions which lasted longer (more than one year). Chauhan (2017) detected that technology produced the highest effect when intervention continued less than one week, compared to intervention lasting between one week and four weeks or intervention more than one month. Other previous reviews also found that longer interventions were less effective than shorter interventions (Gersten et al., 2009; Kroesbergen & Van Luit, 2003; Kulik et al., 1985). Regarding the intervention intensity, much fewer meta-analyses have focused on it. Only one meta-analysis found that higher intensity with more than 30 min CT intervention in a week had a bigger effect on students’ mathematics achievement than lower intensity with less than 30 min treatment in a week (Cheung & Slavin, 2013). Based on these findings, it seems that high intensity within shorter intervention duration was possibly more effective, but no studies had been conducted to check the combination effect of duration and intensity.
Intervention contents have also been found to moderate technology effect. Kroesbergen and Van Luit (2003) found that interventions on basic facts had the largest effect than on problem-solving or arithmetic skills for increasing the gain scores of students with special needs. This finding may be explained by the fact that it may be easier to teach basic skills to students with special needs than to teach problem-solving skills. However, when compared to other research syntheses, the effect of computer interventions on problem solving has also been detected (Xin & Jitendra, 1999).
Other factors that may contribute to mathematics learning with technology are the participant characteristics including, students’ grade level and student type. A consistent result by grade levels has been found that CT interventions are more effective for elementary students than for secondary students (Bangert-Drowns et al., 1985; Cheung & Slavin, 2013; Kulik et al., 1985; Li & Ma, 2010). As for student type, researchers found the effectiveness of CT interventions across different student populations. For instance, Traynor (2003) showed that CT improved mathematics achievement among various student populations, including students in special education, students in regular education, and students with less proficiency in English.
However, when comparing the effects of CT interventions between students with special needs and general education students, results have not been consistent. Some reviews found that CT was noticeably more effective for students with special needs than for general education students (Li & Ma, 2010; Niemiec et al., 1987; Sivin-Kachala & Bialo, 1994), while the review from Steenbergen-Hu and Cooper (2013) found that the effects of CT for helping LP students were smaller than for helping general education students.
Considering the inconsistent results regarding CT effects for LP students that widely existed in both review studies and a large number of empirical studies, as well as the vacuum of meta-analytical research only for LP students, it is emergent to conduct a meta-analysis merely for this particular population. The two existing meta-analyses conducted by Kroesbergen and Van Luit(2003) and Xin and Jitendra (1999) included LP students as participants, but they also used other non-computer intervention studies. In order to extend these reviews and provide a complete picture about CT effects for LP students, our meta-analysis focuses on computer intervention studies published after 2000.
Goals and Research Questions
Our meta-analysis is to explore the effects of CT interventions on K-12 LP students’ mathematics achievement. We are particularly interested in understanding how student characteristics and methodological features moderate the effect of CT on low-performers’ mathematics achievement. In order to accomplish our research goals, the following two specific research questions are addressed in our meta-analysis.
What is the overall effectiveness of CT interventions on the mathematics achievement of K-12 LP students, compared to students without CT interventions? What moderators (i.e., grade levels, types of CT, intervention components, duration of intervention, intervention intensity, and types of testing instrument) explain variation in the effect of CT interventions on the mathematics achievement of K-12 LP students?
Method
Inclusion Criteria
The following inclusion and exclusion criteria were developed based on our research purposes.
Study involves CT in the instruction or learning process. Study measures mathematics achievement as an outcome. Study allows extraction of LP student data as a separate set. Study includes treatment group where students received the CT intervention and control group where students did not. Study reports sufficient data for calculating effect sizes. Study involves students from K-12.
One challenging part of applying the inclusion criteria was to determine whether “at-risk” students had been identified because they were, in fact, low-performing or for some other reasons. Studies that explicitly defined at-risk students as being low performing were included; otherwise, those studies were excluded. Moreover, studies that reported students as being either “at-risk” or “low performing” as a consequence of learning disabilities were excluded.
Search Process
The literature search and screening process followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) protocol (Moher et al., 2009), as shown in Figure 1.

Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) Diagram of Selection Process.
Studies for our meta-analysis were identified by the following three major steps: (1) electronic database search, (2) ancestry searches, and (3) manual search on peer-reviewed journals. Our first step was a comprehensive literature search of electronic databases, including ERIC, Primary Search, Educational Administration Abstracts, Public Administration Abstracts, PsycBOOKS, PsycARTICLES, PsycINFO, and PsycEXTRA via EBSCOhost. In this electronic database search, several combinations of the following keywords were used: mathematics outcome keywords including mathematics achievement, mathematics performance, problem solving in mathematics, mathematics education, and mathematics research; intervention keywords including computer technology, computer-assisted instruction, computer-based instruction, technological Interventions, and computer interventions; and target population keywords including low-achieving students, LP students, low performance, low achievement, underachievement, students at risk, struggling learners, slow learner, below-average, poor mathematics performance, learning problems, and learning difficulties. The initial search resulted in 354 studies.
Then in the second step, references of studies in our initial step were searched for locating any additional studies. Eighty-six potential studies were found in this second step. In the third step, we manually searched 11 peer-reviewed journal outlets, including Educational Technology Research and Development, British Journal of Educational Technology, Journal of Research on Technology in Education, Journal of Computers in Mathematics and Science Teaching, Journal of Research in Mathematics Education, Educational Studies in Mathematics, Journal of Mathematics Behavior, Mathematics Thinking and Learning, Australasian Journal of Educational Technology, Computer and Education, and Journal of Educational Computing Research. Twelve of the related studies were found in this third step. As a result, with three different searches, we identified 452 studies as potential literature to be included in our meta-analysis.
Of more than 450 studies, the first step to remove 60 replicated records. The second screening process is based on titles and abstracts; as a result, 103 studies were excluded and 289 full-text studies were produced. The last step was a detailed review by applying our inclusion/exclusion criteria. A further 258 studies were excluded, of which 59 studies did not use CT as interventions, 31 studies did not measure mathematics as outcomes, 84 studies cannot allow to extract LP students as a separate set, six studies were not conducted for K-12 students, 54 studies had no control conditions, and 24 studies did not report sufficient data so that effect sizes can be computed. As a result, a total of 24 independent journal articles and seven dissertations were included in our meta-analytic review.
Study Coding
In order to examine the methodological and substantive features that might moderate the effect between interventions and student mathematics achievement, a set of study features was coded based on theoretically defined constructs and categories that emerged from the coding process. The six aspects were coded for each primary study, including grade level, types of CT, intervention components, duration of intervention, intervention intensity, and types of testing instrument.
Grade Levels (Kindergarten; Primary, 1–6; Junior High, 7–10; Senior High, 11–12, and Other)
We adopted the common way to categorize grade 1 to grade 12 into three groups, including primary school (grade 1 to grade 6), junior high school (grade 7 to grade 10), and senior high school (grade 11 to grade 12). We separated kindergarten as an individual group, considering that there is a certain number of CT particularly designed for improving kindergarten students’ number sense and skills. “Other” is the group that represented grade in other forms (i.e., age 9–18).
Type of CT (Tutoring; Game-Based; Computerized Practice; Problem-Solving System)
Different types of CT—ranging from early CAI, or computer-mediated communication, or computer or tablet game to recent intelligent tutoring system—have been developed and applied for enhancing mathematics teaching and learning. Of many classifications of CT types, the common one is suggested by Means (1994), which includes four main categories: (a) tutorial, (b) communication media, (c) exploratory environment, and (d) tools. These CT types have been widely examined by other meta-analytical research (Li & Ma, 2010; Lou et al., 2001). However, it is possible that some CT types have not yet been implemented with LP students. Hence, the existing categories are not the best way to capture the updated CT for improving students’ low performance. Therefore, based on what has been empirically used in the current literature, we categorized CT into four groups: (1) tutoring, (2) game-based technology, (3) computerized practice, and (4) computer-based problem-solving system.
The first category is tutoring, referring to web-based programs that provide instructional guidance or practices, especially for students who have difficulties in learning (Karner, 2016; Tsuei, 2012). Gamed-based technology covers mathematics concepts or presenting learning activities by using educational games in a computer-based environment. Compared with the traditional drill and practice, we defined computerized practice differently, which refers to software or applications that provide adaptive, iterative, and continuous practice on a stream of mathematics skills. Hence, it markedly differed from the traditional one in multiple ways (Cibulka & Cooper, 2017), including: (a) practice is adaptive to students’ performance level, (b) intensive practice and students’ performance report are provided, and (c) students receive immediate and corrective feedback based on their performance. Additionally, we added a new category—computer-based problem-solving system, considering the increasing trend of embedding problem-solving complexity into CT. This system refers to a computer-based learning environment or program that provides problem-solving instructions or guidance to assist students’ solving process (i.e., word problem solving procedures).
Intervention Components (Arithmetic Skill; Problem Solving; Mathematical Reasoning; Multiple Skills)
Kroesbergen and Van Luit (2003) claimed that it might be easier to teach basic arithmetic skills to students with special needs than to teach problem-solving skills. For testing whether this claim still remains the same when newly complex CT interventions were applied for LP students, we basically follow the definitions about intervention components from their meta-analysis. Arithmetic skills are the skills to do basic mathematical operations (i.e., addition, subtraction, and multiplication). For mathematics problem solving, it not only involves basic mathematical skills, but also involves how to apply knowledge into new situations. Another intervention component, multiple skills, includes more than one mathematics skill, such as basic computations and problem solving, etc. Mathematical reasoning is a kind of high-order thinking skill, that refers to integrate skills and knowledge to reason mathematically.
Duration of Intervention (1. 0–11 Weeks; 2. 12–23 Weeks; 3. 24–35 Weeks; 4 ≥ 36 Weeks)
The duration of intervention program is a very important contextual factor for the CT effects. Cheung and Slavin (2013) claimed that studies with less than 12-week intervention are brief experiment, which can often easily create experimental conditions that could not be maintained for a whole school year. Hence, we adopted this 12-week period as a cut-off point to divide the duration category.
Intervention Intensity (Low, Less Than 60 Min per Week; Medium, Between 60 and 150 Min per Week; High, More Than 150 Min per Week)
Intervention intensity is the frequency of CT usage. We used the common unit by how many minutes per week. Cheung and Slavin (2013) categorized less than 30 minutes per week as the low intensity program, between 30 and 75 min per week as the medium, and more than 75 min per week as the high intensity. The criteria are not suitable for our meta-analysis as the range of intensity is much bigger from 0 min per week to 275 min per week. We decided to use 150 min (near half of the highest dosage) as the cut-off point to differentiate medium and high intensity, and 1 hour as the point to cut off medium and low intensity.
Types of Testing Instrument (Standardized Tests, Teacher-Developed; Researcher-Developed)
When used to measure student mathematics outcomes, studies with non-standardized tests showed greater effects of CT than studies with standardized tests (Li & Ma, 2010). Standardized tests could present in different forms, such as state-standardized tests, district-standardized tests, or national standardized tests. In our work, non-standardized tests were specifically grouped into teacher-developed tests and researcher-developed tests.
Based on these six study features, thirty-one primary studies were coded. A detailed summary of primary studies with the author of the study, publication year, sample size (number of low performing), the cut-off points for identifying LP students, grade levels, types of CT, duration of intervention, intervention intensity, test types, intervention components, and effect sizes is provided in Table 2. To ensure consistency in interpretation of coding criteria, the weighted Kappa (k = 0.90) was used to measure interrater reliability. It showed there was a substantial agreement between two coders based on Cohen’s (1960) criteria.
Coded Information for the Included Studies.
Note: For test type, SST is the state standard test, NST is the national standard test, DST is the district standard test, TDT is the teacher-developed test, RDT is the researcher-developed test, and CBT is the course-based test.
Effect Size
The primary effect size for this study is a standardized mean difference (d), which quantifies the effect of CT intervention on the outcome. First, for studies using post-test control group design, the effect sizes were computed as the formula (Cooper & Hedges, 1994):
The associated variance for d computed via Equation (1) was calculated as
Second, when Eta-squared (η2) is reported in the studies for estimation of the effects of CT, the η2 is converted into Cohen’s d via this formulation provided by Cohen (1988):
Lastly, when Pearson Product Moment Correlation coefficient (r) was reported in the studies, the reported r was transformed to Cohen’s d via the following formulation provided by Rosenthal (1994).
The associated variance for d computed by Equation (4) was calculated via
Handling Dependent Effect Sizes
Multiple effect sizes can be generated from each study based on different experimental conditions, sample population, outcome measures (i.e., problem solving and mathematical reasoning), or different duration of intervention. Generally, there are two approaches used for extracting findings from studies in a meta-analysis: a single effect size per study and multiple effect sizes per study. According to Lou et al. (2001) and Li and Ma (2010), extracting one effect size per study can guarantee the independence of each finding. However, this approach may cause the loss of difference within a study between different sample groups, between different treatment conditions, or between different outcome domains. Among several available methods of handing dependent issues, Borenstein et al. (2011) suggested to averaging the dependent effect sizes if they come from the same outcome. Along with this method, we extracted multiple findings only if a study had multiple effect sizes from different sample groups, such as different grade levels (e.g., grade 3 vs. grade 4) or sample sources (school 1 vs. school 2). In other cases, such as the same subjects for different mathematics domains or different experimental conditions, we computed an overall average effect size for each study. This approach of taking the arithmetical average of effect sizes allows us to eliminate the problem of dependencies due to multiple effect sizes, and it ensures the usage of all study features for each primary study (Cheung & Slavin, 2013; Li & Ma, 2010). In our study, following this approach, we produced 45 effect sizes from 31 studies.
Publication Bias
Funnel plots, displaying in Figure 2, are simple scatterplots of effect sizes estimated from individual studies against the inverse values of their associated standard errors. Publication bias might not be a concern on the outcome for this study due to the symmetrical appearance of the funnel plot, as shown in Figure 2. Similar results were drawn from both Egger’s regression test (z = 1.08, p = .28) and Mazumdar’s rank test (p = .25), suggesting that little evidence exists to support for the publication bias.

Funnel Plots of Effect Sizes of Mathematics Achievement.
Data Analysis
The R statistical software (Core Team R, 2013) with the metafor package (Viechtbauer, 2010) was used to estimate the overall effect and a series of moderator effects.
The Overall Effect
The overall effect of CT interventions on LP students’ mathematics achievement was estimated. The Qtotal was first conducted to examine whether the effect sizes are consistent across studies. When Qtotal is found to be statistically significant, it indicates that the observed individual effects come from different populations, then the random-effects models were used to estimate the overall effect, where the between-study variance is estimated using Restricted Maximum Likelihood (REML) estimation method (Veroniki et al., 2016). Otherwise, the fixed-effects model is used to estimate the overall effect.
Moderator Analysis
For the moderator analysis, we first examine the significance of between-class homogeneity (Qwithin). When Qwithin was found to be significant, the mixed-effects moderator model was used. Otherwise the fixed-effects moderator model was used. If there is a significant group difference based on Qmodel, a pairwise comparison using Tukey adjustment was used to find the significant pairs of groups.
Results
Description of Studies
In total, forty-five effect sizes were extracted from 31 primary studies. In those 31 studies, a total of 2,044 LP students with individual ranges from 9 to 369 was included. Across 31 studies, six studies were published before 2010, while all other studies were published between 2010 and 2018. Of 2,044 LP students, there were 142 kindergarteners, 865 primary-school students, 707 junior high school students, and 279 senior high school students, and the remaining 51 students’ grade-level information was not reported. In terms of types of CT used in 31 studies, eight studies adopted tutoring, game-based technology, and computerized practice as interventions, respectively, and seven studies used a problem-solving system as instruction interventions.
In terms of intervention duration, most of the studies (16 out of 31) conducted the CT interventions with a range from one to 11 weeks, six studies with an intervention of between 12 to 23 weeks, three with a range from 24 to 36 weeks, five studies implemented more than 36-week interventions, and the intervention duration is not available in one study. The highest intervention lasted four years. In terms of intervention intensity, seven studies implemented a low intensity intervention with less than 60 min per week, nine studies with a medium intensity intervention of between 60 min and 150 min per week, and seven studies with a high intensity intervention of more than 150 min per week, while eight studies did not report the intensity information. The majority of studies (14 out of 31) implemented CT interventions on students’ arithmetic skills, nine studies focused on intervening on the aspect of problem solving, and only two studies worked on students’ mathematical reasoning. In addition, six studies adopted a comprehensive CT intervention with a combination of concepts, problem solving, or mathematical application. As for the tests that were used to test students’ performance, most of the studies (17 out of 31) preferred to use a researcher-developed test, and ten studies gained students’ mathematics scores by using the standardized tests and four studies by using teacher-developed tests.
Overall Effect of CT on Mathematics Achievement
The overall effect of CT on LP students was evaluated by the outcome variable of mathematics achievement. The test of the overall homogeneity of the effect size was significant (Q (44) = 291.28, p < .01), indicating that more than one population exists underlying the included studies. Therefore, the random-effects model was used to compute the overall effect size.
Under the random-effects model, CT was found to be significant in mathematics achievement for LP students (z = 5.83, p < .01). The standardized mean difference was 0.56, with a 95% CI of 0.37 and 0.75. The I-squared value of 85.34% suggests that around 85% of the total variation in effect sizes is due to between-study variation (Higgins & Thompson, 2002).
The significant estimated effect size value indicates that mathematics achievement and CT are significantly and positively related to each other with a large magnitude (
Moderating Effect of CT
A series of moderator analyses were performed to examine whether the relationship between CT usage and mathematics achievement of LP students was significantly different depending on six substantive features of the current study: types of CT, grade levels, duration of intervention, intensity, intervention components, and types of testing instrument. The estimation of mean effect sizes for each moderator factor under the mixed-effect model is provided in Table 3.
Summary for Moderator Analyses.
Type of CT
Under the mixed-effects model, results showing that Q statistics of 8.47 with 3 degrees of freedom was statistically significant, suggesting that intervention effects are found to be different depending on types of CT. The estimated intervention effects for three types of CT—tutoring, game-based technology, and problem-solving system—were found to be statistically significant (
Grade Levels
Results from the mixed-effects model indicate that the effect of CT usage on LP students’ mathematics achievement significantly differs depending on grade levels (Q (4) = 10.38 p < .05). Specifically, the estimated intervention effects were significant for kindergarteners (
Duration of Intervention
Results from the mixed-effects model showed that the duration of intervention did not statistically moderate the effect of CT interventions on mathematics achievement (Qmodel (3) = 5.76, p = .12).
Intensity of Duration
Results from the mixed-effects model suggested that the effect of CT did not significantly differ depending on the levels of intensity (Qmodel (3) = 1.04, p = .79).
Intervention Components
Results from the mixed-effects model indicate that the effect of CT usage on LP students’ mathematics achievement significantly differs depending on intervention components (Qmodel (3) = 8.18, p < .05). The estimated intervention effects were all found to be significant on problem solving (
Types of Testing Instrument
The mixed-effects model results show that the types of test have a significant effect on LP students’ mathematics performances under the CT usage as an intervention (Q (2) = 6.30, p < .05). The estimated intervention effects were found to be significant for the researcher-developed tests (
Discussion
The overall findings of this review indicated that CT has a significantly positive and large effect (
However, Steenbergen-Hu and Cooper (2013) discovered a reverse comparison that the effectiveness of CT is higher for improving general education students than for LP students. The opposite direction might be due to the use of a single CT intervention (i.e., intelligent tutoring systems) in their review.
Our overall effect size seems to be smaller when compared to reviews that both included general education students and students with special needs. For example, Li and Ma (2010) found a bigger overall effect size of 0.71 for improving K-12 students’ mathematics achievement. One possible reason for this difference may arise from sampling. In our study, we merely focused on LP students. However, Li and Ma (2010) included both general education and special needs students in their research, that included a wide range of populations, such as LP students, mental/physical/emotional disability students, and at-risk students.
Besides student types, the number of studies used for reviews had a great influence on the overall findings. Xin and Jitendra (1999) reported CT greatly impacts the mathematics learning of students with learning problems (
The significant results from our review are also supported by some ideas coming from primary studies. For instance, Leh and Jitendra (2013) found that teachers were willing to apply CT in their classrooms as it was more efficient in providing immediate feedback and afforded them more time to address the needs of LP students. In addition, Outhwaite et al. (2017) claimed that the features of CT, such as immediate feedback and continuous assessment, are beneficial for LP students’ mathematics learning by tailoring task levels based on students’ abilities and allowing them to repeat tasks. Overall, with supportive arguments from primary studies, our results provide well-founded evidence about the effectiveness of CT on mathematics achievement for LP students by adding more recent studies focusing on one specific student type.
In addition to these overall findings from our review, we also look at the different impacts of CT on LP students’ mathematics achievement by using various study features: types of CT, grade levels, implementation time (duration and intensity of intervention), intervention components, and types of testing instrument, respectively.
First, among four types of CT, problem-solving systems had the largest effect on mathematics achievement compared to the tutoring program, game-based intervention, and computerized practice. Our results conflicted with an earlier review (Niemiec et al., 1987), which found drill and practice was the most effective in raising students’ mathematics scores among tutorial and problem solving. This should come as no surprise for at least two reasons. First, CT has updated a lot since the last century, which allows for more targeted and individualized instruction or support during the problem-solving process, including creating more realistic problem scenarios that are more related to students (Lantz-Andersson et al., 2009), the use of schematic diagrams and solution trees (Leh & Jitendra, 2013; Reusser, 1996), and providing explicit procedure instruction and metacognitive scaffolding (Leh & Jitendra, 2013). Second, a series of representation techniques or strategies made the learning activities more appealing to students, at the same time, maintaining an appropriate cognitive level for LP students.
Second, our findings indicated that CT interventions had significant effect for kindergarten and primary school students. However, we found that CT did not significantly influence on high school students. The multiple comparison also showed that CT interventions had a significant higher effect for primary students than for those from high schools; in other words, interventions with CT are especially beneficial to students in lower grades. The direction of CT influence is consistent with previous reviews that focused on general education students (Bangert-Drowns et al., 1985; Cheung & Slavin, 2013; Li & Ma, 2010; Niemiec et al., 1987), suggesting that CT intervention had a higher effect on elementary students than on secondary students. The possible reason for less effect on high school students seems to be their different learning needs, as Kulik et al. (1985, p. 71) argued: “high school and college students apparently have less need for the highly structured, highly reactive instruction provided in computers.”
Third, the previous views indicated that the intervention time regarding the duration and intensity were critical moderator factors that influences the CT effect. Specifically, the shorter invention had a larger effect than the longer one (Gersten et al., 2009, Kroesbergen & Van Luit, 2003; Kulik et al., 1985, Li & Ma, 2010) and the high intensity produced a bigger effect than the low intensity on students’ mathematics achievement (Cheung & Slavin, 2013). However, in our review we found that both the levels of two intervention time factors (duration and intensity) did not significantly differentiate the CT effects on LP students’ mathematics achievement. Those insignificant findings might suggest that what matters less for low performers’ mathematics learning are learning time compared to other aspects, such as learning contents or needs that certain types of CT could afford.
The fourth interesting and contradictory finding, compared to the review by Kroesbergen and Van Luit (2003), is that CT interventions on problem-solving seem to be more effective than on arithmetic skills or on multiple skills. There might be two factors related to this different finding. First, in their review, they included both LP students and students with severe learning disabilities; it is possible that learning problem solving by using CT is very challenging for LD students, while LP students without LD may benefit from technology-aided problem solving. Second, the high effect of intervening on problem solving stems from, as mentioned above, the affordance of recent technology development. The advance of CT could realize the design of a complex computer-assisted problem-solving system, which could incorporate real-world problem-solving scenarios that are more understandable for LP students (NASEM, 2018). Chang et al. (2006) claimed that this kind of system was beneficial for students’ problem solving in two aspects: Decreasing the cognitive loading by providing systematic guidance and feedback, and providing cognitive scaffolding by using a step-by-step approach. Our findings provide some suggestive evidence that using CT for problem-solving intervention is feasible and effective.
Fifth, in terms of test types and effectiveness of CT, researcher-developed tests revealed larger effect than standardized tests or teacher-developed tests. This finding is consistent with Li and Ma (2010) study that found non-standardized tests show greater effect sizes than do standardized tests. Standardized tests may have good psychometric properties but fail to align with the content being studied, while teacher-developed tests may have some issues of reliability and validity, and as a consequence, it may inflate the effects of CT on mathematics achievement.
Implication for Practice
Quite a few reviews have explored the effects of CT on the mathematics performance of general education students or students with special needs. However, they did not specifically focus on LP students. In the end, low-performing students seem to have become the “forgotten middle” in the literature. An increasing number of empirical studies have explored the effects of CT on LP students’ mathematics achievement, while these studies offer conflicting results at numerous levels. This meta-analysis integrated the findings from relevant studies and identified the moderating factors that could influence the effects of CT, which could provide useful information or guidance for instructional design and practices.
It is encouraging to find that type of the problem-solving system is the most useful in determining students’ mathematics learning. This finding represented an important strand in the mathematics research community—that is, CT can be used to design and create advanced and appealing problem-solving activities or learning experiences in the technology-supported environment for better satisfying the learning needs of diverse students. This review proved that CT—specifically, a computer-based problem-solving system—has been a very valuable tool to help students with LP or at-risk for academic failure in mathematics to improve their mathematics learning.
Another implication speaks to the higher effect of CT in promoting mathematics achievement for students from lower grades. It demonstrates that CT seems to work well for elementary students. Along with the development and design of technology by incorporating more adaptive and visual support and attractions that fit the learning styles of elementary students, the impact of CT on mathematics achievement for lower graders could even be more dramatic. However, whether CT is more attractive to younger children and hence, further influences their engagement in learning activities, is beyond the scope of this study. Future research needs to investigate the effects of CT on students’ attitudes toward mathematics.
Limitations and Future Research
There are a number of methodological limitations that need to be discussed for the current meta-analysis. First, locating LP students based on percentage or percentile was not consistent across all studies we collected due to the variation in the essential standards. For instance, some studies labeled students as a low performer when they were at or below the 25th percentile, while some studies used the 15th percentile as their cut-off scores. Though we did not exclude studies based on their percentile cut-off scores, these different standards may influence the effect sizes, especially in studies just focusing on one specific group of students. Future meta-analyses that are based on a common percentile may allow for more specific conclusions. Moreover, demographic information such as ethnicity and gender were available for the general sample population in most of the studies, but the information was not provided for the group of LP students. Since we are focusing on LP students, we could not use gender or ethnicity information in our study. We recommend that detailed demographic information should be reported separately in future studies to be used for further reviews. Finally, because we focused on LP students, we could not investigate specific mathematics outcomes. Future research might look at the effectiveness of CT on specific outcomes such as problem solving, mathematical reasoning, or arithmetic skills, etc. Instead, the results of specific mathematic aspects would be more beneficial for educators.
Footnotes
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors received no financial support for the research, authorship, and/or publication of this article.
