Abstract
Evidence that teachers’ short-term instructional effects persist over time and predict substantial long-run impacts on students’ lives provides much of the impetus for a wide range of educational reforms focused on identifying and responding to differences in teachers’ value-added to student learning. However, relatively little research has examined how the particular types of knowledge or skills that teachers impart to students contribute to their longer-term success. In this article, we investigate the persistence of teachers’ value-added effects on student learning over multiple school years and across subject areas. We find that, in comparison with math teachers, English language arts (ELA) teachers’ impacts on same-subject standardized achievement scores are smaller in the year of instruction, but that teacher-induced gains to ELA achievement appear to reflect more broadly applicable skills that persist in supporting student learning in the long run across disciplines. Our results highlight important variation in the quality of teacher-induced learning for long-run success, distinct from the variation across teachers in more typically measured short-term learning effects.
Keywords
Introduction
The distinction between the magnitude and type of learning that teachers impart may be an important one. Recent research indicates that the type of outcome used to measure teachers’ short-term impact is a key factor when assessing teachers’ effects on long-term outcomes. For example, researchers have frequently observed that reading or English language arts (ELA) teachers have smaller short-term impacts on students’ relative achievement levels than math teachers (Kane & Staiger, 2008; Nye et al., 2004; Rockoff, 2004). However, although Chetty and colleagues (2011) similarly find that ELA teachers’ effects on achievement are smaller than that of math teachers, they also find that an English teacher who raises students’ reading test scores by 1 unit has an impact on long-term life outcomes approximately 1.7 times that of a teacher who does the same in math. Accounting for both of these differences, the authors find that, overall, similarly ranked math and ELA teachers predict long-run effects on students’ lives to a similar degree. In a separate example, Jackson (2012) finds that ninth-grade teacher effects on both test scores and on noncognitive short-term outcomes (e.g., attendance, grade progression) independently predict teacher effects on longer term outcomes such as high school graduation and college and career aspirations. Jackson (2012) also observes that accounting for teacher effects on noncognitive outcomes is particularly important when assessing ELA teachers’ long-term impacts; doing so triples the predictable variability of their effects on longer term outcomes. By contrast, accounting for the short-term noncognitive effects of Algebra teachers’ instruction augments their long-term effect sizes by just 20%.
The previous research provides evidence that teachers in different subject areas impart different types of skills that vary in their long-term relevance for students and that different short-term outcomes are not necessarily interchangeable as measures of teachers’ overall instructional impact (Jackson, 2012). If some types of knowledge have more generalizable or persistent impacts on students’ lives, then understanding these differences across measures can lead to insights as to how teachers or other educational interventions improve students’ long-run outcomes and to how best to measure teachers’ contributions to student success. In particular, the available evidence suggests that ELA teachers impart meaningfully different skills than math teachers (Jackson, 2012) and that the teacher-induced learning reflected in ELA achievement gains are differentially relevant to students’ future success than that of math achievement gains (Chetty et al., 2011).
In this article, we utilize data from two large urban school districts in different states to compare the generalizability and persistence of teachers’ instructional effects on ELA and math achievement. Consistent with prior research, we find that ELA and math teachers’ short-term effects persist at similar rates on future assessments within the same subject area. However, we observe that the learning induced by ELA teachers has substantially more long-term persistence across subject areas than the learning induced by math teachers. Our results indicate a channel through which ELA instruction influences students’ long-run outcomes to a greater degree than is immediately apparent from students’ short-term within-subject test score gains. Moreover, the results indicate that ELA teachers’ effects on student achievement may be comparatively diffuse across subjects and over time, complicating efforts to measure instructional impacts using only within-subject measures of student learning gains during the year students spend with the teacher.
Background
Persistence of Teacher Value-Added Effects
One way to explore the relationship between teachers’ impacts on short-term and longer-term student outcomes is to examine the persistence of value-added effects on academic achievement after a student leaves a teacher’s classroom. Researchers have observed that only a relatively small portion of teachers’ proximal year effects persists and continues to impact student achievement in a subsequent school year, with most estimates ranging from between one quarter and one third of the initial value-added effect size (Jacob, Lefgren, & Sims, 2010; Kinsler, 2012; Konstantopoulos & Chung, 2011; McCaffrey, Lockwood, Koretz, Louis, & Hamilton, 2004; Rothstein, 2010). 1 This persistent component decays less quickly in subsequent school years (Candelaria, 2016; Jacob et al., 2010). Value-added persistence rates have generally been observed to be similar in ELA and math.
Based on the limited duration of the majority of teachers’ measured effects and on the broader applicability and slower decay that they observe in their persistent effects, Jacob and colleagues (2010) hypothesize that persistent learning effects represent a qualitatively different type of “long-term knowledge” that teachers can impart. In this framing, long-term knowledge represents a type of “transformational learning” that has relevance to student performance in a way that is distinct from content- or test-specific short-term knowledge. The consistency of long-term learning over time and across different grade-level assessments also suggests that teachers’ effects on this long-term knowledge may be particularly relevant to students’ long-run life outcomes. Researchers have generally observed only modest correlations between teachers’ short- and long-term value-added effects (Candelaria, 2016; Mariano, McCaffrey, & Lockwood, 2010; Rothstein, 2010), 2 which indicates that there may be meaningful heterogeneity in the quality or types of knowledge that individual teachers impart to students.
No extant research that we are aware of has examined the persistence of teacher-induced learning effects across subjects. However, the persistence of long-term knowledge across subjects may offer valuable insight into the channels through which teachers influence students’ longer-term outcomes. To the extent that learning in different subject areas persists at comparable rates across subjects, this would support the notion that different teachers’ instructional effects, on average, include a comparable portion of generalizable knowledge that is useful in any academic context. Alternately, if learning of certain types is differentially persistent across subjects, then this would imply that teachers in some subject areas are providing systematically more (or less) generalizable knowledge, and likely having more (or less) future impact. ELA teaching may generate more generalizable knowledge if, for example, teacher-induced improvements in students’ reading skills promote their ability to acquire knowledge in a wider array of subjects.
Cross-Subject Instructional Effects
A growing body of evidence indicates that teachers can have meaningful effects on their students’ contemporaneous performance across subject areas, although much of this evidence comes from studies at the high school level. For example, Aaronson, Barrow, and Sander (2007) observe that ninth-grade math and ELA teachers in Chicago Public Schools have sizable cross-subject effects in addition to their within-subject effects. Buddin and Zamarro (2009) observe the same trend in students in Grades 9 to 11 in the Los Angeles Unified School District. Koedel (2009) finds that 9th- to 11th-grade teachers in both ELA and math in the San Diego Unified School District impact student performance on students’ reading achievement, although science and social studies teachers do not. Jackson (2012), however, does not find evidence of significant cross-subject effects of ninth-grade Algebra and English teachers in North Carolina.
Both theory and some empirical evidence indicate that ELA instructional effects may be especially generalizable across subjects. For example, students’ reading and language skills have been shown to be important across a range of other subjects and may be particularly important for students of lower socioeconomic status or those with limited proficiency in English (Abedi & Lord, 2001; Chang, Singh, & Filer, 2009; O’Reilly & McNamara, 2007). Moreover, in a recent study of teachers’ cross-subject effects, Yuan (2014) examines teachers in middle school grades in an anonymous urban school district and finds that ELA teachers contribute to student achievement in mathematics, ELA, science, and social studies, whereas, by comparison, mathematics teachers contribute significant cross-subject effects only in ELA achievement.
Teachers’ contemporaneous cross-subject effects may stem from a variety of mechanisms. For example, teachers may directly collaborate in their planning or instruction of a shared group of students, reinforcing a common set of skills. Alternately, knowledge generated in one subject area may consist of generalizable skills that directly support students’ learning in an otherwise unrelated subject area. Thus, it is difficult to disentangle the different ways in which teachers may be contributing to their peers’ contemporaneous cross-subject performance. Research on cross-subject spillover to-date generally does not distinguish between contemporaneous effects of peer collaboration in instruction and differences in the generalizability of acquired student knowledge.
Contribution
In this study, we expand upon the prior research related to both teachers’ persistent effects and to their cross-subject effects. Following a conceptual and methodological framework described by Jacob and colleagues (2010), we investigate the persistence of teachers’ value-added effects, distinguishing between short-term, test-specific knowledge and longer-term learning that accumulates. We extend this framework by differentiating between long-term knowledge that is content-specific and generic knowledge that persists across subjects.
Our approach allows us to identify the persistent impacts of the subject-specific learning that prior-year teachers impart to students. As a consequence, we are better able to isolate the extent to which different types of teacher-induced learning generalize across subject areas, distinct from teacher spillover effects that stem from contemporaneous instructional collaboration. Finally, to better gauge the generalizability of our findings, we consider evidence from two large urban school districts that are located in different states and that utilize quite different standardized assessments of students’ performance in ELA and math.
We specifically consider the following research questions:
The remainder of the article proceeds as follows. In the section “Data,” we describe the data utilized in the study. The “Method” section details our methods for estimating value-added measures and value-added persistence within and across subjects. In the “Results” section, we detail our results, and we conclude in the final section “Conclusion and Discussion” with a discussion of conclusions and potential limitations.
Data
Administrative Data
To investigate the within- and across-subject persistence of teachers’ value-added effects, we draw upon extensive administrative data about students, teachers, classrooms, and schools in two large urban school districts: New York City (NYC) and Miami-Dade County Public Schools (M-DCPS). In both NYC and M-DCPS, our available data include students in third through eighth grade from school years (SY) 2003–2004 through SY 2011–2012. Both district data sets include data on students’ annual achievement test scores in ELA and math, and identification of students’ primary teacher and classroom in each year and subject area. For the purposes of our analysis, we standardize students’ achievement test scores within each grade, subject, and year in each district.
The content assessed by the annual New York State ELA and math student achievement tests has been aligned with the State’s content standards in Grades 3 through 8 throughout this period of time, and exams in each subject area include a mix of multiple-choice and open-response questions. The ELA exams primarily assess students’ comprehension of reading passages and writing ability, whereas math exams address a range of topics including number sense, algebra, probability, and geometry, with overlapping topics across grades. Similarly, student assessments provided by M-DCPS include scores on the statewide Florida Comprehensive Assessment Test (FCAT), in the subject areas of reading and math. FCAT exams in reading and math consist primarily of multiple-choice questions, but also include a small subset of short or extended open-response questions. The reading exams primarily assess students’ comprehension of reading passages. Student writing ability is tested via a separate exam in Florida that is not administered across Grade Levels 3 to 8. As such, assessment scores from M-DCPS include much less data on student writing skills than assessment data from NYC. As in NYC, math and reading FCAT exams include content corresponding to the state’s content standards.
In addition to achievement data, we have access to a rich set of demographic and behavioral characteristics that we utilize in our analyses. For both NYC and M-DCPS students, these characteristics include their race/ethnicity and home language, as well as their absences, suspensions, school transfers, free or reduced-price lunch status, disability or special education status, and English language learner status in each school year.
We estimate teacher value-added effects using a sample of student-year records in Grades 4 through 8 for whom current and prior year achievement data are available. However, because we are investigating the persistence of teachers’ value-added effects on student achievement in the year(s) after they teach a student, the subsamples for our persistence analyses are restricted to student-year observations in which we can identify prior-year teachers with appropriate value-added scores, as detailed in the “Method” section below. In practice, these requirements reduce our analysis of teachers’ persistent effects on student outcomes to student-year observations in Grades 5 through 8, from SY 2005–2006 through SY 2011–2012. This sample includes only students who are present in each district for at least three consecutive years. Finally, as explained in our “Method” section below, we only estimate cross-subject persistence for students whose lagged math and ELA teachers are not the same person, further restricting our sample. Summary statistics for each district’s analytical sample are presented in Table 1.
Summary Statistics for Students, Teachers, and Schools in the Analytical Sample for Each District
Note. Analytical samples include students in Grades 5 through 8 in school years 2005–2006 through 2010–2011 for whom necessary historical math and reading teacher information is available to estimate 1-year persistence both within- and across subject areas. NYC = New York City; M-DCPS = Miami-Dade County Public Schools; ELA = English language arts.
The grade levels of students represented in our analyses vary by district and as a function of whether we are investigating 1- versus 2-year persistent effects. In M-DCPS, a sizable portion of students in Grades 4 and 5 are taught by different teachers in their ELA and math classrooms, and we can thus estimate persistent effects of students in the fifth through eighth grade. However, in NYC very few students are taught by different teachers in ELA and math until the sixth grade, and therefore we limit our analysis of persistent effects to students in the seventh or eighth grade. Moreover, when evaluating teachers’ 2-year persistent effects, our sample is further restricted (to students in sixth through eighth grade in M-DCPS and in eighth grade in NYC). Table 2 provides a summary of our available sample sizes by grade in each district and for each type of analysis.
N of Student-Year Observations, by Grade Level and Subject, in Analytical Samples Used for Estimation of 1- and 2-Year Persistence in Each District
Note. Eligible samples are restricted to students taught by different math and reading teachers with value-added scores in the prior school year or years. In NYC, students in Grades 5 and below are taught almost exclusively by dual-subject teachers. In M-DCPS, however, students in Grades 4 and 5 are taught by different subject-specific teachers in many schools. NYC = New York City; M-DCPS = Miami-Dade County Public Schools; ELA = English language arts.
Method
Teacher Value-Added Measures
To examine the persistence of teachers’ effects, we first generate teacher value-added estimates of the effects of each teacher on tested student achievement in each year. We employ a value-added model that has been used by the NYC Department of Education in the past to evaluate their teachers’ performance (Value-Added Research Center, 2010). Conceptually, this model compares teachers to other “similarly circumstanced” teachers by first predicting students’ achievement with both prior achievement measures (math and ELA) and a range of observable student, classroom, and school characteristics that may influence their achievement, and then attributing the remaining unexplained variation in student performance to individual teachers. The Appendix provides details of our value-added model specification.
As an additional specification check, we also duplicate our main results using an alternative value-added model that includes teacher fixed effects in combination with student and classroom control variables directly in our regression, rather than post hoc aggregation of student residuals. We provide details regarding this alternative model and corresponding results with respect to within- and across-subject persistence of teacher induced learning in the Appendix.
Estimating the Persistence of Teacher Value-Added Effects Within and Across Subjects
We estimate the persistence of teachers’ value-added effects within and across subject areas using a modified version of an instrumental variables approach first described by Jacob et al. (2010). As previously discussed, these authors conceptualize students’ tested knowledge as a combination of “short-term” knowledge that has no observed impact on future achievement, and “long-term” knowledge that is relevant to both contemporaneous and future achievement tests. In their formulation, observed student achievement
Here, current achievement is a function of contemporaneous impacts
In practice, achievement test scores do not directly reflect long-term knowledge, but rather the combination of long-term and short-term knowledge assessed in the prior period,
Following a similar approach, we estimate the proportion of teachers’ total instructional effects that consist of long-term knowledge in a given subject by instrumenting each student’s lagged knowledge (
Here, our subscripts refer to each student i in time t, classroom c, and taught by teacher j. The values of
Because student assignment to teachers is nonrandom, the measured quality of a student’s lagged teachers may be correlated with the quality of their current teacher in each subject. To minimize possible bias in our teacher persistence estimates due to persistent nonrandom patterns of teacher assignment, we include in Model (2) controls for both student covariates
Although we include controls for student characteristics and for contemporaneous school and classroom assignments, our estimates of the within-subject and cross-subject persistence of teacher-induced learning could still be influenced by two potential sources of bias. First, our estimates may be biased if schools systematically adjust the instructional inputs (other than classroom assignments) that students receive as a response to the quality of a prior-year teacher. This could occur, for example, if effective teachers raise students’ achievement and this in turn leads schools to provide fewer instructional supports to those students. It seems unlikely, however, that schools would be sufficiently attuned to differential student achievement gains (rather than levels) to make this a major factor. Second, our estimates of cross-subject persistence could be biased if teacher value-added measures themselves are biased due to nonrandom within-school student sorting to instructional experiences of differing quality and if, additionally, this sorting bias systematically affects teacher value-added estimates in one subject area more than another. We discuss the implications of this potential caveat to our results in our discussion section.
Estimating the Persistence of “Long-Term Knowledge” Across Subjects
In addition to estimating the persistence of teacher-induced learning across subjects, we also investigate the persistence of previously assessed long-term knowledge (not just teacher-induced knowledge) in one subject on future achievement in an alternate subject. To do this, we modify the instrumental variables approach employed by Jacob and colleagues (2010) to estimate within-subject long-term knowledge persistence by simply swapping the dependent variable, students’ same-subject achievement, with alternate-subject achievement. This approach measures the extent to which prior-year achievement in the first subject (
Results
Persistence of Long-Term Knowledge Within and Across Subjects
We begin to address our first research question by investigating the persistence of long-term knowledge within subjects, via a two-stage regression predicting current achievement outcomes using prior achievement, instrumented with twice-lagged achievement, as previously discussed. We compare these results to persistence estimates from an ordinary least squares (OLS) model in which we predict current test scores with observed prior achievement. These results are shown in Table 3. Consistent with prior research and with our intuitive understanding of long-term knowledge, we find that nearly all of previously assessed long-term knowledge (i.e., knowledge that is relevant across two prior school years) within a subject area also persists into a third year. Across NYC and M-DCPS, partial coefficients on long-term knowledge range in a fairly narrow band from 0.949 to 1.000, regardless of whether our models include an additional control for prior achievement in the alternate subject. The coefficients on long-term knowledge persistence are in contrast to those for students’ observed prior-year test scores, which reflect a mix of short- and long-term knowledge (including measurement error) and persist at substantially lower rates. Lower observed knowledge persistence in ELA may correspond to a higher degree of measurement error in ELA assessments.
Estimates for the Persistence of Observed Knowledge and Long-Term Knowledge, Within Subjects
Note. Coefficient for Observed Knowledge from a regression of current achievement on prior achievement in the same subject. Coefficient for Long-Term Knowledge in Model 1 is from an instrumental variables (IV) regression of current achievement on prior achievement in either subject instrumented with twice-lagged achievement in the same subject. Model 2 further includes a control for lagged achievement in the alternate subject. First-stage F statistics for all IV regressions exceed 100,000. NYC = New York City; M-DCPS = Miami-Dade County Public Schools; ELA = English language arts.
Next, in Table 4, we examine how previously assessed long-term knowledge in one subject area predicts current performance in the alternate subject. Here, as previously, we consider a simple OLS regression across subjects, as well as a long-term knowledge “Model 1” that, as previously discussed, predicts current achievement in a subject with instrumented prior achievement in the alternate subject. We also consider an alternate long-term knowledge “Model 2” that includes a control for prior-year achievement in the subject of interest.
Estimates for the Persistence of Observed Knowledge and Long-Term Knowledge, Across Subjects
Note. Coefficient for Observed Knowledge from a regression of current achievement on prior achievement in the alternate subject. Coefficient for Long-Term Knowledge in Model 1 is from an instrumental variables (IV) regression of current achievement on prior achievement in the alternate subject instrumented with twice-lagged achievement in the alternate subject. Model 2 further includes a control for lagged achievement in the same subject. First-stage F statistics for all IV regressions exceed 100,000. NYC = New York City; M-DCPS = Miami-Dade County Public Schools; ELA = English language arts.
Our results differ by model specification. Results for the OLS regression continue to be less than 1 and cross-subject coefficients are smaller than the same-subject coefficients reported in Table 3. Results from long-term knowledge Model 1 indicate that knowledge that is relevant over time and across assessments in one subject area persists at a substantial, but not perfect, rate into the alternate subject. Coefficients when predicting math with long-term ELA knowledge range from 0.852 to 0.859, whereas coefficients for predicting ELA with long-term math knowledge are somewhat smaller, ranging from 0.792 to 0.813. In other words, in contrast to our within-subject results, we observe that not all of the long-term knowledge that is relevant within a subject is relevant across subject areas. Including controls for prior achievement in the subject of interest (long-term knowledge Model 2) substantially reduces these coefficients, with a larger reduction in the adjusted persistence of long-term ELA knowledge (0.339 to 0.362) versus long-term math knowledge (0.401 to 0.486). Collectively, these results indicate that much, but not all, of the within-subject long-term knowledge that students possess reflects foundational skills, abilities, or other human capital that are relevant to student performance across multiple academic subjects. Controls for prior achievement in the same subject partial out much of this generalizable knowledge, and controls for prior math achievement reduce coefficients more, perhaps because they include less measurement error than ELA measures with respect to these generalizable skills.
Overall, our results are consistent with the idea that students’ foundational knowledge and skills in any subject are predictive of performance across multiple subjects. However, it does not necessarily follow that the cross-subject effects of teacher-induced learning are the same for math and ELA teachers. Some teacher-induced learning is reflected in long-term knowledge, but our approach thus far to isolating this persistent knowledge does not distinguish between learning induced by teachers and other types of knowledge.
The Persistence of Teacher-Induced Learning Within and Across Subjects
To address our second research question related to teacher-induced learning, we examine the extent to which teachers’ value-added effects on student learning persist over time (i.e., consist of long-term knowledge) within and across each subject area. In Table 5, we show results from a two-stage regression in which we predict current achievement in either subject area with students’ prior achievement in both the same and the alternate subject, instrumented by the prior-year estimated value-added quality of both their same-subject and alternate-subject teachers. In NYC, we find that 26% to 29% of teachers’ within-subject value-added effects persist into a subsequent school year, with similar results across math and ELA teachers. One-year within-subject persistence rates are higher in M-DCPS in both subjects, but are also fairly similar across math (0.411) and ELA (0.389). Standard errors are larger when estimating persistent impacts of ELA instruction than when estimating persistent impacts of math instruction.
Estimates for the 1-year Persistence of Teachers’ Value-Added Effects Within and Across Subjects
Note. Coefficients for teacher value-added effects from regressions of current achievement in a given subject area on prior achievement in both subject areas instrumented with the prior year teachers’ value-added quality in both subject areas. Models include controls for current student characteristics and classroom fixed effects from the test subject of the dependent variable. One-year persistence coefficients reflect the portion of initial teacher effects that continue to impact achievement in the following school year. NYC = New York City; M-DCPS = Miami-Dade County Public Schools; ELA = English language arts.
The differences that we observe in the magnitude of general value-added persistence across our two districts may stem from a range of factors, including differences in the year-over-year alignment and content covered by state tests in each district. Overall, however, our estimates of same-subject teacher persistence in both districts are in a comparable range to those reported in prior research, and align with previous research showing roughly equivalent persistence rates of teacher-induced learning in ELA and math.
In contrast, across both NYC and M-DCPS, we observe stark differences in the rate of cross-subject persistence between ELA and math teachers, with ELA teacher effects persisting at a much higher rate across subjects. For example, in NYC, ELA teachers’ persistence coefficient across subject is 0.179, which is 70% of their persistence rate within the same subject (0.257). In contrast, NYC math teachers’ persistence coefficient across subjects is 0.013, less than 5% of their same-subject persistence (0.289).
The differential cross-subject persistence of ELA teachers’ instruction is also apparent in M-DCPS. In this district, ELA teachers’ estimated cross-subject persistence rate (0.178) is approximately 46% of their within-subject persistence (0.389). In contrast, M-DCPS math teachers’ cross-subject persistence rate (0.020) is less than 5% of their estimated within-subject persistence (0.411).
Next, in Table 6, we show the same- and cross-subject persistence of teacher-induced learning 2 years following instruction. Because our available samples for estimating 2-year persistence are smaller and reflect a different composition of students by grade level, we also include here results for estimates of 1-year persistence of instruction for the same teachers teaching the same students. For example, we report on the 1-year persistence of a student’s teacher from Grade 6 on Grade 7 achievement, alongside the 2-year persistence of that same student’s teacher from Grade 6 on Grade 8 achievement. Presenting both 1-year and 2-year persistent estimates for this common sample allows us to distinguish between any trend in results that relate to our sample rather than from differences in patterns of 2-year versus 1-year cross-subject persistence. 3
Estimates for 1-Year and 2-Year Persistence of Teachers’ Value-Added Effects Within and Across Subjects, for a Common Sample of Students and Teachers
Note. Coefficients for teacher value-added effects from regressions of current achievement in a given subject area on prior achievement in both subject areas instrumented with the prior year teachers’ value-added quality in both subject areas. Models include controls for current student characteristics and classroom fixed effects from the test subject of the dependent variable. NYC = New York City; M-DCPS = Miami-Dade County Public Schools; ELA = English language arts.
In this restricted sample, 1-year persistence rates are somewhat higher than in our full sample, particularly in M-DCPS. However, we continue to observe no cross-subject 1-year persistence of math teacher effects on ELA in either district, with point estimates of −0.017 in NYC and −0.009 in M-DCPS. In contrast, ELA teachers’ 1-year persistent effects are substantial, with point estimates of 0.095 and 0.223 in NYC and M-DCPS, respectively.
In line with our results for 1-year persistence, we observe differential cross-subject persistence of teacher-induced learning in ELA 2 years after instruction, particularly in NYC where our estimates are more precise. In NYC, the estimated 2-year within-subject ELA persistence rate of teacher-induced learning (0.073) is approximately 42% of the within-subject 2-year persistence estimate (0.173). In contrast, we observe very little 2-year cross-subject persistence for math teachers’ instruction (0.001), an estimate that is less than 1% of math teachers’ within-subject 2-year persistence estimate (0.134). In M-DCPS, our 2-year persistent results are inconclusive with respect to differences in persistence across subjects. ELA teachers’ 2-year cross-subject persistence in this case (0.062) is 26% of their within-subject persistence (0.240), whereas math teachers’ 2-year cross-subject persistence estimate (0.044) is around 18% of their 2-year persistence estimates within the same subject (0.250). However, in M-DCPS, the estimates for 2-year persistence are at odds with the measured (lack of) persistence of those same teachers’ 1-year effects across subjects, and the relatively large standard errors around our 2-year estimates in this district limit our ability to draw clear conclusions from the results.
Overall, our analyses of the persistence of teacher-induced learning indicates that ELA teachers’ impacts on students’ assessed ELA skills are substantially more persistent in math than math teachers’ impacts on future-year ELA performance. This pattern is largely consistent for both 1-year and 2-year persistence measures, particularly in NYC where our point estimates are more precise. Learning due to ELA instruction appears to impart long-term knowledge and skills that are reflected not only in short-term ELA scores, but also in future test scores in both subject areas.
Comparing the Relative Magnitude of Teacher-Induced Learning in ELA and Math
The previous results suggest that instructional effects captured by students’ ELA test scores are qualitatively different than those captured by math test scores. This difference manifests as larger cross-subject persistent effects for ELA learning. We are unable to assess the extent to which ELA learning plays out differently with respect to nonacademic outcomes, but in light of our results it is worth reevaluating the relative magnitude of ELA teachers’ instructional effects on academic achievement when accounting for persistent cross-subject effects. To do this, we consider the relative impact of ELA and math teachers both with respect to longer-term (i.e., persistent) knowledge in particular, and for the combination of short- and longer-term knowledge. In Table 7, we provide estimates of the magnitude of teachers’ effects on long-term knowledge across multiple subjects and on the combination of short-term within-subject and longer-term, multisubject knowledge.
Estimated Relative Magnitude of Impacts of ELA and Math Instruction on Student Achievement Scores in ELA and Math Over 2 Years, Accounting for Cross-Subject Effects in Year 2
Note. Same-year impacts are estimates of unshrunken true teacher value-added effects in the year of instruction. Estimated same-and cross-subject persistence effects are a function of the point estimates for 1-year persistence rates from Table 5, multiplied by the estimated size of teacher effects in the year of instruction. Total persistent effects combine same-and cross-subject 1-year persistence impacts. Total impacts combine within-subject impacts in the year of instruction (i.e., same-year), and multisubject persistent impacts in the year following instruction. ELA = English language arts; NYC = New York City; M-DCPS = Miami-Dade County Public Schools.
In the first column of Table 7, we report our unshrunken estimates of the impact of a one standard deviation difference in ELA and math teachers’ effectiveness on adjusted student achievement in the same-subject in the year of instruction. Our results, which range from 0.235 to 0.283, are consistent with estimates from prior research on the raw standard deviation of teachers’ value-added effects (Rockoff, 2004), and in both districts our estimates are consistent with a common finding of ELA value-added effects that are somewhat smaller than (approximately 84% to 88% of) math value-added effects. When considering same and cross-subject effects that persistent into a second school year, however, the situation is reversed. For this multisubject longer-term knowledge, ELA teachers’ estimated value-added effects are somewhat larger than (approximately 110% to 126% of) math value-added effects. Accordingly, when we estimate teachers’ combined same-year and persistent impact on student test scores and account for cross-subject persistence, ELA teachers’ relative magnitude of impact on achievement moves substantially closer to that of (approximately 92% to 97% of) math teachers’ effects. Note that this sizable improvement in ELA teachers’ estimated magnitude of impact comes from accounting for just cross-subject persistence in the first year following instruction, and thus likely represents a lower bound for ELA teachers’ overall contributions. It does not account for other factors that may also differ across subjects, including possible cross-over instructional effects in the year of instruction (which we do not measure), any cross-subject effects on additional academic subjects such as science or social studies, or any cross-subject effects of ELA instruction on math (or the reverse) that are not mediated by ELA (or math) academic knowledge.
Conclusion and Discussion
This study offers new insight into K–12 teachers’ instructional effects, contributing to our understanding of how teachers’ efforts benefit students over time and on different assessments. We identify important differences in the type of learning that teachers of different subjects impart, as evidenced by a much higher rate of cross-subject persistence associated with teacher-induced learning in ELA than in math. These results may help to explain why ELA teachers’ effects on student achievement have been shown to predict students’ long-run outcomes to a similar degree as math teachers’ effects, in spite of smaller contemporaneous effects on within-subject standardized achievement scores (Chetty et al., 2011). The knowledge that ELA teachers impart and that is captured by ELA achievement scores may be differentially important to student learning more broadly. We hypothesize that due to their broader applicability, a given increase in ELA skills may yield greater long-term benefits to students’ life outcomes than a comparable increase in math skills. In line with this theory, the magnitude of ELA teachers’ estimated effects on measured achievement is substantially increased, relative to that of math teachers, when we examine longer-term knowledge over multiple subjects rather than only short-term, within-subject knowledge.
Although our study focuses on teachers’ contributions to student learning, our findings may also have broader implications for other interventions that aim to develop students’ skills in reading and language arts. The results suggest that a variety of educational interventions that focus on ELA skill development may yield long-term, generalizable benefits that are not fully captured by observed gains on ELA assessments in the year of instruction. Investments in developing students’ ELA skills, whether through classroom instruction or other means, may yield larger benefits to students than are generally recognized or immediately apparent. For example, across our two samples we estimate that between 32% and 41% of ELA teachers’ persistent effects on achievement consist of cross-subject impacts on math achievement. If this pattern of cross-subject impacts holds with respect to the overall academic gains from ELA-focused interventions, researchers may, by focusing only on ELA test scores, fail to identify a sizable portion of the overall benefits of developing students’ skills in reading and language arts.
To the extent that teachers’ instructional effects influence student achievement over time and across subject areas, educators and policymakers may miss valuable information if they rely only on short-term within-subject student learning to evaluate teachers’ “value added” to student achievement. Previous research has shown that teacher value-added rankings are sensitive to modeling choices around the inclusion of cross-subject spillover effects or the persistent effects of prior-year teachers (Kinsler, 2012; Yuan, 2014) and that individual teachers’ short- and longer-term academic effects are only moderately correlated (Mariano et al., 2010; Rothstein, 2010). We find that ELA teachers’ contributions to student learning are particularly diffuse, and thus a larger portion of their instructional impact may typically go undetected or be ascribed to teachers in other subject areas or school years. In the context of teacher performance evaluations that are linked to student achievement gains, our results reinforce the value of attending to team-level indicators of teachers’ contributions to student achievement, or to estimating value-added models that simultaneously account for multiple different subject-teachers’ contributions (Yuan, 2014).
We can only speculate as to whether impacts on proximal versus future-year test scores are more relevant to students’ lifelong outcomes, but nevertheless it is notable that persistent ELA instructional effects across subject areas are larger than teacher-induced persistent math effects. More persistent and generalizable effects on test scores may indicate a deeper learning that yields larger lifelong benefits for students, though we cannot rule out the possibility that short- and longer-term value-added reflect student learning of equivalent value.
Our findings do illustrate that different types of student learning play out differently in students’ academic performance over time. The results corroborate prior theory and research regarding the importance of students’ reading skills across a variety of contexts (Abedi & Lord, 2001; Chang et al., 2009; O’Reilly & McNamara, 2007). They also highlight the potential importance of considering multiple measures of teachers’ contributions to student learning, as different measures may pick up on qualitatively different instructional contributions (Jackson, 2012).
This study also contributes to a growing body of evidence regarding teachers’ cross-subject spillover effects. Although previous research has identified substantial contemporaneous spillover effects across different teachers’ classrooms (Aaronson et al., 2007; Buddin & Zamarro, 2009; Koedel, 2009; Yuan, 2014), these effects may reflect multiple contributing factors, including direct instructional collaboration with shared students in the year of instruction. Our method allows us to better isolate the persistent spillover effects of different types of previously observed teacher-induced student learning, and we demonstrate that ELA learning in particular is more broadly relevant to performance in another subject area. Our study is also among the first to explore the cross-subject effects of teachers in elementary and middle school grade levels, rather than in high school contexts. Consistent with Yuan’s (2014) findings for teachers in middle grades, we find greater spillover from ELA teachers’ instruction, relative to that of math teachers.
Our data and analyses have some key limitations that are typical of research on teacher value-added effects. First, we are unable to examine instructional effects in subject areas other than ELA and math. As a result, we are unable to examine whether the cross-subject generalizability of ELA learning extends to subject areas other than math. Although we hypothesize that this might be the case, future research in contexts where multisubject testing occurs across grade levels is needed to test this. Second, our analysis is limited by the fact that students are not randomly assigned to teachers in our sample. When estimating persistence, our results could be biased if in fact teacher value-added measures include bias from systematic patterns of nonrandom student assignments.
In particular, our findings could have been impacted if ELA teachers’ value-added estimates are systematically influenced by nonrandom patterns in the overall quality of instruction their students receive (e.g., through tracking that might afford some students systematically better learning experiences), whereas math teachers’ value-added estimates are not influenced to the same degree by this nonrandom sorting. If this were the case, ELA teachers’ measured value-added might correspond not just to their own instructional impact, but also to the impact of students’ overall instructional experience, whereas math teachers’ measured value-added would better reflect just their own instructional impact. That kind of lopsided bias in ELA value-added measures could lead to the larger apparent cross-subject effects of ELA teacher-induced learning that we observe. We are unable to verify whether this may be a factor in our results. That said, the best available evidence on teacher value-added measures in middle-school grades does not indicate any systematic bias in teacher value-added measures of the kind we estimate, even in the absence of random student assignment (Kane, McCaffrey, Miller, & Staiger, 2013). Additional research leveraging data from extant random-assignment studies could help to clarify whether the potential bias described here is present in value-added measures in middle school settings.
The consistency of our findings across two large urban school districts that utilize different standardized tests and content standards provides credible evidence that our findings are likely to generalize across a wide range of school settings where standardized math and ELA exams are the norm. That said, we would expect the persistence of teacher-induced learning to vary as a function of the overlap in assessed content between one year’s test administration and the next. Similarly, cross-subject effects may differ in magnitude as a function of the overlap in content between different subject area tests, particularly if some non-ELA tests require student reading and writing skills to a greater degree than others. More work is needed that evaluates competing measures of teachers’ instructional impact and that documents key differences in the types of student learning that are captured by different assessments. Such research could inform which types of assessments do a better job of reflecting teachers’ full instructional effects.
K–12 teachers have substantial impacts on students’ academic achievement, and the best evidence to-date indicates that these impacts are predictive of meaningful long-run effects on students’ life outcomes (Chetty et al., 2011). This prior research suggests that value-added measures are useful indicators of teachers’ contributions, at least in low-stakes settings. However, particularly in light of the frequent use of value-added measures in school accountability and personnel management systems, it is important to understand the mechanisms by which short-term instructional effects translate into longer-term student outcomes. Without a clearer understanding of those mechanisms, we may be unable to differentiate between short-term effects that are transient or context-dependent, versus those that have broader benefits for students. The findings from this study highlight the potential for meaningful variation in the type of learning that different teachers impart, as evidenced by substantial differences in how different instructional effects play out over time and across subject areas. Additional research is needed to illuminate the processes by which different types of learning yield broad and persistent benefits for students.
Footnotes
Appendix
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by a grant from the National Center for the Analysis of Longitudinal Data in Education Research (CALDER). CALDER is supported by the Institute of Education Sciences, U.S. Department of Education Grant R305A060018.
Notes
Authors
BENJAMIN MASTER is an associate policy researcher at the RAND Corporation. His research is in the area of educational policy with a particular focus on school leadership and human capital management in schools. His work explores promising investments in human capital development, including interventions that support teacher and principal effectiveness and the impacts of educational interventions on students’ lives. Address: 1200 South Hayes Street, Arlington, VA 22202. Email:
SUSANNA LOEB is the Barnett Family Professor of Education at Stanford University. She specializes in education policy, looking particularly at policies and practices that support teachers and school leaders. Her work spans the range of age levels, including early education, K–12, and higher education. Her recent work focuses particularly on information barriers to teaching improvement and parenting. She is a member of the National Board for Education Sciences, codirector of Policy Analysis for California Education, a senior fellow at the Stanford Institute for Economic Policy Research, and a faculty research fellow at the National Bureau of Economic Research. Address: 520 Galvez Mall #524, CERAS building, Stanford, CA 94305-3084. Email:
JAMES WYCKOFF is the Curry Memorial Pro-fessor of Education and Policy, and director of the EdPolicyWorks, a research center at the University of Virginia. His research focuses on providing evidence for improving teacher quality through a variety of mechanisms including teacher preparation, recruitment, assessment, and retention. Currently he is working with colleagues to examine the effects of teacher evaluation in the District of Columbia and the ways in which principals improve teacher quality in New York City. Address: PO Box 400277, Charlottesville, VA 22904. Email:
