Abstract
Arizona enrolls a larger share of its students in charter schools than any other state in the country, but no comprehensive examination exists of the impact of those schools on student achievement. Using student-level data covering all Arizona students from 2006 to 2012, we find that the performance of charter schools in Arizona in improving student achievement varies widely, and more so than that of traditional public schools (TPS). On average, charter schools at every grade level have been modestly less effective than TPS in raising student achievement in some subjects. But charter schools that closed during this period have been lower performing than schools that remained open, a pattern that is not evident in the traditional public sector.
Introduction
Although it is clear that charter schools are an increasingly important educational option in Arizona, there is conflicting evidence on their effectiveness relative to traditional public schools (TPS). Solmon and Goldschmidt (2004) examined student achievement in reading over the 3-year period from 1997–1998 to 1999–2000, and found that elementary school students attending charter schools began with lower test scores than students in TPS, but showed faster achievement growth. The same study found no effects of charter attendance on achievement growth for middle school students and a modest negative effect for high school students, which the authors speculated could be due to charter high schools being less likely to have an academic orientation. But a recent report from the Center for Research on Educational Outcomes (CREDO, 2013) at Stanford University examined data from 2005–2006 to 2010–2011 and found that charter students performed slightly worse in both reading and math than a matched comparison group of students in TPS.
Both of these studies have limitations. Solmon and Goldschmidt (2004) is based on data from more than a decade ago that pre-dates Arizona’s most rapid period of charter school expansion. Its primary results focus on the subset of students who attended charter schools each of the 3 years in its panel, who represent just 20.6% of all students who attended a charter school in at least 1 of the 3 years. Finally, its outcome measure is a low-stakes test, which may not have been administered consistently across the two sectors. The CREDO (2013) study uses data on state math and reading tests from more recent years but relies on a value-added identification strategy that only compares year-to-year gains made by students in charter and TPS and thus ignores any persistent or cumulative effects of charter school attendance. For example, any effects charter schools have on student achievement before students are first tested in third grade would not be captured by CREDO’s methodology. The CREDO study also does not consider whether the effectiveness of charter schools varies by factors such as grade level, location, or educational mission.
In this article, we exploit statewide longitudinal data on individual students attending charter schools and TPS between 2006 and 2012 to offer the only comprehensive study of the effectiveness of Arizona charter schools in raising student achievement based on recent, high-stakes tests. We address concerns about selection into charter schools in a more credible way than many previous studies by focusing on middle and high schools, for which it is possible to restrict comparisons to students who previously attended the same school, while still presenting results for elementary schools based on conventional value-added methods.
Our study also exploits the unusual scale of Arizona’s charter sector to provide new evidence on the extent and nature of variation in effectiveness among charter schools. Most prior research on the charter sector focuses on the average effect of attending a charter school within a given geographic area (Betts & Tang, 2011), although some more recent studies have documented variations in effectiveness among charter schools, which they exploit to identify practices correlated with effectiveness (see, for example, Angrist, Pathak, & Walters, 2013; Dobbie & Fryer, 2011; Hoxby, Murarka, & Kang, 2009). Yet a central rationale of charter schooling as an education reform strategy has been that the greater autonomy afforded to charter schools should yield more variation in performance than is observed among TPS, but that the closure of less effective charter schools due to insufficient enrollment or to authorizer decisions should lead to steady improvements over time (see, for example, Finn, Manno, & Vanourek, 2001). We formalize these ideas by explicitly comparing the amount of variation in effectiveness among individual charter and TPS. Although we lack detailed data on the practices of specific charter schools, we also examine the extent to which variation in their performance can be predicted by such factors as whether their mission statements emphasize raising student achievement. Finally, we compare the performance of charter schools that closed during our study period to those that remained open.
We find that the performance of individual Arizona charter schools in improving student achievement varies widely. The variation in effectiveness among charter schools is significantly greater than the variation among TPS in both math and reading, even taking into account the fact that charter schools tend to be smaller. On average, however, Arizona charter schools are modestly less effective than TPS in raising student achievement—especially in the state’s non-urban areas. We find suggestive evidence that charter schools with mission statements indicating a focus on academic rigor are more effective in raising math achievement than schools with a more generic mission, whereas schools with an explicit focus on the arts are less effective. Finally, we show that charter schools that have closed in the past 6 years were markedly less effective than those which remained open, a pattern that is not evident among TPS and should, if it continues, lead to steady improvement in the relative performance of Arizona’s charter sector over time.
Data
Our study takes advantage of a statewide, student-level longitudinal data extract obtained from the Arizona Department of Education (AZDOE) by the Goldwater Institute, a research organization headquartered in Phoenix, Arizona. The extract contains test score, school enrollment, demographic, and program participation information for students in Arizona public schools from the 2005–2006 through 2011–2012 school years (hereafter referred to using the calendar year of the spring semester, that is, 2006–2012).
The Institute requested the data in March 2012 due to its long-standing interest in the state’s charter sector and concern that CREDO’s (2009) influential study might provide a misleading picture of its performance in raising student achievement. AZDOE was initially unresponsive to follow-up emails and phone calls from Goldwater about the data request, but resumed communications after the Institute threatened legal action.
In October 2012, AZDOE provided the data to the Institute via a secure transfer protocol, which enabled analyses to be conducted off-site by users authorized under a data use agreement. The data use agreement did not place any restrictions on the research questions that could be addressed. After receiving the data, the Institute invited the authors to propose a set of analyses that would address their questions about the performance of Arizona charter schools and inform charter school policy in the state. The Institute then added the authors as authorized users under the data use agreement and assisted them in accessing the secure transfer. As authorized users, we were not permitted to use the data to determine the identity of any student, to allow any unauthorized use of the data, to share any data with the potential to be personally identifiable, or to report data with cell sizes smaller than 10.
Although ultimately successful, the challenges the Institute faced in acquiring the data may explain the paucity of recent research using student-level data in Arizona. To our knowledge, the CREDO (2009, 2013) reports are the only examples of external research using information from the state’s current longitudinal data system. 1 The state’s apparent reluctance to work with external researchers also created some challenges for our own analysis. For example, when it became clear that the data extract was missing data on key demographic variables in some years, AZDOE staff were unwilling to provide updated files. We discuss how we addressed these challenges below.
The primary student outcome we examine is student test-score performance on Arizona’s Instrument to Measure Standards (AIMS), the statewide standards-based assessment in math, reading, writing, and science. Students take the AIMS math and reading tests in Grades 3 to 8, science test in Grades 4 and 8, and writing test in Grades 5 to 7. 2 There are also AIMS high school tests in all four subjects that students take in the second semester of their second year of high school. A passing score on the math, reading, and writing tests is required for graduation from high school, and students can retake the tests multiple times to achieve a passing score or improve on a previous (passing) score.
For tests taken in Grades 3 to 8, we standardize all scores by subject, grade, and year to have a mean of 0 and standard deviation of 1. For high school, we use the score from the first time the student took each test (usually in the spring of 10th grade) and standardize it using the distribution of all 10th-grade test takers in that subject and year. We also identify whether students have passed (at any point observed in our data) all three of the tests required for high school graduation.
An enrollment file contains demographic and program participation information for all K–12 students in Arizona’s public schools—roughly 1 million students each year. The data include school, grade, birth year, special education status, and gender for all years (2006–2012). Data on race/ethnicity are only available for 2011–2012 and free-lunch eligibility is only included for 2010–2012. Attempts to obtain these data for the earlier years from the AZDOE were unsuccessful, so we are unable to directly account for these student-level variables in our analysis. However, as we explain below, we were able to obtain school-level information on race/ethnicity and free-lunch eligibility from public sources.
An indicator for limited English proficiency (LEP) status is also included in the school enrollment file. However, the share of students identified as LEP jumps from 13% to 15% in 2005–2008 to 7% to 9% in 2009–2012, raising questions about the consistency of this measure over time. Instead of using this variable, we classify students’ English proficiency status using test-score data from the Arizona English Language Learner Assessment (AZELLA), a test administered to students for whom English is not the language used in their home, the language they speak most often, or the language they learned first. 3 This test is used to both classify students as English language learners (ELL) and to determine when they have achieved fluent English proficiency (FEP). For each year, we identify whether a student is ELL, FEP, or neither (i.e., they do not have an AZELLA score in the data file).
A school-level data file from the AZDOE links the school codes used in the test-score and enrollment files to school name, district name, an indicator for charter schools, and the range of years that the school was open (within the 2005–2012 window). We merge the AZDOE data with school-level data from the National Center for Education Statistics Common Core of Data (CCD) on racial/ethnic composition and the share of students eligible for free- or reduced-price lunch (because these data are not available for most of the years in the student-level data). The CCD files are only available through 2011, so we use the 2011 values for both 2011 and 2012. 4 We identify schools that closed as those with an open window that ends before 2012. For many of the closed charter schools, we are able to identify the official reason for closure using data from the Arizona State Board of Charter Schools.
Finally, we downloaded the mission statements of Arizona charter schools from the AZDOE website, and classified each charter middle school that provided a mission statement into one of five mutually exclusive categories: rigorous, progressive, arts, at-risk, or general. We classified as rigorous schools with mission statements with an exclusive or clearly dominant emphasis on improving student achievement in core academic subjects and/or preparing them for college. The schools we classified as progressive, in contrast, described themselves as cultivating each student’s unique strengths or educating the whole child and often emphasized goals related to respect for others, love of learning, and creativity. Arts schools were those claiming a special curricular emphasis on the arts, whereas at-risk schools were those seeking to serve at-risk students (often described in mission statements with language such as “students for whom traditional schools have not been effective and predictably will not be in the future”). We classified schools with mission statements that did not fall into any of these categories in a general category. We placed virtual charter schools into a sixth category and excluded them from the five mission categories. Virtual charter schools are identified as schools that either have one of four keywords in their name (virtual, online, AOI [Arizona Online Instruction], or distance) or are on a list of approved online programs published by the Arizona State Board of Education in April 2012. 5
Method
The effect of an intervention such as attending a charter school is most credibly estimated using random assignment, such as that which occurs when oversubscribed charter schools hold admissions lotteries. That is not possible in our study because we do not have access to lottery records for Arizona’s charter schools, so we instead use observational methods that compare students in charter schools and TPS with similar characteristics and academic achievement prior to entering those schools. One advantage of this approach over lottery-based studies is that we need not restrict our analysis to only charter schools that are oversubscribed. Our key identifying assumption is that students observed in charter and TPS do not systematically differ in unmeasured ways that impact their achievement. The method we use is similar to the non-experimental methods that have been shown in within-study comparisons to best replicate experimental estimates of the effects of attending charter schools (Fortson, Verbitsky-Savitz, Kopa, & Gleason, 2012) and magnet schools (Bifulco, 2012). 6
We focus our analysis on middle schools for two reasons. First, we are able to measure student achievement prior to entering middle school, which is not possible in elementary school because Arizona only tests students beginning in Grade 3. Second, we can track student performance on state tests for multiple years in middle school, whereas high school students are only tested at one point in time (at the end of 10th grade).
Specifically, we identify students observed entering a middle school in 2007 to 2012 (we exclude 2006 because no baseline data are available for students in the first year of our data extract). We define middle schools as having a minimum grade between 4 and 7, although in practice most middle schools begin in Grade 6 or 7. From the first year a student is observed entering a middle school we count the number of years that the student has spent in charter schools as of a given year. For example, a student who entered a charter middle school in Grade 6 in 2007 and stayed there for 3 years would be classified as having spent 1 year in charters in 2007, 2 years in 2008, and 3 years in 2009.
We measure students’ baseline math and reading test scores based on their standardized test performance in the year and grade prior to their first-time enrollment in a middle school. These baseline scores serve as control variables throughout the analysis. 7 Consequently, our analysis measures charter effects as the cumulative value-added to the student’s achievement since entering the school, not the relative test-score gains from 1 year to the next. We also identify the elementary school attended by each student immediately prior to entering middle school, which allows us to restrict charter–TPS comparisons to students who came from the same elementary school in the same year.
Table 1 shows descriptive statistics for students attending charter and traditional public middle schools. The most striking difference between the two groups is that charter students have baseline scores that are about 0.2 standard deviations higher than their counterparts in TPS. 8 Charter students are also less likely to be Hispanic, receive special education services, or participate in the free- and reduced-price lunch program. Finally, charter middle school students are much more likely to have attended a charter elementary school, with nearly half of them coming from such schools as compared with only 2% of students in traditional public middle schools.
Descriptive Statistics, Middle School Students
Note. Middle schools include all schools where the minimum grade is 4 to 7. Race/ethnicity data are only available for 2011–2012 and free-lunch data are only available for 2010–2012.
We measure the effect of time spent in charter middle schools on student achievement using the following model:
where
We estimate this model separately by subject area. We also estimate an alternative version of the model that does not condition on the baseline school-by-year dummies, which restricts the comparisons being made to charter school and TPS students who had attended the same elementary school in the same year (these models still condition on baseline year dummies). Whether the inclusion of baseline school-by-year dummies reduces any bias due to unmeasured student characteristics in the estimation of charter school impacts is theoretically unclear (cf. Hoxby & Murarka, 2008). They will reduce bias to the extent that students who attended the same elementary school share common characteristics correlated with their subsequent achievement growth. At the same time, they could aggravate bias to the extent that students who had the same local schooling options, yet made different decisions about which school to attend, are especially likely to differ in ways relevant for their achievement growth. Fortunately, our substantive conclusions about the effects of attending Arizona charter middle schools are the same regardless of which version of the model we estimate.
In all models, standard errors are adjusted for clustering by school. In addition to estimating average charter school effects based on all middle school students in our data, we estimate charter effects for subsets of middle schools defined in terms of location and mission.
We also estimate charter effects for elementary and high schools using variations on our preferred method. The method we use for high schools is most similar to our preferred method because we are able to observe students’ academic performance before entering high school. Specifically, we identify the first year that each student in our data was in ninth grade in a high school that begins with that grade (we drop schools where the entering grade is not ninth grade). We match these students to their performance on the eighth-grade math and reading tests (using their most recent data if they were in eighth grade more than once). Online Appendix Table A1 (available at http://epa.sagepub.com/supplemental) shows descriptive statistics for high school students by sector. There is a large difference in baseline test scores that goes in the opposite direction from that observed for middle school students, with charter high school students entering with eighth-grade scores 0.32 to 0.39 standard deviations lower than students in traditional public high schools. This difference likely stems in part from the larger share of charter high schools that are designed to serve at-risk students. High school students attending charters are more likely to be eligible for free lunch (58% vs. 38%) and more likely to have come from a charter middle school (33% vs. 3%).
We measure the effect of starting high school in a charter school as follows:
with the same notation as above except that we use a binary indicator for starting in a charter school
We estimate this model using five different dependent variables: scores on each of the four high school tests and a binary indicator for whether the student passed all three of the tests required for high school graduation. We adjust standard errors for clustering by school.
Finally, we estimate simple value-added models for elementary schools to provide some evidence on the performance of this important group of schools despite the analytical challenges to doing so. We define elementary schools as those in which the minimum grade that appears in the test-score file is third grade. Online Appendix Table A2 (available at http://epa.sagepub.com/supplemental) shows descriptive statistics for elementary school students as well as all middle schools students (not the more limited sample shown in Table 1 for our preferred analytic approach). We are not able to compare any baseline measures of academic performance, but the charter–TPS comparisons on demographic and program participation variables are largely similar for elementary and middle schools, with charter students less likely to be Hispanic or eligible for free lunch.
To estimate the effect of attending a charter elementary school, we regress test scores in a given year on test scores from the prior year and a set of control variables:
where the notation and control variables are the same as above except scores from the prior year are included (
Finally, we also use the value-added model to estimate the relative effectiveness of elementary and middle schools that closed during our period of observation (compared with schools that remained open). In this analysis, we replaced the dummy identifying charter schools with a set of dummies identifying charters that remained open, charters that closed, and TPS that closed (with TPS that remained open forming the omitted category).
Results
Middle Schools
Our preferred results for middle schools appear in the top panel of Table 2. The first coefficient, −0.021, indicates that each year spent in a charter middle school reduced student achievement in math by roughly 2% of a standard deviation, an effect that falls just shy of statistical significance at the 10% level (p = .103). We observe no difference between charter and TPS students in reading or writing performance, but there is a clear negative effect of charter school attendance on science scores—0.041 standard deviations per year spent in charter schools. This effect appears to be driven in part by the sample of students taking the science test, as we obtain modestly larger negative effects on math and reading scores for the subsample of students with science scores in our data. 10
Effect of Years Spent in Charter Middle Schools on Test Scores
Note. Robust standard errors adjusted for clustering by school appear in parentheses. Years in charter is measured as the number of years each student has been enrolled in charter schools between their entry into middle school and time they were tested (for a given observation). All models control for Grade × Year effects, baseline year effects, ELL status, FEP status, age, baseline test scores in math and reading, gender, and school-level race/ethnicity, free-lunch eligibility, gender, ELL, FEP, age, and minimum grade. ELL = English language learners; FEP = fluent English proficiency.
p < .05. **p < .01.
Recent evidence from Massachusetts (Angrist et al., 2013) and from a national evaluation of oversubscribed charter middle schools (Gleason, Clark, Clark Tuttle, & Dwoyer, 2010) indicates that the impact of attending a charter school may vary geographically, with urban charter schools producing more positive impacts (relative to the TPS their students would otherwise attend) than charter schools in non-urban settings. Table 2 confirms that a similar pattern is evident in Arizona. Each year’s attendance at a non-urban charter middle school is estimated to reduce achievement in both math and reading by roughly 3% of a standard deviation. In contrast, we find that attending an urban charter middle school has no effect on achievement in either subject.
A comparison of the top and bottom panels of Table 2 shows little qualitative difference in the results depending on whether we control for baseline school-by-year fixed effects. This is notable because including these variables limits the comparison group for charter students to only those TPS students who came from the same elementary school in the same year. The fact that the results are so similar may indicate that baseline test scores (and demographics) are a reasonable proxy for students’ academic ability.
Both the traditional and charter schools in the analysis presented in Table 2 include some non-traditional schools, such as schools identified by the AZDOE as alternative schools (typically because they focus on at-risk student populations) and virtual schools, which offer instruction online. Below we report estimates of the effect of attending at-risk and virtual charter schools, which are more negative than the estimates for other charter schools. But we find that excluding all alternative or virtual schools from the analysis does not alter the overall pattern of results. 11
Table 3 uses our preferred model (with baseline school-by-year effects) to examine whether the effects of attending a charter middle school on math and reading vary with the limited number of student-level background characteristics that are consistently available in our data. Consistent with the results for charter schools in non-urban areas, we find that the negative charter effect on math scores is concentrated among students with above-average baseline scores in that subject. For these students, each year of attendance at a charter middle school is estimated to reduce math achievement by 0.04 standard deviations. However, we do not find a similar pattern of results for reading scores. Our estimates of charter effects also do not vary notably with respect to gender.
Effect of Years Spent in Charter Middle Schools on Test Scores, by Student Characteristics
Note. Robust standard errors adjusted for clustering by school appear in parentheses. ELL and special education students are identified as those who were ever classified into one of these categories (during the years observed in the data). Years in charter is measured as the number of years each student has been enrolled in charter schools between their entry into middle school and time they were tested (for a given observation). All models control for Grade × Year effects, baseline school-by-year effects, ELL status, FEP status, age, baseline test scores in math and reading, gender, and school-level race/ethnicity, free-lunch eligibility, gender, ELL, FEP, age, and minimum grade. ELL = English language learner; ed. = education; FEP = fluent English proficiency.
p < .1. *p < .05. **p < .01.
ELL students make up 6% of Arizona middle school students attending a TPS and 4% of students in a charter school (see Table 1). For these students, we estimate positive annual effects of charter attendance of 0.04 standard deviations in both subjects. This provides suggestive evidence that Arizona’s charter middle schools may be particularly effective for students learning English, although it is important to note that ELL students attending charters may differ in unobserved ways from those attending a TPS. For special education students, the estimated effect of charter attendance is also positive but statistically insignificant.
Table 4 shows results from our preferred model for subgroups of charter schools defined in terms of stated mission. 12 The 50 charter middle schools with an academically rigorous stated mission increase math scores by 0.042 standard deviations per year spent in the school, an effect that is statistically significant at the 10% level. Progressive schools, of which there are 47, have no statistically significant impact on math or reading performance, while the 123 charters with a general (non-descript) mission have negative impacts of 0.033 standard deviations per year in math and 0.010 standard deviations in reading. Charters that specialize in the arts (16 in total) also have a substantial negative effect on math scores, which might be expected given their subject-specific focus. Charters focused on at-risk students are estimated to have a large negative impact in both subjects, although it is likely that these 20 schools serve students who face challenges that are not adequately captured by the control variables.
Effects of Years Spent in Charter Middle Schools on Test Scores, by Mission
Note. Robust standard errors adjusted for clustering by school appear in parentheses. Years in charter is measured as the number of years each student has been enrolled in charter schools between their entry into middle school and time they were tested (for a given observation). All models control for fixed effects for Baseline school × Year, Grade × Year effects, ELL status, FEP status, age, baseline test scores in math and reading, gender, and school-level race/ethnicity, free-lunch eligibility, gender, ELL, FEP, age, and minimum grade. ELL = English language learners; FEP = fluent English proficiency.
p < .1. *p < .05. **p < .01.
It is important to note that the variation in effects documented in Table 4 could reflect differences in patterns of selection into charter schools with a particular kind of mission based on unmeasured student characteristics. For example, the students who seek out a charter middle school with an academically rigorous mission statement may differ in ways that would have led them to make larger achievement gains in math for other reasons. Even so, the results provide suggestive evidence that the performance of Arizona charter schools varies in ways consistent with their stated mission, with schools emphasizing academic rigor producing positive results in math.
There are only five virtual charter middle schools that can be included in the analysis. Table 4 also indicates that these schools have large negative impacts on test performance (compared with TPS, almost all of which are non-virtual), with effect sizes of 0.25 and 0.11 standard deviations in math and reading, respectively. However, like students that attend schools for at-risk students, virtual school students may have unmeasured characteristics that confound these results.
Taken together, these results indicate that charter middle schools in Arizona produce results that, on average, are modestly worse than TPS. But these effects are not observed consistently across the sector. Some groups of charters produce similar test-score results as regular district schools, some produce more negative results than the average charter, and some produce more positive results. Could it be the case that performance of charter schools varies more in general than the performance of TPS?
We use two methods to answer this question. First, we use our middle school model to estimate the effect each individual school has on student test performance in math and reading by replacing the years in charter variable with a dummy for each school (both charter and TPS). We estimate this model separately by year (replacing the grade-by-year dummies with grade dummies), and do not include baseline school-by-year effects for computational reasons. Finally, we aggregate the effect estimates from each year into a single estimate using Bayesian shrinkage methods to adjust for imprecision in the estimation. 13
Charters show weaker performance in math and similar performance in reading, as we would expect from our main analysis, but in both subjects, the effects for charter schools are more widely dispersed than the effects for TPS (see Online Appendix Table A3, available at http://epa.sagepub.com/supplemental). In math, the fixed effects for charters have a standard deviation of 0.16 student-level standard deviations, as compared with 0.12 for TPS. In reading, the standard deviations are 0.09 and 0.08.
As a check on the robustness of these results, we also estimate random effects models that we implement via hierarchical linear modeling (HLM) using the same set of control variables. 14 HLM allows us to directly estimate the standard deviation of the random effects, as well as a measure of uncertainty for this estimate. This analysis suggests even larger differences in the distribution of school effects between charters and TPS that are evident in both subjects. In math, the random effects have a standard deviation of 0.26 for charters and 0.17 for TPS. In reading, the standard deviations are 0.18 and 0.13. These standard deviations are precisely estimated (each has a standard error of 0.01), so in both subjects, the 95% confidence intervals of the estimates for charters and TPS do not overlap.
Elementary and High Schools
We present our results for high schools in Table 5. Recall that our method for high schools is similar to our method for middle schools, except that for high schools, we only use one observation per student (because students are only tested once, rather than every year). Our preferred results, shown in the top panel, indicate that students who begin high school in a charter school are 9.3 percentage points less likely to pass all three tests required for high school graduation than students who attend traditional public high schools. Consequently, it is not surprising that scores on the individual tests are lower (by 0.07 standard deviations in math) or about the same (in reading and writing) among students who attend charter high schools. 15
Effect of Attending a Charter High School on Test Performance
Note. Robust standard errors adjusted for clustering by school appear in parentheses. Charter (0/1) is a binary indicator for whether the student began high school (ninth grade) in a charter school. All models control for year effects, ELL status, FEP status, age, eighth-grade test scores in math and reading, gender, and school-level race/ethnicity, free-lunch eligibility, gender, ELL, FEP, and age. ELL = English language learners; FEP = fluent English proficiency.
p < .05. **p < .01.
The analytical methods available for elementary schools are less ideal than those we use for middle and high schools. We cannot control for test scores upon entry to middle school, because students are not tested at such a young age, so instead we estimate value-added models that compare test-score growth from 1 year to the next in charter and TPS. To the extent that charter schools (or TPS) have positive effects in Grades K–3, those effects will be controlled away by the use of prior-year scores in these models.
Table 6 nonetheless indicates that charter elementary schools have negative effects on math scores of about 0.05 standard deviations each year, but no effects on reading, writing, or science scores. To shed light on the validity of these results, we also report estimates based on using the same value-added model to study middle schools. Here we find statistically significant negative impacts on math, writing, and science scores, whereas our preferred method showed a statistically significant negative effect only in science (Table 2). Moreover, the point estimates are consistently larger in Table 6 than in Table 2. 16 Extrapolating this pattern to elementary schools, it may be the case that the apparent negative effect of charter school attendance on math scores reported in Table 6 is overstated.
Effect of Attending a Charter Elementary or Middle School on Test Scores
Note. Robust standard errors adjusted for clustering by school appear in parentheses. Charter (0/1) is a binary indicator for whether the student is enrolled in a charter school in a given year. All models control for Grade × Year effects, ELL status, FEP status, age, prior-year test scores in math and reading, gender, and school-level race/ethnicity, free-lunch eligibility, gender, ELL, FEP, and age. ELL = English language learners; FEP = fluent English proficiency.
p < .1. *p < .05. **p < .01.
Closed Schools
Finally, we use the same value-added models from the elementary school analysis to compare test-score gains in charter schools and TPS that closed and those that remained open during the period of observation (2005–2012). For this part of the analysis, we combine elementary and middle schools to increase our sample size and therefore the precision of our estimates. Consistent with Table 6, Table 7 shows that the 311 charter schools that remained open throughout this period were modestly less effective in math than the 1,189 TPS that remained open but equally effective in the other three tested subjects. However, the 64 charter schools that closed were markedly less effective in all four subjects, by 0.10 to 0.13 standard deviations in math, writing, and science and by 0.06 standard deviations in reading.
Relative Effectiveness of Closed Schools, Compared With TPS That Remained Open
Note. Robust standard errors adjusted for clustering by school appear in parentheses. All models control for Grade × Year effects, ELL status, FEP status, age, prior-year test scores in math and reading, gender, and school-level race/ethnicity, free-lunch eligibility, gender, ELL, FEP, and age. ELL = English language learners; FEP = fluent English proficiency.
p < .1. *p < .05. **p < .01.
We also divide the charter schools that closed into four categories based on the closure reason reported by the Arizona State Board of Charter Schools: 34 schools whose charters were revoked (due to serious issues such as academic performance or financial mismanagement), 14 schools that closed voluntarily (usually due to lack of enrollment), 16 schools that closed for other reasons (such as converting to another kind of school or being unable to find a suitable facility), and 9 schools for which the reason for closure was unknown. The bottom panel of Table 7 provides evidence of subpar performance in math for all four categories of schools, but the most consistent evidence of negative impacts (across all four subjects tested) for schools that had their charters revoked.
These results indicate that charter schools that were reducing student learning appear to have been more likely to be shut down (or close on their own). The same does not appear to be the case for TPS. The 58 that closed during the period covered by our data were modestly less effective (compared with TPS that remained open) in writing, but not in the other three subjects. It is worth noting that slightly more charters than TPS closed during the period captured by our data, despite there being more than 3 times as many TPS as charters. Consequently, this represents at least suggestive evidence that the charter sector provides a better mechanism for weeding out poorly performing schools than the traditional sector. 17 This is further supported by evidence, reported in Online Appendix Table A4 (available at http://epa.sagepub.com/supplemental), that the negative effects of charter middle schools in three out of four academic subjects are concentrated among charters that have been in operation for fewer than the median number of years (7).
Conclusion
Enacted in 1994, Arizona’s charter school law seeks to both “provide additional academic choices for parents and pupils” and “provide a learning environment that will improve pupil achievement.” 18 The steady growth in enrollment indicates that the law has accomplished its first purpose. According to the Arizona Joint Legislative Budget Committee, Arizona charter schools received total funding of US$7,460 per student in fiscal year 2012, as compared with US$8,992 per student in schools operated by traditional districts. 19 In other words, Arizona’s charter schools appear to be providing parents with additional academic options in a cost-effective manner. Our results, however, raise questions about the extent to which the state’s charter school law has succeeded in raising student achievement.
Arizona’s charter school law is unique in allowing charter schools to operate for 15 years before coming up for review. Because the most rapid expansion of the Arizona charter sector occurred around the turn of the 21st century, many charters are poised to come up for review in the next few years. This provides an opportunity for rapid improvement through careful attention to quality in the reauthorization process, and the fact that lower quality charter schools have been more likely to have their charters revoked in recent years is encouraging in this regard. But our evidence also suggests that a 15-year period with little oversight of academic quality may be too long to wait to intervene in and potentially close schools that are producing subpar results. A shorter authorization period accompanied by vigorous efforts to measure quality along the way may strike a better balance between autonomy to innovate and accountability for results.
The Arizona State Board for Charter Schools, which serves as the authorizer for more than 90% of charter schools in the state, has already taken important steps in this direction. In particular, in October 2012, it adopted a new Academic Performance Framework that subjects each of its schools to an annual review based primarily on student achievement levels and growth and clarifies that charters may be revoked well before the 15-year contract expires. 20 Although standardized test scores should not be the only metric to judge quality, especially among schools with certain specialized missions, the framework appears to provide a sound basis for taking timely action to address consistent under performance.
The mediocre overall performance of the charter sector in Arizona should not overshadow the impressive work being done in some individual schools. The same can be said for the traditional sector, in which there is also substantial variation in quality. But part of what makes the charter idea compelling is that it provides opportunities for schools to innovate, while not tolerating persistent failure. For this ideal to be realized, policies need to be in place that drive continuous improvement of the sector over time.
Footnotes
Acknowledgements
We thank Mark Berends, Susan Dynarski, and two anonymous referees for helpful comments.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) declared receipt of the following financial support for the research, authorship, and/or publication of this article: Authors received financial support for their work on this article from the Goldwater Institute.
Notes
Authors
MATTHEW M. CHINGOS is a senior fellow at the Brookings Institution and research director of its Brown Center on Education Policy. He studies a wide range of education-related topics at both the K–12 and postsecondary levels.
MARTIN R. WEST is an associate professor of education at the Harvard Graduate School of Education and the deputy director of the Program on Education Policy and Governance at Harvard Kennedy School. His research focuses on the politics of K–12 education policy in the United States and the impact of policy on student learning and non-cognitive development.
