Abstract
Studies link principal effectiveness to lower average rates of teacher turnover. However, principals need not target retention efforts equally to all teachers. Instead, strong principals may seek to strategically influence the composition of their school’s teaching force by retaining high performers and not retaining lower performers. We investigate such strategic retention behaviors with longitudinal data from Tennessee. Using multiple measures of teacher and principal effectiveness, we document that indeed more effective principals see lower rates of teacher turnover, on average. Moreover, this lower turnover is concentrated among high-performing teachers. In contrast, turnover rates of the lowest-performing teachers, as measured by classroom observation scores, increase substantially under higher-rated principals. This pattern is more apparent in advantaged schools and schools with stable leadership.
Research suggests that teacher turnover has negative effects on school performance, particularly for schools that are low-achieving and serve larger high-needs populations (Hanushek, Rivkin, & Schiman, 2016; Ronfeldt, Loeb, & Wyckoff, 2013). Motivated by this inverse relationship between turnover and school outcomes and an assumption that teacher turnover rates are higher than optimal (e.g., Alliance for Excellent Education, 2014), numerous studies have investigated the factors that lead teachers to leave their schools or the profession altogether (see Borman & Dowling, 2008; Grissom, Viano, & Selin, 2016; Guarino, Santibañez, & Daley, 2006). Studies find that the quality of the school’s leadership is among the most important predictors of whether a teacher continues in the school from year-to-year (e.g., Boyd et al., 2011; Grissom, 2011; Ladd, 2011). This research suggests that more effective principals are better able to retain teachers because they create more positive school climates, supply teachers with greater support, provide more beneficial opportunity for professional growth, and otherwise positively shape teachers’ working conditions in ways that lead to greater job satisfaction and attachment (Ingersoll, 2001; Johnson, Kraft, & Papay, 2012; Kraft, Marinell, & Yee, 2016).
An important characteristic of prior studies linking leadership to teacher retention is that they find that more effective leadership is associated with lower turnover for the average teacher. From a school improvement standpoint, however, all teacher turnover is not created equal. Although schools no doubt benefit from retaining effective teachers, they likely also benefit from failing to retain teachers who are ineffective (Adnot, Dee, Katz, & Wyckoff, 2017; Hanushek, 2009; Ronfeldt et al., 2013; Staiger & Rockoff, 2010). We refer to this phenomenon—the retention of effective teachers coupled with the non-retention of ineffective teachers—as strategic retention. Past work has found that high-growth schools indeed have higher retention rates among high-performing teachers and higher turnover rates among low performers (Loeb, Béteille, & Kalogrides, 2012).
As the chief human capital managers in their schools, principals are uniquely positioned to pursue differential retention strategies. Personnel decisions are an important channel through which principals can influence student achievement (Jacob, 2011), and several studies provide evidence that principals can act strategically in the area of personnel management to try to improve school outcomes, particularly when they have access to good information about teacher performance (e.g., Cohen-Vogel, 2011; Goldring et al., 2014; Grissom, Loeb, & Nakashima, 2014; Rockoff, Staiger, Kane, & Taylor, 2012). Effective principals may pursue strategies to reduce the overall teacher turnover rate in their schools, but we expect that they may work particularly to lower turnover among their high performers. In contrast, effective principals may find ways to encourage turnover among low performers, either through administrative means, such as contract nonrenewal, or through less formal means, such as “counseling out” or finding ways to make the job less palatable (Drake et al., 2016).
Investigating strategic teacher retention is especially timely in the context of the widespread adoption of multiple-measure teacher evaluation systems across the United States in recent years (Steinberg & Donaldson, 2016). These systems typically incorporate both school leaders’ observations of classroom practice using a detailed instructional rubric and achievement-based metrics (such as value added), sometimes supplemented with other measures (Grissom & Youngs, 2016). Facilitation of strategic retention is one motivation for these systems, which are predicated on the idea that differentiating teacher performance—which prior, less rigorous evaluation systems failed to do (Weisberg et al., 2009)—can provide schools opportunities to tailor approaches to different teachers, including merit pay, career ladders, improvement plans, and dismissal (Steinberg & Donaldson, 2016). 1 Often ignored in policy discourse around teacher evaluation, principals are both important targets and important facilitators of these systems. They are targets in the sense that requirements for frequent, detailed teacher observations push principals to collect and process their own evidence about their teachers’ effectiveness. They are facilitators in that the implementation capabilities and investment choices of the principal directly influence both the quality of the collected data (in the form of observation scores) and the local response to evaluation information (see Grissom & Loeb, 2017; Kraft & Gilmour, 2017; Reinhorn, Johnson, & Simon, 2017). These roles suggest that principal effectiveness may be an important prerequisite for teacher evaluation systems to realize their potential to shape the effectiveness of a school’s teacher workforce.
In this article, we investigate whether schools led by more effective principals show evidence of strategic retention. We leverage longitudinal administrative data from Tennessee spanning the years 2011–2012 to 2016–2017, which correspond to the first 6 years of the implementation of the Tennessee Educator Acceleration Model (TEAM), the state’s principal and teacher evaluation system. TEAM collects multiple measures of performance for both principals and teachers, including both rubric-based observation measures and measures based on student achievement. The evaluation system thus provides us with rich data for operationalizing both principal and teacher effectiveness, which we further augment—for principals—with additional measures from low-stakes surveys that ask Tennessee teachers to rate leadership quality in their schools. Using these measures of principal and teacher effectiveness, we address several research questions. First, replicating prior work, we ask whether principal effectiveness is associated with lower teacher turnover on average. Second, we ask to what extent the association between principal effectiveness and teacher turnover differs for high-performing and low-performing teachers, and, furthermore, whether these associations vary by different measures of teacher performance. Third, we ask whether and how these patterns differ by other characteristics of teachers (e.g., novice vs. veteran teachers) and their schools (e.g., high-achieving vs. low-achieving schools).
We answer these questions using a school fixed-effects approach to exploit within-school variation in principal effectiveness over time, which helps eliminate some (but not all) threats to interpreting our estimates as causal. In contrast to most prior studies, which have relied on teacher perceptions of principals from surveys to operationalize principal effectiveness, our main analyses make use of ratings assigned to principals using a rubric scored by supervisors trained to rate principal effectiveness against state standards, which may provide more accurate measures of their performance. Our results contribute not only to the research base on teacher mobility and retention but also to the literatures on the talent management role of principals, which is increasingly recognized as a key component of principal work (Cannata et al., 2017; Goldring et al., 2014), and on the role of teacher evaluation systems in shaping the teacher workforce.
Strategic Teacher Retention
Numerous quantitative studies going back nearly two decades identify the effectiveness of school leadership as an important predictor of whether teachers stay or leave their schools (e.g., Griffith, 2004; Ingersoll, 2001; Stockard & Lehman, 2004). Many of these studies use data from teacher surveys to operationalize leader effectiveness, then use those perceptual measures to predict the likelihood of turnover. For example, Boyd et al. (2011) find that beginning teachers in New York City are less likely to leave their positions when they have more positive perceptions of school administrators, and when their colleagues do as well. Moreover, in a follow-up survey, nearly half of the teachers who left their position identified “support from administrators” as the most important reason for their decision. Likewise, Ladd’s (2011) study using teacher working conditions survey data from North Carolina finds that teachers working in schools with better leadership—as measured by aggregate responses to leadership perceptions questions among teachers in the school—express lower intent to leave and, in fact, turn over less often, controlling for other factors. Grissom’s (2011) analysis of data from the nationally representative Schools and Staffing Survey similarly finds that principal effectiveness, measured via school-level averages of teachers’ ratings of leadership, is associated with increased teacher satisfaction and a decreased likelihood of teacher turnover. These associations were larger in schools with larger numbers of historically marginalized students. In another analysis of New York City data—this time longitudinally—Kraft et al. (2016) again find teachers’ perceptions of leadership quality to predict lower rates of teacher turnover, more so than other school climate factors, such as quality of teacher relationships and collaboration.
Numerous mechanisms may link more effective leadership to lower turnover, but explanations in most studies center on effective leaders’ positive impacts on school climate (Burkhauser, 2017), which in turn improves teachers’ satisfaction and commitment to the school and makes it less likely that they leave. In this vein, early-career teachers interviewed by Johnson and Birkeland (2003) who chose to stay in their schools reported having principals who fostered supportive cultures and found ways to encourage teachers and recognize their efforts, while non-stayers instead tended to describe their principals as absent, controlling, overly critical, and not attuned to teacher support. Similarly, Allensworth, Ponisciak, and Mazzeo (2009) found that teachers who reported a high level of trust of their principal and viewed their principal as a strong instructional leader were substantially more likely to remain in their schools.
Although we do not discount the importance of the connections among more effective leadership, more positive school climates, and lower average teacher turnover rates, we propose that effective school leaders may pursue additional strategies to improve the quality of their teaching staffs, presumably in pursuit of more positive student outcomes. In particular, we suggest that more effective leaders may be more likely to target their retention efforts at more effective teachers while simultaneously working to not retain ineffective teachers. Such strategic retention behaviors may be more likely to lead to gains in student outcomes than strategies aimed at retaining all teachers. To this point, Loeb et al.’s (2012) study of Miami-Dade County Public Schools found that high-growth schools were more successful both at retaining teachers with high value added to student achievement and at turning over teachers with low value added.
The hypothesis that more effective principals are more likely to selectively retain teachers rests on a number of assumptions about what leaders know and can do with regard to teacher retention. In particular, strategic retention presumes that principals can identify their high and low performers, that they can influence retention decisions among teachers they wish to retain, and that they have the means to release or influence turnover among teachers they wish to not continue in their schools. We discuss each of these in turn.
Identifying High and Low Performers
Differentiated retention strategies targeted at high- and low-performing teachers rely on principals’ capacities to tell one type of teacher from the other. Even prior to the recent widespread implementation of multiple measures-based teacher evaluation systems, studies suggested that principals could differentiate teachers by performance (Harris & Sass, 2014; Jacob & Lefgren, 2008; Rockoff et al., 2012). In the post–Race to the Top era, when more rigorous teacher evaluation systems that make multiple measures of teacher performance accessible to principals have become commonplace, principals have even more information at their disposal for differentiating between high- and low-performing teachers (see Grissom & Youngs, 2016). More frequent principal observations of teachers using a standardized rubric are a hallmark of these new systems, providing principals with more opportunities to observe teacher practice and, in many cases, facilitating their engagement with other measures of teacher effectiveness (Goldring et al., 2014). As with prior studies in low-stakes settings, studies of principals’ formal evaluation ratings of teacher practice show that they can accurately differentiate teacher performance (Grissom & Loeb, 2017; Sartain, Stoelinga, & Brown, 2011).
Principals may be particularly responsive to teacher effectiveness measures derived from their own observations in making talent management decisions. As Goldring et al. (2015) document in a study of principals’ use of these measures in six urban districts, principals trust observation data because they collect the data themselves. In contrast, they discount other measures, such as value added, because of lack of transparency and because those measures often are returned to them too late to be useful in talent management decisions. Moreover, principals value dimensions of teacher practice that are not captured by student achievement-based measures but that often are reflected in observational rubrics (Grissom, Loeb, & Doss, 2016; Master, 2014). Thus, we expect that teacher effectiveness reflected in the observation score is more likely to drive principals’ teacher retention decisions than other components of an overall evaluation score.
Targeting Retention Efforts at High-Performing Teachers
Knowing who their high-performing teachers are does not guarantee that principals have tools available to influence those teachers to stay. In few districts do principals play any role in determining teacher compensation, meaning that principals’ work to retain their effective teachers must come through affecting their working conditions. 2 In several studies linking principal effectiveness to teacher turnover, effectiveness has been operationalized by teacher responses to survey questions focused on principals’ impacts on working conditions, such as the degree to which principals provide support and encouragement and recognize teachers for a job well done (e.g., Grissom, 2011; Ladd, 2011). Principals’ ability to retain their effective teachers likely depends on the degree to which they can differentiate their efforts to create more positive working conditions for their high performers.
A 2012 report from The New Teacher Project (TNTP) suggested that principals indeed could differentiate such efforts and recommended several straightforward strategies for retaining “irreplaceable” teachers (i.e., those in the top tier of performance), including recognizing their contributions, providing them direct encouragement to stay, giving them helpful feedback, and creating opportunities for them to show leadership. Yet in the four urban districts studied for the report, they found use of such strategies to be relatively infrequent. Other research suggests that principals may give more desirable work assignments—by, for example, matching teachers to higher achieving students—to more effective teachers or move those teachers into instructional coach or other leadership roles, suggesting some other strategies through which principals might seek to encourage retention among their high performers (Chingos & West, 2011; Grissom, Kalogrides, & Loeb, 2015; Kalogrides, Loeb, & Béteille, 2013; Player, 2010). Other efforts to retain effective teachers may be indirect. For instance, creating systems and processes to remove low-performing teachers may indirectly encourage high performers to stay put by emphasizing the value the school places on high-quality teaching (TNTP, 2012).
Targeting Release Efforts at Low-Performing Teachers
Strategic retention also requires that principals have the capacity to move ineffective teachers out of their schools when they identify them. Principals can induce low-performing teachers to turn over through formal or informal means.
Formal means are administrative processes by which a teacher can be removed from a school or district. Typically, schools have more discretion to dismiss low-performing early-career teachers who are untenured or on probationary status, while the administrative processes surrounding dismissal of a tenured or non-probationary teacher often make (or are perceived to make) forced removal of a more veteran teacher too difficult or costly except in extreme cases (Gordon, Kane, & Staiger, 2006; Painter, 2000; Weisberg et al., 2009). Recent reforms in several states, including Tennessee and Florida, have sought to make formal teacher dismissal procedures more accessible to school and district leaders by lengthening probationary periods, conditioning tenure on performance, and eliminating some due process rights associated with tenure, among other changes (Harrison & Cohen-Vogel, 2012; Wesson, 2012). Even in the wake of these changes, however, utilization of formal dismissal procedures appears limited, with few teachers in reform states removed for poor performance (Kraft & Gilmour, 2017).
Other evidence suggests, however, that, when given the tools, principals can utilize formal processes to move teachers out of their schools. For example, Jacob (2011) found that principals in Chicago Public Schools were less likely to renew teachers with higher absence rates, lower performance ratings, and lower value added. In Master’s (2014) study of an urban charter district, principals were substantially more likely to dismiss teachers to whom they had given low mid-year ratings. Grissom et al. (2014) found that principals in low-achieving schools in Miami who were given the option to nominate teachers to be involuntarily transferred out of their schools used the policy to move out veteran teachers with low value added and higher absence rates and replace them with higher performers. In another study, principals reported using data from their teacher evaluation system to place struggling teachers on formal intervention or assistance plans that would lead to dismissal if teachers did not respond to the plans and improve (Drake et al., 2016). The stigma associated with being on a professional improvement plan can be enough to lead ineffective teachers to leave (Cohen-Vogel, 2011).
Principals may make use of informal means to remove ineffective teachers as well. “Counseling out” by, for example, discussing with the teacher that they may be more successful in another environment, is one such strategy (TNTP, 2012), though principals in Grissom and Loeb’s (2011) study of principal management skills rated their ability to counsel out teachers among the lowest of 50 skill areas they assessed. Principals also may implement a strategy of changing teachers’ work assignments to make their jobs less desirable (TNTP, 2012), making it more likely that the teacher voluntarily exits. In an experiment in which principals were provided with teacher performance information, turnover among low-performing teachers in the treatment schools increased (Rockoff et al., 2012), evidence that principals indeed used the information and available mechanisms to push teachers out. Likewise, other studies have found that teachers are more likely to resign when assigned low evaluation ratings (Master, 2014).
Principal Effectiveness and Strategic Retention
Effective teachers are more likely to stay in their schools, and less effective teachers are more likely to move or exit (e.g., Boyd, Lankford, Loeb, Rockoff, & Wyckoff, 2008; Goldhaber, Gross, & Player, 2011). However, researchers have not yet examined whether these patterns are moderated by the quality of the school’s principal. We argue that there are good reasons to suspect that they might. Effective principals produce higher student outcomes (Coelli & Green, 2012; Grissom et al., 2015), and strategies to improve the effectiveness of teachers in the school are likely important to their approach (Cohen-Vogel, 2011; Loeb et al., 2012). To the degree that more effective principals are more engaged in and serious about teacher observation and evaluation (Goldring et al., 2014), they likely have better information with which to differentiate high- and low-performing teachers. They also are more likely to act on that information in strategic ways; to this point, Donaldson (2013) found that principals with greater human capital in the form of ingenuity, motivation, and training in instructional improvement were more likely to circumnavigate obstacles to increasing teacher effectiveness in their buildings through hiring, dismissal, and other talent management functions.
More generally, even in the absence of specific targeted strategies for retaining high performers and removing low performers, effective instructional leaders establish school cultures marked by high expectations and consistent, useful feedback to teachers on their performance (e.g., Hallinger, 2005; Neumerski, 2013). Such school cultures likely are especially conducive to increasing job satisfaction and commitment among teachers who excel and to signaling a lack of fit to ineffective teachers (TNTP, 2012).
Data and Measures
This study analyzes administrative data from Tennessee, a state made up of 146 districts operating roughly 1,800 schools that serve 996,000 students. Thirty-one percent of the state’s students are Black or Hispanic, and 58% are eligible for the federal subsidized lunch program. 3 Tennessee was a first-round winner of the Obama administration’s Race to the Top competition and instituted a number of educational reforms under its auspices. Particularly useful for this study, these reforms included implementation of a statewide educator evaluation system, the Tennessee Educator Acceleration Model or TEAM, beginning in the 2011–2012 school year. 4 We make use of multiple measures from the TEAM system to measure the effectiveness of teachers and principals, as we describe below.
An important piece of Tennessee context is that dismissal or release of teachers is substantially easier for districts than in most other states. A law passed in April 2011 ensured that districts have wide latitude to dismiss teachers deemed ineffective (Tenn. Code. Ann. § 49-5-501–515). During the period we study, teachers in the first five years of their teaching careers in Tennessee (“probationary” teachers) could be let go without cause; districts simply could elect not to renew their yearly employment contract. Even after entering the “non-probationary” period, teachers could be reverted to year-to-year contract status with two consecutive years of low overall evaluations. Permanent tenure status for teachers who already had tenure at the time of the law change was grandfathered, meaning those teachers could only be let go for cause and after due process. 5 Districts differ in how they approach non-renewal of teacher contracts, but principals have relatively broad discretion to release teachers—particularly early-career teachers—they deem ineffective or a poor fit.
For this analysis, the Tennessee Department of Education (TDOE) provided us with longitudinal administrative data files covering all public education personnel in the state from 2011–2012 to 2016–2017 via the Tennessee Education Research Alliance (TERA) at Vanderbilt University. These files contain information about employees’ personal and professional characteristics, including job positions, gender, race and ethnicity, years of experience, and highest degree earned. We use these files to construct additional experience measures, such as years employed in their current school. 6 We then merge these data with information on the characteristics of the schools and districts in which teachers and principals currently work from annual student demographic, enrollment, and achievement data from TDOE.
Table 1 contains descriptive statistics for teachers and schools, including our key measures that we discuss below. The teaching force in Tennessee is predominantly female and White, with a majority having a master’s degree or higher. More than a quarter of teachers have fewer than five years of teaching experience. The average teacher works in a school where 24% of the students are Black and 59% qualify for free/reduced price lunch. More than half of Tennessee’s teachers work in elementary schools.
Descriptive Statistics
Note. FRPL = free/reduced price lunch; TEAM = Tennessee Educator Acceleration Model; FTTT = First to the Top survey. All variables are shown at the teacher-by-year level covering the analysis period 2011–2012 to 2015–2016. Data from 2016–2017 is used only to construct a turnover outcome for teachers in 2015–2016. FTTT scores are available from 2011–2012 to 2013–2014. Teacher growth and achievement scores are unavailable in 2015–2016 because Tennessee cancelled statewide testing in that year.
Measuring Teacher Effectiveness
Under Tennessee’s statewide evaluation system, teachers’ overall summative evaluation scores are comprised of the weighted average of three components: scores from formal classroom observations, student test score growth, and student achievement. Classroom observations are conducted by trained raters—typically, the school’s principal—throughout the school year using the TEAM rubric, which defines levels of performance on 19 instructional indicators and four additional professionalism indicators. 7 The test score growth measure comes from the Tennessee Value-Added Assessment System (TVAAS), estimated for TDOE by the SAS Institute. 8 For teachers of tested grades and subjects, an individual growth score is determined by their students’ test score gains. Teachers of untested classrooms receive a growth score based on schoolwide growth. Achievement measures are locally determined and are not consistent across schools. 9 Scores on these individual components are aggregated using weighting formulas that vary based on a teacher’s subject assignment (i.e., tested or non-tested subjects) and the availability of prior student achievement data; the typical split over the years of the data is 50% from observations, 35% from student growth, and 15% from the achievement measure. 10 The weighted sum is used to assign each teacher a “Level of Overall Effectiveness” (LOE) rating on a discrete scale of 1 to 5, where 1 is “Significantly Below Expectations” and 5 is “Significantly Above Expectations.” Growth scores and LOE ratings typically are returned to teachers and schools in the early fall of the following school year.
For teachers of tested grades and subjects (approximately 44% of teachers in Tennessee), both the average classroom observation and growth scores represent individualized measures of teacher effectiveness. For the remaining majority of teachers, the average classroom score is the only available individualized measure of teacher performance. In this analysis, we use both observation and growth scores as measures of teacher effectiveness. Additionally, we examine “individual growth scores” for the relevant subset of teachers. 11 Growth scores, which range from Level 1 to Level 5, are taken directly from the evaluation files. 12 To facilitate the interpretation of non-linear relationships between observation scores and turnover, as well as to match the construction of the growth and achievement measures, we convert the continuous score to a categorical indicator with five groups. 13 The lowest category (1.00-2.75) includes teachers who are roughly two or more standard deviations below the mean. These measures of teacher effectiveness are available for approximately 58,000 teachers in each year. 14
Table 1 shows that the average observation score is 3.91, and the standard deviation (0.58) is much smaller than that of the growth or achievement score. Relatively few teachers receive very low (3%) or very high (16%) observation scores. The number of teachers in the lowest observation category is roughly equal to the number of teachers who receive the lowest summative evaluation rating from the state each year. There is much greater variation in growth scores, though nearly 42% of teachers receive the highest rating (Level 5). Additionally, far more teachers receive very low growth ratings (20%) than receive very low observation scores. Consistent with prior work, there is a modest positive correlation between observation and growth scores (r = 0.18). 15 Finally, the average achievement score is 4.12, with nearly 57% of teachers receiving a Level 5. Given that achievement scores come nearly exclusively from school performance, 16 we do not include them in our analysis as a measure of teacher effectiveness.
Measuring Principal Effectiveness
We employ two distinct measures of principal effectiveness. Our primary measure comes from summative supervisor practice ratings assigned to each principal each year as part of the TEAM system. These ratings, which comprise 50% of each principal’s overall evaluation score, are assigned by district leaders—the superintendent or his or her designee—who have been trained to rate principals using a rubric derived from the Tennessee Instructional Leadership Standards. The rubric contains 22 indicators of effective leadership practice grouped into four domains (as of 2015-2016): Instructional Leadership for Continuous Improvement, Culture for Teaching and Learning, Professional Learning and Growth, and Resource Management. Importantly, the number and content of these indicators and domains has changed several times since 2011–2012, though prior research on these scores shows that, regardless of the specific indicators included on the rubric in a year, the indicators are so highly intercorrelated that they can be reduced to a single underlying performance score using factor analysis (Grissom, Blissett, & Mitani, 2018). That is, the supervisors’ ratings do not differentiate areas of principal performance and instead identify a single underlying principal effectiveness construct. Given this evidence, we calculate the predicted score from this single-factor model 17 to use as a summary measure of effective leadership practice—at least from the point of view of the principal’s supervisor. We refer to this score, which we standardize, as the TEAM rating. 18
We supplement the TEAM supervisor rating with an additional measure of principal effectiveness from the perspective of teachers in the school. This measure is similar to those employed in other studies linking principal effectiveness to teacher turnover (e.g., Grissom, 2011; Ladd, 2011). We construct this measure from responses to the First to the Top (FTTT) Survey, conducted by researchers at Vanderbilt University in cooperation with TDOE as part of evaluation efforts associated with the state’s FTTT initiatives. Teachers statewide were invited to participate in the survey in the spring of the 2011–2012 through 2013–2014 school years. Response rates ranged across years from 25% to 40%. A random subset of responding teachers completed a module containing a battery of questions designed to assess their principal’s leadership. Items ask, for example, whether the school’s principal consistently monitors student academic progress, communicates a clear school vision, and sets high standards. Using factor analysis, we again found that responses captured one latent construct, which we take to be perception of principal effectiveness (Grissom et al., 2018). To obtain a principal-level score, we averaged the teacher-level factor scores at each principal’s school and standardized them. We refer to this score as the principal’s FTTT score. Approximately 19% of teacher-year observations do not have an FTTT score because none of the school’s teachers responded to the requisite survey items, and most schools have responses from only a few teachers, so results should be interpreted with caution. 19 The correlation between principal TEAM rating and FTTT score is a modest 0.20, suggesting that supervisors and teachers judge principals by related but distinct criteria.
Operationalizing Teacher Turnover
From the longitudinal administrative data files, we create both binary and categorical teacher turnover variables. The binary variable takes a value of 1 if a teacher leaves his or her school during or immediately following year t (i.e., they are not in the same school in year t+ 1). The categorical indicator includes four turnover types (not including teachers who stay in their schools): moving to another school in the same district, moving to a different district, changing positions (e.g., instructional coach, assistant principal, and leaving the education system. 20
Table 2 shows simple mean yearly turnover rates for Tennessee teachers by each measure of teacher effectiveness. Panel A shows that the yearly teacher turnover rate is 13.3%. About a third of turnover events are teachers who leave the education system (4.5%), with a somewhat higher proportion of teachers moving to another school in the same district (5.5%). In comparison to exits and within-district moves, across-district moves and position changes (e.g., becoming an instructional coach or administrator) are less common (2.3% and 1.0%).
Teacher Turnover Rates by Effectiveness Measures
Note: Panel A includes teachers from 2011–2012 to 2015–2016. Panels B and C include teachers from 2011–2012 to 2014–2015.
Unsurprisingly, less effective teachers are more likely to turn over. For each of the three measures of teacher effectiveness (average observation score, growth score, and individual growth score), there is a (generally monotonic) negative relationship between effectiveness and turnover. However, the difference in turnover rates between teachers in the highest and lowest effectiveness categories is much larger for observation scores. Teachers whose average observation scores are 1.00 to 2.75 are more than 3 times more likely to turn over than teachers scoring 4.50 to 5.00. By comparison, for growth, the turnover rate of Level 1 teachers is only 24% greater than Level 5 teachers. When constraining growth scores to teachers in tested classrooms, the turnover gap between Level 1 and Level 5 teachers increases to 49%. Even when comparing teachers with the highest observation scores to those with merely below-average observation scores (2.75–3.50), the turnover gap is larger than between Level 1 and Level 5 growth.
Method
Our main analysis examines the extent to which principal effectiveness is associated with higher or lower turnover probabilities among teachers with different performance ratings. The base model is as follows:
where the probability that teacher
The inclusion of school fixed effects
This base model estimates the adjusted correlation between principal effectiveness and turnover among all teachers in a school. To examine whether this estimated relationship changes based on a teacher’s effectiveness, we modify equation 1 to include an interaction term between principal effectiveness and teacher effectiveness:
where the linear combination of
We also investigate the extent to which the relationship between principal effectiveness and teacher turnover changes when we differentiate between categories of turnover. Here, we adjust Equations (1) and (2) to the multinomial case and estimate the probability of each category of turnover outcome (e.g., move to another school in the district, exit from the profession) relative to the same base category, teachers who stay in their positions. Estimating a series of models separately for each of the five turnover categories is preferred to other methods (e.g., multinomial logit) because we include a large number of fixed effects (Angrist & Pischke, 2009).
Results
Principal Effectiveness and Average Teacher Turnover
Table 3 displays the results from estimating versions of Equation (1) with observation and growth scores as measures of teacher effectiveness. For each measure, we estimate four models. Columns 1 and 5 include only year fixed effects, columns 2 and 6 add district fixed effects, and columns 3 and 7 replace district fixed effects with school fixed effects (our preferred specification). Additionally, columns 4 and 8 show the school fixed effects results using the FTTT score as the measure of principal effectiveness. 23
Principal Effectiveness and Teacher Turnover (Baseline Model)
Note. TEAM = Tennessee Educator Acceleration Model; FTTT = First to the Top survey; FRPL = free/reduced price lunch. School-by-year clustered standard errors in parentheses. The dependent variable is whether a teacher leaves their position in the current year. Models include year fixed effects. Degrees refer to teacher’s highest educational attainment.
p < .10. **p < .05. ***p < .01.
Across models, the estimated coefficients for principals’ TEAM ratings are relatively stable. On average, principals with higher TEAM ratings experience lower rates of average teacher turnover, which is consistent with prior findings (e.g., Boyd et al., 2011; Grissom, 2011; Ladd, 2011). Columns 3 and 7, which include school fixed effects, show that a 1 SD increase in principal TEAM rating is associated with a decrease in average teacher turnover in the school of half a percentage point, or roughly 5% of the average turnover rate. When replacing TEAM ratings with FTTT scores, the results are very similar.
Table 3 also shows that the descriptive pattern from Table 2—that less effective teachers turn over more often—holds when adjusting for other teacher and school characteristics. The predicted turnover rate of teachers who receive very low observation scores is roughly 23 percentage points greater than teachers with the highest observation scores, a very large difference. This difference drops to roughly 2 percentage points when comparing the lowest-growth and highest-growth teachers. Turning to other teacher characteristics, Table 3 shows that teachers with higher levels of education are more likely to leave their positions, and that Black teachers are less likely to leave their schools than White teachers. Similar to previous studies of teacher turnover, we find a “U-shaped” relationship between teacher age/experience and the likelihood of turnover; teachers at each end of the age and experience distributions have higher turnover rates than teachers in the middle. The bottom of Table 3 also shows results for school characteristics; these results should be interpreted with caution given that several of the measures are highly correlated (e.g., racial/ethnic and socioeconomic composition).
Principal Effectiveness and Strategic Retention
Table 3 demonstrates that more effective principals experience lower teacher turnover rates on average. Next, we allow for the possibility that schools with effective principals experience higher turnover rates among teachers at different levels of effectiveness. Table 4 shows the results for our school fixed effects model (control variables are omitted for brevity), with each column using a different measure of teacher effectiveness (observation, growth, and individual growth). The linear combination of the main effect and an interaction term yields a distinct marginal effect for teachers with different effectiveness levels. 24
The Interaction Between Principal and Teacher Effectiveness
Note. TEAM = Tennessee Educator Acceleration Model; FTTT = First to the Top survey. School-by-year clustered standard errors in parentheses. The dependent variable is whether a teacher leaves their position in the current year. All models include teacher/school controls. “Obs” indicates observation score.
p < .10. **p < .05. ***p < .01.
For each column in Table 4, the teachers with the highest effectiveness are the reference group, so the TEAM rating and FTTT score coefficients each represent a change in the predicted probability of teacher turnover among highly effective teachers. In every model, the estimated impact of principal effectiveness on the most effective teachers is negative; a 1 SD increase in principal effectiveness translates into approximately a 1 percentage point decrease in turnover for these teachers (p < .01 in each case).
The interaction terms in columns 1 and 4, however, suggest that the association between principal effectiveness and teacher turnover varies by teacher observation score. In contrast, for growth and individual growth, Table 4 shows little evidence of meaningful interaction. Table 5 makes these conclusions more readily apparent by showing the predicted change in turnover rates for a 1 SD increase in principal effectiveness (TEAM rating or FTTT score) across the teacher effectiveness distribution, based on the results in Table 4. For the observation models, the evidence is consistent with strategic retention behavior—principals who receive higher ratings on the administrator evaluation rubric are more likely to retain teachers with high observation scores but less likely to retain teachers with very low observation scores. To be more specific, a 1 SD increase in the TEAM rating is associated with a 1.3 percentage point decrease in turnover among teachers scoring 4.50 to 5.00 but a 2.3 percentage point increase in the likelihood of turnover among teachers scoring 1.00 to 2.75. Compared with the average turnover rates for high-observation and low-observation teachers, these differences correspond to a 12% decrease and 6% increase, respectively. 25
Marginal Effects of Principal Effectiveness on Teacher Turnover by Teacher Effectiveness
Note. TEAM = Tennessee Educator Acceleration Model; FTTT = First to the Top survey. Each column displays coefficients from a separate regression model interacting the principal’s TEAM rating or FTTT score with teacher observation, growth, and individual growth scores. The marginal effects correspond to the results in Table 4. The dependent variable is whether a teacher leaves their position in the current year. “Obs” indicates observation score.
p < .10. **p < .05. ***p < .01.
Figure 1 plots the predicted probability of teacher turnover by average observation score across the range of principal TEAM ratings. As noted previously, teachers with low observation scores have substantially higher turnover rates than teachers with average or above-average scores, irrespective of principal effectiveness, evidenced by the large gaps between the lines for teachers with high and low scores. However, low-scoring teachers in schools with effective principals are more likely to leave than low-scoring teachers in schools with ineffective principals, demonstrated by the upward sloping line for teachers in the lowest category. This finding is consistent with the idea that effective leaders seek to improve the quality of their school’s teaching staff by pushing out low performers. Moreover, more effective principals are better at retaining high-scoring teachers. Moving from low to high principal TEAM ratings, we see increasingly large turnover gaps between effective and ineffective teachers.

Predicted probability of teacher turnover by average observation score and principal effectiveness. TEAM = Tennessee Educator Acceleration Model.
Returning to Table 5, the FTTT score results for teacher observations show similar patterns, though the evidence of increased teacher turnover among teachers with low observation scores is weaker; the marginal effect is positively signed but not statistically distinguishable from zero. An important consideration in comparing TEAM and FTTT is that they likely reflect different aspects of principal performance. While TEAM ratings come from supervisors who evaluate principals throughout the district, FTTT scores reflect teacher perceptions of principal performance within a given school. For the sake of simplicity and to further explore the strategic retention patterns we observe for this measure, our subsequent analyses focus on TEAM ratings as the measure of principal effectiveness.
Teacher Observation Scores Drive Strategic Retention Patterns
The teacher observation score results in Tables 4 and 5 stand in contrast to those for the two growth measures. Although Table 4 shows that more effective teachers by these metrics are more likely to stay in the school, Table 5 shows no evidence of an interaction with principal effectiveness and no strategic retention pattern. Instead, marginal effects of similar magnitude suggest that teachers’ turnover probabilities decline roughly similarly as principal effectiveness increases across levels of teacher effectiveness.
Up to this point, however, we have considered teacher observation scores and growth scores separately. In reality, principals receive both signals, to some degree, for each teacher. Although growth scores are not available until the following fall, principals may monitor proxies for this growth—for example, from student benchmark or interim assessment data—throughout the year. To parse out the relative importance of different signals of teacher performance to principal human capital decision making, next we include observation and growth in the same model. We also include interactions between principal TEAM rating and both observation and growth scores. 26 We focus on teachers with individual growth scores, since principals are unlikely to take the school-level scores assigned to non-tested teachers as signals of individual performance.
Table 6 shows the estimated marginal effects. The combination of 2 interaction terms with 5 levels produces a total of 25 coefficients. To isolate the relative importance of individual growth scores, we can compare the estimated marginal effects between teachers with different individual growth scores, holding observation scores constant. For example, the top row shows the estimated change in the probability of turnover for teachers with low observation scores with Level 1 through Level 5 individual growth scores. Each of the marginal effects is positive and similar in magnitude. 27 For teachers with high observation scores, the marginal effects are negative regardless of level of growth, with no clear evidence that the marginal effects increase for teachers with higher individual growth. 28
Identifying the Relative Importance of Different Measures of Teacher Effectiveness
Note. School-by-year clustered standard errors in parentheses. Results shown are from estimating Equation (2) with interactions between principal TEAM (Tennessee Educator Acceleration Model) rating and both teacher observation scores and individual growth scores. Each cell shows the estimated marginal effect of a 1 SD change in principal TEAM rating on the probability of teacher turnover among teachers with the given observation (rows) and growth (column) scores. Models include school and year fixed effects.
p < .10. **p < .05. ***p < .01.
We interpret these results as suggesting that observation scores, rather than measures of teacher effectiveness derived from student test score growth, drive the patterns of strategic retention. To see this interpretation more easily, Figure 2 plots the predicted probability of turnover for teachers with different observation and individual growth scores, based on Table 6. For simplicity, we compare teachers in the highest and lowest observation score categories who have individual growth scores of Level 1, Level 3, or Level 5. The probability of turnover among teachers with very low observation scores differs greatly from teachers with very high observation scores, and the difference grows larger as principal effectiveness increases. Holding observation scores constant, however, turnover differences among teachers with low, medium, and high individual growth scores are minimal. There is, perhaps, some visual evidence of separation between high- and low-growth teachers in schools with highly rated principals, but these differences are small and not statistically significant. In other words, as principal effectiveness increases, low observation scores predict greater turnover, and high observation scores predict lower turnover, nearly regardless of whether growth scores are high or low.

Probability of teacher turnover by observation and individual growth scores. TEAM = Tennessee Educator Acceleration Model.
Strategic Retention or Strategic Scoring?
So far, we have discussed observation scores as an independent source of information principals use to identify high- and low-performing teachers for the purpose of strategic retention, as if they are collected by an external source. However, these measures are not externally created; although some districts incorporate observations from external evaluators, principals (or other school leaders) usually assign these scores themselves. Given the weak evidence of an interaction with growth scores, a potential alternative explanation of our findings is that effective principals systematically assign lower observation scores to teachers whom they want to push out, regardless of the actual performance of those teachers. Under this interpretation, the observation score represents a removal mechanism—either by triggering a formal administrative process, such as tenure denial, or by providing a discouraging signal to a teacher that leads them to seek employment elsewhere—rather than a source of information regarding teacher effectiveness.
We conduct a number of analyses to investigate this possibility. First, we examine the extent to which more effective principals are more likely to assign low ratings. In Supplementary Appendix Figure 1 (available in the online version of the journal), we plot histograms of the average teacher observation score across each quintile of principal TEAM rating. Overall, the distribution of teacher observation scores is similar across levels of principal effectiveness. In fact, the mean observation score increases in schools with more effective principals. This approach, however, examines both within- and between-school variation, whereas our identifying variation comes only from within-school variation in principal effectiveness. We more formally test the relationship between within-school changes in principal effectiveness and the distribution of teacher observation scores in Supplementary Appendix Table 1 (available in the online version of the journal). Here, we collapse the data into school-by-year cells and estimate the relationship between principal TEAM rating and (1) the school-by-year mean of observation scores, (2) the school-by-year standard deviation of observation scores, and (3) the proportion of teachers who fall into each of our five categories of average observation scores. We find that while within-school increases in principal TEAM rating are associated with small increases in the average teacher observation score, there is no change in the dispersion of observation scores or in the proportion of teachers who receive very low observation scores.
As an additional check, we investigate the extent to which lower scoring teachers are less effective according to other measures, and whether these patterns vary by principal TEAM rating. In Supplementary Appendix Table 2, we show descriptive statistics for teachers across the range of observation scores and by quintile of their principal’s TEAM rating. In addition to two teacher experience measures (first-year teacher and new-to-school teacher), we examine three types of observation score and value-added (TVAAS) measures: prior-year score, average of all prior scores, and career-average scores. Supplementary Appendix Table 2 illustrates two important descriptive facts. First, across each measure, there is a monotonic relationship between experience/effectiveness and a teacher’s average observation score in the current year. In other words, low-scoring teachers are the lowest performers across all measures, and high-scoring teachers are the highest performers. Second, teachers with low (high) observation scores in schools with effective principals are observationally similar to teachers with low (high) observation scores in schools with ineffective principals.
Finally, we replicate our main results using ratings provided only by raters other than the principal himself or herself. 29 In most schools, assistant principals also conduct teacher observations, and we observe a somewhat smaller number of observations conducted by teacher-leaders or someone from central office. In Appendix Table 3, we show marginal effects from turnover models interacting principal TEAM ratings with observation scores, first for observations conducted by assistant principals and then for other (non-school leader) raters. Results are consistent with those shown in Table 5; the highest-rated teachers are less likely to turn over as principal effectiveness increases, while the lowest-rated teachers’ turnover probabilities increase.
Taken together, these analyses suggest that our main results are not driven by differential or strategic scoring of teachers by more effective principals. Instead, they are consistent with the interpretation that more effective principals are more successful at retaining teachers they (and others) observe to be more effective and at encouraging turnover among less effective teachers.
Strategic Retention Across Different Teachers and School Contexts
Our third research question asks to what extent patterns of strategic retention change when looking at different types of teachers and across school contexts? Understanding differences across teachers and schools helps to provide insight into the mechanisms that underlie the descriptive patterns in our main results.
In Table 7, we examine differences in teacher retention across categories of teachers and schools. Each column shows the estimated marginal effects from a separate regression. The first two columns compare results for schools whose principal stays following year t versus schools whose principal leaves following year t. We would expect that principal turnover matters for strategic retention behavior. Principals who do not plan to remain in the school next year may not prioritize prospective human capital management. Alternatively, new-to-school principals may be less equipped to distinguish between high- and low-performing teachers and/or have the means to strategically target retention efforts. Given that most principals do not receive value-added scores prior to the start of the next school year, new principals may also simply have fewer sources of information about the effectiveness of their inherited teachers.
Exploring Heterogeneity in the Marginal Effects of Principal Effectiveness on Teacher Turnover
Note. TEAM = Tennessee Educator Acceleration Model. School-by-year clustered standard errors in parentheses. Each column displays the marginal effects from a separate regression model interacting the principal’s TEAM rating with a teacher’s average observation score. Models include school and year fixed effects and the full set of teacher/school controls. The dependent variable is whether a teacher leaves their position in the current year.
p < .10. **p < .05. ***p < .01.
The results in Table 7 (columns 1 and 2) are consistent with the idea that principal turnover disrupts strategic retention behavior. When limiting the sample to schools without a principal transition in a given year, patterns of strategic retention become stronger in comparison to the pooled models. In contrast, limiting to schools whose principals leave at the end of the year, the marginal effects across categories of observation scores are statistically similar. In other words, patterns of strategic retention are found solely in schools in which the principal returns next year, where the effects are even more pronounced than in the pooled sample.
Next, we examine whether there are differences between novice teachers (0–4 years of experience) and veteran teachers (5+ years of experience). In Tennessee, administrative means are more available for early-career teachers who have fewer due process protections, particularly in the wake of tenure reforms in 2011 (Wesson, 2012). Effective principals may be more adept at using these administrative options to remove low-performing novice teachers from their schools. Novice teachers may also be more responsive to informal means, such as “counseling out” by the principal. Columns 3 and 4 show the results for novice and veteran teachers. We find little evidence that the observed patterns differ between these two groups of teachers, which suggests that our results are not completely driven by administrative removal.
The ability or willingness of principals to strategically target teachers for retention or turnover may also vary across school contexts. Differences in school characteristics, for example, may proxy for differences in the labor market supply of teachers or in the accountability context for the school. For instance, principals who consistently face teacher shortages may not seek to “push out” ineffective teachers, but rather focus their efforts on retaining high-performers. To investigate these differences, we estimate separate models across three context measures: average school achievement, percentage of students qualifying for free/reduced price lunch, and locale type.
Columns 5 and 6 show the results for schools in the lowest and highest quintile of average school achievement, respectively. Here, we find important differences between high- and low-achievement schools. The marginal effect of increasing principal effectiveness on teacher turnover for teachers with very low observation scores is positive and significant in high-achievement schools, but virtually zero in low-achievement schools. Furthermore, the marginal effect in high achievement schools is roughly three times greater in magnitude than the pooled model estimate in Table 5. Among teachers with the highest observation scores (4.50–5.00), neither of the marginal effects is statistically significant. For the middle of the observation score distribution, we find consistent evidence of greater retention in low achievement schools with more effective principals. In comparison with the pooled models, the estimated relationship between principal effectiveness and teacher retention is substantially larger for low-achievement schools, which highlights the importance of school leadership in promoting stability in these schools. On the other hand, we find limited evidence that more effective principals in high achievement schools experience lower teacher turnover rates.
Columns 7 through 9 group schools by the percentage of students who qualify for free/reduced price lunch, a proxy for student poverty. While FRPL and school achievement are highly correlated (r = −0.80), school achievement may not fully capture differences between high- and low-poverty schools. However, our results are similar to those in columns 5 and 6; effective principals in high-poverty schools are particularly adept at retaining effective teachers, while effective principals in low-poverty schools experience greater turnover among the lowest-scoring teachers.
School locale may also be an important determinant of teacher labor market dynamics. Urban and suburban schools, through their proximity to population centers, may have greater access to replacement teachers than rural schools. They may also have greater access to data management systems (e.g., data dashboards) and other supports principals to facilitate the use of teacher effectiveness data in making talent management decisions (Goldring et al., 2014; Grissom, Rubin, et al., 2017).
Columns 10 through 12 group teachers by school locale: urban, suburban, and town/rural. 30 The results are most striking for suburban schools—the marginal effects are positive and significant for the lowest-scoring teachers. A 1 SD increase in principal TEAM rating predicts an 8.8 percentage point increase in the likelihood of turnover among very low-scoring teachers in suburban schools. Effective principals in suburban schools also experience lower rates of turnover among very highly rated teachers. Results for urban schools are similar in direction, though the marginal effect for the lowest scoring teachers is not statistically significant at conventional levels. Finally, effective principals in town/rural schools experience lower rates of turnover among high-scoring teachers, but no differences for low-scoring teachers.
Together, these results demonstrate that school context is an important aspect of the relationship between principal effectiveness and teacher turnover. In terms of pushing out low-performing teachers, the patterns are strongest in low-poverty, suburban schools, which could reflect labor market dynamics; principals in more advantaged schools may worry less about finding quality replacements for teachers who leave, leading them to focus on pushing out low performers. In contrast, principals in rural settings may instead focus on retaining teachers, since the cost or uncertainty associated with finding and hiring replacement teachers is higher.
Strategic Retention: Administrative Action or “Counseling Out?”
In Table 7, we showed that patterns of strategic retention are similar when comparing novice and veteran teachers, which suggests that formal processes may not be the primary driver of increased turnover among low-performers. To further examine this distinction, we restrict our model to teachers who received Level 3 or higher on their summative evaluation rating. Because we exclude teachers who are most vulnerable to administrative removal (Level 1 and Level 2), any remaining evidence of strategic retention would suggest that less formal means, such as counseling out, likely contribute to the patterns of strategic retention found in the main results.
We show these results in Table 8. When limiting the analysis to Level 3 through 5 teachers, effective principals’ patterns of strategic retention are consistent with the main results. For example, a 1 SD increase in principal TEAM rating predicts a 2.9 percentage point increase in the likelihood of turnover among low-scoring teachers, which is 8% of the average turnover rate for such teachers. The results are nearly identical when restricting the analysis to medium- and high-growth teachers, including when we examine teachers in tested grades and subjects.
Examining Patterns of Strategic Retention Among “Harder to Remove” Teachers
Note. LOE = level of overall effectiveness. School-by-year clustered standard errors in parentheses. Each column displays coefficients from a separate regression model interacting the principal’s TEAM (Tennessee Educator Acceleration Model) rating with a teacher’s average observation score. Models include school and year fixed effects and the full set of teacher/school controls. The dependent variable is whether a teacher leaves their position in the current year.
p < .10. **p < .05. ***p < .01.
These patterns suggest that strong principals’ strategic retention behavior—specifically, the failure to retain ineffective teachers—is not completely attributable to administrative means of removal. That is not to say that principals abstain from administrative means to remove teachers, given the substantially higher turnover rates among ineffective teachers. Rather, more effective principals may be better able to remove ineffective teachers through informal means, while also retaining effective teachers.
Strategic Retention by Types of Teacher Turnover
The prior analyses operationalize teacher turnover as a binary outcome. We next examine whether the relationship between principal effectiveness and teacher turnover changes when differentiating among types of turnover. This analysis is important for more fully understanding the consequences of strategic retention. If effective principals release low-performing teachers only to see them move to other district schools, for example, they may be contributing to a “dance of the lemons” rather than improving the overall quality of teachers in the school district.
Table 9 shows the results of the multinomial analysis. Each of the four turnover outcomes is estimated in reference to the base category (teachers who stay in their school). Panel A shows the regression results (again, with control variables omitted for brevity), while Panel B shows the estimated marginal effects of a 1 SD increase in principal effectiveness for teachers with different average observation scores.
Multinomial Teacher Turnover Results
Note. OLS = ordinary least squares; TEAM = Tennessee Educator Acceleration Model. School-by-year clustered standard errors in parentheses. Models include school and year fixed effects and the full set of teacher/school controls. The dependent variable is the type of teacher turnover listed at the top of the column.
p < .10. **p < .05. ***p < .01.
Our earlier analysis showed that low-scoring teachers are more likely to turn over than average- or above average–scoring teachers. Table 9 shows that this pattern holds for all types of turnover, but that the largest increase is in the probability of exiting the education system. Additionally, the higher overall turnover rate of Level 1 teachers with effective principals is largely explained by exits and across-district moves (Panel B). A 1 SD increase in principal TEAM rating raises the predicted probability of exiting the education system by 1.9 percentage points for low-scoring teachers and the probability of across-district transfer by a similar amount. In contrast, there is no evidence of an interaction in the within-district transfer column, and no evidence that effective principals are merely successful at shuffling teachers with lower observation scores to other schools in the district.
Among high-scoring teachers, effective principals have lower rates of within-district teacher transfers. A 1 SD increase in principal TEAM rating lowers the predicted probability of within-district transfer by 0.9 percentage points for teachers with the highest observation scores. This decrease is substantial—roughly 20% of the within-district transfer rate for these teachers.
Discussion and Conclusions
In recent years, many states have altered the policy environment around teacher evaluation and dismissal to explicitly encourage retention of effective teachers and “deselection” of ineffective teachers (Hanushek, 2009). Yet principals drive talent management (Goldring et al., 2014), and making the policy environment more conducive to strategic retention does not guarantee that it will be used to shape the teacher workforce.
Our results suggest that strategic retention is evidenced among more effective principals. As in prior work (e.g., Boyd et al., 2011), increases in principal effectiveness are associated with lower average teacher turnover rates, but, we find, focusing on turnover responses for the average teacher masks important differences between effective and ineffective teachers. High-performing teachers, measured both by classroom observation and value-added scores, are less likely to leave schools with effective principals. Yet teachers who receive low observation scores are in fact more likely to leave schools with effective principals, regardless of whether they have high or low value-added scores.
This evidence suggests that principals may rely more heavily on classroom observation scores than student test score growth to target high and low performers, which is consistent with prior qualitative investigations of principals’ human capital decisions (Goldring et al., 2015). Principals conduct observations themselves and can easily access scores assigned by themselves or other members of their leadership team throughout the school year. Moreover, observation information is available for every teacher. In contrast, although principals can monitor student progress through benchmark testing or other low-stakes assessments during the school year, teachers’ growth scores are not directly observable until well into the next school year, and, in any case, are only relevant for teachers in tested classrooms. In this light, the primacy of observation scores for strategic retention is perhaps unsurprising.
A key implication of our analysis is that the school principal is an important—and, we would argue, largely ignored—component of the theory of action linking large-scale teacher evaluation systems to the reshaping of the teacher workforce through selective teacher retention, which often is used as justification for these systems (Hanushek, 2009). Inattention to variation in the capabilities of principals to collect actionable observation data and—more to the point of our analysis—act strategically in response to those data may be undercutting states’ and districts’ investments in teacher evaluation systems. On the other hand, our analyses also speak to conclusions that because few teachers receive low overall evaluation ratings that trigger dismissal or other administrative action, investment in teacher evaluation systems are resulting in little change (see Kraft & Gilmour, 2017). In fact, effective principals who deem a teacher low performing via classroom observations systematically see those teachers leave their schools, and often the profession altogether, even when aggregation with other components of the evaluation system means that the overall level of effectiveness does not trigger administrative action.
An important addendum to these conclusions is that school context matters for strategic retention. For example, suburban schools appear better positioned than their urban and especially their rural counterparts to strategically retain teachers of different performance levels. With fewer applicants for open positions, rural schools may have fewer options for replacing low performers, meaning that it may in fact be strategic—albeit by a different definition than the one we have employed—for some rural principals to retain low performers rather than leave a position open or fill it with an undesirable alternative. Schools where the principal turns over also behave differently, with strategic retention patterns essentially driven only by schools that retain their principal from one year to the next, suggesting one potential reason that principal stability is associated with more positive school outcomes (Miller, 2013).
Attention to state context is also important. Our results come from Tennessee, a state which has invested heavily in a statewide evaluation system for teachers and principals. Such a system may be a necessary condition for strategic retention (TNTP, 2012). Meaningful teacher evaluation systems create opportunities to observe and evaluate representative teaching; the absence of such a system creates barriers to dismissal of low-performing teachers and to targeting resources, such as useful feedback or opportunities for teacher leadership, toward retention of high performers. Our results may not be generalizable to other state contexts, particularly those without such developed educator evaluation systems. An additional limitation concerns the simultaneity of principal evaluation and teacher retention decisions. Our results potentially could be driven by principal supervisors who observe strategic retention behaviors, thereby informing higher principal ratings. While Tennessee’s administrator rubric does not explicitly evaluate strategic retention practices, it does include some indicators related to human capital and resource management. However, we believe it unlikely that any such bias is very large, given that district evaluators do not differentiate among aspects of principal performance on the observation rubric, which covers many aspects of principal leadership beyond talent management (Grissom et al., 2018).
Understanding the mechanisms of strategic retention is an important avenue for future work. While we present evidence that effective principals pursue such strategies, we cannot speak directly to their specific strategic behaviors because we cannot observe them. As we show, increased turnover among ineffective teachers under effective principals likely cannot be fully explained by administrative means (e.g., contract nonrenewal), but we need to know more about how principals use informal means to encourage (directly or indirectly) low-performing teachers to leave. At the other end of the distribution, future work might also investigate the strategies through which effective principals influence the retention of high-performing teachers. Such strategies may be particularly helpful to principals in low-achieving and high-poverty schools that have substantial challenges in retaining their most effective teachers.
More generally, our findings speak to the importance of achieving a better understanding of principals’ human resources or talent management roles in their schools. Future work might explore the teacher hiring, assignment, and development strategies of effective principals. Prior research shows, for example, that high-growth schools hire teachers and place them across tested and untested classrooms differently than other schools (e.g., Grissom, Kalogrides, & Loeb, 2017; Loeb et al., 2012), but research has only begun to examine the specific ways principals—especially effective principals—shape and engage in their schools’ talent management processes.
Supplemental Material
DS_10.3102_0002831218797931 – Supplemental material for Strategic Retention: Principal Effectiveness and Teacher Turnover in Multiple-Measure Teacher Evaluation Systems
Supplemental material, DS_10.3102_0002831218797931 for Strategic Retention: Principal Effectiveness and Teacher Turnover in Multiple-Measure Teacher Evaluation Systems by Jason A. Grissom and Brendan Bartanen in American Educational Research Journal
Footnotes
Notes
J
B
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
