Abstract
This paper describes a process to develop and trial new metrics in South Africa to quantify school leadership and management practices or processes that are considered theoretically related to literacy outcomes. The predictive validity of these measures is assessed in challenging contexts, including 60 township and rural primary schools in South Africa. We observe a randomness to how better leadership and management practices are distributed across better and worse performing schools. Regression analyses confirm weak and inconsistent linkages between measured leadership and management dimensions and literacy outcomes across the sample. However, we find evidence of stronger linkages with intermediate outcomes, including monitoring curriculum coverage. This research contributes to a burgeoning, yet underdeveloped literature on educational management and leadership in Africa and the challenges of measurement in this context.
Introduction
Since the 1960s, there has been significant growth in the global knowledge base on school leadership and management (SLM). However there is an acknowledgement of the limits of this knowledge, with ‘far less systematic knowledge on how school leaders carry out their roles in developing nations throughout the world’ (Hallinger, 2017: 363). In Africa, educational management and leadership literature is in an emergent phase, with contributions dominated by one or two countries. Even South Africa, as the largest country contributor to the African literature, derives its knowledge base predominantly from qualitative research studies, with case-study locations limited to a few schools (Bush and Glover, 2016; Hallinger, 2017). There is limited understanding of the empirical linkages between SLM and learning outcomes in the country and Africa more broadly (Hallinger, 2017; Hoadley et al., 2009).
Yet there has been a long-held view that much of the unexplained variation in learner performance in South Africa, particularly across historically disadvantaged school contexts, may be attributed to differences in SLM (Crouch and Mabogoane, 1998, 2001). Subsequent to this finding, school effectiveness studies began to include proxy indicators of SLM into education production function models of learning (see Wills, 2019 for a review). These studies suggest there is educational value in protecting and extending learning time (Gustafsson, 2007; Van der Berg and Louw, 2006); managing the procurement and inventory of books and stationery (Kotze, 2017; Shepherd, 2016; Spaull, 2013; Van der Berg, 2008); quality assuring tests; monitoring learner results; and, importantly, managing and monitoring time-on-task, opportunity-to-learn and curriculum coverage (Carnoy et al., 2015; Taylor, 2011; Taylor and Prinsloo, 2005).
These are useful insights, but the inclusion of these indicator variables in production function models has been far from sufficient in accounting for large unexplained variation in learning initially attributed to management. Furthermore, many SLM indicators that one would assume to matter for learning have been found to be insignificant (Gustafsson, 2005; Van Staden and Howie, 2014). Rather than operating through strong direct effects, this raises the possibility that SLM may interact with learning in indirect ways, with small direct associations (Hallinger and Heck, 1998). However, a question arises as to whether this null hypothesis holds if more rigorous efforts are applied to quantifying SLM, including the development of new tools and centring measurement on clearer conceptual frameworks.
In this paper we describe a process to develop and trial new metrics for codifying SLM practices or processes that may be linked to better literacy in poor South African schools. This is centred on a ‘leadership for literacy’ conceptual framework (Hoadley, 2018) where we prioritise six categories of resources available to school leaders in shaping a literacy learning environment in South Africa. Our first objective is to quantify the availability, use and deployment of these resources by school leaders. The second objective is to assess the reliability and validity of derived ‘leadership for literacy’ resource indices. The data for this process were gathered from 60 township and rural public schools in three South African provinces. In addition to SLM data, we collected reading and literacy scores for grade 3 and 6 students to establish whether our ‘leadership for literacy’ indices are predictive of literacy levels and gains using multivariate regression modelling. At this outset, we acknowledge that we can only identify correlational rather than causal relationships in the analysis. The qualitative case-study component of the wider mixed-methods study drilled further into the generative mechanisms underlying the quantitative findings (as discussed extensively in Taylor et al. (2019)). This paper aims to give specific treatment to the quantitative research process, analytical strategy and quantitative results.
The paper contributes to a knowledge base on leadership and management in challenging primary school contexts in Africa. It highlights that, despite efforts to measure SLM more rigorously, weak and inconsistent associations with literacy are still found in poor primary school contexts. Nevertheless, somewhat stronger associations are evident with intermediate outcomes such as monitoring curriculum coverage.
The next section provides background on measuring SLM and its linkages with learning. We consider three different approaches to instrument development and identify the implications of existing study findings for our methodological approach. We then provide more details on our methodology and data collected. This is followed by a discussion of the reliability and validity of the six ‘leadership for literacy’ indices, particularly focusing on predictive validity, before the final section concludes.
Background
Internationally, qualitative studies yield support for the educational value of leadership, particularly when framed from an instructional leadership perspective (Leithwood et al., 2004; Robinson et al., 2008). Paradoxical results are however found in quantitative studies. A meta-analysis by Robinson et al. (2008) and Marzano et al. (2005) find moderate to strong correlations with learning. However, in a well-cited meta-analysis of 37 multi-national studies by Witziers et al. (2003), correlations between leadership and student outcomes are considered to be weak. This echoes earlier reflections by Hallinger and Heck (1998) that the effect of SLM on learning is small and indirect, mediated through alternate mechanisms such as the work of teachers.
Robinson et al. (2008), in attempting to explain these paradoxical results, argue that they should be examined against various other possible explanations for the predominance of null or weak associations. These include the use of narrow theoretical frameworks in conceptualising SLM; small sample sizes; and limited detectable variation in SLM practice that compromises the identification of direct effects (Robinson et al., 2008). Instruments may also be inadequate to measure SLM, particularly in low-income settings (Lemos and Scur, 2017). The limits of existing instruments to capture leadership and management quality have been suggested as econometricians begin to isolate out large causal ‘principal effects’ (Branch et al., 2012; Grissom et al., 2015a) or identify significant and large positive causal impacts of SLM interventions on learning (Fryer, 2017; Tavares, 2015).
Despite burgeoning empirical work on the determinants of learning outcomes in South Africa, studies have not aimed to capture SLM as a fuller construct (one exception is a small qualitative study by Taylor et al. (2013)). It has not been possible to determine whether the weak explanatory power of SLM indicators in predicting learning outcomes in historically disadvantaged schools is due to poor measurement of SLM or a lack of variation in existing SLM proficiencies or learning outcomes across schools. In general, too little attention is given to research methodology and the development of effective assessment tools in establishing a valid knowledge base underlying the practice of educational leadership and management (Heck and Hallinger, 2005: 232).
Approaches to quantifying SLM
Three broad sets of instrument-based approaches are typically used in existing studies to quantify SLM, including: detecting leadership behaviours or effectiveness with self-administered questionnaires; quantifying school leaders’ allocations of time to different tasks; and assessing SLM competencies using scoring rubrics.
Here we identify the advantages and disadvantages of each approach and how findings from existing studies adopting these approaches informed our study design.
Self-administered questionnaires are commonly used to evaluate leadership behaviours and effectiveness (for a list of instruments see Condon and Clifford (2012)). Instruments may vary in terms of their reliability and validity (Goldring et al., 2009) and even rigorously developed instruments have shortcomings. Yet there is still evidence that many of these instruments provide measures associated with learning.
Two key reflections from these studies have implications for our design. First, ratings tend to be inflated, even when provided by supervisors or peers, so that less than desirable variation is detected in leadership across schools (Grissom et al., 2015b: 22; McCullough et al., 2016). But data collected from teachers are typically more reliable compared with principal self-reports or supervisor reports (Hallinger, 2008: 16). Collecting responses from teachers in addition to school leaders is therefore a preferred approach. Second, the strength of associations with learning may vary depending on the subject and grade level in question (Grissom et al., 2015b; McCullough et al., 2016). This calls for learning-centred leadership theories where leadership is conceptualised with a subject-specific and grade-specific end in mind (Stein and Nelson, 2003).
Existing studies also quantify the work of school leaders in terms of time allocations to different tasks measured through work activity analyses, structured observations or self-reported activity logs (Camburn et al., 2010; Grissom et al., 2015b; Horng et al., 2009; Lee and Hallinger, 2012). These studies highlight that SLM tasks comprise a diverse set of functions, but allocating more time to some tasks over others may yield higher returns to learning (Grissom et al., 2015b). Time use and allocations to tasks may also vary within leaders in the same school, within the same leaders over time (Camburn et al., 2010) and across societies as a result of differential economic, socio-cultural and institutional dynamics (Lee and Hallinger, 2012). In response, our SLM assessment was designed to be contextually appropriate for the South African public schooling context. Data are collected about more than one leader or manager in a school and at two different time points.
But how does one combine data from different times, several respondents and different questionnaires? A predetermined scoring rubric was developed for this purpose to facilitate data triangulation. Rubrics are increasingly used internationally to quantify competencies in education management, assessment, or systems technologies (Arcia et al., 2015; Bloom and Van Reenen, 2010; Lemos and Scur, 2017). Guided by rubric descriptions, different data sources can be combined to assess how institutional practices or processes compare to described proficiency standards.
A well-known scoring rubric is the World Management Survey (WMS). Its application in large studies reveals that higher levels of formalisation of school management practices and processes are associated with better learning outcomes and teacher practices, even in developing countries (Bloom et al., 2015; Crawfurd, 2017; Scur, 2017). But this approach depends on using high-level researchers to conduct interviews with head teachers and in turn establishing SLM proficiency scores. This may be costly and infeasible in environments with few available researchers speaking indigenous languages. We aimed to use a rubric approach but to limit costs and improve scalability by using fieldworkers rather than experienced researchers to collect data. Close-ended questions are relied on more heavily in our approach to reduce the need for value-judgements.
Method and data
SLM measurement approach
Our SLM measurement approach was operationalised in four phases as shown in Figure 1, including the development of a theoretical framework, its empirical application across interconnected qualitative and quantitative fieldwork components, and final index construction.

Four phases of a mixed-methods study to develop indices of leadership and management.
Phase 1 – Theoretical framework
A ‘leadership for literacy’ framework underpinned our study design (Hoadley, 2018), where its literacy focus is informed through two considerations. The first is the notion that leadership effectiveness is deeply connected to leaders’ subject-specific knowledge resources (Burch and Spillane, 2003; Stein and Nelson, 2003). Second, the literacy focus is contextually appropriate where an inability to read for meaning pervades the South African schooling system, compromising individual and economic advancement. Almost eight of every ten South African grade 4 students cannot read for meaning in any language, as revealed in the 2016 Progress in International Reading and Literacy Study (PIRLS) (Howie et al., 2017). South Africa was ranked last of the 50 countries participating in PIRLS 2016 (Mullis et al., 2017). As Murphy (2004) emphasises, in low literacy contexts leadership should be leveraged to move new knowledge on literacy into classrooms and schools. In this vein, our framework – informed through a review of school effectiveness and leading for reading literature (see Hoadley, 2018) – identifies six clusters of resources available to leaders in promoting literacy in disadvantaged South African schools: material resources (time): the allocation of time for teaching language and reading, as well as protecting and maximising the use of time for learning; material resources (presence of text): the prioritisation and procurement of textbooks,
1
graded readers and library resources to support a programme of reading; material resources (use of text): the optimal utilisation of these text resources for reading; knowledge resources: within-school expertise in understanding the value of reading and the methods to teach it effectively, as well as the extent to which this understanding is shared by school leaders and teachers; human resources: the effective recruitment, utilisation and development of teaching expertise, particularly in reading and language; and strategic resources: the mobilisation of available resources to drive a coherent literacy programme.
Each of these resources is expanded on in Table 1. This is by no means a comprehensive list of factors that may be important for literacy development. Murphy (2004) for example considers ten sets of resources in his ‘leadership for literacy’ meta-analysis. But in developed countries, with different schooling systems and persistent shortages of resources and skills, prioritising a few factors that may promote baseline levels of functionality and learning is instructive. Furthermore, where much of the research on leading for literacy emanates from developed countries, the issue of basic resources tends to be downplayed (Lee and Zuze, 2011). Pretorius and Machet (2004) for example draw attention to three obvious resources that define the context for literacy accomplishment in disadvantaged schools: instructional time, books and good teachers.
Six ‘leadership for literacy’ dimensions with sub-dimension descriptions.
We were interested in quantifying both leadership and management functions despite these being distinct concepts. The theoretical framework assumes these functions are shared across individuals in the school organisation (Spillane et al., 2004). This is in line with South African school policy which envisages SLM as distributed across school management team members comprising the principal and middle managers (deputy principal and heads of department) (Department of Basic Education (DBE), 2016).
Phase 2 – Qualitative process
While we do not report on the qualitative findings in this paper, this process was critical to aligning the quantitative instrument development process with the theoretical framework. Case studies were conducted in four pairs of high and lower achieving schools selected from the larger 60-school sample in township and rural areas. Three to four days were spent in each case-study school, where visits were led by a team of four South African-based education specialists. Semi-structured interviews with principals, teachers and middle managers were conducted in these schools. Interview findings were augmented with classroom and school observations, focusing on identifying factors that support or hinder a literacy-learning environment. This case-study process generated thick descriptions to augment the development of rubric descriptions as described in the next phase, and in turn the writing of questions incorporated into quantitative instruments.
Phase 3 – Quantitative process
Developing the scoring rubric involved mapping each of the six resource dimensions from the theoretical framework into descriptions of SLM proficiency. These dimensions were further broken down into sub-dimensions which comprise individual rubric elements (see Table 1). In total, scoring descriptions were written for 114 rubric elements. As an example, Table 2 illustrates how rubric descriptions relate to quantitative scores of 1 (low) to 5 (high) for three elements under the human resource dimension.
Example of scoring rubric descriptions for three elements under the human resource dimension.
Close-ended questions were written to generate variables necessary to evaluate how a school should be scored on each rubric element. Relying on close-ended rather than open-ended questions reduces the need for probing and research judgements from fieldworkers. The questions were allocated across six questionnaires, which can be administered during one school day. These include interviews with the principal, deputy principal, a grade 3 teacher and grade 6 teacher; a self-administered survey for all teachers; and a school observational instrument. Observational questions required fieldworkers to verify the existence of factors such as book resources in classrooms, the state or existence of school libraries, the availability of management documents (e.g. timetables, assessment plans and financial documents) and the presence of teachers in classrooms.
Before finalising instruments, they were subjected to multiple stages of piloting. We initially trialled instruments in four schools located across three provinces, then revisions were made and we re-piloted instruments in another three schools. Additionally, all instruments were reviewed by four former principals and two district officials working in rural school contexts. We also engaged in cognitive piloting of the teacher survey in two peri-urban schools in KwaZulu-Natal.
A dedicated fieldworker, trained over a four-day process on the instruments, was responsible for conducting SLM interviews and collecting observation data for each school. The training of fieldworkers was supported through an in-school simulation day. Senior project researchers were present during the school simulation to monitor the quality of interviews and data collection process.
Phase 4 – Index construction
Once data were collected, a data-coding process combined variables to ‘objectively’ score each rubric element. The scoring is objective in the sense that the data determine each school’s score on a rubric element rather than a researcher using a value-judgement. In total, over 500 variables collected across the six instruments were used to code 114 rubric elements. Finally, the six ‘leadership for literacy’ dimensions were obtained using principal components analysis (PCA) to weight each element in terms of the variation each element explains in an underlying unobserved factor.
Reliability
Reliability was embedded into the measurement process by collecting data from different sources and respondents. Almost half of the elements are coded using data that are triangulated in some way; for example, using responses from multiple respondents. Of the 114 elements, 42 rely on data that require the fieldworker to observe evidence of practice. Due to potential inaccuracies in self-reported data from teachers and school leaders, we also constructed a second set of combined indices only using elements coded from observational or ‘evidence-based’ data. (The supplementary materials provide distributional plots of these two sets of ‘leadership for literacy’ indices.)
Although we collected data at two points in time, we did not administer two rounds of the same instruments, so that evaluating the test–retest reliability is not possible. However, we do test whether the indices discriminate in ways that we would expect, such as observing better practices in urban as opposed to rural schools or in the least poor schools in the sample.
Our theoretical framework is multi-dimensional in its conceptualisation. Conducting tests of internal consistency, such as Cronbach’s alpha, which assume construct uni-dimensionality, are not appropriate (Peters, 2014).
Content and face validity
Methods to establish the validity of the measurement approach were incorporated into each of the four phases in Figure 1. The literature review, its qualitative application in schools, the rubric development process and several rounds of instrument piloting were critical to establishing content validity. The mixed methods design also allowed for face-validity checks (i.e. at face value, do our metrics measure what they claim to measure?). We also evaluated the predictive validity of the SLM measures. The method and analytical strategy to do this are now discussed.
School sampling
A possible reason for weak associations between SLM and learning is that study samples lack detectable variation in both student learning and SLM practices. In addressing this concern, the 60 schools were purposively selected to artificially add as much student performance variation as exists in the sampling frame of public schools reaching poorer populations in three of nine South African provinces (KwaZulu-Natal, Gauteng and Limpopo). As described in Wills (2017), we initiated an intensive search for the 30 best performing schools in these provinces that charge minimal or no school fees. 2 This involved identifying top-performing schools in system-wide testing data – namely the Annual National Assessments. We corroborated these data against several recommendations of ‘good’ no-fee schools collected from various sources. Then 30 lower performing schools, in similar geographic locations, were matched to the higher performing schools. School selection was also informed by whether the dominant student language in the school was Zulu, Sepedi or Xitsonga.
Literacy data
Grade 3 and 6 students were tested in these 60 schools at the beginning of 2017. Pre-testing was necessary to verify school literacy performance levels at project inception and to guide the selection of matched school pairs for the qualitative case-study work. We then re-tested the same learners towards the end of the same school year (i.e. post-test).
The grade 6 English literacy test, administered to an entire class of grade 6 students in each school, consisted of two silent reading comprehension passages (released items from grade 4 PIRLS assessments 3 ) and a vocabulary test. From the original sample of 2656 grade 6 students, 2379 also wrote the post-test, implying a low attrition rate of 11%.
Additionally, pre- and post-test English Oral Reading Fluency (ORF) scores are available for 599 grade 6 students (drawn from roughly 10 randomly selected students in each school). We have African language ORF scores for these students but only for the end of 2017. A battery of reading and literacy tasks in English and African language were also administered to 10 randomly selected grade 3 students in each school. Pre- and post-test scores are available for 631 grade 3 students.
Analytical strategy
The predictive validity of the ‘leadership for literacy’ indices is assessed using an education production function framework. Literacy outcomes are expressed as a function of each ‘leadership for literacy’ index,
Here
Equation (1) estimates levels of learning and is unlikely to control for unobserved factors such as student ability or historical exposure to different factors which may contribute more to the learning process than observable school factors. For this reason, a value-added model which includes a pre-test score
Finally, we also explore whether associations exist between our indices and intermediate outcomes.
In equation 3, intermediate variables vary at the school level (not the student level), so we also average teacher and student characteristics (Ss
) to the school level. The four intermediate variables considered include: coverage of work in the best grade 6 learner’s exercise or workbook (expressed in centiles); the percentage of classrooms with a present teacher and students engaged in a learning activity; the percentage of teachers indicating their head of department conducts at least biweekly checks of their curriculum coverage; and an index of average teacher ‘employee engagement’. An engagement score is generated for teachers in each school, averaged to the school level and normalised. The score is constructed from eight questions related to work enjoyment, satisfaction with the school management team, sense of safety, stress levels, receiving praise or recognition, and feeling valued.
Results
Reliability
One way to assess whether the ‘leadership for literacy’ indices are reliably measuring what we intend them to is to compare index scores across rural and urban schools, and across the poorest 20% of schools and the least poor 20% of schools. In general, we would expect higher scores for urban and wealthier schools compared to rural and poorer schools.
The expected patterns hold for four of six indices, as seen in Table 3. However, the human resources and strategic resources indices are higher in poorer than wealthier schools – a pattern we would not expect. Limiting the indices to being constructed from ‘evidence-based’ elements yields patterns on the strategic resources index that are somewhat more aligned with expectations.
‘Leadership for literacy’ indices by location and school wealth.
Note: Differences are statistically significantly different at *** p < 0.01, ** p < 0.05, * p < 0.1.
Face validity
In a mixed-methods paper we examine convergences and divergences between the quantitative data collected through our instruments and the qualitative findings for eight topics that were probed in the case-study schools (Taylor et al., 2019 ). Qualitative and quantitative findings only converged for two of eight topics compared. The convergence on these two topics was not surprising as they could be assessed using observational data. Measuring the other six topics requires collecting self-reported recall or perception-based responses as they cannot be directly observed. This opens the door for less reliable socially acceptable responses. Given these face-validity concerns, we assess whether predictive validity improves if we only construct indices from observational data (i.e. excluding elements derived from self-reported or recall responses).
Predictive validity
Descriptive findings
Overall, systematically better SLM practices are not detected in better performing schools in our sample. This is initially seen in Figure 2, which plots the raw correlation between the six ‘leadership for literacy’ dimensions and grade 6 English literacy test scores in reading comprehension and vocabulary (expressed in z-scores) at the 50th and 90th percentile. The lack of any systematic relationship for at least five of the six dimensions is evident. While the material presence of text in the school shows a positive association, in the multivariate estimations this is absorbed by controls for socio-economic status.

Median and 90th percentile grade 6 English literacy scores plotted against six ‘leadership for literacy’ indices for 60 schools.
Multivariate estimation of literacy outcomes
The overall impression of no or weak associations between the ‘leadership for literacy’ indices and learning is confirmed in the multivariate analyses, seen in Tables 4 and 5. If positive and significant results do emerge, these are inconsistent across grades and different test outcomes.
Estimating grade 6 literacy outcomes.
Source: Leadership for literacy dataset, 2017.
Notes: Standard errors are in parentheses and clustered at the school level. Significant at *10% level, **5% level, ***1% level.
Each cell represents a regression of each individual index on the outcome variable.
The pre-test control in estimating A) English literacy scores and B) English ORF scores is the students’ pre-test results on the same tests. The pre-test control in estimating C) African language ORF is the student’s pre-test English ORF score because no pre-test is available in African language ORF. In estimation C) a control is also added to reflect the African language in question. For a description of individual, home and school controls see the discussion under ‘analytical strategy’.
WCPM = words read correctly per minute.
Estimating grade 3 literacy outcomes.
Source and notes: See Table 4. In estimations of African language outcomes, we control for the language in question.
In Table 4 we estimate three grade 6 post-test outcomes: English literacy test scores expressed in z-scores; English ORF scores expressed in z-scores; and African language ORF scores expressed as the percentage of words read correctly per minute (% of WCPM). Table 5 shows estimates for two grade 3 literacy post-test outcomes: English and African language ORF (% of WCPM).
In each table we only show the coefficient estimate on each index entered individually in the regressions. We also show coefficients on pre-test literacy outcomes in the value-added models. Model 1 reflects the raw correlation between each index and literacy outcomes (i.e. excludes controls) while model 2 includes individual, classroom and school-level controls.
The limited explanatory power of the indices in estimating grade 6 literacy outcomes is evident in the low R-squared values in Table 4, model 1. Most of the time, no significant coefficients are observed on any of the indices. An exception is the significant and positive coefficient of 0.1 standard deviations on the human resources index in the fully controlled, value-added estimations of grade 6 literacy outcomes.
Returning to the issue of reliability, the exclusion of self-reported data in index construction does not significantly improve the predictive validity of the indices. We tested this by re-running all estimations in Tables 4 and 5 but using indices constructed from observational or evidence-based data only (results are shown in supplementary tables). We also find no evidence that the effect of the indices is mediated through the presence of knowledge resources.
Each rubric element is also included individually in estimations as they may be predictive of outcomes as stand-alone measures rather than in aggregation. A striking result is how few of the 114 rubric elements are individually predictive of selected literacy outcomes. Only 13% of the rubric elements are positively and significantly associated with grade 6 English literacy outcomes in the fully controlled model – only 12% if we include pre-test scores. Roughly 10% of all elements are statistically significantly negatively associated with each of the grade 3 and 6 literacy outcomes (see the supplementary tables for results). While unexpected, the identification of negative associations is not uncommon in the literature on ‘leadership effects’ (Robinson et al., 2008: 655).
Multivariate estimation of intermediate outcomes
Our ‘leadership for literacy’ indices yield stronger and more consistent associations with intermediate outcomes than with student outcomes, as seen in Table 6. The human resources as well as the strategic resources indices are positively associated with indicators of curriculum coverage. A one standard deviation increase in the strategic resources index, for example, is associated with an 8% point increase in the percentage of teachers who report their curriculum coverage is checked biweekly by a head of department (middle manager), and a 15% point jump in the amount of work in students’ exercise books and workbooks (expressed in centiles).
Estimating intermediate outcomes.
Notes: N = 60 for all regressions. Each cell represents a regression of each individual index and the outcome variable.
Average student characteristics of grade 6 class include % overage, % who attended grade R, % who always or almost always speak English at home, % whose parents are not employed, % with own story books at home. School controls include average school wealth, grade 6 class size, English LOLT and low-fee paying. Cells are highlighted where p-values are less than 0.1.
There is also evidence that the allocation and deployment of the six resources may matter for the extent to which teachers enjoy their work and feel connected to their jobs and the school organisation. However, these findings are treated with caution because one would expect higher teacher engagement levels to be met with evidence of more teaching in the classroom. This is not the case.
Discussion
This paper described a process to construct composite measures of school leadership and management that may be linked to literacy outcomes in township and rural South African schools. Centred on a ‘leadership for literacy’ theoretical framework and in-depth qualitative work, a pre-determined rubric guided the SLM assessment approach. Despite giving more attention to quantifying SLM than has previously been attempted in South Africa, we were unable to reject a null hypothesis that SLM has a direct connection with literacy outcomes.
Only one of six ‘leadership for literacy’ dimensions was significantly positively associated with grade 6 literacy outcomes. That is, the human resource dimension, which quantifies the extent to which leaders develop, recognise and deploy expertise to build a literacy environment. Similarly, at the grade 3 level, only one resource dimension – the efficient use and allocation of time for reading and language – has a significant but small positive association with grade 3 reading outcomes. But somewhat stronger linkages are found with intermediate outcomes. Schools that more effectively deploy and utilise human or strategic resources have more evidence of curriculum management and teacher work engagement. But the overall finding is one of few associations between six ‘leadership for literacy’ dimensions and literacy outcomes.
Most of the time, the expected relationships are not found. This echoes earlier reflections in South African school effectiveness research (Gustafsson, 2007; Van Staden and Howie, 2014), and challenges prior beliefs that leadership and management accounts for the lion’s share of unexplained variation in learner performance across historically disadvantaged South African schools (see Crouch and Mabogoane, 1998).
Despite these results, the study highlights the limits and challenges of quantifying SLM in these contexts which may constrain the identification of effects. Reliability and validity concerns still taint our quantitative data collection process. Close-ended questions were used to limit high-level judgements that characterise semi-structured interviews. While this improves the scalability of the approach – reducing the need for experienced researchers – it reduces the opportunity to probe through socially desirable responses. The case studies revealed numerous instances where initial responses in interviews were biased through social desirability concerns, with respondents telling the interviewer what they believed the latter wanted to hear, or what they perceived to be accurate (Mertler, 2019). Without effective probing, these socially acceptable responses bias the data and reduce the detectability of variation in SLM proficiency across schools. Incorporating triangulation into the measurement approach was insufficient to compensate for these reliability concerns. Even observational data did not significantly improve predictive validity over measures derived from self-reported data. We argue that substituting fieldworkers for experienced researchers to collect quantitative (not just qualitative) data on SLM is a sub-optimal approach, limiting the identification of variation in SLM outcomes.
Additionally, limited variation in literacy outcomes across the rural and township school sample may constrain the detection of associations. This occurs despite exhausting available opportunities to add more variation through a purposeful sampling approach. In future studies, it may help to extend sample sizes both vertically and horizontally – adding more schools and classes or grades within schools. Notably, we found that teachers’ perceptions and experiences of their managers and school leaders varied more within than across schools (Taylor et al., 2019 ). More predictive power may also be gained through conceptualising SLM with additional resource dimensions or using another subject lens, such as mathematics.
Notwithstanding the merits of advancing research efforts to improve the detectability of SLM effects in South African schools, this is subsidiary to the real issue of building SLM capacity and leadership content knowledge. Particular attention needs to be given to developing the literacy knowledge of leaders to aggressively advance the teaching of reading in schools and support the implementation of coherent reading programmes (Murphy, 2004). In the qualitative findings from eight case study schools, weak practices in all ‘leadership for literacy’ domains were typically observed even in the best performing schools in the sample (Taylor et al., 2019). Where good leadership and management practices were discerned in the deployment of one resource area, this was juxtaposed against weaknesses in how one or more of the other resources were deployed. Furthermore, where material, human and strategic resources were effectively deployed this was strongly mediated through the presence of knowledge resources of not only leaders, but teachers. Addressing the South African literacy crisis is unlikely to happen in isolation of hiring and developing leaders and teachers with the requisite skills to advance literacy knowledge in schools.
Supplemental material
Supplementary_Tables_EMAL_SLM_in_SA - Measuring school leadership and management and linkages with literacy: Evidence from rural and township primary schools in South Africa
Supplementary_Tables_EMAL_SLM_in_SA for Measuring school leadership and management and linkages with literacy: Evidence from rural and township primary schools in South Africa by Gabrielle Wills and Servaas van der Berg in Educational Management Administration & Leadership
Footnotes
Acknowledgements
We are grateful for the collaborative work of the ‘Leadership for literacy’ study team including Nick Taylor, Ursula Hoadley, Jaamia Galant, Francine de Clercq, Nic Spaull, Nompumelelo Mohohlwane and Lilli Pretorius, and especially David Carel for overseeing quantitative data collection. Thank you to Marie-Louise Schreve, Carine Brunsdon and Silke Rothkegl-Van Velden for their administrative support. We also acknowledge the efforts of fieldworkers, data capturers and test markers.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research is funded by the Economic and Social Research Council [grant ES/N01023X/1].
Supplemental material
Supplemental material for this article is available online.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
