Abstract
Selection practices in education, such as tracking, may represent a structural obstacle that contributes to the social class achievement gap. We hypothesized that school’s function of selection leads evaluators to reproduce social inequalities in tracking decisions, even when performance is equal. In two studies, participants (students playing the role of teachers, N = 99, or preservice and in-service teachers, N = 70) decided which school track was suitable for a pupil whose socioeconomic status (SES) was manipulated. Although pupils’ achievement was identical, participants considered a lower track more suitable for lower SES than higher SES pupils, and the higher track more suitable for higher SES than lower SES pupils. A third study (N = 160) revealed that when the selection function of school was salient, rather than its educational function, the gap in tracking between social classes was larger. The selection function of tracking appears to encourage evaluators to artificially create social class inequalities.
Students from disadvantaged social classes are consistently underrepresented in higher education in all industrialized countries (Organisation for Economic Co-operation and Development [OECD], 2011). The present research addresses social class inequalities in education by adopting an approach that considers how societal structures shape individual experience (Adams, Biernat, Branscombe, Crandall, & Wrightsman, 2008; Kraus & Park, 2017; Markus & Stephens, 2017; Smeding, Darnon, Souchal, Toczeck-Capelle, & Butera, 2013). In particular, we study the educational practice of tracking—grouping students based on their achievement level—which is in use in most OECD countries (OECD, 2013). Tracking is traditionally used because “a good fit between a student’s ability and the level of instruction is believed to maximize the efficiency and effectiveness of the instructional process” (Hallinan, 1994, p. 79), but research has shown that tracking is linked to increased social class inequalities in educational attainment and earnings (e.g., Brunello & Checchi, 2007). Drawing on research in social psychology and sociology of education that depicted the educational system as a contributor to the reproduction of social classes (Bourdieu & Passeron, 1990; Stephens, Fryberg, Markus, Johnson, & Covarrubias, 2012), we contend that tracking is a structural obstacle that contributes to actively exclude lower social class pupils from higher education. In particular, we propose that tracking shapes the decisions of evaluators in a way that reproduces social class inequalities, even when achievement is equivalent, because it epitomizes a specific aspect of the educational system: its function of selecting the most deserving students.
Social psychology has recently produced a number of studies that experimentally manipulate the cultural and structural factors that might affect the results of students from different social classes (Goudeau, Autin, & Croizet, 2017). Here, we present the first experimental investigation of the impact of students’ social class on tracking decisions of evaluators.
Tracking as Selection
The practice of grouping students by achievement levels into differentiated curricula can be found in all OECD countries (OECD, 2013). Some countries, such as Austria, Germany, Hungary, the Czech Republic, and Switzerland, track students into separate academic/vocational programs or schools for their full curriculum (i.e., full-curriculum tracking). This separation has lasting consequences, as students in the lower tracks are not allowed to directly access higher education. Other countries (e.g., the United States, Australia, Canada, the United Kingdom, and New Zealand) practice various forms of within-school “ability grouping/streaming” or “course-by-course tracking” in their curriculum (Chmielewski, 2014). In this case, students are tracked for certain courses, not programs, and every student is presumably eligible to go to University. Students are usually tracked (into courses, programs, or schools) during secondary schools at the age of 15 or 16 years. Nonetheless, early tracking (10-12 years old) occurs in European countries such as Germany, Austria, Switzerland, Belgium, and the Netherlands (OECD, 2013).
Several authors have noted that tracking systems should be described in terms of degree of selection (countries/schools have a more or less restrictive tracking system), rather than consider tracking as a binary variable (tracking vs. no tracking; Bol, Witschge, Van de Werfhorst, & Dronkers, 2014; Chmielewski, 2014). Despite the above differences in application, these selection practices (tracking and ability grouping) have been argued to fulfill the same function, namely, differentiating students by achievement levels and tailoring the curriculum to fit students’ academic needs and abilities (Chmielewski, 2014).
Tracking and Learning
However, the support for school tracking has been controversial. The arguments put forward in the controversy often revolve around a perceived trade-off between efficiency and equality, as educational systems aim for both a high level of achievement and equality of opportunities (Felouzis & Charmillot, 2013). Those who favor tracking have argued that homogeneous classes contribute to a more focused curriculum, leading to an optimal learning context for all students (Hallinan, 1994). Others argue against tracking because in heterogeneous classes weaker students profit from the presence of stronger students, without hindering the latter’s performance (Kerckhoff, 1986), or even favoring it (Konan, Chatard, Selimbegović, & Mugny, 2010).
Empirical findings on the benefits of tracking on mean performance have been mixed. Whereas there is some evidence that the effects of tracking can be slightly beneficial for high-ability groups and detrimental for low-ability groups (Hattie, 2002; Huang, 2009), cross-national studies suggest that the overall impact of tracking on student performance is very small and most often negative (Betts & Shkolnik, 2000; Hattie, 2002; Huang, 2009). Consequently, some authors question the existence of the potential trade-off between efficiency and equality, as the evidence for an increase in performance is limited, yet the impact on the achievement gap between students of different levels is more robust (Schütz, Ursprung, & Wößmann, 2008; Van de Werfhorst & Mijs, 2010).
Tracking and Social Inequalities
Indeed, students’ social background has been shown to yield a greater influence on students’ chance of success in countries that practice tracking than in countries with comprehensive school systems (Brunello & Checchi, 2007; Chmielewski, 2014; Hanushek & Wößmann, 2006; Schütz et al., 2008). Even when tracking is not operationalized as a binary variable but as a scale taking into account the age of first selection, the length of the tracked curriculum, and number of tracks, results show that an achievement gap due to the students’ socioeconomic background is larger in educational systems that are strongly tracked compared with more comprehensive systems (Van de Werfhorst & Mijs, 2010). Another study carried out with schools using academic/vocational tracking and schools using “ability grouping” concluded that they display rather similar achievement gaps due to students’ socioeconomic status (SES; Chmielewski, 2014). In sum, international surveys suggest that the implementation of selection practices of various forms can impact the actual distribution of opportunities across social classes.
The Structure of Educational Institutions and the Social Class Achievement Gap
We contend that tracking is not just a device that allows the educational system to sort students as a function of their objective performance, but that it is a practice that paradoxically perpetuates existing social class inequalities. This contention builds on sociopsychological and sociological analyses of the role of educational institutions in reproducing social inequalities. In sociology of education, Bourdieu and Passeron (1990) argued that the school system is in some measure an active contributor to the social class achievement gap because it is structured and organized by the upper class culture. Using an experimental approach, research in social psychology has provided evidence for the causal role of educational settings (e.g., institutional norms, classroom settings, and educational practices) in restraining the success of lower social class students (Batruch, Autin, & Butera, 2017; Croizet, Goudeau, Marot, & Millet, 2017; Smeding et al., 2013). The literature on cultural mismatch demonstrated that the middle/upper class norms and values promoted by higher education institutions interfere with the achievement of first-generation students (who are the first members of the family who go to college; Stephens et al., 2012). Moreover, the literature on stereotype threat suggests that the evaluative situations put in practice in universities produce a fear to confirm negative stereotypes among lower status students, which contributes to the academic difficulties experienced by low social class students (Croizet & Claire, 1998; Croizet & Millet, 2012). Recently, a set of experiments showed that making social comparison salient can be detrimental for the performance of working-class pupils. When pupils were asked to raise their hand upon finishing with a task, the performance of lower social class pupils decreased (Goudeau & Croizet, 2017). These studies indicate that the structure of the educational system can shape students’ behavior and achievement, but we suggest that it could also shape the behavior of the agents implementing its institutional practices, notably teachers.
Evaluators’ Decisions and Inequalities
It has long been proposed that teachers can affect student experience by holding biased beliefs and differential expectations of students (Dusek & Joseph, 1983; Riegle-Crumb & Humphries, 2012). Indeed, several studies have provided empirical support for discrimination based on factors such as nationality, gender, race, or even SES in school contexts (Farkas, 2003; Hinnerich, Höglin, & Johannesson, 2015; Kiss, 2013). Regarding tracking, teachers are less likely to promote students with an immigrant background than native students to a higher track (Glock, Krolak-Schwerdt, Klapproth, & Böhmer, 2013). The results of studies on teachers’ discrimination have largely been interpreted as documenting biases in individuals. We rather contend that discrimination in tracking could be the by-product of the institutional logic underlying the implementation of selective practices. Let us see why.
The Two Functions of Educational Institutions
The ongoing debate over tracking policies, between efficiency and equality, occurs at two different levels: educational and societal. Previous authors noted that the potential trade-off of tracking might not reside on its beneficial effect on student’s performance but rather on its efficiency to provide the labor market with differentially skilled workers (Bol & Van de Werfhorst, 2013). More generally, this debate reflects the ambivalence between two functions assigned to the school system: the educational function and the function of selection (Autin, Batruch, & Butera, 2015, in press; Darnon, Dompnier, Delmas, Pulfrey, & Butera, 2009; Jury, Smeding, & Darnon, 2015). Since the development of compulsory education, the school system has been required to pursue democratic objectives and provide education to all pupils by transmitting knowledge, skills, and culture. In this regard, the school system fulfills an educational function. Nonetheless, the educational system is also expected to prepare students for their future positions in the social hierarchy (Darnon et al., 2009; Dornbusch, Glasgow, & Lin, 1996). The certifications of competence (i.e., grades or diplomas) awarded by the educational institutions are used to compare and rank students to determine which are worthy of following the most prestigious paths, thereby fulfilling the function of selection of the educational system.
Previous studies in social psychology have found that the function of selection could have detrimental effects on the performance of low-SES students. A study demonstrated that when academic assessment was presented to students as fulfilling a selective rather than an educational purpose, the achievement gap between low- and high-SES students was higher (Smeding et al., 2013). The mere activation of the function of selection of the university system has been shown to impair the performance of first-generation students compared with continuing-generation students (i.e., one or both parents finished high school; Jury et al., 2015). These studies conclude that the mechanism responsible for the impairment of performance is rooted in the structural functioning of the institution (i.e., its function of selection).
The Present Research
In the present research, we move beyond the study of how low-SES students underperform in a selective environment and investigate the active involvement of selection in education: We propose to study the link between the function of selection and the social class achievement gap from the perspective of the evaluators, to test whether a selective institutional context can lead evaluators to create the difficulties/advantages encountered by students of different social classes. Importantly, in this study, we experimentally manipulate the pupils’ SES, but we keep the pupils’ performance constant to overcome the limitations of previous correlational studies, which could not conclude on causality in the relationship between tracking, SES, and past performance.
In the case of selection practices such as tracking, decisions are meant to be primarily based on prior achievement, which is believed to be a fair reflection of the student’s individual merit. This ideology, meritocracy, proposes that only individual inputs should be considered, and group-based considerations ignored, to determine one’s social position (Sabbagh, Resh, Mor, & Vanhuysse, 2006). Paradoxically, even though meritocracy as a justice principle is perceived as being relatively bias-free (Son Hing et al., 2011), research has shown that meritocratic principles can have the opposite psychological effects. For instance, studies conducted by Son Hing, Bobocel, and Zanna (2002) have shown that endorsement of the merit principle was associated with increased opposition to policies challenging the status quo in favor of disadvantaged group members. Meritocracy is often used as a justification to oppose affirmative action (Faniko, Lorenzi-Cioldi, Buschini, & Chatard, 2012). In the field of education, Darnon, Smeding, and Redersdorff (2018) showed that beliefs in school meritocracy were negatively associated with interest or behavioral intention to implement an equalizing pedagogical method. Taking the perspective of evaluators, another study showed that an organizational culture emphasizing meritocratic principles led managers to favor men over equally competent women (Castilla & Benard, 2010).
Contrary to the function of selection that can be linked to a meritocratic principle, the educational function seems to be related to an egalitarian principle as it aims at equalizing individuals’ level of instruction. Egalitarian principles have been associated with positive outcomes for disadvantaged group members. For instance, in an engineering college, implementing egalitarian social norms led to stronger intentions to speak out against racist behaviors, and more positive attitudes toward diversity in engineering (Bennett & Sekaquaptewa, 2014).
In light of these results, we propose that merit-based selective practices, like tracking, are structural contexts that encourage agents of the educational system, namely, teachers, to maintain status inequalities between pupils of advantaged and disadvantaged social background, even when achievement is equal. Conversely, focusing agents on the educational function of the educational system could reduce such a tendency to maintain status inequalities.
Hypotheses and Overview
We tested the hypothesis that tracking decisions will be influenced by the evaluators’ awareness of the pupils’ SES, even if pupils’ achievement is kept identical and will result in considering a lower track more suitable for a low-SES pupil than for a high-SES pupil, and the reverse for a higher track (Hypothesis 1). Two experiments tested this hypothesis, one with university students playing the role of teachers, and another with preservice and in-service teachers. Furthermore, a third experiment tested that, if it is true that such discriminatory behavior in tracking judgments is to some extent the result of an institutional logic of selection, rather than purely individual prejudice, then the social class gap in tracking decisions will be stronger when the educational system’s function is portrayed as being selective rather than educational (Hypothesis 2).
Experiment 1
Method
Participants
A total of 104 psychology students enrolled in a French-speaking Swiss university participated in this study in exchange for partial credit. A sensitivity analysis on G*Power using a negative correlation between the tracking measures (r = –.50) and an 80% power returned a required effect size of f = .24, Cohen’s d = 0.48 for the expected interaction between track and pupil’s SES (Faul, Erdfelder, Buchner, & Lang, 2009). All participants have been randomly assigned to one of the four between-participants experimental conditions: target’s SES (low vs. high) × target’s Gender (girl vs. boy). The target’s gender was included as a control variable. Data from five participants were excluded because they were suspicious of the aim of the study (final sample: N = 99, 85 women, 14 men, Mage = 21.43, SDage = 2.53).
Material and procedure
Switzerland is a federal state, divided in relatively autonomous cantons (like in Germany or the United States); in most cantons, performance (grades) at age 12 determines whether a pupil will follow a vocational or academic track (i.e., different curricular programs). Some schools allow for borderline cases (i.e., pupils whose grades are above the passing grade but only slightly below the usual requirements for the highest track) to be reviewed by teachers and the school director, to decide whether the pupil should be allowed to pursue the academic track (an average of 40% of pupils enter the academic track). For the present study, participants were asked to take the role of a teacher in this context and help decide which secondary school track was most suitable for a particular borderline case (the target pupil). Participants received a booklet 1 informing them about (a) the future consequences of sending the pupils to the higher or lower track (e.g., higher track: leading to high school and university, lower track: leading to an apprenticeship or general knowledge schools), (b) the grade requirements for tracks, and (c) the target’s description including the pupil file (similar to the one actually in use), grades, and a brief description of his or her extracurricular activities. Relevant information about the target’s SES and gender were presented among neutral information (e.g., date of birth, address, nationality—all targets were presented as native pupils).
Manipulation of target’s SES and gender
Gender was manipulated by changing the pupil’s first name and the reported gender in the pupil file. SES was manipulated by altering the pupil’s first name (using stereotypical names of high vs. low social class girls and boys: “Charlotte/Louis” vs. “Cindy/Bryan”; cf., Coulmont, 2011), parental occupation (mother: director of marketing vs. waitress; father: architect vs. construction workman), and extracurricular activities (e.g., traveling to London vs. local amusement park). Importantly, every target was presented with the same grades.
Measures of suitability of school tracks
We included two sets of measures 2 to estimate what participants thought was the appropriate track for the bogus pupil. Participants were to evaluate separately on 7-point scales the suitability of each track for the target (from 1 = totally disagree to 7 = totally agree), once according to their own opinion as a teacher and again according to what they thought other teachers’ opinion was (supposedly included in the decision). The first two measures (i.e., “In your opinion, to what extent the pupil should be sent to the lower/higher track?”) were meant as an indicator of the participants’ own decision, were they to be in this position. We included the two other measures to investigate participant’s perception of the school system functioning (i.e., “In your opinion, to what extent the other teachers making the decision would send the pupil to the lower/higher track?”; the analyses on these measures are presented in the Supplemental Materials).
The questionnaire ended with a manipulation check to make sure that participants had correctly perceived the target’s background—participants were asked to recall information about the pupil they had seen and were told to report on a 7-point scale to what extent the pupil seemed to come from a highly disadvantaged (1) versus highly advantaged (7) background—and sociodemographic questions, among which information pertaining to the participants’ parental level of education and employment. Finally, participants were thanked and debriefed.
Results
Manipulation check
We analyzed the rating of the perceived socioeconomic background of the target in a 2 (target’s gender: girl vs. boy) × 2 (target’s SES: high vs. low) ANOVA. As expected, we found an effect of the target’s SES indicating that low-SES targets were perceived as coming from a less-advantaged background (M = 3.88, SD = 0.94, 95% confidence interval [CI] = [3.65, 4.10]) than high-SES targets (M = 5.96, SD = 0.61, 95% CI = [5.74, 6.19]), F(1, 95) = 169.11, p < .001,
Suitability of school tracks
We examined the participants’ assessment of the suitability of each track in a 2 (target’s gender: girl vs. boy) × 2 (target’s SES: high vs. low) × 2 (track: high vs. low) mixed ANOVA with the last factor as repeated measures. The analysis revealed no main effect of target’s SES, F(1, 95) = 1.74, p = .19,

Mean suitability from the participants’ point of view of lower and higher secondary school tracks as a function of target’s SES in Experiment 1.
Target’s gender was initially included as a control variable and the results indicated no significant main effect of gender, or interaction between gender and SES (Fs < 1; ps > .71). However, the analysis revealed an unexpected gender by track interaction, F(1, 95) = 9.13, p = .003,
Participants’ background
One might argue that the participant’s own social class may have an impact on their tracking judgment, under the form of an in-group bias. Participants were considered as first-generation (e.g., none of the parents has a university degree) or continuing-generation (e.g., at least one parent has a university degree), following the procedure used in many social psychology studies on social class in educational context (Sirin, 2005; Snibbe & Markus, 2005; Stephens et al., 2012).
We performed a 2 (participant’s generation: first vs. continuing-generation) × 2 (target’s gender: girl vs. boy) × 2 (target’s SES: high vs. low) × 2 (track: high vs. low) ANOVA with the last factor as repeated measures. The analysis showed an effect of track, F(1, 90) = 148.91, p < .001,
Discussion
We designed this first experiment to examine whether individuals acting as agents of the educational system would reproduce existing social disparities when placed in the strategic position of having to actively influence pupils’ prospects, namely, when asked to make a tracking decision. Results showed that pupils’ SES was an influential factor in conceding academic opportunities. Although grades were the same for all targets, participants imagined that as teachers they would consider a lower track more suitable for a low-SES pupil when compared with a high-SES pupil, and reciprocally a higher track more suitable for a high-SES pupil than for a low-SES pupil.
To preserve the ecological validity of the study, the scenario was made to resemble actual tracking dilemmas in the Swiss system where teachers and the principal can offer a second chance to pupils who are borderline cases for the higher track (i.e., slightly below official standards). This specific characteristic is important for the interpretation of the effect, as most research in the discrimination literature focuses on targets from disadvantaged groups who meet the official criteria for an opportunity (e.g., for a job application) but are less likely to benefit from it in comparison with targets from advantaged groups (Quillian, Pager, Hexel, & Midtbøen, 2017). Allowing students who do not meet the criteria to follow an academic path, on the contrary, could be interpreted as offering these students a privilege. In the case of this study, this privilege appears to be more readily offered to high-SES students than to low-SES students. This result is consistent with the theorizing of researchers such as DiTomaso (2015) who suggest that the reproduction of inequalities is also a by-product of favoritism processes, where advantaged group members are extended the benefit of the doubt in case of mistakes and might be rewarded for promise over performance.
We also observed that the effects were stronger for male than for female targets. This unexpected effect might be linked to different attributions to academic success depending on gender. According to Fennema, Peterson, Carpenter, and Lubinski (1990), teachers perceive boys’ performance as being more related to essential qualities such as abilities, whereas girls’ performance are seen as being more the results of effort. In the case of girls, it may be that the impact of the target’s SES was reduced by the gender-based assumption regarding the effort involved in the academic path. However, this effect might also be explained by the fact that at this level of education, girls tend to generally outperform boys academically in many countries (OECD, 2011), and some research suggests that teachers and young children perceive behavioral and academic stereotypes in early education as being more favorable toward girls than boys (Hartley & Sutton, 2013; Jones & Myhill, 2004). Indeed, prior experimental work suggested that stereotypical boys are seen by teachers as less academically engaged when compared with nonstereotypical boys and girls (Heyder & Kessels, 2015). Although this effect of gender should be replicated before reaching any definitive conclusion, we think it possible that the advantages/disadvantages of girls’ SES could be less impactful for school decisions if their performance is seen as being more linked to efforts and if girls are seen by teachers as less of an academic risk.
Experiment 2
To ensure that the above results were not biased by participant’s lack of teaching and selecting experience, we conducted a replication with a sample of preservice and in-service teachers.
Method
Participants
Based on the observed effect size and the correlation between the tracking measures in Experiment 1 (
Material and procedure
The descriptions of the target and the general procedure were identical to those used in Experiment 1. The cover story was however adapted to fit with our sample. The study was presented as a comparison between individual and collective expert decision-making processes. All participants were told they were to answer individually and were asked to decide which track was most suitable for a borderline pupil (i.e., whose grades are just below the usual requirement to be tracked in the higher track). As agents of the Swiss school system, teachers are familiar with tracking decisions, but they received nonetheless the booklet informing them about the outcomes of tracking for pupils. They read the administrative file presenting the pupil’s SES and grades and were then asked to indicate the suitability of each (low and high) track for the pupil. 3 Finally, they reported demographic information including their gender, age, and number of years of teaching experience. At the end of the study, participants were thanked and debriefed.
Results
Manipulation checks
An ANOVA involving the target’s SES (low vs. high) revealed that participants did indeed perceive the low-SES pupil (M = 3.73, SD = 0.82, 95% CI = [3.49, 3.99]) as being less advantaged than his high-SES counterpart (M = 6.00, SD = 0.68, 95% CI = [5.73, 6.28]), F(1, 67) = 148.82, p < .001,
Suitability of school tracks
We performed a 2 (target’s SES) × 2 (track) ANOVA with the last factor as a repeated measure. First, we observed no main effect of target’s SES, F < 1, ns, and a significant main effect of track, F(1, 68) = 11.82, p = .001,

Mean suitability from the participants’ point of view of lower and higher secondary school tracks as a function of target’s SES in Experiment 2.
Teachers’ experience (in years)
To test whether the documented gap in personal assessment of track suitability was affected by participants’ experience in teaching, we performed a regression analysis with the target’s SES (low coded −0.5, high coded 0.5), the teacher’s experience (centered; N = 66; four missing values), and the interaction as predictors of the difference score between the higher track and the lower track. Analyzing the difference score allows to include the within-participants “Track” factor in the analysis (Judd, McClelland, & Ryan, 2011). The previously reported interaction between SES and track remained similar, t(62) = 2.22, p = .03,
Experiment 3
Our hypothesis was that it is the selective role of educational institutions that drives the discriminatory behavior. In the two previous studies, the observed social class gap in tracking was affected neither by the respondent’s social class nor by the extent of their actual practice as teachers. The third study aimed to test whether it is indeed a structural mechanism that drives the social class gap in tracking: If the observed behavior is driven by a compliance with the function of selection of the system, rather than an individual’s propensity to discriminate, then manipulating such function (i.e., either to select or to educate) should impact the tracking decisions. We hypothesized that the gap between the targets in the tracking decision should be wider in the condition where the function of the school system is presented as being selection rather than education. We did not expect the effect of SES on tracking between the “education” condition and the “selection” condition to disappear because tracking in itself is a selective practice; therefore, using tracking with an educational mind-set might reduce the discriminatory behavior, but not completely eliminate it.
Method
Participants
The sample consisted of 160 first-year life science and technology students from a Swiss polytechnical university (82 women, 71 men, and seven unspecified; Mage = 20.08, SD = 1.19). An a priori analysis using the correlation between the tracking measures (r = –.70) and the more conservative effect size observed in Experiment 2 (
Material and procedure
Participants were presented with the same scenario, in which they were asked to imagine they were teachers, and the same targets used in Experiment 1. Again, all targets were presented as having the same grades, which were slightly below the usual requirement for the higher track. Participants were also informed of the consequences of tracking (i.e., possibility for the student to pursue postsecondary education or not).
Manipulation of school’s function
Before seeing the information about the target’s case and the implications of tracking, participants read a paragraph about the characteristics of the Swiss school system, which served to render the function of the educational system (selection vs. educational) salient. In the educational function paragraph, the main goal of the educational system was described as imparting all students with knowledge and skills and to help everyone develop their competence. In the selection function paragraph, the main goal of the educational system was described as identifying and rewarding the most deserving students. For instance, in the educational function condition, they were told that the Swiss school system’s goal was “to develop skills and give students the opportunity to progress in the mastery of their knowledge.” In the function of selection condition, the school’s goal was “to orient students according to academic abilities and lead everyone to the maximum of their potential.”
Results
Manipulation checks
A 2 (target’s SES) × 2 (target’s gender) × 2 (school’s function) ANOVA was conducted to analyze participants’ perception of the target’s SES. The results showed a marginal effect of target’s gender, F(1, 152) = 3.52, p = .06,
We included a manipulation check to ensure that participants had correctly perceived the school system as being selective or educational. The vast majority (78% of participants—that is, 128) answered correctly. We tested our final model with samples including and excluding those participants and we found similar results. We decided to keep the full sample.
Suitability of school tracks. 4
Given the specific direction of the expected results, we recoded the experimental conditions into three orthogonal contrasts (Judd & McClelland, 1989): the planned contrast testing our hypothesis (coded: +1, –1, +3; –3, respectively, for the high SES/education, low SES/education, high SES/selection, and low SES/selection conditions), and two orthogonal contrasts testing the remaining variance (+1, +1, –1, –1 and −3, +3, +1, –1). The planned contrast predicts that the means are linearly ordered in the following direction: For the higher track, the high-SES pupil should be considered the most suitable in the selection condition, followed by the high-SES pupil in the educational condition, next the low-SES pupil in the educational condition, and finally the low-SES pupil in the selection condition. If the data support our hypothesis, the first contrast should be significant, but not the others (Brauer & McClelland, 2005; Judd & McClelland, 1989).
It is worth noting that the social class gap is in opposite directions in the two tracks: For the lower track, the high-SES pupil should be considered the least suitable in the selection condition, followed by the high-SES pupil in the educational condition, next the low-SES pupil in the educational condition, and finally the low-SES pupil in the selection condition. We thus expected an interaction between the track and the contrast. The suitability of tracks was analyzed with the contrast-coded conditions as a between-participants factor and the tracks (higher vs. lower) as a repeated measure. The target’s gender was originally included in the analysis, but as it did not result in any significant main or interaction effects, we did not include the variable in the final model. The analysis showed no main effect of the contrast-coded conditions, Fs < 1, ns, and a main effect of track, F(1, 156) = 103.63, p < .001,
The interaction between track and the planned contrast, shown in Figure 3, was further decomposed by track. In both the lower and the higher track, the linear planned contrast was significant, F(1, 156) = 11.69, p = .001,

Mean suitability from the participants’ point of view of lower and higher secondary school tracks as a function of target’s SES and school’s function in Experiment 3.
Participants’ background
We also tested the effect of participants’ parental education to see whether their social class had an effect on their tracking decisions. The analysis showed a significant effect of track, F(1, 149) = 72.13, p < .001,
Discussion
The order of the means in both the lower and the higher track is congruent with our hypothesis that tracking in a selective institutional context, more than in an educational context, is associated to SES inequalities, even when achievement is kept constant. The results suggest that selection can affect the agents’ behavior, shaping and even creating the difficulties/advantages encountered by students of different social class in a manner that reproduces social inequalities at school.
In the present study, we had a specific hypothesis and thus we followed the recommendation of many scholars who suggest to use theoretically focused a priori planned contrasts, for their increased power and specificity, as an alternative to less theoretically focused omnibus tests (e.g., interaction) when the researchers’ goal is to investigate an overall pattern of means instead of several specific effects (Judd, McClelland, & Culhane, 1995; Levin & Neumann, 1999; Van Wesel, 2011). It should be noted, however, that some scholars favor the use of a more traditional ANOVA strategy, as some contrasts may be sensitive to effects not captured by the predicted contrast when the pattern include two or more local effects, leading to increased family-wise error rate (e.g., see the debate between Abelson, 1996; Petty, Fabrigar, Wegener, & Priester, 1996; Rosnow & Rosenthal, 1996). In our case, none of the residual contrasts included in the analyses of Experiment 3 specifically tested the main effect of target’s SES. In other words, there is a non-negligible chance that the contrast remains significant in the presence of the main effect of SES alone. Therefore, the results of Experiment 3 should be interpreted with caution. As a robustness check, we conducted additional analyses using the R package “Restriktor” (http://www.restriktor.ugent.be/index.html), which tests an inequality constrained linear mean order hypothesis. We performed the analysis on the lower track, higher track, and the difference between higher and lower track. All the results confirmed a linear order of the means in the direction of our hypothesis. 5
General Discussion
A meritocratic educational system, by rewarding individual performance, is supposed to circumvent group-based considerations and therefore offer every student the same opportunities. Yet, in reality, socioeconomic background remains one of the main predictors of future academic achievements in most countries (OECD, 2011). When comparing the effects of tracking between countries, reports present evidence for a link between tracking and social class inequalities in education (Brunello & Checchi, 2007; OECD, 2011; Schütz et al., 2008). The correlational relationships examined in past work were studied experimentally in the three studies presented here, to test that selection based on tracking can paradoxically favor social class inequalities. In other words, we aimed to uncover whether the social class gap in tracking that has been observed in numerous studies could partly be a by-product of an institutional logic of selection.
The results of our first experiment confirmed our first prediction: Participants recreated the disparities observed in school when they were placed in the strategic position to actively influence pupils’ prospects. These results were replicated with a sample of preservice and in-service teachers. In all experiments, no individual factors (such as participant’s own social class, and years of practice as a teacher) significantly moderated this effect. Finally, the third experiment was devised to test our second hypothesis that the artificial creation of a social class gap in tracking would be at least partly explained by the institutional function of selection. The results are in line with this interpretation and seem to indicate that the function of selection of the educational system is linked to the creation of social class inequalities at school.
Contributions
This research sheds light on a new mechanism informing the debate over the link between tracking policies and social inequalities: selective educational contexts can lead agents to produce biased academic decisions. Until now, this debate focused primarily on the cost versus benefits for students to be in classrooms structured around the students’ academic potential, presuming that pupils’ achievement level was adequately assessed by the agents of the education system. Yet, the results of our research provide evidence of the contrary. Regardless of their actual merit, lower SES pupils were considered more suited for a lower track and less suited for a higher track comparatively to their higher SES counterparts with the same grades. Proponents of tracking policies generally suggest that having differentiated curricula or classes based on the students’ level of achievement can serve the improvement of learning for all students (Hallinan, 1994). But if, as we propose, teacher’s biases might be prompted by selective educational contexts, then tracking policies could instead lead teachers to artificially create academic differences among students. More generally, the implications of our research suggest that efforts toward raising lower social class students’ performance may not be sufficient to combat social class inequalities at school. Attention should also be devoted to the intent underlying institutional practices. Particularly, in a context where schools are often under political pressure to provide external proof of their productivity and of their high standards, increasing selection can seem a reasonable solution to achieve excellence. In light of our results, we would suggest to carefully consider the consequences of such strategies, as drawbacks on equality of opportunity may outweigh the potential benefits.
Moreover, the present research builds upon a growing literature in social psychology that has been directed toward examining cultural and structural factors involved in the perpetuation of the social class achievement gap (e.g., Adams et al., 2008; Croizet et al., 2017; Jury et al., 2017; Kraus & Park, 2017). These studies have consistently demonstrated the importance of looking into the role of institutional contexts to fully understand the reasons why low-SES students or high-SES students are more susceptible to fail/succeed at school. However, by focusing on student’s contextual ability to perform, previous studies did not directly address how performance could also be constructed by evaluators and, by extension, the institution. In this research, we went beyond individual performance by investigating the structural obstacles that contribute to actively exclude low social class pupils from higher education. Our research provides initial empirical support to the validity of this approach, as the results were consistent with the hypothesis that structural factors, such as the intended function of educational institutions, are related to the creation of social class inequalities at school.
Increased societal inequality has been identified as a social problem requiring urgent attention by the World Economic Forum report on the 2015 Global Agenda. In fact, their report also indicates that improved education is considered to be one of the top solutions to resolve the issue of deepening inequalities (World Economic Forum, 2015). A major contribution of our research is the experimental investigation of one specific structural factor in the educational system that could hinder the potentially beneficial effect of the democratization of the educational system. The present results provide preliminary support to the hypothesis that the education system may encourage the reproduction of social class inequalities because its selective structure prompts evaluators to make decisions that hinder the chances of success of lower SES students and favor those of higher SES students. In this respect, our findings provide initial experimental evidence supporting a long-standing line of research that has linked social class inequalities to the functioning of the educational system (Bourdieu & Passeron, 1990; Bowles & Gintis, 1976; Marks, Cresswell, & Ainley, 2006; Oakes, 1985; Van de Werfhorst & Mijs, 2010). This research used experimental designs to study the structural causes of social class discrimination in tracking. To the best of our knowledge, it is the first to investigate the causal relationship between the function of the school system and unequal treatment of students of different social classes by agents of the system in the case of tracking.
Limitations and Conclusion
Some limitations should be mentioned. First, the aim of our research was to investigate a structural mechanism (i.e., institutional function) to examine the social class gap in tracking. As it was beyond the scope of this research, the results did not address the individual-level mechanisms at stake in this specific phenomenon. Further research could build on our conclusions to examine how selective institutional contexts affect institutional agents’ specific psychological processes. Second, Experiment 1 revealed an interaction between SES and gender. Although unexpected, this result is consistent with prior literature describing opposite academic gender stereotypes of teenagers, which could lead to social class disadvantages targeting boys more than girls. However, as this pattern of result is not replicated in Experiment 3, we think that further intersectional research is needed to test the robustness of this effect. Third, results of Experiment 3 provide initial support for the role of institutional function, but future research should use larger samples to provide a direct replicative test of our hypothesis.
In sum, the SES achievement gap due to selection practices such as tracking may not entirely result from the lower ability of low-SES students or from some particularly prejudiced teachers. It appears that the very principle of institutional selection could play a central role in the reproduction of social inequalities observed in the use of tracking. An OECD (2011) report suggests that tracking policies may have a general negative impact on equal opportunity for all in education and propose to delay selection to reduce its consequences. Our findings provide convergent experimental evidence in favor of this position by highlighting the paradoxical effects of selection-based practices on equality of treatment for students.
Supplemental Material
ABatruch_Supplemental_Materials_Final071218 – Supplemental material for School Selection and the Social Class Divide: How Tracking Contributes to the Reproduction of Inequalities
Supplemental material, ABatruch_Supplemental_Materials_Final071218 for School Selection and the Social Class Divide: How Tracking Contributes to the Reproduction of Inequalities by Anatolia Batruch, Frédérique Autin, Fabienne Bataillard and Fabrizio Butera in Personality and Social Psychology Bulletin
Footnotes
Acknowledgements
We wish to express our gratitude to Jean-Philippe Antonietti, Benoît Dompnier, and Nicolas Sommet for invaluable comments on an earlier draft, and Philemon Griesmar, Vincent Pillaud, Philippe Sperisen and Sophia Stark for help in data collection.
Authors’ Note
Anatolia Batruch is now at University of Amsterdam, Amsterdam, The Netherlands. Frédérique Autin is now at Université de Poitiers, CeRCA, Poitiers, France.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Swiss National Science Foundation (grant CRSII1_141872), and was part of Anatolia Batruch’s doctoral dissertation at University of Lausanne under the supervision of Fabrizio Butera.
Notes
Supplemental Material
Supplemental material is available online with this article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
