Abstract
People automatically infer others’ personality (e.g., trustworthiness) based on facial appearance, and such facial stereotype biases predict real-world consequences across political, legal, and business domains. The present research tested whether these biases can be mitigated through counterstereotype training aimed at reconfiguring the associations between specific facial appearances and social traits. Across six studies and a replication, a behavioral counterstereotype training consistently reduced or eliminated facial stereotype biases for White male faces in the context of economic trust games, hiring decisions, and even automatic evaluations assessed via evaluative priming. Together, the results demonstrate a fundamental malleability in facial stereotyping related to trustworthiness, with a minimal training able to mitigate the tendency to activate and apply long-held, highly automatized facial stereotypes. These findings suggest that face impressions are more flexible than typically appreciated, and they provide a potential inroad toward combating our ingrained biases based on facial appearance.
At a glance, people quickly evaluate others’ faces based on personality traits such as trustworthiness. People make these trait inferences as quickly as 100 ms (Bar et al., 2006) and do so automatically and without conscious awareness (Engell et al., 2007; Freeman et al., 2014). Although rarely diagnostic of a target’s actual personality (Rule et al., 2013), trait impressions evoked by faces tend to be reliable across perceivers (Oosterhof & Todorov, 2008). These trait impressions can be consequential, as facial trustworthiness predicts outcomes from electoral success (Todorov et al., 2005) to criminal-sentencing decisions (Wilson & Rule, 2015). Here, we investigate the malleability of such social judgments, testing whether learning can reduce the tendency to spontaneously judge faces.
To date, face impression research has focused on bottom-up structural aspects, highlighting specific arrangements of features serving as “facial stereotypes” for particular traits. Ecological accounts of person perception have long emphasized evolutionarily important arrangements of facial features, with trait impressions reflecting overgeneralizations of functionally adaptive cues, such as emotional expressions (Mcarthur & Baron, 1983). As such, a face’s physical resemblance to emotional expressions conveys specific trait impressions, for example, joyful cues (e.g., upturned mouth) convey trustworthiness while angry cues (e.g., furrowed brow) convey untrustworthiness (Oosterhof & Todorov, 2009; Zebrowitz et al., 2003). Data-driven models have identified specific facial features associated with countless perceived traits (Oosterhof & Todorov, 2008). Even 3-year-old children reliably evaluate traits from faces, leading some to argue that such trait associations may be congenitally determined (Cogsdill et al., 2014).
However, experience and learning could also play a role. Although perceivers are unlikely to form trait associations with facial appearance based on targets’ own behavior (as facial appearance does not reliably correspond to one’s personality traits; Rule et al., 2013), they may implicitly learn regularities between certain facial features (e.g., furrowed brow) and how others judge and react to individuals with those features (e.g., untrustworthy), as these face-based judgments and reactions tend to be highly consistent (Oosterhof & Todorov, 2008). Accordingly, such trait associations with facial appearance would come to reflect preconceived notions derived from the social environment. Indeed, researchers have increasingly documented the effects of learning and experience on trait associations of faces (e.g., Dotsch et al., 2016; Hehman et al., 2017; Sofer et al., 2017; Stolier et al., 2018, 2020).
In the present research, we leverage such sensitivity to learning to explore an intervention to reduce facial stereotyping. Given face impressions’ lack of correspondence with actual personality and their impact on real-world outcomes (Olivola et al., 2014), it is important to examine whether facial stereotype biases can be ameliorated. A rich literature has leveraged counterstereotype interventions to reduce social biases like racial or gender bias (Lai et al., 2016; Paluck & Green, 2009), but trait impressions have not typically been conceptualized as a bias requiring intervention (despite operating as facial stereotypes). Counterstereotype interventions have typically had participants engage in counterstereotypical imagery or be presented with counterstereotypical exemplars or pairings of exemplars and traits (e.g., a female name and “strong”; for reviews, Blair, 2002, Forscher et al., 2019; Gawronski & Bodenhausen, 2006). In the face impressions literature, studies have long examined how trait impressions of a face are updated based on newly learned behaviors related to that trait (Todorov & Uleman, 2002, Bliss-Moreau et al., 2008). For instance, face identities paired with positive or negative behaviors are subsequently judged as more or less trustworthy, respectively (Todorov & Olson, 2008). This impression updating occurs because behavioral information spontaneously triggers trait inferences that alter impressions (Uleman et al., 1996). As learned behaviors can affect trait associations for individual face identities, it may be possible to leverage such behavior-based learning to flexibly reassociate trait associations with more general facial appearances, that is, facial stereotypes. Indeed, previous work found that facial features become rapidly associated with a trait if those features are paired with the trait’s related behaviors (Lick et al., 2018).
Here, we integrate the bias intervention and impression updating literatures to investigate whether a counterstereotype training paradigm, which pairs facial features and behaviors, has the potential to reduce facial stereotyping. A training task was used to either reverse (trained group) or maintain (control group) the mapping between untrustworthy/trustworthy facial features and untrustworthy/trustworthy behaviors. We focus on trustworthiness because it is the primary dimension by which faces are judged and is a proxy for general face valence (Oosterhof & Todorov, 2008). In the trained group, untrustworthy faces were associated with trustworthy behaviors 80% of the time and trustworthy faces were associated with untrustworthy behaviors 80% of the time. In the control group, the same faces were instead presented with a name label that provided similar individuating information without countering or affirming facial stereotypes, allowing this group to act as a baseline for trustworthiness evaluations. Overall, we found that the behavioral counterstereotype training reduced facial stereotype biases in an economic trust game (Studies 1A/1B), hiring decisions (Studies 2A/2B), and even in automatic evaluations assessed via evaluative priming (Studies 3A/3B), thereby showing that even long-held facial stereotype associations can be mitigated through associative learning mechanisms.
Study 1
We test whether training reduces facial stereotype bias in an economic trust game, assessed via payments to computer-generated (Study 1A) or real (Study 1B) faces.
Method
Participants
Without a direct precedent, for all studies, we used a target sample size of 200 participants (100 participants/group), the sample necessary to detect a small-to-medium effect size (d = .3) at 80% power. All studies were performed on Mechanical Turk. Participants received monetary compensation.
Two hundred twenty-five total participants performed Study 1A. For final analyses, 101 participants were in the control group (age: M = 33.7 years, SD = 10.3 years; 43 male; race: 77 White, nine Black, eight Asian, seven Other; Hispanic/Latino ethnicity: 10), and 100 participants were in the trained group (age: M = 35.5, SD = 11.1; sex: 39 male; race: 80 White, nine Black, eight Asian, three Other; Hispanic/Latino ethnicity: 6). Two hundred twenty total participants completed Study 1B; 102 participants were in the final control group (age: M = 33.3, SD = 10.3; 57 male; 80 White, 15 Black, seven Asian; Hispanic/Latino ethnicity: 12), and 100 participants were in the final trained group (age: M = 34.7, SD = 10.0; sex: 58 male; race: 79 White, 13 Black, eight Asian; Hispanic/Latino ethnicity: 10).
Procedure
Participants engaged in a two-part task. The first task involved a learning phase purporting to test face memory. For control participants, the learning phase involved 20 different faces paired with a name label. The trained participants viewed 20 faces paired with one-sentence trustworthy/untrustworthy behavioral descriptions (see Supplementary Material for details). Participants were instructed to memorize the face–behavior or face–name pairings for a later test of face memory.
In Study 1A, the target faces were computer-generated White male faces generated via FaceGen (Blanz & Vetter, 1999). The faces were manipulated along the trustworthiness trait dimension (Oosterhof & Todorov, 2008), such that they were +2 SD or −2 SD in trustworthiness. Faces were cropped such that only internal face features were visible. Participants only saw one variant of each facial identity in the study (trustworthy or untrustworthy). One set of faces was used during training, and a different set of faces was used for the evaluation phase.
In Study 1B, the stimuli were natural White male faces from the Basel Face Database (Walker et al., 2018). These faces were systematically manipulated on the communion dimension, which is virtually identical to the trustworthiness dimension in facial feature space, acting as a semantic analogue to trustworthiness (Stolier et al., 2018). The stimuli consisted of White male faces whose features were increased +2 SD or decreased −2 SD in trustworthiness/communion, making them a functional analogue to the Study 1A faces (Figure 1).

Examples of computer-generated faces (A) and real faces (B) that systematically vary in trustworthiness. Note. The faces on the left columns are 2 SD below the population mean in trustworthiness and faces on the right are 2 SD above the population mean.
Both control and trained participants saw 10 unique trustworthy faces and 10 unique untrustworthy faces during the learning phase. For the trained group, the 10 trustworthy faces were paired with untrustworthy behaviors 80% of the time and the 10 untrustworthy faces were paired with trustworthy behaviors 80% of the time. The face–behavior pairings were randomized for each subject. Each trial was self-paced and presented in a randomized order. A time-out warning (meant to ensure participants carefully read the face–behavior/face–name pairings) remained on screen for 2,000 ms if participants finished a trial in under 500 ms. Each face–behavior pairing was repeated three times, resulting in 60 learning trials.
Participants then performed a second task, ostensibly unrelated to the learning phase. This task was an economic trust game involving decisions to allocate money to another player, similar to previous studies (Bonnefon et al., 2013; Rezlescu et al., 2012; van’t Wout & Sanfey, 2008). To improve the task’s realism, participants were told they were playing for real money, and they were urged to maximize profits. Participants selected a face avatar to represent themselves, and they were told other human players were represented by similar avatars. The time between each round of the trust game began with a delay of random duration (1,000–8,000 ms) to simulate the latency in matching with another player. In actuality, there were no human players and participants were presented with a fixed set of faces in a randomized order. Additional data suggested that participants believed the cover story (see Supplementary Material).
Before the game began, participants were told they were assigned as Player 1, the “giver.” Consistent with previous studies (van’t Wout & Sanfey, 2008), for each trial, they received US$1.00 and could give a portion to another player (options: US$0.00, US$0.25, US$0.50, US$0.75, and US$1.00). Whatever amount given was tripled, and their partner could return as much or as little back to the participant. For example, if the participant gave the full US$1.00, the other player would have US$3.00 and could return whatever portion of US$3.00 back to the participant. However, if the participant found the player’s avatar untrustworthy, they could give less money and keep the remainder.
The trust game involved 32 total rounds. For 16 rounds, participants were paired with 16 unique partners. These 16 partners were either high or low in trustworthiness (+2 and −2 SDs), resulting in eight trustworthy faces and eight untrustworthy faces. The remaining 16 rounds consisted of female avatars with no systematic variability in trustworthiness, included to reduce the task’s transparency. Critically, the faces in the trust game were distinct from those in the learning phase so we could test whether training generalized to novel target faces that shared trustworthy and untrustworthy face features (but differed in identity).
Five attention checks were interspersed between trust game rounds to ensure participants maintained attention. The participants were instructed to press a number (1–5). After the trust game, participants learned there would be no test of face memory and were debriefed about the study’s aims.
Results and Discussion
Participants who triggered timeout errors on >50% of learning phase trials (indicating they were not carefully encoding face–behavior/face–name pairings) were excluded (Study 1A: 14 participants; Study 1B: six participants), as were participants who failed any attention check (Study 1A: 10 participants; Study 1B: 12 participants). Data for all studies are available at https://osf.io/3x5qn/
In Study 1A with computer-generated faces, a 2 (group: control vs. trained) × 2 (facial trustworthiness: trustworthy vs. untrustworthy) mixed-model analysis of variance (ANOVA) on payment elicited a main effect of trustworthiness, F(1, 199) = 9.79, p = .002,

Violin plots representing probability densities for payment allocated to trustworthy and untrustworthy faces, separately for the control and trained groups for Study 1A (A) and Study 1B (B). Note. Error bars denote standard error of the mean. Note that values may exceed the possible 0–100 cents range because probability densities are depicted.
Similarly, in Study 1B with real faces, a 2 (group: control vs. trained) × 2 (trustworthiness: trustworthy vs. untrustworthy) mixed-model ANOVA elicited a main effect of facial trustworthiness, F(1, 200) = 32.54, p < .0001,
Thus, a brief learning phase was sufficient to abolish the effects of facial appearance on trust game payments. While control participants were influenced by facial trustworthiness, trained participants paid trustworthy and untrustworthy faces equally. Moreover, because the training generalized to novel targets, these results show that the associations between specific facial appearances and trustworthy/untrustworthy traits were flexibly updated rather than simply updating impressions of specific individuals.
Study 2
In Study 2, we aim to extend the extinction of facial stereotype biases to downstream behaviors with important social consequences. Participants engaged in identical training as Study 1 and performed a mock hiring process in accordance with studies examining hiring bias (Rooth, 2010). Participants evaluated candidates for a job and were given biographical information alongside a candidate’s face. They rated candidates’ suitability for a job and the likelihood that they would interview them. Candidates’ faces were either computer-generated (Study 2A) or real (Study 2B). Prior work has shown that facial stereotype biases persist even when more diagnostic cues are available (Jaeger et al., 2019; Olivola & Todorov, 2010). In Study 2, we test whether training reduces facial stereotype bias even in contexts where more diagnostic information is provided.
Method
Participants
Two hundred nineteen total participants completed Study 2A. About 104 participants were in the final control group (age: M = 35.0, SD = 9.0; sex: 60 male; race: 79 White, 16 Black, nine Other; Hispanic/Latino ethnicity: 15), and 102 participants were in the final trained group (age: M = 36.3, SD = 9.8; sex: 53 male; race: 82 White, 12 Black, eight Asian; Hispanic/Latino ethnicity: 15). 235 total participants completed Study 2B. Around 105 participants were in the control group (age: M = 36.8, SD = 11.0; sex: 48 male; race: 87 White, five Black, 10 Asian, three Other; Hispanic/Latino ethnicity: 15), and 112 participants were in the trained group (age: M = 35.0, SD = 10.0; sex: 57 male; race: 82 White, 17 Black, 11 Asian, two Other; Hispanic/Latino ethnicity: 14).
Procedure
Following the same training as Study 1, participants performed a task purportedly helping a company make hiring decisions. They were shown a picture of a candidate as well as their age, years of relevant work experience, and college attended. Ages were randomized from 24 to 26 years in 1-year increments. Relevant work experience was randomized from 2.5 to 3.5 years in half-year increments. Colleges were taken from a ranking of top 25 public universities (U.S. News and World Report, 2019). The combination of face, age, education, and work experience was randomized for each participant. Biographical information was presented in accordance with studies examining bias in hiring (Rooth, 2010).
The hiring task involved making two judgments about each candidate: rating how suitable they were for the job (1 = not suitable at all to 7 = very suitable) and how likely they were to invite them for an interview (1 = not at all likely to interview to 7 = very likely to interview). The candidates consisted of 10 trustworthy and 10 untrustworthy faces, yielding 20 candidates total. In Study 2A, candidates were computer-generated faces, and in Study 2B, candidates were real faces. Interspersed between trials were attention checks, wherein participants typed a specific number (1–7). After rating the candidates, participants were debriefed.
Results and Discussion
Participants were removed for >50% time-out trials in the learning phase (Study 2A: five participants; Study 2B: six participants), failing any attention checks (Study 2A: six participants; Study 2B: five participants), or for having fast mean reaction times (RT; <500 ms) during the hiring task (Study 2A: two participants; Study 2B: seven participants).
In Study 2A, the two measures were highly correlated (r = .89, p < .0001, 95% CI [0.863, 0.918]), so they were averaged together into an overall job suitability metric and submitted to a 2 (group: control vs. trained) × 2 (trustworthiness: trustworthy vs. untrustworthy) mixed-model ANOVA. There was a main effect of trustworthiness, F(1, 204) = 81.48, p < .0001,

Violin plots representing probability densities for the job suitability metric, separately for trustworthy and untrustworthy faces and for the control and trained groups in Study 2A (A) and Study 2B (B). Note. Error bars denote standard error of the mean. Note that values may exceed the possible 1–7 ratings range because probability densities are depicted.
For Study 2B (real faces), the two measures were again highly correlated (r = .87, p < .0001, 95% CI [0.84, 0.90]), so were averaged together and submitted to a 2 (group) × 2 (trustworthiness) mixed-model ANOVA. There was a main effect of trustworthiness, F(1, 215) = 36.25, p < .0001,
Extending the results of Study 1, the brief training reduced the effect of facial appearance in the domain of hiring decisions. Importantly, the findings also show that this bias reduction persists even in contexts where more diagnostic, relevant information is provided rather than a task where only faces are provided.
Study 3
In the previous studies, it is possible that participants were merely responding to learned information in a deliberate manner rather than genuinely shifting their evaluations. In Study 3, we use an evaluative priming task to test whether training reduces facial stereotype biases even in automatic face evaluations. Evaluative priming measures automatic evaluations in an indirect manner (Fazio et al., 1986), including for faces (Olson & Fazio, 2003) and social biases (Olson & Fazio, 2006). Participants were shown trustworthy or untrustworthy face primes followed by a positive or negatively valenced word (e.g., “happy,” “poison”). Given the automaticity of face evaluations (e.g., Freeman et al., 2014; Todorov et al., 2009), in the control group, we expect RT facilitation effects, whereby evaluatively congruent face–target pairs (trustworthy face/positive word and untrustworthy face/negative word) elicit faster RTs than incongruent pairs (trustworthy face/negative word and untrustworthy face/positive word). If the training is successful in remapping these associations and reducing facial stereotypes, we would expect reduced RT facilitation in the trained group.
Method
Participants
Two hundred forty-four total participants completed Study 3A on Mechanical Turk. Previous studies have successfully conducted evaluative priming tasks in online samples (e.g., Mattan et al., 2019; Green et al., 2019). About 110 participants were in the final control group (age: M = 36.9, SD = 11.3; sex: 51 male; race: 87 White, 13 Black, 10 Other; Hispanic/Latino ethnicity: 9), and 110 participants were in the final trained group (age: M = 37.0, SD = 10.5; sex: 55 male; race: 88 White, 15 Black, four Asian, three Other; Hispanic/Latino ethnicity: 8). Two hundred twenty-six total participants completed Study 3B, and 97 participants were in the final control group (age: M = 38.4, SD = 12.3; sex: 46 male; race: 75 White, 13 Black, six Asian, three Other; Hispanic/Latino ethnicity: 5), and 108 participants were in the final trained group (age: M = 36.0, SD = 9.7; 60 male; 79 White, 16 Black, seven Asian, six Other; Hispanic/Latino ethnicity: 13).
Procedure
Following the same training as previous studies, participants were told they would perform a language test. On each trial, a fixation cross was presented (500 ms), followed by a prime face (200 ms), followed by a blank screen (100 ms), followed by target word that appeared until a response. The timing was based on recommendations from previous studies to achieve facilitation effects (Fazio et al., 1986; Hermans et al., 1994). The task was to classify the target word as positive or negative as quickly and accurately as possible by key press. The prime faces consisted of new trustworthy and untrustworthy faces that were not encountered during training, with eight unique identities for each condition, resulting in 16 prime faces that were computer-generated (Study 3A) or real (Study 3B). The target words were adjectives that had positive (e.g., “Good,” “Kind”) or negative (“Bad,” “Mean”) valences. There were 10 target words for each valence. Each prime face (16 faces) was paired with each target word once, resulting in 320 trials total.
Results and Discussion
Participants were removed for having >50% of learning trials register as time-outs (Study 3A: 11 participants; Study 3B: eight participants) or for at-chance performance (50% accuracy) in the evaluative priming task (Study 3A: 13 participants; Study 3B: 13 participants). For final analysis, we removed incorrect responses (Study 3A: 5% of trials; Study 3B: 6% of trials) and trials with RTs faster than 250 ms and slower than 3,000 ms (Study 3A: 6% of trials; Study 3B: 7% of trials).
For each participant, RT difference scores were computed for (negative–positive) words. In this calculation, positive values (greater than 0) denote facilitation for positive words and difference scores smaller than 0 denote facilitation for negative words. Separate RT difference scores were computed for trustworthy and untrustworthy faces.
For Study 3A, RT difference scores were submitted to a 2 (group) × 2 (trustworthiness) mixed-model ANOVA, eliciting a main effect of facial trustworthiness, F(1, 218) = 44.87, p < .0001,

Violin plots representing probability densities for reaction time (RT) facilitation effects (negative RT–positive RT), separately for trustworthy and untrustworthy faces in the control and trained groups in Study 3A (A) and Study 3B (B). Note. The dashed line represents zero facilitation in either direction. Positive values indicate facilitation for positive words and negative values indicate facilitation for negative words. Error bars denote standard error of the mean.
For Study 3B, RT difference scores were submitted to a 2 (group) × 2 (trustworthiness) mixed-model ANOVA, eliciting a main effect of trustworthiness, F(1, 203) = 13.34, p < .0001,
Extending the results of the previous studies, we found training reduced not only payment and hiring decisions, but even participants’ automatically generated evaluations for trustworthy and untrustworthy targets, assessed in a more implicit manner.
Meta-Analysis and Replication
To assess the overall strength of the training effects, we meta-analyzed the six studies using fixed effects, with effect size weighted by sample size (Goh et al., 2016). The effect size for the Trustworthiness × Group interaction in each study was converted to Cohen’s d. The meta-analytic effect was significant, with a conventionally medium effect size, mean d = .53, Z = 9.30, p < .00001. The aggregated meta-analytic framework also permitted additional analyses described in the Supplementary Material. Participant race did not influence any results, and the training effects held when controlling for other facial dimensions such as attractiveness and competence. Finally, the training effects were replicated in an additional study (n = 199) that used a different set of training behaviors (see Supplementary Material).
General Discussion
Across six studies and a replication, we demonstrate flexibility in facial stereotyping, with a brief training reducing or extinguishing the activation and application of facial stereotypes. Expectedly, untrained control participants were strongly biased by facial appearance. When targets bore untrustworthy appearances, control participants allocated less money, found them to be less suitable for jobs, and harbored automatic evaluative biases against them. When associations between facial appearance and trustworthiness were reversed in trained participants, facial stereotype biases were reduced or eliminated. Critically, the test phase involved novel targets, so these results reflect a generalized bias reduction for untrustworthy versus trustworthy facial appearances rather than merely effects of impression updating for specific targets. This bias reduction was observed for both computer-generated and real faces and demonstrated across a wide set of contexts: when facial appearance was the only information provided (Study 1), when diagnostic, decision-relevant information was also provided (Study 2), and even when automatic evaluative responses were assessed in a more implicit manner (Study 3).
Previous work has shown that face impressions update when new behavioral information about specific identities is encountered (Bliss-Moreau et al., 2008; Todorov & Olson, 2008; Todorov & Uleman, 2002). The present results demonstrate a distinct, more fundamental form of updating underlying face impressions, whereby specific facial appearances themselves can be flexibly reassociated with new trait information. In the context of counterstereotype training paradigms used to reduce racial or gender bias (Blair, 2002; Forscher et al., 2019; Gawronski & Bodenhausen, 2006; Lai et al., 2016; Paluck & Green, 2009), it is noteworthy that our training paired exemplars with behaviors rather than traits (as is common in these paradigms), and yet a brief training period yielded particularly strong effects. Because behavioral information spontaneously triggers trait inferences (Uleman et al., 1996), it is likely that these inferred traits may alter stereotypical associations through similar mechanisms as presenting the traits directly (e.g., Gawronski & Bodenhausen, 2006; Gawronski et al., 2008). However, in theory, behavioral pairings in a counterstereotype training context may operate doubly as individuating information, leading to greater individuation of targets than would standard trait pairings, thereby aiding in bias reduction (Brewer, 1988; Fiske & Neuberg, 1990). If true, such behavioral counterstereotype training may capitalize on a synergy between counterstereotypical learning and individuation processes and represent a promising paradigm for bias reduction research more generally. It is also possible that facial stereotype bias is simply easier to reduce than other biases. We believe this is unlikely, as previous attempts to reduce facial stereotyping have failed (Jaeger et al., 2019).
Previous attempts to reduce facial stereotyping have focused on either educating participants to avoid facial stereotypes or “nudging” them to use more diagnostic cues, which proved unsuccessful (Jaeger et al., 2019). Unlike the present work, this previous study attempted to reduce bias by raising participants’ awareness of their general stereotypes. However, raising awareness of a general rather than specific bias can often be ineffective (Axt et al., 2019). Moreover, while interventions focusing on participants’ goals to reduce bias can be effective (Fazio & Towles-Schwen, 1999; Forscher et al., 2019), counterstereotype interventions like those used in the present research (aimed at changing underlying associations) tend to elicit stronger bias-reduction effects (Lai et al., 2016). It is also likely that the former interventions rely more on deliberate, propositional processes and the latter on more automatic, associative processes, and this distinction may have implications for successful bias reduction (Gawronski & Bodenhausen, 2006). Future research could directly compare multiple candidate interventions in the novel context of facial stereotyping as well as test their boundary conditions.
A critical question is whether the mitigation of facial stereotype biases persists over time. Lab-based interventions have successfully reduced implicit racial bias, but these effects tend to vanish after 3–4 days (Lai et al., 2016). Given a lifetime of acquired facial stereotype associations, it might prove difficult for single-shot interventions to result in long-term evaluative shifts. In the context of racial bias, researchers have incorporated similar kinds of lab-based counterstereotype training into longer, more intensive interventions under the framework of bias habit-breaking, which have demonstrated long-term bias reductions (Devine et al., 2012; McNulty et al., 2017). Our findings provide a critical first demonstration that facial stereotype bias is indeed malleable rather than fixed and susceptible to counterstereotype interventions. Theoretically, this is important because while other social biases like racial bias are widely acknowledged to be malleable, as they reflect cultural learning (which interventions seek to reverse), facial stereotype bias is generally assumed to be relatively fixed due to evolutionary adaptation. Thus, demonstrating a generalized bias reduction in activating and applying facial stereotypes even in the short-term represents an important advance. Future research could build on the present work to test whether the present training, in isolation or in conjunction with more temporally extended interventions, could have demonstrable long-term bias reduction effects.
There are several limitations of this work. Our stimuli were limited to White male faces so as to avoid potential confounds such as individual differences in gender and racial bias. Previous work has shown that target race and gender affect facial trustworthiness evaluations; however, substantial variance in these evaluations is still driven by trustworthiness-related features that are independent of race and gender (Hehman et al., 2019; Xie et al., 2019). Future studies could build on this work to explore how the training effects generalize to faces of other social groups and potentially interact with other social dimensions. Moreover, an alternative interpretation of our overall results is that rather than the training weakening associations between specific features and traits, it violated participants’ expectations and decreased their general reliance on facial stereotypes. Future research could directly examine the specificity versus generality of these training effects, and this could have implications for different intervention strategies.
In summary, the present work provides evidence for an associative learning mechanism that shapes our evaluations of others’ faces and that this can be exploited to reduce the activation and application of facial stereotypes. These findings not only show that face impressions are more flexible than typically appreciated, but they also provide an inroad toward combating ingrained biases based on facial appearance.
Supplemental Material
Supplemental Material, Kao_SPPS_Supplemental_final - Facial Stereotype Bias Is Mitigated by Training
Supplemental Material, Kao_SPPS_Supplemental_final for Facial Stereotype Bias Is Mitigated by Training by Kao-Wei Chua and Jonathan B. Freeman in Social Psychological and Personality Science
Footnotes
Acknowledgments
We thank Michael Berkebile and Maryam bin Meshar for their help with the studies and stimulus preparation. We thank Dr. Chu Chang Chua for her continued guidance.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by NSF BCS-1654731 (J.B.F) and SBE-1911860 (K.C.) grants.
Supplemental Material
The supplemental material is available in the online version of the article.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
