Does the Linguistic Expectancy Bias Extend to a Second Language?

Abstract

The linguistic expectancy bias (LEB) reflects the tendency to describe expectancy-consistent behavior more abstractly than expectancy-inconsistent. The current studies replicate the LEB in Portuguese and examine it in a second language (English). Earlier studies found differences in processing a first language (L1) and a second language (L2) shaping affective and cognitive processes. We did not expect these differences to shape the LEB because controlled lexical decisions (e.g., use of verbs and adjectives) are unlikely, even when using L2. Participants wrote stereotypically male or female behavioral descriptions for male and female targets. A new group of participants read those descriptions and was asked about their causes. Expectancy-consistent behavior was described more abstractly and shaped more dispositional inferences in L1 and L2. Aside from replicating the LEB in a different language, these studies indicate that structural features of language preserve a linguistic bias with implications for social perception even when using a second language.

Keywords

linguistic expectancy bias second language language use language abstraction interpersonal communication social attribution

Language is a powerful social tool that is a vehicle to pass on a message and an instrument to shape the message. Indeed, this makes the communicative process susceptible to subtle linguistic biases, such as those involved in transmitting and maintaining stereotypes (e.g., Maass et al., 1989; Wigboldus et al., 2000).

It has long been established that the first language (L1) uses to describe ingroup and outgroup behavior varies—the linguistic intergroup bias (LIB; Maass et al., 1989). People use abstract terms (e.g., adjectives [ADJ], nouns) to describe ingroup members' desirable behavior and outgroup members' undesirable behavior (e.g., the ingroup member is helpful; the outgroup member is aggressive). In contrast, an ingroup member showing undesirable behavior and an outgroup member engaged in desirable behavior are both described with concrete terms (e.g., the ingroup member pushes someone; the outgroup member opens the door to someone).

Concrete or abstract linguistic representations of behavior convey different types of information. Although more abstract descriptions lead to generalizations across situations about the targets of such messages, concrete messages refer to the here and now of a behavior and suggest that the behavior in question is situated. This systematic difference in abstraction has been consistently replicated across different languages such as Italian (e.g., Maass et al., 1989), Dutch (e.g., Werkman et al., 1999), Japanese (e.g., Tanabe & Oka, 2001), or French (e.g., Assilaméhou & Testé, 2013), and successfully applied to the study of stereotypes and intergroup relations (e.g., Gorham, 2006; Maass, 1999; Maass & Arcuri, 1996; Maass et al., 1996, 1998; Rubini & Semin, 1994).

The LIB was initially explained based on ingroup protective motives (Tajfel & Turner, 1979): when the ingroup is threatened, the LIB is used to maintain a positive image even in the presence of contrary evidence (Maass & Arcuri, 1996; Maass et al., 1996). Subsequently, a more recent interpretation suggested that the LIB relies on representing typical and stable knowledge in more abstract terms: expectancy-consistent behavior is described with more abstract predicates than expectancy-inconsistent behavior regardless of valence (e.g., Maass et al., 1995). However, it has also been argued that the informational distinction between expected and unexpected behaviors can be socially motivated as well (e.g., Fiedler et al., 2003).

Research on the LIB shows that the use of language contributes in a subtle but powerful way to the representation of stereotypes, that is, to positive perceptions of ingroup members and negative perceptions of outgroup members by receivers of these messages. However, Wigboldus et al. (2000) have a broader take on this by providing additional support that expectancy-consistent behavior is described at a higher level of abstraction than expectancy-inconsistent behavior, regardless of the target group membership—the linguistic expectancy bias (LEB). Moreover, by communicating expectancy-consistent behavior more abstractly, a sender should lead receivers to infer that the behavior in question is due to the target's character (i.e., dispositional inferences). In contrast, more concrete descriptions (made for expectancy-inconsistent behavior) are likely to lead receivers to infer that the target's behavior is driven by situational constraints (i.e., situational inferences). These predictions were experimentally examined by Wigboldus et al. (2000). Participants were asked to describe events in which a female or male friend revealed stereotypical male and stereotypical female behavior. These descriptions were then randomly distributed to the participants, who were asked about the causes of the behavior described. To determine the level of linguistic abstraction in the behavioral descriptions, the authors used the linguistic category model (LCM; Semin & Fiedler, 1988), a powerful tool to analyze how people use interpersonal terms when representing social events in communication (Semin, 2012) and, therefore, a useful model for research on stereotype communication (Maass et al., 1989). This model distinguishes between four types of predicates (from concrete to abstract): descriptive-action verbs (DAV) represent the most concrete representation whereby the verb unequivocally corresponds to the behavior in question (e.g., John hits Mary); interpretive action verbs (IAV) provide a framework for the behavior that can subsume different functions as a function of context (e.g., John hurts Mary); state verbs (SV) correspond to the description of the state of the target with no verifiable behavior (e.g., John hates Mary); and ADJ describe dispositional properties of a target (e.g., John is aggressive).

Wigboldus et al. (2000) results confirmed that expectancy-consistent behavior is communicated with more abstract predicates than expectancy-inconsistent behavior. Critically, more abstract descriptions subsequently led to stronger dispositional inferences than the less abstract ones produced for expectancy-inconsistent behavior, thereby endorsing stereotypical beliefs (Wigboldus & Douglas, 2007). The LEB was further replicated in German (Fiedler et al., 2003) and Dutch (Wigboldus et al., 2006) while uncovering important moderators such as interpersonal communication goals but also establishing the validity of the LEB effect at an individual level.

However, and to our knowledge, the LEB has always been examined in participants using their native language (L1). There is no evidence of whether this effect can be observed with participants using a second language (L2).

Due to professional, educational, and social demands, mastering an L2 is critical nowadays. However, several studies have shown that communicating in L1 or L2 shapes affective and cognitive processes differently. For example, speakers' perceptions of language emotionality are higher in L1 than in L2 (e.g., Dewaele, 2004, 2008; Dewaele & Nakano, 2013; Garrido & Prada, 2018; see Caldwell-Harris, 2015; Pavlenko, 2012, for reviews). Other research examining the psychophysiological markers of somatic and autonomic activity has also shown that emotional words produce higher physiological arousal when presented in L1 (see Harris, 2004). Memory performance for emotional words (e.g., Anooshian & Hertel, 1994; Marmolejo et al., 2009) or words encoded in emotional scenarios (e.g., Saraiva et al., 2021) presented in L1 is also higher than in L2. A different line of studies also showed that using an L2 reduces decision-making bias and fosters more utilitarian decisions in moral dilemmas (e.g., Costa et al., 2014; Hayakawa et al., 2017), suggesting that in L2, decision-making is more deliberate and less intuitive than in L1 (for reviews, see Costa et al., 2017; Hayakawa et al., 2016). These differences were further documented in studies showing that, in contrast to L1, information processing in L2 recruits more brain areas related to control processes (Branzi et al., 2016).

One of the most prominent accounts for the observed differences in processing an L1 and an L2 suggests that L2 engages emotions less than L1 does. These differences arguably result from L1 being acquired and used in an emotionally rich context (e.g., family, friends), whereas L2 is often learned and used in more emotionally detached contexts (e.g., school, work, etc.; Keysar et al., 2012). The reduced emotional processing engaged in L2 could reduce the impact of affective states on people's decisions and enhance deliberative processing (Costa et al., 2017; Hayakawa et al., 2016), namely allowing people to exert higher control on their linguistic choices in L2. Using L2 is also likely to enhance psychological distance (Costa et al., 2017; Hayakawa et al., 2016), leading to a more abstract construal level (e.g., Trope & Liberman, 2010) and a more objective perspective of the situation. Finally, the increased difficulty in processing a more disfluent language may also signal the need for more careful processing (Costa et al., 2017; Hayakawa et al., 2016) and trigger more deliberative thinking (e.g., Oppenheimer, 2008).

The literature has already shown how language and its systematic biases can shape social communication and influence social perception (e.g., Maass et al., 1989; Semin, 2000; Wigboldus et al., 2000). There are also a few studies indicating that in bilinguals, the native language enhances cultural biases, namely, more favorable implicit attitudes toward the social group associated with the language of the test (Danziger & Ward, 2010; Ellis et al., 2015, 2018; Ogunnaike et al., 2010). However, research on how processing differences in L1 and L2 might contribute to the communication of stereotypes has not yet been reported.

The first goal of the current work is to examine the LEB in Portuguese (Study 1), one of the most spoken languages in the world, with more than 200 million native speakers (Lewis, 2009). This replication will further ascertain the LEB as a reliable phenomenon. The second and main goal of this work is to examine the LEB in an L2 (Study 2).

In the face of earlier work suggesting the reduced emotionality, an increased construal level, or perceived disfluency in an L2, the LEB might not be observed in L2. An alternative prediction emerges from the “Architecture of Linguistic Behavior” (Semin, 2006), which distinguished four different levels of language use. At the utterance or surface level, thematic or topical choices are driven consciously by explicit goals and their situated relevancies (Sperber & Wilson, 1995). This surface level of language use is scaffolded by the lower layers of language use, namely phonemes, as constituents at the primary level of organization, with morphemes at the second level, and the phrase structure at the third level. These three levels escape conscious access. The proposed automaticity of lexical decisions finds empirical support in the recurrent finding that people use a biased selection of predicates (verbs and ADJ) even when explicitly instructed not to do so (e.g., Douglas & Sutton, 2006). This suggests that controlled lexical decisions are highly unlikely, even when using an L2. Highly automated lexical decisions about the use of verbs and ADJ are likely to be driven by L1 habits. Since the function they fulfill is identical across Portuguese and English, the LEB should also be observed in L2.

To test our predictions, we conceptually replicated Wigboldus et al.’s (2000) in L1 (Studies 1) or L2 (Studies 2). We expected to observe the LEB in L1 (i.e., European Portuguese) and determine whether this linguistic bias extends to an L2 (i.e., English).

Studies 1a and 1b

In Study 1a, participants were asked to describe stereotypically (desirable or undesirable) male and female behaviors for a female or a male target. We expected that behavior consistent with the target stereotype (e.g., male target performing a stereotypically male behavior) would be communicated with a higher level of abstraction than behavior inconsistent with the target stereotype (e.g., male target performing a stereotypically female behavior)—LEB replication (Hypothesis 1). In Study 1b, a new group of participants received a sample of the descriptions of the target generated in the first study. They had to use these descriptions to make inferences about the targets. We expected that expectancy-consistent behavioral descriptions (more abstract) would lead to more dispositional inferences compared to expectancy-inconsistent (more concrete) ones (Hypothesis 2).

Method

Participants

In the original study that we were replicating, the sample included 33 participants. Since the main purpose of Study 1a was to obtain descriptions for the second study, we approximated the original sample size (N = 35). However, six participants did not comply with the instruction and were excluded from the data analysis. The final sample consisted of 29 Portuguese native speakers (M_age = 29.62; standard deviation [SD] = 8.72; 18 female).

Given the changes introduced in the original procedure, namely running two separate studies instead of using a within-participants design, we calculated a new sample size for Study 1b with an a priori power analysis (G*Power). Using as reference a medium effect size (η_p² = .06; Cohen, 1988) and a power 1 − β = 0.80 to detect the interaction between target gender (female vs. male) and behavior stereotypicality (female vs. male), a sample of 126 participants was determined. A total of 125 Portuguese native speakers volunteered for the study (96 female; M_age = 28.82; SD = 9.35).

Design

The overall design of the two studies was similar to the original study: a 2 (participant gender: male vs. female) × 2 (target gender: male vs. female) × 2 (behavior desirability: desirable vs. undesirable) × 2 (behavior stereotypicality: male vs. female) mixed design. The variables of behavior desirability and behavior stereotypicality were manipulated within participants. In Study 1a, the dependent variable was the linguistic abstraction level calculated using the LCM (Semin & Fiedler, 1989). In Study 1b, the standardized mean of four dispositional inference questions was the dependent variable.

Procedure

All procedures were conducted following the ethical guidelines of the host institution. The studies were programmed in the Qualtrics online platform, and participants were invited to participate through social network websites. The procedure was similar to that of the original study with two exceptions: data were collected online, and the participants in Study 1a were different from those participating in Study 1b. In both studies, after reading the informed consent and agreeing to participate, participants provided sociodemographic information (i.e., native language, age, and gender).

In Study 1a, participants were asked to think of a good male or female friend (target manipulation, random order) and provide background information about this friend (e.g., when they first met him/her) to ensure they were actually thinking about someone. Then, they were asked to write down four short behavioral descriptions about their friend that they had witnessed. These descriptions were asked to be of a desirable stereotypically male behavior, a desirable stereotypically female behavior, an undesirable stereotypically male behavior, and an undesirable stereotypically female behavior. The order of the descriptions was random between participants. After writing the fourth description, participants were thanked and debriefed.

Study 1b presented 16 descriptions (eight for each target gender), selected from Study 1a, based on their different degrees of linguistic abstraction. We selected eight descriptors with low to mid abstraction (coded as 1 and 2) and eight with mid to high abstraction (coded 3 and 4); see the “Data Analysis” section under the “Study 1a” section for further details on the coding. For stimuli generalizability purposes, each set of eight descriptions for each target gender were divided into two equivalent blocks of four: desirable stereotypically male (e.g., X loves sports—SV), undesirable stereotypically male (e.g., X made a sex joke at the party—IAV), desirable stereotypically female (e.g., X was always very kind to a friends' children—ADJ), and undesirable stereotypically female (e.g., X began to cry for no reason—DAV). Each participant was presented with a random block of four behavioral descriptions of a male or a female target. Within each block, the descriptions were randomly presented, one at a time. After reading each description, two sets of questions were presented, as in the original study. The first set accessed participants' dispositional inferences: (a) estimate the likelihood of the target repeating the described behavior in the future (indicate a percentage); estimate the extent to which (b) the behavior described was due to the situation in which the target was (situation attribution) or (c) to his/her personality (person attribution), on a scale from 1 (not at all) to 7 (very much); (d) the behavior described was due to the situation (1) or due to the personality (100). These questions were presented in random order. The second set of four questions constituted a manipulation check. Participants were asked to indicate for each description, on a scale from 1 (not at all) to 7 (very much), to what extent they considered the behavior described as desirable, undesirable, stereotypically male, and stereotypically female. These questions were also presented randomly. After reading the four descriptions and answering the two sets of questions for each one, participants were thanked and debriefed.

Results¹

Study 1a

Data analysis

The descriptions generated were categorized according to the LCM (Semin & Fiedler, 1989) by four independent raters, two of them blind to the goals of the study, and all of them blind to the experimental conditions. All verbs and ADJ of each description were identified, and the scoring established by the model was applied. DAV (corresponding to 1 point; representing the most concrete level of description), interpretative action verbs (2 points), SV (3 points), and ADJ (4 points; representing the most abstract level of description) were counted. A general abstraction score (between 1 and 4) was obtained by dividing the total score for all predicates by the total number of predicates. Interrater agreement was 91%, and disagreements between raters were resolved jointly by two of the raters. In Study 1b, the answers to the question “situation attribution” were reversed, and the standardized average of the four dispositional inference questions for each description was calculated. Higher values on this scale mean more dispositional inferences.

Level of abstraction

First, we calculated the level of abstraction of the descriptions obtained in Study 1a (see Table 1). A 2 (participant gender: male vs. female) × 2 (target gender: male vs. female) × 2 (behavior desirability: desirable vs. undesirable) × 2 (behavior stereotypicality: male vs. female) analysis of variance (ANOVA) revealed the expected significant interaction between target gender and behavior stereotypicality, F(1, 25) = 8.92, p = .006, η_p² = .263, 90% CI (.05, .45).

Table 1.

Mean Level of Abstraction as a Function of Target Gender and Behavior Stereotypicality (L1).

	Behavior stereotypicality
Target gender	Stereotypically male		Stereotypically female
	M	SE	M	SE
Male	2.60	.25	1.85	.20
Female	2.07	.27	2.24	.21

Note. L1 = first language; M = mean; SE = standard error.

Planned comparisons indicated that the behavior of male targets was described more abstractly when stereotypically male (mean [M] = 2.60, standard error [SE] = .25) than stereotypically female (M = 1.85, SE = .20), F(1, 25) = 12.82, p = .001, η_p² = .339, 90% CI (.10, .52). The level of abstraction of female targets’ behavior was higher when this behavior was stereotypically female (M = 2.24, SD = .21) than stereotypically male (M = 2.07, SD = .27) but this difference was not significant, F(1,25) = 0.528, p = .474. Notably, replicating the original study, stereotype-consistent descriptions (M = 2.43, SE = .17) were communicated more abstractly than stereotype-inconsistent descriptions (M = 1.96, SE = .14), t(28) = 3.01, p = .005, d = .55, 90% CI (.22, .87).

A three-way interaction between participant gender, target gender, and behavior desirability was also observed, F(1, 25) = 5.51, p = .027, η_p² = .181, 90% CI (.01, .38). Planned comparisons revealed that the level of abstraction was higher when female participants described female undesirable behavior than desirable behavior, F(1,25) = 4.81, p = .038, η_p² = .161, 90% CI (.01, .36). The remaining differences were not statistically significant (all p's > .100).

Study 1b

Dispositional inferences

To confirm the effectiveness of stereotypicality and desirability of the targets' behavior manipulation, a separate ANOVA 2 (participant gender: male vs. female) × 2 (target gender: male vs. female) × 2 (behavior desirability: desirable vs. undesirable) × 2 (behavior stereotypicality: male vs. female) was conducted for each of the four manipulations. As expected, stereotypically male behavioral descriptions were considered more typically male (M = 3.68, SE = .24) than typically female (M = 2.88, SE = .17), F(1, 121) = 12.19, p < .001, η_p² = .092, 90% CI (.03, .18). Likewise, stereotypically female behavioral descriptions were considered more typically female (M = 4.30, SE = .23) than typically male (M = 2.60, SE = .17), F(1, 121) = 58.25, p < .001, η_p² = .325, 90% CI (.21, .42). Desirable behavioral descriptions were considered significantly more desirable (M = 5.47, SE = .15) than undesirable (M = 2.76, SE = .16), F(1, 121) = 142.33, p < .001, η_p² = .540, 90% CI (.44, .61), and undesirable behavioral descriptions were considered more undesirable (M = 4.45, SE = .17) than desirable (M = 2.50, SE = .16), F(1, 121) = 62.58, p < .001, η_p² = .341, 90% CI (.23, .44). These results indicated that the manipulation had worked as intended.

To analyze the inferences made, we conducted another 2 (participant gender: male vs. female) × 2 (target gender: male vs. female) × 2 (behavior desirability: desirable vs. undesirable) × 2 (behavior stereotypicality: male vs. female) ANOVA with the dispositional inferences scale as a dependent variable (α = .68). This analysis showed the expected target gender and behavior stereotypicality interaction, F(1, 121) = 12.88, p < .001, η_p² = .096, 90% CI (.03, .18).

Planned comparisons revealed that behavior of female targets led to more dispositional inferences when the behavior was stereotypically female (M = .15, SE = .11) than stereotypically male (M = −0.15, SE = .09), F(1,121) = 8.16, p = .005, η_p² = .063, 90% CI (.01, .14). Likewise, the behavior of male targets led to more dispositional inferences when stereotypically male (M = .06, SE = .05) than stereotypically female (M = −.08, SE = .07), F(1, 121) = 4.92, p = .028, η_p² = .039, 90% CI (.00, .11). No other significant main or interaction effects were observed.

Taken together, these results confirmed that when the described behavior is stereotypically consistent with the target gender, the causes of such behavior are more likely to be attributed to the targets' personality (M = .11, SE = .08) than when the behavior is stereotypically inconsistent (M = −.12, SE = .08).

Studies 2a and 2b

In Study 1, the LEB was replicated. Expectancy-consistent behaviors were described more abstractly, and these more abstract descriptions prompted more dispositional inferences. In Study 2, we examined whether this linguistic bias would generalize to L2. To this end, native Portuguese speakers performed the same tasks as in Study 1 but in English. If L1 habits that drive highly automated lexical decisions about verb and adjective use are identical across a native and an L2, then the LEB effect should be observed in L2. We would expect that stereotype-consistent behavior would be described more abstractly and that this higher abstraction would lead to more dispositional inferences. If, however, the described differences in cognitive and affective processes involved in L1 and L2 were to affect language use and the type of inferences made, we would not expect an LEB generalization to L2.

Participants

Like in Study 1a, 35 participants were required for Study 2a. Because data collection was set to stop at the end of a sampling day, the sample was somewhat larger (N = 43, 28 Female, M_age = 24.72; SD = 6.94). All participants were Portuguese native speakers and were proficient in English² (M_EnglishTest = 20.51; SD = 2.46).

For Study 2b, a sample of 76 participants was determined by an a priori power analysis (G * Power), using as reference the effect size observed in Study 1b (η_p² = .096) and a power 1 − β = .80 to detect the interaction between target gender (female vs. male) and behavior stereotypicality (female vs. male). Because data collection was set to stop at the end of the day the required number of participants was reached, the final sample was larger than predetermined (N = 91; 65 female; M_age = 27.38; SD = 8.87). All participants were Portuguese native speakers and proficient in English (M_EnglishTest = 20.00; SD = 2.37).

Design and Procedure

The design and procedure were the same as in Study 1³. The only difference was that participants performed the tasks in their L2 (English) and were asked to complete an English diagnostic test (Cambridge English assessment) at the beginning of each study.

Results

Study 2a

Level of abstraction

First, we calculated the level of abstraction of the descriptions obtained (see Table 2). A 2 (participant gender: male vs. female) × 2 (target gender: male vs. female) × 2 (behavior desirability: desirable vs. undesirable) × 2 (behavior stereotypicality: male vs. female) ANOVA revealed, a significant interaction between target gender and behavior stereotypicality, F(1,39) = 6.20, p = .017, η_p² = .137, 90% CI (.01, .30).

Table 2.

Mean Level of Abstraction as a Function of Target Gender and Behavior Stereotypicality (L2).

	Behavior stereotypicality
Target gender	Stereotypically male		Stereotypically female
	M	SE	M	SE
Male	2.55	.19	1.97	.15
Female	2.10	.19	2.33	.14

Note. L2 = second language; M = mean; SE = standard error.

Planned comparisons indicated that the behavior of female targets was described more abstractly when stereotypically female (M = 2.33, SE = .14) than stereotypically male (M = 1.97, SE = .15). Likewise, the behavior of male targets was described more abstractly when stereotypically male (M = 2.55, SE = .19) than stereotypically female (M = 2.10, SE = .19). However, these differences did not reach statistical significance, F(1, 39) = 3.27, p = .078, and F(1, 39) = 3.08, p = .087, respectively. Nevertheless, as in Study 1, stereotype-consistent descriptions (M = 2.46, SE = .17) were communicated more abstractly than stereotype-inconsistent ones (M = 2.00, SE = .17), t(42) = 3.15, p = .003, d = .48, 90% CI (.21, .74).

To further examine the role of L2 proficiency (e.g., Costa et al., 2017; Pavlenko, 2012) in the abstraction level of the described behavior, we conducted a regression analysis, using the English test score as an independent variable and the mean abstraction level as the dependent variable. The results were not significant, β = −.005, p = .975, suggesting that L2 proficiency did not affect the abstraction level of the descriptions.

Study 2b

Dispositional inferences

To confirm the effectiveness of the manipulation of the stereotypicality and desirability of the targets' behavior, an ANOVA 2 (participant gender: male vs. female) × 2 (target gender: male vs. female) × 2 (behavior desirability: desirable vs. undesirable) × 2 (behavior stereotypicality: male vs. female) was conducted for each of the four manipulations. The results revealed that stereotypically male behavioral descriptions were considered more typically male (M = 3.90, SE = .20) than typically female (M = 3.06, SE = .14), F(1, 87) = 20.50, p < .001, η_p² = .191, 90% CI (.08, .35). Likewise, stereotypically female descriptions were considered more typically female (M = 4.15, SE = .19) than typically male (M = 2.94, SD = .16), F(1, 87) = 29.58, p < .001, η_p² = .254, 90% CI (.13, .37). The desirable descriptions were considered significantly more desirable (M = 5.32, SD = .17) than undesirable (M = 2.61, SD = .16), F(1, 87) = 120.58, p < .001, η_p² = .581, 90% CI (.47, .66), and the undesirable descriptions were considered more undesirable (M = 4.75, SD = .17) than desirable (M = 2.12, SD = .15), F(1, 87) = 107.17, p < .001, η_p² = .552, 90% CI (.43, .63). These results confirm the effectiveness of the manipulations.

To analyze the inferences made by the participants, we conducted the same 2 (participant gender: male vs. female) × 2 (target gender: male vs. female) × 2 (behavior desirability: desirable vs. undesirable) × 2 (behavior stereotypicality: male vs. female) mixed ANOVA having the dispositional inferences scale as a dependent variable (α = .59). The interaction effect between target gender and behavior stereotypicality was significant, F(1, 87) = 31.90, p < .001, η_p² = .268, 90% CI (.14, .38). Planned comparisons further showed that when the behavioral descriptions of male targets were stereotypically male, more dispositional inferences were made (M = .31, SE = .07) than when the descriptions were stereotypically female (M = −.10, SE = .08), F(1, 87) = 21.98, p < .001, η_p² = . 202, 90% CI (.09, .32). Likewise, the behaviors of female targets led to more dispositional inferences when the described behavior was stereotypically female (M = .06, SE = .09) than stereotypically male (M = −.30, SE = .08), F(1, 87) = 11.86, p = .001, η_p² = .120, 90% CI (.03, .23). These results replicate the LEB, suggesting that when the described behavior is stereotypically consistent with the target gender, the causes of such behavior are more likely to be attributed to the target's personality (M = .18, SE = .08) than when the behavior is stereotypically inconsistent (M = −.20, SE = .08), even when L2 is used.

A significant main effect of target gender was also observed, F(1, 87) = 6.51, p = .012, η_p² = .070, 90% CI (.01, .17), revealing more inferential dispositions for behavioral descriptions of male (M = .10, SE = .06) than female targets (M = −.12, SE = .07).

To further examine the role of L2 proficiency in the dispositional inferences made by participants, we conducted two regression analyses, using the English test score as an independent variable and the mean of dispositional inferences scores for stereotypically consistent and stereotypically inconsistent behaviors as the dependent variables. The results were significant for stereotypically consistent behaviors, R²_adj = .06, β = .257, p = .014. This analysis suggests that as L2 proficiency increases, more dispositional inferences for stereotypically consistent behaviors were made. The effect of L2 proficiency on the dispositional inferences made for stereotypically inconsistent behaviors was not significant, R²_adj = −.006, β = .071, p = .506.

Discussion

Linguistic biases are known to influence social perception (e.g., Maass et al., 1989; Wigboldus et al., 2000). These biases have been systematically observed in an L1, but little is known about their emergence and consequences in an L2.

Communicating in an L2 is increasingly relevant. However, information processing in L1 and L2 seems to be different. The present study explored whether these differences extend to social perception, namely to the consequences of linguistically biased information in the communication and maintenance of social stereotypes. Although previous studies suggest that in L2, people engage in more deliberate processes reducing biases in moral judgments and decisions (e.g., Costa et al., 2017; Hayakawa et al., 2017), we argued that the structural properties of language should perpetuate the LEB, even in an L2. Specifically, while at the utterance level, language is accessed consciously, at the lexical level, the different layers of language are highly habitualized and automatic (Semin, 2006). These highly automated lexical decisions about the use of verbs and ADJ are driven by L1 habits.

To examine our predictions, we conceptually replicated the work by Wigboldus et al. (2000) in a previously unexamined L1 (Portuguese, Studies 1a and 1b) and, for the first time, in L2 (English, Studies 2a and 2b). The results from the two studies revealed that both in L1 and L2, expectancy-consistent behavior was communicated more abstractly than expectancy-inconsistent one (although not always significant for both female and male targets).

In both studies, we also examined whether the differences in linguistic abstraction resulting from the consistency between the expectation about the target and its behavior influenced the types of inferences made by the participants. The results were clear in showing that a higher level of linguistic abstraction involved in describing expectancy-consistent behavior led to stronger dispositional inferences than descriptions of expectancy-inconsistent behavior. Moreover, these results were observed in both L1 and L2, suggesting that linguistic variations associated with the abstraction level play an important role in transmitting and maintaining stereotypes even when using an L2.

Study 1 constitutes an important replication of the LEB in a different language and a different culture. Replication studies endorse the veracity of previous results and findings and examine whether or not the results are generalizable to other domains and contexts (Diener & Biswas-Diener, 2021; Godinho & Garrido, 2016; Godinho et al., 2019; Ijzerman et al., 2013; Pashler & Wagenmakers, 2012). Therefore, the observation of the LEB with European Portuguese native speakers further confirms the robustness of this linguistic bias.

The findings in Study 2, where the LEB was observed in an L2, are particularly relevant considering previously reported processing differences between L1 and L2, and particularly the benefits of using L2 in reducing biases in several contexts (Costa et al., 2017; Favreau & Segalowitz, 1983; Hayakawa et al., 2017; Keysar et al., 2012). These studies suggest that L2 prompts more deliberate processes, with the potential to reduce the emergence of the LEB. The results of Study 2 suggest, however, that L2 is not immune to linguistic bias and provide convergent evidence that while the situated meaning of utterances may be monitored, the choice of words (predicates) may escape intentional monitoring (Semin, 2006). In other words, possibly due to lexical automaticity, communicating in L2 does not seem to be an impending or attenuating factor in the communication of stereotypes through language bias or in the type of inferences that this biased communication induces. Finally, the LEB was observed in L2 independent of L2 proficiency. Likewise, although more dispositional inferences were observed in proficient bilinguals of L2, the pattern of inference made as a function of linguistic abstraction converged across the two languages. Although the literature points out the role of L2 proficiency on observed differences between L1 and L2 (e.g., Eilola et al., 2007; Ferré et al., 2010), these differences did not seem to emerge, at least when automated lexical decisions are involved.

A possible limitation of the current studies is that in Studies 1a and 2a, the LEB was only observed for overall stereotypically consistent and inconsistent behavior (and not always significant for both female and male targets). Moreover, for convenience purposes derived from the online data collection procedure, in Studies 1b and 2b, we used a selected sample of behavioral descriptions obtained in Studies 1a and 2a (instead of using all the produced descriptions). Although this procedure might have boosted the magnitude of the LEB observed in Studies 1b and 2b, it does not undermine the fact that more abstract descriptions lead to more dispositional inferences. Nevertheless, future studies should directly replicate the original paradigm in an L2 retaining all the behavioral descriptions obtained in Studies 1a and 2a as input for the inferential tasks required in Studies 1b and 2b. A single study (using the same participants) with two tasks, as used in the original study, would also constitute an interesting contribution.

To the best of our knowledge, this study represents the first demonstration of the LEB in Portuguese. Importantly, this study also represents the first attempt to extend the LEB to an L2. Taken together, these two contributions reinforce the robustness of previous results and the generalizability of the LEB and the fact that biases in predicate use seem to escape conscious monitoring. Nevertheless, the present study does not exhaust the study of the differences between communicating in L1 and L2 in the context of social perception, which requires further research with different paradigms, different languages, and the examination of moderators and boundary conditions.

Footnotes

Acknowledgments

The authors would like to thank Katherine Lopes and Carina Freitas for their help in data collection and coding. We also thank Klaus Fiedler, an anonymous reviewer, and the editor for their helpful feedback on an earlier version of this article.

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Magda Saraiva

Notes

Author Biographies

Margarida Vaz Garrido is an associate professor at Iscte-Instituto Universitário de Lisboa. Her research focuses on situated cognition, collaborative memory, false memories, and person perception.

Magda Saraiva is currently a researcher and invited assistant professor at Iscte-Instituto Universitário de Lisboa. Her main research interests are focused on memory processes, particularly collaborative memory and false memories.

Gün R. Semin is currently a professor of psychology at ISPA-Instituto Universitário and the founding Director of the William James Center for Research at the same institution. His main research interests are on the communication of emotions via chemosignals, embodied social cognition, communication, language, and interspecies communication via chemosignals.

References

Anooshian

L. J.

Hertel

P. T.

(1994). Emotionality in free recall: Language specificity in bilingual memory. Cognition and Emotion, 8(6), 503–514. https://doi.org/10.1080/02699939408408956

Assilaméhou

Testé

(2013). How you describe a group shows how biased you are: Language abstraction and inferences about a speaker’s communicative intentions and attitudes toward a group. Journal of Language and Social Psychology, 32(2), 202–211. https://doi.org/10.1177/0261927X12456382

Branzi

F. M.

Calabria

Gade

Fuentes

L. J.

Costa

(2016). On the bilingualism effect in task switching. Bilingualism: Language and Cognition, 21(1), 195–208. https://doi.org/10.1017/S136672891600119X

Caldwell-Harris

C. L.

(2015). Emotionality differences between a native and foreign language: Implications for everyday life. Current Directions in Psychological Science, 24(3), 214–219. https://doi.org/10.1177/0963721414566268

Cohen

(1988). Statistical power analysis (2nd ed.). Erlbaum.

Costa

Foucart

Hayakawa

Aparici

Apesteguia

Heafner

Keysar

(2014). Your morals depend on language. PLoS ONE, 9(4), e94842. https://doi.org/10.1371/journal.pone.0094842

Costa

Vives

Corey

J. D.

(2017). On language processing shaping decision making. Current Directions in Psychological Science, 26(2), 146–151. https://doi.org/10.1177/0963721416680263

Danziger

Ward

(2010). Language changes implicit associations between ethnic groups and evaluation in bilinguals. Psychological Science, 21(6), 799–800. https://doi.org/10.1177/0956797610371344

Dewaele

J.-M.

(2004). The emotional force of swearwords and taboo words in the speech of multilinguals. Journal of Multilingual and Multicultural Development, 25(2–3), 204–222. https://doi.org/10.1080/01434630408666529

10.

Dewaele

J.-M.

(2008). The emotional weight of I love you in multilinguals’ languages. Journal of Pragmatics, 40(10), 1753–1780. https://doi.org/10.1016/j.pragma.2008.03.002

11.

Dewaele

J.-M.

Nakano

(2013). Multilinguals’ perceptions of feeling different when switching languages. Journal of Multilingual and Multicultural Development, 34(2), 107–120. https://doi.org/10.1080/01434632.2012.712133

12.

Diener

Biswas-Diener

. (2021). The replication crisis in psychology. In Biswas-Diener

Diener

(Eds.), Noba textbook series: Psychology. Champaign, IL: DEF Publishers. Retrieved from http://noba.to/q4cvydeh

13.

Douglas

K. M.

Sutton

R. M.

(2006). When what you say about others says something about you: Language abstraction and inferences about describers’ attitudes and goals. Journal of Experimental Social Psychology, 42(4), 500–508. https://doi.org/10.1016/j.jesp.2005.06.001

14.

Eilola

T. M.

Havelka

Sharma

(2007). Emotional activation in the first and second language. Cognition and Emotion, 21(5), 1064–1076. https://doi.org/10.1080/02699930601054109

15.

Ellis

Kuipers

J. R.

Thierry

Lovett

Turnbull

Jones

M. W.

(2015). Language and culture modulate online semantic processing. Social Cognitive and Affective Neuroscience, 10(10), 1392–1396. https://doi.org/10.1093/scan/nsv028

16.

Ellis

Thierry

Vaughan-Evans

Jones

M. W.

(2018). Languages flex cultural thinking. Bilingualism: Language and Cognition, 21(2), 219–227. https://doi.org/10.1017/S1366728917000190

17.

Favreau

Segalowitz

N. S.

(1983). Automatic and controlled processes in the first- and second-language reading of fluent bilinguals. Memory & Cognition, 11(6), 565–574. https://doi.org/10.3758/BF03198281

18.

Ferré

García

Fraga

Sánchez-Casas

Molero

(2010). Memory for emotional words in bilinguals: Do words have the same emotional intensity in the first and in the second language? Cognition and Emotion, 24(5), 760–785. https://doi.org/10.1080/02699930902985779

19.

Fiedler

Bluemke

Friese

Hofmann

(2003). On the different uses of linguistic abstractness: From LIB to LEB and beyond. European Journal of Social Psychology, 33(4), 441–453. https://doi.org/10.1002/ejsp.158

20.

Garrido

M. V.

Prada

(2018). Comparing the valence, emotionality and subjective familiarity of words in a first and a second language. International Journal of Bilingual Education and Bilingualism, 24(2), 275–291. https://doi.org/10.1080/13670050.2018.1456514

21.

Godinho

Garrido

M. V.

(2016). Oral approach-avoidance: A replication and extension for European–Portuguese phonation. European Journal of Social Psychology, 46(2), 260–264. https://doi.org/10.1002/ejsp.2172

22.

Godinho

Garrido

M. V.

Horchak

O. V.

(2019). Oral approach avoidance: A replication and extension for Slavic and Turkic phonations. Experimental Psychology, 66(5), 355–360. https://doi.org/10.1027/1618-3169/a000458

23.

Gorham

B. W.

(2006). News media’s relationship with stereotyping: The linguistic intergroup bias in response to crime news. Journal of Communication, 56(2), 289–308. https://doi.org/10.1111/j.1460-2466.2006.00020.x

24.

Harris

C. L.

(2004). Bilingual speakers in the lab: Psychophysiological measures of emotional reactivity. Journal of Multilingual and Multicultural Development, 25(2–3), 223–247. https://doi.org/10.1080/01434630408666530

25.

Hayakawa

Costa

Foucart

Keysar

(2016). Using a foreign language changes our choices. Trends in Cognitive Sciences, 20(11), 791–793. https://doi.org/10.1016/j.tics.2016.08.004

26.

Hayakawa

Tannenbaum

Costa

Corey

J. D.

Keysar

(2017). Thinking more or feeling less? Explaining the foreign-language effect on moral judgment. Psychological Science, 28(10), 1387–1397. https://doi.org/10.1177/0956797617720944

27.

Ijzerman

Brandt

M. J.

van Wolferen

(2013). Rejoice! In replication. European Journal of Personality, 27(2), 128–129.

28.

Keysar

Hayakawa

S. L.

S. G.

(2012). The foreign-language effect: Thinking in a foreign tongue reduces decision biases. Psychological Science, 23(6), 661–668. https://doi.org/10.1177/0956797611432178

29.

Lewis

P. M.

(2009). Ethnologue: Languages of the world (16th ed.). SIL International.

30.

Maass

(1999). Linguistic intergroup bias: Stereotype perpetuation through language. Advances in Experimental Social Psychology, 31, 79–121. https://doi.org/10.1016/S0065-2601(08)60272-5

31.

Maass

Arcuri

(1996). Language and stereotyping. In Macrae

C. N.

Stangor

Hewstone

(Eds.), Stereotypes and stereotyping (pp. 193–226). Guilford Press.

32.

Maass

Ceccarelli

Rudin

(1996). Linguistic intergroup bias: Evidence for in-group-protective motivation. Journal of Personality and Social Psychology, 71(3), 512–526. https://doi.org/10.1037/0022-3514.71.3.512

33.

Maass

Milesi

Zabbini

Stahlberg

(1995). Linguistic intergroup bias: Differential expectancies or in-group protection? Journal of Personality and Social Psychology, 68(1), 116–126. https://doi.org/10.1037/0022-3514.68.1.116

34.

Maass

Montalcini

Biciotti

(1998). On the (dis-)confirmability of stereotypic attributes. European Journal of Social Psychology, 28(3), 383–402. https://doi.org/10.1002/(SICI)1099-0992(199805/06)28:3<383::AID-EJSP870>3.0.CO;2-Q

35.

Maass

Salvi

Arcuri

Semin

(1989). Language use in intergroup contexts: The linguistic intergroup bias. Journal of Personality and Social Psychology, 57(6), 981–993. https://doi.org/10.1037//0022-3514.57.6.981

36.

Marmolejo

Diliberto-Macaluso

K. A.

Altarriba

J. E.

(2009). False memory in bilinguals: Does switching languages increase false memories? The American Journal of Psychology, 122(1), 1–16.

37.

Ogunnaike

Dunham

Banaji

M. R.

(2010). The language of implicit preferences. Journal of Experimental Social Psychology, 46(6), 999–1003. https://doi.org/10.1016/j.jesp.2010.07.006

38.

Oppenheimer

D. M.

(2008). The secret life of fluency. Trends in Cognitive Sciences, 12(6), 237–241. https://doi.org/10.1016/j.tics.2008.02.014

39.

Pashler

Wagenmakers

(2012). Editors’ Introduction to the special section on replicability in psychological science: A crisis of confidence? Perspectives on Psychological Science, 7(6), 528–530. https://doi.org/10.1177/1745691612465253

40.

Pavlenko

(2012). Affective processing in bilingual speakers: Disembodied cognition? International Journal of Psychology, 47(6), 405–428. https://doi.org/10.1080/00207594.2012.743665

41.

Rubini

Semin

G. R.

(1994). Language use in the context of congruent and incongruent in-group behaviours. British Journal of Social Psychology, 33(3), 355–362. https://doi.org/10.1111/j.2044-8309.1994.tb01031.x

42.

Saraiva

Garrido

M. V.

Pandeirada

J. N. S.

(2021). Surviving in a second language: Survival processing effect in memory of bilinguals. Cognition and Emotion, 35(2), 417–424. https://doi.org/10.1080/02699931.2020.1840336

43.

Semin

G. R.

(2000). Agenda 2000—communication: Language as an implementational device for cognition. European Journal of Social Psychology, 30(5), 595–612. https://doi.org/10.1002/1099-0992(200009/10)30:5<595::AID-EJSP23>3.0.CO;2-A

44.

Semin

G. R.

(2006). Modeling the architecture of linguistic behavior: Linguistic compositionality, automaticity, and control. Psychological Inquiry, 17(3), 246–255.

45.

Semin

G. R.

(2012). The linguistic category model. In Van Lange

P. A. M.

Kruglanski

Higgins

E. T.

(Eds.), Handbook of theories of social psychology (pp. 309–326). Sage. https://doi.org/10.4135/9781446249215.n16

46.

Semin

G. R.

Fiedler

(1988). The cognitive functions of linguistic categories in describing persons: Social cognition and language. Journal of Personality and Social Psychology, 54(4), 558–568. https://doi.org/10.1037/0022-3514.54.4.558

47.

Semin

G. R.

Fiedler

(1989). Relocating attributional phenomena within a language-cognition interface: The case of actors’ and observers’ perspectives. European Journal of Social Psychology, 19(6), 491–508. https://doi.org/10.1002/ejsp.2420190602

48.

Sperber

Wilson

(1995). Relevance: Communication and cognition. Blackwell.

49.

Tajfel

Turner

(1979). An integrative theory of intergroup conflict. In Austin

W. S.

Worchel

(Eds.), The social psychology of intergroup relations (pp. 33–47). Brooks/Cole.

50.

Tanabe

Oka

(2001). Linguistic intergroup bias in Japan. Japanese Psychological Research, 43(2), 104–111. https://doi.org/10.1111/1468-5884.00166

51.

Trope

Liberman

(2010). Construal-level theory of psychological distance. Psychological Review, 117(2), 440–463. https://doi.org/10.1037/a0018963

52.

Werkman

W. M.

Wigboldus

D. H. J.

Semin

G. R.

(1999). Children’s communication of the linguistic intergroup bias and its impact upon cognitive inferences. European Journal of Social Psychology, 29(1), 95–104. https://doi.org/10.1002/(SICI)1099-0992(199902)29:1<95::AID-EJSP898>3.0.CO;2-Z

53.

Wigboldus

D. H.

Douglas

(2007). Language, stereotypes, and intergroup relations. In Fielder

(Ed.), Social communication (pp. 79–106). Psychology Press.

54.

Wigboldus

D. H.

Semin

G. R.

Spears

(2000). How do we communicate stereotypes? Linguistic bases and inferential consequences. Journal of Personality and Social Psychology, 78(1), 5–18. https://doi.org/10.1037//0022-3514.78.1.5

55.

Wigboldus

D. H.

Semin

G. R.

Spears

(2006). Communicating expectancies about others. European Journal of Social Psychology, 36(6), 815–824. https://doi.org/10.1002/ejsp.323

Does the Linguistic Expectancy Bias Extend to a Second Language?

Abstract

Keywords

Studies 1a and 1b

Method

Participants

Design

Procedure

Results 1

Study 1a

Data analysis

Level of abstraction

Study 1b

Dispositional inferences

Studies 2a and 2b

Participants

Design and Procedure

Results

Study 2a

Level of abstraction

Study 2b

Dispositional inferences

Discussion

Footnotes

Acknowledgments

Declaration of Conflicting Interests

Funding

ORCID iD

Notes

Author Biographies

References

Results¹