Abstract
Implicit measures of the gender-science stereotype are often better than explicit measures in predicting relevant outcomes. This finding could reflect a discrepancy between implicit and explicit stereotypes, but an alternative is that the implicit measure is sensitive to constructs other than the stereotype. Analyzing an archival data set (total N = 478,550), we found that self-reported liking of science versus liberal arts was the best predictor of the gender-science implicit association test (IAT). In a reanalysis of a previous study and a replication of another study, we found that evidence for the IAT’s advantage over explicit stereotypes in predicting relevant outcomes disappeared when controlling for self-reported liking. Therefore, perhaps the IAT has often outperformed the explicit stereotype because the gender-science IAT captures personal attraction, whereas the explicit stereotype does not. It is premature to conclude that implicit constructs are superior to explicit constructs in predicting science-related plans and behavior.
According to current theory and research, the gender-science stereotype has a central role in the underrepresentation of women in occupations related to science. The basic premise is that people expect women to be unskillful or uninterested in science, and those expectations influence judgment and behavior toward women (e.g., by teachers and parents, Gunderson, Ramirez, Levine, & Beilok, 2012; by potential employers, Reuben, Sapienza, & Zingales, 2014), as well as women’s self-concepts (e.g., math ability self-concept, Sáinz & Eccles, 2012) and aspirations (e.g., career intentions, Schmader, Johns, & Barquissau, 2004). Of special interest are implicit stereotypes—“social category associations that become activated without the perceiver’s intention or awareness when […] presented with a category cue” (Blair, Ma, & Lenton, 2001, p. 828). Even people who do not endorse the stereotype explicitly might still have those mental associations from exposure to the prevalent beliefs that constitute the stereotype. Those associations might influence behavior and judgment automatically, without intention and awareness (Gawronski, Geschke, & Banse, 2003; Gawronski, Hofmann, & Wilbur, 2006; Greenwald & Banaji, 1995; Jost et al., 2009).
Previous studies have found that the implicit gender-science stereotype predicts judgment and behavior that contribute to women’s underrepresentation in science-related activities and occupations. Appendix A includes a summary of all the studies that we found that measured implicit gender-science stereotypes and explicit gender-science stereotypes or other explicit beliefs. Those studies found that implicit stereotypes predicted participants’ math engagement (Nosek & Smyth, 2011), performance and achievement (Ramsey & Sekaquaptewa, 2011), intentions to pursue science-related majors, academic programs (Lane, Goh, & Driver-Linn, 2012; Smyth, Greenwald, & Nosek, 2009), and career (Cundiff, Vescio, Loken, & Lo, 2013). These relations were usually moderated by gender. Among women, stronger implicit stereotypes predicted worse math performance and achievement, and weaker identification with math and science. Among men, the implicit stereotypes sometimes had no predictive value, and on other studies, stronger implicit stereotypes predicted better performance, achievements, and stronger identification with math and science.
Seventeen of the studies listed in Appendix A compared the relation of a third outcome measure with implicit versus explicit stereotypes. Fifteen of those studies found that the implicit stereotype had a stronger relation with an outcome measure than the explicit stereotype. The superiority of the implicit stereotypes could reflect a unique role for automatic activation of stereotypes in judgment and behavior. Lane, Goh, and Driver-Linn (2012) argued that “sincere and conscious beliefs that men and women are equally well suited for science, technology, engineering, and mathematics (STEM) fields do not preclude internalization of these beliefs at a less conscious level” (p. 222). Likewise, Muzzatti and Agnoli (2007) speculated that the implicit gender-science stereotype is present even when “participants are not aware of (or deny) the stereotype” (p. 758). It was further speculated that implicit stereotypes “shape choices by subtly constraining preferences without the individual’s awareness or conscious exertion of choice” (Nosek, Banaji, & Greenwald, 2002, p. 50). Moreover, Nosek and Smyth (2011) argued that implicit stereotypes can shape certain outcomes (e.g., math engagement and achievement) through mechanisms that operate “under the surface.”
Thus, it is common to interpret the advantage of implicit measures of the gender-science stereotype over their explicit counterparts as revealing the important role of automatic activation of stereotypes in those areas. In this article, we suggest that an alternative account is as likely. According to that alternative, the measure used so far for measuring the implicit gender-science stereotype is not the implicit counterpart of the common explicit stereotype measures. In addition to stereotypes, the implicit measure taps other constructs linked to science-related behavior and intentions. Those constructs, rather than implicit processes or constructs, might be the reason for the superiority of the implicit measure over the explicit stereotype measures in predicting important outcomes.
Implicit stereotypes are almost exclusively measured with indirect measures that are considered sensitive to mental associations, mainly the implicit association test (IAT; Greenwald, McGhee, & Schwartz, 1998). In all the studies that we found, the implicit gender-science stereotype was measured with the IAT (or an IAT variant) as the association between nouns representing science and nonscience (math/humanities, science/arts, mathematics/language, math words/reading words, science/liberal arts, math/english, scientific/humanistic) and nouns representing gender (boys/girls, girls names/boys names, female/male, masculine/feminine, men/women). It is not obvious that the implicit/explicit distinction is the only difference between such an IAT and a measure of the belief that, in comparison to women, men are better or are more interested in science. Many other beliefs could map into the gender–science associations. Based on that notion, the original goal of the present research was to test a simple hypothesis: Self-reported associations would be more strongly related to the IAT than self-reported gender-science beliefs. Such a result would cast doubt on the common interpretation of previous findings that the IAT was better than self-reported beliefs in predicting important outcomes. Perhaps those results reflected a superiority of associations over beliefs, not the superiority of implicit constructs over explicit constructs.
In the present research, we analyzed a large sample of participants (N = 478,550) to test whether self-reported associations are related to the IAT more than self-reported beliefs pertaining to the gender-science stereotype. Although the analyses confirmed our hypothesis, they also found that self-reported liking of science was related to the IAT even stronger than self-reported associations. This finding suggests that the advantage of the gender-science IAT over explicit gender-science beliefs in predicting relevant outcomes might reflect only the advantage of personal attraction over gender-science beliefs in predicting those outcomes, not the advantage of implicit constructs. In the second part of the present investigation, we searched for evidence that the IAT has any advantage over self-reported stereotypes, after controlling for self-reported liking.
Method
Participants
Participants were volunteers who completed the gender-science IAT demonstration task in the Project Implicit website (implicit.harvard.edu; Nosek, 2005) between January 13, 2003 and December 31, 2013. We excluded participants who did not indicate their gender. We separated the data set to 11 studies, one for each year, because the self-report measures changed over time (see Table 1 for details).
Sample Size, Women Rate, and Mean Age, in Each Year (Study).
Note. Because many of the self-report measures changed on December 7, 2006, we added the sessions that followed that date (until the end of 2006) to the 2007 sample.
Measures
Complete information about the data set, methods, and measures is available online at https://osf.io/f7jzb.
Implicit association test
The categories were male (items: man, boy, father, male, grandpa, husband, son, uncle), female (girl, female, aunt, daughter, wife, woman, mother, grandma), science (biology, physics, chemistry, astronomy, engineering, neuroscience, biochemistry; the last two were replaced with math and geology at 2007), and liberal arts (philosophy, humanities, arts, English, music, history, Latin; Latin was replaced with literature at 2007). The IAT consisted of seven trial blocks and was scored with the D1 algorithm (Greenwald, Nosek, & Banaji, 2003). Positive scores indicated faster performance when words related to males and science shared the same key than when words related to females and science shared the same key.
Self-report measures
Participants answered direct questions related to their own attitudes about science and liberal arts, and about their beliefs regarding gender differences in those subjects. We analyzed only questions relevant to the present investigation.
Self-reported associations
Participants reported how much they associated science with males versus with females, and how much they associated liberal arts with males versus females. The response scales changed over the years but always ranged from strongly female to strongly male. The self-reported association score was the difference between these two items, larger numbers indicating stronger association of science with males and liberal arts with females.
Beliefs about natural ability
In 2003–2006, participants reported their level of agreement with the statement “Males perform better than females in science because of greater natural ability” on a 7-point scale. In 2007–2013, participants rated factors explaining why “Women hold a smaller portion of the science and engineering faculty positions at top research universities than do men.” One factor pertained to ability: “Different proportions of men and women are found among people with the very highest levels of math ability.” Participants rated how important that factor was in explaining this frequency difference on a 5-point scale.
Beliefs about natural interest
In 2007–2013 participants rated, on a 5-point scale, the importance of the factor “On average, men and women differ naturally in their scientific interest” in explaining the abovementioned frequency difference.
Beliefs about prevalence
In years 2007–2013, participants estimated how many out of 10 men at U.S. universities graduate with a scientific major and answered the same question about women. The difference between the two responses was the prevalence score.
Personal liking
Participants reported, on a 5-point scale, how much they like science and how much they like liberal arts. We computed a preference for the topic stereotypically associated with the participant’s gender.
Personal importance
In years 2007–2013, participants rated, on a 5-point scale, how important it was for them to become knowledgeable in science, math, and liberal arts. We averaged the importance of science and math together, and computed a difference score indicating preference of becoming knowledgeable in the topic stereotypically associated with the participant’s gender.
Results
The scores were stable over the years (see Appendix B), with the IAT showing a positive score (M min = 0.34, M max = 0.38). Figure 1 shows highly consistent rank order of the correlations of the IAT with the different self-report measures. In all years, the IAT was more strongly related to self-reported associations (r min = .198, r max = .218, minimum and maximum values are from the 11 correlations computed for the eleven samples) than reported beliefs about natural differences between the genders in math ability (r min = .035, r max = .119), in interest in science (r min = .056, r max = .087), and in estimated prevalence of students who major in science (r min = .137, r max = .161). These results were replicated among women and among men (Figure 2a and b).

Correlation of the implicit association test with the six relevant self-report measures, by year.

(a) Female participants: Correlations of the implicit association test (IAT) with the six relevant self-report measures, by year. (b) Male participants: Correlations of the IAT with the six relevant self-report measures, by year.
Unexpectedly, two sets of questions were related to the IAT more strongly than self-reported associations (Figure 1 and Table 2). These were self-reported liking (r min = .217, r max = .290), and self-reported importance (r min = .228, r max = .246). These relations indicated that stronger men/science and women/liberal arts associations predicted stronger preference for science among men and stronger preference for liberal arts among women. Self-reported liking had the strongest relations to the IAT, and as Figures 2a and 2b show, this superiority was more pronounced among women than among men (even among men, personal liking had the strongest correlation with the IAT in 10 of the 11 studies). Self-reported importance and self-reported liking were strongly related (r min = .607, r max = .621). In all the years, self-reported liking and importance were related to the IAT significantly more than to self-reported beliefs about natural differences between the genders in math ability (liking: r max = .135; importance: r max = .053), in interest in science (liking: r max = .111; importance: r max = .080), and in estimated prevalence of students in science majors (liking: r max = .094; importance: r max = .080).
Correlation of the Implicit Association Test With the Six Relevant Self-Report Measures, by Year.
Note. On each row, different subscripts indicate significant difference (p < .05).
We also used multiple regression analyses to predict the IAT score, in each year, from self-reported associations, ability stereotype beliefs, interest stereotype beliefs, prevalence stereotype beliefs, and personal liking. In all years, reported liking shared the largest unique variance with the IAT, and reported associations was always the second best predictor (Figure 3a and 3 show separate results for men and women). The consistency of the ranking of predictors attests for their statistical reliability. The chances of one predictor being stronger than another predictor in 11 studies, when there is actually no difference between the two, is p = .0009765625 (2 × (1/211)).

(a) Female participants: Unique variance of the five relevant self-report measures in predicting the implicit association test (IAT) score, by year. (b). Male participants: Unique variance of the five relevant self-report measures in predicting the IAT score, by year.
Part 2: Can Liking Explain the IAT’s Advantage?
The results so far show that direct reports about mental associations were related to the IAT more than self-reported beliefs about stereotypes. Unexpectedly, the IAT and self-reported preference for the topic stereotypically associated with the participant’s gender were related to each other more than each of these measures was related to other self-reported beliefs and associations. That unexpected result suggested a novel account for the findings that the IAT is better than self-report stereotype measures in predicting important outcomes. According to that account, the IAT was a better predictor than self-reported gender–science beliefs because the IAT was more strongly related to personal liking, which, in turn, was a better predictor of science-related outcomes than gender–science beliefs. To refute that possibility, we returned to previous research that found an advantage of the IAT and tested whether the evidence for this advantage persists even when controlling for self-reported liking.
Predicting Math Performance
Nosek and Smyth (2011) compared the gender–science IAT and the explicit stereotype (measured with two self-reported items: men are better at math than women are and women can achieve as much as men in math) in predicting math-related outcomes. We reanalyzed that data and found only one variable, the difference between math and verbal SAT scores, that had reliably stronger relation with the IAT (r = .193, p < .001) than the explicit stereotype (r = .090, p = .004), Williams’ t(1,040) = 2.590, p = .010.
In Nosek and Smyth’s study, participants rated the warmth of their feelings toward math and a contrast category and reported a preference between the two. With those measures, we computed a preference for the topic stereotypically associated with the participant’s gender over the other topic. Consistent with our findings, the IAT/attitudes relation (r = .281, p < .001) was stronger than the IAT/explicit stereotype relation (r = .136, p < .001), and the explicit stereotype/attitudes relation (r = .145, p < .001), Williams’ ts(2,918) = 6.199, 5.790, respectively, ps < .001.
We used PROCESS macro for simple mediation (Model 4) for SAS (Hayes, 2013) to find unstandardized estimates with 95% confidence intervals (CIs) of the reduction in the effect of each stereotype measure on the SAT difference, due to controlling for attitudes. We entered the IAT as the independent variable, attitudes as a mediator, SAT as the outcome, and explicit stereotypes as a covariate. We replaced the roles of the IAT and the explicit stereotypes when testing the explicit stereotype. In the present context, rather than mediation effects, this analysis tested whether the relation between each stereotype measure and the SAT was significantly reduced when controlling for attitudes. Bootstrap tests with 10,000 resamples showed a significant reduction in the IAT’s effect, b = .137, SE = .017, 95% CI [0.104, 0.171], and in the explicit stereotype’s effect, b = .052, SE = .015, 95% CI [0.023, 0.081], when controlling for attitudes. The regression analysis provided by the PROCESS macro showed that after the reduction due to controlling for attitudes, the IAT’s effect, b = .044, SE = .028, t(1,040) = 1.55, p = .121, 95% CI [–0.011, 0.100], and the explicit stereotype’s effect, b = .009, SE = .027, t(1,040) < 1, p = .731, 95% CI [–0.044, 0.063], were no longer significant. Importantly, when we computed partial correlations between each measure and the SAT, partialling out shared variance with attitudes, there was no longer reliable evidence for an IAT advantage: The IAT/SAT relation (r =.049, p = .115) was not significantly better than the explicit stereotype/SAT relation (r = .017, p = .584), Williams’ t(1,040) < 1, p = .442.
Predicting Plans to Pursue Science
Lane et al. (2012) found that the gender–science IAT predicted students’ plans to pursue science versus humanities (r = .34, p < .0001) better than the self-reported stereotype (r = .12, ns), Williams’ t(150) = 2.158, p = .03. We repeated that study with similar materials and procedure, adding attitude and importance measures, measured, and scored identically to our main present study (full details about the replication are in Appendix C and at https://osf.io/vc68r). We repeated the same analysis strategy as before. Replicating Lane et al., we found significant advantage for the IAT (r = .245, p < .001) over explicit stereotypes (r = .111, p = .012), Williams’ t(511) = 2.304, p = .022, in predicting intentions to pursue the topic stereotypically associated with the participant’s gender. The bootstrap tests in the mediation-like analyses found that controlling for attitudes significantly reduced the IAT’s effect, b = .168, SE = .028, 95% CI [0.114, 0.224], and the explicit stereotype’s effect, b = .080, SE = .030, 95% CI [0.019, 0.138]. The regression analyses showed that although the IAT’s effect was reduced, it remained significant, b = .067, SE = .032, t(511) = 2.132, p = .034, 95% CI [0.005, 0.130], suggesting that attitudes might not be the only reason for the IAT/pursuit relations. The explicit stereotype’s effect was reduced to being nonsignificant, b = .010, SE = .031, t(511) < 1, p = .755, 95% CI [–0.051, 0.070]. Importantly, we did not find evidence that the IAT maintained its advantage over the explicit stereotypes after controlling for attitudes. When we partialled out shared variance with attitudes, the IAT/pursuit relation (r = .095, p = .032) was no longer reliably stronger than the explicit stereotype/pursuit relation (r = .019, p = .662), Williams’ t(511) = 1.251, p = .212.
General Discussion
Research about gender–science stereotypes has often found that implicit measures of the gender–science stereotype are better than explicit measures in predicting performance, motivation, intentions, self-concept, and decision-making related to math and science. It is common to interpret such findings as revealing the important role that automatic activation of stereotypes plays in those areas. In this article, we challenge that interpretation. Had previous research used an IAT with the concepts science/liberal arts, pleasant/unpleasant to measure implicit gender–science stereotype, many would have doubted a claim that discrepancies between implicit and explicit gender–science stereotype reflect discrepancies between implicit and explicit constructs or processes. It would not seem that the only difference between the implicit measure and self-reported gender–science beliefs is their sensitivity to automatic versus deliberate processes. We suggest that this threat also applies to the actual measure that has been used so far to assess the implicit gender–science stereotype, an IAT with the concepts science/liberal arts, male/female. Perhaps that IAT taps into different constructs than those tapped by the explicit measures used in research on the gender–science stereotype.
We suspected that previous findings about discrepancies between the implicit and the explicit gender–science stereotype might have reflected discrepancies between associations and beliefs, rather than between implicit and explicit constructs. Indeed, we found that people’s direct report on their mental associations between gender and science had a stronger correlation with the IAT than any self-reported belief. That finding favors previous research that measured self-reported associations (e.g., Nosek et al., 2009) over research that measured only beliefs (e.g., Ramsey & Sekaquaptewa, 2011) as an investigation of implicit/explicit discrepancies rather than associations/beliefs discrepancies.
Unexpectedly, our research also found that the IAT’s strongest relation was not with self-reported associations but with self-reported personal liking of science in comparison to liberal arts. Importantly, that self-report measure was related to the IAT more than to self-reported stereotypic beliefs. Thus, whereas the explicit gender–science stereotype has very little to do with people’s self-reported liking of science, the gender–science IAT is related to self-reported liking more than to any other belief.
A cross-study overview of previous research (see Appendix A) finds evidence compatible with our present findings. First, across a variety of direct measures (not including reported associations), previous research found weak relations of explicit gender–science stereotype with the IAT score (r min = .01, r max = .191). Such implicit/explicit correlations are weaker than what is usually found between implicit and explicit measures of attitudes (Bar-Anan & Nosek, 2014; Nosek, 2005) and stereotypes (Hofmann, Gawronski, Gschwendner, Le, & Schmitt, 2005). Further, across studies, among women participants, these implicit/explicit relations were weaker than the relations observed between the IAT and self-reported attitudes toward science (r min = .15, r max = .35), and self-reported identification with science (r min = .17, r max = .36). Among men, previous results were less conclusive than our present findings (IAT/liking: r min = .01, r max =.35; IAT/identity: r min = .01, r max = .24).
Our results, mostly supported by a cross-study overview of previous research, are compatible with the possibility that the gender–science IAT and self-reported gender–science stereotypes (or associations) are different not only in automaticity/controllability of the processes that influence them or the implicitness/explicitness of the constructs that they reflect. These measures are also different in the specific beliefs or attitudes that they capture. Whereas the explicit measure captures people’s beliefs about gender and science, the IAT is also related to personal attraction to science (versus nonscience topics). Therefore, a difference between specific beliefs and attitudes, rather than a difference in controllability or implicitness, might explain previous findings that implicit gender–science stereotypes are better than explicit gender–science stereotypes in predicting important outcome variables.
An overview of previous research finds that, indeed, most of the outcomes that were predicted better by the gender–science IAT than by the explicit gender–science measures are linked to liking math and science. Among those outcomes were plans and intentions to pursue science (Cundiff et al., 2013; Lane et al., 2012), science aspirations (Lane et al., 2012; Phelan, 2010), choice of major (Smyth et al., 2009), math engagement and achievements (Nosek et el., 2002; Nosek & Smyth, 2011), math performance, and the desire to pursue math-related careers (Kiefer & Sekaquaptewa, 2007b; Ramsey & Sekaquaptewa, 2011), math self-perceived ability and math participation (Nosek & Smyth, 2011), and sensitivity to stereotype threat (Galdi, Cadinu, & Tomasetto, 2014; Kiefer & Sekaquaptewa, 2007a). People who like science are more likely to perceive science abilities as important, plan to pursue science, choose a science major, engage in a related activity, and reach more successful achievements in that activity. Regarding sensitivity to stereotype threat, women with lower grades in math and those who perceive math-related abilities as relatively unimportant are affected to a lesser extent by stereotype threat (Cadinu, Maass, Frigerio, Impagliazzo, & Latinotti, 2003; Steinberg, Okun, & Aiken, 2012). Therefore, liking math should predict sensitivity to stereotype threat.
The alternative account that we consider here does not argue that the gender–science IAT is not a measure of automatic processes or implicit constructs. Our argument pertains only to the reason for the IAT’s superiority over explicit stereotype measures in predicting science-related outcomes. We argue that the predictive advantage that previous research found for the IAT might reflect a stronger relation of the outcome variable with personal attraction to science than with gender–science beliefs, rather than a stronger relation of the outcome variable with implicit than with explicit constructs or processes.
One course of action to refute the argument proposed in the present article is to show that the IAT is a superior predictor of science-related behavior and cognition even when controlling for self-reported liking of science. Following that logic, we reanalyzed data from one previous study (Nosek & Smyth, 2011) and replicated another (Lane et al., 2012) to examine what happens to the advantage of the gender–science IAT over self-report measures in predicting a science-relevant outcome, when attitudes are added to the model. We found that shared variance with attitudes explains much of the variance the IAT shared with the outcome measure (when we controlled for attitudes, the IAT’s effect decreased significantly). We also found that the IAT’s advantage over explicit stereotypes was no longer significant when controlling for attitudes. Unfortunately, we did not find a statistical method to test whether the IAT’s advantage over the explicit measure was significantly reduced when attitudes were added to the model. 1 Therefore, our findings only failed to refute the alternative account we proposed here, rather than provide more empirical support for that account.
Why would an IAT with the nouns male/female and science/liberal arts as category names capture one’s attitudes toward science and liberal arts? Perhaps women tend to map male/female to not-me/me (and men show the opposite mapping). The self-concept IAT and the attitude IAT are strongly related (e.g., Nosek et al., 2002, r = .58; Nosek & Smyth, 2011: r among women = .53, r among men = .39). Therefore, perhaps IATs with gender categories are related to people’s self-concepts more than to people’s beliefs about the genders. In turn, self-concepts are strongly related to attitudes (e.g., Nosek & Smyth, 2011: r among women = .84, r among men = .88; Young, Rudman, Buettner, & McLean, 2013: r among women = .53, r among men = .54). For that reason, the IAT used so far to measure the gender–science stereotype was sensitive to people’s attitudes toward science more than to beliefs about gender differences. Compatible with that hypothesis are previous findings that on the IAT, people tend to show an association between their gender and favorable concepts (Rudman, Greenwald, & McGhee, 2001).
Limitations and Future Research
The most obvious challenge to our alternative account is the possibility that our findings are just another example for the superiority of the implicit over the explicit gender–science stereotypes in predicting important psychological variables related to math and science (in this case, science-related attitudes). Perhaps the predictive power of the IAT diminishes when controlling for self-reported liking due to shared variance between three distinct constructs: implicit stereotypes, self-reported liking, and the predicted outcome. That shared variance could reflect various causal relations, and some of them would suggest an important role for implicit constructs and processes. For instance, perhaps the automatic activation of the gender–science stereotype affects attraction to science, which further influences aspirations and skills in science. Indeed, we have not ruled out the possibility that an implicit construct is responsible for the IAT’s superiority documented in previous studies, and for our present findings. What we have done is to propose an alternative account for the IAT’s superiority in the gender–science domain that is as likely as the common account. The only argument in support of the common account is that in other domains, there is good evidence that the IAT reflects implicit constructs. That is not sufficient evidence that implicit constructs are responsible for the IAT’s superiority over explicit stereotypes in predicting important science-related outcomes.
To investigate what contributes to the IAT’s superiority in the gender–science domain, we recommend three future directions. First, as we have done in the present reanalysis and replication, future research on the relation between the implicit gender–science stereotype and relevant outcomes should control for participants’ attitudes toward science (versus a nonscience concept). Unique variance between the IAT and the target outcome measure, not shared with any of the self-report measures, could help establish the gender–science IAT as a measure of a psychological construct that has an important role in judgment and behavior related to people’s pursuit of math and science.
Second, it is necessary to test whether the gender–science IAT predicts automatic behavior and judgment related to gender and science. For instance, research should test whether the gender–science IAT is a better predictor of the choice to pursue science when people choose under conditions that reduce controllability (e.g., time pressure and cognitive load) than under conditions that allow control. It is also important to test whether the gender–science IAT is related to people’s feelings about stereotypical gender–science beliefs more than to their cognitions on those beliefs. Such evidence has helped to establish the IAT as a measure of automatic evaluation and to document the unique role that automatic evaluation plays (e.g., Friese, Hofmann, & Wänke, 2008; Gawronski & LeBel, 2008; Hofmann, Rauch, & Gawronski, 2007).
Third, and perhaps most importantly, so far, the research we reviewed and our present research all used correlational designs. Research on the role of implicit processes in science-related behaviors and goals is doomed to remain limited without experimental studies. It would be important to test whether a direct manipulation of mental associations between gender and science affects important outcomes related to math and science. Such an effect could increase the confidence that the predictive advantage of implicit over explicit measures reflects a causal link, and is not only due to the fact that the outcome variables and the IAT are both sensitive to variance in personal attraction to science. Similarly, research that would manipulate personal attraction to science and find changes in the gender–science IAT might support the alternative account proposed here.
Summary
The present research found that whereas the gender–science IAT is hardly related to explicit beliefs about gender and science, it is related to personal attitudes and goals pertaining to science. This finding points to the possibility that the IAT’s advantage over explicit measures of the gender–science stereotype is not only due to the automaticity versus controllability of the processes that influence each measure or to the implicitness versus explicitness of the constructs captured by each measure. Rather, perhaps it is due to discrepancy in the explicit beliefs and attitudes captured by each measure. The present findings emphasize that much evidence is still missing for understanding the theoretical implications of previous findings about implicit gender–science stereotypes. In order to examine the unique role of implicit gender–science stereotypes, one must measure not only explicit stereotypes, but also self-reported associations and self-reported attraction to math and science. Further, research must examine whether the gender–science IAT predicts automatic processes that influence science-related behavior and judgment. Finally, it is essential to conduct experiments that directly manipulate the automatic gender–science stereotype and examine its effect on relevant behavior and judgment.
Footnotes
Appendix A
Appendix B
Appendix C
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This project was supported by grants from the Israeli Science Foundation [779/16] and from the United States – Israel Binational Science Foundation [2013214] to Y. B.-A.
