Abstract

In the corrected version of their 2018 article, Stoet and Geary (Corrigendum issued 2019) responded to our identification of a mismatch between their numbers for women in science, technology, engineering, and math (STEM) with tertiary degrees and the United Nations Educational, Scientific and Cultural Organization (UNESCO, 2015) data they sourced. They clarified that their numbers do not represent the percentage of women among STEM graduates, as they had originally stated. Rather, their numbers represent a ratio, which they claim measures the “propensity” for women compared with men to earn a tertiary degree in STEM in a given country (p. 584). The use of this measure in combination with the Global Gender Gap Index (GGGI), a contested (Else-Quest & Hamilton, 2018; Hawken & Munck, 2013) composite measure of nation-level gender equality increasingly employed in similar studies advancing the hypothesis of a gender-equality paradox (e.g., Falk & Hermle, 2018), raises methodological and empirical questions about their claims that there is a gender-equality paradox in STEM and that a larger gender gap in STEM achievement in high gender-equality countries is evidence of baseline sex differences in career and educational preferences.
Propensity to Graduate in STEM
Women’s share of STEM degrees relative to men’s share of STEM degrees is a logical and interpretable statistic standardly used in scholarship on gender, STEM achievement, and economic development (Ceci, Ginther, Kahn, & Williams, 2014; Ceci, Williams, & Barnett, 2009; Charles & Bradley, 2009; Cheryan, Ziegler, Montoya, & Jiang, 2017; Miller, Eagly, & Linn, 2015) Stoet and Geary instead operationalized nation-level women’s participation in STEM as a/(a + b), “where a is the percentage of women who graduate with STEM degrees (relative to all women graduating) and b is the percentage of men who graduate with STEM degrees (relative to all men graduating)” (p. 584).
What does Stoet and Geary’s propensity ratio measure? Worldwide, women earn more tertiary degrees than men. In Algeria, 62.7% of tertiary graduates, and 53.55% of STEM graduates, are women. Yet Stoet and Geary reported a value of 40.7 for Algeria. In Poland, 43.63% of STEM graduates are women, which would place it 5th out of the 45 UNESCO countries with Programme for International Student Assessment (PISA) data included in Stoet and Geary’s analysis. Yet Stoet and Geary’s measure yields a value of 26.9, which ranks Poland 20th in their data set. Conversely, 21.83% of Luxembourg’s STEM graduates are women, ranking it 44th among 45 countries, yet Stoet and Geary reported a value of 28.7 for Luxembourg, ranking it 15th.
In their corrected article, Stoet and Geary argued that they intended this measure to capture women’s “propensity” to graduate in STEM. We assume that they mean propensity in the sense of tendency, inclination, or predisposition, consistent with their hypothesis that men and women have different baseline preferences for STEM study (e.g., Lippa, Collaer, & Peters, 2010; Pinker, 2008; Schmitt, 2015), which can be “exaggerate[d]” and “abated or overridden” (Stoet & Geary, 2018, p. 591) depending on the context. But individual predispositions should be measured by a psychometrically sound scale of individuals’ perceived (or actual) gender inequality in relation to their STEM preferences. Both the UNESCO tertiary-degree statistics and GGGI are inappropriate for these purposes because they tell us nothing about individual STEM preferences or individual experiences of gender inequality.
Setting this aside, a propensity variable is conceptually discordant with the GGGI. The GGGI measures achieved outcomes, not propensities (World Economic Forum, 2018, p. 4). The more appropriate measure for analysis of gender equality in STEM tertiary degrees alongside the GGGI is gross completion rates—for example, Algeria’s 53.55% or Luxembourg’s 21.83%. However, adopting this measure would not fully resolve problems in Stoet and Geary’s use of the GGGI, for reasons elaborated below.
Gender Equality
Stoet and Geary characterized the GGGI as a measurement instrument of the degree of gender equality at the nation level, claiming that a high composite index score captures “more educational and empowerment opportunities” for girls and women, including promotion of “girls’ and women’s engagement in STEM fields” (Stoet & Geary, 2018, p. 591). But the GGGI does not measure opportunity, empowerment, or STEM encouragement. The GGGI is an index for ranking countries by the gap in parity between women and men on select indicators for which there are global, quantitative data available since 2006.
The World Economic Forum (2018) designed GGGI as a “benchmarking tool” to “track a country’s progress over time” (p. 32). GGGI provides information about gender equality within countries and ranks countries relative to each other. It is unclear whether a tool designed for ranking, benchmarking, and tracking can be included in correlations without consideration of country-specific parameters and validation of its composite scale as a measure appropriate for correlation with the variable of interest (Else-Quest & Hamilton, 2018; Hawken & Munck, 2013).
By design, the GGGI is not intended to be used to causally explain outcomes, and gender-equal outcomes cannot be interpreted as providing information on causal context within countries. For example, among nations with perfect parity in the health and survival domain are lower-income countries in which both women and men have short life expectancies as well as higher-income countries in which both women and men live longer. Similarly, GGGI rank does not distinguish between top-performing countries in the GGGI political-empowerment subindex, such as France, Ireland, and Slovenia, which have candidate quota laws mandating gender balance, and Germany, Iceland, Norway, which do not (World Economic Forum, 2015; International Institute for Democracy and Electoral Assistance, 2019). In short, the GGGI is neutral with respect to how outcomes of parity are achieved (Thompson, 2017).
Gender equality is a multidimensional construct. Just as a nation’s high GGGI score does not mean that gender-equal outcomes are due to gender equality, a low score does not predict gender inequality or gender-unequal outcomes in all domains. Algeria, 128th on the 2015 GGGI (World Economic Forum, 2015), has the world’s second-highest rate of STEM tertiary degrees for women. Algeria’s extraordinary achievements for women in STEM could result from societal investment in women’s education in STEM, distinct cultural beliefs about women’s capacity and affinity for STEM, vast overenrollment of women compared with men in tertiary-degree programs, disproportionate outflow of men compared with women to STEM tertiary-degree programs in other Francophone countries, a limited range of tertiary degrees on offer that channel both more women and men into STEM, or other factors (Charles & Bradley, 2009; Thébaud & Charles, 2018).
Results Change Depending on Measures Used
We have argued that the measures of STEM achievement and gender equality that Stoet and Geary used are inappropriate. Does the negative association that they reported persist when measures change?
Method
We hypothesized that the construct of gender equality, the measure of women’s achievement in STEM, and countries included in the analysis all affect the correlation between gender equality and women’s representation in STEM. As an example, we offer an alternative analysis, using a different measure of gender equality, that proposed by Stoet and Geary (2019), the Basic Index of Gender Inequality (BIGI), to test the robustness of Stoet and Geary’s operationalization of gender equality and women’s STEM achievement. We changed the measure of women’s achievement in STEM to the percentage of women among tertiary STEM graduates as reported by UNESCO, a measure consonant with the focus on achieved outcomes in gender-equality indices. We included all countries with available BIGI and UNESCO STEM data for the year 2015 (N = 77) rather than limiting the analysis to countries with PISA data (n = 45). This made possible the inclusion of regions not represented or underrepresented by Stoet and Geary (2018), including sub-Saharan Africa (e.g., Angola, Benin, Burkina Faso), Brazil, and Central and East Asia (e.g., Azerbaijan, Bhutan, Cambodia).
Results
Stoet and Geary (2018) reported a significant correlation between the GGGI and their propensity measure, rs = –.47, 95% CI = [−.66, −.22], p < .001, n = 50. We conducted a two-tailed bivariate Spearman correlation analysis to assess the relationship between the BIGI and the percentage of women among STEM graduates. The correlation was not significant, rs = –.075, p = .518 (Fig. 1). Restricting the analysis to the 45 countries for which BIGI and PISA data are available yielded similar results, rs = .070, p = .647. Analysis of the relationship between the BIGI and Stoet and Geary’s propensity measure yielded a modest correlation, rs = .266, p = .021 (n = 75), but this correlation became nonsignificant when restricted to PISA countries, rs = .240, p = .117, n = 44.

Scatterplot showing the relationship between the Basic Index of Gender Inequality score (Stoet & Geary, 2019) and the percentage of women among all tertiary graduates in science, technology, engineering, and math (STEM), separately for each of the 77 countries from the United Nations Educational, Scientific and Cultural Organization (UNESCO) 2015 data set.
Discussion
The nonsignificant correlation between nation-level gender equality and women in STEM when measures of gender equality and STEM achievement were changed indicates that the association is sensitive to choice of measures of gender equality, range of countries, and STEM achievement. We maintain, however, that these patterns tell us little about global, causal relationships between nation-level measures of gender equality and women in STEM. Substantiating causal relationships between nation-level gender equality and women’s STEM achievement would necessitate a longitudinal study design tracking change over time. Further, one would need to account for the nonindependence of countries that share common history, language, and cultures—ideally with a multilevel model that builds in correlations among countries. As argued above, such an analysis would also require a valid measure of experiences of gender inequality in relation to concordant measures of women’s STEM achievement and a global range of country-level data.
Footnotes
Transparency
Action Editor: Timothy J. Pleskac
Editor: D. Stephen Lindsay
Author Contributions
S. S. Richardson drafted the manuscript. M. W. Reiches and J. Bruch analyzed the data. All authors provided critical revisions and approved the final version of the manuscript for submission.
