Abstract
Analyzing the test scores of more than 10,000,000 students who participated in the Advanced Placement (AP) math exams from 1997 to 2019, this study examined the direction and magnitude of the trend in gender disparity by race in participation in and top achievement on AP Calculus AB, Calculus BC, and Statistics exams. The results of this study indicated that, in general, females’ representation in all three AP exams increased significantly. Although the findings indicated that the female-to-male ratios (FMRs) in participation in the AP math exams increased significantly from 1997 to 2019 and favored females for all races, the gender disparities among top achievers for all math exams are still substantial. The relationships between the FMRs in participation and top achievement for all AP math exams were also analyzed within races, and the possible impacts of these findings within the context of the underrepresentation of women in science, technology, engineering, and mathematics (STEM) fields were also discussed.
Moving through the 21st century, the daily lives of human beings evolve with the influence of science and technology at a dizzying speed. In 2011, at the Hannover Fair, German government and industry leaders announced “Industry 4.0,” referring primarily to developments in automation and digitalization of industry with the integration of technological advances such as Big Data and analytics, augmented reality, autonomous robots, and cloud computing (Tay et al., 2018). As nations strive to integrate these technological advances, they also seek opportunities to cultivate talented individuals equipped with strong skills and competencies in science, technology, engineering, and mathematics (STEM) to sustain their advancements in these fields (Bahar & Maker, 2020).
Many developed countries continue to experience wide gender gaps in STEM fields. For example, in the United States, females make up only 15% of the engineering occupations and 26% of the computer workforce (National Science Board, 2018). According to a report published by The American Association of University Women, “More than ever before in history, girls are studying and excelling in science and mathematics” (Corbett & Hill, 2015, p. 2). Although the number of girls and women studying STEM subjects has increased dramatically at all levels (Corbett & Hill, 2015), women’s share of degrees at the bachelor’s, master’s, and doctoral levels continues to lag behind that of men. For example, engineering degrees earned by women in 2016 only represented 12%, 18%, and 18% of degrees at the bachelor’s, master’s, and doctoral levels, respectively, in the United States (National Science Foundation & National Center for Science and Engineering Statistics, 2019).
Understanding the roots of gender disparities in STEM careers has been a long-investigated journey among researchers from economics to psychology. A variety of factors (e.g., PK–12 school experiences, occupational discriminations, gender stereotypes, cultural settings, psychosocial factors, education policies, and biological traits) associate with these disparities (Morris, 2013). Contemporary educational research explains this complicated issue of occupational disparities via differences in educational attainments and opportunities, particularly in PK–12 level. For example, a nationwide call from the National Council of Supervisors of Mathematics (NCSM) and the National Council of Teachers of Mathematics (NCTM) highlighted that gender differences in school mathematics account for significant disparities in many STEM-related careers (NCSM & NCTM, 2018). Furthermore, several researchers have argued that the gender-related occupational differences might have resulted from disparities among top performing students who flowed into the field from major STEM pipelines, including but not limited to advanced programs (AP, IB, and dual-enrollment classes), talent development programs, and gifted education programs (Lazear & Rosen, 1990; Makel et al., 2016; Wai et al., 2010). The researchers argued that many of the current STEM workers were prepared through these pipelines and suggested that the earlier disparities among the students possibly reflected the later gender disparities in STEM-related occupations.
Reis and Park (2001) suggested that gender disparities in PK–12 settings were due to a range of factors and helped to explain the underrepresentation in later STEM pipelines. These factors included, but are not limited to, gender differences of talented females in mathematical self-efficacy; self-confidence in STEM subjects; teacher and parent perceptions about STEM aptitudes, and gender stereotyping (Callahan et al., 1994; Ceci & Williams, 2007; Dweck, 2007; Fennema et al., 1990; Halpern, 2007; Hyde, 2007; Lubinski & Benbow, 2007; Reis & Callahan, 1989; Stumpf & Stanley, 1996).
Gender Disparities Among High Achievers in Mathematics
Large-scale meta-analytic studies investigating gender-related disparities in mathematical achievement provided researchers and educators with trend data over the span of 30 years. Hyde et al. (1990) analyzed more than 100 studies between 1960 and 1990 on gender differences in math performance. The authors found these differences became nonexistent during the following three decades. Later studies (Hyde, 2005; Hyde et al., 2008; Hyde & Linn, 2006; Lindberg et al., 2010; J. P. Robinson & Lubienski, 2011) confirmed the findings of Hyde et al. (1990) that gender differences in overall K–12 math achievement have been relatively small since the 1990s.
Contrary to the minimal gender differences in mathematics performance among the general population, many studies have reported significant disparities among high-achieving students that favor males across all grades (Andreescu et al., 2008; Benbow & Stanley, 1980, 1983; Desjarlais, 2009; Ellison & Swanson, 2010; Hedges & Nowell, 1995; Hyde et al., 1990; Makel et al., 2016; Niederle & Vesterlund, 2010; Olszewski-Kubilius & Lee, 2011; Reis & Park, 2001; Wai et al., 2010, 2018; Xie & Shauman, 2003). In particular, studies that have investigated gender disparities in mathematics among high achievers suggest that the males significantly outnumbered females significantly in the high-score ranges. Furthermore, the gender gap was dramatically widened in the upper limits of the achievement distribution (Ellison & Swanson, 2010). For example, Hedges and Nowell’s (1995) analysis of high-achieving individuals’ test scores recorded in large nationwide databases found that the female-to-male ratio (FMR) in the top 3% participants was around 0.20 to 1 and even smaller in highest levels of the achievement distribution.
Studies that explored the gender disparity in math achievement have consistently documented more males than females among top achievers (Benbow & Stanley, 1980; Ceci & Williams, 2010; Makel et al., 2016; Olszewski-Kubilius & Lee, 2011; Wai et al., 2010, 2012). In one of the earliest studies investigating gender disparities among mathematically talented students, Benbow and Stanley (1980) reported that the FMR on the SAT-Math was approximately 1 to 13 in the 700 to 800 score range. In a follow-up study of a similar student population, Makel et al. (2016) found that there were approximately 2.5 males for every female in the same score range among 320,000 talented students in 2015. These findings illustrate that the FMRs in math achievement among top achievers have increased rapidly over the past four decades.
What Is the Advanced Placement Program?
The College Entrance Examination Board (College Board), a nonprofit education organization, created the Advanced Placement (AP) program in the mid-1950s to provide college-level coursework for high-achieving high school students (Christiansen, 2009). In the first year of the program, 12 AP courses, including AP Calculus, were offered to more than 1,000 students across 110 schools (Demaree, 2016). As of 2019, 2.8 million students at 20,000 high schools across the United States took more than 5 million AP exams (College Board, 2020a). In 2010, approximately 90% of U.S. high schools offer at least one AP course for their students (College Board, 2020c).
The AP program currently offers 38 courses in 22 different subjects ranging from mathematics to foreign languages (College Board, 2019). Although schools and teachers are responsible for designing their own AP course materials, College Board requires schools to fulfill some curricular requirements before a school can label a course “AP” (College Board, 2019). Thus, schools wishing to offer AP courses must go through a formal audit process conducted by College Board to review course syllabi and resources to ensure that each AP course content meets the equivalency of a college-level course (College Board, 2019).
Almost all AP courses are year-long courses followed by end-of-year exams, designed by teams of AP teachers and college professors in a standardized format, to assess the content and skill mastery of the course (College Board, 2020a). AP exams are criterion-referenced tests, with students’ exam scores reported on a 5-point scale, with 5 representing the highest possible score (Shaw et al., 2013). According to College Board (2019), guidelines recommend that a score of 5 is a college-grade equivalent of “A,” while a score of 4 is an “A−, B+, or B,” and a score of 3 equates to a “B−, C+, or C.” In 2019, students reported their exam scores to more than 3,500 colleges to seek course credit. Although each college or university has a credit policy, most award course credit for AP scores of 3 or higher (College Board, 2019).
Potential Benefits of AP Participation and Performance
The AP program has been one of the most popular programs in high schools for exposing students to advanced academic content (Warne, 2017). Rapid growth and program prevalence have attracted interest from many researchers. Most researchers who investigated potential benefits of the program have noted that the program has essential college-related outcomes. One benefit of the program is that the successful completion of an AP course increases the possibility that a student will enroll in college (Ackerman et al., 2013; Prong, 2018). For example, many high schools assign more weight to honors and advanced classes, including dual-enrollment or AP courses than they do for typical high school courses, so completing these advanced courses successfully provides students with an opportunity to achieve higher grade point averages (GPAs).
Furthermore, successful completion of an AP course typically provides students with an opportunity to reduce college costs by earning college credit while in high school, as well as completing a college degree in a shorter time (Callahan, 2003). A passing AP exam score does not guarantee that every college or university will grant credit. However, students who earn a 3 or higher may receive college credit at many universities or colleges (Hertberg-Davis et al., 2006; Moore & Slate, 2011; Prong, 2018; Shaw et al., 2013). In addition to college admission benefits, students’ AP exam performances influence their college performance as well. Regardless of gender and ethnicity, students who earn a 3 or higher on an AP exam earn higher grades in the associated college course (Hargrove et al., 2008; Murphy & Dodd, 2009; Shaw et al., 2013).
Participation in the AP program can increase participants’ college degree persistence and STEM career interest (Adelman, 1999; Bahar & Adiguzel, 2016; Kokkelenberg & Sinha, 2010; Lubinski & Benbow, 2007; Morgan & Klaric, 2007; Sadler et al., 2014). Students’ who participated in AP program courses were more likely to graduate within 4 years when compared to students who did not participate in the AP program (Mattern et al., 2009, 2013; Shaw et al., 2013). Sadler et al. (2014) examined the relationship between taking advanced high school courses and students’ interest in pursuing a STEM career. The authors used data from 4,691 college students and found that taking calculus, 2 years of chemistry or 1 or 2 years of physics in high school significantly predicted increases in STEM career interest when compared to students who do not take these courses (Sadler et al., 2014).
Advanced classes offered at high schools, such as AP courses, have also served as a natural curricular option for gifted and talented students because most secondary schools cannot offer beyond these options (Sytsma, 2000). In the field of gifted education, AP courses have been well regarded by researchers and educators as an appropriate option to accelerate talented students through advanced resources (Feldhusen, 1995; Hertberg-Davis et al., 2006; Kettler & Hurst, 2017; A. Robinson et al., 2007). Previous research recommended that rigorous curriculum is essential to talent development (Bahar, 2013; Bahar & Maker, 2015; Bloom, 1985; Callahan, 2006; Hertberg-Davis & Callahan, 2008; Kettler & Hurst, 2017; Subotnik et al., 2011; VanTassel-Baska, 2000).
Gender Disparities for AP Mathematics Exams
Although AP programs continue to serve as a major pipeline that provides students with advanced academic opportunities, there are questions as to whether these opportunities extend to all students equitably. Researchers investigating gender differences in mathematics have raised concerns about disparities in participation to and performance in AP math exams (Amelink, 2009; Brookhart, 2009; Buck et al., 2002; Moore et al., 2012; Morris, 2013; Morris et al., 2015; Morris & Slate, 2012; Prong, 2018; Stumpf & Stanley, 1996). In one of the earliest studies investigating gender disparities in AP math exams, Stumpf and Stanley (1996) explored differences in participation in and performances on AP exams (Calculus AB and Calculus BC) and compared these differences from 1984 to 1992. Over 8 years, female student participation increased from 40% to 45% for Calculus AB and from 30% to 35% for Calculus BC. Despite the increases in female representation in both exams, the FMRs among the top-achieving students who earned a score of five continued to decline and favored males for both exams. For the Calculus AB exam, the FMR dropped from 0.72:1 to 0.67:1 and dropped from 0.76:1 to 0.69:1 for Calculus BC.
Amelink (2009) explored math participation and performance over a variety of high-stake tests, including AP Calculus AB, Calculus BC, and Statistics exams. The findings revealed that roughly equal proportions of male and female high school students took the exams in Calculus AB and Statistics in 2007 (the FMRs were 0.95 and 1.01 for Calculus AB and Statistics, respectively). However, male students outnumbered female test-takers for the Calculus BC tests (the FMR was 0.70). In terms of the number of students who scored a 5, males outperformed females on all exams. While the percentages of males who scored a 5 on Calculus AB, Calculus BC, and Statistics exams were 24%, 47%, and 15%, respectively; the percentage of females were 17%, 37%, and 9%. In a similar study, Morris and Slate (2012) investigated the relationship between gender and student performance on the AP Calculus AB, Calculus BC, and Statistics exams over three examinations years 2000, 2005, and 2010. The findings revealed a significant relationship between gender and exam scores, with males tending to score higher than females.
Gender Disparities by Race Among High Achievers in Mathematics
The literature investigating gender disparities among high achievers in mathematics is well-established. Concerning disparities, gender gaps and race, there are a limited number of studies. One possible reason for the scarcity of the literature on this topic might be due to researchers’ assumption that gender disparity would be same among racial groups (Baird, 2019).
In one of the few existing studies, Catsambis (1994) compared the development of gender gaps in learning opportunities and achievement in mathematics among White, Black, and Hispanic students. Using data from a nationally representative sample of approximately 24,500 eighth-grade students who were again surveyed in 10th grade, female students were found to perform equally well, or even better, in mathematics tests than do male students in eighth grade across all racial groups. Concerning learning opportunities, female and male students were equally likely to enroll in average-ability mathematics classes and a higher proportion of girls were enrolled in high-ability classes. However, Catsambis found that all female students tended to have less interest in mathematics and less confidence in their mathematics abilities when they were resurveyed in 10th grade. She also found that gender gaps in interest and confidence in their mathematics abilities were the largest among Hispanic and the smallest among African Americans (Catsambis, 1994).
Examining the results of the Education Longitudinal Study (ELS), Baird and Keene (2019) investigated gender disparities in mathematical ability high-achieving students’ confidence in all races. The study sample included students who recently graduated from high school with 4.0 GPAs for their high school math coursework. Using ordinary least squares (OLS) regression analysis, females of all races had lower confidence in their mathematical abilities than males, even when they had the same level of mathematical achievement. Moreover, female Asian students were found to have far less math confidence than did all other students.
Present Study
Gaps in participation and achievement in STEM pipelines not only threaten educational opportunities (Darling-Hammond, 2010; Prong, 2018), but may also explain later gender disparities in STEM careers. The following research questions guided this study:
What is the trend in gender disparity by race in participation in AP math exams?
What is the trend in gender disparity by race in top achievement in AP math exams?
To what extend are the trends in gender disparity by race in participation and top achievement related?
Methods
Participants and Setting
Data were obtained from the annually published College Board’s AP National Summary reports. These reports consist of detailed data on AP Exam participation, volume, and performance separated into categories, including exam type, grade level, and gender (College Board, 2020c). From the 38 different AP exams, only data related to three mathematics exams (Calculus AB, Calculus BC, and Statistics) were processed for the analyses. The data analyzed in this study included test scores of more than 10,000,000 students who participated in the AP mathematics exams from 1997 to 2019. The gender and racial profiles of participants in each of these three exams appear in Tables 1 to 3, respectively. Likewise, Tables 4 to 6 show the gender and racial breakdown of top achievers who scored five on the exams during this same time period.
The Number of Participants in the AP Calculus AB Exam and the FMRs Across Races.
Note. AP = advanced placement; FMR = female-to-male ratio; F = female; M = male.
The Number of Participants in the AP Calculus BC Exam and the FMRs Across Races.
Note. AP = advanced placement; FMR = female-to-male ratio; F = female; M = male.
The Number of Participants in the AP Statistics Exam and the FMRs Across Races.
Note. AP = advanced placement; FMR = female-to-male ratio; F = female; M = male.
The Number of Top Scorers in the AP Calculus AB Exam and the FMRs Across Races.
Note. AP = advanced placement; FMR = female-to-male ratio; F = female; M = male.
The Number of Top Scorers in the AP Calculus BC Exam and the FMRs Across Races.
Note. AP = advanced placement; FMR = female-to-male ratio; F = female; M = male.
The Number of Top Scorers in the AP Statistics Exam and the FMRs Across Races.
Note. AP = advanced placement; FMR = female-to-male ratio; F = female; M = male.
Testing Procedures
Like many other AP exams, all three of AP mathematics exams, Calculus AB, Calculus BC, and Statistics, are proctored on specific dates, usually in the first 2 weeks of May at thousands of high schools and designated exam centers across the country. Although students can select their testing locations, most students choose to take these exams at their school during regular school hours. After the designated school personnel proctor the exams, they are mailed back to the College Board centers.
All three AP mathematics exams, Calculus AB, Calculus BC, and Statistics, are similar in terms of testing procedures and structure. The exams include a multiple-choice section and a free-response section. These two sections are graded in separate forms due to their different structures. For example, the multiple-choice section is scored by computer by scanning the answer sheet. In contrast, the free-response section is scored by appointed educators, including experienced high school AP teachers and college professors during the summer at a large convention (Sundquist, 2016). The raw scores from each section are added to form a composite score, which is later translated into a 5-point scale using statistical processes to hold consistency across years. Like many other criterion-referenced tests, AP scores are not graded on a curve but instead calculated based on mastery and of the content and skills of a specific AP course (College Board, 2020a; Sundquist, 2016). Research that investigated the predictive validity of AP mathematics exams have found significant positive relationship between AP Exam performance and college course placement (Mattern et al., 2009; Patterson & Ewing, 2013). The interrater reliability statistics for different sections of Calculus AB and BC exams were found to range from high 0.70s to low 0.90s (Bridgeman et al., 1996). Course-specific testing procedures are described below.
AP Calculus AB
More than 300,000 students take the AP Calculus AB exam annually, and fewer than 20% achieve a score of 5. Students traditionally take the AP Calculus AB course after Precalculus. It is approximately the equivalent of a first-semester college calculus course, which covers topics in differential and integral calculus (College Board, 2020a). The examination lasts 3 hours and 15 minutes and consists of a section of 45 multiple-choice items and another section of six free-response questions, with each section having equal weight. A graphing calculator is permitted for some parts of the exam (College Board, 2020a). The gender and racial breakdown of participants in the AP Calculus AB exam, from 1997 to 2019, appears in Table 1. Table 4 shows the gender and racial breakdown of top achievers who scored a 5.
AP Calculus BC
Approximately 130,000 students take the AP Calculus BC exam every year, and roughly 40% of the participants can score a 5. Although successful completion of a Calculus AB course is not a prerequisite for Calculus BC course enrollment, most students take BC after completing AB, and they are not allowed to take both exams within the same year. Similar to AP Calculus AB, the examination lasts 3 hours and 15 minutes and consists of a section of 45 multiple-choice items and another section of six free-response questions, with each section having equal weight. Likewise, a graphing calculator is permitted for some parts of the exam. The gender and racial breakdown of participants in the AP Calculus BC exam, from 1997 to 2019, is given in Table 2. Table 5 shows the gender and racial breakdown of top achievers who scored a 5 out of 5 possible points.
AP Statistics
The AP Statistics program is the newest of the AP program’s math offerings. The first exam of the program was administered in 1997. Because the course readiness requires less advanced course content than do Calculus AB and BC, the number of participants in AP Statistics exams grew faster than other AP exams. In 2019, more than 200,000 students took the exam, and almost 15% of the examinees scored a 5. Similar to the other two AP math exams, students have 40 multiple-choice and six free-response questions in 3 hours, and they can use a graphing calculator for some parts of the exam. The gender and racial breakdown of participants in the AP Calculus AB exam, from 1997 to 2019, appear in Table 3. Table 6 shows the gender and racial breakdown of top achievers who scored a 5 on the exam.
Data Analysis
Operational definitions of some significant concepts covered in this study and related index calculations are listed below. The variables were defined to identify trends in gender disparity in participation and top achievement in AP mathematics exams by race. Furthermore, it was possible to examine the relationship between the trends in gender disparities in AP participation and top performance.
Disparity index calculations
Many prior studies have used disparity indexes to examine gender gaps in mathematics performances (Bahar, 2021; Ellison & Swanson, 2010; Hyde et al., 1990, 2008; Lindberg et al., 2010; Olszewski-Kubilius & Lee, 2011; J. P. Robinson & Lubienski, 2011). In this study, a simple FMR was reported as a measure of gender disparity and denoted with the FMR index. The FMR index is calculated as
When the numbers of male and female students are equal, the value of the FMR function should be equal to 1, which indicates parity. As the number of female students exceeds that of males, the value of the function becomes larger than 1, and the disparity favors female students. As the number of male students exceeds that of females, the value of the FMR function becomes smaller than 1, and disparity favors male students (Bahar, 2021).
Index of gender disparity in participation
AP participation was represented by the number of students who participated in the corresponding AP exam. Gender disparity in exam participation is denoted with the FMR-P index and is calculated as
Index of gender disparity in top achievement
In prior studies (Bahar, 2021; Benbow & Stanley, 1983; Makel et al., 2016; Wai et al., 2010, 2018), which investigated the gender disparities among top-achieving students, researchers examined the top performances in the extreme right tail of curves including the top 0.01%, 0.1%, 1%, and 5% (Bahar, 2021). However, different from many other norm-referenced high-stake tests, AP exams are criterion-referenced measures and students’ scores range from 1 to 5, with 5 being the highest possible score. According to AP exam score distributions published on College Board (College Board, 2020b), varying from exam to exam, 10% to 45% of the participants can make the top score. In this study, a score of 5 is the top achievement. The AP Score scale table published by College Board also recommends a score of 5 as exceptionally well qualified to receive college credit and placement, which is a college course grade equivalent of A+ or A (College Board, 2020b). Gender disparity among top achievers is denoted with the FMR-P index, and it is calculated as
Analysis of research questions
Upon obtaining the data from College Board’s database that includes the number of participants and top scorers, the FMRs for participation and top achievement was calculated for all three of AP exams across races for each year from 1997 to 2019. The calculated FMRs were later employed in the analyses described below to answer the research questions.
Research questions 1 and 2
A Mann–Kendall (MK) trend test was employed to explore the trends in gender disparity in participation and top achievement in AP mathematics exams across races. As a nonparametric test, MK is a powerful exploratory analysis method to identify the presence of monotonic trends in data collected over time (Kendall, 1975; Mann, 1945). The MK test has been commonly used to evaluate trends in hydrological and climatic data (Jaiswal et al., 2015), though it is not prevalent in educational research (Bahar, 2021). However, MK trend tests are useful, especially for researchers exploring the flow of changes in educational settings and outcomes. Moreover, because the MK is a non-parametric procedure, no assumption of normality is required, and the test can be used with even small data sets.
In addition to the MK trend analysis, Sen’s nonparametric procedures were used. Sen’s slope was calculated for each AP exam across races to identify the magnitude of the slope of the trend over time (Sen, 1968). Sen’s slope has been used widely to identify the slope of trend lines (Bahar, 2021), and according to Yu et al. (2002), it is a robust measure of a trend’s magnitude. Both the MK trend test and Sen’s slope were performed using R software.
Research question 3
Correlation analysis measured the relationship between the trends in gender disparity in participation and top achievement across races. Using SPSS 25, Pearson product-moment correlation coefficients among female-to-male ratios in participation (FMR-P) and female-to-male ratios in top achievement (FMR-TA) were calculated for each AP exam.
Results
Trends in Gender Disparity in AP Math Exam Participation
Tables 1 to 3 depict the number of students who participated in each AP math exam; Calculus AB, Calculus BC, and Statistics, as well as the FMRs among the participants, respectively, for years between 1997 and 2019, separated by race and gender. As seen in Table 1, the number of female and male students who took the AP Calculus AB exam increased rapidly over the years for all students across races. MK trend analyses (see Table 7) determined whether a trend was present in gender parity/disparity in participation. For the entire student population, the FMR values showed an upward trend across the years. They increased from 0.90:1 to 0.98:1, meaning that the percentage of female students who enrolled in the Calculus AB exam grew and reached parity as of 2019. The MK test indicated that the trend was statistically significant and favored females (p < .001). According to Sen’s slope analysis, the magnitude of the change in the FMR values was 0.004 per year (see Figure 1). Similar to the entire population, the FMR values for Hispanic, Asian American, and White students showed upward trends, which were statistically significant and favored female participants. However, the FMR values for Black students showed a downward trend, which was statistically significant and favored male participants. According to the MK test, no trend existed for Native American students.
Mann–Kendall Two-Tailed Trend Test and Sen’s Slope Values for Female to Male Ratios in Participation (FMR-P).
Note. FMR-P = female-to-male ratios in participation; AP = advanced placement; M = mean; SD = standard deviation; K’s Tau = Kendall’s Tau; SLB = slope lower bound (95%); SUB = slope upper bound (95%); IV = intercept value; ILB = intercept lower bound (95%); IUP = intercept upper bound (95%).

Trends in participation in AP exams.
Table 2 depicts the number of female and male students who participated in the AP Calculus BC exam. The number of participants grew almost six times, exceeding 120,000 as of 2019. From 1997 to 2019, female students taking the exam increased faster than it did for males; accordingly, the FMRs moved from 0.59:1 to 0.72:1 (see Table 7). The MK test indicated that the FMR trend was upward and statistically significant and favored females (p < .001). Sen’s slope analysis indicated a 0.004-unit change in the magnitude of the FMR trend per year (see Figure 1). In addition, the FMR values for all races showed upward trends and favored female participants. According to the MK test, the trends were statistically significant, except for the Black population (p = .249).
Similar to AP Calculus AB and BC exams, the number of female and male students who participated in the AP Statistics exam increased for all races from 1997 to 2019 (see Table 3). Among all exam participants, the FMR values increased from 0.81:1 to 1.11:1 during the stated period. The MK test indicated that the FMR trend was upward, statistically significant, and favored females (p = .000). Thus, the percentage of female students who enrolled in the AP Statistics exam increased rapidly and surpassed that of male students. According to Sen’s slope analysis, the magnitude of the change in the FMR trend was 0.008 per year (see Figure 1). Likewise, the FMR values for Hispanic, Asian American, and White students showed upward trends that were statistically significant and favored female participants. The FMR trend for the Native American population increased, favoring females, though not significantly (p = .082). Different from other groups, the FMR trend for Black students was downward and favored male participants; however, it was not statistically significant (p = .958).
Trends in Gender Disparity Among Top Achievers on AP Math Exams
Tables 4 to 6 depict the number of students who scored highest (a 5) on each AP math exam; Calculus AB, Calculus BC, and Statistics, as well as the FMRs among the top scorers, respectively, for years between 1997 and 2019, separated by race and gender. As seen in Table 4, the number of female and male students who scored a five on the AP Calculus AB exam increased for all students from 1997 to 2017. MK trend analyses (see Table 8) determined whether a trend was present in gender parity/disparity among top scorers. For the entire student population who scored a 5, the FMR values showed an upward trend over the years and increased from 0.59:1 to 0.79:1. The MK test indicated that the trend was statistically significant and favored females (p < .001). Sen’s slope analysis indicated a 0.008-unit change in the magnitude of the FMR trend per year (see Figure 2). In addition, the FMR values for all races showed upward trends that favored top-scoring females. The trends were also statistically significant, except for the Native American students (p = .208).
Mann–Kendall Two-Tailed Trend Test and Sen’s Slope Values for Female to Male Ratios Among Top Scorers (FMR-TA).
Note. AP = advanced placement; FMR-TA = female-to-male ratio in top achievement; M = mean; SD = standard deviation; K’s Tau = Kendall’s Tau; SLB = slope lower bound (95%); SUB = slope upper bound (95%); IV = intercept value; ILB = intercept lower bound (95%); IUP = intercept upper bound (95%).

Trends in top achievement in AP mathematics exams.
Table 5 presents the number of female and male students who scored a five on the AP Calculus BC exam between 1997 and 2019 over the years. The MK test indicated that the FMR trend in the entire top scorers was upward and statistically significant (p < .001). From 1997 to 2019, the percentage of female students continued to increase, and the FMR value moved from 0.44 to 0.59. Likewise, the FMR values for each race showed upward trends that favored female participants. According to Sen’s slope analysis, the magnitude of the change in the FMR trend was 0.007 per year (see Figure 2). The trends were also statistically significant, except for Hispanic (p = .190) and Black (p = .082) students (see Table 8).
Table 6 depicts the number of top-scoring students and the FMRs on the AP Statistics exams. Similar to Calculus AB and BC, the number of top scorers increased rapidly for all races on the AP Statistics exam. The male-to-female ratio among top scorers continued to grow over the years, increasing from 0.34 to 0.73. The MK test indicated that the FMR trend was upward and statistically significant for all populations (p < .001). Sen’s slope analysis indicated a 0.013-unit change in the magnitude of the FMR trend per year (see Figure 2). Likewise, the FMR values for each racial group showed upward trends that favored female participants. As seen in Table 8, the trends were also statistically significant except for Hispanic students (p = .125).
Relation of Trends in Participation and Top Achievement
Basic descriptive statistics and Pearson product–moment correlation coefficients that show the relationship between the FMR-P and FMR-TA across races are presented in Table 9. As seen in the table, the results of Pearson correlation analysis indicated that there was a significant positive association between the FMRs in participation and top achievement on the AP Calculus AB exam for the entire population (r = .83, p < .001). The correlation coefficients were also significantly positive for Hispanic (r = .69, p < .001), White (r = .77, p < .001), and Asian American (r = .72, p < .001) students. However, there were no associations between the FMRs in participation and top achievement on AP Calculus AB exam for Native American (r = .08, p = .696) and Black (r = −.001, p = .995) students.
Correlation Values Between FMR-P and FMR-TA Across Races for Each AP Exams.
Note. FMR-P = female-to-male ratio in exam participation; FMR-TA = female-to-male ratio in top achievement; AP = advanced placement.
The correlation analysis for the AP Calculus BC exam showed a significant positive association between the FMRs in participation and top achievement for the entire population (r = .91, p < .001). The correlation coefficients were also significantly positive for Hispanic (r = .51, p = .013), White (r = .92, p < .001), and Asian American (r = .90, p < .001) students. Similar to Calculus AB exam, the findings indicated no associations between the FMRs in participation and top achievement on Calculus BC exam for Native American (r = .22, p = .315) and Black (r = .24, p = .267) students (see Table 9). Finally, results of Pearson correlation analysis for AP Statistics exam indicated that there was a significant positive association between the FMRs in participation and top achievement for the entire population (r = .91, p < .001). The correlation coefficients were also significantly positive for White (r = .89, p < .001) and Asian American (r = .88, p < .001) students. However, the findings indicated no associations between the FMRs in participation and top achievement on AP Statistics exam for Native American (r = .24, p = .264), Black (r = −.04, p = .868), and Hispanic (r = .26, p < .228) students.
Discussion
Trends in Gender Disparity in AP Math Exam Participation
One of the goals of this study was to explore the trends in gender parity participation in AP mathematics exams across races. In general, the findings indicated that females’ representation in all three AP exams increased significantly from 1997 to 2019, as the MK tests and Sen’s slope indicated significant and upward trends for all exams that favored females.
The findings related to the AP Calculus AB exam showed that females reached parity in participation in the entire population over the years, as the FMR among all participants increased from 0.90:1 to 0.98:1 between 1997 and 2019. Furthermore, if the trend continues, Sen’s slope analysis indicates that males most likely will be minority in exam classrooms year by year. When the data are segregated by race, significant findings appeared as well. Similar to the entire population, the FMRs in participation for Hispanic and Asian American students was already in parity. Furthermore, the significant and upward trend that favored females might emerge as a sign for Hispanic and Asian American males’ underrepresentation in AP Calculus AB exams over the years. For White students, although the FMR trend was upward and close to parity, Sen’s slope shows that it might take a few more decades for females to reach parity with male students. Among all AP Calculus AB exam participants, Black males were the only underrepresented male group among all participants. As seen in Table 7, the mean Black FMR over the years was 1.36:1, while the next highest FMR that belonged to Asian American students was only 1.00:1. Although the FMR trend for Black students was significantly downward and favored male students, the Sen’s slope magnitude showed that it might take several decades to reach parity between male and female participants. For the Native American students, the mean FMR value was close to parity, but the MK test failed to identify a trend for the population, which might be a result of large deviations and outliers in the FMR data that were led by a small number of Native American participants over the years.
Similar to the Calculus AB exam, the findings related to participation in the AP Calculus BC exam showed that the FMR trend was significantly upward and favored females in the entire population over the years. The MK trend analysis showed that the FMR among Calculus BC participants increased from 0.59:1 to 0.72:1 between 1997 and 2019. However, different from the Calculus AB exam, the female participants were largely underrepresented in the Calculus BC exam (see Table 2). According to Sen’s slope analysis, reaching parity in the number of female and male participants would take up to a half-century if the trend continues. When the data are segregated by race, the FMR trend in participation for all races was similar to the trend in the entire population, except Black students. The FMR values were in parity for Black students, while females from Hispanic, White, Asian American, and Native American students were widely underrepresented among the Calculus BC exam participants. Reaching parity in the number of female and male participants might be expected to take several decades for each race, other than Black students, as the MK test indicated a stagnant FMR trend for Black students in AP Calculus BC participation. Namely, without implementing any further culturally and gender-responsive policies and practices, the present trends suggest that closing the gaps in AP Calculus BC participation would not be happening soon.
The AP Statistics exam was the only AP math exam for which female participants were overrepresented. When the first AP Statistics exam became available in 1997, the FMR was 0.81:1 among the participants. However, the FMR value continued to increase and finally reached 1:11:1 as of 2019. Furthermore, the MK test analysis showed that the FMR trend was upward, which means the gap keeps widening in favor of females. For all racial groups, the female participants were in parity with males or majority among the participants. For Black and Native American students, the trends were stagnant and increased for Asian American, Hispanic, and White students. Taken together, these findings mean that, except Native American students, the male students will remain unrepresented, and their share will decline year by year among the AP Statistics exam participants.
Different from the studies (Stumpf & Stanley, 1996) that found significant disparities among AP Calculus AB and BC participants, in favor of male students, the current findings indicate parity in AP Calculus AB and even overrepresentation of females in AP Statistics exams. These results, in general, might explain the increase in females’ interest in AP math exams, and such an increase in interest might positively influence parity among top achievement in these exams. Besides, although the findings indicated that females are still underrepresented among AP Calculus BC exam participants, their share among the participants increased significantly over the years.
Given that mathematics and related fields have long been identified as masculine disciplines (Fennema & Sherman, 1977), the increase in female students’ interests in all AP math exams over the years is welcome. However, underrepresentation of males in recent AP Statistics exams and future AP Calculus AB exams could also be considered precarious. The findings indicate that males from certain races, specifically Black males, are largely underrepresented in AP Statistics and AP Calculus AB exam rooms. Although investigating the factors that contributed to the Black male students’ participation in advanced mathematics programs and courses is beyond the scope of this study, it is crucial to identify and remove the barriers that prevent Black male students from participating in advanced mathematics courses. As the decision or ability to enroll in any AP math course usually was determined by the level of mathematics that students take in Grade 8 (Finkelstein et al., 2012; Lubienski et al., 2004; Stevenson et al., 1994), Black males’ underrepresentation in AP courses might result from their underrepresentation in advanced middle school math courses and gifted programs. This assumption might hold as many prior studies already identified Black boys underrepresented in elementary and middle school gifted programs (Berry, 2008; Davis, 2014; Wright & Counsell, 2018). Policies and research-based practices that increase Black students’ representation in advanced elementary and middle school math courses and gifted programs would also support Black male students’ participation in advanced high school mathematics courses.
Trends in Gender Disparity Among Top Achievers on AP Math Exams
Another primary goal of this study was to explore trends by gender among top achievers on AP mathematics exams. Disparities in top achievement can account for subsequent disparities in STEM achievement and outcomes. These disparities include earning higher degrees, publications, and patents (Makel et al., 2016; Wai et al., 2010), the findings of this study might partially shed light on predicting future gender disparities in STEM fields.
The findings related to the AP Calculus AB exam indicated that females were underrepresented among the students who scored a 5. As of 2019, roughly 43% of the top scorers in the Calculus AB exam were female students. However, the findings related to the AP Calculus AB exam indicated that the share of females among the students who scored a 5 increased significantly over the years. The MK trend analysis showed that the FMR among top scorers increased significantly from 0.59:1 to 0.79:1 over the years. However, according to Sen’s slope analysis, reaching parity in the number of female and male top scorers would take at least three decades if the trends remained similar.
When the data are segregated by race, the trend in FMR among the top-scoring students for all races was upward, significant, and favored females in the entire population, except Native American students as the MK test indicated a stagnant FMR trend. The FMR values were closest to parity for Black and Asian American students with a mean FMR value of 0.83:1 and 0.80:1, respectively. FMR ratios among top achievers were lowest for Native American and Hispanic students (0.49:1 and 0.54:1, respectively). As the trend in FMR was also stagnant for the Native American students, the small number of representations among the top achievers is a significant disparity issue for female Native American students.
The findings related to top achievement in AP Calculus BC showed that the female students were widely underrepresented among the students who scored a 5. Despite vast differences in the numbers of female and male top scorers, FMR value trends indicated the gap was diminishing in favor of females between 1997 and 2019. The MK trend analysis showed that the FMR among top scorers on the Calculus BC exam increased significantly from 0.44:1 to 0.59:1 over the years. Likewise, when the data are segregated by race, the trend in FMR among the top-scoring students for all races was upward, significant, and favored females in the entire population, except Black and Hispanic students; the MK test indicated stagnant FMR trends for both groups. The FMR values for Black and Asian students were the highest among all racial groups with a mean FMR value of 0.62:1 and 0.65:1, respectively, while the lowest FMRs were for Native American and Hispanic students (0.34:1 and 0.42:1, respectively).
AP Statistics exam was another AP math exam that female participants were widely underrepresented among the students who scored a 5. However, similar to the other two exams, the trend in FMR values among the students who scored a 5 was significantly upward and favored females. The MK trend analysis showed that the FMR among top scorers increased from 0.34:1 to 0.73:1 over the years. Compared to the other two math exams, the magnitude of the change in the FMR values was more significant. As Sen’s slope analysis indicated in Table 8, females could reach parity among top achievers in AP Statistics in less than two decades. However, this rapid change was not consistent for Hispanic students as the MK test indicated a stagnant FMR trend for the group.
In general, different from the findings that indicated a high representation of female students in participation in all AP math exams, especially in Calculus AB and Statistics, the findings showed lower female representation in top achievement for all exams. This finding has been consistent with prior studies that have documented higher male-to-female ratios in the highest score ranges (Andreescu et al., 2008; Benbow & Stanley, 1980, 1983; Desjarlais, 2009; Ellison & Swanson, 2010; Hedges & Nowell, 1995; Hyde et al., 1990; Linn, 2010; Makel et al., 2016; Niederle & Vesterlund, 2010; Olszewski-Kubilius & Lee, 2011; Reis & Park, 2001; Wai et al., 2010, 2018; Xie & Shauman, 2003).
Relation of Trends in Participation and Top Achievement
As seen in Table 9, the results of Pearson correlation analysis indicated that a significant positive correlation exists between the FMRs in participation and top achievement for all AP math exams for the entire population. The author interprets this finding as a strong association between participation and performance, which means when we achieve to lower the gender disparities in participation, we might get closer to increasing parities among top-achieving female and male students. Although reaching parity in participation positively benefits parity in top achievement, a 1:1 FMR value in participation does not necessarily lead to a 1:1 FMR in top achievement. For example, the FMR value in participation in AP Statistics was 1.11:1 for the entire population in 2019 while it was 0.73:1 among top achievers in the same year. This situation also holds for most mathematics exams and tests, including the ACT/SAT, AP Calculus test, Graduate Record Exam (GRE), American Mathematics Competitions (AMC), and National Assessment of Educational Progress (NAEP). Prior studies report disparities among high achievers favoring male students despite parity in participation (Niederle & Vesterlund, 2010).
When the data are segregated by race, while the relationship between FMRs in participation and top achievement was significantly positive for Asian American, Hispanic, and White students for all exams, the findings indicated no associations between the FMRs in participation and top achievement for Native American and Black students in any of the exams. This result is most probably due to the small number of top scorers from Native American and Black students that led to significant variations in the mean FMR values. Although this study only focused on gender disparities as disaggregated by race, analyzing the racial disparities was beyond the goal of this study. However, the findings showed significant racial disparities not only in participation in the exams but also in top achievement between racial groups. For example, only 694 Black students scored a 5 on the AP Calculus BC exam in 2019 while the number of Asian American students was more than 20,000. Thus, Asian American students were overrepresented almost 100 times more than Black students among the top scorers when considering their share in the total population. Significant racial disparities in precollege achievement are already perilous. Disparities will also lead to enormous differences in later outcomes and achievements; future research should explore the trend in racial disparities in AP math exams as an important area.
Among all three AP math exams, Calculus BC is a platform for which gender disparities in both participation and top achievement are most apparent. This finding is consistent with the findings of Amelink (2009), who identified the AP Calculus BC as the exam in which females were underrepresented most. Generally, students enroll in AP Calculus BC courses after successful completion of AP Calculus AB exam. In other words, most of the participants of Calculus BC are the students who already scored a 3, 4, or 5 on the Calculus AB exam, which is also evident when the mean FMR value (0.678:1) in top achievement for AP Calculus AB participation is compared with the mean FMR (0.671:1) in participation for AP Calculus BC. This fact may imply that the smaller number of females appear in the more advanced level of mathematics, which is consistent with the findings of Bahar (2021), who found higher proportions of males in the higher levels of the AMC competitions.
This study presented encouraging findings in terms of reaching parity between females and males. However, there is no evidence as to whether these advancements are due to effective policies and practices that encourage females and males for STEM fields or some unobtrusive factors such as ceiling effect of the exams. Especially given that 43% of the entire Calculus BC participants scored a 5 in 2019, this suspicion is not rootless. Although the present findings did not test the significance of the change over the years, the number of students who scored five on the Calculus BC exam was only 30% in 1997. As AP exams are designed as criterion-referenced tests, it is not possible to investigate disparities beyond a score of five. However, we could have expected to see lower FMRs if we had the chance to move through the upper percentages in the achievement distribution of the AP Calculus BC exam. As prior studies that examined gender disparities in other high-stake tests, such as the SAT, ACT, GRE, and AMC, found larger underrepresentation of females in top achievement levels, this assumption might also hold but needs validation through further investigations.
The number of female and male participants stopped increasing for all AP math exams recently. For example, as depicted in Table 1, the number of female and male participants in Calculus AB started to decline as of 2017 for the entire population. This picture is almost similar to AP Calculus BC and Statistics (see Tables 2 and 3). Because this study focused on the trend analysis in gender disparities rather than general trends in participation in AP math exams, analyzing the factors that contribute to the decline in the number of the participants is beyond the scope of this study. However, a further investigation would be needed to understand the reasons leading to the decline in the interest in AP math exams. More specifically, if such a decline stems from a decline in students’ interest in STEM fields or subjects, then this might be a significant threat to future STEM labor outcomes. Perhaps such an exploration might also reveal whether the AP program is currently at a breakout point that just started to lose its popularity or reached a saturation point in terms of student participation. Investigation of the matter would be an additional area for future research.
A limitation of this study was that, every year, a substantial number of participants either chose not to report their racial background or belonged to two or more than two races. Although these students were included in the total number of participants, they were not represented under any racial groups. For example, in 2019, the total number of students who chose not to report their racial background or belonged to two or more than two races exceeded 30,000 in total for all three AP math exams. The number of students who participated in one of these exams exceeded 600,000; the percentage of such a student population constituted only 5% of the entire population. However, because Native American and Black students are represented with a smaller size of participants and top achievers on these exams, such unreported numbers might be impacting their trends more than that were of overrepresented groups such as White and Asian American students. As no further information about these participants was accessible, it is reasonable to assume that the number of students who chose not to report their racial background or belonged to two or more than two races was represented equally weighted in all racial groups. However, this assumption needs further validation.
Conclusion
This study investigated the trends in gender disparities among participants and top achievers in AP math exams. Although the findings indicated that the FMRs in participation in the AP math exams increased significantly from 1997 to 2019 and favored females, the gender disparities among top achievers for all math exams are still substantial. Furthermore, the MK test and Sen’s slope analysis indicated that reaching parity in high achievement may take up to 60 years for the Calculus BC exam and roughly 30 years for Calculus AB and Statistics exams. Monitoring these differences through robust and scientifically sound analysis tools is crucial because the lack of parity in participation in and top achievement at advanced academics will not only lead to later gender disparities in STEM outcomes and achievements (Eccles, 2007; Makel et al., 2016; Wai et al., 2010), but they also threaten social justice initiatives and capitalizing on human potential (Darling-Hammond, 2010; Ma & Johnson, 2008; Prong, 2018; Sells, 1980; Shaw & Barbuti, 2010; Trusty, 2002).
Footnotes
Declaration of Conflicting Interests
The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author received no financial support for the research and/or authorship of this article.
