Abstract
Background. Prevention of both school dropout and teen pregnancy represent clear public health priorities for South Africa, yet their complex and potentially cyclical relationship has not been fully explored. Objective. To further understand how this relationship operates, we analyzed data from a randomized trial of young women aged 13 to 20 years enrolled in school in rural South Africa to estimate the association between pregnancy and subsequent dropout and between dropout and subsequent pregnancy. Method. We examined inverse probability (IP) of exposure-weighted survival curves for school dropout by pregnancy and for pregnancy by school dropout. We used weighted curves to calculate 1-, 2-, and 3-year risk differences and risk ratios. Additionally, we used an IP-weighted marginal structural cox model to estimate a hazard ratio (HR) for each relationship. Results. Dropout from school was associated with subsequent pregnancy (HR 3.58; 95% confidence interval [CI] [2.04, 6.28]) and pregnancy was associated with subsequent school dropout (HR 2.36; 95% CI [1.29, 4.31]). Young women who attended school but attended fewer days had a higher hazard of pregnancy than those who attended more school (HR 3.64; 95% CI [2.27, 5.84]). Conclusion. Pregnancy is both a cause and a consequence of school dropout. Consideration of school attendance and academic performance could ultimately enhance pregnancy prevention efforts in this population. Programs should be tailored differently for (1) girls who have dropped out of school, (2) those who are in school and at risk for pregnancy, and (3) those who are in school and become pregnant.
In South Africa, pregnancy among adolescent girls and young women is extremely common. A third of young women have had their first child before the age of 20 years, and roughly two thirds of births among teenage mothers are reported as unplanned or unwanted (MacPhail, Pettifor, Pascoe, & Rees, 2007; National Department of Health, Medical Research Council, & OrcMacro, 2007; Rosenberg et al., 2015). Early age at first birth has been associated with medical complications for the mother and the infant as well as adverse outcomes later in life such as reduced educational achievement and economic instability (Gibbs, Wendt, Peters, & Hogue, 2012; Panday, Makiwane, Ranchod, & Letsoalo, 2009). Teen pregnancy has been shown to lead to school dropout (Ardington, Menendez, & Mutevedzi, 2015; Clark & Mathur, 2012; M. J. Grant & Hallman, 2008; Marteleto, Lam, & Ranchhod, 2008; Timæus & Moultrie, 2015), and, conversely, school dropout among nonpregnant teens has been shown to lead to subsequent pregnancy (Manlove, 1998; Rosenberg et al., 2015).
School dropout rates in South Africa are low (<5%) between the ages of 6 and 15 years as per South African law mandating school attendance (Timæus, Simelane, & Letsoalo, 2012), but the rates begin to rise (15% to >50%) after the age of 16 years. Dropout rates are highest in poor, majority Black schools (Timæus et al., 2012) and may be exacerbated by poor quality of education characterized by low literacy and numeracy levels, resource shortages, underqualified teachers, and high failure rates (Modisaotsile, 2012). School dropout has also been associated with several negative health outcomes among young women beyond pregnancy, including increased sexual risk behaviors (Hargreaves et al., 2008; Stroeken et al., 2012) and almost three times higher risk for HIV infection (Stoner, Pettifor, et al., 2017).
Prevention of both school dropout and teen pregnancy represent clear public health priorities for South Africa, yet their complex and potentially cyclical relationship has not been fully explored. Understanding the extent to which pregnancy affects school dropout and to which school dropout affects pregnancy could guide targeted interventions or programs to prevent one or both of these outcomes. Here, we estimated the association between pregnancy and subsequent dropout and between dropout and subsequent pregnancy in a cohort of adolescent girls and young women in rural South Africa.
Method
Study Sample and Procedures
We analyzed data from the HIV Prevention Trials Network (HPTN) 068 study, a randomized trial to determine if providing a conditional cash transfer intervention would reduce young women’s risk for HIV acquisition (Pettifor, MacPhail, Hughes, et al., 2016; Pettifor, MacPhail, Selin, et al., 2016). The intervention provided a transfer of cash to young women and their households, conditional on young women attending 80% of school days. The study enrolled 2,533 young women (all Black South African) aged 13 to 20 years who were not pregnant or married at baseline and were attending high school grades 8 to 11 in the rural Bushbuckridge subdistrict of Mpumalanga province, South Africa. All young women were enrolled in school at study enrollment but were retained in the cohort if they dropped out of school during the study. Our analysis excluded young women without at least one follow-up visit after enrollment and those for whom pregnancy information was not available.
Institutional review board approval for this study was obtained from the University of North Carolina at Chapel Hill and the University of the Witwatersrand Human Research Ethics Committee as well as the Provincial Department of Health’s Research Ethics Committee.
Up to four annual assessments of the young women were conducted from 2011 to 2015. Assessments occurred at enrollment and roughly every 12 months thereafter. Each annual study visit included an Audio Computer-Assisted Self-Interview with self-reported information on pregnancy, sexual behaviors, and demographics. Additionally, information on school attendance and dropout was collected directly from high school attendance registers, which were closely monitored by both school staff and study staff to ensure accuracy. School attendance data were collected in the months of February, May, and August because these months were most representative of normal attendance due to lack of holidays/exams.
We examined the relationship between pregnancy and school dropout in both directions (with each as an exposure and the other as the outcome) using time-to-event methods to account for the different lengths of time that the girls contributed to the study.
Pregnancy and Subsequent School Dropout
To estimate the association between pregnancy and subsequent school dropout, we constructed an analytic cohort in which each young woman was followed from enrollment in continuous months until date of school dropout, or date of censoring if she moved, died, was lost to follow-up, graduated, or reached the end of the study time period. We used continuous time until the actual month of dropout between visits. A woman was considered to never have been pregnant until the first survey, including baseline, at which she reported being pregnant within the past 12 months (yes/no). After this time, she was considered “exposed” to pregnancy throughout the remainder of follow-up. We used exposure information only from the visits prior to outcome ascertainment to ensure that exposure occurred before the outcome. The outcome of school dropout was defined as the first report of school dropout in the school attendance registers for any reason.
We used inverse probability (IP) of exposure weights to estimate absolute 3-year risks as well as the 3-year risk difference, risk ratio, and hazard ratio (HR) comparing risk of dropout had all women been “exposed” to pregnancy throughout the study (e.g., had been pregnant at enrollment) with the risk if all women had never become pregnant during the study. Risks, risk differences, and risk ratios show risk at specific time points over the study period, while the HR is a measure of the effect over the entire time period.
School Dropout and Subsequent Pregnancy
To estimate the association between school dropout and pregnancy, we constructed an analytic cohort in which each young woman was followed from enrollment in yearly intervals until the follow-up visit with a reported pregnancy, or censoring if she moved, was lost to follow up, graduated, or reached the end of the study period. We used yearly intervals as pregnancy was only reported at each annual survey. A young woman was considered to be in school until the first visit after which she was recorded dropping out of school. She was then defined as having dropped out for the remainder of the study period. We also examined low attendance in school (<80% of school days vs. 80% or more) at each survey as another measure of schooling, as defined in previous studies using these data (Pettifor, MacPhail, Hughes, et al., 2016; Stoner, Pettifor, et al., 2017). The outcome was defined as the first self-reported pregnancy in the past 12 months during the study. We used time to first pregnancy as very few girls had more than one child by the end of the study (N = 29, 1.23%). Again, we used exposure information only from the visits prior to outcome ascertainment to ensure that exposure occurred before the outcome. In this case, dropout was assessed using school attendance status at two follow-up visits before the outcome to ensure no overlap. We estimated absolute 3-year risks as well as the 3-year risk difference, risk ratio, and HR comparing risk for pregnancy had all women been “exposed” to dropout throughout the study with the risk if all women had never dropped out during the study.
We did not exclude young women who had a pregnancy prior to the study but did include prior pregnancy in our weights to control for confounding. An additional sensitivity analysis was done excluding young women who had a previous pregnancy at baseline (N = 197) and, for analyses using two time periods prior, those pregnant at baseline or the first visit (N = 260). However, results remained similar, and therefore, young women with a prior pregnancy at baseline were retained in the analysis cohort (Supplemental Table S1, available in the online version of this article).
Statistical Methods
Risks under each exposure were estimated using the complement of the extended Nelson-Aalen estimator of the survival function weighted to account for time-varying confounders of each exposure-outcome relationship (Cole & Hernán, 2004; Westreich et al., 2010; Xie & Liu, 2005). These risk functions were used to calculate risk ratios and risk differences comparing exposures. Confidence intervals around weighted risks, risk differences, and risk ratios were calculated using a nonparametric bootstrap calculated from 200 full samples with replacement from the observed data.
To examine the relationship between pregnancy and school dropout, we additionally estimated an IP-weighted HR using weighted Cox proportional hazards regression with robust standard errors. To estimate the association between dropout and subsequent pregnancy (in yearly intervals), we estimated an IP-weighted HR using pooled logistic regression. Confidence intervals around the HRs were created using the robust sandwich variance estimator (Cole, Hernán, Anastos, Jamieson, & Robins, 2007).
Inverse Probability Weights for Both Relationships and Covariate Definitions
We used a causal directed acyclic graph (DAG) to identify a minimally sufficient adjustment set of covariates and identified the same time-fixed and time-varying covariates for both outcomes. Time-fixed covariates included age at baseline (categorized as 13-14, 15-16, 17-18, and 19+ years), prior pregnancy, depression, and cash transfer intervention arm (Pettifor, MacPhail, Hughes, et al., 2016). Depression was defined as a baseline children’s depression inventory score equal to or above seven (Cluver, Gardner, & Operario, 2007; Kovacs, 1985). Time-varying covariates included alcohol use more than once per month, older partner (≥5 years), wealth in quartiles according to household assets, and single or double orphan status. Based on the literature, we used ≥5 years as the uniform cutoff for all age-disparate relationships (Harling et al., 2014; Schaefer et al., 2017; Stoner et al., 2019). We used wealth quartiles rather than parental educational attainment as they were highly correlated and parental education had more missing information.
Other covariates that were descriptively examined at baseline but were not included in the minimally sufficient adjustment set in our DAG were physical violence by a partner, grade repetition, hope for the future score (Abler et al., 2017), risky sexual behaviors (any unprotected sex, transactional sex, number of partners, contraception use), any report of violence from teachers or students at school, parent/guardian beliefs that education is more important for boys than girls, and revised children’s manifest anxiety score (Boyes & Cluver, 2013).
We accounted for confounders set using stabilized IP of exposure weights (Cole & Hernán, 2008). Weights had the form
where
Results
Of the 2,533 young women included in the original randomized trial, we excluded 163 young women with no follow-up Audio Computer-Assisted Self-Interview survey after baseline and 5 who were missing pregnancy information for the duration of the study. Of the 2,365 young women in our final analytic cohort, 136 (5.7%) dropped out of school, and almost 16% (N = 372, 15.7%) had an incident pregnancy between enrollment and the end of the study period.
Young women who reported ever being pregnant at baseline (N = 197, 8.4%) were older and were more likely to be depressed, anxious, an orphan, have a lower hope for the future score, and/or ever have experienced physical violence from a partner or reported violence from teachers or students at school (Table 1). They also had worse educational outcomes—they were more likely to have ever repeated a grade and less likely to have high attendance in school. We did not find differences by randomization arm, alcohol use, or having an older partner or a belief that education was more important for boys than girls.
Baseline Characteristics of Young Women Aged 13 to 20 Years by Ever Been Pregnant in Agincourt, South Africa, from March 2011 to December 2012 (N = 2365). a
Note. Mdn = median; IQR = interquartile range.
Missing at baseline: Ever been pregnant n = 22; school attendance n = 10; attendance n = 10; wealth n = 4; alcohol use n = 3; depression n = 107; anxiety n = 33; ever had sex n = 3; unprotected sex n = 15; single or double orphan n = 20; parental monitoring n = 40; mother’s educational level n = 191; exchange sex n = 89; intimate partner violence n = 48, older partner n = 15. Boys more educated 1; modern contraception 7.
Young women who reported having a pregnancy at baseline or during follow-up were more likely to drop out of school compared with young women who never had a pregnancy (Figure 1). The risk differences and risk ratios comparing those who were ever pregnant versus those who were never pregnant increased over the study period for a 3-year risk difference of 2.3% (95% CI [−1.5%, 6.0%]) (Table 2). The weighted hazard of school dropout for young women who ever had a pregnancy was 2.36 (95% CI [1.29, 4.31]) times the hazard in young women who never had a pregnancy (Table 3).

Cumulative incidence curves for the association between pregnancy in the past 12 months and incident school dropout for 2,360 young women aged 13 to 23 years participating in HPTN 068 (2011-2015). (A) Unweighted. (B) Weighted. Treatment weighted curves accounted for the following covariates: age at baseline, used alcohol in the past 12 months, baseline depression, having any partner 5 years or older, wealth, intervention assignment, orphan status, and pregnancy at baseline.
Weighted Risk Differences and Risk Ratios at 1, 2, and 3 Years for the Effect of Pregnancy in Past 12 Months on Incident Dropout Among Young Women Aged 13 to 23 years, Enrolled in HPTN 068 Between 2011 and 2015 at Different Time Intervals. a
Note. HPTN = HIV Prevention Trials Network; CI = bootstrap confidence interval.
Weighted models conditioned on the following covariates: age at baseline, alcohol use, baseline depression, having any partner 5 years or older, wealth, intervention assignment, orphan status, prior pregnancy at baseline.
Hazard Ratios for the Effect of Pregnancy on Incident School Dropout and Dropout on Incident Pregnancy Among Young Women Aged 13 to 23 Years, Enrolled in HPTN 068 from 2011-2015. a
Note. HPTN = HIV Prevention Trials Network; CI = robust confidence intervals; HR = hazard ratio.
Weighted models conditioned on the following covariates: age at baseline, alcohol use, baseline depression, having any partner 5 years or older, wealth, intervention assignment, orphan status, prior pregnancy at baseline.
Young women who dropped out of school during the study period had a higher weighted risk for incident pregnancy compared with young women who never dropped out (Figure 2). The risk difference comparing those who ever dropped out versus those who never dropped out at 3 years was 21.7% (−6.5%, 49.9%) (Table 2). The weighted HR for the association between dropout and incident pregnancy was 3.58 (95% CI [2.04, 6.28]) (Table 3). Low attendance in school (<80% of school days) was also associated with a higher weighted hazard of pregnancy compared with high attendance (≥80%) in school (HR 3.64; 95% CI [2.27, 5.84]). The number of young women who became pregnant and dropped out of school was low over the study period, leading to large confidence intervals around all estimates (limited precision).

Cumulative incidence curves for the association between dropout and incident pregnancy two time periods later for young women aged 13 to 23 years participating in HPTN 068 (2011-2015). (A) Unweighted. (B) Weighted. Treatment weighted curves accounted for the following covariates: time-varying age, used alcohol in the past 12 months, time-varying depression, having a partner 5 years or older, Wealth, intervention assignment, orphan status, previous pregnancy at baseline.
Discussion
In this bidirectional assessment of pregnancy and school dropout among adolescent girls and young women in rural South Africa, dropout and pregnancy were highly associated. Girls who were pregnant either before or during the study period were more likely to drop out than girls who did not become pregnant (HR 2.36; 95% CI [1.29, 4.31]), and girls who dropped out of school were more likely to experience a pregnancy compared with girls who were retained in school (HR 3.58; 95% CI [2.04, 6.28]). Even among young women who remained in school, those who attended fewer days had a higher hazard of pregnancy than those with greater attendance (HR 3.64; 95% CI [2.27, 5.84]). Together, our findings suggest that pregnancy is both a driver and a consequence of school dropout among South African adolescent girls.
For young women who are in school, risk factors for pregnancy are complex and are likely to vary by context and population (Christofides et al., 2014; M. J. Grant & Hallman, 2008; Jewkes, Morrell, & Christofides, 2009; Panday et al., 2009; Stroeken et al., 2012; Willian, 2013). In our study, we found that girls who became pregnant were more likely to drop out of school and that violence, limited future aspirations, and poor educational performance (school attendance and grade repetition) were all associated with ever having been pregnant at baseline. Gender inequity is a major driver of teen pregnancy, and girls who have experienced physical violence are more likely to become pregnant and to have an unwanted pregnancy (Christofides et al., 2014; Jewkes et al., 2009; Jewkes, Vundule, Maforah, & Jordaan, 2001). Young women who exhibit poor scholastic performance and those who have repeated grades are also more likely to get pregnant and are more likely to drop out following pregnancy (M. J. Grant & Hallman, 2008; Marteleto et al., 2008; Timæus & Moultrie, 2013).
Inconsistent and incorrect use of effective contraception likely accounts for a nontrivial proportion of teen pregnancy in South Africa. More than half of sexually active young women in South Africa report that they do not use contraception to prevent pregnancy (MacPhail et al., 2007). In our study, girls who experienced a prior pregnancy were significantly more likely to be using modern contraception at baseline compared with girls who had never been pregnant. Poor contraceptive uptake is associated with lack of adolescent friendly health services and purported stigma and discrimination by health care workers, friends, families, and other institutions (MacPhail et al., 2007; Panday et al., 2009; Zwang & Garenne, 2008). Furthermore, some studies have shown that along with this stigma, societal norms preclude girls from engaging with sexual health and contraceptive services until after they experience their first pregnancy (MacPhail et al., 2007). Therefore, interventions to prevent pregnancy in girls who are in school should include components to address gender inequity (i.e., gender-based violence) and provide comprehensive sexual health education and contraception services.
Young women who become pregnant and leave school infrequently return to school following childbirth (Jewkes et al., 2009; Willian, 2013), and interventions are needed to promote continued enrollment or reentry after pregnancy. Some studies have shown that young women who have performed well academically are more likely to return to school, further highlighting the need to intervene early in order to improve school performance before young women drop out (Ardington et al., 2015; M. J. Grant & Hallman, 2008). In our study population, only 14% of young women who dropped out for any reason eventually returned to school (Stoner, Edwards, et al., 2017). South African law mandates that pregnant girls be allowed to stay in school during pregnancy and return following childbirth. However, these policies are often not implemented due to attitudes of school staff and peers. Furthermore, limited resources among families, including lack of childcare, may prevent girls from returning to school (Jewkes et al., 2009; Panday et al., 2009; Willian, 2013). At present, South Africa does not provide childcare services for young women with children returning to school following pregnancy. Subsequently, efforts that encourage girls to return and remain in school following childbirth are critical. Future interventions or programs should focus on improving indicators of schooling before dropout occurs; building support among school staff, families, and peers; and providing services to support the parenting needs of girls returning to school.
Few studies have examined the relationship between dropout and subsequent pregnancy or considered interventions to prevent pregnancy in adolescents who have dropped out of school (Manlove, 1998; Rosenberg et al., 2015). Our results are similar to another study in the same region of South Africa, which found that girls who were enrolled in school had a lower hazard of pregnancy than girls who had dropped out, and that pregnancy rates were lower during school term than summer holiday (Rosenberg et al., 2015). School attendance may prevent pregnancy by providing periods of structure and supervision in young adults’ lives, which reduce opportunities for sexual activity (Barnes, Hoffman, Welte, Farrell, & Dintcheff, 2007; Rosenberg et al., 2015; Stoner, Edwards, et al., 2017). Similarly, young women out of school in settings where unemployment and poverty are high may be ambivalent about preventing pregnancy or encouraged to have children in order to increase their social status (K. Grant et al., 2002; Jewkes et al., 2009). Therefore, interventions to prevent pregnancy among adolescents out of school should help girls return to school and/or provide an environment where they are engaged in meaningful activities and supported by peers.
There are some limitations to this analysis. First, information on pregnancy in our study was self-reported and may be underreported. Young women who misreport pregnancy may also be more likely to misreport other sensitive information such as gender-based violence or depression. However, our assessment of school dropout was derived using school attendance registers and is likely unrelated to potential mismeasurement of other variables in this analysis. We expect our estimates to be biased toward the null and thus attenuated. Furthermore, we did not have information about the outcome of the pregnancy and would expect the association to be stronger in girls who have a child rather than those who do not carry their pregnancy to term. Third, there may be common causes for both school dropout and pregnancy or the potential for unmeasured confounding in our estimates. However, we did include the most important confounders identified by our DAG, including prior pregnancy, and explored models with other covariates. Estimates were generally similar for models with alternative adjustment sets. Fourth, we used depression information from baseline only as a different measure of depression was collected over study follow-up, and depression experience may have changed over time. Last, we did not capture the exact date of pregnancy, and we were unable to distinguish timing of pregnancies and school dropouts that occurred close together. However, our analytic framework ensures temporality for most events by defining our exposure in the time period before outcome ascertainment.
Our study is unique in that we use longitudinal school dropout and attendance information collected directly from high schools rather than from national census data, as reported in other studies. Our information is likely more accurate as it was monitored and collected from schools during a randomized trial and we are better able to finely capture characteristics that vary over time. Additionally, we were able to incorporate temporality in our analysis and control for both individual- and household-level characteristics. We contribute stronger causal evidence for the bidirectional relationship between pregnancy and dropout.
Overall, our findings provide important insights into the relationship between pregnancy and school dropout among young women in South Africa. Furthermore, our results posit that while pregnancy is both a driver and a consequence of school dropout, consideration of school attendance and academic performance could ultimately enhance pregnancy prevention efforts in this population. Programs should be tailored differently for (1) girls who have dropped out of school, (2) those who are in school and at risk for pregnancy, and (3) those who are in school and do become pregnant. Programs to prevent pregnancy among adolescents who have dropped out of school should encourage girls to return to school and/or focus on creating supportive and structured environments. For girls in school at risk for pregnancy, prevention programs should address gender inequity, provide comprehensive sexual health education, and improve access to contraception services. For young women in school who have recently had a child, programs should build support from school staff and families or focus on improving indicators of schooling before dropout occurs.
Supplemental Material
HEB831755_Supplemental_Material – Supplemental material for The Relationship Between School Dropout and Pregnancy Among Adolescent Girls and Young Women in South Africa: A HPTN 068 Analysis
Supplemental material, HEB831755_Supplemental_Material for The Relationship Between School Dropout and Pregnancy Among Adolescent Girls and Young Women in South Africa: A HPTN 068 Analysis by Marie C. D. Stoner, Katherine B. Rucinski, Jessie K. Edwards, Amanda Selin, James P. Hughes, Jing Wang, Yaw Agyei, F. Xavier Gomez-Olive, Catherine MacPhail, Kathleen Kahn and Audrey Pettifor in Health Education & Behavior
Footnotes
Acknowledgements
We thank the HPTN 068 study team and all trial participants.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was funded by the National Institutes of Health (R01 MH110186, 2T32AI102623-06) and by Award Numbers UM1 AI068619 (HPTN Leadership and Operations Center), UM1AI068617 (HPTN Statistical and Data Management Center), and UM1AI068613 (HPTN Laboratory Center) from the National Institute of Allergy and Infectious Diseases, the National Institute of Mental Health, and the National Institute on Drug Abuse of the National Institutes of Health. This work was also supported by NIMH R01 (R01MH087118) and the Carolina Population Center and its NIH Center (Grant P2C HD050924). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The MRC/Wits Rural Public Health and Health Transitions Research Unit and Agincourt Health and Socio-Demographic Surveillance System have been supported by the University of the Witwatersrand, the Medical Research Council, South Africa, and the Wellcome Trust, UK (Grants 058893/Z/99/A; 069683/Z/02/Z; 085477/Z/08/Z; 085477/B/08/Z).
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
