Abstract
In this study, we examine race, sex, and self-reported arrest histories (excluding arrests for minor traffic violations) from the National Longitudinal Survey of Youth 1997 (NLSY97; N = 7,335) for the period 1997 through 2008 covering cumulative arrest histories through ages 18 and 23. The analysis produces three key findings: (a) males have higher cumulative prevalence of arrest than females and (b) there are important race differences in the probability of arrest for males but not for females. Assuming that the missing cases are missing at random (MAR), about 30% of Black males have experienced at least one arrest by age 18 (vs. about 22% for White males); by age 23 about 49% of Black males have been arrested (vs. about 38% for White males). Earlier research using the NLSY97 showed that the risk of arrest by age 23 was 30%, with nonresponse bounds [25.3%, 41.4%]. This study indicates that the risk of arrest is not evenly distributed across the population. Future research should focus on the identification and management of collateral risks that often accompany arrest experiences.
Surprisingly, not much is known about an important social indicator in America: the cumulative probability that someone will have an arrest record by a given age. Brame, Turner, Paternoster, and Bushway (2012) recently investigated the arrest experience of a national sample of American youth and found that 25% to 41% of those youth reported having been arrested or taken into custody for a nontraffic offense by age 23. This study was the first in a generation (Christensen, 1967; see also Blumstein, 2010) to examine progressive cumulative population arrest patterns and the first ever to do so using survey data based on a nationally representative youth sample.
There is substantial research showing that arrested youth are not only more likely to experience immediate negative consequences such as contact with the justice system, school failure and dropout, and family difficulties, but these problems are likely to reverberate long down the life course in terms of additional arrests, job instability, lower wages, longer bouts with unemployment, more relationship troubles, and long-term health problems including premature death (Bernburg & Krohn, 2003; Caspi, Wright, Moffitt, & Silva, 1998; Kirk & Sampson, 2013; Lattimore, Linster, & MacDonald, 1997; Laub & Vaillant, 2000; Sampson & Laub, 1993; Teplin, McClelland, Abram, & Mileusnic, 2005). In spite of its importance, while we have estimates of the cumulative probability of incarceration (Bonczar, 2003; Pettit & Western, 2004), prior to Brame et al. (2012) we lack a similar estimate for a criminal justice sanction that is far more pervasive—arrest. Furthermore, understanding the prevalence of arrest cannot be diminished by the claim that many arrests do not result in a criminal conviction as previous research has suggested that the negative consequences of arrest (at least with respect to employment) occur even when arrest does not lead to conviction (Pager, 2003; Schwartz & Skolnick, 1962).
While the Brame et al. (2012) estimate is informative, it is well documented that at any fixed age or within any age range, there are large differences in aggregate arrest rates by race and sex in the United States. For example, in 2010, the Federal Bureau of Investigation (FBI) reported that the arrest rate of men was 3 times higher than that of women, and the arrest rate for Blacks was 2.2 times higher than that of Whites (FBI, 2011). Aggregate arrest statistics from the FBI like these, however, are not broken down by race and sex groups (i.e., Black males, Black females, etc.; Snyder, 2011). Furthermore, aggregate arrest rates conflate two statistics—the prevalence of an arrest and the frequency of arrests for those who have at least one arrest (Blumstein & Cohen, 1987; Blumstein, Cohen, Roth, & Visher, 1986). Hence, an arrest rate that is 2.2 times higher for Blacks relative to Whites does not mean that Blacks have a cumulative prevalence of arrest that is 2.2 times that of Whites. While social scientists and policy makers are already well aware of the race and sex differentials in the FBI statistics (Snyder, 2011), little is known about race and sex variation in the cumulative prevalence of arrest. This study attempts to fill this gap by describing what can be learned about race and sex differences in the cumulative risk of arrest by ages 18 and 23 using a contemporary nationally representative sample.
Method
The analysis reported in this article is based on the cross-sectional National Longitudinal Survey of Youth 1997 (NLSY97). All of the data used in this study are publicly available from the United States Bureau of Labor Statistics. The initial effort in conducting the cross-sectional NLSY97 involved the identification of a self-weighting representative sample of U.S. households with youths between the ages 12 and 16 years at the end of 1996. A total of 7,335 survey-eligible youths screened into the NLSY97, and the interviewing staff were successful in conducting a first-round interview with 6,748 of these youths in 1997 and were unsuccessful in conducting first-round interviews with the remaining 587 youths. At the time of the first interview in 1997, all surveyed youths were between the ages of 12 and 17 (Moore, Pedlow, Krishnamurty, & Wolter, 2000).
According to the NLSY97 data set, the 6,748 first-round respondents include 921 Hispanics, 1,081 Non-Hispanic Blacks, 81 Non-Hispanic Mixed races, and 4,665 Non-Hispanic, Non-Blacks (consistent with Moore et al., 2000, we consider the last two groups to be non-Hispanic, non-Black, or—more simply—White). Table 1 presents a summary of the race and sex distribution of the 7,335 survey-eligible NLSY97 youth and the subsample of 6,748 youths who actually participated in the first round of the NLSY97. Based on survey questions about arrest experiences, we compiled historical information about arrest experiences through ages 18 and 23 for each person. An age 23 (276 months old) cutoff was used because the youngest NLSY97 participants were at least 23 years old by the time of the 2008 interview.
Race and Sex Distribution of NLSY97-Eligible Youth (N = 7,335).
Note. NLSY97 = National Longitudinal Survey of Youth 1997; P = first-round NLSY97 participants; NP = first-round NLSY97 nonparticipants.
In addition to examining the arrest information for the NLSY97 participants, we pay considerable attention to (a) the uncertainty created by the 587 youths who were targeted for NLSY97 participation but did not actually participate and (b) the persons who participated in the first round of the NLSY97 but provided insufficient information for us to discern whether they had been arrested by age 23. Researchers commonly assume that the missing cases are missing at random (MAR; Brame et al., 2012; Little & Rubin, 1987; Manski, 1995, 2003). For purposes of this article, the MAR assumption asserts that individuals with missing information in a particular race and sex cell have the same arrest rate as the individuals who provided information on their arrest experiences. While convenient for statistical analysis, this is a strong and untestable assumption (Manski, 2003).
An alternative to MAR is based on what we refer to as the minimal assumptions (MA) estimator. The MA involved are those that would be invoked in any large-scale survey: (a) that survey respondents accurately report their experiences; (b) that missing survey respondents would have accurately reported their experiences if they had been interviewed; and (c) that the entire sample (observed and unobserved cases) is a nationally representative sample. It produces two estimates: (a) a lower bound assuming that none of the missing persons had been arrested and (b) an upper bound assuming that all of the missing persons had been arrested (Manski, 2003).
The MAR and MA analyses could be viewed as two extremes: one treats the missing data problem as trivial and the other considers the maximum uncertainty. It is also useful to consider a series of middle-ground (MG) analyses which not only take seriously the ambiguities created by the missing cases but also highlight some reasonable possibilities for inference with assumptions that are stronger than MA but weaker than MAR (Manski, 2003). We view our MG estimators as the most credible estimates of arrest prevalence patterns in the NLSY97.
The key parameter in the MG estimators is the conditional prevalence of arrest for the missing cases divided by the conditional prevalence of arrest for the observed cases, or the relative risk (RR). The RR cannot be estimated as the numerator—the conditional prevalence of arrest for the missing cases—is unknown. We do know that the numerator lies between 0 and 1, so we can form bounds on the RR parameter for the sample as a whole and for any identifiable subsample of the NLSY97. When RR is 1, the two conditional prevalence estimates match and the data are MAR (no correlation between a person being missing and arrested). When RR < 1, the missing cases have a lower likelihood of arrest than the observed cases (a negative correlation between a person being missing and arrested). Symmetrically, when RR > 1, the missing cases have a greater likelihood of arrest than the observed cases (a positive correlation between a person being missing and arrested). Viewed in this light, it becomes apparent that MAR is a special case of a wide range of possibilities. Within the statistical literature on missing data, both our MA and MG estimators fall within the set of models that allow for nonignorable nonresponse (Little & Rubin, 1987).
Although most of the uncertainty described in this article revolves around the missing cases, we also follow standard practice and account for sampling error with confidence intervals (CIs). The NLSY97 cross-sectional household sample is based on a self-weighting design which leads to valid point and interval estimates of cumulative arrest prevalence. Due to clustering, however, sampling error estimated under the assumption of simple random sampling will be too small. NLSY97 research staff have recommended adopting a standard error design effect multiplier (DEFT) of 1.22 to calibrate sampling uncertainty in the self-reported arrest measures due to clustering. The CIs estimated in this article are based on DEFT-adjusted Bonferroni-asymptotic joint 95% confidence regions and implemented with the bootstrap (Manski, 2003). Statistical analyses were conducted with the R/S+ programming language (Chambers, 1998).
Results
The NLSY97 data allow us to measure the accumulation of arrest experiences as people age from 8 to 23 (Brame et al., 2012). Tables 2 and 3 present the frequency distributions upon which all of the analyses reported in this article are based; Table 2 counts the persons in each group at age 18 while Table 3 counts the persons in each group at age 23. As our analysis focuses on cumulative arrest experience, the number of persons in the “never arrested” group declines over time, while the number in the “ever arrested” and “missing” groups increases over time. The number of persons who did not participate in the first round is constant throughout the 11 rounds of the NLSY97.
Age 18 Distribution of Race, Sex, and Arrest Experience.
Note. NP = first-round NLSY97 nonparticipants, persons in the “NP” column (never participated) were missing throughout the study; II = people who participated but provided insufficient information to measure arrest experiences; NH = non-Hispanic; NLSY97 = National Longitudinal Survey of Youth 1997.
Age 23 Distribution of Race, Sex, and Arrest Experience.
Note. NP = first-round NLSY97 nonparticipants, persons in the “NP” column (never participated) were missing throughout the study; II = people who participated but provided insufficient information to measure arrest experiences; NH = non-Hispanic; NLSY97 = National Longitudinal Survey of Youth 1997.
Basic Analysis
Figure 1 summarizes our findings for the age 18 analysis (Panel A) and the age 23 analysis (Panel B). The far-left-hand panels in Figure 1 display the cumulative arrest prevalence rates assuming that the missing cases are MAR; the fences at the end of the intervals represent the 95% confidence limits. Based on the age 18 MAR analysis, it appears that non-Hispanic, Black males have the highest risk of arrest by age 18 (29.6%), followed by Hispanic males (26.2%) and non-Hispanic, non-Black (White) males (21.5%). The MAR estimates for the females are much more tightly clustered (all three race groups between 11.8% and 12.0%). Although the MAR arrest rates are higher at age 23 (Panel B), the between-group patterns are the same (e.g., the Black males have an arrest rate of 48.9%, Hispanic males, 43.8%, and White males, 37.9%). The MAR CIs tell us that the males as a whole are significantly different from the females as a whole and that the Black males have a significantly higher arrest rate than the White males. Neither the White nor the Black males differ significantly from the Hispanic males.

Ever arrested status by demographic group.
The right-hand sides of Figure 1 describe the uncertainty created by the missing cases using the MA estimator. The inner fences represent the in-sample uncertainty and the outer fences reveal the 95% confidence limits of our interval estimates for the prevalence of arrest. The bounds of these interval estimates are wide and reflect the inferential consequences of the missing observations in the NLSY97. Of course, all large-scale nationally representative longitudinal surveys confront this sort of uncertainty (Manski, 1995, 2003; Pepper, 2001). Our analysis has the virtue of formally transmitting this uncertainty to readers, hence the ambiguity created by the missing data is well understood. As all of the intervals for all of the groups overlap with each other, it is not possible to draw definitive conclusions about group differences in arrest experiences without making some untestable assumptions.
Two clear conclusions can be drawn from Figure 1. First, there is no credible evidence that the Hispanic males differ significantly from either the White or the Black males. Second, there are no discernible differences in female arrest rates among the different race groups. Even the MAR estimates for these two comparisons do not identify a clear difference. The analysis in Figure 1 is less clear on whether (a) males are different from the females and (b) White males are different from the Black males.
Sex Comparisons
We turn first to the question of whether the males and females in the NLSY97 have different cumulative arrest prevalence rates. The MAR analysis suggests that the answer is “yes,” while the MA analysis is inconclusive. It turns out, however, that there is a strong basis in these data for concluding that males do have higher arrest rates than females at both ages 18 and 23. As a starting point, we define the parameter, δ, as the conditional prevalence of arrest for the males minus the conditional prevalence of arrest for the females. If δ > 0, males have a higher prevalence of arrest than the females; if δ < 0, the females have a higher arrest rate; and if δ = 0, the arrest rates for males and females are equal. Figure 2 reveals how δ varies depending on different missing data patterns that might exist in the NLSY97 data.

Sensitivity analysis for difference between male and female arrest rates.
Panel A of Figure 2 displays the different values of δ that are possible in the age 18 male–female comparisons. The analysis reveals that males and females have approximately the same arrest rate (i.e., δ ≈ 0) only under a peculiar set of circumstances: (a) the arrest rate for the missing females is higher than that for the observed females and (b) the arrest rate for the missing males is lower than that for the observed males. The horizontal and vertical axes show the approximate bounds on the RR parameter for the males and the females. Darker shades of gray in the plotspace are indicative of smaller δ estimates.
The MAR estimate of δ (0.115 with 95% confidence limits of [0.093, 0.137]) is in the lower-left-hand corner of the plotspace indicating that the MAR arrest rate for the males exceeds the MAR arrest rate for the females. In other words, even if the MAR assumption is wrong, only an extreme difference in the missing data patterns for the two sexes could overcome the difference we see in the observed data. Based on this evidence, we conclude that the males have higher arrest rates than the females at ages 18 and 23 in the NLSY97 sample. We also conclude that the relative difference between the MAR rates is essentially the same at both ages, with the male arrest rate ~2.0 times larger than the female arrest rate at age 18 and ~2.1 times larger at age 23. Apparently, the pattern of differential risk did not change over this time period, and its origins almost certainly precede it.
Race Comparisons among the Males
We now turn to the question of whether the Black males and White males have different arrest rates. For this analysis, we define δ as the conditional prevalence of arrest for White males minus the conditional prevalence of arrest for Black males. Hence, when δ > 0, the arrest rate for the White males exceeds the arrest rate for the Black males; when δ < 0, the arrest rate for the Black males exceeds the rate for the White males. The MAR estimates of δ are highlighted in the lower-left-hand quadrant of both panels of Figure 3. At age 18 (Panel A), the MAR estimate of δ is −0.081 (95% CI = [−0.133, −0.029]), while at age 23 (Panel B), the MAR estimate is −0.111 (95% CI = [−0.169, −0.052]).

Sensitivity analysis for difference between White and Black male arrest rates.
Based on the MAR analysis, we would conclude that the arrest rate for the Black males significantly exceeds the arrest rate for the White males. In fact, the White male rate can equal or exceed the Black male rate only when the RR for both groups is in the area to the right of the dark diagonal line in the bottom right quadrants of Figure 3: a point where the correlation between a case being missing and arrested is strong and positive for the White males but negative for the Black males. As with the aforementioned sex comparison, this would seem to be a highly unlikely scenario. We are, therefore, willing to make the relatively weak assumption that the RRs are either at or to the left of the diagonal line and conclude that the arrest rate for the Black males exceeds that for the White males. We also conclude that the ratio of MAR arrest rates for Black males over White males is similar at both ages 18 (~1.4) and 23 (~1.3). Despite the escalation in cumulative prevalence over the period from ages 18 to 23, the ratio of the rates between groups is relatively constant. Whatever process accounts for this differential pattern probably originates from childhood or adolescence.
Discussion
Cumulative arrest prevalence rates are high (Brame et al., 2012), and there are good reasons for suspecting that these rates are not equally distributed throughout the population (Blumstein, 2010; Blumstein & Graddy, 1982; Christensen, 1967). There are also good reasons to be concerned about this, given the fact that an arrest, even if it results in an acquittal, can have very damaging consequences for the subsequent life chances of youth. The measurement of cumulative arrest prevalence using official data, however, presents difficult challenges (Brame et al., 2012). The best option currently available is to rely on self-reported survey data. Using the NLSY97, our analysis attempts to fill a gap in our understanding of contemporary American arrest patterns by ages 18 and 23.
A major earlier finding from Brame et al. (2012) was that contemporary cumulative ever-arrest experiences—while not normative—are certainly disturbingly prevalent among American youth. In this analysis, we find that males exhibit significantly higher cumulative arrest prevalence than females and that there seems to be important differences between Black and White males. We are not able to discern clear differences between the Hispanic males and other race groups. Nor were we able to measure any clear race differences among the females. Interestingly, the between-group differences in cumulative arrest prevalence observed in the NLSY97 are smaller than the measurable differences between groups in the aggregate FBI arrest rate statistics. For example, the 2010 FBI statistics suggest that Blacks have an arrest rate that is on the order of 2.2 times higher than Whites (FBI, 2011). In terms of MAR estimates of cumulative arrest prevalence, however, we see that the Black and White females are not different and that the ratio of Black male prevalence to White male prevalence at age 23 is on the order of 48.9 / 37.9 ≈ 1.3, which is much smaller than the FBI difference. It follows that the distributions of cumulative prevalence estimates do not necessarily correspond to easily measured differences between race and sex groups in aggregate arrest rates. This is not surprising as many individuals who have been arrested at least once persist in criminal behavior at different rates (Blumstein & Cohen, 1987; Blumstein, Cohen, Roth, & Visher, 1986; Piquero, Farrington, & Blumstein, 2003). Taken as a whole, our findings inform social scientists and policy makers that many young people have had (and will have) the experience of being arrested and that there is a real variation in that risk within the population.
A high priority for future research is to document the temporal sequencing of arrest and other adverse events during childhood, adolescence, and early adulthood. It will also be constructive to place this information in its proper geographic and historical context, as suggested in recent research conducted by Stevens and her colleagues (Stevens, 2013; Stevens, Morash, & Chesney-Lind, 2011). The NLSY97 data are particularly useful for this purpose as a good deal of relevant information is recorded on a near-annual basis in a calendar format and data suitable for subnational analysis are available (see, for example, Apel, Bushway, Paternoster, Brame, & Sweeten, 2008). It will be important to continue following the NLSY97 survey participants as they move further into adulthood to reach an improved understanding of the long-term health, legal, and social consequences that flow from formal contact with the criminal justice system. Our results also suggest that the relationship between demographic characteristics and arrest experiences is possibly quite complex and not yet well understood. For example, it would not be reasonable to speak about an overall “race correlation” because the correlation between race and arrest experiences is different for males than for females—at least in the NLSY97. Similarly, our study makes no claims about discrimination or racial profiling as there was no effort to control or adjust for the effects of other factors that might be correlated with both demographic characteristics and criminal involvement. Our focus is simply on documenting the basic demographic patterns of arrest as measured by the NLSY97. We hope this work will serve as a point of departure for careful consideration of key measurement, theoretical, and policy issues surrounding the arrest experiences for American youth.
Footnotes
Authors’ Note
Most of the work on this project was completed when Dr. Brame was at the University of North Carolina at Charlotte.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
