Abstract
The predictive validity of the Static-99 measures with ethnic minorities in the United States has only recently been assessed with mixed results. We assessed the predictive validity of the Static-99 and Static-99R with a sample of Latino sex offenders (N = 483) as well as with two subsamples (U.S.-born, including Puerto Rico, and non-U.S.-born). The overall sexual recidivism rate was very low (1.9%). Both the Static-99 measures were able to predict sexual recidivism for offenders born in the United States and Puerto Rico, but neither was effective in doing so for other Latino immigrants. Calibration analyses (N = 303) of the Static-99R were consistent with the literature and provided support for the potential use of the measure with Latinos born in the United States and Puerto Rico. These findings and their implications are discussed as they pertain to the assessment of Latino sex offenders.
Actuarial measures of sex offense recidivism risk are broadly used by researchers, evaluators, and clinicians in the United States (Babchishin, Hanson, & Blais, 2016; Claussen-Schulz, Pearce, & Schopp, 2004; Jackson & Hess, 2007). These measures have taken on an important role in the assessment of sex offenders and can play a crucial role in decisions ranging from the intensity of treatment received, to the release and/or supervision of sex offenders (Falk, 1999) to determination of the need for commitment as a sexually violent predator (Calkins, Jeglic, Beattey, Zeidman, & Perillo, 2014; Hamilton, 2011).
Actuarial risk assessment involves the measurement of the likelihood of an offender’s target behavior on a limited set of factors, using a numerical weighting system; usually, the factors are derived from empirical research, and the total score of the factors is used as a recidivism risk score (Hanson & Thornton, 2000). This method has been found to be consistently more accurate than unstructured clinical judgment in predicting sexual and violent recidivism (Hanson & Morton-Bourgon, 2004, 2009).
The Static-99 (Hanson & Thornton, 2000) remains the most utilized sex offense risk assessment measure (Hanson, Thornton, Helmus, & Babchishin, 2015; Viljoen, McLachlan, & Vincent, 2010). A revised version of the measure, the Static-99R (Helmus, Thornton, Hanson, & Babchishin, 2012) was created by modifying one item from the Static-99 to account for a decrease in reoffending as offenders’ age. Both measures have been found to have strong interrater reliability (e.g., Barbaree, Seto, Langton, & Peacock, 2001; de Vogel, de Ruiter, van Beek, & Mead, 2004; Hanson, Lunetta, Phenix, Neeley, & Epperson, 2014).
The Static-99 has exhibited at least moderate predictive validity (i.e., discriminating between recidivists and non-recidivists) for sexual recidivism using a variety of samples, with area under the curve (AUC) values ranging from 0.62 in a U.S. sample (Sreenivasan et al., 2007) to 0.78 for an Australian sample (Allan, Dawson, & Allan, 2006). The Static-99R has shown improved performance over the Static-99, with reported AUC values of 0.73 for a Canadian sample (Brouillette-Alarie & Prouxl, 2012) and 0.74 for a Dutch sample (Smid, Kamphuis, Wever, & van Beek, 2014).
However, discrimination alone provides an incomplete picture of a measure’s predictive validity (Singh, 2013). The extent to which a measure’s predictions of risk correspond to actual observed risk must also be determined (i.e., calibration; Singh, 2013). Hanson, Lunetta et al. (2014) described two approaches to assess the calibration properties of the Static-99R. The first approach involves the assessment of the fit between the expected (E) and observed (O) recidivism rates, by dividing the number of expected recidivists by the number of observed recidivists (E/O). The number of expected recidivists for a specific sample is obtained by using the percentage of expected recidivists in the normative sample for the measure. A 95% confidence interval (CI) for the ratio is calculated using the formula included provided by Rockhill, Byrne, Rosner, Louie, and Colditz (2003):
Perfect calibration is denoted by an E/O index value of 1.0 (Hanson, Lunetta et al., 2014). Significant differences between the Expected and Observed rates are inferred from a 95% CI that does not contain the value of 1.0.
The second approach described by Hanson, Lunetta, and colleagues (2014) involves the comparison of logistic regression coefficients obtained in a study to those previously reported in the literature. Hanson, Lunetta et al. (2014) suggested centering the logistic regression equation on the value of two, which corresponds to the score of the median offender. For example, when studying an ethnically diverse California sample, Hanson Lunetta et al. (2014) used the regression parameters based on a follow-up period of at least 5 years reported previously (Hanson, Babchishin, Helmus, & Thornton, 2013, Appendix Table B1: B02 = −2.941, standard error [SE] = 0.238; B1 = 0.331, SE 0.044) for comparison with the regression parameters they derived in their study. As Hanson, Lunetta et al. (2014) explained, the first regression coefficient (B02) “. . . represents the expected recidivism rate for a Static-99R score of 2 (p2) in logit units (ln[p2/{1-p2}])” (p. 16). Hanson, Lunetta, and colleagues (2014) also determined the relative increase in risk of each increase in score on the Static-99R using the formula eB1, where B1 is the model’s regression coefficient and the adjusted 5-year recidivism rate for a Static-99R score of 2 using the formula 1/(1 + eB02), where B02 is the intercept’s regression coefficient when scores on the measure are centered on the score of 2.
Use of the Static-99 and Static-99R With Non-Whites
The predictive accuracy of Static-99 and Static-99R needs be supported empirically with ethnically diverse samples before these measures can be used with confidence, particularly in the United States where racially and ethnically diverse correctional populations are the norm. Latinos are one of the fastest growing populations in U.S. prison and currently comprise approximately 22% of those incarcerated in state and federal prisons (Carson, 2014). The Static-99 development sample relied primarily on a Canadian offender sample, where less than 1% of the Canadian correctional population identifies as Latino (Trevethan & Rastin, 2004). Although recent Static-99R norms rely more heavily on U.S. data (Helmus, 2009), U.S. samples still made up a minority (approximately 25%) of the study samples used in development of the Static-99R norms. Most of the studies used in the re-norming were from Commonwealth and European countries, places with a very low number of Latinos. Previous authors have remarked that the Static-99 may have limited applicability to ethnically diverse people in the United States (Sreenivasan et al., 2007). Findings across the few studies that have assessed the predictive validity of Static-99 and Static-99R across different ethnicities have been inconsistent (see Table 1). Therefore, the applicability of the Static-99 measures with diverse offender populations has not been well established, particularly with respect to U.S. Latinos.
Predictive Validity of the Static-99 and Static-99R for Sex Offense Recidivism With Non-European Samples.
Note. AUC = area under the curve.
Mean score was significantly higher than European/White groups.
Mean score was significantly lower than comparison group(s).
Although the Hanson et al. (2014) reported a significant overall difference in mean scores among groups, they did not include information about post hoc analyses.
Some authors have found the Static-99 and/or Static-99R to work well with non-European samples (Babchishin, Blais, & Helmus, 2012; Hanson, Lunetta et al., 2014; Smallbone & Rallings, 2013). For example, Smallbone and Rallings (2013) reported that the Static-99 performed well with a small sample (n = 67) of indigenous Australians, but noted that the same was not the case for the Static-99R and urged more research using that measure. Babchishin et al. (2012) conducted a meta-analysis based on five studies that included indigenous Canadian participants and found that both the Static-99 and Static-99R successfully discriminated recidivists from non-recidivists with fairly strong effect sizes (AUCs of 0.698 for random effects and 0.718 for fixed effects). Hanson, Lunetta and colleagues (2014) reported similar findings with respect to African Americans in a field study of diverse (White, African American, and Latino) sex offenders in California. However, although the AUC values for Latinos were higher than 0.73 on both Static-99 measures, they did not reach statistical significance. Hanson and his colleagues noted that the low number of recidivists per group limited statistical power. Nonetheless, the Static-99R performed well with African Americans and Latinos in terms of relative risk, based on part of their calibration analyses. Using the entire sample of Whites, African Americans, and Latinos, Hanson et al.’s (2014) comparison of expected versus observed recidivism rates yielded only one significant difference, which centered on the Low-Moderate Risk group, with the observed recidivism rate for their entire sample being lower than the expected rate. In addition, regression analyses using the entire sample revealed that relative risk increased along with increments on the measure’s total score (Hanson et al., 2014). However, regression coefficients for their entire sample, as well as for Latinos in particular, were somewhat different from those obtained from pooled samples by Hanson et al. (2013). The expected recidivism rate for a Static-99R score of two was somewhat lower, but the relative risk somewhat higher. These studies provide some support for the use of the measures with indigenous populations in Canada and Australia as well as with African Americans and Latinos in California; nonetheless, Hanson, Lunetta et al. (2014) noted the potential need to validate the measures in specific localities.
However, Långström (2004) and Varela, Boccaccini, Murrie, Caperton, and Gonzalez (2013) found the measure(s) to be unable to predict sexual recidivism in their samples. Långström (2004) reported that the Static-99 performed no better than chance in predicting sexual or violent recidivism among African/Asian sexual offenders in Sweden. Varela and colleagues (2013) were the first to assess the cross-cultural validity of the Static-99 and Static-99R among U.S. ethnic minority sex offenders using a large sample (N = 1,911) from Texas. Despite having large sample size, they found the recidivism rates for African Americans and Latinos to be rather low (less than 3.2%). Moreover, even Latinos assessed by the Static-99 measures to be at high risk to reoffend sexually recidivated at a rate lower than 6%. They reported that neither measure predicted sexual recidivism for White, African American, or Latino sex offenders. Although these authors noted recidivism rates by risk category and race, they did not conduct calibration analyses. They concluded that the Static-99 and Static-99R
. . . may not work well as well with Latino offenders as White and Black offenders . . . we do not assert that the Static-99 instruments are clearly inappropriate for use with Latino offenders. Rather, we suggest only that the Static-99 instruments’ appropriateness for use with Latino sex offenders has not yet been clearly established . . . [emphasis in the original]. (p. 238)
Varela and his colleagues (2013) identified two general issues that may have accounted for the poor predictive validity of the Static-99 measures with Latinos. First, they raised concerns about possible measurement error based on the measures’ reliance on historical information, which would not be available for sex offenders who are immigrants. Rice, Boccaccini, and Varela (2013, as cited in Varela et al., 2013) found that Latino sex offenders in Texas who were born outside the United States were assigned significantly lower total scores on the Static-99 than did those born in the United States. Moreover, they noted that cultural attitudes about sex and rape may also lead to a significant under-reporting of sexual offenses for this population, which could affect the accuracy of available recidivism rates. Second, they raised the possibility that the items on the Static measures may lack functional equivalence. In other words, items that are deemed to suggest sexual deviance (e.g., the lack of a long-term live-in partner, having unrelated, stranger, or male victims) may not be indicative of sexual deviance for Latinos. Neither the exploration of potential differences in the predictive validity of the Static-99 measure between Latinos born in the United States or its territories and those born in the rest of Latin America nor differences in individual item scores have been undertaken in the literature.
Current Study
The present study focused on assessing the predictive validity of the Static-99 and Static-99R when used with Latino sex offenders. Latinos constitute the fastest growing group in the United States (Passel, Cohn, & Lopez, 2011). They comprise 12% and 29% of the total number of inmates involved in rape and sexual assault (unwanted sexual contact that does not include penetration) crimes respectively (Carson & Sabol, 2012) and have been found to be overrepresented among Level 3 sex offenders in some states, such as New York (Smith-Socaris, Perry, & Fox-Mullen, 2006). However, despite these figures, only two studies have evaluated the predictive validity of the Static-99 and Static-99R with Latino sex offenders in the United States (Hanson, Lunetta et al., 2014; Varela et al., 2013), with mixed results, and only the former included calibration analyses. Therefore, given that these instruments are integral in decision making regarding sex offender placement, supervision, and potentially post sentence civil commitment, it is essential that the predictive validity (both terms of discrimination and calibration) of the Static-99 measures with Latino sex offenders be further explored.
The main hypothesis is somewhat exploratory in nature given the contradictory findings about the Static-99 measures’ performance with Latinos reported in the literature so far. Hence, we sought to assess the predictive validity of the Static-99 and Static-99R with this sample of Latino sex offenders, both in terms of discrimination and calibration. Moreover, we aimed to explore the performance of the measure with subgroups of Latinos. Researchers have typically divided Latinos based on their country of origin and representativeness in their samples (e.g., Vaughan, Kratz, Escobar, & Middendorf, 2013, who divided their sample accordingly: Mexican, Puerto Rican, Cuban, and Central/South American and Other Latino). However, given the issues raised by Varela et al. (2013), we opted to divide our sample based on availability of historical and follow-up information. The first group was composed of participants born in the United States or its territories (i.e., Puerto Rico). The second group was composed of participants born elsewhere in Latin America (i.e., Mexico, the Caribbean, Central, and South America). Given the considerable diversity within the Latino population in the United States (Ennis, Ríos-Vargas, & Albert, 2011), one of our primary aims was also to examine within-group differences on the Static-99 and Static-99R between Latinos born in the United States or U.S. territories (i.e., Puerto Ricans and U.S.-born Latinos) and Latinos born out of the United States (i.e., those born in Mexico, Central America, or South America).
Method
Participants
Data from the archival files of 483 Latino male sex offenders who had been incarcerated in New Jersey state prisons were utilized in this study. The data were gathered as part of a larger project funded by the National Institute of Justice (NIJ; Mercado, Jeglic, & Markus, 2011). The majority of the sample (n = 368, 76.2%) was composed of sex offenders who were incarcerated in a general prison population, whereas 115 (23.8%) were incarcerated in a specialized treatment facility for sex offenders deemed to be compulsive and repetitive, but amenable to treatment.
The overall sample was divided into two groups: those born in the United States and Puerto Rico (n = 268, 55.5%) and those born elsewhere in Latin America (n = 215, 44.5%). The characteristics of the samples are presented in Table 2. The groups did not differ significantly with respect to victim age category in their index offense (child under 13, teenager/adult, or mixed), χ2(2) = 2.61, p = .271. In addition, the groups did not differ significantly from one another with respect to victim gender, χ2(1) = .36, p = .55. Of note, two participants in the U.S.-born/Puerto Rican group offended against both males and females, whereas none in the other group did. These two participants were excluded from the preceding analysis. The two groups, however, differed with respect to aspects of their criminal histories. Participants born in the United States or Puerto Rico were significantly more likely than those born elsewhere in Latin America to have past charges for sexual offenses, χ2(1) = 9.38, p = .002 (2.55 times more likely), non-sexual offenses, χ2(1) = 29.04, p < .001 (2.76 times more likely), and to have charges as juveniles, χ2(1) = 25.16, p < .001 (7.24 times more likely). Moreover, 10 participants from the U.S.-born/Puerto Rican group were charged with sexual offenses when they were juveniles, whereas none were in the other group.
Characteristics of the Sample.
p < .01. **p < .001.
Measures
Static-99
The Static-99 (Hanson & Thornton, 2000) is a widely researched and used actuarial measure of recidivism risk. The Static-99 is composed of 10 items: (a) young, (b) single, (c) index non-sexual violence, (d) prior non-sexual violence, (e) prior sex offenses, (f) prior sentencing dates (excluding index), (g) any convictions for non-contact sex offenses, (h) any unrelated victims, (i) any stranger victims, and (j) any male victims. Total scores for the Static-99 can range from 0 to 12 and converted into four risk categories: the scores of 0 and 1 as “Low” risk, 2 and 3 as “Moderate-Low” risk, 4 and 5 as “Moderate-High” risk, and 6 and above as “High” risk.
Static-99R
The only difference between the original Static-99 and the Static-99R is the way in which the age (at time of release) item is coded 1 for ages 18 to 34.9, 0 for ages 35 to 39.9, −1 for ages 40 to 59.9, and −3 for ages 60 and older (Helmus, Thornton et al., 2012). The total scores for the Static-99R range from −3 to 12. The risk category for the Static-99R is the same as that of the Static-99, with the exception of the “Low” risk category, which can include negative total scores.
Data collection tool
Demographic, criminal, and other data were gathered from the offender files as part of a larger study (Mercado et al., 2011). For the purposes of this study, only demographic information, Static-99 scores, and recidivism data were used to conduct the analyses.
Procedure
The Static-99 data were scored by clinicians (n = 378; as contained in the participants’ record) and/or by trained graduate research assistants (n = 391). The level of agreement in Static-99 scores based on rater was measured using the intraclass correlation coefficient (ICC) for the 287 cases that were rated by both clinicians and research assistants. The ICC obtained was .884, which suggests excellent agreement (Cicchetti, 1994) and is consistent with previously reported interrater reliability for the Static-99 measures (e.g., Hanson, Lunetta et al., 2014, ICC of .85 for experienced raters; Quesada, Calkins & Jeglic, 2014, ICC of .89). When both sets of scores were available, the ratings from research assistants were used. The Static-99R total was subsequently obtained by the first author by recoding the first item using the participants’ age at time of their release from custody and substituting this score for the Age item on the Static-99, according to the guidelines provided by Helmus, Thornton et al. (2012).
Recidivism data were obtained via the New Jersey State Police criminal records database for those offenders released from the New Jersey Department of Corrections (DOC) between the years 1996 and 2007 following an incarceration for a sexual offense. These records were obtained via the New Jersey State Police Computerized Criminal History System and the National Crime Information Center’s Interstate Identification Unit; thus, the recidivism data account for criminal offenses across all U.S. jurisdictions, not just New Jersey. Recidivism data were collected on the number and nature of sexual and non-sexual. The average period of follow-up was 6.14 years (SD = 2.60, range = 1.63-12.11).
In order to replicate Hanson, Lunetta et al.’s (2014) calibration analyses using a follow-up period of at least 5 years, participants in our sample with shorter periods of time in the community were excluded from those analyses. The final sample for the calibration analyses was composed of 303 participants, with an average follow-up period of 7.62 years (SD = 1.83, range = 5.0-13.11). Both U.S.-born/Puerto Rican and other Latin American participants had comparable ranges of time in the community for the purpose of follow-up analyses (n = 174, M = 7.65, SD = 1.87 vs. n = 129, M = 7.58, SD = 1.77, respectively), t(301) = .32, p = .749, ns. To compare observed recidivism rates with expected rates, we used the 5-year estimates provided by Phenix, Helmus, and Hanson (2012) for a routine sample because the majority of our participants (76.2%) came from a general prison population. The only difference in analytic strategy between the present study and Hanson, Lunetta et al.’s (2014) was the way in which logistic regression coefficients were compared. Whereas Hanson, Lunetta and colleagues (2014) used fixed-effect meta-analysis, we used the formula proposed by Paternoster, Brame, Mazerolle, and Piquero (1998) to test the equality of regression coefficients:
where z is the standardized residual of the difference between coefficients. The coefficients being compared are represented by b1 and b2, and their SE by SEb1 and SEb2. The difference between coefficients is significant if the z score is smaller than −1.96 or larger than 1.96.
Results
Scores and Level of Risk
Static-99
The mean score on the Static-99 for the overall sample was 1.86 (median = 2.0, SD = 1.40, range = 0-8). There was a significant difference on average Static-99 total score between participants born in the United States/Puerto Rico (n = 268) and non-U.S.-born (n = 215; M = 2.03, SD = 1.46 vs. M = 1.64, SD = 1.30, respectively), t(481) = 3.11 p = .002, which represented a small effect size, Cohen’s d = .28.
Static-99R
The mean score on the Static-99R for the overall sample was 1.64 (median = 2.0, SD = 1.88, range = −3-8). There was a significant difference on average Static-99R total score between participants born in the United States/Puerto Rico (n = 268) and those who were non-U.S.-born (n = 215; M = 1.81, SD = 1.91 vs. M = 1.43, SD = 1.81, respectively), t(481) = 2.23 p = .026, which represented a small effect size, Cohen’s d = .20.
Comparison of item scores
A crosstabs analysis was performed using the Static-99 items, with the addition of the age-related item of the Static-99R, to assess potential differences between the two subgroups. Of note, the prior sex offenses item, which can have a value ranging from 0 to 3, was recoded as dichotomous to conduct a χ2 analysis because only one participant in the non-U.S.-born group scored above a 1 on that item. Results are presented in Table 3. A specific analysis of the Age at the Time of Release item for the Static-99R is presented in Table 4. Significant differences between the two groups centered on the Static-99 items related to participants’ criminal history. Specifically, based on odds ratio analyses, we found that U.S.-Born Latinos/Puerto Ricans were 2.46 times more likely than non-U.S.-born Latinos to have prior non-sexual charges/convictions, more than 4 times (4.15) more likely to have prior sentencing dates, and almost 4 times (3.83) more likely to have prior sexual offenses.
Static-99 Item Analysis.
p < .01. **p < .001.
Static-99R Age at Time of Release Item Analysis.
Risk levels
With respect to assigned risk categories, we attempted to conduct chi-square analyses, but the ensuing crosstabs tables for risk categories for both Static-99 measures had expected counts less than five for more than 20% of the cells. We subsequently collapsed the risk categories into Low/Moderate-Low, and Moderate-High/High. There was no significant difference between the two groups on the collapsed Static-99 risk categories, but there was a trend, χ2(1) = 3.688, p = .055. However, there was a significant difference between the two groups with respect to the collapsed Static-99R risk categories, χ2(1) = 7.639, p = .006. Based on an odds ratio analysis, we determined that Latinos born in the United States or Puerto Rico were about twice as likely (2.05) than non-U.S.-born Latinos to fall in the Moderate-High/High collapsed risk category. Results are summarized in Tables 5 and 6.
Static-99 and Static-99R Risk Categories Based on Total Score.
Static-99 and Static-99R Collapsed Risk Categories Based on Total Score.
p < .01.***p = .055.
Predictive Validity Analysis of the Static-99 and Static-99R
Rates of sexual offense recidivism
In the overall sample of 483 sex offenders, nine participants reoffended sexually (1.9%). Of the 268 U.S.-born Latinos/Puerto Ricans, six participants recidivated sexually (2.2%) versus three of the 215 non-U.S.-born Latinos (1.4%). However, this difference was not statistically significant (Fisher’s Exact Test, p = .737).
Discrimination analyses
Receiver operating characteristic (ROC) analyses were performed to assess the ability of the Static-99 and Static-99R measures to discriminate between recidivists and non-recidivists. Analyses were conducted with the entire sample as well as for both subgroups. The magnitude of the ensuing AUC values was interpreted using the guidelines proposed by Rice and Harris (2005) for violence predictions studies (0.56 is small, 0.64 is moderate, and 0.71 is large).
For the entire sample, the AUC value for the Static-99 was moderate-large (0.683), although it was not statistically significant. In addition, the CI around the AUC value was wide, ranging from 0.483 to 0.883. However, the AUC value for the Static-99R, based on the entire sample, was large (0.719) and statistically significant. Once again, the corresponding CI around the AUC value was also wide, ranging from 0.529 to 0.909.
Results once the entire sample was divided into U.S.-born/Puerto Ricans and non-U.S.-born offenders were mixed. With respect to the U.S.-born/Puerto Rican group, the AUC values for both measures were large (0.774 for the Static-99 and 0.815 for the Static-99R) and statistically significant. Nonetheless, their corresponding CIs were also wide (0.611-0.938 for the Static-99 and 0.625-0.995 for the Static-99R). In contrast, for the non-U.S.-born group, the AUC values for both measures were small (0.469 for the Static-99 and 0.524 for the Static-99R) and not statistically significant. Moreover, their corresponding CIs were very wide (0.066-0.873 for the Static-99 and 0.191 to 0.856 for the Static-99R). Results are presented in Table 7.
Predictive Validity (Discrimination) of the Static-99 and Static-99R for Sex Offense Recidivism.
Note. AUC = area under the curve; CI = confidence interval.
Additional analyses were performed to assess whether or not group membership affected the Static-99 measures’ prediction of sexual recidivism. We conducted binary logistic regressions using each Static-99 measure as the predictor for sexual recidivism alone as well as with group membership as a covariate. The Static-99 measures predicted sexual recidivism, whereas group membership did not. Results are presented in Table 8.
Prediction of Sexual Reoffending by Static-99 Measures and Group Membership.
Note. CI = confidence interval.
Calibration analyses
Calibration analyses focused solely on the Static-99R based on the recommendation from Hanson, Lunetta et al. (2014). The sensitivity of the measure was assessed using both calibration methods proposed by Hanson, Lunetta et al. (2014): through a comparison of observed versus expected recidivism rates (for our sample) based on the literature and comparison of regression coefficients with an intercept centered on a score of two. In general, our rates of observed sexual offense recidivism increased with Static-99R risk categories, as expected. The E/O indices for the Low and Moderate-High risk categories were close to 1.0, suggesting good calibration. However, the E/O indices for the Moderate-Low and High risk categories differed from 1.0, but not significantly given that their corresponding 95% CIs contained the value of 1.0. Results are presented in Table 9.
Comparison of Expected Versus Observed Rates of Sexual Recidivism for Static-99R Risk Categories (With at Least 5 Years in the Community Post Release).
Note. E/O = expected/observed; CI = confidence interval.
The difference between the Observed and Expected is statistically significant.
We subsequently performed a binary logistic regression to assess the Static-99R’s ability to determine relative risk and compared our regression coefficients with those reported by Hanson et al. (2013) for the normative sample and by Hanson, Lunetta et al. (2014) with respect to their California Latino sample. The relationship between the centered (on the score of 2) Static-99R total scores and sexual offense recidivism fit a logistic distribution because the Hosmer–Lemeshow test was not significant, χ2(5) = 5.73, p = .333; B02 = −3.75, SE = 0.43; B1 = 0.496, SE = 0.22. The ensuing logistic equation showed a relative increase in risk of 1.64 for each increase in Static-99R score (e0.382 = 1.64) and an adjusted 5-year sexual offense recidivism rate of 2.3% for a Static-99R score of 2 (1/{1 + e-(-3.79)} = 2.3) The adjusted base rate for our sample was not significantly different from that reported by Hanson et al. (2013), B02 of −3.75 versus −2.941, z = −1.64, and was very similar to the one reported for Latinos in California by Hanson et al. (2014), B02 of −3.75 versus −3.953, z = 0.47. With respect to the change in relative risk, our findings were consistent to those reported both by Hanson et al. (2013; B1 of 0.496 vs. 0.331, z = 0.74) and Hanson, Lunetta et al. (2014) for Latinos (B1 of 0.496 vs. 0.383, z = 0.51).
Discussion
The main purpose of the current study was to assess the utility of the Static-99 measures for the assessment of risk among Latino sex offenders. First, we sought to evaluate the predictive validity of Static-99 and Static-99R with a sample of Latino sex offenders as a whole; then, we conducted separate analyses for those who were U.S.-born/Puerto Rican and those who were non-U.S.-born. Predictive validity was assessed both in terms of discrimination and calibration properties.
Overall, we found that the Latino sex offenders in the present study received fairly low average scores on the Static-99 and Static-99R when compared with the general sex offender literature (e.g., Helmus, 2009; Helmus, Hanson, Thornton, Babchishin, & Harris 2012). However, our findings were consistent with, but slightly lower than, other studies involving Latino sex offenders in Texas (Varela et al., 2013) and California (Hanson, Lunetta et al., 2014). The scores we obtained for our sample could be influenced by a number of factors. It is very possible that a number of Latino (except Puerto Rican) participants immigrated into the United States as adults from Mexico and other Latin American countries given the high influx of immigrants from those regions (Camarota, 2012). Participants’ immigrant status could influence the amount of information available about them when scoring the Static-99 measures. Varela and colleagues (2013) raised concerns regarding measurement error of this type, which arises from the lack of documented criminal histories for immigrant sex offenders. Our findings corroborate those concerns. Moreover, even if immigrant sex offenders were forthcoming about their criminal history if interviewed, they may not have considered certain types of behaviors crimes, such as sexual contact between a young adult and a teenager, if there are no laws prohibiting that type of contact in their countries of origin or if those laws are not enforced. Consequently, measures that contain items about criminal history, such as the Static-99 and Static-99R, could provide somewhat depressed total scores.
The above appears to be a possibility for our sample, as evidenced by the fact that U.S.-born/Puerto Rican sample was 2½ times more likely than non-U.S.-born Latinos to get a score of one on the prior non-sexual charges/convictions item, more than 4 times more likely on the prior sentencing dates item, and almost 4 times more likely on the prior sexual offenses item. However, both groups received similar scores for the remaining items. These findings support the theory proposed by Varela and colleagues (2013) that immigrant status affects the accuracy of scores on the Static-99 measures.
Our analysis of risk-level assignment on both Static-99 measures was affected by the fact that very few participants (four based on each measure) were assessed to fall in the High Risk category. Once we collapsed risk categories into two (Low/Low-Moderate and a Moderate-High/High), we found a strong trend that suggests U.S.-born/Puerto Ricans are more likely to fall in the Moderate-High/High risk category based on Static-99 total score as compared with non-U.S.-born participants. However, we found that the U.S.-born/Puerto Rican group was twice as likely than the non-U.S.-born group to fall in the same higher risk category, based on Static-99R total score. However, the fact that changes in risk-level assignment from the Static-99 to the Static-99R are solely based on the change in the item pertaining to age suggests that differences in the age category between U.S.-born and non-U.S.-born participants also affect differences in the assessment of risk between these two groups.
In addition to receiving lower scores on the Static-99 measures than normative samples, the sex offenders in our sample had significantly lower rates of documented sexual offense recidivism (based on convictions) than those reported in the literature based on meta analyses (Helmus, 2009). Low sexual offense recidivism rates have also been reported for Latino sex offenders by Varela et al. (2013) and Hanson, Lunetta et al. (2014). Moreover, we found that U.S.-born/Puerto Rican and other Latin American sex offenders (i.e., non-U.S.-born) in our sample did not differ significantly from one another with regard to their respective sexual recidivism rates.
Although lower documented recidivism rates for sexual offenses could be expected of immigrant sex offenders who would be likely to face deportation on the completion of a prison sentence, there are a number of other potential explanations for the low rates found for our overall sample. Varela and his colleagues (2013) have argued that if Latinos follow similar patterns of sexual offending as Whites and African Americans, a substantial number of them would have Latino/a victims (i.e., offend against individuals for their same ethnicity). However, reporting of sexual offenses may be depressed among Latinas in the United States based on negative beliefs about victims and endorsement of rape myths in this population (Jimenez & Abreu, 2003). In addition, cultural taboos around talking about sex and rape, the culture’s focus on the family over the individual, and the lack of available community resources could prevent victims from reporting abuse (Jimenez & Abreu, 2003). In addition, Rennison (2010) identified fears related to immigration status and language barriers as obstacles to reporting sexual victimization among Latinas.
With regard to the ability of the original Static-99 to discriminate between sexual recidivists and non-recidivists, our findings were mixed. Although its performance with the entire sample did not reach formal statistical significance, the AUC value was consistent with that found in previous studies (e.g., Brouillette-Alarie & Proulx, 2012). It is striking that the measure was able to predict recidivism for the U.S.-born/Puerto Rican sample fairly well despite the very low number of recidivists in our sample. However, its performance was poor when applied to the non-U.S.-born Latino sample. In addition, the Static-99’s predictive validity with the entire sample may have been hindered by the lack of accurate historical information for the non-U.S.-born Latino sample, as well as other related factors, such as age at time of immigration into the United States, and the extent of travel between the United States and country of origin (transmigration).
We found a somewhat similar pattern with the Static-99R. However, this newer measure was able to discriminate recidivists from non-recidivists for the entire sample as well as the U.S.-born/Puerto Rican group. The AUCs for the Static-99R were large for the entire sample and the U.S.-Born/Puerto Rican group. However, the Static-99R was also not able to predict recidivism for the non-U.S.-born group, likely for the same reasons as the original measure.
The calibration analyses of the Static-99R were generally consistent with the existing literature (Hanson et al., 2013; Hanson, Lunetta et al., 2014). However, the total observed rate of recidivism was significantly lower than expected when compared with the norms provided by Phenix et al. (2012). The rest of our findings were similar to those reported by Hanson et al. (2013) based on a large sample of U.S., Canadian, and West European sex offenders, as well as findings pertaining specifically to a sample of Latino sex offenders in California reported by Hanson, Lunetta et al. (2014). Although our sample sizes for each risk level were small, as was the number of recidivists in each category, the recidivism rates increased consistently along risk levels, with the exception of the Low-Moderate risk category, which was also true in Hanson, Lunetta et al.’s (2014) study.
Moreover, our assessment of relative risk using logistic regression yielded similar results to those based on a normative sample (Hanson et al., 2013) and on a Latino sample (Hanson, Lunetta et al., 2014), both with respect to the adjusted base rate and adjusted 5-year expected recidivism rate regression coefficients. Of note, our adjusted base rate coefficient did not differ significantly from Hanson et al.’s (2013) when the standardized residual of the difference was examined, whereas Hanson, Lunetta et al. (2014) found a significant difference between their coefficient and Hanson et al.’s (2013) using meta-analytic methods, which are more sensitive. In all, our findings provide further evidence of the predictive validity of the Static-99R with respect to the assessment of offenders’ relative risk.
Our findings provide some support for the use of the Static-99 measures with Latinos. Overall, the measures performed relatively well both in terms of discrimination and, in the case of the Static-99R, calibration, when applied to our entire sample of Latino sex offenders. Although the measures performed poorly with Latinos born outside the United States or Puerto Rico, more research is needed to come to stronger conclusions with respect to the utility of the measures with those individuals.
Limitations
Given the archival nature of this study, we were able to discern neither participants’ age at the time of immigration to the United States nor their history of sexual aggression in their countries of origin. This lack of historical information affects the scores offenders obtained, as well as their assignment of level of risk on the Static-99 measures, and likely other actuarial measures that require such data. In addition, obtaining effective post-release follow-up information for Latinos can be rather difficult given the group’s tendency to travel back and forth between the United States and their country of origin (e.g., Guarnizo, 1997; Massey, Goldring, & Durand, 1994). It is very possible that a number of individuals in our overall sample, even those born in the United States of Latino heritage, may have left the United States subsequent to their release from prison, which would render effective follow-up difficult (if the person went to Puerto Rico, which is American territory) or practically impossible. In sum, problems obtaining accurate historical and follow-up information may have affected the measures’ performance with non-U.S.-born Latinos.
Another limitation is the lack of data about immigration status. Some of the individuals in the foreign-born sample may not have legal status in the United States, although others are likely visa holders, permanent residents, or citizens. Those who arrived to the United States legally would have already been vetted through a selective immigration process, one that would likely deny entry for individuals with any kind of documented criminal history. Thus, at least some portion of our non-U.S.-born sample is likely to have, by nature of their having gained legal entry into the United States, little to no criminal history or other serious red flags in their background.
Finally, although we found the Static-99 measures to perform well with U.S.-born and Puerto Rican offenders, our observed recidivism rates were rather low. Some of the calibration analyses for the Static-99R were affected by these low base rates. Although not necessarily a limitation of the methods used in this study, it is certainly an issue that affected our assessment of the sensitivity of the measure.
Conclusion and Future Directions
Our findings underscore the need for multicultural research in the sex offender field, particularly with respect to the assessment of recidivism risk. Moreover, they provide further evidence that different ethnic groups may vary considerably in their rate of sexual reoffending. In addition to recent studies by Varela et al. (2013) and Hanson, Lunetta et al. (2014), our findings suggest that Latinos’ reoffending rates are rather low. These findings could be considered when Latino sex offenders are assessed post release to determine at what level they should be supervised (e.g., by parole or sex offender registry boards). However, more accurate assessment of recidivism is needed. Although it may be impossible to obtain follow-up information from foreign official sources, any assessment of an immigrant sex offender, or those with closer ties with a foreign territory or county (e.g., Puerto Ricans and Mexican Americans who may travel back and forth to the United States), should include questions about potential sexually aggressive behavior in which they may have engaged while outside the United States. In addition, researchers may wish to consider grouping samples of immigrant Latinos based on their age of immigration into the United States, because those who were younger (i.e., prepubescent) at time of entry into the United States may be more culturally similar to Latinos born in the United States, and historical information about them may be more readily available.
Although our findings suggest that the Static-99 measures exhibit predictive validity with Latinos of U.S. or Puerto Rican origin, solid conclusions about its cross-cultural validity would be premature given the impact low base rates had on the statistical power of our analyses. In addition, although our discrimination and calibration analyses suggest that the Static-99R in particular may be useful in the assessment of risk among U.S.-born Latinos, our findings should be considered within the context of the low base rate of recidivism.
It is clear that additional research is needed to provide more extensive evidence of the predictive validity of the Static-99R, both in terms of discrimination and calibration, with minority groups in the United States, as well as its cross-cultural validity with immigrants. Although Hanson, Lunetta et al. (2014) suggested using a localized approach to the validation of the measure, the pooling of resources and data to conduct validation analyses with more ample sample sizes could also be considered.
Finally, if Latinos indeed reoffend at such low rates, it would be valuable to determine what factors may contribute to their desistance from sex offending. Although a number of potential factors have been proposed with regard to sex offenders in general by Ward and Laws (2010), potential cultural factors that may predict desistance for specific ethnic or cultural groups should also be explored.
Footnotes
Acknowledgements
The authors thank the New Jersey Department of Corrections and New Jersey Department of Human Services for their support in providing access to files.
Authors’ Note
The opinions, findings, conclusions, and recommendations expressed are those of the authors and do not necessarily reflect those of the institutions that supported this research.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported in part by Grant NIJ 2007-IJ-CX-0037 from the National Institute of Justice, Office of Justice Programs, U.S. Department of Justice.
