Abstract
Although Aboriginal offenders are overrepresented in Canadian prisons, there is limited research examining the extent to which commonly used risk factors and risk scales are applicable to Aboriginals. Aboriginal (n = 88) and non-Aboriginal (n = 509) sex offenders on community supervision were compared on the dynamic risk factors of STABLE-2007. Data on sexual, violent, any crime, and any recidivism (including breaches) were collected with an average follow-up of 3.4 years. Aboriginal offenders scored significantly higher than non-Aboriginal offenders on STABLE-2007 total scores and on several items measuring general criminality. STABLE-2007 did not significantly predict recidivism with Aboriginal offenders (although it did for non-Aboriginals). The general antisociality items were generally significantly less predictive for Aboriginals than non-Aboriginals, whereas items assessing sexual self-regulation and relationship stability predicted similarly for both groups. These exploratory results suggest that Aboriginal sex offenders are a higher-needs group but that some STABLE-2007 items are not predictive with this population.
Aboriginal offenders are overrepresented in the Canadian criminal justice system, despite the attempts of both the government and the Supreme Court to address this serious problem (Canadian Criminal Code § 718.2(e); R. v. Gladue, 1999). Currently, Aboriginals represent 3.7% of the adult male population (Statistics Canada, 2008) but 19.2% of the incarcerated male population (Public Safety Canada, 2009). Compared with non-Aboriginal offenders, Aboriginal offenders are also more likely to be incarcerated for violent offences (Trevethan, Moore, & Rastin, 2002) and to be sent to maximum security institutions (Public Safety Canada, 2009). Despite this overrepresentation, there has been insufficient research comparing Aboriginal and non-Aboriginal offenders on risk factors and examining the extent to which assessment, treatment, and management strategies are equally effective for both groups.
Risk Assessment
Most decisions throughout an offender’s progression in the criminal justice system involve risk assessment, including sentencing, security classification, parole decisions, treatment needs, and supervision intensity. Many of the most commonly used risk assessment instruments focus on static risk factors that are historical and unchanging. Dynamic risk factors are features related to recidivism, which can change, and when changed, should alter the likelihood of recidivism (Andrews & Bonta, 2006). For general offenders, empirically supported dynamic risk factors include antisocial personality pattern, procriminal attitudes, procriminal associates, work/school problems, family/marital problems, poor use of leisure/recreation time, and substance abuse (along with the static factor of criminal history, these are referred to as the “Central 8” risk factors; Andrews & Bonta, 2006). Important dynamic risk factors for sex offenders include the “Central 8” as well as deviant sexual interests, sexual preoccupations (Hanson & Morton-Bourgon, 2005), intimacy deficits, and emotional congruence with children (Mann, Hanson, & Thornton, 2010).
Dynamic risk factors have been found to add incremental predictive validity to static risk factors (e.g., Allan, Grace, Rutherford, & Hudson, 2007; Olver, Wong, Nicholaichuk, & Gordon, 2007). In addition, a recent meta-analysis found that dynamic risk scales predicted sexual recidivism with greater predictive accuracy than static scales (Hanson & Morton-Bourgon, 2009). Although there are several ways to identify and combine dynamic items into a global structured risk assessment (for a review, see Hanson & Morton-Bourgon, 2009), this article will focus on an actuarial risk assessment scale (i.e., STABLE-2007). Actuarial scales involve explicit rules to combine prespecified items into total scores, and they include empirically derived estimates of recidivism probability linked to each total score (Meehl, 1954).
Risk Assessment With Aboriginal Offenders
Although there is ample empirical support for static and dynamic risk factors and risk assessment scales, virtually all of this research is conducted with samples of primarily Caucasian offenders. Given the large number of Aboriginal offenders in the Canadian correctional system, knowing whether risk scales developed and validated on primarily non-Aboriginal offenders can generalize to Aboriginal offenders is imperative.
There is considerable debate regarding risk assessment with Aboriginal offenders, with some critics arguing that current assessment scales fail to consider important differences between Aboriginal and non-Aboriginal offenders and the possibility of unique risk factors for Aboriginal offenders (Maynard, Coebergh, Anstiss, Bakker, & Huriwai, 1999; Webb, 2003). These individuals advocate the development of risk scales specific to Aboriginal offenders.
The assertion of differences between Aboriginal and non-Aboriginal offenders is empirically well supported. On average, Aboriginal offenders are younger, less educated, and have higher rates of unemployment compared with non-Aboriginal offenders (Statistics Canada, 2006). In addition, they have lengthier criminal histories (Dell & Boe, 2000; Holsinger, Lowenkamp, & Latessa, 2003) and report more negative childhood histories (Johnston, 1997; Trevethan, Auger, Moore, MacDonald, & Sinclair, 2002). In adulthood, Aboriginal offenders are also slightly more likely to be single and are rated as higher need in the domain of family and/or marital problems (Trevethan, Moore, & Rastin, 2002). Not surprisingly based on the above-noted differences, Aboriginal offenders also have higher recidivism rates than non-Aboriginal offenders (Bonta, Rugge, & Dauvergne, 2003; Sioui & Thibault, 2002).
The fact that Aboriginal offenders exhibit more risk factors than non-Aboriginal offenders does not mean that risk factors predict differently between these groups. Although limited, research conducted thus far has found that history of antisocial behaviour, antisocial attitudes, and antisocial peers are equally predictive of future criminal behaviour for Aboriginal and non-Aboriginal offenders and that substance abuse may be a better predictor for Aboriginals compared with non-Aboriginal offenders (Bonta, 1989; Bonta, LaPrairie, & Wallace-Capretta, 1997; British Columbia Public Safety and Solicitor General, 2004). Psychopathy has also been found to predict recidivism similarly for Aboriginal and non-Aboriginal offenders (Boer, Couture, Geddes, & Ritchie, 2004; Schmidt, McKinnon, Chattha, & Brownlee, 2006). However, family/marital problems and school/work problems were not found to predict recidivism with Aboriginal offenders (Bonta, 1989; Bonta et al., 1997).
In terms of the predictive validity of risk assessment tools, several studies have demonstrated that scales such as the Level of Service Inventory (LSI) and its subsequent adaptations predict recidivism equally well for Aboriginal and non-Aboriginal general offenders (Bonta, 1989; Brews, Wormith, & Guzzo, 2009; Gossner & Wormith, 2007; Gross & Sroga, 2008; Tanasichuk, Wormith, & Guzzo, 2009). Similarly, positive results have been reported for the Statistical Information on Recidivism Scale (SIR; Bonta & Rugge, 2004; Hann & Harman, 1993; Nafekh & Motiuk, 2002). However, it is important to note that these findings have not always been replicated outside Canada. In a study in the United States, the LSI:R did not predict recidivism for Native American general offenders (Holsinger, Lowenkamp, & Latessa, 2006).
In general, given the limited research on Aboriginal offenders, it is not surprising to also discover few studies concerning Aboriginal sex offenders. In terms of social history, Ellerby and MacPherson (2002) found that when compared with non-Aboriginal sex offenders, Aboriginal sex offenders were more likely to have had an unstable upbringing, including separation from parents, exposure to general criminal activity, domestic abuse, and higher rates of neglect and sexual abuse. Also, consistent with the literature on general Aboriginal offenders, Aboriginal sex offenders had lower educational achievement, higher rates of unemployment, and greater rates of substance abuse than non-Aboriginal sex offenders. However, interestingly, Aboriginal sex offenders had lower levels of sexual deviance (e.g., coercive sexual fantasies, paraphilias) compared with non-Aboriginal sex offenders (Ellerby & MacPherson, 2002).
Regarding the types of offences committed, Aboriginals are more likely to have female victims (Ellerby & MacPherson, 2002; Nahanee, 1996; Rastin & Johnson, 2002; Rojas & Gretton, 2007) within their own ethnicity (Ellerby & MacPherson, 2002; Nahanee, 1996) and to abuse substances during the commission of the offence (Ellerby & MacPherson, 2002; Nahanee, 1996; Rastin & Johnson, 2002; Rojas & Gretton, 2007). Aboriginal offenders are also less likely to target younger victims (e.g., prepubescent) and victims with whom they hold a position of trust (Ellerby & MacPherson, 2002).
In addition, Aboriginal sex offenders show higher rates of sexual recidivism (Rastin & Johnson, 2002; Rojas & Gretton, 2007; Williams, Vallée, & Staubi, 1997), violent recidivism (Rojas & Gretton, 2007), and general recidivism (Rastin & Johnson, 2002; Rojas & Gretton, 2007) compared with non-Aboriginal sex offenders. The only study we identified that examined possible differences in the predictive accuracy of risk factors or a risk assessment scale found that Static-99 predicted recidivism comparably for Aboriginal and non-Aboriginal sex offenders (area under the receiver operating characteristic curve [AUC] = .67 for both groups; Nicholaichuk, 2001).
Purpose of Current Study
Given that Aboriginal offenders represent nearly 20% of the Canadian prison population, it is surprising that there is not more research examining predictors of recidivism for Aboriginal sex offenders. The purpose of this exploratory study was twofold: (a) examine whether Aboriginal sex offenders have different dynamic risk profiles than non-Aboriginal sex offenders and (b) examine whether the STABLE-2007 scale and its individual items predict sexual, violent, any crime, and any recidivism differently for Aboriginal and non-Aboriginal sex offenders. The hypotheses were that Aboriginal offenders would be significantly at higher risk than non-Aboriginal offenders on STABLE-2007 but that the items and risk scale would predict recidivism similarly for both groups.
Method
Participants
The sample for this study is from the Dynamic Supervision Project (Hanson, Harris, Scott, & Helmus, 2007). More detailed information on the study methodology is available in the original report, although it should be noted that the current analyses included only the Canadian offenders in this project and those who had at least one STABLE-2007 assessment. Also excluded were 4 female offenders and 12 offenders with less than 1 year of follow-up information. All offenders in this project were adult sex offenders starting a period of community supervision (e.g., probation or parole) between 2001 and 2005. In total, 597 offenders were included in this study; 88 (14.7%) were Aboriginal sex offenders and 509 (85.3%) were non-Aboriginal sex offenders. Offenders were identified as Aboriginal by the supervising officers. No explicit criteria were listed to define Aboriginal, and no information is available on the offender’s band or nation. The number of Aboriginal offenders living on reserves is unknown, although they would likely be a minority given that most of the supervising officers were located in major urban areas.
The offenders were supervised in all provinces and territories across Canada. Virtually all offenders were in the provincial/territorial correctional systems; only 13 offenders were supervised by the federal prison system (Correctional Service of Canada [CSC]). Similar to the general Canadian population, the proportion of Aboriginal offenders varied considerably across regions, with the highest proportions in the Prairie provinces and Northern Territories.
Measures
STABLE-2007
An earlier version of the dynamic risk scale, STABLE-2000, was developed for this project (see Hanson et al., 2007). However, based on the data received, three items were dropped from the scale (measuring attitudes tolerant of sexual offending), and the coding rules for three additional items were modified slightly (capacity for relationship stability, emotional identification with children, and deviant sexual interests). The revised version of the scale was called STABLE-2007. STABLE-2007 includes 13 items assessed using a three-point rating scale (from 0 to 2). The items are organized into five sections: capacity for relationship stability (one item), intimacy deficits (five items), general self-regulation (three items), sexual self-regulation (three items), and cooperation with supervision. Total STABLE-2007 scores are obtained by summing the items. The maximum possible score for child molesters is 26 and for rapists is 24 (because the emotional identification with children item is not scored for rapists). Offenders with scores of 0-3 are considered low risk/need, 4-11 are considered moderate risk/need, and 12 and higher are considered high risk/need.
Procedure
Data were collected as part of the routine supervision practices of the officers participating in the project. The STABLE-2007 assessment was to be completed within the first 3 months of supervision, and then every 6 months thereafter. Only the first STABLE assessment will be analyzed in this study because of the small number of offenders for which multiple assessments were received. Because STABLE-2007 was developed at the end of the project, the officers were actually trained on and submitted STABLE-2000 scores. STABLE-2007 scores were calculated from SPSS syntax using the original STABLE-2000 items, victim information submitted by the officers, and from an item on the Static-99 scale (ever lived with a lover for at least 2 years). Supervising officers attended a 2-day training session, although in rare cases, officers submitted data who had been trained by apprenticing with other local officers. Formal interrater reliability information is not available, but scores on training exercises and selective site visits suggested reasonable understanding and application of the scoring rules (for more information, see Hanson et al., 2007).
Recidivism
Information concerning new offences was gathered from reviews of provincial and national criminal history records, as well as from supervising officers and local police jurisdictions. In addition, one recidivist was identified from a newspaper article. National Canadian Police Information Centre (CPIC) records maintained by the Royal Canadian Mounted Police (RCMP) were received in August 2005 and June 2006. Provincial records were received from the following jurisdictions: British Columbia (January 2006), Manitoba (April 2005), and Ontario (December 2005). The Offender Management System of CSC was checked in May 2005 for recidivism information for the CSC offenders registered in the project.
Each recidivism source contained at least some unique information. Part of this can be attributed to time delays. For example, CPIC records only include information after charges have been formally processed (e.g., resulted in conviction or dismissal), whereas the other sources could contain information on charges in progress. In addition, some of the unique information is a reflection of the unreliability of the official criminal records. However, note that all the recidivism sources would be expected to underestimate the true rate of recidivism.
Information on the date and circumstances of new offences was obtained to classify them into one of the four recidivism categories (defined below). Recidivism was considered to have occurred if the agency reporting the information believed that the offence occurred. However, for breaches, an official record of parole revocation or a new conviction for violation of conditional release was required. Given that criminal history records were the major source of recidivism information, the majority of recidivism events were linked to an officially recorded charge or conviction.
Four types of recidivism were recorded. The first category was sexual recidivism, which included all crimes with a sexual motivation, whether or not the name of the offence was explicitly sexual. Violent recidivism was defined as all crimes that involved direct confrontation with the victim and included sexual recidivism. The third category was any criminal recidivism, which included all crimes but excluded breaches. The final category, any recidivism, included all crimes (sexual, violent, and nonviolent) as well as breaches. For breaches, if the offence description indicated that a crime had occurred, the incident was coded as a crime.
The follow-up period was calculated from the date that the first assessment information was collected to the date of the last recidivism information received (or until death or deportation). For the few cases that did not appear on any official record, the follow-up end date was set 1 month after the last assessment information was received. The offender start dates ranged from January 2001 to October 2005, with an average follow-up time of 3.4 years (SD = 1.0; range, 1.0–5.4 years).
Overview of Analyses
The AUC was used to examine whether Aboriginal offenders scored differently on risk factors than non-Aboriginal offenders and to examine the predictive accuracy of STABLE-2007. Cox regression was used to test whether predictive accuracy differed for Aboriginal and non-Aboriginal offenders. AUC is an effect size statistic for dichotomous outcome variables (Swets, Dawes, & Monahan, 2000) and can vary between 0 and 1, with .50 indicating the level of prediction that would be expected by chance. An AUC value of less than .50 indicates negative predictive accuracy (i.e., low scores on a risk factor are associated with higher recidivism rates). The AUC values between .50 and 1 indicate prediction exceeding chance levels, with higher numbers indicating greater accuracy. Also worth noting is that AUCs only examine relative predictive accuracy (e.g., are high-risk offenders more likely to reoffend than low-risk offenders?); they do not provide information about absolute recidivism rates.
As a rough heuristic, an AUC of .56 corresponds to a small effect size, whereas .64 reflects a moderate effect and .71 reflects a large effect size, as these values correspond to Cohen’s ds of 0.2, 0.5, and 0.8, when certain assumptions are satisfied (see Rice & Harris, 2005). An AUC value is statistically significant if the 95% confidence interval (CI) does not include .50. Given the small sample size for Aboriginal offenders, the AUC CIs tended to be extremely wide, indicating low power to detect statistically significant predictive accuracy, particularly compared with non-Aboriginal offenders. As such, our interpretation of the AUC results focused on magnitude and not on significance.
Although the AUCs are most commonly used to predict dichotomous outcome variables (e.g., recidivism), they can also be used to examine differences on a dichotomous grouping variable (e.g., do risk scores predict Aboriginal status?). Group differences could also be examined using t tests, but t tests assume interval data, and the risk scales are technically ordinal. In contrast, the AUCs are appropriate for ordinal data and provide a slightly more conservative test.
Cox regression analysis (Allison, 1984) was used to test whether the predictive accuracy of a STABLE-2007 item or total score was significantly different between Aboriginal and non-Aboriginal sex offenders. Cox regression estimates relative risk ratios (hazard rates) associated with one or more predictor variables from survival data with unequal follow-up times. To test for differences in predictive accuracy, Cox regression models were calculated using the predictor variable, Aboriginal status, and an interaction term for Aboriginal status and the predictor. A significant interaction means that the predictive accuracy of the item differs between Aboriginal and non-Aboriginal offenders. These analyses should be considered exploratory given that the number of tests conducted increases the Type I error rate.
In the Results section, only the test of the interaction was presented (Wald test). Although Wald values are always more than 0, we added positive and negative signs to indicate the direction of the findings, with positive values denoting higher predictive accuracy for Aboriginal offenders compared with non-Aboriginal offenders and negative signs denoting lower predictive accuracy for Aboriginals. The regression coefficients for the predictor variables were not reported because they are difficult to interpret in the presence of an interaction term, and they do not answer the primary research questions because they are combined for Aboriginal and non-Aboriginal offenders. Readers interested in the predictive accuracy of these items for the combined sample are encouraged to refer to the original report (Hanson et al., 2007).
Given variability in crime rates across Canada (Public Safety Canada, 2009), geographical region was entered as a strata variable, which allows Cox regression to estimate separate baseline hazard functions (i.e., recidivism rates) for each value of the stratified variable (in other words, it controls for regional differences in recidivism rates). This provides a more conservative test because the proportion of Aboriginal offenders differs across regions, so removing regional variability may also remove some variability due to Aboriginal status. To determine whether this test was too conservative, all analyses were run both with and without geographic region as a strata variable. Largely similar results were obtained, with a few additional significant findings without the strata variable. Given that the number of analyses increases the Type I error rate, only the more conservative analyses (with region as a strata variable) were reported.
Results
Descriptive Information
Of the 88 Aboriginal offenders, 10 reoffended sexually (11.4%), 28 reoffended violently (31.8%), 36 committed any new crime (40.9%), and 45 recidivated in any way, including breaches (51.1%). Of the 509 non-Aboriginal offenders, 37 reoffended sexually (7.3%), 64 reoffended violently (12.6%), 92 committed any new crime (18.1%), and 118 recidivated in any way, including breaches (23.2%).
Aboriginal offenders were significantly younger (M = 36.2, SD = 10.5) than non-Aboriginal offenders (M = 41.8, SD = 13.9; t (146) = 4.42, p < .001). Additional descriptive information for the two groups is presented in Table 1. There were significant differences between groups in offender type (x2 = 11.06, df = 4, p = .026), with Aboriginal offenders more likely to be rapists or mixed offenders (committing offences against both adults and children) and less likely to have only noncontact offences. The proportion of child molesters (incest or extrafamilial) was similar in both groups. There were also significant differences between groups in the most serious victim injury (χ2 = 9.13, df = 3, p = .028), with Aboriginal offenders more likely to cause victim injury than non-Aboriginal offenders (18.2% vs. 10.4%, respectively) and less likely to commit noncontact offences (3.4% vs. 11.4%, respectively). The proportion of offenders with intellectual impairments or major mental disorders was similar for Aboriginal and non-Aboriginal offenders; approximately 5% of offenders were developmentally delayed, and approximately 9% had previously been hospitalized for a major mental disorder.
Descriptive Information for Sample
Group Comparisons on STABLE-2007
Table 2 presents the means and standard deviations of Aboriginal and non-Aboriginal offenders on STABLE-2007 (items and total scores), and the results of the AUC analyses comparing the two groups. Means could range from 0 to 2 but most were less than one, indicating that the risk factors were generally low in both groups. The AUC values greater than .50 indicate that Aboriginals scored higher than non-Aboriginals. There were no significant differences between Aboriginal and non-Aboriginal offenders in their scores on significant social influences, emotional identification with children, nor for any of the items in the sexual self-regulation section. In the intimacy deficits section, Aboriginal offenders scored significantly higher on lack of concern for others. In the general self-regulation section, compared with non-Aboriginal offenders, Aboriginal offenders scored significantly higher on impulsive acts and poor cognitive problem solving. Aboriginal offenders also had significantly more problems cooperating with supervision than non-Aboriginal offenders. STABLE-2007 total scores were significantly higher among Aboriginal offenders compared with non-Aboriginal offenders, although the average score for both groups was within the moderate risk category.
Comparing Aboriginal (n = 88) and Non-Aboriginal (n = 509) Offenders on STABLE-2007 Item and Total Scores
Predictive Accuracy for STABLE-2007
Sexual recidivism
Table 3 presents the AUC values for the predictive accuracy of STABLE-2007 items and total scores as well as the Wald test from the Cox regression analysis used to examine group differences in predictive accuracy. The AUCs were higher for Aboriginals for only 2 of the 13 items (capacity for relationship stability and sex as coping). For Aboriginal offenders, the AUC values for the items ranged from .42 to .65, with the median effect size at chance levels (AUC = .48). For non-Aboriginals, the AUC values ranged from .51 to .70, with a median of .60. From the Cox regression analyses, four items had significantly lower predictive accuracy for Aboriginal offenders (hostility towards women, lack of concern for others, impulsive acts, and poor cognitive problem solving). For STABLE-2007 total scores, the predictive accuracy appeared lower for Aboriginal offenders (AUC = .529) compared with non-Aboriginal offenders (AUC = .701), but the difference was not significant (Wald = −2.78, df = 1, p = .096).
Predictive Accuracy of STABLE-2007 Items and Total Scores for Sexual Recidivism
Interactions between Aboriginal status and STABLE-2007 items were tested using Cox regression, with 1 degree of freedom for each Wald test and using jurisdiction as a strata variable. Although Wald values are always more than 0, we added positive and negative signs to indicate the direction of the findings, with positive values denoting higher predictive accuracy for Aboriginal offenders compared with non-Aboriginal offenders and negative signs denoting lower predictive accuracy for Aboriginals.
p < .05.
Violent recidivism
Table 4 presents the AUC and Cox regression results. The AUC values were consistently lower for Aboriginal offenders compared with non-Aboriginal offenders, with only 2 of 13 items favouring Aboriginals (capacity for relationship stability and emotional identification with children). For Aboriginals, the AUC values for the items ranged from .48 to .59, with a median of .51. For non-Aboriginals, the AUC values ranged from .49 to .74, with a median of .61. Cox regression analyses identified five items with significantly lower predictive accuracy for Aboriginal offenders (hostility towards women, lack of concern for others, impulsive acts, poor cognitive problem solving, and cooperation with supervision). In addition, STABLE-2007 total scores had significantly lower predictive accuracy for Aboriginal offenders (Wald test = −5.24, df = 1, p = .022).
Predictive Accuracy of STABLE-2007 Items and Total Scores for Violent Recidivism
Interactions between Aboriginal status and STABLE-2007 items were tested using Cox regression, with 1 degree of freedom for each Wald test and using jurisdiction as a strata variable. Although Wald values are always more than 0, we added positive and negative signs to indicate the direction of the findings, with positive values denoting higher predictive accuracy for Aboriginal offenders compared with non-Aboriginal offenders, and negative signs denoting lower predictive accuracy for Aboriginals.
p < .05. **p < .01. ***p ≤ .001.
Any crime
Table 5 presents the AUC and Cox regression results. Of the 13 item comparisons, the AUCs were higher for Aboriginal group for only 3 items (significant social influences, emotional identification with children, and deviant sexual interests). For Aboriginals, the AUC values for the items ranged from .47 to .61, with a median of .53. For non-Aboriginals, the AUC values ranged from .48 to .72, with a median of .61. Five items had significantly lower predictive accuracy for Aboriginal offenders (general social rejection/loneliness, lack of concern for others, impulsive acts, poor cognitive problem solving, and cooperation with supervision). For STABLE-2007 total scores, the predictive accuracy appeared lower for Aboriginal offenders (AUC = .581) compared with non-Aboriginals (AUC = .706), but the difference was not significant (Wald = −2.45, df = 1, p = .117).
Predictive Accuracy of STABLE-2007 Items and Total Scores for Any New Crime
Interactions between Aboriginal status and STABLE-2007 items were tested using Cox regression, with 1 degree of freedom for each Wald test and using jurisdiction as a strata variable. Although Wald values are always more than 0, we added positive and negative signs to indicate the direction of the findings, with positive values denoting higher predictive accuracy for Aboriginal offenders compared with non-Aboriginal offenders and negative signs denoting lower predictive accuracy for Aboriginals.
p < .05. **p ≤ .01.
Any recidivism (including breaches)
Table 6 presents the AUC and Cox regression results. Only two AUC item comparisons favoured Aboriginal offenders (significant social influences and emotional identification with children). For Aboriginals, the AUC values for the items ranged from .47 to .63, with a median of .53, with two items demonstrating significant predictive accuracy (significant social influences and impulsive acts). For non-Aboriginals, the AUC values ranged from .51 to .73, with a median of .62. Six items had significantly lower predictive accuracy for non-Aboriginal offenders (hostility towards women, general social rejection/loneliness, lack of concern for others, impulsive acts, poor cognitive problem solving, and cooperation with supervision). In addition, STABLE-2007 total scores had significantly lower predictive accuracy for Aboriginals (Wald = −4.26, df = 1, p = .039).
Predictive Accuracy of STABLE-2007 Items and Total Scores for Any Recidivism (Including Breaches)
Interactions between Aboriginal status and STABLE-2007 items were tested using Cox regression, with 1 degree of freedom for each Wald test and using jurisdiction as a strata variable. Although Wald values are always more than 0, we added positive and negative signs to indicate the direction of the findings, with positive values denoting higher predictive accuracy for Aboriginal offenders compared with non-Aboriginal offenders and negative signs denoting lower predictive accuracy for Aboriginals.
p < .05. **p < .01. ***p ≤ .001.
Summary of predictive accuracy
Three items had significantly lower predictive accuracy for Aboriginal sex offenders for all four types of recidivism (lack of concern for others, impulsive acts, and poor cognitive problem solving). In addition, hostility towards women had significantly lower predictive accuracy for Aboriginals for the prediction of sexual, violent, and any recidivism. Cooperation with supervision had significantly lower predictive accuracy for Aboriginal offenders for all outcomes except sexual recidivism. General social rejection/loneliness was also less predictive for Aboriginals for any crime and any recidivism. Although several other items had lower AUC values for Aboriginal offenders compared with non-Aboriginal offenders, these differences were not statistically significant based on Cox regression.
For STABLE-2007 total scores, the predictive accuracy for Aboriginal offenders was significantly lower than non-Aboriginal offenders for the prediction of violent recidivism and any recidivism (including breaches) and approached significance for sexual recidivism. Differences in predictive accuracy between groups were not significant for any new crime.
Discussion
This study found several significant differences in the presence and predictive accuracy of dynamic risk factors for Aboriginal and non-Aboriginal sex offenders. In terms of demographic and offence characteristics, Aboriginal sex offenders were younger, more likely to be rapists or mixed offenders, less likely to be noncontact offenders, and more likely to cause victim injury. Consistent with the first hypothesis, Aboriginal sex offenders scored significantly higher than non-Aboriginal sex offenders on STABLE-2007, particularly on the items that assessed general antisociality (e.g., lack of concern for others, impulsivity, poor problem solving, and cooperation with supervision). Higher scores for Aboriginal offenders on these items make sense given that rapists generally exhibit more general criminality and antisociality than child molesters (Firestone, Bradford, Greenberg, & Serran, 2000; Harris, Smallbone, Dennison, & Knight, 2009).
Contrary to our second hypothesis, the same items on which Aboriginal offenders scored higher generally predicted recidivism with significantly lower accuracy than non-Aboriginal offenders. Conversely, Aboriginal and non-Aboriginal offenders had similar profiles on the sexual self-regulation items, and these items had comparable predictive accuracy for both groups. Significant social influences and capacity for relationship stability seemed to be equally predictive for Aboriginal and non-Aboriginal offenders.
The reason for the lower predictive accuracy for several of the general antisociality items is unclear. These results were surprising given that these items are strongly supported predictors of sexual recidivism (Hanson & Morton-Bourgon, 2005) and general recidivism (Andrews & Bonta, 2006), and other items and scales measuring similar constructs have been validated with general Aboriginal offenders (e.g., Gossner & Wormith, 2007; Schmidt et al., 2006). However, there are some key differences in this study. This is the first study we are aware of that examined the predictive accuracy of dynamic risk factors with Aboriginal sex offenders. In addition, most of the previous studies examined whether the items or risk scales predicted recidivism at all with Aboriginal offenders, whereas this study focused on whether the items predicted as well for Aboriginal offenders as they did with non-Aboriginal offenders. Impulsive acts, for example, significantly predicted any recidivism for Aboriginal offenders, but the effect size was still significantly lower than for non-Aboriginal offenders.
The poor performance of some of these items with Aboriginal offenders could be due to measurement issues. It is possible that the causal factors for recidivism are similar for Aboriginal and non-Aboriginal offenders but that the STABLE-2007 items are not adequately measuring these constructs, particularly for Aboriginal offenders. Although the static/dynamic conceptualization of risk factors has been dominant for over a decade, it has been challenged in recent years (e.g., Beech & Ward, 2004). Mann et al. (2010) have recently advocated for a new focus on psychologically meaningful risk factors. In this model, the risk factors generally measured in research studies are indicators of underlying propensities. For example, self-regulation problems may be an underlying propensity that causes offending and reoffending. Certain risk factors such as substance abuse, job instability, gambling, fighting, and poor problem solving may all be indicators of this propensity. Interestingly, this model downplays the importance of the static/dynamic distinction because underlying propensities can have both static and dynamic indicators.
It is possible that some of the risk factors measured in STABLE-2007 are not indicators of the same construct for Aboriginal and non-Aboriginal offenders. Perhaps for Aboriginal offenders, some of the behaviours measured in the impulsive acts item (e.g., substance abuse, fighting, and employment instability) are indicators of something other than self-regulation problems (e.g., cultural marginalization and poverty). A corollary of this is the possibility that similar underlying propensities may be expressed differently by Aboriginal and non-Aboriginal offenders due to social, economic, and cultural differences.
The strengths of this study are its prospective design and its field implementation of a risk assessment scale among trained supervision officers. This study also provides an important contribution by shedding light on an underresearched area. The main limitation of this study is the small sample size of Aboriginal offenders, which would likely lead to increased Type II errors (i.e., nonsignificant AUCs). However, alternately, the large number of analyses would increase the probability of Type I errors. Readers should consider the findings of this study as preliminary and exploratory; further replications with larger sample sizes are sorely needed.
Another limitation is that we used the same sample on which STABLE-2007 was developed. Although it is possible that this could inflate the AUC values, only three of the items (and the total score) were changed from STABLE-2000 (which was developed before these data were being collected) to STABLE-2007, so inflation is anticipated to be minimal. In addition, any inflation effect is unlikely to have a disparate effect on the two groups and is therefore unlikely to affect analyses of differences in predictive accuracy between Aboriginal and non-Aboriginal offenders. Nonetheless, independent validations of STABLE-2007 would be a valuable contribution to this field.
In addition, although the field use of the scale increases generalizability to real-world settings, it also introduces considerable variations in the quality of the STABLE ratings. The original report from this project identified substantial differences in the results for supervision officers considered conscientious (defined as those who submitted all the data that were requested of them) compared with less-conscientious officers (Hanson et al., 2007). Although the assessments in this study were completed by well-trained officers who demonstrated a good understanding of the coding rules, the lower levels of conscientiousness may reflect the tension between ideal practice and the realities of limited resources. Nonetheless, optimal tests of predictive accuracy for Aboriginal offenders (or for any group) should be conducted with risk assessments coded by raters who are properly trained, who have access to complete information, and who have sufficient time to adequately score the measures.
Implications for Researchers
This study provides preliminary data on the use of dynamic risk factors with Aboriginal sex offenders. This study did not find support for many of the general antisociality items of STABLE-2007, but additional studies are needed to replicate these findings, especially with larger samples of Aboriginal offenders. Research should also explore potentially unique risk factors for Aboriginal offenders (e.g., cultural identity) and examine possible differences within Aboriginal groups (e.g., different bands or whether the offenders live on a reserve).
Research should also examine the absolute predictive accuracy of actuarial risk scales (e.g., do the recidivism estimates overrepresent or underrepresent recidivism for Aboriginal offenders?). More research is also needed to understand what underlying propensities are related to sexual offending and to explore optimal ways of measuring these constructs. If the items on current risk assessment scales are not valid indicators of these constructs, it may be necessary to identify better indicators, such as self-report questionnaires or implicit tasks. Finally, another avenue of future research should explore the effects of rater prejudice on the scores and predictive accuracy of risk assessment scales with explicit coding rules.
Implications for Risk Evaluators
It is difficult to provide recommendations to practitioners on the basis of an exploratory study with a restricted sample size. Virtually all the STABLE-2007 items did not achieve statistical significance in predicting recidivism, which means that these items cannot be used with confidence for Aboriginal sex offenders. However, the statistical power of these analyses was quite small. Even for non-Aboriginal offenders (n = 509), some of the items did not reach statistical significance, even though they did in the original report with a larger sample size (Hanson et al., 2007). For these reasons, our interpretation of these exploratory findings focused on the magnitude of the effect sizes and the tests for differences in predictive accuracy, which provided some basis for optimism.
This study suggests that sexual self-regulation, relationship stability, and social influences predict recidivism similarly among Aboriginal and non-Aboriginal sex offenders. Although more research is needed to replicate these findings, we believe that practitioners can use these risk factors in their evaluations with some justification. Evaluators should be more cautious about using STABLE-2007 total scores and the items that had significantly lower predictive accuracy (e.g., hostility towards women, lack of concern for others, impulsivity, poor problem solving, and cooperation with supervision).
It is also worth noting that other risk assessment scales have already been validated for use with Aboriginal offenders, such as the LSI and its variants (e.g., Brews et al., 2009; Gossner & Wormith, 2007) and Static-99/R (Babchishin, Blais, & Helmus, 2012; Nicholaichuk, 2001). Therefore, the evaluations of Aboriginal sex offenders could include Static-99/R to assess static risk factors, the LSI-R or its variants to assess dynamic risk factors, and the sexual self-regulation items of STABLE-2007 (with caution) to supplement the LSI variables and to guide treatment intervention decisions.
Footnotes
Acknowledgements
We would like to thank R. Karl Hanson and Andrew Harris for allowing us to use the data for the current analyses and Karl for his helpful comments on an earlier draft of this article.
Authors’ Note
The views expressed are those of the authors and not necessarily those of Public Safety Canada.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: Funding for this project was provided in part by the Social Science and Humanities Research Council of Canada.
