Abstract
This article provides an evaluation of the effectiveness of the “CoVa” cognitive skills training program. A quasi-experimental design was used to compare the known reoffending behavior of 2,229 CoVa participants with that of two control groups of offenders: (1) one formed using the inclusion criteria of the program, and (2) one formed using propensity score matching (PSM). Outcome measures included the prevalence, frequency, and impact of new, adjudicated crimes, with the impact defined as the combined severity of the offenses. Results indicated that the participants in the CoVa group were reconvicted less frequently than those in Control Group 1. Moreover, the impact of their recidivism was lower than the impact of those of Control Group 2. However, the effect sizes were very small, and no significant differences were found regarding reconviction prevalence. Explanations for the absence of (large) effects are discussed.
To reduce reoffending rates, various countries in Europe and North America invest in offending behavior programs within the prison and probation services (cf. Redondo, Sánchez-Meca, & Garrido, 1999; Tong & Farrington, 2006). The risk–need–responsivity (RNR) model as proposed by Andrews, Bonta, and Gendreau (e.g., Andrews & Bonta, 2003; Andrews et al., 1990; Gendreau & Andrews, 1990) is one of the more influential models for the treatment of offenders (Ward, Melser, & Yates, 2007). The RNR model suggests that reoffending is less likely to occur when offenders participate in programs that (a) match the level of program intensity to the participants’ risk of reoffending, (b) target known criminogenic needs of the offender, and (c) tailor the intervention to the learning style, motivation, abilities, and strengths of the offender (specific responsivity) using cognitive behavioral or social learning principles (general responsivity).
Following the RNR principles, the Dutch Probation Service has developed several behavioral intervention programs for offenders (Ministry of Justice, 2005). One of the most widely implemented programs in the Netherlands is the cognitive skills training CoVa (an acronym of cognitieve vaardigheden [i.e., cognitive skills]). Since 2004, it is possible to administer the CoVa training to adjudicated adult offenders, both inside and outside the prison. The training itself is an adapted version of the Enhanced Thinking Skills (ETS) program 1 (Clarke, Simmonds, & Wydall, 2004), which has been used in England and Wales for over 20 years, although from 2009 the program is gradually replaced with the Thinking Skills Program (TSP; Gobett & Sellen, 2014).
The CoVa program is based on the idea that cognitive deficits are important criminogenic factors that can be remedied or improved (Buysse & Loef, 2012). Deficits in cognitive skills have an effect on both the onset and the continuation of offending behavior (Tong & Farrington, 2006). Various empirical studies provide support for this relationship (Antonowicz & Ross, 1994; Farrington, 2001; Ross & Fabiano, 1985). Conversely, the cognitive model suggests that if offenders manage to improve their cognitive skills, the risk of (re)offending reduces correspondingly (Ross & Fabiano, 1985). Basically, the program theory of CoVa (i.e., the description of how the program can achieve its goals) stipulates that the training will reduce cognitive deficits, which will in turn lead to lower reconviction rates.
Previous Research on CoVa and ETS
Whether CoVa actually reduces reoffending has not yet been thoroughly investigated. The training is only researched as part of a larger investigation into the effectiveness of the Prevention of Recidivism Program in detention (Bosma, 2017). A small part of the offenders in this evaluation participated in CoVa. The reconviction prevalence of 93 offenders who completed CoVa was compared to offenders who did not participate. After controlling for differences in sentence length, prior convictions, and accommodation, no statistically significant effect was found on the reconviction rates in the 6 and 24 months following release. However, there are also studies about the effectiveness of CoVa on other outcomes. Two studies examined the results of preintervention and postintervention psychometric assessments completed by CoVa participants (Buysse & Loef, 2012; Tierolf, 2007). Both studies showed that the CoVa training has a significant positive effect on impulsiveness reduction, problem solving, and perspective taking. However, most of the reported effect sizes were small, and these studies did not include a control group. A third study used neurocognitive tasks to examine the effect of CoVa among prisoners (Cornet, Van der Laan, Nijman, Tollenaar, & De Kogel, 2016). This study did include a control group, but the results did not provide any evidence of a favorable change in neurobiological functioning in the intervention group.
Evaluation research on the predecessor of CoVa―ETS―does include various studies involving reoffending data, though the results of these studies are mixed. Several studies on ETS did not find significant differences in the 1- or 2-year reconviction rates of ETS participants and matched controls (Cann, 2006; Falshaw, Friendship, Travers, & Nugent, 2003; Palmer, Hatcher, McGuire, & Hollin, 2015). Other studies only showed a significant and positive effect when the comparison was limited to those who fully completed the intervention, excluding subjects who dropped out prematurely (Cann, Falshaw, Nugent, & Friendship, 2003; Hollin et al., 2008). Several explanations had been put forward for the fact that no effects were found, including the rapid expansion of the program, the lack of motivation on the part of the participants, and low program completion rates (Falshaw et al., 2003; Hollin et al., 2008). Moreover, the authors acknowledged a methodological limitation common to each of these studies: the inability to control for potential selection bias due to differences in dynamic risk factors for reconviction.
There are, however, also studies that showed significant positive effects of ETS. Reconviction rate reductions of 6% to 14% points were found (Friendship, Blud, Erikson, & Travers, 2002; Sadlier, 2010; Travers, Mann, & Hollin, 2014; Travers, Wakeling, Mann, & Hollin, 2011). In contrast to the other studies, Sadlier (2010) did use both static and dynamic risk factors to compose a control group and showed that the proportion of ETS participants who were reconvicted within 1 year was 6.2% points lower than that of the control group. Moreover, ETS participants were convicted for 60 fewer recordable offenses within 1 year per 100 released offenders than control group participants. Nevertheless, no statistically significant effect was found on the rate of reconviction for severe offenses (for instance, violence against persons, sexual offenses, and vehicular manslaughter).
The Current Study
This article evaluates the CoVa program in terms of reconviction rates. 2 We will examine the prevalence, frequency, and impact of reoffending by the CoVa participants and compare them with a similar group of offenders who did not participate in CoVa. 3 On the basis of the program theory, we expect that the CoVa participants have lower reconviction rates on these three outcome measures than the two control groups, which consist of offenders who have an RISc (Recidive Inschattings Schalen; Reclassering Nederland, 2010) assessment but did not participate in the program.
Although CoVa is one of the most widely implemented behavior programs in the Netherlands, the reconviction rates of the participants of CoVa are largely unknown. They are known only for a small proportion of participants, namely, those who had completed the CoVa training in prison and entered one of the Dutch remand centers between October 2010 and April 2011 (Bosma, 2017). The current study is a large-scale evaluation which also includes participants who follow the training outside of prison or who did not fully complete the training. Therefore, the current study contributes to the research into the effectiveness of CoVa. In contrast to the majority of earlier evaluations of ETS, this evaluation includes both static and dynamic risk factors in the composition of the control groups. Moreover, this study includes three outcome measures instead of only reconviction prevalence. The reconviction prevalence is an extreme dichotomous outcome in terms of human behavior because an offender either stops reoffending or continues (Bushway, Piquero, Broidy, Cauffman, & Mazerolle, 2001). However, if the process of desistance is actually more gradual and dynamic, then it may be of interest to examine the frequency as well as the impact of the repeat crimes to codetermine the success of the program.
Method
In this research, we used a quasi-experimental research design with observational data. The reconviction rates of CoVa participants who followed the training between 2008 and 2011 (n = 2,229) were compared to the reconviction rates in two control groups of offenders (n = 506 and n = 2,229) who did not participate in the CoVa program.
Program Description
The CoVa program consists of 20 group sessions of 2.5 hr, typically held twice a week, with a group consisting of 12 to 14 participants (Buysse & Loef, 2012). Every session is conducted by two regular trainers, with a third trainer on standby. During the training sessions, participants are instructed about values, made to practice new skills, learn to think creatively and rationally, and to consider both the short-term and the long-term consequences of their actions. Between sessions, participants are obliged to complete homework. The total amount of time invested by each participant is approximately 80 hr. When a participant is unable to attend a session, they have to make up for it later. Failure to appear for the second time—without a valid reason—leads to exclusion from the entire course.
Offenders have to meet certain selection criteria to participate in the CoVa program. Whether or not offenders meet the criteria is determined by a screening instrument called “RISc” (Reclassering Nederland, 2010), which closely resembles the OASys (Howard, Clark, & Garnham, 2003). The RISc is used by the Probation Service on a national scale to investigate problems on 12 life domains (e.g., accommodation, drug misuse, and education, work, and training). Together, the individual sections assessing these domains form an overall score indicating the risk of reoffending. To complete a RISc assessment, probation officers use criminal record information, interviews with the offenders and offender’s references, and their own professional judgment. The RISc was evaluated with regard to the interrater reliability, factorial structure, internal consistency, and concurrent validity (Van der Knaap, Leenarts, & Nijssen, 2007). The general conclusion of the evaluation was that the RISc has favorable psychometric qualities. Further research showed that the predictive validity of the RISc was “sufficient” with respect to the prediction of general, serious, and very serious recidivism in the population of probation and aftercare clients (Van der Knaap & Alberda, 2009).
CoVa is developed for offenders who show a moderate to high risk of reoffending as measured by the RISc assessment (i.e., a total score of 32 points or more; RISc version 3.2.5). In addition, offenders should show cognitive deficits on the thinking and behavior scale of the RISc. This scale consists of eight items: level of interpersonal skills, aggressive/controlling behavior, temper control, impulsivity, ability to recognize problems, problem solving skills, achieves goals, and mindset and learning ability (range = 0-2 points for each item). The offender should score at least 4 points on this scale. With regard to the five last mentioned items, the offender should have at least 1 score of 2 points (which indicates a severe problem) on any of these five items or a score of 1 (which indicates a problem) on each. If an offender does not meet the abovementioned selection criteria but is nevertheless considered a suitable candidate for the program by the probation officer, the offender is allowed to participate if the intervention coach agrees with the probation officer’s judgment (Ferwerda, Van Wijk, Arts, & Kuppens, 2009). Other treatment inclusion criteria are as follows: being an adult (i.e., 18 years or over), not being under special psychiatric care, sufficient proficiency in the Dutch language, a valid residence permit, and, if applicable, having to serve at least another 4 months of detention at the start of the training program. Contraindications for participation include addictions or mental health problems that obstruct group functioning. 4 Furthermore, the CoVa training is also not suitable for offenders who either show little willingness to change or are presumed to have a low IQ. 5 Offenders with lower intellectual abilities are generally assigned to an adjusted version of the training named “CoVa plus.” Fischer, Captein, and Zwirs (2012) estimated that 18% to 30% of the offenders who meet the selection criteria for CoVa exhibit one or more contraindications.
Data
We combined different data sources. Information gathered in the earlier mentioned research by Buysse and Loef (2012) was reused. Specifically, we reused data about the persons who participated in the CoVa program, their start and end date in the program, the setting in which the program was applied (inside or outside prison), and whether or not participants dropped out of the program prematurely. On the individual level, we linked the data of each CoVa participant to three databases:
Probation data: We used the RISc database to get more information about the psychosocial functioning of the CoVa participants before they entered the program. Probation officers apply this instrument to assess the risk of reoffending, obtain presentence reports, and to develop plans concerning probation and aftercare service supervision. The RISc database contains information about the prevalence of criminogenic needs in 12 different life areas.
Prison data: The Dutch prison registration system (TenUitvoerleggingprogramma Gevangeniswezen, abbreviated to “TULP-GW”) contains information about prison admission and release dates of all detainees in the Netherlands.
Conviction data: The Dutch Offender Index (Onderzoek- en Beleidsdatabase Justitiële Documentatie, abbreviated to “OBJD”) is an anonymized database. It contains information about all criminal cases of individuals prosecuted in the Netherlands for criminal offenses. Criminal cases are only registered for persons who are 12 years and older.
The resulting combined data set suffered from missing data. Around 12% of the records contained at least one missing value. In most cases, it was unclear whether a person had problems in one or more life areas of the RISc. Occasionally, the offense type or the country of birth of a participant was missing. The absence of data was found to be selective, so if we simply would have discarded the individuals with missing data, a biased estimate could be the result. Therefore, we used switching regression (Van Buuren, Boshuizen, & Knook, 1999) to iteratively impute the missing data. For continuous variables, we applied predictive mean matching (Heitjan & Little, 1991; Schenker & Taylor, 1996) to obtain plausible covariate values.
Participants and Control Groups
This study examined all offenders who had followed the CoVa training at least once somewhere between 2008 and 2011. As referred to above, this information was obtained from the study of Buysse and Loef (2012). If an offender followed CoVa more than once, only the last instance of participation was included. In addition, we excluded participants with missing end dates, intramural participants without a matching detention episode in the prison databases, and those whose records could not be linked to the other data sources. Furthermore, we excluded participants who left detention after 2011, ensuring that every participant was observed for a period of at least 2.5 years. The final treatment group, hereafter referred to as the “CoVa group,” includes 2,229 participants. This group consists of both completers and noncompleters (around one fifth of the participants dropped out prematurely).
To estimate the effect of CoVa on post-training recidivism, the CoVa participants were compared to offenders in two separate control groups. Both comparison groups were selected from the total population of offenders with a valid RISc assessment but who had not followed CoVa between 2008 and 2013. 6 Next, assessments of forensic psychiatric patients and juvenile offenders were removed, as they were not allowed to participate in this particular training, thus making them inadequate controls. The resulting pre-matching group consisted of 52,242 offenders.
The first comparison group was composed by Buysse and Loef (2012). Buysse and Loef selected all RISc assessments of persons who fitted the inclusion criteria for CoVa. Offenders with a presumed low IQ as well as those with a very low score on items measuring motivation to change were removed. From the remaining pool of offenders, 300 offenders with an extramural sanction and 300 detained offenders were randomly selected. Subsequently, Buysse and Loef manually checked in the RISc data for any contraindications (e.g., severe psychiatric disorder or addiction). If none were found, the offender was permitted in the comparison group. From this comparison group, we excluded those offenders in the intramural group who could not be linked to the other data sources and who left detention or completed the RISc assessment after 2011. The resulting first comparison group consists of 506 offenders.
In addition to this first control group, we used the pre-matching group of 52,242 offenders to compose a second control group using propensity score matching (PSM; Rosenbaum & Rubin, 1983). Offenders were matched on the probability to be allocated to the CoVa group on basis of their characteristics. The independent variables used as covariates can be grouped into four categories: offender background characteristics, features of the index case, criminal career characteristics, and the RISc problem scores in 12 life areas. All covariates have shown associations with reconvictions (cf. Wartna, 2009; Zebel, Alberda, & Wartna, 2014). There are several approaches to applying PSM within a multiple imputation framework. We used the method of combining PSM and imputations of missing data that has shown the least bias in simulation studies (Hill, Reiter, & Zanutoo, 2004). This method consists of the following steps:
Estimate propensity score models on each separate imputed data set and find a match for each intervention subject;
Calculate the covariate balance statistics and the treatment effect for each separate matched sample;
Combine the effect estimates and the balance statistics over imputations using the rules for combining statistics established by Rubin (1978).
Outcome Measures
To estimate reconviction, we used conviction data from the criminal justice system. A conviction was defined as a valid disposal by the court or penalty order issued by the public prosecutor (i.e., decisions that found the defendant guilty). Acquittals and technical dismissals were excluded. Furthermore, subdistrict court cases (i.e., minor offenses) were not counted as valid reconviction events. However, criminal justice data do not necessarily display the actual number of committed offenses but only those that were detected by the justice system (Biderman & Reiss, 1967). Nevertheless, as this is a comparison study, the estimates served as lower bounds of the actual number of new offenses for both the treatment and control groups.
Three outcome measures of reoffending were considered: the prevalence of reconvictions, the number of reconvictions, and the impact of all crimes involved in the new penal cases. The prevalence was defined as the proportion of offenders who are convicted of an offense within a certain follow-up time. The observation period with respect to the CoVa participants started at the last day they attended the program or, if applicable, the day that they were released from prison. For the offenders in the comparison groups, the follow-up period started on the day they were released from prison or, if no prison sentence was served, on the day when the RISc questionnaire was completed. Reconviction frequency was defined as the number of new convictions. As some convictions lead to new imprisonments, the time at risk for reoffending was not similar for everyone. For that reason, we corrected the frequency for the duration of the subsequent detentions and rescaled the frequency as the number of new convictions per year of “not being detained.” The impact of the reoffenses that were adjudicated is a measure that weighs the frequency of registered offenses against their severity. Offense severity was measured with the MSI (in Dutch, the Misdaad Straf Index; Beerthuizen, Wartna, Verweij, & Tollenaar, 2015). This scale quantifies the severity of the committed offenses in a penal case on basis of the sentence imposed in that case (refer to the appendix for more information on this process). In this study, we determined a certain amount of “sanction points” for each reconviction within a 4-year observation period. Next, we calculated an overall MSI mean score per year with no detention for each person.
Analysis Strategy
The reconviction prevalence rates were estimated using survival analysis (Kaplan & Meier, 1958). To test whether there were differences in survival between the CoVa group and comparison groups, we used log-rate, Wilcoxon, and Tarone–Ware tests. Differences in frequency and impact of the reoffenses were tested with t tests. We used Cohen’s d to portray the effect sizes of CoVa (Cohen, 1988).
The quasi-experimental design of this study required that we statistically control for any differences found in relevant background characteristics. If there were differences between the participants and a comparison group, we used multiple regression models. These models allowed us to determine the effect of CoVa participation while accounting for group differences in various variables that are typically associated with reoffending (e.g., offense history or drug abuse). As this study used three different kinds of outcome measures, different multiple regression models were deemed appropriate. First, we applied a Cox regression analysis on the reconviction prevalence. The effect sizes of the covariates and this outcome variable were expressed in the exponent beta (eβ) or hazard ratio (Cox, 1972). Second, to analyze the number of reconvictions, we used a negative binomial regression model. This technique is particularly suited for the analyses of “count” variables with a variance in the outcome measure that is larger than the average (Long & Freese, 2001). Third, we used a multiple regression model when analyzing the impact of the reoffenses. All regression models were computed in Stata version 12.1 using the “mi estimate” procedure. This procedure combined multiple imputations and estimated adjusted coefficients.
Results
Outcome of the Matching Procedure
As the effectiveness of CoVa is determined by a comparison of reconviction rates of the treatment and control groups, it was important that these groups were comparable on various background characteristics that (co)influence the risk of recidivism. To test whether there were differences, we performed imbalance tests. Table 1 displays the results of the pre-matching and post-matching t tests. Significant differences between the CoVa group and the pre-matching group existed on every covariate, which underlines the need for matching the treatment and the comparison groups.
Background Characteristics of the CoVa Group and the Comparison Groups
Note. CoVa = cognitieve vaardigheden; PSM = propensity score matching; RISc = Recidive Inschattings Schalen.
p < .05. **p < .01. ***p < .001.
Although the background characteristics of the first comparison group were more similar to the CoVa group than the pre-matching group, many characteristics were still statistically significantly different. The CoVa participants were younger and less often born in Morocco. Furthermore, the index case of the participants more often constituted a violent property crime or a drugs or traffic offense, while the offenders in Comparison Group 1 were more often convicted for violent or sexual offenses. Looking at their criminal careers, the CoVa participants had fewer previous contacts with the justice system. The CoVa participants were, however, younger at the time of their first conviction and spent a longer term in detention. In contrast, Comparison Group 1 showed more problems in several areas of functioning than the CoVa group. In general terms, the first comparison group seemed to consist of offenders with a higher average risk of reoffending than those in the CoVa group. We controlled for these differences when comparing the reconviction rates of both groups.
For the composition of Control Group 2, we used PSM based on entering the covariates in Table 1 into a logit model using CoVa treatment participation as the outcome variable. Figure 1 displays the distribution of the log of the propensity scores in a single imputation for the total pre-matching group and the CoVa group. It shows complete overlap of the propensity scores of the pre-matching group and the CoVa group, which means that using PSM allowed us to find a reasonable match for each CoVa participant.

Common Support of Propensity Scores of CoVa and Pre-matching Group
The fourth column of Table 1 shows that, following matching, there were no significant differences left between the CoVa participants and Comparison Group 2. We therefore conclude that these groups were comparable after matching.
Although Comparison Groups 1 and 2 were extracted from the same population, the overlap was limited. A maximum of 71 individuals was included in both groups. The core strength of Comparison Group 2 is that this group was matched to the CoVa group on various important covariates in advance, while Comparison Group 1 and the CoVa group still differ on many of the background characteristics. Multiple regression models were thus needed to statistically control for these differences. However, the core positive feature of Comparison Group 1 is that persons with contraindications were removed from the sample. This was not feasible for Comparison Group 2, so, on this point, Comparison Group 1 is preferable. In sum, both the comparison groups have their own strengths which is why we decided to use both groups. Moreover, if we were to find similar results when comparing the treatment group to Comparison Groups 1 and 2, this would substantiate the findings on the effectiveness of CoVa. In that case, the outcome seems insensitive to the selection procedure of the comparison group.
Outcome Results
The outcome of the CoVa training was determined in three ways: by measuring the prevalence of reconvictions, the number of reconvictions, and the impact of the adjudicated reoffenses.
Reconviction Prevalence
Figure 2 depicts the failure curves of reconviction prevalence at different survival times. After 1 year, 32.1% of CoVa group participants had been reconvicted. This percentage had risen to 47.9 % and 56.3% after 2 and 3 years, respectively. The reconviction rates of Comparison Group 1 were a bit higher. After 1 year, 35.8% had been reconvicted, after 2 years, 49.5%, and after 3 years, 57.7%. In Comparison Group 2, the prevalence of reconviction was lower. Of the persons in this group, 31.3% were reconvicted within 1 year, 46.3 % within 2 years, and 55.8% within 3 years. These differences between the CoVa group and the comparison groups were not statistically significant. This did not change when we used a Cox regression analysis to control for differences between the CoVa group and Comparison Group 1 (see Table 2). The Exp(B) for program participation remained statistically nonsignificant.

Reconviction Prevalence of the CoVa Group Versus Comparison Groups 1 and 2. The F Statistics for the Differences Between the CoVa Group and Comparison Group 2 Are Obtained by Combining the χ2 From the Imputed Datasets
Multiple Regression Analyses CoVa Group Versus Comparison Group 1
Note. CoVa = cognitieve vaardigheden; CI = confidence interval; RISc = Recidive Inschattings Schalen.
p < .05. **p < .01. ***p < .001.
Reconviction Frequency
The CoVa group displayed a lower average frequency of reconvictions per year of no detention compared with both comparison groups. On an average, the CoVa participants were convicted for 0.51 new criminal cases per year not in detention, whereas Comparison Group 1 recidivated with 0.62 criminal cases and Comparison Group 2 with 0.54 criminal cases. The difference in frequency of the CoVa group and Comparison Group 2 was not statistically significant—t(df) = 35.45 = −1.12, p = .135—but the difference between the CoVa group and Comparison Group 1 was significant—t(df) = 684.772 = −2.75, p = .012. Since the CoVa group and Comparison Group 1 differed on background characteristics, this last result might be biased. For this reason, we next performed a negative binomial regression analysis. From the results in Table 2, it follows that, while controlling for differences in background characteristics, the effect of CoVa program participation was still statistically significant. This effect is comparable to a Cohen’s d of 0.10. According to Cohen (1988), this is a “very small” or “negligible” effect.
Impact of Adjudicated Repeat Offenses
The severity of the offenses was articulated in terms of sanction points and combined with the frequency to determine the impact (refer to the appendix). The average impact of repeat offenses per year of no detention for the CoVa group was 21.2 sanction points. Comparison Group 1 had an average of 23.6 sanction points and Comparison Group 2 an average of 24.3 sanction points. The difference between the CoVa group and Comparison Group 1 was not statistically significant—t(df) = 2,733 = −1.487, p = .137. In the additional multiple regression analysis, a similar result was found (see Table 2). However, the difference between the CoVa group and Comparison Group 2 was significant—t(df) = 332 = −2.30, p = .011. This effect is comparable with a Cohen’s d of 0.07.
Discussion
CoVa is a training program intended to improve cognitive skills and subsequently reduce recidivism among its participants. To evaluate the effectiveness of CoVa, we used a retrospective quasi-experimental design with two control groups and three outcome measures. Contrary to our expectations, the analyses showed no effects on the first outcome measure, the reconviction prevalence. However, statistically significant effects were found on the other outcome measures. The participants were reconvicted less frequently than the offenders in Comparison Group 1, and the impact score of their reoffenses was on average lower than that of Comparison Group 2. Although the comparison with two control groups led to a slightly different outcome, no contradictory results were found. The reconviction frequency in Comparison Group 2 and the impact of the reoffenses of Comparison Group 1 were also higher than in the CoVa group, although those differences were not statistically significant.
Our results are consistent with the analysis of Bosma (2017), which also did not show a significant treatment effect of CoVa based on the reconviction prevalence. Moreover, our results on the reconviction prevalence are consistent with the outcome of a substantial part of the research on ETS (Cann, 2006; Cann et al., 2003; Falshaw et al., 2003; Hollin et al., 2008; Palmer et al., 2015). These studies also did not find significant differences between all offenders allocated to a program and the comparison groups.
Although the nonsignificant results on the reconviction prevalence suggest that CoVa did not reduce the amount of persons with at least one reconviction, it does not necessarily mean that the program has no effect at all. If the process of desisting offending behavior is gradual, then the reconviction prevalence, an outcome measure which focuses only on the first reconviction, is too crude to measure program effects. For that reason, we also evaluated the post-treatment reconviction frequency and the impact of the adjudicated repeat offenses. These measures showed some significant positive effects (with regard to one control group), but the corresponding effect sizes are very small (Cohen, 1988). In other words, the current study did not reveal any large positive effects, thus suggesting that CoVa did not substantially reduce recidivism.
When interpreting the results of this research, it is important to consider some limitations. First, selection bias might have affected the outcome of this study. Because there was no random assignment to the treatment conditions, we cannot rule out unmeasured differences between treatment and control groups that may have hampered the comparison on the outcome variables. Nevertheless, in the current study, we used two different selection procedures for the composition of the control groups and more than 25 static and dynamic risk factors to match the offenders in the treatment and control groups. Hence, the risk of potential selection bias was reduced to a minimum.
Second, the use of convictions as an outcome measure is a limitation as well. It is well known that not every new crime leads to a conviction (e.g., Farrington, Auty, Coid, & Turner, 2013). Hence, it is interesting to see whether the same effects would be found if, for instance, self-report or police data are used in an evaluation about CoVa.
Third, a methodological limitation is the lack of information on the exact nature of the control conditions. To determine the effect of CoVa on reconviction, we contrasted the results on the outcome variables for the participants with those of offenders who received “treatment as usual” (TAU). A TAU design allows for only finding a relative treatment effect (Löfholm, Brännström, Olsson, & Hansson, 2013). Hence, the size of an effect depends on the level of treatment in the control groups, which is information we do not have. For instance, we are unaware of the proportion of persons who also participate in some sort of other treatment program and, if they do, the dosage of that program. In general, though, the level of TAU in the Netherlands can be considered quite high. The CoVa training might be the behavior modification program most frequently applied, but it is definitely not the only one. Other nationwide behavior programs for adults focus on employability or budgeting skills, housing, and the prevention of alcohol abuse and violence, including domestic violence (Fischer et al., 2012). The Probation Service also implements a great variety of locally organized programs that focus on offender rehabilitation and the prevention of recidivism. Moreover, every Dutch prison offers reintegration activities which aim at providing a regular income, proper housing, a valid ID card, health care, and guidance to help relieve financial debts (Beerthuizen, Beijersbergen, Noordhuizen, & Weijters, 2015). Following their release, exprisoners can receive further guidance and help with their reintegration in society through living in halfway houses (Van Wingerden, Alberda, Moerings, Wartna, & Van Wilsem, 2010).
As mentioned above, we do not have detailed information on interventions received by the offenders in the two comparison groups. However, as the default treatment level in the Netherlands is high, the TAU design used in this study may be an important reason for not finding large positive treatment effects for the CoVa training program. A previous meta-analysis of recidivism research on penal interventions in the Netherlands produced an average effect that is even smaller than the significant effects found in the current study (Wartna, Alberda, & Verweij, 2013). Although the effect sizes found in the current study may be very small in terms of the well-known classification of Cohen (1988), they can be termed “normal” in relation to the outcome of other Dutch evaluation studies, which in general could suffer from the fact that there is no such thing as a “nontreatment” condition in the Netherlands.
Conclusion
The current study adds substance to the body of literature regarding evaluation research of the CoVa treatment program. This article represents the first large-scale study examining the effectiveness of CoVa in terms of reconviction. Furthermore, the current study used both static and dynamic risk factors to adequately compose control groups, and did not focus solely on reconviction prevalence, as is often the case in effect studies on treatment programs regarding criminal behavior.
The results showed that there were no differences in reconviction prevalence, but participants in the CoVa group were less frequently reconvicted than the offenders in one comparison group, whereas the average impact of their reoffenses was lower than those of the other comparison group. Although the effects were classified as very small, they do not differ much from the average effects found in other research into penal measures in the Netherlands. Our overall conclusion is that, although there are weak indications of the program being effective in some areas of recidivism, it has yet to be proven that the CoVa program substantially reduces recidivism among adult offenders with established deficits on cognitive skills.
Footnotes
Appendix
Acknowledgements
The authors would like to thank K. Beijersbergen for her helpful comments and suggestions during the writing process. Parts of this paper have previously been published in a Research and Documentation Centre (WODC) report, available in Dutch at
, which was financed by the Dutch Ministry of Security and Justice. The current paper does not necessarily reflect any views, opinions, or policies of either the WODC or the Ministry of Security and Justice. The authors are solely responsible for the contents of the current paper.
