Abstract
Empirical studies on the longitudinal change of risk factors during treatment of forensic psychiatric patients are scarce. Therefore, in the current study, changes in risk factors of 341 high-security forensic psychiatric patients in Flanders were investigated using latent growth curve analyses. Annual assessments of the Historical Clinical Future-Revised (HKT-R) at three time points were considered, starting at time of admission. A significant small improvement was found on the future scale and most future factors. However, the clinical scale did not change significantly and only one of the 14 clinical factors (i.e., problem insight) changed significantly over time. Although the forensic treatment seems to be a slow process, the sensitivity of the HKT-R to detect clinically relevant change is questioned, underscoring the importance of determining whether there are instruments capable of detecting small significant changes over time.
Introduction
In the field of forensic psychiatry, a large body of research has studied risk and protective factors associated with recidivism (Bonta et al., 2014), as risk management is one of the primary goals of treatment. In this context, dynamic risk factors are especially important, because of their changeability over time (Douglas & Skeem, 2005). The degree of change in those dynamic risk factors is included in the examination and decision-making of practitioners about the risk of recidivism and the further treatment trajectory of an individual. However, only in the past decade, attention has shifted to the empirical assessment of longitudinal change of dynamic risk factors during treatment.
The most influential risk factors for recidivism in offenders were described by Andrews and Bonta (2010) as “the Central Eight”: four risk factors that are strongly associated with recidivism (i.e., history of antisocial behavior, an antisocial personality pattern, antisocial cognitions, and antisocial associations) and four risk factors that are moderately and indirectly associated with recidivism (i.e., a dysfunctional family situation, work and school problems, a lack of leisure activities, and substance abuse). This distinction was confirmed in mentally disordered offenders for violent recidivism (Bonta et al., 2014).
In recent decades, several risk assessment instruments have been developed to assess individual changes in empirically supported dynamic risk factors during treatment, for example, the Historical Clinical Risk Management-20 (HCR-20; Webster et al., 1997), the Violence Risk Scale (VRS; Wong & Gordon, 2006), and the Historical Clinical Future-Revised (Historische, Klinische, Toekomst-Revised; HKT-R; Bogaerts et al., 2018; Spreen et al., 2014). Since risk assessment and risk management are merged in practice, it is necessary to understand the changeable aspects of risk factors (Andrews & Bonta, 2010; Blanchard, 2013). However, progress can be made in the empirical assessment of the changeability of dynamic risk factors in inpatient forensic psychiatric patients since the literature is rather scarce.
Douglas and Belfrage (2001, as cited in Belfrage & Douglas, 2002; see also Webster et al., 2000) conducted the first study on changes of the HCR-20 over time in a forensic patient population. After approximately 6 months, significant moderate declines in the clinical and risk management scale were found in high-risk Canadian and Swedish forensic patients. Over the past two decades, multiple studies have been conducted in forensic psychiatric patients reporting significant small improvement on clinical scale scores during treatment (Belfrage & Douglas, 2002; Blanchard, 2013; Chakhssi et al., 2010; De Jonge et al., 2009; Hogan & Olver, 2016, 2019; Janković et al., 2021; Lewis et al., 2013; Mastromanno et al., 2018; Morrissey et al., 2014; Olsson et al., 2013; Olver et al., 2013; O’Shea & Dickens, 2015; Reed et al., 2000; Richter et al., 2018; Van der Linde et al., 2020; Webster et al., 2009). However, no significant change in dynamic factors was found in the study of Van der Veeken et al. (2018), using the Instrument for Forensic Treatment Evaluation (IFTE; Schuringa et al., 2014). The study was conducted in high security forensic patients at six time points with an interval of 4 to 6 months. Additionally, a Flemish study investigating a sample of patients residing in three forensic medium-security units found no significant difference in HCR-20 total scores between admission and discharge (Jeandarme et al., 2017). Similar non-significant findings were reported in the studies of Longdon et al. (2018) and Nitschke et al. (2020).
At the item level, De Jonge et al. (2009) found significant decreases on most clinical factors of the HKT-30 (Werkgroep Risicotaxatie Forensische Psychiatrie, 2002) after 2 years of treatment (problem insight, impulsivity, empathy, hostility, social skills, cooperation with treatment, responsibility for the offense, and coping skills), but the effect sizes were rather small. Other research confirmed the significant small decrease of several clinical factors during treatment, that is, lack of insight, negative attitudes, active symptoms of major mental illness, unresponsiveness to treatment, impulsivity, hostility, and egocentrism (Hildebrand & de Ruiter, 2012; Olsson et al., 2013; O’Shea & Dickens, 2015). Webster et al. (2000) described varying results, as significant decreases on all clinical factors of the HCR-20 were found in a high-security patient population in Canada, but only one clinical factor (i.e., negative attitudes) decreased significantly in Swedish high-security patients. Similarly, none of the clinical factors changed significantly in the study of Longdon et al. (2018).
Several risk assessment instruments, such as the HKT-30 and HCR-20, include risk factors related to the assessment of risk management in the future, after discharge. In the study of De Jonge et al. (2009), the HKT-30 future scale or risk management scale decreased significantly after about 2 years, as did all future factors. Several studies also reported significant small declines on the future scale (Blanchard, 2013; Douglas & Belfrage, 2001, as cited in Belfrage & Douglas, 2002; Hogan & Olver, 2016, 2019; Morrissey et al., 2014; Olsson et al., 2013). Although Belfrage and Douglas (2000) reported a decrease on the intra-institutional risk management scale after 1 year of follow-up, no change was found on the risk management scale for community release. Other authors also did not report significant changes on the future scale between assessments (Longdon et al., 2018; Nitschke et al., 2020; O’Shea & Dickens, 2015; Richter et al., 2018), while Mastromanno et al. (2018) found a significant increase after treatment.
It should be emphasized that various characteristics of the study design, the patient population or the type or duration of treatment or follow-up, can influence the findings and therefore explain differences between studies. For example, treatment duration seems relevant, as Van der Veeken et al. (2018) reported better resocialization skills in patients with a longer treatment history at the forensic psychiatric center (FPC) at first IFTE assessment. Janković et al. (2021) found a significant decrease in the clinical scale of the HKT-R during the first phase of treatment (period from juridical assessment until the first unguided leave), but a slower decrease during the second phase (period from the first unguided leave until unconditional release). This is in line with the finding of Richter et al. (2018), who found that only patients admitted for less than 2,112 days at baseline significantly decreased on the clinical scale. The greatest improvement in discharge plans of patients with longer prior treatment times seems to have occurred before the first assessment, so no significant improvements were reported afterwards. However, Van der Linde et al. (2020) found no differences in trajectories between different treatment duration groups.
Furthermore, psychopathology has been found to be associated with change (Janković et al., 2021). Patients diagnosed with substance use disorder or psychotic disorder showed a greater reduction in risk factors in the first phase of treatment, while patients with a cluster B personality disorder showed a greater decline in the second phase.
The same findings were seen in patients with psychotic disorder and cluster B personality disorder with or without a diagnosis of comorbid substance use. Moreover, the risk of recidivism could have an influence, as patients at higher risk at admission showed significant treatment progress, but an increase in problematic behavior was found in low risk patients (Van der Veeken et al., 2018). However, no different change trajectories were found between first-time offenders and recidivists, nor between different age groups (Van der Linde et al., 2020).
Lastly, according to Mastromanno et al. (2018), there might be differences in treatment between (1) patients who have been found not guilty by reason of mental impairment and (2) offenders who were found guilty of their crime and had served a prison sentence, but who were also diagnosed with a mental disorder and who have been referred to the mental health service. Possibly the focus is more on treating the mental disorder of these mentally ill prisoners than on rehabilitation and reintegration, which could explain why no decrease in the risk management scale for community release was found in their study.
It is important to address some issues raised in several studies that have not found significant decreases of dynamic risk factors during treatment. First, the sensitivity of the HCR-20 was questioned, as the three-point ordinal rating scale at item level may not be sensitive enough to detect change (Longdon et al., 2018; Morrissey et al., 2014; Nitschke et al., 2020; Olsson et al., 2013; Richter et al., 2018). The only study found that used a larger rating scale (i.e., IFTE) could not confirm this idea as no significant changes were discovered (Van der Veeken et al., 2018). The authors suggested that a 17-point rating scale might be too sensitive, detecting not only slight progress but also a decline, which could result in a mean change near zero. However, the prior treatment duration differed between patients at the start of IFTE assessment, which could also have influenced the findings. Second, most studies that found no significant change in dynamic risk factors during treatment applied a follow-up period of less than 2 years (Longdon et al., 2018; Nitschke et al., 2020, O’Shea & Dickens, 2015; Mastromanno et al., 2018). This study period may be too short to achieve significant improvement and longer intervals may be required to observe larger and positive changes.
In conclusion, since periodic risk assessments are an essential part of risk management, it is important to gain empirical understanding of the modifiable aspects of risk factors and to learn which factors would benefit from systematic increased attention during treatment (Andrews & Bonta, 2010; Blanchard, 2013). Most international studies found rather small but significant reductions in risk factors during treatment and after release, which were measured with different risk assessment instruments. However, research is still scarce, especially in Belgium, making this study even more relevant. The aim of the current study therefore contributes to deepening this line of research by assessing the longitudinal change of risk factors during treatment in high-security forensic patients in Flanders. The change of both the risk scales as the individual risk factors of the HKT-R were assessed. Based on prior research, we expected to find a small decrease in the clinical and future scales and risk factors of the HKT-R.
Method
Participants
Participants were 341 male patients (previously) residing in the two high security FPCs in Flanders. All patients were found not guilty by reason of insanity after having committed an offense and were subsequently admitted for mandatory treatment to diminish the risk of future offending. Mean age at admission was 40.7 years old (SD = 11.3, range = 19–77). Most of the patients (85.9%) were of Belgian nationality, followed by Moroccan (5.9%), Turkish (1.2%), Afghan (0.9%), and Dutch (0.9%) nationality. Based on the Diagnostic and Statistical Manual of Mental Disorders 5 (DSM-5; American Psychiatric Association, 2013), 38.1% of patients had a primary diagnosis of psychotic disorder, 27.9% Cluster B personality disorder, 8.2% other specified personality disorder, 8.2% paraphilic disorder, 4.7% substance abuse disorder, 4.7% conduct disorder, 4.1% neurobiological disorder, 0.9% Cluster A personality disorder, and 0.3% Cluster C personality disorder. The patients had committed a variety of index offenses. Among the cases with multiple index offenses, the most severe offense was selected: (attempted) homicide (23.5%), sexual offenses (23.2%), arson (4.4%), threats (5.9%), property crime (9.1%), drugs crime (1.5%), and other violent crimes (32.6%). The latter index offense included the offense of intentional assault and battery, as well as property offenses or extortion with violence, threats, or weapons.
Measure
The HKT-R is a Dutch risk assessment tool used to predict the likelihood of violent recidivism (Spreen et al., 2014) and is the revised successor of the HKT-30. It was created to support the structured professional judgment of the risk of violent recidivism in leave applications and TBS (“terbeschikkingstelling”; involuntary admission by order of the state for mentally disordered offenders) recommendations in the Netherlands. The goal was to develop an instrument to predict and identify characteristics of individuals for diagnosis and risk assessment purposes. Researchers and clinicians were consulted to include all elements deemed relevant.
The HKT-R consists of three scales, the historical, clinical, and future scale. The historical scale contains 12 historical factors related to the life history of the patient. These items are static risk factors and were therefore not investigated in the current study. The clinical scale consists of 14 clinical factors measuring the behavior of the patient in the previous 12 months (see Table 1). Lastly, the future scale includes seven future factors where the rater estimates the risks that emerge during leave modalities or after discharge from the FPC with further supervision (albeit at a lower level of supervision, i.e., supervised future scale), and after discharge from the FPC without further supervision (i.e., unsupervised future scale) (see Table 1). All risk factors are scored on a 5-point Likert scale ranging from 0 (“no risk”) to 4 (“high risk”). Therefore, a maximum of 56 points can be scored on the clinical scale and a maximum of 28 points on both the supervised and unsupervised future scale. After each HKT-R assessment, an individualized weighted structured professional judgment of the risk of recidivism is made by the clinicians for each patient. However, this final clinical judgment was not taken into account in this study. In the present study, the internal consistency of the three scales (clinical scale, supervised future scale, and unsupervised future scale) was considered acceptable at any time point, with Cronbach’s alphas ranging from .74 to .87 (Nunnally & Bernstein, 1994).
Means and Standard Deviations for HKT-R Clinical and Future Scales and Factors.
Note. HKT-R = historical clinical future – revised.
Procedure
The original sample of the study consisted of all patients residing in the two high security FPCs in Flanders from the opening of the centers (November 17th, 2014, for FPC Ghent and August 7th, 2017, for FPC Antwerp) until July 3rd, 2020 (n = 633). Most patients were men (n = 618), as no women were admitted to FPC Ghent and only 10% of the admission capacity in FPC Antwerp can be allocated to female patients. All 15 female patients were excluded from the study because of the small number. Additionally, only patients with at least two HKT-R assessments were eligible for participation to be able to record change over time. Therefore, 200 patients were excluded as only one or no HKT-R assessment was done. This concerned patients who had only recently been admitted, patients who stayed in hospital for a short period of time, and patients for whom an HKT-R was not considered necessary because they had not committed a violent offense. Furthermore, 77 patients were excluded because of invalid HKT-R assessments that could affect the reliability of the measurements. An HKT-R assessment was considered invalid if more than two missing values were detected. This resulted in a final sample of 341 male patients.
The data were collected from the electronic patient files. The HKT-R was prospectively scored by psychologists during treatment, as part of routine outcome monitoring procedures. All psychologists followed a certified HKT-R training. According to good practice, a patient’s risk of recidivism should be assessed annually or when a transition to a lower security level is warranted. For this study, the first three HKT-R assessments were investigated. The first examination of the HKT-R (Time 1) for each patient was considered if it occurred within the first 2 years after admission, with a mean of 8.0 months (SD = 5.8, range = 0–24) after admission. On average, the second assessment (Time 2) took place 13.5 months (SD = 4.2, range = 5–35) after the first assessment and the third assessment (Time 3) followed 11.8 months (SD = 3.6, range = 1–25) after the second assessment. Therefore, there was a period of approximately 12 months between each time point. As previously mentioned, the HKT-R assessment was only considered valid if there were no more than two missing values. The scale scores were prorated when one or two missing values were present. In total, there were 312 valid assessments at Time 1, 341 valid assessments at Time 2, and 176 valid assessments at Time 3.
Statistical Analyses
Preliminary analyses were performed using IBM SPSS Statistics (version 25). All dependent variables were normally distributed, with skewness between −2 and +2 and kurtosis between −3 and +3 (Gunzler et al., 2021). Multivariate outliers were investigated and resulted in the removal of 109 values (0.80%). Missing data were processed using Full Information Maximum Likelihood (FIML) estimation to make optimal use of the data (Stoel & Garre, 2011).
Next, latent growth curve (LGC) analyses were calculated in Mplus version 7 (Muthén & Muthén, 1998–2012) to model changes in risk factors during treatment, from Time 1 until Time 3. The latent growth curve model (LGCM) is a method used to analyze longitudinal data and is suitable for investigating trajectories and changes of risk and protective factors over time. The LGCM compares lines of change across a range of individuals and determines the line of change of the overall model. LGCMs can be used to trace linear or curvilinear trajectories. Since the method uses structural equation modeling, models are adjusted for measurement errors. The mean baseline HKT-R scores (i.e., mean of intercept) and mean rate of change in HKT-R scores per assessment (i.e., mean of slope) were examined. Furthermore, the individual differences at baseline HKT-R scores (i.e., variance of intercept) and interindividual different rates of change per HKT-R assessment or growth trajectories (i.e., variance of slope) were described (Muthén & Muthén, 1998–2012). A total of 31 separate unconditional models were tested, in which the following dependent variables were separately included: the 14 clinical factors, the seven future factors at conditional release, the seven future factors at unconditional release, the clinical scale (i.e., the sum of the 14 clinical factors), the supervised future scale (i.e., the sum of the seven future factors at conditional release), and the unsupervised future scale (i.e., the sum of the seven future factors at unconditional release). A Bonferroni correction was applied for multiple testing (p < .0016). Model fit was assessed by global fit indices provided by Mplus: the Comparative Fit Index (CFI; >0.95 for reasonable fit) and the Standardized Root Mean Square Residual (SRMR; <0.08 for reasonable fit) (Hu & Bentler, 1999).
Finally, the RCI was calculated for the HKT-R clinical scale and factors, as well as the (un)supervised future scale and factors. The RCI determines if the individual change on each measure is statistically reliable and not due to measurement error. These analyses were performed because each HKT-R assessment was scored by a single rater and no conclusions could be made about inter-rater reliability. It was therefore unknown whether the scores were influenced by measurement error. The method of Jacobsen and Truax (1991) was adopted to calculate the RCI, in which the score at baseline was subtracted from the score at follow-up, then divided by the standard error of the difference. The standard error of the difference was calculated by √2(sd1√1−rxx)², where sd1 is the standard deviation of baseline observations and rxx is the reliability coefficient. The one-way random, single measure intraclass-correlation coefficient (ICC) was applied, which was assessed during the development of the HKT-R (Bogaerts et al., 2018; Spreen et al., 2014). The inter-rater reliability for the clinical domain was .85 (.67–.94), and for the future domain .42 (.09–.67). If the RCI was greater than 1.96, the null hypothesis was rejected that the difference was 0 with 95% confidence. The RCIs were calculated within tree assessment intervals, that is, between the first and second assessment, between the second and third assessment, and between the first and third assessment.
Results
Descriptives
Table 1 presents the descriptive statistics of the six risk scales and the 28 individual risk factors in the clinical and future domain, at the three time points.
Group-Level Change
Table 2 presents the mean and variance of the intercept and slope of the clinical scale and the 14 individual clinical factors. The LGC-model regarding the clinical scale and clinical factors fitted the data well. Since the chi-square is sensitive to sample size, only the CFI and SRMR were relied upon. The CFI and SRMR values were acceptable to good, meaning that only a small portion of the residual correlation between the observed variables could not be explained by the estimated model. The clinical scale did not decline significantly over time. Regarding the clinical factors, only one item declined significantly (K01, problem insight; slope of −0.13). Individual differences in slopes were found for the clinical scale and for K04 (impulsivity), K06 (hostility), K08 (self-reliance), K10 (responsibility for the offense), K11 (coping skills), K12 (violation of terms and agreements), K13 (labor skills), and K14 (influence of risky network-members), which signified significant different growth trajectories between individuals during treatment.
Parameter Estimates and Model-Fit Indices of Latent Growth Curve Model for the HKT-R Clinical Scale and Factors.
Note. HKT-R = historical clinical future - revised; CFI = comparative fit index; SRMR = standardized root mean square residual.
p ≤ .0016.
The parameter estimates and model-fit indices of the LGC models of the supervised and unsupervised future scales, as well as the 14 supervised and unsupervised future factors, are presented in Table 3. All models had a good fit to the data. A significant decline over time was found regarding the supervised future scale (slope of −1.29) and the unsupervised future scale (slope of −1.07). Looking at the future factors, significant negative slopes were present in the supervised future domain for T01 (agreement on future conditions; slope of −0.21), T02 (accommodation; slope of −0.34), T03 (financial situation; slope of −0.24), T04 (employment; slope of −0.18), and T05 (daily activities; slope of −0.17). In the unsupervised future domain, there was also a significant decline in T01 (slope of −0.13), T02 (slope of −0.19), T03 (slope of −0.23), T04 (slope of −0.13), and T05 (slope of −0.14). However, individual variations in between-subject trajectories could be found in all 16 models, except for T01 and T02 in the unsupervised domain. In the two latter cases, all individuals had similar rates of change per assessment.
Parameter Estimates and Model-Fit Indices of Latent Growth Curve Model for the HKT-R Supervised and Unsupervised Future Scale and Factors.
Note. HKT-R = historical clinical future - revised; CFI = comparative fit index; SRMR = standardized root mean square residual.
p ≤ .0016.
Individual Level Change
In Table 4 the RCI is presented, which represents the percentage of patients that reliably decreased, reliably increased, or showed no reliable change during treatment on the clinical and future domain. For each time period, the majority of patients did not show any reliable change on the clinical scale and clinical items. Similar results were found for the supervised and unsupervised future scale and items. The threshold values for each item to be considered a reliable change, were included in Table 4.
Reliable Change Index for the HKT-R Clinical and Future Scales and Factors (in %).
Note. HKT-R = historical clinical future – revised.
The threshold values for each item to be considered a reliable change.
Discussion
The aim of this study was to empirically measure longitudinal change of the risk scales and risk factors of the HKT-R in 341 male patients after 2 years of treatment in the two FPCs in Flanders. Based on group-level analyses, the clinical scale did not change significantly, contrary to findings of previous literature that reported a significant small decline (Belfrage & Douglas, 2002; De Jonge et al., 2009; Hogan & Olver, 2016, 2019; Janković et al., 2021; Jeandarme et al., 2017; Mastromanno et al., 2018; Morrissey et al., 2014; O’Shea & Dickens, 2015; Olsson et al., 2013; Richter et al., 2018; Van der Linde et al., 2020). Furthermore, a decline was observed in only one clinical factor. There was a significant improvement of problem insight of the patients, similarly to several previous studies (De Jonge et al., 2009; O’Shea & Dickens, 2015; Olsson et al., 2013). However, no significant change was detected in most clinical factors (Longdon et al., 2018). These findings are similar to the results of the individual-level analyses, as most patients did not change reliably on the clinical scale and all clinical factors across all time periods.
Regarding the future domain, previous studies empirically assessing the evaluation of the future scale, reported varying results, from no significant changes (Belfrage & Douglas, 2002; O’Shea & Dickens, 2015) to significant increases (Mastromanno et al., 2018) and reductions (Blanchard, 2013; De Jonge et al., 2009; Hogan & Olver, 2016, 2019; Morrissey et al., 2014; Olsson et al., 2013) over time. Since the current study made use of the future factors of the HKT-R, which is related to the future factors of the HKT-30, the findings are similar to the study by De Jonge et al. (2009). Indeed, a significant decline was found in the supervised and unsupervised future scale. Five future factors improved significantly in both conditions ((un)supervised). However, the significant reductions were rather small. These findings were again in line with the fact that most patients did not change reliably on the future scales and future factors across all time periods.
Implications
As discussed, we found no significant change after 2 years of treatment on the clinical scale and clinical factors, with the exception of one significantly decreasing clinical factor in the group-level analyses. Thus, the following question arises: Does the treatment have any effect on risk factors in the clinical domain? Before trying to answer this question, more information on the treatment program is warranted. Treatment programs in the FPCs are based on cognitive behavioral therapy, which is one of the most empirically supported interventions in the forensic context (Bonta & Andrews, 2017). Different therapeutic modules are developed and offered to patients based on their needs or dynamic risk factors. For example, as the module aggression regulation therapy targets impulsivity, hostility, and antisocial behavior, it is offered to patients with a Cluster B personality disorder or patients with similar needs. The different modules consist of verbal group therapies as well as non-verbal group therapies such as drama therapy, music therapy, art therapy, and psychomotor therapy. Each module is evaluated every 6 months by the multidisciplinary team.
Should a patient follow the full offer of therapeutic modules that is assigned to them, in the correct intensity and duration, it can be estimated that the therapeutic offer should be sufficient to treat the risk factors and eventually lower the risk of recidivism. However, several factors might hinder such a positive evolution. First, the required intensity of therapy (i.e., an average of four to five modules each week) is structurally not always feasible, since there are many patients in an FPC but only a relatively small capacity of treatment services. In turn, this structurally leads to a larger offer of group therapy, while some patients could benefit from an individual therapy program in addition to their group therapy. Thus, the patients included in the current study may not yet be following all the modules designated for them. Third, the motivation of the forensic patient population should be considered. There is sometimes a lack of internal motivation, which impacts the therapy progress and can even cause a patient to drop out of the module, even though therapy is mandatory. Lastly, it is possible that the skills learned by patients in therapy are not exercised sufficiently in everyday life on the ward, as is theoretically expected. Structural and in-depth discussions between nursing staff and therapists are planned on a weekly basis. However, due to the high workload, these evaluations tend to focus more on practical affairs or problems, resulting in less of a focus on the content of the therapy sessions. Therefore, the nursing staff might not be informed in sufficient detail in how they can support patients in applying the skills learned during therapy in stressful circumstances or in daily life. Additional effort in informing psychiatric nursing staff about the content of the therapy modules and the relevant skills for each individual patient, seems important to significantly improve on these risk factors, such as impulsivity, hostility, and coping skills. In turn, this could have a positive impact on other risk factors, such as the violation of conditions and cooperation with treatment.
Furthermore, it should be noted that some factors that did not change significantly, such as psychotic symptoms and self-reliance, as well as hostility and impulsivity may be specific to psychotic patients. However, since not all patients in our study were diagnosed with psychotic disorders, it is possible that changes may become apparent when the analyses are performed only on patients with psychotic symptoms (Janković et al., 2021). The same conclusion can be drawn for patients who exhibit antisocial behavior. Therefore, future research should include psychopathology as predictor to empirically verify this potential explanation.
Additionally, the follow-up period of the current study seems rather short because previous research has indicated that the reduction of risk factors is a rather slow process (Hildebrand & de Ruiter, 2012). Therefore, a two-year period might be too short to make significant progress in diminishing the risk of recidivism in a high-security patient population. Since most of these high-risk patients are characterized by multiple problems, they need to successfully follow multiple therapy modules to treat the different risk factors. Furthermore, it often takes some time before patients commit to their therapeutic trajectory. Establishing trust between therapists and patients is an essential prerequisite of a secure therapeutic relationship and a necessity to achieve treatment progress. Trust between patients is also important in group therapy, since patients are expected to share personal information with each other and be open to feedback from fellow patients. However, establishing a sufficient level of trust is not an easy task, as many patients have developed cognitive distortions during their prior prison sentence. Therefore, it would be an added value if patients were to move more quickly from a prison to an FPC after the internment sentence. In December 2019, 537 of those found not guilty by reason of mental impairment in Belgium still resided in prisons (Venken et al. v. Belgium, 2021) due to a lack of admission beds in forensic psychiatric facilities. Research has shown that a prolonged stay in prison can cause certain symptoms (e.g., paranoia, inability to engage in relationships, hampered decision-making, spatial disorientation), that are summarized by the term post-incarceration syndrome. These symptoms relate to the development of an antisocial attitude, a decrease in sensorimotor activities and an increase in passivity in several areas. When patients are admitted to an FPC after a long prison term, it takes a lot of time to adapt to the new clinical environment, which demands self-reflection and treatment cooperation which is not evident (Liem & Kunst, 2013). Thus, the effects of long-term incarceration may have influenced the results, especially for patients who have not been transferred from prison that long.
Moreover, the HKT-R may not be specific enough to evaluate short-term changes and to evaluate the treatment progress in sufficient detail. This is in line with the earlier reported doubts about the sensitivity of the HCR-20 to detect change (Longdon et al., 2018; Morrissey et al., 2014; Nitschke et al., 2020; Olsson et al., 2013; Richter et al., 2018). The five-point ordinal rating scale at item level of the HKT-R might not be sensitive enough to detect changes in the first months or years of treatment. This issue was already raised by Schuringa et al. (2014; 2018), who proposed to use a larger rating scale, since they are more likely to detect important minimal change. However, no evidence is currently available to support this suggestion.
Lastly, the fact that most patients did not change reliably on risk factors during treatment based on the individual-level analyses, warrants some further discussion. After approximately 2 years of treatment (i.e., between the first and third assessment), a two point change on the 5-point scale was considered a reliable change for 10 items, a three point change for 17 items and a four point change for one item. Similar thresholds exist after approximately 1 year of treatment (i.e., between the first and second assessment and between the second and third assessment). Rather large levels of change are therefore necessary for the change to be considered reliable, partly because of the poor to moderate ICC of most of the clinical and future items. However, this amount of change might not be feasible for high-security patients after 1 or 2 years of treatment. In comparison, the study by Van der Linde et al. (2020) in Dutch forensic psychiatric patients reported an average decrease of 11.73 points on the clinical scale of the HKT-R between the time of judicial psychiatric assessment and unconditional release. The average treatment time was 5.9 years, while in the current study a decrease of 10 points should be achieved in 1 year to be considered a reliable change. Additionally, the study of Van der Linde et al. (2020) consisted of only patients who were successfully treated, leading to unconditional release, while our study included patients independent of their treatment progress.
Strengths and Limitations
This is the first study in Flanders to empirically assess the longitudinal change of dynamic risk factors during treatment in a high-security forensic setting. Additionally, all high-security/risk patients residing in the FPCs from the opening of the centers until July 3rd 2020 were considered. To receive more reliable results, only patients with valid HKT-R scores were actually included in the study. This led to the total sample of 341 patients, which is a relatively high number of participants and similar to prior studies concerning the evolution of risk factors during treatment. Another strength of the current study is that we did not solely assess the evolution in the different risk scales, but also focused on the different risk factors individually. Even though the risk scales are very valuable to evaluate the risk of recidivism as a whole, it does not inform treatment practitioners on which specific risk factors need more attention. However, this knowledge is essential to make the necessary adjustments in treatment guidelines and ultimately reduce the risk of recidivism.
Apart from the strengths, this study also has some limitations. First, since the HKT-R assessments were scored prospectively by one rater as part of risk assessment during treatment, no inter-rater reliability could be calculated. The findings can therefore be questioned since it is unknown whether the observed changes are the result of measurement error. Second, the variance of the slope was significant for most clinical and future factors, but these interindividual differences were not assessed any further in the current study. Future research should therefore include predictors to the LGC model to try and explain these interindividual differences. As mentioned earlier, psychopathology might play a role (Janković et al., 2021), but the risk at admission could also be relevant (van der Veeken et al., 2018). Also, the duration of the prior prison sentence differed between the participants (in years, M = 4.9, SD = 5.6, range = 0.02–30.6), but was not controlled for in the analyses. As mentioned previously, a prior prison sentence can negatively affect the mental well-being and therefore hinder treatment progress. Lastly, the change in risk factors was only measured based on three time points, covering a period of 2 years. Since the average treatment time of completed treatments in the FPC is 2.8 years (Jeandarme et al., 2020), for most patients, the complete duration of the treatment was not considered. Future studies could include more time-points to set out trajectories from admission until release into society or in an institution with a lower security level.
Conclusion
Since reducing risk of recidivism is the main goal of forensic treatment, the empirical assessment of longitudinal change of risk factors during treatment is an important part of treatment evaluation. Applying LGC analysis to a high-security forensic population in Flanders, significant decreases were found among the HKT-R supervised and unsupervised future scale after 2 years of treatment, as well as on five future factors individually (i.e., agreement on future conditions, accommodation, financial situation, employment, and daily activities). However, no significant decline was found in the clinical scale, as well as most of the clinical factors. Only difficulties regarding problem insight ameliorated significantly. Additionally, most patients did not change reliably on all clinical and future factors. Therefore, a period of 2 years seems insufficient to decrease the HKT-R risk factors significantly and reliably. In addition to addressing difficulties associated with the treatment process in forensic psychiatry, the sensitivity of the HKT-R to detect clinically relevant change should be challenged.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
