Abstract
Despite the forensic relevance of psychopathy and the overrepresentation of Hispanics in the United States’ criminal justice system, these two issues remain underexplored, particularly with self-report measures of psychopathy. We investigated the criterion validity of three psychopathy measures among African Americans, Caucasians, and Hispanics in a sample of 1,742 offenders. More similarity than dissimilarity emerged across groups. The factor structures of psychopathy measures among Hispanic offenders were consistent with previous findings. Few significant differences emerged between Hispanic and Caucasian offenders, with most differences emerging between African Americans and the other ethnic groups. In such instances, the correlates of psychopathy were typically weaker for African Americans. The Psychopathy Checklist–Revised yielded fewer psychopathy × ethnicity interactions than the Psychopathic Personality Inventory and Levenson Primary and Secondary Psychopathy Scales. Overall, these psychopathy measures showed reasonable validity across these cultural groups.
Keywords
Psychopathy is a personality disorder comprising a broad array of behavioral, emotional, and interpersonal features. It is characterized by glibness, narcissism, deceitfulness, lack of empathy, poor impulse control, and disruptive behavior (Cleckley, 1941; Hare, 1991; Hart & Dempster, 1997). Psychopathy is associated with various negative outcomes, including antisocial conduct (e.g., Leistico, Salekin, DeCoster, & Rogers, 2008; Yang, Wong, & Coid, 2010), criminal recidivism (e.g., Douglas, Nikolova, Kelley, & Edens, 2015), and sexual recidivism (e.g., Hawes, Boccaccini, & Murrie, 2013).
Despite the consensus that psychopathy is an important forensic construct, initial research on the assessment of psychopathy focused largely on Caucasian men. For instance, the most widely used instrument for measuring psychopathy, the Psychopathy Checklist–Revised (PCL-R; Hare, 1991, 2003), was originally developed with, and validated on, predominantly Caucasian samples. In response, more emphasis has been given to potential ethnic differences in the measurement of psychopathy between African Americans and Caucasians (Cooke, Kosson, & Michie, 2001; Neumann & Hare, 2008; Skeem, Edens, Camp, & Colwell, 2004). Nevertheless, questions remain concerning the validity and applicability of psychopathy measures for certain ethnic groups, including Hispanic 1 individuals (e.g., Douglas et al., 2015; Sullivan, Abramowitz, Lopez, & Kosson, 2006).
In applied contexts, the label “psychopath” can influence important forensic decisions (Edens, Magyar, & Cox, 2013). Given psychopathy’s pejorative connotation and its role in legal decision making, it is important to discover if ethnic differences exist across multiple methods of assessment. To address this issue, we compare the reliability and mean scores of three measures of psychopathy among Hispanic, Caucasian, and African American offenders as well as the measures’ associations with commonly evaluated criteria (Hart & Storey, 2013; Patrick, 2006). In addition, this study gives particular accord to Hispanic offenders by examining the factor structure of psychopathy measures within this ethnic group.
Research Comparing Psychopathy Measures in African American and Caucasian Individuals
Factor analysis of the PCL-R originally yielded two correlated factors: Factor 1 (F1) or the Interpersonal/Affective factor and Factor 2 (F2) or the Social Deviance factor (Hare, 1991). Hare (2003) later proposed a four-factor model that split the original two broad factors into four facets: Interpersonal, Affective, Lifestyle, and Antisocial. Cooke and Michie (2001) had previously reported a competing three-factor model that was virtually identical except that it did not include the Antisocial facet. At the manifest level of the PCL, incongruent factor structure was reported for the two-factor model between African Americans and Caucasians (Kosson, Smith, & Newman, 1990). But do PCL-R score differences exist between African Americans and Caucasians? In a meta-analysis, Skeem et al. (2004) found only a 1-point difference in PCL-R total scores between African American and Caucasian individuals—a relatively trivial difference given that the standard error of measurement of the PCL-R is approximately 3 points (if not appreciably higher, at least in applied adversarial settings; see Edens, Cox, Smith, DeMatteo, & Sörman, 2015). Similarly, a meta-analytic review reported only negligible mean differences between African American and Caucasian adolescents on PCL measures (McCoy & Edens, 2006).
Although scale score differences are important, it is also important to establish measurement invariance to confirm if an instrument is assessing the same construct across subpopulations (e.g., ethnic groups) without bias (Millsap, 2007). A confirmatory factor analysis (CFA) revealed that the three-factor structure of the PCL-R was invariant between African American and Caucasian samples (Cooke et al., 2001). Additionally, item response theory analyses revealed that although certain items of the PCL-R demonstrated ethnic bias, item-level differences “cancelled out,” indicating that the measure as a whole did not demonstrate differences in test functioning between African American and European American samples (Cooke et al., 2001). The PCL: Screening Version (PCL: SV; Hart, Cox, & Hare, 1995) also evidenced structural invariance between African American and Caucasian adults (Neumann & Hare, 2008; Skeem, Mulvey, & Grisso, 2003; Vitacco, Neumann, & Jackson, 2005). For African American, Caucasian, and Hispanic adolescent offenders, the PCL: Youth Version (PCL: YV; Forth, Kosson, & Hare, 2003) was invariant across ethnicity (Jones, Cauffman, Miller, & Mulvey, 2006) but, in the same sample, differential item functioning emerged on 15 of 20 items (Tsang, Piquero, & Cauffman, 2014).
Similar findings have emerged in other ethnic groups. For instance, a CFA of Aboriginal and non-Aboriginal offenders revealed structural invariance of the PCL-R (Olver, Neumann, Wong, & Hare, 2013). The PCL-R has also generally evidenced structural invariance across nationalities (e.g., Cooke, Michie, Hart, & Clark, 2005; Mokros et al., 2011; cf. Mokros et al., 2014). Although the Interpersonal Measure of Psychopathy (Kosson, Steuerwald, Forth, & Kirkhart, 1997)—an observational measure of interpersonal interactions designed to supplement the PCL-R—displayed structural invariance between African American and Caucasian inmates (Vitacco & Kosson, 2010), research on measurement invariance across ethnicities using other psychopathy measures is scant.
In addition to testing measurement invariance, the criterion-related validity of psychopathy measures has been investigated among ethnicities. Certain findings have suggested that, across ethnicity, the PCL-R did not vary in its associations with external criteria. Vachon, Lynam, Loeber, and Stouthamer-Loeber (2012) found only 3 (out of a possible 125) ethnicity-based moderations in the criterion-related validity of the PCL-R. The number of moderations was less than the expected family-wise error rate, and the effect sizes were small in magnitude. Others have found that psychopathy is a significant predictor of violence in African American offenders (Walsh & Kosson, 2007), and that the PCL-R three- and four-factor models were not moderated by ethnicity in predicting aggressive behavior among civil psychiatric patients (Vitacco et al., 2005).
However, in other studies, the relationship between psychopathy and external correlates has varied by ethnicity. Meta-analytic reviews have reported that PCL measures display stronger effect sizes with antisocial outcomes (e.g., recidivism) when samples had greater ethnic homogeneity (i.e., Caucasian samples) for both adults (Leistico et al., 2008) and youth (Edens, Campbell, & Weir, 2007). Ethnicity has also moderated the relationship between psychopathy and impulsivity (Jackson, Neumann, & Vitacco, 2007; Kosson et al., 1990). In comparison with Caucasians, African Americans displayed weaker associations between impulsivity and both F1 and F2 scores. Moreover, African American inmates demonstrated differing patterns of emotional and informational processing in comparison with Caucasians such that reaction times on lexical decision making did not differentiate a low-anxiety psychopathy group from the comparison, nonpsychopathy group (Lorenz & Newman, 2002).
Most of the extant research has focused on the PCL-R and its derivatives (e.g., PCL: SV [Hart et al., 1995] and PCL: YV [Forth et al., 2003]). Yet the PCL-R is only one measure of psychopathy (see Skeem & Cooke, 2010), and other conceptualizations of the disorder exist. For instance, the Psychopathic Personality Inventory (PPI; Lilienfeld & Andrews, 1996), the most widely researched self-report measure of psychopathy, was developed to assess psychopathy with less emphasis on criminality or antisocial behavior. Still, investigations of ethnic differences among non-PCL-R conceptualizations and measures of psychopathy have been limited and mixed. Only 4 out of 672 potential racial and gender differences were found in an investigation of several self-report measures of psychopathy and external criteria, including normal-range personality traits and personality disorder symptoms (Benning, Patrick, Salekin, & Leistico, 2005). Similarly, no ethnic differences were reported between the self-report Levenson Primary and Secondary Psychopathy Scales (LPSP; Levenson, Kiehl, & Fitzpatrick, 1995) and passive avoidance learning in a sample of noninstitutionalized offenders (Epstein, Poythress, & Brandon, 2006).
Nevertheless, others have reported that the nomological nets of self-report psychopathy measures have differed between African Americans and Caucasians. For instance, in an offender sample, the LPSP was less strongly related to indices of antisociality, nonviolent criminality, and passive avoidance for African American inmates compared with Caucasians (Brinkley, Schmitt, Smith, & Newman, 2001). Similar ethnic variability was reported for the PPI (Lilienfeld & Andrews, 1996). Contrary to findings with the PCL-R (i.e., smaller relations with aggression for non-Caucasians), PPI total scores were predictive of proximate violence for African American inmates but not for Caucasians (Camp, Skeem, Barchard, Lilienfeld, & Poythress, 2013).
In sum, similar to the latent structure of psychopathy measures, there is generally evidence for consistency in the pattern of correlates across ethnic groups. However, there are scattered reports of ethnic differences that vary in effect size. Although scholars should be commended for examining the generalizability of psychopathy in African Americans—a group whose overrepresentation in prisons has been well documented (National Association for the Advancement of Colored People, 2014)—other ethnic minorities have been largely overlooked in the assessment of psychopathy. Hispanics are of particular forensic relevance in the United States, as they comprise 16% of the prison population (U.S. Census Bureau, 2012). Approximately 1 in 100 U.S. citizens are in prison, but roughly 1 in 36 Hispanic American men are in prison (Pew, 2008).
Measures of Psychopathy in Hispanic Samples
Initial findings concerning the assessment of psychopathy in Hispanic samples have varied. In a study of Hispanic adolescent offenders, Roth (2006) reported that the PCL: YV (Forth et al., 2003) was positively correlated with the Structured Assessment of Violence Risk in Youth (Borum, Bartel, & Forth, 2006), and the PCL: YV displayed good concurrent validity. In contrast, Walsh (2013) found that for Latino American adult inmates, PCL-R scores were not predictive of future violence save for a small positive effect for the fourth (antisocial) facet.
There have also been several investigations of the psychometric properties of the PCL-R in Hispanic samples. Tubb (2002) found that the three-factor model proposed by Cooke and Michie (2001) afforded a better fit in a Hispanic offender sample than the two-factor model. Sullivan et al. (2006) found broad similarity in the reliability and construct validity of the PCL-R among European Americans, African American, and Latino inmates. Several interactions were reported between psychopathy and ethnicity in predicting outcomes. For example, the affective facet was positively related to the Interpersonal Measure of Psychopathy among European Americans, whereas a negative association was present among Latinos. A similar interaction was reported between the affective facet and socioeconomic status. However, the number of significant interactions was less than the expected family-wise error rate (i.e., 4 significant interactions out of 360).
Even less is known regarding the validity of self-report measures of psychopathy among Hispanic adults. In a youth community sample, the LPSP was invariant between Hispanic and African American youth on the basis of multigroup CFA (Horan, Brown, Jones, & Aber, 2015). Nevertheless, there is a need to examine similar ethnic differences in at-risk adult populations for both the LPSP and the PPI. In the revised PPI (PPI-R; Lilienfeld & Widows, 2005), an analysis of normative data did not reveal significant mean ethnic differences (i.e., Caucasian vs. other races). Yet, in a meta-analytic review, small moderating effects were reported for both PPI higher order factors (i.e., Fearless Dominance [FD] and Self-Centered Impulsivity) and the percentage of Caucasians in a sample (Miller & Lynam, 2012). Specifically, the relationship between the PPI factors and internalizing behaviors was stronger in studies with fewer Caucasian participants. Furthermore, small ethnic differences have been reported between primary and secondary psychopathy subtypes—derived by the PPI—whereby non-Caucasians were more likely to be classified as falling within the secondary subtype, which is associated with weak impulse control and antisocial behavior (Lee & Salekin, 2010).
Present Study
The purpose of the present study was to investigate potential ethnic differences in measures of psychopathy among Hispanic, African American, and Caucasian offenders. Our research questions were fourfold: (a) Does the internal consistency of psychopathy measures differ among ethnic groups? (b) Do mean differences exist among ethnic groups for measures of psychopathy? (c) Is the factor structure of psychopathy measures’ replicable in Hispanic offenders? and (d) Does the construct validity of psychopathy measures differ among ethnic groups? We used multiple methods of assessment to clarify potential ethnic differences. Despite a growing body of work focusing on differences in the assessment of psychopathy between particular ethnicities (e.g., Caucasian vs. African Americans), few studies have addressed differences in measures of psychopathy, especially self-report measures, among Hispanic populations.
Method
Participants
The sample consisted of 1,742 offenders who were participating in court-ordered residential community drug treatment programs or serving an institutional prison sentence in Nevada, Oregon, Florida, Texas, or Utah. Inclusion criteria included (a) English speaking, (b) not receiving psychotropic medication or residing on the mental health unit of a prison, and (c) having an IQ of at least 70 based on an intelligence screening measure (Quick Test; Ammons & Ammons, 1962). Participants in community drug treatment programs must have completed detoxification plans before enrollment in the study. The sample consisted predominantly of men (83%), and ages ranged from 17 to 59 years (M = 30.51, SD = 6.63) at the time of recruitment, although an effort was made to recruit participants aged 19 to 40 years.
Participants reported whether they identified as Caucasian or African American and then whether they identified as Hispanic or non-Hispanic. The sample consisted of 591 African Americans, of whom 27 identified as Hispanic, and 1,068 Caucasians, of whom 119 identified as Hispanic. Furthermore, 19 offenders reported being Hispanic, but had missing data for whether they identified as African American or Caucasian. Given the small sample sizes for the Hispanic Caucasians and Hispanic African Americans, we combined these individuals into an overall Hispanic group. Thus, the finalized ethnic composition was 10% Hispanic (n = 165), 32% non-Hispanic African American (n = 564), and 55% non-Hispanic Caucasian (n = 949), and 4% (n = 65) missing data. Participants were approximately evenly distributed between community drug programs (47.8%) and prisons (52%). Sample characteristics are described in more detail elsewhere (Poythress et al., 2010).
Measures
Measures of Psychopathy
PCL-R (Hare, 1991, 2003)
The PCL-R is a 20-item measure used to assess psychopathy based on an interview and file review. We used the four-factor model to examine mean-level differences in psychopathy features across groups and their relations with external criteria. Descriptive statistics and internal consistency (Cronbach’s alpha) for total and facet scores were as follows: PCL Total scores, M = 22.55, SD = 7.50, α = .82; Facet 1, M = 3.79, SD = 2.27, α = .70; Facet 2, M = 4.36, SD = 2.37, α = .77; Facet 3, M = 6.52, SD = 2.14, α = .60; Facet 4, M = 5.84, SD = 2.54, α = .63. The interrater reliability was high, with an intraclass correlation coefficient of .88. Across the sample, 15% had a score of 30 or higher, which has traditionally been considered a cutoff for diagnosing psychopathy.
PPI (Lilienfeld & Andrews, 1996)
The PPI is a self-report measure of psychopathy with 187 items that are answered on a 4-point Likert-type scale. The PPI contains eight content scales (subscales), seven of which have often been found to load onto two factors (Benning, Patrick, Hicks, Blonigen, & Krueger, 2003): FD comprising the Social Potency, Stress Immunity, and Fearlessness subscales and Impulsive Antisociality (IA; now commonly referred to as Self-Centered Impulsivity in relation to the PPI-R [Lilienfeld & Widows, 2005]) comprising the Impulsive Nonconformity, Blame Externalization, Machiavellian Egocentricity, and Carefree Nonplanfulness subscales (but see Neumann, Malterer, & Newman, 2008, for an alternative factor structure). The eighth scale, Coldheartedness, did not load onto either factor, and it is often considered a third dimension of psychopathy on its own, although it has been understudied (see Berg, Hecht, Latzman, & Lilienfeld, 2015; Marcus, Fulton, & Edens, 2013). Descriptive statistics and internal consistency for total and factor scores were as follows: PPI Total scores, M = 386.28, SD = 41.32, α = .91; FD scores, M = 144.68, SD = 20.13, α = .88; IA scores, M = 190.72, SD = 31.60, α = .93; Coldheartedness scores, M = 44.15, SD = 8.38, α = .78.
LPSP (Levenson et al.,1995)
The LPSP is a self-report measure of psychopathy consisting of 26 items that are scored on a 4-point Likert-type scale. The LPSP was intended to capture two broad forms of psychopathy: primary and secondary psychopathy. The Primary and Secondary scales were intended to be indices of PCL-R F1 and F2, respectively, although there is evidence to suggest that Primary scale may be a stronger marker of antisocial propensities (e.g., Lilienfeld & Fowler, 2006; Poythress et al., 2010). Descriptive statistics and internal consistency for the factor scores in this sample were as follows: LPSP Total score, M = 55.90, SD = 11.66, α = .85; Primary scale, M = 32.91, SD = 8.14, α = .76; Secondary scale, M = 22.99, SD = 5.27, α = .75.
Measures of External Correlates
Personality Assessment Inventory (PAI; Morey, 1991)
The PAI is a widely used multiscale self-report measure that focuses on distress-related psychopathology, personality disorder features, and treatment needs. Participants respond to 344 declarative statements on a 4-point scale by answering the degree to which each statement is true. The PAI includes 22 full, nonoverlapping scales, including 11 clinical scales, 5 treatment scales, and 2 interpersonal scales. In the present study, the Antisocial Features (ANT), Aggression (AGG), Borderline Features (BOR), and Drug Problems (DRG) subscales were selected for analyses because of their theoretical and empirical relevance to psychopathy (Edens, Hart, Johnson, Johnson, & Olver, 2000; Edens, Poythress, & Watkins, 2001; Miller et al., 2010; Taylor & Lang, 2006). The range of internal consistencies for the clinical scales used in this study was α = .79-.94.
Harm Avoidance (HA)
The HA scale from the Multidimensional Personality Questionnaire (Tellegen, 1982) consists of 28 true/false items. HA is negatively associated with psychopathy (see Lilienfeld & Andrews, 1996), and is considered a conceptual inverse of Lykken’s (1995) fearlessness construct, which is regarded by many scholars as central to primary psychopathy. For this sample, the internal consistency was high (α = .86).
Impulsivity
The Barratt Impulsiveness Scale (BIS Version 11; Barratt, 1994) is a 30 item self-report measure of impulsivity that has been defined as a “predisposition toward rapid, unplanned reactions to internal or external stimuli without regard to the negative consequences of these reactions to the impulsive individual or to others” (Moeller, Barratt, Dougherty, Schmitz, & Swann, 2001, p. 1784). Impulsivity is considered by many to be a central component of psychopathy (Cleckley, 1976; Hare, 2003), although it is more strongly associated with the behavioral than with the affective and interpersonal features of psychopathy (Edens & McDermott, 2010; Ruiz, Skeem, Poythress, Douglas, & Lilienfeld, 2010). For this sample, the internal consistency for the BIS was high (α = .86).
Criminal Recidivism
Offenders in community drug programs or inmates who were released from prison (n = 1,154) were monitored for recidivism over a 1-year follow-up period. Recidivism was coded on the basis of criminal record review from the National Crime Information Center. Two dichotomous recidivism variables were coded: the presence of any arrest and the presence of any violent arrest. A violent arrest was defined as any arrest for murder, manslaughter, robbery, assault, and/or sexual assault. Although 41% of offenders were rearrested for some type of criminal activity over the 1-year follow-up period, few were rearrested for a violent offense (4%).
Measures of Antisocial Personality Disorder (ASPD)
Psychopathy and ASPD are moderately correlated (Lilienfeld, 1994), stemming largely from the shared impulsive and antisocial behavioral components of psychopathy (Hare, Neumann, & Widiger, 2012). In the present study, ASPD was measured using the Structured Clinical Interview for the DSM-IV Axis II Personality Disorders (SCID-II; First, Gibbon, Spitzer, Williams, & Benjamin, 1996). The internal consistency for the SCID ASPD was α = .83.
Behavioral Inhibition System (BIS) and Behavioral Activation System (BAS; Carver & White, 1994)
The BIS and BAS scales consist of 20 total items in a 4-point Likert-type scale format. Gray’s (1987) reinforcement sensitivity theory has been used to explain differences between primary psychopathy, which is ostensibly associated with weak BIS reactivity (Fowles, 1980), and secondary psychopathy, which is ostensibly associated with strong BAS reactivity (Lykken, 1995). The internal consistency for the BIS scale was adequate (α = .70) as were the internal consistencies of the BAS Drive (α = .85) and BAS Fun scales (α = .78).
Child Abuse and Trauma Scale (CATS; Sanders & Giolas, 1991)
The CATS is a self-report measure focusing on negative experiences in childhood and adolescence such as physical, sexual, and psychological abuse. It consists of 38 items that are answered on a 5-point scale. Porter (1996) theorized that childhood abuse and trauma is an etiological mechanism for secondary psychopathy (but see Poythress, Skeem, & Lilienfeld, 2006). In this sample, the internal consistency was excellent (α = .95).
Procedure
The current study derived from a larger multisite research project focusing on psychopathic personality (e.g., Poythress et al., 2010). Research assistants were advanced graduate students in clinical psychology or social work. After receiving formal training by published experts on study measures, they scored 10 training tapes for PCL-R and ASPD diagnoses and required an interrater reliability above .80 before starting data collection. Every 6 months, the project manager observed two interviews conducted by research assistants to examine interrater reliability for these rating scales.
Participants were sampled randomly from an institutional list of offenders who met basic eligibility criteria. Enrollment procedures were completed in private rooms, and on informed consent, IQ screening tests were administered. In a fixed order, self-report measures were completed on a laptop except for the PAI, which was completed in paper-and-pencil form. Participants without a 10th-grade education or who could not read the first few items of the PAI were assessed for reading ability using the Basic Reading Inventory (Johns, 1997). Items were read out loud if participants did not achieve a 9th-grade level on this measure. The research protocol lasted approximately 4.5 hours, and was typically completed over two sessions. Participants received $20 for their participation aside from one institutional site that did not allow financial reimbursement.
Data Analytic Strategy
Cronbach’s alpha and mean interitem correlation (MIC) values were calculated to examine potential ethnic differences in the reliability of the psychopathy measures. Potential differences in mean levels of psychopathy across ethnic groups were calculated using Tukey’s Honestly Significant Difference procedure. To investigate the criterion validity of psychopathy across ethnic groups, bivariate analyses were completed by computing zero-order Pearson product–moment correlations for each ethnic group (i.e., Hispanic, Caucasian, and African American) between total and subscale scores of the PCL-R, PPI, LPSP, and external correlates. Given the large number of analyses, differences were considered statistically significant using a conservative alpha level of .001.
Next, CFAs were conducted for the PCL-R, PPI, and LPSP within the Hispanic subsample to examine if models could be replicated in this ethnic group. CFAs were computed in “lavaan,” a structural equation modelling package for R (Rosseel, 2012), using robust weighted least squares estimation given the ordinal nature of the PCL-R and LPSP indicators. Maximum likelihood estimation was used for the PPI three-factor model given the continuous nature of its subscale indicators. As suggested by Hu and Bentler (1999), in addition to the χ2 goodness-of-fit statistic, model fit was evaluated with the comparative fit index (CFI) and root-mean-square error of approximation (RMSEA). Given the sensitivity of χ2 values to sample size and model correlations, normed chi-square (NC; i.e., χ2/df) values below 2.00 were used to represent acceptable model fit (Kline, 2005). Other commonly used thresholds of acceptable fit include CFI >.90 and RMSEA <.08, although some argue these rules of thumb are too strict (Marsh, Hau, & Wen, 2004). Using statistical code to calculate power for RMSEA in R (Preacher & Coffman, 2006), power was greater than .98 for both the LPSP and PCL-R models, but only .42 for the PPI three-factor model; hence, the results for the lattermost measure should be interpreted with some caution.
Hierarchical regression was used to analyze the potential moderating effect of ethnicity on the relation between measures of psychopathy and external correlates. Dummy-coded ethnicity variables were created to examine differences between each unique ethnicity pairing. Because the ethnic groups differed by IQ screening score, F(2, 1653) = 104.31, η2 = .11, p < .001, gender, χ2 = 7.15, df = 2, φ = .07, p = .028, and the proportion of participants in community drug treatment programs, χ2 = 99.11, df = 2, φ = .24, p < .001, these variables were entered as covariates. After mean centering, the psychopathy variable and dummy coded variables were entered into the second step of the regression models. Next, psychopathy × ethnicity interaction terms were entered in the third step. Moderation analyses were conducted for each total and subscale score for each theoretically relevant criterion variable. Interactions are challenging to detect because they can be attenuated by the multiplicative of the main effects’ error terms (Cohen, Cohen, West, & Aiken, 2003). Therefore, the alpha level was kept at .05; we focused on the consistency in patterns of interactional differences in interpreting these results.
Results
Reliability Analyses
Table 1 presents the Cronbach’s alpha and MIC values for the PCL-R, PPI, and LPSP among Hispanic, African American, and Hispanic offenders. For the PCL-R and LPSP scales, no internal consistency differences emerged. However, Cronbach’s alpha was somewhat higher for Caucasians compared with African Americans for the PPI total (χ2 = 52.38, φ = .26, p < .001), FD (χ2 = 38.74, φ = .23, p < .001), and IA (χ2 = 29.14, φ = .19, p < .001) scales. No significant differences emerged in the internal consistency of the PPI Coldheartedness scale, nor were significant differences observed between Hispanic offenders and the other ethnic groups.
Internal Consistency of Psychopathy Measures Between Ethnic Groups.
Note. α = Cronbach’s alpha; PPI = Psychopathic Personality Inventory; PCL-R = Psychopathy Checklist–Revised; MIC = mean interitem correlation; FD = PPI Fearless Dominance; IA = PPI Impulsive Antisociality; Cold = PPI Coldheartedness.
Differences in Psychopathy Scores
Analysis of variance revealed ethnic differences for PCL-R Total, F(2, 1542) = 22.26, p < .001, Facet 1, F(2, 1542) = 20.52, p < .001, Facet 2, F(2, 1542) = 33.40, p < .001, and Facet 4 scores, F(2, 1531) = 15.89, p < .001 (see Table 2 for descriptive statistics). Post hoc analyses revealed higher mean scores for African Americans than Caucasians for these PCL-R scales (d = .36-.37). No significant ethnic differences emerged for the PPI and LPSP scales, nor were significant differences observed between Hispanic offenders and the other ethnic groups.
Descriptive Statistics of Measures of Psychopathy Across Ethnic Groups.
Note. d1 = Cohen’s d between Hispanic and Caucasian groups; d2 = Cohen’s d between Hispanic and African American groups; d3 = Cohen’s d between Caucasian and African American groups; PCL-R Tot = PCL-R Total Score; Facet 1 = PCL-R Facet 1 Score; Facet 2 = PCL-R Facet 2 Score; Facet 3 = PCL-R Facet 3 Score; Facet 4 = PCL-R Facet 4 Score; PPI Tot = PPI Total Score; PPI FD = PPI Fearless Dominance Score; PPI IA = PPI Impulsive Antisociality Score; PPI Cold = PPI Coldheartedness Score; LPSP Tot = LPSP Total Score; LPSP Pri = LPSP Primary Scale Score; LPSP Sec = LPSP Secondary Score.
p < .001.
Confirmatory Factor Analysis
For the PCL-R, when specifying a four-factor model (Hare, 2003), the results indicated inadequate fit, χ2(129) = 193.21, p < .001, CFI = .87, RMSEA = .06, 90% confidence interval (CI) [.04, .08]. Correlations between factors and most of their loadings were significant at a level of p ≤ .001, except for PCL-R Items 15 (p = .007) and 19 (p = .090). Yet for the PCL-R three-factor, hierarchical model with parcels (Cooke & Michie, 2001), the results indicated good fit, χ2(56) = 94.72, p = .001, CFI = .91, RMSEA = .07, 90% CI [.04, .09]. All correlations between the three factors and their respective loadings were significant at p < .001. For the LPSP, the two-factor model (Levenson et al., 1995) had inadequate fit, χ2(298) = 410.06, p < .001, CFI = .87, RMSEA = .05, 90% CI [.04, .06]; but the factor loading of Item 11 was nonsignificant and small (z = 1.22, p = .222). By removing LPSP Item 11 and 6 other items that have been previously removed (see Brinkley, Diamond, Magaletta, & Heigel, 2008), the respecified model demonstrated good fit, χ2(151) = 208.70, p = .001, CFI = .92, RMSEA = .05, 90% CI [.03, .06]. All correlations between LPSP factors and their respective loadings were significant at a level of p < .001. In contrast, the PPI three-factor model (Benning et al., 2005) had inadequate fit (CFI = .47, RMSEA = .33, 90% CI [.28, .38]).
Criterion Validity Across Ethnicity
Bivariate Analyses
For each ethnic group, the relationship between the PCL-R, PPI, LPSP, and external criteria was generally consistent with the existing literature (see Tables 3, 4, 5, respectively). Across ethnic groups, the PCL-R shared a modest association with LPSP total scores (r = .22-.33) and a moderate to high association with PPI total scores (r = .40-.46), whereas the PPI and LPSP total scores were highly correlated (r = .66-.70). Of note, PPI FD was negatively associated with PAI DRG for African Americans, but not for Hispanics or Caucasians. The associations across measures of psychopathy and indices of antisociality were generally consistent across groups. Last, the relationship between psychopathy scores and other external criteria were typically consistent among ethnic groups. For Hispanic offenders, but not other offenders, the Levenson Secondary scale was positively associated with the presence of a future arrest. In contrast, PCL-R total and facet scores typically shared small negative associations with general and violent arrests among Hispanics.
Relations Between the PCL-R Total, Facet, and External Criteria Scores.
Note. PPI = Psychopathic Personality Inventory; PCL-R = Psychopathy Checklist–Revised; H = Hispanic; C = Caucasian; AA = African American; Factor 1 = PCL-R Factor 1; Factor 2 = PCL-R Factor 2; FD = PPI Fearless Dominance; IA = PPI Impulsive Antisociality; Cold = PPI Coldheartedness; LPSP Total = Levenson Psychopathy Total Score; LPP = Levenson Psychopathy Primary Scale; LPS = Levenson Psychopathy Secondary Scale; SCID = SCID APD Total Symptom Count; PAI = Personality Assessment Inventory; AGG = PAI Aggression; ANT = Antisocial Features; BOR = PAI Borderline Features; DRG = PAI Drug Problems; BIS 11 = BIS-11 Total; BIS = BIS Scale; Drive = BAS Drive Scale; Fun = BAS Fun; CATS = CATS Total; HA = Harm Avoidance; Arrest = Any Arrest; Vio Arrest = Any Violent Arrest.
Presence of an interaction between African Americans and Caucasians. bPresence of an interaction between Hispanics and Caucasians. cPresence of an interaction between African Americans and Hispanics.
Relations Between the PPI Total, Higher Order Factors, and External Criteria Scores.
Note. PPI = Psychopathic Personality Inventory; PCL-R = Psychopathy Checklist–Revised; H = Hispanic; C = Caucasian; AA = African American; FD = PPI Fearless Dominance; IA = PPI Impulsive Antisociality; Coldheartedness = PPI Coldheartedness; Factor 1 = PCL-R Factor 1; Factor 2 = PCL-R Factor 2; LPSP Total = Levenson Psychopathy Total Score; LPP = Levenson Psychopathy Primary Scale; LPS = Levenson Psychopathy Secondary Scale; SCID = SCID APD Total Symptom Count; PAI = Personality Assessment Inventory; AGG = PAI Aggression; ANT = Antisocial Features; BOR = PAI Borderline Features; DRG = PAI Drug Problems; BIS 11 = BIS-11 Total; BIS = BIS Scale; Drive = BAS Drive Scale; Fun = BAS Fun; CATS = CATS Total; HA = Harm Avoidance; Arrest = Any Arrest; Violent Arrest = Any Violent Arrest.
Presence of an interaction between African Americans and Caucasians. bPresence of an interaction between Hispanics and Caucasians. cPresence of an interaction between African Americans and Hispanics.
Relations Between the Levenson Self-Report Psychopathy Scales and External Criteria.
Note. PPI = Psychopathic Personality Inventory; PCL-R = Psychopathy Checklist–Revised; H = Hispanic; C = Caucasian; AA = African American; LPSP Total = Levenson Psychopathy Total Score; LPP = Levenson Psychopathy Primary Scale; LPS = Levenson Psychopathy Secondary Scale; Factor 1 = PCL-R Factor 1; Factor 2 = PCL-R Factor 2; FD = PPI Fearless Dominance; IA = PPI Impulsive Antisociality; Coldheartedness = PPI Coldheartedness; SCID = SCID APD Total Symptom Count; PAI = Personality Assessment Inventory; AGG = PAI Aggression; ANT = Antisocial Features; BOR = PAI Borderline Features; DRG = PAI Drug Problems; BIS 11 = BIS-11 Total; BIS = BIS Scale; Drive = BAS Drive Scale; Fun = BAS Fun; CATS = CATS Total; HA = Harm Avoidance; Arrest = Any Arrest; Violent Arrest = Any Violent Arrest.
Presence of an interaction between African Americans and Caucasians. bPresence of an interaction between Hispanics and Caucasians. cPresence of an interaction between African Americans and Hispanics.
Moderation Analyses
To investigate potential ethnic differences in the relationship between psychopathy and external criteria, 828 moderating effects were tested across the three ethnic groups, 12 psychopathy scales, and 23 external criteria (see the online Supplemental material for the standardized beta weights available at http://asm.sagepub.com/content/by/supplemental-data). The expected number of Type I errors (828 × 0.05) was 41.40, but a total of 98 significant psychopathy × ethnicity interactions occurred—2.37 times or 56 more interactions than expected by chance. 2 The most common interaction was between Caucasian and African American offenders (54%), then Hispanic and African Americans (25%), and the least frequently occurring interaction was between Caucasians and Hispanics (21%). The significant psychopathy × ethnicity interaction terms contributed very small incremental effects to the extant models (mean ΔR2 = .005), and the significant unique contributions of these interactions were also small in magnitude (mean β = .09). 3
PCL-R
Despite having the most scales of the psychopathy measures, the PCL-R evidenced only 23 psychopathy × ethnicity interactions—fewer than the PPI and LPSP. The number of significant moderation analyses expected by chance is 17.25. As seen in Table 3, a large proportion (i.e., 17) of these psychopathy × ethnicity interactions were observed between Caucasians and African Americans in which the relationships between PCL scores and external criteria were weaker for African Americans than Caucasians. For instance, for both CATS total and PAI Aggression scores, the relationships with Facet 1 scores were of stronger magnitude for Caucasians. Similarly, in most interactions between Hispanic and African Americans, the relationships between PCL scores and external criteria were generally weaker for African Americans. However, Facet 4 scores were negatively predictive of violent arrests among Hispanics, but positively predictive for African Americans. Only two psychopathy × ethnicity interactions were observed between Hispanics and Caucasians, with stronger relations for Caucasians.
PPI
Across the PPI Total, FD, IA, and Coldheartedness scales, 13.80 interactions were expected by chance. Nevertheless, a total of 40 significant psychopathy × ethnicity interactions were observed (see Table 4). Eight psychopathy × ethnicity interactions emerged between Hispanics and African Americans, where the relationships between PPI scores and external criteria were typically weaker for African Americans. Of the 21 interactions between Caucasian and African Americans, the relationships were generally stronger for Caucasians. However, in several instances there were stronger associations for African Americans that were theoretically unexpected in the context of psychopathy (e.g., stronger negative relations with PAI DRG and FD scores). A total of 11 psychopathy × ethnicity interactions were found between Hispanic and Caucasians; most were stronger for Caucasian offenders with the exception of CATS and PAI Borderline Features scores.
LPSP
The expected number of interactions by chance across the Levenson Total, Primary, and Secondary scales was 10.35, and yet 35 psychopathy × ethnicity interactions were observed (see Table 5). There were 15 psychopathy × ethnicity interactions between Caucasian and African Americans and 12 interactions between African American and Hispanics. As was the case for the PPI and PCL-R, weaker associations were observed between LPSP scale scores and external criteria for African Americans in comparison with other ethnic groups. A total of eight psychopathy × ethnicity moderations emerged between Hispanics and Caucasians; the relations with external criteria were stronger for Hispanics in five of eight instances.
Discussion
The current study investigated potential ethnic variability in the reliability, mean scores, and criterion-related validity in multiple psychopathy measures. We accorded particular emphasis to Hispanic offenders, including psychopathy factor structure testing, given that the implications of measuring psychopathy in this ethnic group have been substantially understudied. To our knowledge, this study is the first to explicitly investigate self-report measures of psychopathy among Hispanic offenders in comparison with Caucasian and African American offenders. Broadly, there were more similarities than differences among ethnic groups in the psychometric performance of the psychopathy measures. Hispanics did not differ in the reliability and mean scores of psychopathy measures compared with other ethnic groups. It appears that—at the manifest or observable level—the assessment of psychopathy is comparable between Hispanics and Caucasian offenders.
Differences, however, did emerge between Caucasians and African Americans, as the latter group displayed slightly higher scores. This finding is broadly consistent with the meta-analysis of Skeem et al. (2004), who reported that African Americans exhibited slightly higher scores but well within the range of the instrument’s standard error of measurement. Moreover, these mean differences do not necessarily indicate inherent differences in psychopathy, as they could be measure specific. That is, our study is limited such that differences in test scores may reflect either true variance, variance due to error, or both. Future research should build on past analyses of measurement bias (e.g., Cooke et al., 2001) using classical and modern test theory (e.g., factor and item response theory analysis) that compare Hispanic people with Caucasian and African American people.
Nevertheless, the factor structure of the three psychopathy measures was examined in the Hispanic subsample. The PCL-R three-factor model fit the Hispanic offenders, which is in keeping with the work of Cooke and Michie (2001). In contrast, the four-factor model (Hare, 2003) did not demonstrate adequate fit. Yet the PCL-R four-factor model yielded two of three acceptable fit indices, and the CFI was approaching the threshold for appropriate model fit. With respecification, the two-factor structure of the LPSP evidenced good model fit, suggesting—as others have (see Brinkley et al., 2008)—that consideration should be given to revising redundant or poor functioning items.
In comparison with the PCL-R and LPSP, the PPI three-factor structure could not be reproduced. Although controversies surrounding the PPI factor structure have been previously reported (Neumann et al., 2008; cf. Benning et al., 2003), power was low for the three-factor model, as it comprised only eight indicators (i.e., the PPI subscales). Overall, our findings offer provisional evidence that the factor structure of psychopathy measures for Hispanics is broadly similar to that in African Americans and Caucasians, although these findings will require replication in other samples, including nonforensic samples.
The moderating effect of ethnicity was examined across multiple methods of measurement. Of the 85 psychopathy × ethnicity interactions, there were 22 ethnicity moderations for the PCL-R, 39 for the PPI, and 24 for the LPSP. In consideration of the different number of scales per measure, the proportion of significant interactions was highest for the LPSP (18% of total tested interactions) and lowest for the PCL-R (7% of total tested interactions). The Primary scale of the LPSP was particularly noteworthy given its 17 moderated effects. The PPI—with 15% significant psychopathy × ethnicity analyses—may generalize slightly more across different ethnic groups in comparison with the LPSP. These differences in generalization across measures, although important and worthy of replication, do not address the competing possibility that such ethnic differences are genuine (i.e., they exist in reality) and reflect variations in the conceptualizations used to derive the PCL-R, LPSP, and PPI. That is, greater ethnic differences for a given psychopathy measure could reflect actual differences in the manifestation of the disorder rather than a bias in the measure.
Consideration of the associations with external correlates revealed a fair number of psychopathy × ethnicity interactions (i.e., 85 moderations out of 726 analyses, or 12%). These findings are fairly consistent with the work of Sullivan et al. (2006), who found that differences were evident primarily between African Americans and the other ethnic groups. Specifically, after controlling for gender, intelligence, and the site of the offender (i.e., prison vs. community drug treatment program), over three quarters (79%) of the moderating effects were observed between African Americans and the two other ethnic groups. In contrast, only 21% of the moderating effects emerged between Hispanics and Caucasians. Parallel to the reliability and mean scores, there appears to be substantial overlap in the criterion-related validity of psychopathy measures for Caucasian and Hispanic offenders.
The strength of validity coefficients is capped by the reliability of the two indices in question (see Winne & Belfry, 1982). Consequently, the differences in the criterion validity of psychopathy measures for African Americans could be partially attributed to the relatively lower levels of internal consistency in comparison with Caucasian offenders. Specifically, Cronbach’s alpha was lower for PPI Total, FD, and IA scales among African Americans. However, these differences were small in magnitude and isolated to the PPI. Other psychopathy scales demonstrated as many criterion-related ethnic differences without diminished reliability coefficients for African Americans. It is likely that differential reliabilities account for at least some of the observed ethnic differences. Future latent analyses will be necessary to fully disentangle the role of true variance and error variance.
Despite their high alpha coefficients, PPI total and subscale MIC coefficients—which are less influenced by the total number of items in a scale—were low across all three ethnic groups. In particular, Coldheartedness, a single subscale, should contain items that better reflect a unitary construct. Weak MIC values for the PPI total scale are consistent with the relative orthogonality of the FD and IA factors, and they suggest that future research should place less emphasis on the total composite score. These inadequate MIC coefficients are congruent with other findings (Neumann, Uzieblo, Crombez, & Hare, 2013), suggesting low internal consistency of the PPI factors—a necessary component of unidimensionality (Clark & Watson, 1995). A lack of unidimensionality of the subscales might also explain the poor factorial model fit for this Hispanic sample and other factor analyses of the PPI (e.g., Neumann et al., 2008).
The latent properties of psychopathy measures are important to consider in the context of the current findings. The PCL-R measurement invariance between African Americans and Caucasians (e.g., Cooke et al., 2001) provides support for the validity of the differences we found between these two groups. Because the PCL-R measures the content of psychopathy comparably between these groups, the differences can be interpreted with some confidence. This is particularly the case, as all mean score differences and the majority of criterion-related differences occurred between African Americans and Caucasians. Similarly, evidence suggests that the LPSP is structurally invariant between Hispanic and African American adolescents (Horan et al., 2015), suggesting that the differences we observed for the LPSP may be interpreted with reasonable confidence. Because PPI measurement invariance remains untested across ethnicities, our findings for differences on this measure must be interpreted with caution.
Although many moderations were reported, and a notable difference emerged in the associations between psychopathy and external criteria for African Americans, the effect sizes of the interactions were typically small. Although not dispositive, small incremental effects are common in regression models with a large number of predictors (Hunsley & Meyer, 2003). When a significant moderation effect appeared, the relationship between psychopathy and the external criterion usually presented itself in the same direction for both ethnic groups. That is, unlike Sullivan et al. (2006), opposing associations between ethnicities were generally not observed. Hence, our findings do not point to major ethnic differences in the relationship between psychopathy and other psychological or criminogenic variables, but rather a small, consistent pattern of findings of modest magnitude that fits with the extant literature (e.g., Jackson et al., 2007; Walsh, 2013). Clinically, it is promising that the most frequently used measure of psychopathy (i.e., PCL-R) yielded the fewest ethnic discrepancies.
Implications
One area of caution concerns the slightly substandard criterion-related validity for African Americans. Theoretically, this finding may suggest slight differences in the nomological network of psychopathy for African Americans in comparison with other ethnic groups. Consequently, clinicians may want to accord additional consideration to the case formulations of African American offenders who obtain high scores on psychopathy measures, including the PCL-R. Techniques that include the clustering of risk factors with psychopathy as the central or precipitating factor may be less effective if its association with other risk factors is diminished. Ultimately, formulation is conducted at the individual level, but it is also theoretically driven. Our findings suggest that the interplay between psychopathy and other criminogenic variables may differ slightly for African Americans, and clinicians may want to consider this possibility when relying on theory to facilitate their formulation.
Across the different measures of psychopathy, criterion-related validity was relatively similar between Hispanics and Caucasians. Nevertheless, our findings were consistent with the work of Walsh (2013), who reported that PCL-R scores did not significantly predict violence for Latinos. In the current study, psychopathy was approaching a significant negative relationship with criminal offending among Hispanic offenders. Granted, the predictive validity of the PCL-R vis-à-vis criminal recidivism was modest for all ethnic groups. This lack of predictive power raises concerns about incorporating the PCL-R into risk assessments for Hispanics, and is consistent with meta-analytic work suggesting that the PCL-R is slightly less accurate in its prediction of violence in samples with a higher proportion of ethnic minorities (e.g., Singh, Grann, & Fazel, 2011). This possibility should be investigated using a variety of recidivism variables and employing a longer follow-up period.
Strengths and Limitations
The current study addressed a novel research question by comparing ethnic differences in a large sample of Hispanic, Caucasian, and African American offenders using a multimethod assessment of psychopathy. Nonetheless, our large sample size was not evenly distributed among the three ethnic groups. Although the Hispanic subsample size (n = 165) was large in comparison with other studies, it precluded multiple group CFA to test factorial invariance. Hispanic offenders’ disproportionately small sample (i.e., Caucasian [n = 949], African American [n = 564]) may have affected analyses of ethnic differences in mean levels and criterion validity. Nonetheless, the power for detecting beta coefficients was extremely strong for all ethnic comparisons (i.e., Power > .99). Furthermore, if the differing sample sizes affected the number of moderations, we would expect to see the fewest number of moderations between African Americans and Hispanics (n = 729). However, this was not the case, as the least frequently occurring moderation occurred between Caucasians and Hispanics (n = 1,114). In addition, because the term “Hispanic” is an ethnonym (Martinez-Brawley & Gualda, 2011) that comprises a variety of languages and cultural norms originating from various nationalities, it is likely that this group was heterogeneous in its ethnic composition. Additionally, Hispanics, Caucasians, and African Americans were not mutually exclusive; in the present study, African American and Caucasian participants also self-identified as Hispanic or non-Hispanic. However, for statistical purposes (sizes of subsamples), we were unable to conduct analyses with a reasonable degree of power that further classified individuals (e.g., Caucasian and Hispanic vs. African American and not Hispanic). We advise for research to further parse these cultural groups.
Footnotes
Declaration of Conflicting Interests
The author(s) declared the following potential conflicts of interest with respect to the research, authorship, and/or publication of this article: Dr. Scott O. Lilienfeld is the first author of the Psychopathic Personality Inventory.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research was supported by a grant from the National Institute of Mental Health. This research was supported by the Social Sciences and Humanities Research Council of Canada.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
