Abstract
Extant research on the effects of judicial background characteristics suggests minimal influence from the race or gender of the sentencing judge in criminal cases. This raises at least two possibilities: (1) the combined influence of judicial recruitment, indoctrination, and socialization into the judgeship results in a homogenous body of criminal court judges; or (2) current approaches to identifying judge effects in criminal sentencing have methodological and conceptual flaws that limit their ability to detect important influences from judicial background characteristics. The current article examines this issue with data from the Pennsylvania Commission on Sentencing that is augmented to include information on sentencing judges and criminal court contexts. It argues that the mode of conviction shapes the locus of sentencing discretion in ways that systematically underestimate judge effects for pooled estimates of incarceration and sentence length. The empirical results support this interpretation, especially for incarceration in trial cases, where older, female, and minority judges are substantially less likely to sentence offenders to jail or prison terms. The article concludes with a discussion of future research directions and policy implications for judge effects and disparity in sentencing.
For decades, criminologists, legal scholars, and policy makers have been captivated by continuing attempts to unravel the causes, consequences, and loci of disparate treatment in criminal sentencing. A voluminous literature has developed examining the influence that individual offender characteristics, such as race and gender, exert on courtroom decision-making outcomes. A limited but related literature has also emerged focusing on the influence of courtroom actor background characteristics in sentencing. While the former finds continuing evidence of individual offender disparities, the latter reports relatively few important influences from judicial background characteristics (Zatz, 2000). This has led some scholars to conclude that the collective impact of formal sentencing innovations, judicial selection and socialization processes, and courtroom workgroup pressures for conformity largely nullify any preexisting differences tied to judicial background characteristics (Spohn, 1990a; Steffensmeier & Britt, 2001; Steffensmeier & Hebert, 1999).
While the advent of sentencing guidelines and the importance of judicial socialization should not be overlooked, recent theorizing on courtroom actor discretion may provide valuable insights into the importance of judge characteristics in sentencing. Contemporary sentencing research emphasizes the fact that courtroom decision-making processes are likely to vary depending on whether or not a case is plea-bargained or convicted at trial (Alschuler, 1979; Johnson, 2003; Padgett, 1985; Smith, 1986). If the underlying processes determining sentencing outcomes are tied to the mode of conviction, then it may be necessary to disaggregate sentencing outcomes to better capture the effects of different decision makers in the punishment process. In particular, if the exercise of judicial discretion is most prominent in the sentencing of trial cases, as recent research argues (Johnson, 2003), then the influence of judicial background factors should be most pronounced for these types of cases. Given that guilty pleas account for about 95% of all felony convictions (Durose & Langan, 2005), the general lack of significant findings for judge characteristics in prior research may simply reflect the high prevalence of pled cases. The present study investigates this possibility by examining variations in judge-level predictors of sentencing for cases convicted through different types of pleas and trials. Using data from the Pennsylvania Commission on Sentencing (PCS), this study evaluates hypotheses about the relative import of a variety of judge-level background characteristics across modes of conviction. Such an investigation holds the potential to clarify past discrepancies regarding the importance of judge factors in sentencing, while also providing useful directions for future theoretical and policy developments regarding the identification and control of court actor discretion and sentencing disparity.
Prior Research and Theorizing
The Importance of Judge Characteristics in Prior Research
Relative to the copious literature on individual offender attributes, research on the characteristics of organizational decision makers in criminal courts is rare. In the past three decades, only about a dozen studies have incorporated measures of judge characteristics in the study of criminal punishment. Table 1 provides a summary of this research. Although not necessarily exhaustive, it represents the most recent and influential investigations of judge effects in sentencing. 1 The limited scholarly attention devoted to this aspect of criminal punishment is partly the product of data constraints that preclude examination of court actor background characteristics, and partly the result of prior research reporting few or inconsistent findings for judicial background factors (e.g., Gruhl, Spohn, & Welch, 1981; Spohn 1990a; Steffensmeier & Hebert, 1999). Overall, though, the collective research on judicial background characteristics has been limited in important ways, including small and inadequate samples of judges, limited measures of judicial backgrounds, and relatively simplistic analytical approaches (Johnson, 2006).
Prior Studies of Judge Characteristics in Criminal Sentencing.
X = study examined judicial characteristic.
Early investigations into the importance of judicial attitudes and sentencing philosophies argued forcefully that the sentencing judge was critically important for determining individual sanctions. In his theoretical exposition of judicial decision making, Gibson averred, “there can be little doubt that what judges prefer to do—i.e. their substantive attitudes and values—influences their decision-making behavior” (Gibson, 1983, p. 9). Similarly, Hogarth (1971) examined the influence of judicial attitudes, sentencing philosophies and organizational constraints on the sentencing behavior of a large sample of Canadian magistrates and found that “enormous variation exists among magistrates in the way in which they go about making sentencing decisions” (Hogarth, 1971, p. 91). He ultimately concluded that “one can explain more about sentencing by knowing a few things about a judge than by knowing a great deal about the facts of the case” (Hogarth, 1971, p. 350).
Early empirical work, however, failed to provide evidence for the importance of judge characteristics in sentencing. For instance, three early studies utilizing the same data from an unspecified large urban court reported few important differences along racial and gender lines (Gruhl et al., 1981; Kritzer & Uhlman, 1977; Uhlman, 1978). In their study of judge’s gender, Kritzer and Uhlman (1977, p. 86) concluded that “female judges behave no differently than their male colleagues.” This research was important for identifying the potential importance of judicial variations in punishment, but it was limited by small samples (e.g., seven female judges), a limited focus on race or gender, and simplistic analytical techniques (e.g., cross-tabulations). Later work by Welch, Combs, and Gruhl (1988) using the same data with additional control variables suggested that Black judges were slightly more lenient toward Black offenders than White judges.
The work of Myers and Talarico (1987) substantially advanced early work on the influence of judge characteristics in sentencing. Their investigation of contextual variations in Georgia incorporated a range of judicial background factors, such as their age, religious affiliation, prior prosecutorial experience, and tenure on the bench, along with various measures of county-level social context. Their findings for the direct effects of judicial backgrounds “conveyed the impression that these factors are relatively unimportant” (Myers & Talarico, 1987, p. 122), but evidence of indirect effects suggested that judge “variables operate as impressive conditioners of differential treatment” in punishment. However, they were unable to include any direct measures of some key characteristics such as the individual gender and race of the judge in their sample.
More recent research has focused on judge effects in additional jurisdictions. Spohn (1990a, 1990b) investigated the impact of judge factors in Detroit. Her first study examined the effect of judge’s race for violent felonies and uncovered only mild evidence that black judges were less punitive, with both Black and White judges punishing minority offenders more harshly. The second study examined the role of judge’s gender in sexual assault cases and reported that female judges sentenced offenders to longer terms of incarceration. The overall conclusion in both these studies, however, was that judicial backgrounds have minimal impact on punishment decisions in criminal courts. These studies were limited, however, by a lack of information on other judicial background characteristics.
Steffensmeier and Britt (2001) reported that Black judges were overall more severe in their treatment of criminal defendants. Given that the effect of judge’s race was relatively small, though, they concluded that “the job, not so much the individual, apparently makes the ‘judge’” (Steffensmeier & Britt, 2001, p. 749). In a related study, Steffensmeier and Hebert (1999) examined the influence of judge’s gender on variations in criminal punishments across 18 Pennsylvania counties. After controlling for age, tenure, prosecutorial experience, and marital status of the judge, they concluded female judges sentenced more harshly than male judges, particularly when faced with repeat Black offenders. Muhlhausen (2004) found contradictory effects for judge’s race, ethnicity, and gender in his study of Pennsylvania courts. Using Tobit regression with fixed effects for counties, he found that female and Hispanic judges were more lenient in sentencing, but that African American judges were more severe.
Johnson (2006) reexamined the influences of judicial race and gender in Pennsylvania using three-level hierarchical models incorporating individual-, judge-, and county-level sentencing measures. Contrary to early work using Pennsylvania data, he found minority judges were less likely to render incarceration sentences, especially when faced with minority defendants, and when they did so they meted out slightly shorter sentences. Furthermore, his analysis uncovered no significant relationships between judges’ gender and punishment decisions. Overall, judge-level predictors accounted for less than one quarter of the between-judge variation in both incarceration and sentence lengths. Wooldredge (2010) analyzed data on felony convictions in Ohio and reported relatively larger differences between judges. He found that even after controlling for compositional case characteristics, about 14% of the variation in the use of prison was attributable to judges. Moreover, the relative weight placed on different sentencing factors, including offender characteristics, differed significantly across judges.
Select work has also examined judge effects in federal sentencing. The United States Sentencing Commission (USSC, 2004, p. 101) conducted a comprehensive analysis of federal sentencing which concluded that only “relatively minor inter-judge and regional disparity” exists that is not attributable to case characteristics. However, using unique data collected from the federal District of Massachusetts, Scott (2009, p. 105) examined inter-judge sentencing disparity following the Booker decision and reported that “Sharp and growing differences are apparent in guideline sentencing patterns between judges in the same courthouse.” Similarly, Tiede (2009) demonstrated that variation in federal sentences was far more pronounced when judges depart from the sentencing guidelines, suggesting meaningful differences remain between judges for departure cases. Anderson and Spohn (2010) also found evidence of sentencing differences among federal judges. They examined inter-judge disparity in three federal district courts and found that significant variation existed between judges in mean sentence lengths after controlling for legally relevant case characteristics. Their results suggested that different federal judges evaluate individual offenders and cases differently, with some evidence emerging for the importance of judicial caseload and prior legal experience in the sentencing process.
Although a number of commentators have argued that the collective effects of judicial legal training, selection, socialization, organizational conformity pressures and presumptive sentencing guidelines largely negate preexisting differences in judicial attitudes, beliefs, and sentencing philosophies (Zatz, 2000), such a conclusion may be premature given recent evidence and the relative paucity of studies examining judge characteristics in sentencing. Because sentencing discretion can vary by case type, a true test of the influence of judicial characteristics in sentencing needs to account for variation in the exercise of judicial discretion across cases. One useful approach for gaining leverage on this issue is to disaggregate cases by the mode of conviction.
Judicial Discretion and JudgeCharacteristics Across Modes of Conviction
In his influential and eloquent diatribe chastising unfettered judicial discretion, Judge Marvin Frankel avouched, “the evidence is conclusive that judges of widely varying attitudes on sentencing . . . mete out widely divergent sentences . . . explainable only by the variations among the judges, not by material differences in the defendants or their crimes” (Frankel, 1973, p. 21). This assertion is consistent with early theoretical arguments that suggest judicial attitudes and sentencing philosophies act as filters through which sentencing judges selectively perceive their social worlds. According to Gibson (1983, p. 9), “judges’ decisions are a function of what they prefer to do, tempered by what they think they ought to do, but constrained by what they perceive is feasible to do.” What judges prefer to do is a product of their attitudes and sentencing philosophies, what they think they ought to do stems from their role orientation on the bench (i.e., their perception of what it means to be a judge), and what they perceive is feasible to do results from organizational constraints associated with their courtroom workgroup and community social environment.
However, the salience of any of these factors is restricted unless the sentencing judge is free to exercise significant individual discretion in the determination of individual sentencing outcomes. From a court community perspective, “judges do not exercise discretion freely even in sentencing. The power of individual judges is circumscribed by the role of ‘court work groups’” (Welch et al., 1988, p. 134). As Eisenstein and Jacob (1977, p. 8) argued long ago, “the singular focus on the judge cannot stand comfortably with the widespread use of plea bargaining.” For these cases “one would not expect the judge’s background or attitudes to be closely related to the disposition of charges against defendants” (Eisenstein & Jacob, 1977, p. 8). Their court community perspective suggested instead that criminal punishments result from the dynamic interaction of key court personnel, including the prosecutor, defense counsel, and presiding judge, and importantly, that different case processing strategies emerge for different modes of conviction that condition the relative influence of the sentencing judge.
Similar arguments have been made on less theoretical, more methodological grounds. Klepper, Nagin, and Tierney (1983, p. 102), for instance, explicitly criticize contemporary research for combining trial and plea dispositions into single regression equations, arguing that “This implicitly incorporates the assumption that the relationship between sentence . . . and various observational features of the case is the same for guilty pleas and trial convictions. If this is not correct, then the estimating equations . . . are misspecified.” They observe that because “guilty pleas generally dominate trial convictions,” analyses performed on aggregate data will provide information primarily on the factors influencing sentences for guilty pleas (Klepper et al., 1983, p. 102). In order to adequately separate and isolate the unique influence of different court actors and their background experiences on criminal punishment decisions, then, it becomes necessary to separate trial cases from guilty pleas.
In this respect, recent research that emphasizes the conditioning effects of different modes of conviction on the sentencing process provides useful insights into the influence of judicial factors in sentencing. Two recent studies are particularly apposite. The first, conducted by Lacasse and Payne (1999) using federal sentencing data, offers one of the few studies to compare sentencing variations across judges. These scholars examined the amount of variation attributable to judges for plea and trial cases before and after the implementation of the federal sentencing guidelines. Their results offered no evidence that inter-judge variation had been reduced by the guidelines (in fact it appeared to increase). Instead, their work offered convincing evidence that sentencing variations were intricately tied to the mode of conviction. Specifically, these researchers concluded that “the variation attributable to judge assignment represents . . . a higher proportion [for trials] than for pleas . . .” supporting “the notion that the prosecutor has an important influence on the sentence in plea bargains, while the judge has the determining influence on trial sentences, even in the post-reform period” (Lacasse & Payne, 1999, pp. 262-264).
In the second study, Johnson (2003) offered further support for the theoretical import of modes of conviction in criminal sentencing. He suggested that the exercise of judicial discretion is greatest for cases convicted at trial, while prosecutorial sentencing discretion is more likely to emerge in negotiated pleas. His empirical findings were largely consistent with this theoretical framework. Extralegal disparities in judicial departures from the sentencing guidelines tended to be most pronounced for modes of conviction in which judges and/or prosecutors exercised significant sentencing discretion. Nonnegotiated pleas, which he argued largely reflected “going rates” (Eisenstein & Jacob, 1977) applied to “normal crimes” (Sudnow, 1965), demonstrated less influence from offender characteristics. Collectively, these perspectives clearly suggest that sentencing discretion, and the associated sentencing disparities that potentially accompany it (at the individual or court actor level of analysis), may be tied to and reflected in different types of case disposition.
Theoretically, this raises important issues for research on judge effects in sentencing. If it is the case that judges exercise significantly more sentencing discretion for trials compared to pleas, then trial cases should provide greater opportunity for judicial background characteristics and career experiences to influence these sentencing outcomes. Conversely, if judicial decision making is constrained in cases disposed of through guilty pleas, then one would expect judge characteristics to exert weaker influences for these cases. Given that guilty pleas account for all but about 5% of criminal convictions, the general lack of significant findings for judge characteristics in prior research may be due to a failure to disaggregate sentencing outcomes by mode of conviction. All prior empirical work on judge effects jointly examines plea and trial cases. If, however, judge characteristics fail to demonstrate substantively important effects across modes of conviction, then the argument for the importance of judicial selection and socialization is strengthened.
The present work investigates this issue by reanalyzing the data examined in Johnson (2006) after disaggregating sentencing outcomes across modes of conviction. The guiding research question is the extent to which judge effects are more pronounced for trial cases compared to those convicted through nonnegotiated and negotiated pleas. 2 Specifically, the following two propositions are examined:
Hypothesis 1: Between-judge variation in criminal punishments is expected to be most pronounced for trial cases.
Hypothesis 2: Judicial background characteristics should have more pronounced effects in trial cases.
If trial cases allow for greater exercise of judicial discretion, they are likely to be associated with greater inter-judge variation in punishment. Although specific directional hypotheses are not specified due to the inconsistency of findings in prior work, 3 the age, race, and gender of the judge are expected to have stronger effects for trial cases than for negotiated and non-negotiated pleas. Overall, then, this research presents an important opportunity to better evaluate the effects of judicial background characteristics in the meting out of criminal punishments while simultaneously enhancing our theoretical understanding of the potentially important intersection between courtroom actor discretion and various modes of conviction.
Data and Method
The current research examines the influence of judicial characteristics in sentencing across modes of conviction by utilizing 2 years of data (1999 to 2000) from the PCS. The PCS is required by law to collect information on sentenced cases in the state of Pennsylvania. The PCS data are among the most complete state-level resource for studying criminal sentencing. They contain a variety of theoretically important individual-level predictors of sentencing, such as offense severity and prior criminal history, as well as various offender characteristics such as age, race/ethnicity, and gender. Moreover, detailed information is available on whether an offender pled guilty outright, negotiated a plea, or was convicted at bench or jury trial. The Pennsylvania sentencing guidelines, enacted in 1982, are among the oldest presumptive guidelines in operation, and extensive prior work has been conducted in this research context, including previous studies of judge effects in sentencing (Johnson, 2006; Steffensmeier & Britt, 2001). The current results can therefore be evaluated relative to previous findings regarding judicial effect sizes.
The individual-level PCS sentencing data were extended to include judge and court characteristics using publicly available judicial biographies. 4 In addition, select county- and courtroom-level control variables were also obtained through the U.S. Census Bureau and the Administrative Office of Pennsylvania Courts. Overall, this final data set provides a very rich and detailed body of information that is well suited for examining the influences of theoretically important judge-level factors in sentencing. In line with prior research, cases in this analysis are limited to the most serious offense per judicial transaction and to those cases sentenced under the 1997 guidelines, resulting in 148,590 total cases, 79,333 of which resulted in incarceration. 5 Table 2 provides descriptions of the specific individual-, judge-, and county-level measures utilized in this study.
Statistical Coding and Descriptions of Dependent and Independent Variables.
Analytic Procedures
The analyses separately examine the probability and length of incarceration using a combination of hierarchical generalized and linear modeling techniques. Sentence type is modeled as a binary outcome, whereas sentence length is a continuous measure based on the minimum number of months of detainment. 6 Because the current data structure is characterized by the nesting of cases within judges and of judges within courts, three-level hierarchical models were utilized to simultaneously incorporate individual-, judge-, and court-level predictors. Useful discussions of the application and advantages of hierarchical modeling in the study of criminal sentencing are available in prior research (see Bontrager et al., 2005; Britt, 2000; Johnson, 2006, 2010; Kautt, 2002; Ulmer & Johnson, 2004; Weidner, Frase, & Schultz, 2005). For the sake of illustration, the simple three-level linear model for sentence length is presented below. Its application to the incarceration decision is straightforward.
In the multilevel model, Yijk is the minimum number of months of incarceration that the ith offender is sentenced to by the jth judge in the kth court. π0jk is the individual-level intercept and π1jk represents the effects of a vector of individual-level variables Aijk centered around their grand means. eijkis the individual-level residual error term. Parameters in the individual-level Equation 1 are then modeled at higher levels of analysis using judge- and county-level predictors. Variation in the individual-level intercept π0jk is modeled with a judge-level intercept β00k and a vector of judge-level covariates Xjk also centered on their grand means. β01k therefore represents the effect of judicial background characteristics on average sentence lengths after controlling for individual predictors, with r0jk representing the judge-specific error term. In this way, judge characteristics are utilized to explain between-judge variation in sentence lengths. Variation in the intercept β00k is further modeled using county court characteristics Wk centered on their grand means, where γ001 symbolizes the effects of county-level predictors on sentence length with u00k representing the county-specific error term. For all models, fixed effects were specified and robust standard errors were utilized. 7
The application of hierarchical modeling procedures provides several advantages over many prior studies of judge effects in punishment. This approach provides standard error estimates that are corrected for the clustering of cases within judges and county courts, as well as statistical significance tests that are appropriately adjusted at each level of analysis. Hierarchical estimates also provide variance components that are useful for evaluating the variation in outcomes across higher units of analysis. Very little research on judges controls for aggregate court contexts, only one prior study utilizes multilevel models (Johnson, 2006), and no work has examined the moderating effects of modes of conviction. The current study therefore provides several advantages over prior work examining judicial characteristics and modes of conviction in criminal sentencing.
Results
Table 3 provides descriptive statistics for each level of analysis with individual predictors disaggregated by mode of conviction. The typical offender is a White male in his early 30s who is convicted though a negotiated guilty plea. Few differences emerge between offenders who negotiate a plea and those who plead guilty outright, but offenders who go to trial are distinct in several ways—they tend to be charged with more serious crimes, have longer prior records, and are disproportionately male, Black and Hispanic. These individual-level descriptive findings comport with prior research on who is most likely to exercise their right to trial (Albonetti, 1990).
Descriptive Statistics for Individual-, Judge-, and Court-Level Predictors.
A typical judge in Pennsylvania is a White male in his mid-50s who has sat on the bench for nearly a dozen years. About 15% of judges were female and only 7% were minorities. 8 Of the 303 judges in the data, 46 were female and 22 were minorities. The majority of criminal courts in the state are located in small rural counties, but the number of criminal cases sentenced is distributed about evenly across small, medium, and large courts. Typical county contexts were characterized by minority populations near 5% and average crime rates just over 24 crimes per 1,000 people.
Table 4 presents the results from the unconditional Hierarchical Linear Models (HLM) models that parcel the variation in sentencing across levels of analysis for each mode of conviction. These results provide some preliminary evidence that variation in inter-judge punishments is greatest for trial cases. For all cases, the between judge variance component for the incarceration decision is 0.17, but for trial cases it increases to 0.31. The estimate of the proportion of total variance in incarceration probabilities that is attributable to judges is about 8%, compared to an estimate of 5% when all modes of conviction are combined. 9 Although these differences are not of large magnitude, they are consistent with the theoretical argument that judges have greater influence over individual punishment outcomes in trial cases. The probability of incarceration varies most starkly across judges for trial cases, with the differences being greatest between trials and negotiated pleas.
HLM Unconditional Models—Judge Characteristics Across Modes of Conviction.
Examination of the absolute magnitude of inter-judge variance in sentence lengths reveals that it is nearly five times larger for trial cases compared to pled cases. The between-judge (level 2) variance components range between 50 and 55 for negotiated and nonnegotiated guilty pleas but increases dramatically to 241 for trial cases. Translated into concrete numbers, this suggests that a one standard deviation change in sentence length for pled cases results in a change of about 7 months, whereas for trial cases a one standard deviation change alters one’s sentence length between 15 and 16 months. Somewhat surprisingly, though, the relative proportion of total variation in sentence lengths attributable to judges is greatest for negotiated pleas, with an intraclass correlation of 0.25. This highlights the fact that sentence lengths following trial are substantially more varied at each level of analysis in the data. The proportionally larger judge-level variation for negotiated pleas may provide some evidence that plea bargaining occurs “in the shadow of the judge” (Lacasse & Payne, 1999) or it may reflect that fact that sentencing guidelines are less restrictive on sentence length decisions in negotiated pleas, but future work is needed to investigate these possibilities. Overall, the unconditional models for sentence length suggest that the relative variation attributable to judges compared to other levels of analysis is greatest in negotiated pleas, but that the absolute magnitude of inter-judge variance is nearly five times larger in trial cases compared to guilty pleas.
Table 5 presents results from the fully specified three-level model of incarceration across modes of conviction. Consistent with prior research, individual legal considerations exert the strongest effects on all incarceration decisions (e.g., Steffensmeier, Ulmer, & Kramer, 1998). Not surprisingly, offense severity and criminal history are strong predictors of incarceration, and mandatory minimums virtually guarantee time spent behind bars. Several individual offender characteristics also significantly influence the odds of incarceration though. Young, male, and minority offenders are disadvantaged across all modes of conviction. Females are approximately two thirds as likely to be incarcerated whereas Blacks and Hispanics are about 40% more likely to be sentenced to jail or prison. Also consistent with prior work, the odds of incarceration are especially pronounced in small rural courts and in counties with greater available jail capacity. Overall, individual- and county-level effects exert relatively consistent, anticipated effects across conviction categories.
HLM Judge Effects on Incarceration Across Individual Modes of Conviction.
Note. The model for all cases includes controls for mode of conviction. The sample sizes for submodels do not equal the total sample size (N = 148,590) due to missing data on mode of conviction. The coefficients of determination are calculated as proportionate reduction in error (PRE) measures.
*p ≤ .10. **p ≤ .05. ***p ≤ .01. ****p ≤ .001.
Turning to the judge-level of analysis, minority judges are consistently less likely to incarcerate offenders, regardless of the mode of conviction. This may suggest that the race of the judge is influential in the plea negotiation process, or it may partially reflect the fact that minority judges are more likely to sentence cases in large urban courts. To tease out this relationship, future research is needed in additional contexts with more diverse samples of judges that span a broader range of county court environments.
The other judge characteristics exerted noticeably stronger effects in trial cases. The gender of the judge had no significant influences on negotiated and nonnegotiated pleas, but it emerged as a significantly stronger predictor of incarceration for trial cases (Z = 3.07). 10 Female judges were 0.63 times as likely to incarcerate offenders convicted at trial. Similarly, the age of the judge had a significantly stronger effect at trial (Z = 2.99), with each additional year reducing the odds of incarceration by about 4%. Judicial tenure on the bench also emerged as a small and marginally significant predictor of incarceration for trial cases alone, but this effect was not statistically distinguishable from the plea effects. Taken together, these results offer some tentative support for the contention that judicial background characteristics have more influence on incarceration decisions for trial cases than for other modes of conviction. Theoretically, trial convictions should introduce greater opportunity for judges to exercise their individual sentencing discretion, resulting in stronger influence from demographic factors that may be tied to past experiences, role orientations and individual sentencing philosophies. Although the judge characteristics only explain 18% of the total between-judge variation in the use of incarceration for trial cases, this was substantially greater than the 2% of the variation explainable for cases convicted through guilty pleas.
Table 6 reports parallel analyses for sentence length. Overall, results from these models are less consistent with theoretical expectations. Mandatory minimums and offense severity again emerge as powerful individual-level predictors of punishment. While offender characteristics influence sentence lengths across all modes of conviction, though, it is interesting to note that their effects are strongest for trial cases. The effect of Hispanic ethnicity, for instance, is more than twice as great for cases convicted through trials. The only court context measure that emerged as a consistent predictor of sentence lengths was the size of the court. Larger courts meted out slightly shorter sentence lengths, though substantively, this effect was small.
HLM Judge Effects on Sentence Length Across Individual Modes of Conviction.
Note. The model for all cases includes controls for mode of conviction. R2 estimates not computed for models with insignificant variation in sentence length.
*p ≤ .10. **p ≤ .05. ***p ≤ .01. ****p ≤ .001.
Judicial background characteristics had mixed and inconsistent effects on sentence lengths across modes of conviction. In line with theoretical expectations, judge characteristics explained the greatest proportion of judicial variation for trial cases. Judge factors accounted for 11% of this variance, but this was only marginally greater than for pled cases. The coefficient for female judges was negative for all modes of conviction but failed to reach statistical significance for trial cases (its effect was not statistically different across models). Substantively, this effect was negligible, amounting to less than a 1-month discount across all conviction types. Race of the judge produced stronger effects, but its influence was inconsistent. At trial, minority judges meted out sentences that were about 3.6 months longer than their majority counterparts. This effect was significantly stronger than the effect for both nonnegotiated pleas (Z = 2.42) and negotiated pleas (Z = 7.23), and for negotiated pleas the effect was negative. This highlights the importance of disaggregating judge effects and suggests that the race of the judge may have different meaning and different influence over negotiated pleas and trial cases. Overall magnitude of judge effects tended to be greatest in trial cases (excluding caseload), but the substantive impact of many of these predictors tended to be small and often statistically insignificant. Interestingly, the relative lack of strong judge effects in the sentence length models is consistent with research on individual sentencing disparities that reports stronger extralegal effects for incarceration decisions compared to sentence length outcomes (e.g., Steffensmeier et al., 1993). Future work is needed that investigates the reasons for this finding along with the potential influence of individual judge factors on additional outcomes of interest, such as guidelines departures, the use of intermediate sanctions, and the application of sentencing enhancements such as mandatory minimums.
Discussion
The past three decades have witnessed a virtual revolution in criminal sentencing practices in America (Tonry, 1996). Although contemporary changes in the justice system have been driven by myriad forces, one key concern associated with sentencing reform remains the presences of inter-judge disparities in punishment. Frankel (1973, p. 44) contended, “the splatter of varied sentences” among judges “nourishes the view that there is no justice in the law.” This viewpoint sparked early interest in the effects of judge’s race and gender, but a lack of substantial findings and data limitations soon led to stagnation. Modern attempts at revival have met with a similar fate, leading some scholars to conclude that the powerful forces of selective recruitment and judicial socialization result in a uniform “subculture of justice” among jurists (Frazier & Bock, 1982). Recent policy innovations like presumptive sentencing guidelines almost certainly contribute further to conformity in judicial punishment standards. Despite the considerable impetus for uniformity, though, extant research suffered from numerous shortcomings, including an implicit assumption that judges exercise equivalent sentencing discretion in all types of cases. The current research set out to test this assumption by disaggregating judge effects across modes of conviction.
If judges are free to exercise greater discretion in trial cases, as theoretical expectations suggest, then greater judicial variation should exist and judicial demographic factors should matter more for these cases. The current findings offer tentative support for this expectation. A greater proportion of variation in incarceration decisions emerged between judges for trial cases than for other dispositional outcomes. Similarly, the magnitude of between-judge differences in sentence lengths was twice as great for trials compared to pleas. Overall, variation among judges was greatest for trial convictions, but the magnitude of these differences was not overwhelming.
Regarding the influence of judicial background characteristics, strong support for the importance of modes of conviction emerged for the incarceration decision. Judge’s gender, for instance, had no discernible impact on overall incarceration decisions, nor did it significantly affect negotiated and nonnegotiated pleas. Yet it demonstrated sizeable effects in trial cases, reducing the odds of incarceration by a factor of 0.63 for female judges. Similarly, the age and tenure of the judge only appeared to matter for trial convictions, whereas the race of the judge exerted significant effects across all modes of conviction. One possibility is that the racial background of the judge conditions the negotiation process, though it should be noted that the majority of minority judges were clustered in only a few county courts, so it may be difficult to parcel out the unique effect of judge race from the broader influences of localized courtroom-sentencing norms. Future work is needed with more diverse samples in additional jurisdictions to better investigate these issues. Because trial cases are relatively rare, it was also necessary to combine jail and prison sentences into a single incarceration outcome in the current analysis. Future work on judge effects would benefit from examining these related but distinct dispositions separately (Holleran & Spohn, 2004), and additional work is also needed that begins to investigate the join impact, or intersectionality, of multiple judicial background characteristics in sentencing (Collins & Moyer, 2007). Overall, though, the current study found that judge characteristics tended to exert more pronounced effects on the incarceration decision in trial cases relative to guilty pleas.
The pattern of findings for models examining sentence lengths was less consistent and produced few notable judge effects with the exception of judge’s race. Interestingly, the pooled race effect was watered down by the fact that minority judges exert opposite effects on negotiated pleas and trials. This provides some suggestive evidence that the sentencing process, and the concomitant effect of judge characteristics, differs according to the mode of conviction. It may also suggest that judge characteristics exert indirect effects on the plea negotiation process, but future research is again needed to substantiate that possibility.
Taken together, these results are consistent with the theoretical argument that trial cases provide greater and perhaps different opportunities for judges and other court actors to exercise their sentencing discretion. Oftentimes, the effects of judge factors are on par with individual sentencing effects. For instance, being a minority offender at trial increases the probability of incarceration by about 35% whereas being sentenced by a minority judge at trial decreases it by 22%. Future research is needed that examines the interaction of judge and offender characteristics across modes of conviction. It may be that judge characteristics are particularly strong conditioners of individual disparities in punishment in trial cases where judges exercise the most discretion. Examination of these interactions would provide an additional test of the theoretical propositions outlined in the current work.
Future research is also needed that extends the gamut of punishment outcomes to additional decisions of consequence. Primary among them is the decision to depart from the sentencing guidelines. Because guidelines departures entail substantial exercise of judicial discretion, particularly for cases convicted through trials, judge effects are likely to have important influences over these outcomes. No research to date examines the influence of judge characteristics on guidelines departures. A similar outcome of interest is the use of intermediate sanctions (Johnson & DiPietro, 2012). In Pennsylvania, judges have carte blanche power to divert offenders from incarceration in specified cells of the sentencing guidelines matrix (i.e., the shaded cells in the Appendix). Because this decision is wholly discretionary, one should expect judge characteristics to emerge as potentially important predictors of these decisions. Future research should be guided by thoughtful consideration of how different court actors exercise sentencing discretion for different types of cases and different sentencing decisions. It is necessary to isolate decision-making outcomes that provide reasonable opportunities for individual sentencing philosophies and background experiences to be considered in order to provide a true test of the influence of judicial background characteristics.
Conclusion
As Lafree (1985, p. 291) suggested more than two decades ago, “Comparing sentencing severity for guilty pleas and trials is difficult because they represent different processes to which defendants are nonrandomly assigned.” Despite this poignant observation, the vast majority of research on criminal sentencing continues to utilize combined samples of guilty pleas and trials. If plea and trial convictions do in fact represent separate punishment processes, though, then it is ill-advised to aggregate them in pooled analyses. Moreover, disaggregating trials and pleas may provide important analytical leverage to better disentangle the separate sentencing processes that underlie different types of criminal case disposition (Bushway & Piehl, 2007; Piehl & Bushway, 2007). The present study investigated this issue through a reexamination of the influence that judge’s race and gender exert on criminal case outcomes.
The current findings provide some tentative support that judge characteristics matter more in trial cases because these cases entail greater exercise of judicial sentencing discretion. Overall, individual legal considerations, such as the severity of the offense, dominate individual punishment decisions, but beyond these influences, characteristics of sentencing judges and county court contexts also matter. Rather than impugning prior research, however, the current work serves as a call for additional work on variations in court outcomes across different social contexts of punishment. Ideally, background information on other court actors such as prosecutors and defense counsel would be included and the purview of study extended to other decision-making points in the justice system beyond the final sentence determination. More proximate measures of judicial sentencing philosophies, punitive attitudes, and role orientations are also needed. Judge’s race and gender are imperfect proxies for attitudinal variables that need to be analyzed. Future research should address these and other limitations of the present study.
From a public policy perspective, the current results provide a more nuanced test of inter-judge sentencing disparity under sentencing guidelines. Guidelines, such as other sentencing innovations that have been enacted in recent decades, were powered in large part by concern over unwarranted disparity in sentencing. Although a passel of studies examines the role of individual offender characteristics in punishment, relatively few studies incorporate relevant judge characteristics. The present study suggests that judicial discretion is an important conditioner of individual punishment decisions, but that it is manifest primarily in trial cases which constitute only a small minority of all criminal cases. Thus, greater attention should be devoted in future work to capturing the discretionary powers exercised by additional court actors such as the public prosecutor. The combined forces of judicial selection, socialization and formal sentencing rules may well serve to constrain judicial discretion and limit inter-judge sentencing variation, but very little is known about interprosecutor variation or potential disparities that may arise from early case processing decisions. Future research is therefore needed that identifies and measures these effects if we are to better inform future policy innovations aimed at increasing fairness and effectiveness in criminal punishment in America.
Footnotes
Appendix
1997 Pennsylvania Sentencing Guidelines
| Prior record score |
||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Level | OGS | 0 | 1 | 2 | 3 | 4 | 5 | RFEL | REVOC | AGG/MIT |
| Level 5 | 14 | 72-240 | 84-240 | 96-240 | 120-240 | 168-240 | 192-240 | 204-240 | 240 | +/−12 |
| State Incar | 13 | 60-78 | 66-84 | 72-90 | 78-96 | 84-102 | 96-114 | 108-126 | 240 | +/−12 |
| 12 | 48-66 | 54-72 | 60-78 | 66-84 | 72-90 | 84-102 | 96-114 | 120 | +/−12 | |
| 11 | 36-54 | 42-60 | 48-66 | 54-72 | 60-78 | 72-90 | 84-102 | 120 | +/−12 | |
| BC | ||||||||||
| 10 | 22-36 | 30-42 | 36-48 | 42-54 | 48-60 | 60-72 | 72-84 | 120 | +/−12 | |
| BC | BC | |||||||||
| 9 | 12-24 | 18-30 | 24-36 | 30-42 | 36-48 | 48-60 | 60-72 | 120 | +/−12 | |
| BC | BC | BC | BC | BC | ||||||
| Level 4 | 8 | 9-16 | 12-18 | 15-21 | 18-24 | 21-27 | 27-33 | 40-52 | NA | +/−9 |
| State Incar/RIP trade | BC | BC | BC | BC | BC | BC | ||||
| 7 | 6-14 | 9-16 | 12-18 | 15-21 | 18-24 | 24-30 | 35-45 | NA | +/−6 | |
| BC | BC | BC | BC | BC | BC | BC | ||||
| Level 3 State/Cnty | 6 | 3-12 | 6-14 | 9-16 | 12-18 | 15-21 | 21-27 | 27-40 | NA | +/−6 |
| Incar | BC | BC | BC | BC | BC | BC | BC | |||
| RIP trade | ||||||||||
| Level 2 | 5 | RS-9 | 1-12 | 3-14 | 6-16 | 9-16 | 12-18 | 24-36 | NA | +/−3 |
| Cnty Incar | BC | BC | BC | BC | BC | BC | ||||
| RIP | 4 | RS-3 | RS-9 | RS-<12 | 3-14 | 6-16 | 9-16 | 21-30 | NA | +/−3 |
| RS | BC | BC | BC | BC | ||||||
| 3 | RS-1 | RS-6 | RS-9 | RS-<12 | 3-14 | 6-16 | 12-18 | NA | +/−3 | |
| BC | BC | BC | ||||||||
| LEVEL 1 | 2 | RS | RS-2 | RS-3 | RS-4 | RS-6 | 1-9 | 6-<12 | NA | +/−3 |
| RS | 1 | RS | RS-1 | RS-2 | RS-3 | RS-4 | RS-6 | 3-6 | NA | +/−3 |
Note. Aggravated and mitigated ranges add or subtract, respectively, the months shown from the standard range.
Shaded areas of the matrix indicate restrictive intermediate punishments may be imposed as a substitute for incarceration.
When restrictive intermediate punishments are appropriate, the duration of the restrictive intermediate punishment program shall not exceed the guideline ranges.
When the range is RS through a number of months (e.g., RS-6), RIP may be appropriate
CNTY = county; RS = restorative sanctions; REVOC = repeat violent offender category; RFEL = repeat felony 1 and felony 2 offender category; RIP = restrictive intermediate punishments; <;> = less than; greater than; BC = boot camp.
Declaration of Conflicting Interests
The author declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author received no financial support for the research, authorship, and/or publication of this article.
