Refinements to Effect Sizes for Tests of Categorical Moderation and Differential Prediction

Abstract

We provide a follow-up treatment of Nye and Sackett’s (2017) recently proposed d_Mod standardized effect-size measures for categorical-moderation analyses. We offer several refinements to Nye and Sackett’s effect-size equations that increase the precision of d_Mod estimates by accounting for asymmetries in predictor distributions, facilitate the interpretation of moderated effects by separately quantifying positive and negative differences in prediction, and permit the computation of nonparametric effect sizes. To aid in the implementation of our refinements to d_Mod, we provide software written in the R programming language that computes Nye and Sackett’s effect sizes with all of our refinements and that includes options for easily computing bootstrapped standard errors and bootstrapped confidence intervals.

Keywords

categorical moderation multiple regression differential prediction bias effect size

Nye and Sackett (2017) recently derived a class of effect-size measures to quantify categorically moderated effects. These new d_Mod effect sizes summarize interactions in a consistent metric across studies, are easier to interpret than R² values from regression models (see Nye & Sackett, 2017, for a discussion of the difficulties of using R² as an effect size for interactions), and provide an intuitive way to detect categorical moderation without significance testing. We build on Nye and Sackett’s equations and introduce several refinements that increase d_Mod’s versatility and ease of interpretation.

We begin with a brief summary of Nye and Sackett’s (2017) methods as context for the present work, followed by a discussion of updates to d_Mod effect sizes. These updates include (a) adjustment factors to offset bias that results from violating distributional assumptions, (b) directional effect sizes to separately quantify negative and positive differences in prediction between two groups’ regression lines, and (c) nonparametric versions of the d_Mod equations for use with observed distributions of predictor scores. Our review of Nye and Sackett’s d_Mod equations is meant to provide minimally sufficient context for the presently proposed updates; we encourage readers to consult Nye and Sackett (2017) for detailed information about d_Mod.

Nye and Sackett’s (2017) d_Mod Effect-Size Measures

Nye and Sackett’s (2017) d_Mod effect-size measures facilitate the comparison of two regression models: one model regressing Y on X for each of two categorically different groups.¹ The interpretation of d_Mod effect sizes is similar to the interpretation of Cohen’s d, except that d_Mod summarizes differences between distributions of predicted scores rather than distributions of observed scores. A d_Mod effect size indicates the weighted average difference in prediction between a referent regression model (e.g., a model summarizing data from a majority or control group) and a focal regression model (e.g., a model summarizing data from a minority or experimental group), scaled in terms of the referent group’s criterion standard deviation ( $S D_{Y 1}$ ).

Table 1 arrays all of the equations that will be discussed in this article, including Nye and Sackett’s (2017) formulations of the d_Mod equations (lightly modified to be in slope-intercept form). Nye and Sackett’s (2017) d_{Mod_Signed} effect-size measure (see Equation 1a) represents the weighted average net difference in prediction between two models across an operational range of predictor scores. A positive (negative) sign for d_{Mod_Signed} means that focal-group criterion scores predicted from the focal-group regression model are, on average, lower (higher) than focal-group criterion scores predicted from the referent-group regression model.

Table 1.

Compendium of Parametric and Nonparametric d_Mod Formulas.

Type of Effect	Effect-Size Measure	Parametric Version		Nonparametric Version
Type of Effect	Effect-Size Measure	Eq. No.	Formula	Eq. No.	Formula
Overall (direct computation)	$d_{M o d_S i g n e d}$ Nye & Sackett (2017) formulation	(1a)	$\frac{1}{S D_{Y_{1}}} \int f_{2} (X) [X (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}}] d X$
	$d_{M o d_S i g n e d}$ revised formulation	(1b)	$\frac{\int f_{2} (X_{I n f}) d X_{I n f}}{S D_{Y_{1}} \int f_{2} (X) d X} \int f_{2} (X) [X (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}}] d X$	(9)	$\frac{n^{T} [x (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}}]}{S D_{Y_{1}} n^{T} 1}$
	$d_{M o d_U n s i g n e d}$ Nye & Sackett (2017) formulation	(2a)	$\frac{1}{S D_{Y_{1}}} \int \sqrt{f_{2} (X) {[X (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}}]}^{2}} d X$
	$d_{M o d_U n s i g n e d}$ revised formulation	(2b)	$\frac{\int f_{2} (X_{I n f}) d X_{I n f}}{S D_{Y_{1}} \int f_{2} (X) d X} \int f_{2} (X) \| X (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}} \| d X$	(10)	$\frac{n^{T} \| x (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}} \|}{S D_{Y_{1}} n^{T} 1}$
Extrema	$d_{M i n}$	(3)	$\frac{1}{S D_{Y_{1}}} Min [\| X (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}} \|]$	(11)	$\frac{1}{S D_{Y_{1}}} Min [\| x (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}} \|]$
	$d_{M a x}$	(4)	$\frac{1}{S D_{Y_{1}}} Max [\| X (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}} \|]$	(12)	$\frac{1}{S D_{Y_{1}}} Max [\| x (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}} \|]$
Directional	$d_{M o d_U n d e r}$	(5)	$\frac{\int_{X_{I n f} : {\hat{Y}}_{1} < {\hat{Y}}_{2}} f_{2} (X_{I n f}) d X_{I n f}}{S D_{Y_{1}} \int_{X : {\hat{Y}}_{1} < {\hat{Y}}_{2}} f_{2} (X) d X} \int_{X : {\hat{Y}}_{1} < {\hat{Y}}_{2}} f_{2} (X) [X (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}}] d X$	(13)	$\frac{n_{U n d e r}^{T} [x_{U n d e r} (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}}]}{S D_{Y_{1}} n^{T} 1}$
	$d_{M o d_O v e r}$	(6)	$\frac{\int_{X_{I n f} : {\hat{Y}}_{1} > {\hat{Y}}_{2}} f_{2} (X_{I n f}) d X_{I n f}}{S D_{Y_{1}} \int_{X : {\hat{Y}}_{1} > {\hat{Y}}_{2}} f_{2} (X) d X} \int_{X : {\hat{Y}}_{1} > {\hat{Y}}_{2}} f_{2} (X) [X (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}}] d X$	(14)	$\frac{n_{O v e r}^{T} [x_{O v e r} (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}}]}{S D_{Y_{1}} n^{T} 1}$
Overall (additive computation; recommended over direct computation)	$d_{M o d_S i g n e d}^{*}$	(7)	$d_{M o d_U n d e r} + d_{M o d_O v e r}$
	$d_{M o d_U n s i g n e d}^{*}$	(8)	$\| d_{M o d_U n d e r} \| + d_{M o d_O v e r}$

Note: X represents values within the range of observed focal-group predictor scores; X is bounded by the minimum and maximum possible scores in the focal group.

$X_{I n f}$ represents values from an infinite distribution of predictor scores; the distribution of $X_{I n f}$ is integrated to account for the finite range of X.

Eq. No. is the formula’s equation number for in-text reference.

$S D_{Y_{1}}$ is the referent group’s observed criterion standard deviation.

f₂ is the normal-density function for the focal group’s unrestricted predictor scores.

$b_{1_{1}}$ and $b_{1_{2}}$ are the referent- and focal-group slopes, respectively and $b_{0_{1}}$ and $b_{0_{2}}$ are the referent- and focal-group intercepts, respectively.

The subscripts of the integrals for $d_{M o d_U n d e r}$ and $d_{M o d_O v e r}$ specify the values of X to include in the integral; X values are included in the integral as a function of the predicted values of Y associated with X for the referent and focal regression models, where ${\hat{Y}}_{1} = b_{0_{1}} + X b_{1_{1}}$ and ${\hat{Y}}_{2} = b_{0_{2}} + X b_{1_{2}}$ .

$x$ is a column vector of observed focal-group predictor scores.

$n^{T}$ is a row vector of focal-group frequencies with elements that correspond in an ordered fashion to the predictor scores arrayed in $x$ .

$x_{U n d e r}$ and $x_{O v e r}$ are column vectors of focal-group predictor scores that contain the values from $x$ that satisfy ${\hat{Y}}_{1} < {\hat{Y}}_{2}$ and ${\hat{Y}}_{1} > {\hat{Y}}_{2}$ , respectively.

$n_{U n d e r}^{T}$ and $n_{O v e r}^{T}$ are row vectors of focal-group frequencies with elements that correspond in an ordered fashion to the predictor scores arrayed in $x_{U n d e r}$ and $x_{O v e r}$ , respectively.

1 is a column vector of 1 s with as many elements as are in $n^{T}$ .

When subgroup regression lines cross within an operational range of predictor scores and one computes d_{Mod_Signed}, positive and negative differences in prediction will cancel out because the signed effect size summarizes the net difference in predicted criterion scores. For example, if two groups’ regression lines cross at the focal group’s mean predictor score when the predictor is normally distributed, positive and negative differences would completely cancel out and d_{Mod_Signed} would be zero. In settings such as this, d_{Mod_Signed} would fail to suggest the existence of a moderated effect. To overcome this, Nye and Sackett (2017) created an unsigned effect size, d_{Mod_Unsigned}, that quantifies differences between subgroup regression lines without allowing signed differences to cancel out (see Equation 2a). As an unsigned index of an effect, d_{Mod_Unsigned} is useful when one wishes to quantify the overall magnitude of a moderated effect and the net direction of predicted differences between groups is not relevant to one’s research question.

Nye and Sackett (2017) offered two supplementary effect sizes that facilitate the interpretation of d_{Mod_Signed} and d_{Mod_Unsigned}. The d_Min (see Equation 3) and d_Max (see Equation 4) effect sizes indicate the smallest and largest absolute-value differences, respectively, between groups’ regression lines. By computing d_Min and d_Max, one can easily communicate the range of differences that were used to compute d_{Mod_Signed} and d_{Mod_Unsigned} and identify whether subgroup regression lines cross within the operational range of predictor scores (an occurrence signaled by a d_Min equal to zero that is paired with a nonzero d_Max).

We refer readers to Nye and Sackett (2017) for more information about d_{Mod_Signed}, d_{Mod_Unsigned}, d_Min, and d_Max. The remainder of this article will describe our refinements to d_Mod.

Refinements to Nye and Sackett’s (2017) d_Mod Effect-Size Measures

Our suggested refinements include (a) adjustments to d_Mod to reduce bias from modest violations of distributional assumptions, (b) special cases of d_{Mod_Signed} that quantify directional differences in predicted criterion scores, and (c) nonparametric methods for computing d_Mod effect sizes. We introduce each of these refinements in the sections that follow.

Modifications to Reduce Bias From Modest Violations of Normality

A subtlety of the d_Mod effect-size measures that makes them particularly well-suited for use in operational settings is that one can integrate differences in prediction within a finite range of scores (i.e., one need not integrate the infinite normal distribution if one is studying predictor scores that span a bounded operational range). However, when d_Mod requires integration involving a finite minimum value and/or a finite maximum value, the cumulative density within that range may fall short of unity by a nontrivial amount due to departures from normality. A cumulative density smaller than 1 indicates that the d_Mod effect size one has computed may differ from what one would have obtained if one’s operational distribution of scores were indeed normally distributed. We propose a simple adjustment to d_{Mod_Signed} and d_{Mod_Unsigned} to offset this bias; we describe an even more effective adjustment later in the article, but the logic of the initial refinement described below is necessary to lay the groundwork for the more advanced method.

Integrating over a finite distribution can result in a biased d_Mod effect size unless one rescales the effect size by the sum of the weights generated by the normal-density function (i.e., the cumulative density). The problem of integrating over a finite distribution of scores is analogous to algebraically computing a weighted average (i.e., $\bar{x} = \frac{\sum_{i}^{n} w_{i} x_{i}}{\sum_{i}^{n} w_{i}}$ , where w is a weight and x is a score) in which the sum of weights in the denominator exceeds the sum of the weights used in the numerator. Of course, cumulative densities appreciably smaller than unity may indicate that the assumption of a normal distribution is inappropriate. When normality cannot be reasonably assumed of one’s data, one should use the nonparametric versions of d_Mod described later in this article. For our present discussion, we assume that departures from normality are modest and that using a normal-density function to compute d_Mod is a reasonable choice.

When integrating a d_Mod function over a finite range of predictor scores, an option for addressing a cumulative density smaller than one is to simply divide the d_Mod effect size by the cumulative density of scores within the operational range. For example, if the operational range of scores includes 95% of the theoretical normal distribution, dividing the d_Mod effect size by .95 will rescale the weights to account for the fact that scores outside of the operational range are impossible to achieve and that 95% of the theoretical distribution represents 100% of the distribution of possible scores. An adjustment factor that implements this correction is incorporated in Equations 1b and 2b and we recommend using these updated equations over Equations 1a and 2a. The gains in precision from using our corrections are likely to be small in most settings, but the corrections will consistently attenuate the bias from minor violations of normality. Unless otherwise noted, all subsequent mentions of d_{Mod_Signed} and d_{Mod_Unsigned} in this article will refer to Equations 1b and 2b, respectively.

In formulating our updated version of d_{Mod_Unsigned} in Equation 2b, we have further reduced bias by modifying the way in which the sign is removed from differences in prediction. Although Nye and Sackett’s d_{Mod_Unsigned} formula (Equation 2a) accomplishes the advertised goal of removing the sign from differences in prediction, it does so in a way that slightly alters the meaning of the resulting d_{Mod_Unsigned} effect size relative to the meaning of the d_{Mod_Signed} effect size (Equation 1a). Under the radical in Equation 2a, one only squares the differences between regression lines and does not square the density weights. Due to the fact that only the differences between regression predictions are squared (rather than squaring the products of the differences and densities), the square root of the product under the radical does not result in an absolute difference like one might expect. Over the range of predictor scores, taking the square root of the unsquared densities in Equation 2a alters the proportional weight given to each predictor score: Scores toward the middle of the distribution receive too little weight and extreme scores receive too much weight. See Figure 1 for comparisons of the distributions of raw and square-root densities, as well as the corresponding distributions of proportional weights. As absolute differences are the intuitive metric for unsigned differences, d_{Mod_Unsigned} should be computed using Equation 2b rather than Equation 2a.

Figure 1.

Comparisons of the distributions of densities and square roots of densities for the normal distributions used in Equations 2a and 2b, respectively. Distributions of proportional weights were computed by dividing the density associated with each predictor score by the corresponding cumulative density. Comparison of the figure panels reveals that taking the square roots of densities distorts the weight given to each predictor score, whereas using raw densities does not.

A limitation of Equations 1b and 2b is that they assume that violations of normality are symmetric, with similar effects on the low and high ends of a score distribution, but this not likely to be the case in all settings. We present methods that account for asymmetric violations of assumptions after first introducing our directional d_Mod effect sizes.

Separate Effect Sizes for Positive and Negative Differences in Predicted Criterion Scores

The d_{Mod_Signed} and d_{Mod_Unsigned} effect sizes are useful for quantifying effects that are moderated by a categorical variable. However, some categorically moderated effects occur in domains in which the regions of negative and positive differences in predicted criterion values have substantive importance. An example of this is predictive bias from the industrial and organizational psychology literature. Predictive bias occurs when assessment scores predict performance differently as a function of one’s protected-class status (e.g., one’s sex or race). For example, a biased test might predict lower performance for Black job applicants when the White (referent group) regression model is used to make predictions than when the Black (focal group) regression model is used. If predicted criterion scores are ${\hat{Y}}_{1} = b_{0_{1}} + X b_{1_{1}}$ using the referent model (where X is a vector of focal predictor scores) and ${\hat{Y}}_{2} = b_{0_{1}} + X b_{1_{1}}$ using the focal model, “underprediction” for the focal group occurs when ${\hat{Y}}_{1} < {\hat{Y}}_{2}$ and “overprediction” occurs when ${\hat{Y}}_{1} > {\hat{Y}}_{2}$ . We suggest that quantifying these directional differences can have interpretive value.

The d_{Mod_Signed} and d_{Mod_Unsigned} effect sizes both combine directional differences in prediction into a single effect size, which can occasionally make it difficult to interpret the precise form of an interaction. Nye and Sackett (2017) suggested that d_Mod effect sizes could be computed over any meaningful range of predictor scores, which means that one could compute separate d_Mod effect sizes within any segments of a distribution that are of interest to one’s research question. In our use of d_Mod effect sizes to quantify predictive bias, we have found it informative to break d_{Mod_Signed} into two directional effect sizes. We propose an effect size called d_{Mod_Under} that only quantifies differences in prediction in the score range where negative differences in prediction occur (d_{Mod_Under} is the standardized average of differences in prediction for all ${\hat{Y}}_{1} < {\hat{Y}}_{2}$ ; see Equation 5) and an effect size called d_{Mod_Over} that only quantifies differences in prediction in the score range where positive differences in prediction occur (d_{Mod_Over} is the standardized average of differences in prediction for all ${\hat{Y}}_{1} > {\hat{Y}}_{2}$ ; see Equation 6). These directional effect sizes facilitate the interpretation of moderated effects, especially when groups’ regression lines cross within an operational score range. If subgroup regression lines do not cross and there are no ${\hat{Y}}_{1} < {\hat{Y}}_{2}$ differences in prediction within an operational score range, d_{Mod_Under} will be zero and d_{Mod_Over} will be equal to d_{Mod_Signed}. Similarly, if subgroup regression lines do not cross and there are only ${\hat{Y}}_{1} < {\hat{Y}}_{2}$ differences in prediction within an operational score range, d_{Mod_Over} will be zero and d_{Mod_Under} will be equal to d_{Mod_Signed}.

The d_{Mod_Under} and d_{Mod_Over} directional effect sizes are useful for isolating the magnitudes of negative (i.e., ${\hat{Y}}_{1} < {\hat{Y}}_{2}$ ) and positive (i.e., ${\hat{Y}}_{1} > {\hat{Y}}_{2}$ ) differences in prediction, respectively. However, provided that the cumulative density of the operational predictor distribution equals unity, d_{Mod_Under} and d_{Mod_Over} can also be used to compute d_{Mod_Signed} and d_{Mod_Unsigned} via addition. When d_{Mod_Under} and d_{Mod_Over} effect sizes computed from a complete normal distribution are added together, they equal d_{Mod_Signed} and, when their absolute values are added together, they equal d_{Mod_Unsigned}. The two directional effect sizes are therefore quite versatile and, because of this, we view d_{Mod_Under} and d_{Mod_Over} as elemental equations for quantifying moderated effects. The results from d_{Mod_Under} and d_{Mod_Over} capture all of the information necessary to make sense of the magnitude of a moderated effect: Separately, they communicate information about negative and positive differences in prediction and, by addition, they can result in d_{Mod_Signed} and d_{Mod_Unsigned}. The information gained from d_{Mod_Under} and d_{Mod_Over} is complementary to the information found by comparing d_Min and d_Max and these directional effect sizes can be helpful for making sense of d_{Mod_Signed} and d_{Mod_Unsigned}.

Based on our recommendations for rescaling d_{Mod_Signed} and d_{Mod_Unsigned} by the cumulative density of predictor scores, Equations 5 and 6 include adjustment factors that rescale the effect sizes by the ratio of the cumulative density from integrating the infinite normal distribution to the cumulative density from integrating over operational scores (i.e., the ratio of the sum of the theoretical weights to the sum of the actual weights). In general, the adjustment to the d_{Mod_Under} and d_{Mod_Over} effect sizes is more appropriate than making a global correction to the d_{Mod_Signed} and d_{Mod_Unsigned} effect sizes as we did in Equations 1b and 2b. This is because the separate corrections made to d_{Mod_Under} and d_{Mod_Over} capture the possibility that violations of normality are asymmetric and differentially impact regions where ${\hat{Y}}_{1} < {\hat{Y}}_{2}$ and where ${\hat{Y}}_{1} > {\hat{Y}}_{2}$ . We recommend computing d_{Mod_Under} and d_{Mod_Over} effect sizes separately (ensuring that the raw directional effect sizes are appropriately rescaled, as shown in Equations 5 and 6) and then adding them together to obtain $d_{M o d_S i g n e d}^{*}$ and $d_{M o d_U n s i g n e d}^{*}$ (see Equations 7 and 8). Although the adjustment factors included in Equations 1b and 2b account for the overall cumulative density’s deviation from unity, we recommend using $d_{M o d_S i g n e d}^{*}$ and $d_{M o d_U n s i g n e d}^{*}$ instead of d_{Mod_Signed} and d_{Mod_Unsigned} when possible because the asterisked additive effect sizes are more robust to asymmetric violations of normality than are the directly computed effect sizes.

We note that our corrections for nonunity cumulative densities do not perfectly correct for abnormalities in the distribution of actual predictor scores. However, they offer a relatively simple way to approximate the effect size of interest. If deviations from the assumed normal distribution are substantial, use of normal-density weights will result in effect sizes of questionable validity. When deviations from normality are great and one has access to the actual distribution of predictor scores, we recommend computing d_Mod using nonparametric equations.

Nonparametric Methods for Computing d_Mod Effect Sizes

The d_Mod equations discussed thus far have assumed that there is an underlying parametric function that describes the distribution of predictor scores. This assumption is not always reasonable and there may be settings in which researchers wish to use observed frequencies as weights when they compute d_Mod. To compute nonparametric versions of all of the effect-size measures described above, we recommend using Equations 9 through 14 in Table 1. These equations provide standardized weighted averages of the differences in predicted criterion scores between referent and focal models using frequencies of observed focal-group scores as weights. These equations are direct algebraic analogs of their parametric counterparts listed in Table 1; if the observed data are normally distributed, the results from corresponding parametric and nonparametric procedures should agree within a reasonable margin of error.

Our notation for the nonparametric formulas implies that one is using a frequency distribution as weights, but it is also possible to perform this procedure with raw data directly. If X is a vector of the focal group’s observed scores in which each entry represents a case, one can simply compute the average of $X (b_{1_{1}} - b_{1_{2}}) + b_{0_{1}} - b_{0_{2}}$ and divide that average by the referent-group criterion standard deviation to obtain a nonparametric version of d_{Mod_Signed}. The same logic applies to computing the other nonparametric d_Mod effect sizes from observed scores.

These nonparametric versions of d_Mod will be appropriate only if one can reasonably assume that the population to which one seeks to generalize will follow approximately the same distribution as the observed frequencies. If the sample used to derive the weights is atypical of the population of interest, these nonparametric equations will only be suitable for describing differences in prediction in the sample and will not be useful for making generalizations.

Empirical Examples

To illustrate the impact of our refinements, we analyzed a data set from the GATB validation project that substantially overlaps with the data set analyzed by Nye and Sackett (2017) in terms of the occupational groups represented. We computed unstandardized regression models for all of the occupational groups reported by Nye and Sackett that were represented in our version of the data set (see Nye & Sackett, 2017, for information on how these occupational groups were identified). Table 2 presents comparisons of the effect sizes computed using Nye and Sackett’s equations and using our revised equations. Some results with large proportional differences in Table 2 are associated with very small raw differences, but other differences (e.g., raw differences of .07 and .08 for d_{Mod_Unsigned}, associated with proportional differences of 24.14% and 18.18%, respectively) are of magnitudes that could impact interpretations of effect sizes and complicate comparisons of effect sizes across contexts. The majority of the differences from using the revised equations came from our use of absolute values in computing d_{Mod_Unsigned}, which illustrates that the differences between the weight distributions depicted in Figure 1 can have noticeable, practical impacts on effect-size estimates.

Table 2.

d_Mod Effect Sizes Computed for Selected Occupational Groups in the GATB Validation Database.

Occupational Group	Sample Sizes		Results From Nye and Sackett’s (2017) Equations		Results From Revised Equations		Directional Effects		Raw Difference Between Nye and Sackett’s (2017) Equations and Revised Equations		Percentage Difference Between Nye and Sackett’s (2017) Equations and Revised Equations
Occupational Group	n White	n Black	d_{Mod_Signed}	d_{Mod_Unsigned}	d_{Mod_Signed}	d_{Mod_Unsigned}	d_{Mod_Under}	d_{Mod_Over}	d_{Mod_Signed}	d_{Mod_Unsigned}	d_{Mod_Signed}	d_{Mod_Unsigned}
Clerks	268	118	.06	.18	.06	.15	–.04	.10	.00	.03	0.00	20.00
Bindery workers	97	51	.38	.40	.40	.40	.00	.40	–.02	.00	–5.00	0.00
Ship fitters	150	75	.28	.28	.28	.28	.00	.28	.00	.00	0.00	0.00
Child care workers	57	96	–.18	.18	–.18	.18	–.18	.00	.00	.00	0.00	0.00
Bench assemblers	67	91	.22	.36	.22	.29	–.04	.26	.00	.07	0.00	24.14
Forming machine operators	195	52	.67	.70	.68	.68	.00	.68	–.01	.02	–1.47	2.94
Millwrights	522	84	.56	.57	.56	.56	.00	.56	.00	.01	0.00	1.79
Typists	348	244	–.01	.07	–.01	.05	–.03	.02	.00	.02	0.00	40.00
Salespersons	151	64	.10	.18	.11	.15	–.02	.13	–.01	.03	–9.09	20.00
Steel workers	276	59	.43	.45	.44	.44	.00	.44	–.01	.01	–2.27	2.27
Computer operators	131	53	.46	.56	.47	.49	–.01	.48	–.01	.07	–2.13	14.29
Psychiatric aides	112	185	–.04	.04	–.03	.03	–.03	.00	–.01	.01	33.33	33.33
Electrical assemblers (employer 1)	118	56	–.02	.02	–.02	.02	–.02	.00	.00	.00	0.00	0.00
Etchers	130	65	–.39	.52	–.39	.44	–.41	.02	.00	.08	0.00	18.18

Note: Equations used to compute all tabled effect-size estimates are displayed in Table 1. The occupations listed in this table are those for which we could match the occupational groups analyzed by Nye and Sackett (2017) to cases in our version of the GATB data set. Sample sizes for millwrights and typists differ from the sample sizes reported by Nye and Sackett (2017) because we included all individuals in the data set whose Dictionary of Occupational Titles (DOT) codes matched with these occupations’ DOT codes. All effect sizes were computed using unstandardized regression models for congruence with how the effect sizes are computed in practice; thus, the effect sizes computed here using Nye and Sackett’s equations may differ from the effect sizes computed from standardized data reported by Nye and Sackett (2017). Raw difference = Nye and Sackett equation – revised equation. Percentage difference = (Nye and Sackett equation – revised equation) / revised equation × 100.

Discussion

In this article, we have outlined several refinements to Nye and Sackett’s (2017) d_Mod effect-size measures that we discovered during our use of these effect sizes. Our goal has been to update readers on methods for computing d_Mod effect sizes and to share the progress that has been made since the concept of d_Mod was first introduced.

As a supplement to this article, we have produced software to compute d_Mod effect sizes. Our software is written in the R programming language (R Core Team, 2017) and is part of the “psychmeta” R package (Dahlke & Wiernik, 2017). The general-purpose “compute_dmod” function computes parametric and nonparametric d_Mod effect sizes from a raw data set and can compute corresponding bootstrapped uncertainty statistics. We also offer functions for computing d_Mod from descriptive statistics and regression coefficients without raw data.

We have found d_Mod effect-size measures to be informative for interpreting moderated effects and we hope that our modifications to Nye and Sackett’s (2017) equations will encourage more researchers to use d_Mod. Our open-source software’s compatibility with all common operating systems will support the use of these effect-size measures in future research.

Footnotes

Acknowledgments

The authors would like to thank Christopher D. Nye for his helpful comments on an early version of this article.

Declaration of Conflicting Interests

The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The author(s) received no financial support for the research, authorship, and/or publication of this article.

ORCID iD

Jeffrey A. Dahlke .

Note

References

Dahlke

J. A.

Wiernik

B. M.

(2017). psychmeta: Psychometric meta-analysis toolkit (Version 0.1.1) [Computer software]. Retrieved from https://CRAN.R-project.org/package=psychmeta.

Nye

C. D.

Sackett

P. R.

(2017). New effect sizes for tests of categorical moderation and differential prediction. Organizational Research Methods, 20, 639–664. doi:10.1177/1094428116644505

R Core Team. (2017). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from https://www.R-project.org/