Abstract
Additive transformations are often offered as a remedy for the common problem of collinearity in moderated regression and polynomial regression analysis. As the authors demonstrate in this article, mean-centering reduces nonessential collinearity but not essential collinearity. Therefore, in most cases, mean-centering of predictors does not accomplish its intended goal. In this article, the authors discuss and explain, through derivation of equations and empirical examples, that mean-centering changes lower order regression coefficients but not the highest order coefficients, does not change the fit of regression models, does not impact the power to detect moderating effects, and does not alter the reliability of product terms. The authors outline the positive effects of mean-centering, namely, the increased interpretability of the results and its importance for moderator analysis in structural equations and multilevel analysis. It is recommended that researchers center their predictor variables when their variables do not have meaningful zero-points within the range of the variables to assist in interpreting the results.
Organizational theories continue to grow in complexity, with more researchers including moderating and nonlinear relationships into their theories of workplace behaviors (Bobko, 2001; Cortina, 1993). To analyze these complex relationships, researchers often use moderated multiple regression analysis (MMRA) and polynomial regression analysis (PRA). Although these techniques are flexible enough to perform many complex analyses using commonly available statistical software, quite often the correlation between components and the product terms used to represent moderating or nonlinear effects are high. This collinearity is often blamed for the lack of statistical support for these moderating and nonlinear effects (e.g., Dunlap & Kemery, 1987; Morris, Sherman, & Mansfield, 1986; Sockloff, 1973).
A remedy commonly offered to reduce this collinearity is to transform the variables—usually using mean-centering—before creating the interaction or nonlinear term. Despite the frequency with which predictors are mean-centered, it seems that the original rationales for these additive transformations may be lost. In fact, we believe that in most cases that mean-centering is used, it fails to accomplish the goal intended by the researcher. The purpose of this article, then, is to address some of the myths surrounding mean-centering in MMRA and PRA and to clarify what these additive transformations do and do not accomplish. Throughout the article, we demonstrate our points with derivation of relevant equations as well as illustrative empirical analyses. 1 Before continuing, however, it is prudent to acknowledge that the purpose of this article is educational; the substantive content presented here has been presented elsewhere (e.g., Aiken & West, 1991; Cohen, 1978; Cohen, Cohen, West, & Aiken, 2003; Cronbach, 1987; Echambadi & Hess, 2007; Friedrich, 1982; Jaccard & Turrisi, 2003; Jaccard, Wan, & Turrisi, 1990; Smith & Sasaki, 1979). For example, recent work by Echambadi and Hess (2007) demonstrates some of the results outlined here. However, our treatment of the topic is considerably less technical by not relying on complex matrix algebra (e.g., Kroenecker products). Furthermore, we expand on these works by discussing situations when centering has no impact. In addition, we discuss the role of centering in multilevel models, moderated structural equation modeling, and when analyzing categorical moderator variables. Our intent is to integrate findings across different publications into a single resource to guide organizational researchers in understanding what mean-centering will accomplish in MMRA and PRA and in deciding if and when to center their predictor variables.
Prevalence of MMRA and PRA in Organizational Research
A variable (X 2) is said to moderate the relationship between an antecedent (X 1) and consequence (Y) when the relationship between X 1 and Y varies at different levels of the moderator X 2. Saunders (1955, 1956) credits Gaylord and Carroll’s paper at the 1948 American Psychological Association meeting for the first discussion of a moderator variable where they cited examples of moderating relationships from the fields of personnel selection, experimental psychology, and personality dynamics. A nonlinear relationship between an antecedent (X 1) and the outcome (Y) implies that the relationship between X 1 and Y is not constant across the range of X 1. Perhaps the most recognizable nonlinear relationship is the Yerkes-Dodson law—arousal improves performance to a certain point, but then arousal hinders performance (Yerkes & Dodson, 1908).
Additive predictor score transformations in MMRA and PRA were first advocated by Marquardt and Snee (1975), who demonstrated that mean-centering predictor variables reduced ill-conditioning in the data (see also Cohen, 1978; Cronbach, 1987). Loosely, conditioning in the data can be thought of as the stability of the results obtained to fluctuations in said data—the correlations among variables in the data cause ill-conditioning in the data that will then inflate the variances of the coefficients estimated from that data set and cause unstable results. Nonessential ill-conditioning results simply from the scaling of the variables, whereas essential ill-conditioning results from substantive relationships among the variables (Cohen et al., 2003). As we will detail, mean-centering will remove the nonessential ill-conditioning because mean-centering changes the scaling of the variables in moderator analysis. Bradley and Srivastava (1979) and later Marquardt (1980) extended this work to polynomial regression analysis. Since then, popular regression texts have recommended mean-centering predictor variables prior to conducting MMRA or PRA (e.g., Aiken & West, 1991; Bobko, 2001; Cohen et al., 2003; Jaccard & Turrisi, 2003; Pedhazer, 1997).
As Table 1 shows, an appreciable number of articles, between 2000 and 2010, in three organizational journals have investigated moderating effects; as expected, fewer investigated nonlinear relationships. For those researchers who did transform their variables, mean-centering was by far the most popular, and collinearity reduction was the most cited justification. Interestingly, only two articles noted that their transformation reduced nonessential collinearity—collinearity due simply to the scaling of the variables—even though the source cited most often for these transformations (i.e., Aiken & West, 1991) explicitly states this point (pp. 35-36). In our view, this reliance on centering predictor variables to “reduce collinearity” has become an indoctrinated practice achieving a status of urban legend and has engendered a few myths about what centering does and does not accomplish. Like most urban legends, there is a kernel of truth regarding the capability of additive transformations to reduce collinearity. Throughout this article, we try to caution readers about the inaccuracies regarding centering and collinearity as well as clarifying the kernels of truth.
Prevalence of Moderating and Nonlinear Relationships in Organizational Research
aThe author(s) centered to both reduce collinearity and improve interpretability.
bEach of these authors was cited only once in different articles.
The Urban Legend and Kernel of Truth
As seen in Table 1, a common justification for additively transforming variables prior to conducting MMRA or PRA is that it reduces collinearity. Furthermore, a common misconception we have heard is that reducing collinearity will then change the results and conclusions of the regression analysis or even make it easier to find significance. Kromrey and Foster-Johnson (1998) had also encountered this misconception with other researchers. They paraphrase the misconception voiced by many that “The reduction in collinearity afforded by centered data when product terms are included in correlation matrices should ‘logically’ provide better (or at least different) hypothesis test results” (pp. 51-52); they go on to correct this misconception. In an example of PRA, Luchak and Gellatly (2007) stated that mean-centering variables before computing the squared term “had no noticeable impact on our results and did not alter our conclusions” (p. 791), implying that there was a possibility that mean-centering might have influenced statistical conclusions. In addition, we have had many conversations with researchers, faculty, students, and practitioners who believe that centering is a panacea for the collinearity problem. We believe this thinking is common among organizational researchers.
The kernel of truth to this myth is that mean-centering does reduce collinearity but reduces nonessential collinearity, which is the type of collinearity most researchers do not care about. Indeed, the seminal articles on this topic (e.g., Bradley & Srivastava, 1979; Cohen, 1978; Cronbach, 1987; Friedrich, 1982; Marquardt & Snee, 1975; Smith & Sasaki, 1979) have all noted that the large degree of collinearity would raise computational errors, but mean-centering variables will not alter the substantive conclusions of the analyses (see also Aiken & West, 1991; Jaccard et al., 1990; Jaccard & Turrisi, 2003). Although we demonstrate the following points using mean-centering, the results do generalize to other forms of additive transformations.
Situations Where Additive Transformations Do Not Matter
We begin by outlining results and conclusions that do not change even if predictors in MMRA or PRA are mean-centered.
Mean-centering only reduces nonessential ill-conditioning
Bohrnstedt and Goldberger (1969) derived the equation for the covariance between an interaction term and its constituent part for normally distributed variables. Taking their work, the correlation between an interaction term and its component is:
Bohrnstedt and Goldberger (1969) demonstrated that the covariance of a product term with its component parts is dependent on the scale of the variables; Equation 1 shows that the correlation is influenced by the means of X
1 and X
2. Marquardt and Snee (1975), Marquardt (1980), and Bradley and Srivastava (1979) showed that a large part of the relationship between a product term and its component parts can be reduced by shifting the origin. This reduction in collinearity is termed nonessential collinearity. Nonessential collinearity is collinearity that exists merely due to the scaling of X
1 and X
2 (Cohen et al., 2003); when the scaling of X
1 and X
2 changes, the correlations between an interaction and polynomial term with its components will also change. This can be demonstrated quite effectively by showing that a set of five scores (i.e., 1, 2, 3, 4, 5) correlates 0.98 with their squares (i.e., 1, 4, 9, 16, 25). However, after centering the scores (i.e., –2, –1, 0, 1, 2), the correlation of these centered scores with the squares (i.e., 4, 1, 0, 1, 4) is reduced to 0.00. Here, one can see the large impact scaling has on the correlation between multiplicative terms and their components. Mean-centering reduces the correlation between an interaction and polynomial term with its components by making all expected values 0. If X
1 and X
2 are bivariate normally distributed, mean-centering will reduce
More consequential, however, is that mean-centering will not alter the correlation between X 1 and X 2. The correlation between X 1 and X 2 is all essential collinearity and can have deleterious effects on the estimation of regression models, regardless of mean-centering. In addition, additive transformations will not alter the relationships between X 1 and Y or X 2 and Y. The intercorrelations involving the X 1 X 2 term, on the other hand, will be different depending on whether or not X 1 and X 2 are centered; indeed, Cohen (1978) cites these potentially drastic (p. 861) changes in the correlations as the reason that researchers worry about using MMRA and PRA. The differences in these correlations, however, are attributed to the differences in the means of the centered and uncentered X 1 X 2 term (see also, Arnold & Evans, 1979; Cronbach, 1987). As demonstrated in the following, the changes in the intercorrelations due to mean-centering will not alter the value of the regression coefficient for the interaction term (i.e., b 3).
Table 2
presents the correlation matrices and descriptive statistics for randomly generated data where X
1, X
2, and Y are normally distributed and to be correlated at
Descriptive Statistics and Correlations Among Predictors, Outcomes, Interaction, and Polynomial Terms With Centered and Uncentered Data a
aData were randomly generated to be normal; however, some non-normality in the data remains.
bThe scale of Y was not altered.
Mean-centering does change lower order regression coefficients, but not the highest order one
Although the correlations involving the interaction and polynomial terms differ depending on whether they are calculated with centered or raw data, this difference has no effect on the regression coefficient for the interaction or polynomial term. The regression coefficients for the component predictors (e.g., X 1 and X 2) will change, but in a systematic way (Aiken & West, 1991; Cohen, 1978). Note that for multiplicative transformations (i.e., when a constant is multiplied with or divided from the predictors), the highest order terms will change, but the substantive interpretation of the effect represented by the terms will remain unaffected (for a review, see Cohen, 1978).
The form of the multiple regression equation for testing moderation is:
Given that the b
3 coefficient in MMRA does not differ between centered and uncentered analyses, the standard error and t value also do not change, resulting in the same statistical conclusions. The formula for the t value of a regression coefficient is:
Equivalence of Centered and Uncentered Standard Errors for Moderated Multiple Regression Analysis (MMRA) a
Note: MSres = mean square residual for full regression model; differences are due to rounding error.
aResults obtained from data generated for Table 2.
Finally, when an interaction is present (i.e., b
3 ≠ 0), the lower order terms—b
1 and b
2—do differ between centered and uncentered analyses often in a “dramatic” way (Cohen et al., 2003, p. 264). As described in greater detail in the following, however, when b
3 ≠ 0 the lower order terms are interpreted conditionally. The interpretation of these simple slopes—the relationship between X
1 (X
2) and Y at specific values of X
2 (X
1)—though is also the same for centered and uncentered analyses (Aiken & West, 1991). Equation 3 can be restructured to:
Centered and Uncentered Moderated Multiple Regression Analyses
aDiagonal elements represent variance.
bFormula from Aiken and West, 1991:
cValues are distributed on (n – k – 1) degrees of freedom.
Mean-centering does not change the fit of the regression model
Thus far, we have demonstrated that mean-centering does not alter the value of or significance test related to the b 3 regression term in MMRA, and a similar logic holds for the highest order regression term in PRA. When the b 3 term is tested hierarchically, the significance test of the b 3 term is essentially a test of the significance of the change in model fit (i.e., ΔR 2). It follows, then, that neither the ΔR 2 value of the regression model nor its significance differs between centered and uncentered analyses; a similar logic holds for the overall R 2 values as well. Indeed, Table 4 shows that the R 2 and ΔR 2 values are identical for the MMRA with centered and uncentered data—again, the results hold for PRA as well. To demonstrate these points further, we begin with a situation when the interaction (or polynomial) term is not included. In the model without the interaction 2
In the model containing the higher order term, we can demonstrate that the R
2 value is the same using the sum of squares regression and total:
Finally, by combining the previous results, it is evident that the ΔR
2 from the model containing only the lower order terms (Model 1 in Table 4) to the model containing the interaction term (Model 2 in Table 4) are the same for centered and uncentered analyses. Given that
Mean-centering does not impact the power to detect moderating or nonlinear effects
The power of a statistical test is the probability that the test will detect an effect when one exists in the population. In the context of MMRA or PRA, the power of the analysis is the probability that the interactive or nonlinear effect will be detected. A myriad of factors affect the power of MMRA and PRA (Aguinis, 1995; Aguinis & Stone-Romero, 1997; Stone-Romero & Liakhovitski, 2002), but many of them are unaffected by mean-centering.
Cohen (1988, 1992) outlines four factors that affect the power of statistical tests familiar to most researchers: the statistical test chosen (i.e., parametric vs. nonparametric), the particular α level selected, the sample size, and the effect size for multiple regression analysis (i.e., f 2, which is driven by R 2). None of these factors change as a result of mean-centering.
McClelland and Judd (1993) demonstrate that range restriction in the component part of an interaction can affect the power of MMRA or PRA. They show that the ability of MMRA and PRA to detect the effect is contingent on the residual variance of the product (i.e., V[X
1
X
2.X
1,X
2];
Shieh (2009) developed an approach to determine both the a priori and post hoc power for a MMRA with two continuous predictors, demonstrating that the power is determined by the distributions of X 1 and X 2 as well as the correlations of X 1 and X 2. As discussed earlier, however, centering does not impact the distributions of X 1 and X 2 or the correlation between X 1 and X 2 (see also Shieh, 2010). Furthermore, the aforementioned discussion was related to the power associated with continuous moderators and predictors. Aguinis and Pierce (1998) and Aguinis, Boik, and Pierce (2001) provide a detailed discussion of the determinants of the power of MMRA with categorical moderators; interested readers are directed to their papers. It is worth noting, however, that again, centering has no impact on the determinants they outline in their papers. A factor affecting the power of MMRA and PRA that appears to change after centering is the reliability of the higher order term.
Mean-centering does not alter the reliability of the product term
Unreliability in predictor variables attenuates the power of a statistical test and has been investigated with less than encouraging results for the power of MMRA (Busemeyer & Jones, 1983; Dunlap & Kemery, 1988; Edwards, 2009; Evans, 1985). Like the effects of range restriction, unreliability in the product or polynomial term is exacerbated when unreliable measures are combined to create the higher order terms. Furthermore, given that constructs in organizational research are almost always measured with error, the effects of predictor unreliability are of great importance to organizational researchers investigating moderating and nonlinear effects. Indeed, Jaccard and Turrisi (2003) showed that the unreliability in the product (or polynomial) term will attenuate the ΔR 2 according to:
Bohrnstedt and Marwell (1978) derived the equation for the reliability of a product term:
From Equation 15, one can see that the reliability of the product term is a function of the reliability of the components and the correlation of the components. Edwards (2009) provides an example: If X 1 and X 2 are uncorrelated and have reliabilities of .70, the reliability of X 1 X 2 is only .49. He goes on to demonstrate that if the correlation between X 1 and X 2 increases to .25 and .50, the reliability of X 1 X 2 increases to .52 and .59, respectively. As another example, if the reliability of X 1 and X 2 are both 0.80, the reliability of the product term can vary from as low as 0.64 if the variables are uncorrelated to as large as 0.82 if the variables are perfectly correlated. However, if two variables have a reliability of 0.60, the maximum reliability the product term can have is 0.68. Finally, suppose one variable is measured very reliably (i.e., 0.90), whereas the other is measured poorly (i.e., 0.60); the reliability of the product term can be as low as 0.54 for uncorrelated variables and only as high as 0.77 with perfectly correlated variables. The conclusion from this is that even when the components have reliabilities that meet standard cut-offs (see Lance, Butts, & Michels, 2006), the interaction term may not achieve the same standard (Edwards, 2009, p. 149). Moosbrugger et al. (2009) note the paradox that “While multicollinearity of predictors enhances the reliability of an interaction term, it also causes severe estimation and interpretation problems because of the high correlations between all variables in the nonlinear regression model” (pp. 107-108).
With respect to centering, Bohrnstedt and Marwell (1978) recognize that the reliability of a product term is scale dependent. More specifically, they showed that any transformation that alters the ratios of means to standard deviations of the component terms (i.e.,
What Additive Transformations Do Accomplish
Mean-centering does have some beneficial outcomes that aid researchers. In particular, the traditional reason to center was the reduction in nonessential ill-conditioning, limiting the likelihood of computational errors. However, given the computational capacity of today’s computers, it is unlikely that failing to remove nonessential ill-conditioning will lead to computational errors for traditional MMRA. There are other reasons, however, that still justify centering.
Mean-centering improves the interpretability of results
In the past, some debate existed as to whether the lower order terms can be interpreted in MMRA and PRA. One view suggested, as Stone (1988) summarizes, that “Main effects cannot be considered in isolation of the interaction effect” (p. 201). Indeed, Katrichis (1993) notes that the arbitrary nature of scales can complicate the interpretation of the lower order term. Alternatively, others believed that the lower order terms have a meaningful interpretation. This is true when there is and is not a moderating effect (see Aguinis, 2004; Irwin & McClelland, 2001). To outline the meaning of lower order terms, suppose a researcher is interested in the effects of job performance and work stress on intentions to quit. In this example, Y is intentions to quit (ITQ), X 1 is work stress (WS), X 2 is job performance (JP), and X 1 X 2 is the multiplicative term capturing the moderating impact of JP on WS and ITQ.
In a typical MMRA, the researcher regresses ITQ on WS and JP in the first model (i.e., Model 1) and in the second model (i.e., Model 2) adds the interaction term. The interpretation of the regression weights for WS and JP depends on which model is being interpreted and whether or not the data are centered. Regardless of centering, however, the regression weights for WS and JP in the first model (i.e., the model excluding the interaction term) are interpreted as average effects (Aguinis, 2004; Jaccard & Turrisi, 2003). That is, b 1 is the WS-ITQ relationship averaged across JP; similarly, b 2 is the JP-ITQ relationship averaged across WS. Similarly, b 1 and b 2 can be interpreted as the relationship between X 1 (X 2) and Y holding X 2 (X 1) constant. In the second model (i.e., the model including the interaction term), the first-order effects are interpreted as conditional or simple effects (Aguinis, 2004; Jaccard & Turrisi, 2003). In other words, b 1 is the relationship between WS and ITQ at a specific value of JP—that is, when JP equals 0. When variables are centered, JP = 0 at the mean value of JP. Similarly, b 2 is the relationship between JP and ITQ at the mean value of WS. In Model 2, then, the regression weights for lower order terms no longer represent average effects, but are conditional on the value of the other lower order term—more generally, with centered variables b 1 (b 2) is the relationship between X 1 (X 2) and Y at the mean value of X 2 (X 1) (Jaccard & Turrisi, 2003).
When variables are not centered in Model 2, b 1 and b 2 are interpreted the same as in the centered case; however, in this instance, 0 is not the mean value of X 2 or X 1. Often, 0 is not a meaningful point in many psychological scales. Indeed, most operationalizations of the constructs in organizational research, and in psychology in general, are arbitrary (Blanton & Jaccard, 2006). Because of the arbitrary nature of most of the scales used by organizational researchers, zero-points fall outside the range of the raw-score variables; this makes interpreting results more complicated.
As just outlined, the zero-point is important in multiple regression because the interpretation of b 1 and b 2 depends on a meaningful zero-point. Similarly, the intercept b 0 corresponds to the expected value of Y when all predictors have a value of zero (Aguinis, 2004; Jaccard & Turrisi, 2003). Centering assists researchers by making the results interpretable. This is shown in Table 4. Following Aiken and West’s (1991) procedures for interpreting an interaction at three levels (i.e., at the mean, one standard deviation above the mean, and one standard deviation below the mean), we computed the simple slopes equations. All that differs is that the value of X 2 for the mean in centered analyses is 0. The relationship between X 1 and Y when X 2 = 0 is the relationship at the mean value of X 2.
It should be noted that mean-centering may not be necessary when a meaningful zero-point naturally occurs and zero falls within the range of the data. Ratio scales are an example of scales with a meaningful zero-point that organizational researchers do use in their research; some common variables are: tenure, counts of absences, production counts, and number of job searches. When using scales like these, centering for interpretation is not necessary (as long as zero falls in the range of the data). Other than ratio scales, any interval scale that has a meaningful zero-point can also be used without centering. Examples of this type of measurement are item response theory (IRT) parameter estimates; the zero-point indicates average standing on the latent trait continuum. In this situation, researchers would not necessarily need to transform the θ-estimates to make zero the mean θ-level since a θ score of zero is meaningful. Note, however, that these types of variables may still be mean-centered with no impact on the interpretation of the interactive effect (i.e., b 3).
Furthermore, researchers may wish to center their variables at some other value on that variable when he or she wants zero to represent that value. Any constant subtracted from X 1 and X 2 will not affect the significance or interpretation of b 3, and all of the equivalences between mean-centered and uncentered analyses demonstrated previously will hold for centering on any constant when applying MMRA and PRA. In organizational research, some examples of values other than the mean to center at include: a cut score on a selection test, a minimum level of stress, or a specific level of performance. Although the substantive conclusions will be the same, additive transformations other than mean-centering can result in different effects on collinearity (Jaccard & Turrisi, 2003).
Situations Where Additive Transformations Do Matter
Mean-centering matters when testing moderated structural equation models (MSEM)
Structural equation modeling (SEM) is a valuable method because it allows researchers to estimate the relationships among variables while controlling for measurement error (Kenny & Judd, 1984). Kenny and Judd (1984) introduced an approach to modeling interactive and nonlinear effects using SEM to account for the measurement error in MMRA and PRA (Busemeyer & Jones, 1983). Since Kenny and Judd’s seminal work, many approaches to testing these effects have been proposed (for reviews, see Cortina, Chen, & Dunlap, 2001; Marsh, Wen, & Hau, 2004; Schumacker & Marcoulides, 1998). In MSEM, a researcher specifies both a structural model (i.e., the relationships among the latent factors of X
1, X
2, X
1
X
2, and Y) as well as the measurement model (i.e., the relationships among the observed variables and the latent factors). When specifying the measurement model, the researcher has to decide how to use indicators of the latent factors of X
1 and X
2 (i.e.,
Mean-centering and model convergence
Many of the approaches used in MSEM are complicated by the use of nonlinear constraints—that is, parameters within the model are constrained to be nonlinear functions of other parameters in the model (for a review, see Cortina et al., 2001). Because of these complicated constraints, these types of models are prone to nonconvergence and improper solutions (Ping, 1998). Structural equation models are tested using iterative processes (e.g., maximum likelihood estimation); a model converges when the values of the unknown parameters are sufficiently close to the convergence criterion (Bollen, 1989); otherwise, model nonconvergence occurs. A myriad of causes for nonconvergence have been identified. Central to MSEM, nonconvergence can result from ill-conditioning in the data (Moosbrugger et al., 2009) that can cause fluctuations in the observed variance and covariances (Bollen, 1989). Furthermore, large input matrices that need to be inverted can cause convergence issues (Ping, 1995, 1996, 1998; Rigdon, Schumacker, & Wothke, 1998).
Kenny and Judd (1984) specified indicators and latent factors with zero-means, and therefore suggested there was no need to model the means of the latent factors. However, Joreskog and Yang (1996) demonstrated that even with zero-means, the mean-structure still needed to be modeled. With zero-means, only the mean for the latent interaction term is needed (i.e., κ
3 in LISREL notation); however, with non-zero-means, the means for all three latent variables (i.e., κ
1, κ
2, κ
3) need to be modeled. In their demonstrations, Joreskog and Yang simulated data with means of zero meaning κ
1 and κ
2 could be set to zero. Algina and Moulder (2001) later demonstrated that nonconvergence can result when modeling the mean structure with indicator means greater than zero. In particular, they showed that the nonconvergence occurred when trying to estimate the error variances of the observed indicators (in LISREL terms, the
Ping (1995, 1996, 1998) also demonstrated that the more indicators used to model the latent interaction, the larger the input matrix will become—this will make it more difficult to invert the matrix (as is necessary in SEM estimation). Similarly, Marsh et al. (2007) and Moosbrugger et al. (2009) showed that multicollinearity can cause nonessential ill-conditioning in the input matrix (see previous), which can cause nonconvergence issues in MSEM. They suggest mean-centering the indicator variables to reduce the nonessential ill-conditioning and nonconvergence issues.
Mean-centering variables in MSEM can reduce convergence issues by reducing the nonessential ill-conditioning in the input matrix that needs to be inverted when running the estimation procedure. Furthermore, centering variables prior to modeling the mean-structure can limit convergence issues related to the estimation of the
Mean-centering and relationships among latent factors
Other than alleviating nonconvergence issues, mean-centering the indicators of latent variables can alter the relationships among the latent variables. In particular, assuming multivariate normality and after mean-centering, the relationships between the latent factors for X 1 and X 2 will not be related to the latent factor for X 1 X 2. In LISREL terms, the Φ31 and Φ32 parameters become zero. Indeed, this relationship is a major assumption of all of the constrained models—an assumption that is often violated (Marsh et al., 2004). Relaxing this assumption of multivariate normality is what separates approaches like Marsh et al.’s (2004) unconstrained approach from the constrained approaches. Nevertheless, these alternative approaches to testing MSEMs still rely on mean-centering of the indicator variables to alleviate convergence issues.
It is worth noting, also, that centering the manifest variables will not change the relationship between the latent factors for X 1 and X 2. Stated in LISREL terms, the Φ21 parameter will remain unchanged after mean-centering (Bollen, 1989). Moosbrugger et al. (2009) noted that with multivariate normality, centering will remove the nonessential ill-conditioning in the data, resulting in the Φ31 and Φ32 parameters being zero. With non-normal data, these parameters will be greater than zero because of essential ill-conditioning. However, if these variables are not centered, then Φ31 and Φ32 will be much larger. The unconstrained approaches model the essential ill-conditioning due to non-normal data. Alternatively, the constrained approaches assume Φ31 and Φ32 are zero when indicators have zero-means (see Marsh et al., 2004, for a review).
In all, it is important to center one’s manifest variables prior to modeling interactions in SEM to alleviate convergence issues related to (a) nonessential ill-conditioning and (b) complex, nonlinear constraints. If one uses a constrained approach that assumes Φ31 and Φ32 are zero, mean-centering is necessary if the indicator variables have non-zero means. If one uses an unconstrained approach, mean-centering is still recommended to limit convergence issues. Finally, even with indicator variables with meaningful zero-points, centering is still useful to reduce the nonessential ill-conditioning in MSEM regardless of the approach used.
Mean-centering matters when testing multilevel models (MLM)
Researchers interested in investigating multilevel phenomena need to decide how to center their variables. That is, researchers have shown that how variables are centered when testing MLM has implications for interpreting the results. In MLM analyses, which are regression based, the results of the Level 1 analysis (i.e., the intercept and slope) become the outcome variables in the Level 2 analysis—the Level 1 results must have a clear meaning (Raudenbush & Bryk, 2002).
In a simple two-level model, the MLM takes the form of Equations 16, 17.1, and 17.2:
Grand mean-centering
In grand mean-centering, the scores are centered about the grand mean of the predictor variable across groups. Grand mean-centering of variables does not change the rank order of scores on either Xij or Yij . Because grand mean-centering does not alter the rank ordering of the scores, the multilevel association between Xij and Yij does not change after grand mean-centering (Enders & Tofighi, 2007). Furthermore, the mean differences and correlations among the variables remain the same.
Finally, grand mean-centering results in scores that are correlated at Level 1 and Level 2. Subsequently, Xij becomes a composite variable that contains both within-group and between-group variation. As a result, the interpretation of the model parameters differs greatly compared to group mean-centering. Table 5 outlines how the different model parameters are interpreted with grand mean-centering.
aThis table only discusses parameters that have an interpretational difference between the centering types.
bInterpretations based on Enders and Tofighi (2007) and Hofmann and Gavin (1998).
Group mean-centering
In group mean-centering, scores are centered about the mean of the predictor variable within groups. Unlike grand mean-centering, group mean-centering does change the rank order on both Xij and Yij . Since scores are now centered about the means of their respective groups, observations are now expressed relative to other cases belonging to the same group. Furthermore, group mean-centering will change the mean differences and correlations among the variables (Enders & Tofighi, 2007).
As a result then, group mean-centering will remove all between-group variation; Xij becomes a variable containing only within-group variation that correlates only with Level 1 variables but not with Level 2 variables (Enders & Tofighi, 2007). Because the Level 2 variation is removed with group mean-centering, variables correlate less strongly with each other (compared to grand mean-centered and raw score variables). The interpretation of the model parameters following group mean-centering is also shown in Table 5.
Research questions and centering decisions
Given the structural differences between group mean- and grand mean-centering, as well as the interpretational differences, different research questions call for different types of centering (see Enders & Tofighi, 2007; Hofmann & Gavin, 1998). Centering decisions should always be determined by the researcher based on the particular research question being addressed. We briefly cover four types of questions here, but interested readers can obtain a more detailed review in Enders and Tofighi (2007) and Raudenbush and Bryk (2002).
When the researcher is interested in the effect of the Level 1 predictor (i.e., Xij ) on the outcome variable (i.e., Yij ), group mean-centering is the best option. Because group mean-centering removes all between-group variation, the β1j is a pure estimate of the pooled, within-group relationship between Xij and Yij . Alternatively, when the researcher is interested in knowing how the Level 2 variable (i.e., Zj ) impacts the outcome variable, grand mean-centering is the best option. With grand mean-centering, γ01 is the partial regression coefficient that reflects the influence of Zj controlling for Xij (Enders & Tofighi, 2007).
When one is interested in the interactive effect across levels of analyses (i.e., cross-level interaction), the researcher should use group mean-centering. Cross-level interactions require an unbiased estimate of the Level 1 association (i.e., β1j must be meaningful). As stated earlier, β1j is an unbiased estimate of the within-group variation with group mean-centering (Enders & Tofighi, 2007; Hofmann & Gavin, 1998).
Finally, if a researcher is interested in the effects of a variable at both levels of analyses, the researcher can use either type of centering. That is, the researcher may be interested in whether the relationship between the predictor and outcome is the same at both levels of analyses. In this situation, the researcher can choose to either group mean- or grand mean-center because this is one of the rare situations where the parameter values will be algebraically equivalent and lead to similar substantive conclusions (for the derivations and discussion, see Enders & Tofighi, 2007).
Additive Transformations With Categorical Moderators
Often, researchers are interested in how a categorical variable moderates a relationship between a predictor and an outcome. Cohen (1968) demonstrated the flexibility of multiple regression analyses to handle categorical variables. Here, we briefly describe how centering continuous variables impacts the interpretation of the model parameters when the moderator is categorical. For a more detailed discussion, interested readers are referred to Aguinis (2004) and Cohen et al. (2003).
The two most typical coding schemes for categorical variables are dummy coding and effect coding (for other coding schemes, see Aguinis, 2004; Cohen, 1968). With each type of coding, the researcher creates (c – 1) variables, where c is the number of categories in the moderator variables that are then multiplied with the predictor variable. Then, the component terms and interaction terms are entered into the model—hierarchically if desired—to control for the component terms. Importantly, centering continuous variables prior to conducting the analyses aids in interpreting the results without altering the results (Aguinis, 2004). One may want to center continuous variables for ease of interpreting the results; categorical variables should be coded based on the research questions and should not be altered.
Table 6 outlines the interpretation of the model parameters for a model containing one continuous predictor and a categorical moderator with two levels. Table 6 assumes the continuous variable has been centered; differences in interpretation of the various regression coefficients for dummy and effect coding are discussed. For an expanded discussion, including interpretations of other coding schemes, see Aguinis (2004) or Cohen et al. (2003).
aInterpretations adapted from Aguinis (2004) and Cohen, Cohen, West, and Aiken (2003).
bInterpretations for a model containing one continuous predictor—centered—and one categorical moderator with two levels; for more detailed treatments, see Aguinis (2004).
It is important for researchers to carefully consider how they choose to code their variables (Irwin & McClelland, 2001) and should consider centering continuous variables to help interpret the results (Cohen et al., 2003). Also, researchers need to fully consider the power issues specific to MMRA with categorical variables (see Aguinis, 2004; Aguinis et al., 2001; Aguinis & Pierce, 1998).
Final Conclusions and Recommendations
This article has addressed the myth that mean-centering can alter substantive conclusions in moderated multiple regression analysis and polynomial regression analysis. We looked to integrate findings from different fields into a single article that addresses many of the confusions surrounding additive transformations in MMRA and PRA.
We have shown that mean-centering reduces nonessential collinearity between interaction terms and its components. We further showed that although b 1 and b 2 can change after mean-centering, the b 3 coefficient, its standard error, and significance do not change. In line with this, we have demonstrated that the R 2 and ΔR 2 are also the same. Additionally, the simple slopes interpretation of the interaction does not differ between mean-centered and raw score variables.
We further showed that mean-centering does not change the power of the tests of MMRA or PRA. Indeed, if a study suffers from low power, mean-centering the variables will not increase the power. We also showed that mean-centering variables may appear to reduce the reliability of the product term. However, the inclusion of X 1 and X 2 in the model negates the effects of scaling.
Finally, we note, like most scholars on the topic, that mean-centering reduces nonessential collinearity, reducing ill-conditioning in the data. We demonstrated that centering can help interpret the results of the regression analysis by giving X 1 and X 2 meaningful zero-points. We also showed two areas where additive transformations can make an appreciable impact, including when testing MSEMs and when testing hierarchical linear models (HLMs). It is important to recognize that additive transformations will not improve one’s chances of finding significant results, and although not demonstrated here, the conclusions outlined previously extend to higher order interactions and polynomial effects (for a discussion, see Jaccard & Turrisi, 2003).
Before concluding with our recommendations on when to center in MMRA and PRA, we note an interesting finding from Landis and Dunlap (2000) regarding the structure of an MMRA. Specifically, Landis and Dunlap argue that in some cases, specifying the criterion and the predictor variable can be interchangeable. Stated differently, they note, “It is possible to examine the influence of [a moderator] of the x→y and y→x relationships” (p. 255). Although they do note that in most cases this ordering is clear, there are situations where it is more ambiguous (e.g., performance→satisfaction or satisfaction→performance). In these latter situations, then, it is possible that specifying the criterion and predictor in the opposite order can result in a different conclusion related to the moderating effect. They argue for the need for a clear theoretical reason to specify one variable as a criterion and the other as the predictor—if this does not exist, though, they recommend analyzing the moderating relationship on both the x→y and y→x relationships.
We conclude with some recommendations on when to center variables prior to conducting MMRA or PRA. If one is testing an MSEM or HLM, we recommend centering according to the recommendations of Cortina et al. (2001) or Marsh et al. (2004) for MSEM and Enders and Tofighi (2007) or Hofmann and Gavin (1998) for HLM. If one is testing relationships according to Equation 3, we recommend centering the variables if the variables do not have meaningful zero-points. If the variables have meaningful zero-points (e.g., tenure) within the range of the variable, the variables do not need to be centered to increase interpretability.
Often moderating and nonlinear relationships are investigated together—some researchers investigate moderators of a curvilinear relationship. At a more general level, researchers have suggested that quadratic terms (i.e.,
In general, there are some benefits to centering, though they are often exaggerated and misunderstood, especially when discussing regression. We hope this article clarifies the advantages and disadvantages for centering and helps researchers make more informed decisions.
Footnotes
Parts of this article were presented at the 2011 Annual Meeting for the Society of Industrial and Organizational Psychology, Chicago, Illinois. The authors would like to thank the editor and three anonymous reviewers for their detailed and insight suggestions in shaping this manuscript.
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
The author(s) received no financial support for the research, authorship, and/or publication of this article.
