Abstract
Currently, the most popular analytical method for testing moderated mediation is the regression approach, which is based on observed variables and assumes no measurement error. It is generally acknowledged that measurement errors result in biased estimates of regression coefficients. What has drawn relatively less attention is that the confidence intervals produced by regression are also biased when the variables are measured with errors. Therefore, we extend the latent moderated structural equations (LMS) method—which corrects for measurement errors when estimating latent interaction effects—to the study of the moderated mediation of latent variables. Simulations were conducted to compare the regression approach and the LMS approach. The results show that the LMS method produces accurate estimated effects and confidence intervals. By contrast, regression not only substantially underestimates the effects but also produces inaccurate confidence intervals. It is likely that the statistically significant moderated mediation effects that have been reported in previous studies using regression include biased estimated effects and confidence intervals that do not include the true values.
In recent years, a growing number of studies have investigated models that integrate moderation and mediation. In moderated mediation models, 1 researchers investigate whether a mediating effect differs according to contexts, groups, or the values of another variable. In other words, the strength of an indirect effect depends on the value of the moderator, resulting in a conditional indirect effect (Preacher, Rucker, & Hayes, 2007). Researchers have been searching for the most appropriate analytical method for testing moderated mediation. Among all of the analytical procedures, the observed variable regression approach proposed by Muller, Judd, and Yzerbyt (2005); Edwards and Lambert (2007); Preacher et al. (2007); and recently by Hayes (2013) has been the most popular. This approach includes the product of an independent variable and a moderator in the regression equations to estimate the moderating effect. Nonetheless, the estimates of both mediating and moderating effects using the regression approach are typically underestimated because of measurement errors (Aguinis, 1995; Evans, 1985; Ledgerwood & Shrout, 2011).
Structural equation modeling (SEM) has been extensively used to estimate structural parameters because it corrects for measurement errors. However, it is not widely utilized in testing latent interactions because most of the methods for modeling latent interactions involve some type of multiplication of indicators to form the indicators of the latent variable that represents the product of the two latent variables and require the inclusion of complicated nonlinear constraints in the model. Approximately a decade ago, Klein and Moosbrugger (2000) introduced the latent moderated structural equations (LMS) approach to estimate interaction effects in SEM; this approach does not require the creation of a latent variable to represent the latent interaction. The LMS was later incorporated as the latent variable interaction function in Mplus, which can be easily implemented. Therefore, our objective is to extend the LMS approach to estimate the moderated mediation of latent variables and compare the regression approach with the LMS approach in testing latent moderated mediation effects, particularly with respect to the accuracy of estimates, statistical power, and Type I error in significance tests and the accuracy of confidence intervals (CIs).
The Regression Approach for Testing Moderated Mediation
The regression approach introduced by Muller et al. (2005) extends the methods for testing mediation by examining the moderator’s effect on each of the paths in the two models introduced by Baron and Kenny (1986). Specifically, researchers first examine the moderator’s effect on the overall effect that the independent variable has on the dependent variable in a model without the mediator. Researchers then examine the moderator’s effect on both the direct effect that the independent variable has on the dependent variable and the indirect effect that the independent variable has on the dependent variable through the mediator. The existence of moderated mediation and mediated moderation is assessed by estimating the regression coefficients in these two models. The framework introduced by Edwards and Lambert (2007) combines moderated multiple regression (MMR) and path analysis. These authors developed regression equations for testing the basic mediation model and seven moderated mediation models. Substituting different moderator values into each equation enables an analysis of the moderator’s effect on the direct, indirect, and total effects of the independent variable on the dependent variable. Preacher et al. (2007) developed an SPSS syntax named MODMED for testing moderated mediation. More recently, Hayes (2013) developed a new syntax called PROCESS for SPSS and SAS to analyze 76 moderated mediation models. Researchers can use these syntaxes to obtain estimates for all path coefficients, the conditional direct and indirect effects at certain particular values of the moderator, and the bias-corrected bootstrap confidence intervals (BCCIs) for the conditional effects.
One critical limitation of these methods is that they use regression analyses that assume that variables are measured without error. However, the variables in organizational and psychological studies are typically not directly observable and include measurement errors. The presence of measurement errors not only impedes the power to test for the significance of various parameters in the model but also results in biased estimates of the parameters. In fact, researchers have long realized this power issue in MMR and have stated that the power of MMR is a function of various factors, including the independent variable range restriction, the total sample size, the correlation between the independent variable and the moderator, the effect size of the moderating effect, and the reliabilities of the independent variable and moderator (Aguinis, 1995; Aguinis & Stone-Romero, 1997; Evans, 1985; Stone-Romero & Anderson, 1994). When the independent variable or the moderator has less than perfect reliability, the reliability of the product term of these two variables is likely to be adversely affected, which in turn results in a biased regression coefficient for the product term.
In fact, the biasing effects of measurement errors on the regression coefficients of a model that integrates moderation and mediation are complex, and these regression coefficients can be biased upward or downward. While the measurement errors in endogenous variables have no biasing effect on the regression coefficients (Ree & Carretta, 2006), the measurement error of one independent variable may result in a biasing effect on the regression coefficient of another independent variable, even if that variable is measured without errors if the two independent variables are correlated (Antonakis, Bendahan, Jacquart, & Lalive, 2010; Ree & Carretta, 2006). For example, researchers have asserted that measurement errors may result in the underestimation of mediating effects when using the regression approach. The simulation study conducted by Cheung and Lau (2008) demonstrated that both the size of the measurement errors and the effect size caused deviations in the parameters of a mediation model that was estimated by regression. Specifically, true mediation was underestimated by 5.5%, on average, when item reliability was equal to 0.90 and by 12.4%, on average, when item reliability was equal to 0.75. The simulation by Ledgerwood and Shrout (2011) further showed that while the estimates of the path between the independent variable and the mediator and of the path between the mediator and the dependent variable were attenuated by measurement errors, the direct path between the independent and dependent variables was underestimated in some cases and overestimated in other cases. This problem, which stems from less than perfect reliabilities in testing moderating and mediating effects with the regression approach, may become more serious when testing moderated mediation. In moderated mediation models, when the independent variable and/or the moderator have reliabilities that are lower than unity, the interaction effect is underestimated. When the mediator is also measured with errors, the estimated moderated mediation is further biased toward zero. As a consequence, the power available for detecting the moderated mediation suffers considerably.
The LMS Approach for Testing Moderated Mediation
Given the limitations of regression, we suggest the use of SEM to examine moderated mediation with latent interaction models. Because SEM corrects for measurement errors when estimating structural parameters, it avoids the problem generated by less than perfect reliability. However, the modeling of a latent interaction is not a straightforward procedure. Moosbrugger, Schermelleh-Engel, and Klein (1997) summarized five methodological problems of latent interactions in SEM as follows: (a) the measurement error of indicators, (b) the nonlinearity of the factor loadings of product indicators, (c) the non-zero mean structure of the latent product term, (d) the effect of linear transformation on the structural coefficient, and (e) the non-normally distributed latent product term. The search for appropriate analytical methods for the latent interaction effect has been undertaken for three decades. The first method to receive wide attention was proposed by Kenny and Judd (1984). This method requires the researcher to create products of indicators as indicators of the latent interaction term and to impose various nonlinear constraints in the model. Hayduk (1987), Jaccard and Wan (1995), Jöreskog and Yang (1996), and Algina and Moulder (2001) suggested further development of the product-indicator methods with modifications to the nonlinear constraints. The two-step approach proposed by Ping (1996) and the method recommended by Marsh, Wen, and Hau (2004), which uses matched indicators, do not require nonlinear constraints. However, all these methods have a number of practical and statistical drawbacks. There are many reviews and comparisons of various methods for modeling latent interactions (e.g., Cortina, Chen, & Dunlap, 2001; Jöreskog, 1998; Marsh et al., 2004; Marsh, Wen, Hau, & Nagengast, 2013; Moosbrugger et al., 1997), and interested readers are encouraged to read those reviews. While simulations have shown that the two-stage least squares (2SLS) method suggested by Bollen (1995, 1996) and Bollen and Paxton (1998), which uses instrumental variables (by using special linear combinations of indicators) to estimate interaction effects, provides unbiased estimates (Klein & Moosbrugger, 2000; Klein & Stoolmiller, 2003), the 2SLS method requires a large sample size to achieve a given power (it is less efficient) because the specific distributions induced by the product term are not exploited (Klein & Moosbrugger, 2000; Klein & Stoolmiller, 2003; Moosbrugger et al., 1997). In addition, because only one indicator of the outcome variable is used, the estimated regression coefficients and standard errors will depend on which indicator of the outcome variable is chosen (Jöreskog, 1998). In sum, in empirical studies, testing interactions using SEM is far less popular than employing the regression approach with observed variables.
Among all the methods for modeling latent interactions, the LMS approach is investigated in the current study for two reasons. First, this method does not require the creation of product indicators and can be easily implemented with Mplus without imposing complicated nonlinear constraints. Second, simulation studies have shown that the LMS approach provides unbiased estimators and standard errors (Klein & Moosbrugger, 2000; Schermelleh-Engel, Klein, & Moosbrugger, 1998). If the LMS approach provides accurate estimates and CIs, it might be a good alternative to the regression approach to estimate a moderated mediation effect. The primary criticism of the LMS approach is that it is computationally intensive. However, with the rapid development of desktop computing capabilities and power, computational intensiveness is no longer an issue. The LMS approach makes use of a new maximum likelihood (ML) method that explicitly takes into account the non-normal distributional characteristics of the joint indicator vector in a general latent interaction model. An elementary interaction model (Kenny & Judd, 1984) with one latent interaction effect can be represented as follows:
By incorporating the matrix Ω, an upper triangular matrix that contains zeros in the diagonal, a structural equation of the general interaction model that includes multiple latent interactions is created (Klein & Moosbrugger, 2000, p. 460):
where η is a (1 × 1) latent endogenous variable; α is a (1 × 1) intercept term; ξ is a (n × 1) vector of latent exogenous variables; Γ is the (1 × n) coefficient matrix representing ξ’s impact on η; Ω is the (n × n) coefficient matrix representing the impact of the product terms ξ
i
ξ
j
(i < j) on η; ζ is a (1 × 1) disturbance variable with E(ζ) = 0 and Cov(ζ, ξ′) = 0; and
where
In LMS, the latent exogenous variables ξ are decomposed into two vectors z1 and z2 , whereas η (and the y indicators) is linear in z2 and nonlinear in z1 . The vector z1 is used to derive the multivariate density function (joint probability distribution) of the indicators, which is then used to estimate the parameters in Equation 2 with the expectation-maximation (EM) algorithm (Dempster, Laird, & Rubin, 1977; Redner & Walker, 1984). In adopting the EM algorithm, the LMS approach performs an iterative estimation of the model parameters and stops when the log-likelihood function of the observed variables is maximized. During the whole process, no products of indicators are created. The simulation conducted by Klein and Moosbrugger (2000) showed that LMS generated unbiased parameter estimations. Similarly, the LMS estimation of standard errors did not show substantial bias. When distribution assumptions were only moderately violated, the simulation results revealed that the approach remained rather robust.
The Reliability-Corrected Single-Indicator LMS Approach
The LMS approach may not be desirable for analyzing certain types of latent variable models, such as models with many indicators and/or studies with small sample sizes. Large numbers of indicators frequently lead to less than adequate overall model fit because a large number of parameters are fixed at zero. In addition, small sample size may result in model non-convergence. Under such conditions, researchers may consider using item parcels rather than all indicators for the constructs. At the extreme, researchers may revert to testing the structural equation model with a simple average score as the single-indicator of a construct. However, rather than assuming that the variables are measured without error, as in the regression approach, the unique variances can be fixed at (1 – reliability) × indicator variances, and the factor loadings can be fixed at 1 (Bollen, 1989; Coffman & MacCallum, 2005; Hayduk, 1987). Then, the model is analyzed with LMS to estimate the moderation, direct, and indirect effects. We refer to this approach as the reliability-corrected single-indicator LMS (RCSLMS) approach. Nevertheless, it is important to note that only items with a well-known unidimensional structure, no cross-loadings on multiple factors, no correlated uniqueness, and no other sources of misspecification could be combined into parcels or single indicators (Bandalos, 2002; Bandalos & Finney, 2001; Marsh, Ludtke, Nagengast, Morin, & Vov Davier, 2013).
Index of Moderated Mediation
Figure 1a shows a first-stage moderation model (Edwards & Lambert, 2007), where moderator W affects the relationship between independent variable X and mediator M. While moderation effect a3 and effect b form the moderated mediation effect, either parameter informs the magnitude of the moderated mediation effect. Hence, Morgan-Lopez and MacKinnon (2006) and Hayes (in press) suggested examining the product term a3b, which represents the change in the indirect effect of X on Y through M for a unit change in W, as a formal test of moderated mediation. Hayes (in press) referred to this parameter as the index of moderated mediation (Index MM) and derived the function for various moderated mediation models. When the CI of the index of moderated mediation does not include zero, it might be concluded that the indirect effects at various levels of the moderator are statistically significantly different.

(a) Moderated mediation model with observed variables used in simulation. (b) Moderated mediation model with latent variables used in simulation.
Bias-Corrected Bootstrap Confidence Intervals
Researchers have long indicated that the sampling distribution of mediation (indirect effect) as the product of two path coefficients is not normal (Bollen & Stine, 1990; MacKinnon, Lockwood, & Williams, 2004; Stone & Sobel, 1990). It is also generally acknowledged that the distribution of the product of two independent normal distribution variables is not normal; in other words, even if both the independent variable X and the moderator W are normally distributed, their product term XW is not normally distributed. Thus, using a t test or a Sobel test, which assume normality, to test the significance of the parameters may not be appropriate. Previous studies have advocated for the construction of BCCIs for testing mediating effects (e.g., Cheung & Lau, 2008; MacKinnon et al., 2004; Williams & MacKinnon, 2008). The SPSS and SAS syntaxes developed by Preacher et al. (2007) and Hayes (2013) also generate BCCIs for moderated mediation effects in the output. Edwards and Lambert (2007) also presented SPSS syntax (CNLR syntax) to produce bootstrap estimates for the regression coefficients and Excel files to produce the BCCIs of the estimated coefficients. With bootstrapping, the sampling distribution of the moderated mediation effect is first estimated nonparametrically. Then, information from the bootstrap sampling distribution is used to construct a confidence interval for the moderated mediation effect (Preacher et al., 2007). We thus also recommend using BCCIs in testing moderated mediation effects in latent interaction models.
In the following sections, we first present the results of our comprehensive simulation that compares the observed variable regression approach, the LMS approach, and the newly proposed RCSLMS approach, illustrating their performance as measured by accuracy in parameter estimation, CI coverage, statistical power, and Type I error rates. Then, we provide a numerical example that demonstrates how to use the LMS approach with Mplus to test for moderated mediation with latent variable models.
Simulation
Method
We conducted two simulation studies to compare the regression approach, the LMS approach, and the RCSLMS approach. These studies aimed to illustrate the performance of the three approaches in correcting for measurement errors when estimating the confidence intervals of the estimated parameters. First, as shown in Figure 1a, a population model with the following four variables was generated: independent variable X, mediator M, moderator W, and dependent variable Y. Moderator W was proposed to moderate the effect of X on M, thus moderating the mediating effect of X on Y through M. Specifically, three levels of effect size of a3 (0.0, 0.2, and 0.4) and three levels of sample size N (100, 200, and 500) were manipulated. The population parameters (true values) of the moderation effect (a3 ) are shown at the top of Table 1. The manipulations of the effect size of a3 = 0.0 and 0.2 followed those adopted by Klein and Moosbrugger (2000). The effect size of 0.7, as used by Klein and Moosbrugger in their simulation, was not included because 0.7 might be considered too large as an effect size in organization studies. Instead, 0.4 was included in the current study. The value of 0.0 was used to test for Type I error, and the values of 0.2 and 0.4 were used to test for power. Because a3 (XW on M) was varied due to manipulations, a3b (Index MM) was also varied. The manipulations of the sample size represent certain commonly used sample sizes in organizational research. The population values of other direct paths are shown on the left side of Table 1. The population parameters a1 (X on M) = 0.2 and a2 (W on M) = 0.4 followed those adopted by Klein and Moosbrugger (2000), and the direct paths c’ (X on Y) = 0.14 and b (M on Y) = 0.59 represented the small and large direct effects in a mediation model (MacKinnon et al., 2004). The covariance between independent variable X and moderator W was set at 0.5.
Simulation Results for Data With Small Unique Variance.
Note: REG = regression; LMS = latent moderated structural equations; RCSLMS = reliability-corrected single-indicator LMS; Est. = estimate; SE = standard error; AB = absolute bias; COV = 95% coverage; SIG = Type I error/power.
Next, the four variables were treated as latent variables, and each was measured by four indicators, the values of which were simulated by adding random error to the indicators. The latent interaction model is presented in Figure 1b. Both the factor loading and unique variance of all indicators were set at 1. These parameters simulated the situation in which the Cronbach’s alpha of the simple average score for each latent variable was approximately equal to 0.8. To examine the effect of lower reliability of the estimated parameters and confidence intervals, we conducted a second simulation with the unique variance set at 2 and a3 set at 0.2 and with three levels of sample size (N = 100, 200, and 500). These values simulated the situation in which the Cronbach’s alpha of the simple average score for each latent variable was approximately equal to 0.67. Taken together, there were 9 conditions (3 sample sizes × 3 moderated mediation effects) in Simulation 1 and 3 conditions (3 sample sizes) in Simulation 2. Five hundred simulated data sets were created for each of the 12 sets of population values, and all the simulated data sets were analyzed under the three approaches.
For the regression approach, the simple average scores of the four indicators of each latent variable were treated as observed variables for estimating the model parameters in Figure 1a, with the procedures repeated for all 500 simulated data sets. A bias-corrected bootstrap confidence interval for each estimated parameter was created with 1,000 bootstrap samples. This procedure simulates the most commonly used approach to analyzing moderated mediation models with latent variables. For the LMS approach, we used the Mplus 7.3 syntax in Appendix 1 to estimate the parameters and create BCCIs for each simulated data set. For the RCSLMS approach, the simple average score of the four indicators of each latent variable was calculated to form the single-indicator. The unique variance was fixed at (1 – 0.8) × 1.25 = 0.25 for the high reliability condition in Simulation 1 and at (1 – 0.67) × 1.5 = 0.5 for the low reliability condition in Simulation 2. The Mplus 7.3 syntax that was used is shown in Appendix 2. The procedure for testing moderated mediation with Mplus is explained in greater detail in the numerical example.
Six criteria were used to evaluate the performance of the three approaches. First, accuracy was measured by the average estimates, standard error of estimates, and average of the absolute bias values obtained from the 500 simulated samples. Absolute bias was computed as the difference between the estimate and the true value. Second, CI coverage was examined by calculating the percentage of the 500 simulated samples in which the CIs included the true value of the parameter. Next, statistical power was examined as the percentage of the 95% CIs that did not include zero when the population value of the parameter was not zero. When the population value of a3 was 0.0, this percentage reflected Type I error for testing a3 and Index MM. Finally, completion rate of the bootstrap samples was examined.
Results of Simulation 1
Table 1 presents the simulation results produced by the regression approach, the LMS approach, and the RCSLMS approach. The average and standard error of the parameter estimates, in addition to the average of the absolute bias over the 500 simulated samples, are presented for three sample sizes (100, 200, and 500) and three levels of the true values of the interaction effect, a3 . The reliabilities of the four constructs, X, W, M, and Y, in the regression approach were 0.800, 0.798, 0.792, and 0.793, respectively.
Average estimates
In general, Table 1 reveals that the average values of the estimated coefficients of a1 , a2 , and b obtained from the observed variable regression approach were smaller than the true values under all conditions. When the true value of a3 equaled 0.0, the average estimates of a3 were close to the true value for all three sample sizes. However, when the true value of a3 equaled 0.2 or 0.4, the average estimates of a3 were all smaller than the true value. As a result, the average estimates of Index MM obtained from the regression were close to the true value (0.0) when a3 was set at 0.0 but smaller than the true values of 0.118 and 0.236 when a3 was set at 0.2 and 0.4, respectively. Finally, for c’, for all three sample sizes, the average values of the estimated coefficients were all larger than the true value.
However, Table 1 shows that the estimated coefficients obtained from the LMS and RCSLMS approaches, in general, were near the true values. As a result, in most cases, the estimates of Index MM that were calculated from these two approaches were also near the true values. For all three approaches, increasing the sample size had little effect on the average estimated coefficients.
Standard error of estimates
Following Ledgerwood and Shrout (2011), standard errors of estimates were used to measure the precision of the estimates. Table 1 shows that in all conditions, the standard errors of the estimates produced by the regression approach were the smallest among those produced by the three approaches. The standard errors of the estimates produced by the LMS and RCSLMS approaches were similar, although the LMS approach produced slightly larger standard errors. In all three approaches, increasing the sample size produced smaller standard errors under all conditions.
Average absolute bias
In addition, Table 1 presents the average absolute bias of the three approaches at different true values of a3 and at different sample sizes. Because the standard error of estimates was centered on the estimated value rather than the true value, it was not a good measure of accuracy. Hence, we calculated the average absolute bias as an additional measure of accuracy. As the value of average absolute bias decreases, the accuracy increases. The results showed that in general, the estimates produced by the LMS and RCSLMS approaches were similar in accuracy, although the RCSLMS approach produced slightly more accurate estimates than the LMS approach under all conditions. A closer examination of the average absolute bias values reveals that the estimated coefficients produced by all three approaches were more accurate with smaller effect sizes and larger sample sizes. However, the effects were stronger for the regression approach than for the LMS and RCSLMS approaches. Hence, the regression approach produced more accurate estimates of parameters with small true values, namely, a1 and c’, than did the LMS and RCSLMS approaches across all three sample size levels. However, the LMS and RCSLMS approaches produced more accurate estimates of b with effect size at 0.59 than did the regression approach. For direct effect a2 with moderate effect size, the regression approach produced more accurate estimates when N = 100, whereas the LMS and RCSLMS approaches produced more accurate estimates when N = 500. Similarly, the estimates of interaction effect a3 using the LMS and RCSLMS approaches achieved higher accuracy than the regression approach when the true value of a3 was set at 0.4 for all three sample sizes. By contrast, when the true value of a3 was set at 0, the accuracy of the regression approach was higher than that of the LMS and RCSLMS approaches for all three sample sizes. When the true value of a3 was 0.2, the regression approach gave more accurate estimates when N = 100, whereas the LMS and RCSLMS approaches gave more accurate estimates when N = 500. The three approaches produced estimates with a similar level of accuracy when N = 200. The results of Index MM demonstrated a similar pattern.
CI coverage
Table 1 also presents the coverage rates of CIs using different approaches under various conditions of a3 effect sizes and sample sizes. Because 95% CIs were constructed, a coverage rate close to 0.95 was desirable. For the LMS approach, the CI coverage rates ranged from 0.930 to 0.972, whereas for the RCSLMS approach, the CI coverage rates were between 0.924 and 0.970. The coverage rates of the true values were near 0.95 in most cases when these two approaches were employed, suggesting that these two approaches performed in a desirable manner. For the regression approach, the range of the CI coverage rates was much larger, from 0.024 to 0.958. Table 1 reveals that the regression approach produced CIs with good coverage rates for parameters with small population values. For instance, the CI coverage rate of a1 with population value at 0.2 ranged from 0.930 to 0.958. Similarly accurate CIs were found for parameters c’ with population value at 0.14 and a3 and Index MM with population values at zero. However, when the effect size was large, such as in the case of parameter b with population value at 0.59, the CI coverage rate was 0.748 when N = 100 but deteriorated to 0.258 when N = 500. The CI coverage rates for interaction effect a3 and Index MM produced by the regression approach were even worse. When the true value of a3 equaled 0.4, the CI coverage rate of the true value of a3 was 0.716 when N = 100 and only 0.112 when N = 500. Similarly, the CI coverage rate for Index MM was only 0.554 when N = 100 and merely 0.024 when N = 500. These CI coverage rates that were generated by the regression approach suggested that the larger the sample size, the greater the likelihood that the 95% CIs constructed did not include the true values.
Power and Type I error
Table 1 suggests that the CIs of the parameters produced by all three approaches enjoyed higher power with larger effect sizes and larger sample sizes. All three approaches produced sufficient power for testing direct effect b, which had a large effect size, even with a sample size of 100. The regression approach had adequate power to test for a2 when N = 100, whereas the LMS and RCSLMS approaches required a sample size of 200 or 500 to provide sufficient power for the test. When testing the parameter a1 , which had a smaller effect size, all three approaches achieved adequate power only when N = 500. Finally, when the effect size was even smaller, only the regression approach provided adequate power for testing c’ when N = 500. Neither the LMS nor the RCSLMS approaches had sufficient power for testing a small effect, even when N = 500. While the regression approach produced CIs with greater capabilities of testing direct effects than the LMS and RCSLMS approaches, there were no substantial differences in power for testing the interaction effect a3 and Index MM among the three approaches. All three approaches had adequate power to test for a3 and Index MM with a sample size of 100 when the true value of a3 equaled 0.4. When the true value of a3 equaled 0.2, all three approaches provided adequate power for testing a3 and Index MM only when N = 500. Finally, as shown in Table 1, an examination of the Type I error rates for true values of a3 and Index MM = 0 reveals that all three approaches produced accurate Type I error rates with the 95% CIs, ranging from 0.044 to 0.076.
Completion rate
Although no instances of non-convergence or improper solutions were observed when using all three approaches, the completion rates of the 1,000 bootstrap samples for each simulated data set were less than 100% in some conditions when using LMS and RCSLMS approaches. The completion rate was less than 100% because there were instances of non-convergence or improper solutions. The completion rate decreases with smaller sample sizes. When using the LMS approach, the completion rate for N = 500 was 100%. The average completion rate for N = 200 was 99.9997%, whereas the completion rate for N = 100 was 99.8905%. When using the RCSLMS approach, the completion rates for N = 500 and N = 200 were both 100%. The average completion rate for N = 100 was 99.9989%.
Results of Simulation 2
Table 2 shows the results of the simulation for data with larger unique variance than the first simulation. The conditions simulated data with a population value of a3 at 0.2 and scales with Cronbach’s alpha at approximately 0.67. The lower reliability resulted in larger standard errors, larger average absolute bias, and lower power for all three approaches. Although the lower reliability resulted in slightly inflated estimates and larger CI coverage rates for both the LMS and RCSLMS approaches, it resulted in more attenuated estimates and a much lower CI coverage rate using the regression approach. The average estimated value of a3 dropped to 0.10, and the CI coverage rate was lowered to 0.574 when N = 200. In the extreme case, the CI coverage rate for Index MM when N = 500 was only 0.012 using the regression approach. That is, only 6 of the 500 simulated samples had CIs that included the true value of 0.118! No instances of non-convergence or improper solutions were observed when using all three approaches. When using the LMS approach, the completion rate for N = 500 was 100%. The average completion rate for N = 200 and N = 100 were 99.9757% and 96.022%, respectively. When using the RCSLMS approach, the completion rates for N = 500, N = 200, and N = 100 were 100%, 99.9964%, and 99.4826%, respectively.
Simulation Results for Data With Large Unique Variance and Population Interaction Effect = 0.2.
Note: REG = regression; LMS = latent moderated structural equations; RCSLMS = reliability-corrected single-indicator LMS; Est. = estimate; SE = standard error; AB = absolute bias; COV = 95% coverage; SIG = power.
Numerical Example
In this section, we provide the details of the analytical procedures that are involved in the LMS approach for examining moderated mediation. This is a three-step procedure for testing moderated mediation in the LMS approach with Mplus 7.3. The model shown in Figure 1b is used for illustration, and the same procedure can be used to analyze any moderated mediation model.
In the first step, the model is estimated without the interaction term. Because the LMS approach does not provide conventional fit indices for assessing overall model fit, the test for a moderated mediation model requires the estimation of two models (Muthén, 2012). In the first model, the latent interaction term is not included (the model in Figure 1b without latent interaction XW and path a3 ). If the model without the latent interaction provides adequate fit and the factor loadings are significant, then we can conclude that the indicators measure the constructs well and proceed to the second step. Table 3 shows the various fit indices of the model without the latent interaction. For this numerical example, the χ2(99) = 116.81 (p = .1069) indicates that the model with no latent interaction fits the data well. The estimated factor loadings and their standard errors, t values, and p values under the assumption of normal distribution, in addition to the 95% and 99% BCCIs, are reported in Table 4. The factor loading of the first indicator for each latent variable was fixed to unity to provide identification. The results of the numerical example showed that the 99% BCCIs for all of the factor loadings did not contain zero. Hence, we conclude that all of the factor loadings were statistically significant.
Numerical Example: Fit Indices for Baseline Model Without Latent Interaction.
Note: The results of a model with a latent interaction should only be interpreted if the model without a latent interaction has adequate fit.
Numerical Example: Factor Loadings and Bias-Corrected Confidence Intervals.
Note: Std Err = standard error of estimate; Norm p = p value under the assumption of normal distribution; BCLL = lower limit of bias-corrected bootstrap confidence interval; BCUL = upper limit of bias-corrected bootstrap confidence interval.
In the second step, the model in Figure 1b with latent interaction XW and path a3 is estimated (Klein & Moosbrugger, 2000; Muthén, 2012). The Mplus syntax used for the second step is presented in Appendix 1. 2 Under the MODEL command, the XWITH option is short for “multiplied with” and is used in conjunction with the | symbol to name and define a product term of continuous latent variables (Muthén & Muthén, 1998-2012). For instance, “XW | X XWITH W” defines XW as the product of X and W. In the ANALYSIS command, the statement “TYPE = RANDOM” requests an estimation of random intercepts and slopes, and it must be stated to define interactions among latent variables. Because the interactions among latent variables require numerical integration, the statement “ALGORITHM = INTEGRATION” is also included. Then, the MODEL CONSTRAINT command and the NEW option are used to create a new variable for the index of moderated mediation. In this demonstration, the index of moderated mediation a3b is given the label INDEXMM. Finally, the BOOTSTRAP option under the ANALYSIS command and the CINTERVAL(BCBOOTSTRAP) option under the OUTPUT command request 1,000 bootstrap samples and the BC bootstrap confidence intervals for the estimates. Bootstrapping with latent interactions has only been made available beginning with Mplus 7.3, which was released in May 2014.
Table 5 presents the estimates, standard errors, t values, p values under a normal distribution assumption, and the lower and upper limits of the 95% and 99% BCCIs for the model parameters. The results showed that regression coefficient a3 of the latent interaction term on the mediator was 0.4621, 95% CI [0.3343, 0.6755], and regression coefficient b of the mediator on the dependent variable was 0.7156, 95% CI [0.4659, 0.9758]. The estimated index of moderated mediation a3b was 0.3307, 95% CI [0.2155, 0.5091]. The regression coefficient c’, of the independent variable on the dependent variable was 0.1981, 95% CI [0.0375, 0.3450].
Numerical Example: Estimated Effects and Bias-Corrected Confidence Intervals.
Note: Std Err = estimated standard error; norm p = p value under the assumption of normal distribution; BCLL = lower limit of bias-corrected bootstrap confidence interval; BCUL = upper limit of bias-corrected bootstrap confidence interval.
In the third step, following Preacher et al. (2007), Hayes and Preacher (2013), and Hayes (in press), we estimate the moderated mediation effect of X on Y at various levels of moderator W to probe the moderated mediation effect. The moderated mediation effect of X on Y for the model in Figure 1 is (a1 + a3 W)b. Although the moderator W is a latent variable without any defined value, the variance of W is estimated and the latent mean of W is set to zero as the default in Mplus. Hence, we can substitute five levels of W in the equation that defines the moderated mediation effect of X on Y, as follows: (a) mean plus two standard deviations, (b) mean plus one standard deviation, (c) mean, (d) mean minus one standard deviation, and (e) mean minus two standard deviations. The BCCI for each parameter is also estimated. The moderated mediation effects at various levels of the moderator are reported in Table 6. The results showed that when W was at one standard deviation above the mean (+1 sdw), the mediating effect of X on Y through M was positive (0.4191, 95% CI [0.2654, 0.6529]). When W as at one standard deviation below the mean (–1 sdw), the mediating effect of X on Y through M was negative (–0.2671, 95% CI [–0.4494, –0.1239]). These results imply that moderator W may change the direction of the mediating effect of X on Y through M.
Numerical Example: Moderated Mediation Effect of X on Y at Various Values of Moderator W.
Note: +2 sdw = two standard deviations above mean; +1 sdw = one standard deviation above mean; –1 sdw = one standard deviation below mean; –2 sdw = two standard deviations below mean; W = value of the moderator W; Effect = estimated moderated mediation effect; SE (Boot) = bootstrap standard error; BCLL = lower limit of bias-corrected bootstrap confidence interval; BCUL = upper limit of bias-corrected bootstrap confidence interval.
Discussion
Given the increasing popularity of moderated mediation models in organizational and psychological studies, the importance of identifying an accurate and user-friendly analytic strategy has increased. Because CIs of estimates provide more useful information than do significance tests, reporting CIs has become a requirement for publication in many organizational and psychological journals. Hence, in addition to examining the accuracy of the estimates and the statistical power for testing the statistical significance of the direct and moderated mediation effects, the current study examined the accuracy of CIs produced under three approaches.
The Regression Approach Versus the Latent Moderated Structural Equations Approach
Although regression has been the most frequently used approach for testing moderated mediation, the existence of measurement errors may lead to biased estimates of moderated mediation effects, which in turn may result in invalid conclusions for management theory and for implementing management practices. These biased estimates also lead to problems in meta-analyses. As the bias in the effect size of moderated mediation is replicated in all empirical studies that use the regression approach, the meta-analytic findings based on these inaccurate estimates of moderated mediation effects are also likely to be biased. In multiple regression with more than one independent variable, measurement errors may decrease or increase or even change the sign of a partial relationship (Cohen & Cohen, 1983). Similarly, the estimated parameters in a mediation model can be larger or smaller than the true values (Ledgerwood & Schrout, 2011). Our simulation results reflect these biases. Even when reliability reached 0.80, which is considered adequate in social science studies, the regression approach continued to produce estimates of parameters a1 , a2 , and b that were smaller than the true values by 5% to 20%. Both larger effect size and lower reliability result in larger attenuation. The estimates of the direct effect c’ from X to Y, however, were slightly larger than the true value. Because product term XW presented poorer reliability, the inaccuracy of the regression approach was particularly serious in relation to the estimates of the moderation effect a3 , which was underestimated by 30% when reliability was 0.8 and by 50% when reliability was 0.67. The reliability ρ xw of the product term XW is a decreasing function of reliabilities ρ xx and ρ ww and the correlation between X and W (Busemeyer & Jones, 1983). Busemeyer and Jones (1983) showed that ρ xw is typically lower than ρ xx and ρ ww and reaches its minimum value of ρ xx ρ ww when X and W are not correlated. However, the average estimates produced by the LMS approach were all slightly larger than the true values under all conditions. With respect to the simulated data sets with reliability at 0.8, the relative bias in estimates were mostly below 2%, with the largest overestimation at 5.7% for c’ when N = 100. Although lower reliability enlarged the relative bias, the relative biases of estimates using the LMS approach were all within the acceptable range of 10% (Cham, West, Ma, & Aiken, 2012). These results demonstrate that the LMS approach is effective in correcting for measurement errors, even under marginally acceptable reliability conditions.
While the regression approach consistently produced a smaller standard error than the LMS approach, the absolute bias produced by the regression approach is not necessarily smaller than that produced by the LMS approach. Our simulation results show that the following three factors affect the magnitude of absolute bias: effect size, sample size, and reliability. In general, larger absolute bias is associated with larger effect size and smaller sample size. However, the impact of effect size on absolute bias is greater for the regression approach, whereas the impact of sample size on absolute bias is greater for the LMS approach. Thus, the regression approach resulted in smaller absolute bias than the LMS approach when the effect size was small and larger absolute bias when the effect size was substantial. For a moderate effect size, the regression approach resulted in smaller absolute bias with small sample sizes but larger absolute bias with large samples. The same pattern of absolute bias was observed for the low reliability samples. The absolute bias of the regression approach was smaller than that of the LMS approach when the sample size was small but was larger than that of the LMS approach when the sample size was large. Although lower reliability has a slight effect on the estimates but a substantial effect on the standard errors produced by the LMS approach, it has a substantial effect on the estimates but little effect on the standard errors produced by the regression approach.
What has drawn relatively less attention is that regression typically generates biased CIs, in other words, CIs that do not include the true values. This result can be attributed to the small standard errors and underestimated parameters. Following Collins, Schafer, and Kam (2001) and Cham et al. (2012), in the current study, it was considered acceptable if the coverage rate of the 95% CIs exceeded 90%. Our simulation results showed that all the CIs using the LMS approach had acceptable coverage rates and were near the theoretical 95%. However, the CI coverage rates using the regression approach were only acceptable for small direct effects (a1 = 0.2 and c’ = 0.14). The coverage rates for medium and large direct effects (a2 = 0.4 and b = 0.59) and for small and medium interaction effects (a3 = 0.2 and 0.4) were all below 90%. The CI coverage rates for Index MM were worse than the interaction effects. The coverage rate was adversely affected by a larger effect size, larger sample size, and lower reliability because a larger effect size and lower reliability resulted in a more substantial underestimation of parameters and a larger sample size resulted in smaller standard errors using the regression approach. Taken together, the CI is narrower, and the center of the CI is farther from the true value. Hence, the CI coverage rate deteriorates.
Table 1 also reveals the power of all three approaches to test moderated mediation. In general, the regression approach has greater power than the LMS approach under all conditions. This result is consistent with previous simulations in which regression produced more powerful tests than SEM (Klein & Muthén, 2007; Ledgerwood & Shrout, 2011). All three approaches had adequate power for testing an interaction effect with effect size at 0.4, even when N = 100. When the effect size of a3 was 0.2, the power deteriorated rapidly with the smaller sample size and lower reliability. None of the approaches provided adequate power for testing either the interaction effect or the Index MM when the sample size was 100 or 200. It is interesting to note that although its power for testing main effects and interaction effects is similar to that of the regression approach, the LMS approach provides more power for testing interaction effects than main effects with the same effect sizes. Regarding Type I error, both the regression and LMS approaches show accurate Type I error rates.
Reliability-Corrected Single-Indicator LMS Approach
An appealing finding of the simulation is the similarity in the results of the RCSLMS approach and the LMS approach and the slightly greater power of the RCSLMS approach. One limitation of the LMS approach is that when testing a complicated model with many indicators and when the sample size is small, the LMS approach may result in poor model fit or the model may not converge. However, because the RCSLMS approach combines the indicators of each construct into one single-indicator, it has a greater chance of producing converged solutions even when the model is complicated and the sample size is small. Despite advantages over the LMS approach, the current results of the RCSLMS approach should be generalized with caution because the conditions in the simulation were all favorable to the RCSLMS approach. First, parallel indicators with equal factor loadings and equal unique variance were simulated. We believe that tau-equivalent indicators with only equal factor loadings and congeneric indicators with both unequal factor loadings and unequal unique variance may work against the RCSLMS approach because simple average scores may not be good representations of the latent constructs under such conditions. In fact, the regression approach may face similar problems. Second, while the uniqueness term for each indicator is the sum of the random measurement error and the specific error, only random measurement error was included in the simulation. If the indicator specific error is not zero, then the random measurement error is an underestimation of the uniqueness and the correction will not be adequate. Future studies are needed to investigate how various factors, such as unequal factor loadings, unequal unique variance, and the existence of specific errors may affect the performance of the RCSLMS approach in testing moderated mediation. Although the RCSLMS approach may be a good alternative under the most favorable conditions, researchers should use the LMS approach as much as possible because the aforementioned factors will not affect the accuracy of estimates or the CIs of the LMS approach. Readers are referred to Petrescu (2013) for a review on using single indicators in SEM and Bandalos and Finney (2001) and Little, Rhemtulla, Gibson, and Schoemann (2013) for reviews on item parceling issues in SEM.
Forms of Moderated Mediation
When the index of moderated mediation is significant, we can refer to the results in Table 6, which provide a better picture of how the moderator W affects the mediating effect from X to Y through M. This is similar to examining the simple main effects when the interaction of a factorial design is significant in an ANOVA. One form of moderated mediation is where the mediating effects at various levels of the moderator are statistically significant but in opposite directions, as reported in Table 6. A second form of moderated mediation is where the mediating effects at various levels of the moderator are all statistically significant but in the same direction, which makes them stronger at some moderator levels and weaker at others. In this case, the moderator alters the strength of the mediating effect in the relationship between the independent and dependent variables. A third form of moderated mediation is when the mediating effects are statistically significant at some moderator levels and statistically nonsignificant at others. This form of moderated mediation assists researchers in identifying specific critical boundary assumptions and theoretical constraints, which are vital because a theory can be properly utilized only when the theory’s boundary conditions are understood (Bacharach, 1989). Identifying this form of moderated mediation is particularly useful in the interaction analysis suggested by Cortina and Folger (1998) for studies that attempt to retain the null hypothesis. In the mediation study context, if the objective is to find support for the nonexistence of a mediating effect (a null mediation hypothesis), then it should also be established that the mediating effect can be detected at a certain level of a moderator. Evidence of sufficient power to test for the mediating effects should also be provided, and the results should show that the index of moderated mediation is statistically significant and that the confidence interval for the mediating effect at one moderator level (at least) includes zero (statistically nonsignificant) while at another moderator level (at least) does not include zero (statistically significant).
Limitations
Although the current discussion is based on the first stage moderation model, in which moderator W affects the relationship between independent variable X and mediator M (Edwards & Lambert, 2007), the procedures described herein can be easily extended to other moderated mediation models, in which the moderator also affects the direct effect from X to Y (first stage and direct effect moderation model), second stage (and direct effect) moderation model, and models with multiple serial mediators. Hayes (in press) provided the equations for an index of moderated mediation for different models. One limitation of the index of moderated mediation is that when the moderator affects both the first stage and second stage mediation processes, although the CIs of all the path coefficients can be estimated with the LMS approach, the CI of the index of moderated mediation cannot be constructed by bootstrapping because it is a nonlinear function of W (Hayes, in press). However, it does not affect Step 3 in our proposed three-step Mplus procedures, and the results of this step will play a more important role in making inferences about the moderated mediation effects (Edwards & Lambert, 2007; Hayes, in press).
Although we have explicitly discussed the problem of measurement errors in examining moderated mediation effects, there are other assumptions for testing a mediating effect using regression, such as homoscedasticity and independence of errors. There is also the assumption of no missing variable that is correlated with both the mediator and dependent variable, such that the disturbances of the mediator and the dependent variable are not correlated (Shaver, 2005). Other assumptions that may affect the exogeneity of the independent variable when testing mediation and moderated mediation models include equivalent intact groups, no simultaneous causation, and the absence of common-source or common-method variance. Interested readers are referred to the list of 14 threats to validity for making causal claims by Antonakis et al. (2010). Although SEM can include correlated errors and clustered errors in the model, additional studies are required to examine the impact of violating other assumptions of the estimates and interpretations when testing moderated mediation effects with LMS.
An additional limitation of this study is that despite the inclusion of different levels of effect size, levels of random measurement error, and sample sizes in the simulations, this study did not examine the effect of the number of indicators, unequal factor loadings and unequal unique variances across items, multiple mediators, or non-normal observed variables on the accuracy of the parameter estimates and CIs. Nevertheless, it is speculated that unequal factor loadings and unequal random measurement errors tend to have a considerable impact on the regression and the RCSLMS approaches, which require the averaging of indicator scores to represent the constructs. However, the impact on the LMS approach is relatively small. Comparing the moderation effect a3 and Index MM (a3b) results in Table 1, we suspect that the inclusion of more mediators in the moderated mediation model will have little impact on the CI coverage rates when using the LMS and RCSLMS approaches but that the CI coverage rate for Index MM will be further decreased when using the regression approach. Although the impact of non-normal observed variables on the LMS approach is not examined in this study, previous simulation studies have found that the LMS approach provides unbiased estimates when the observed variables do not severely deviate from normality, such as when distributions are symmetric and when kurtosis is less than 1 (Cham et al., 2012; Klein & Moosbrugger, 2000). Because previous simulations have found that 2SLS also provides unbiased estimators in models with interactions, future simulation studies are needed to examine the behavior of the CIs produced by the 2SLS method under various conditions.
Conclusion
An increasing number of organizational and psychological researchers are incorporating moderated mediation effects (conditional indirect effects) into their theoretical models, and most empirical studies currently use the observed variable regression approach in their data analyses. Our simulation has demonstrated that despite the greater power provided by the regression approach, this approach underestimates the moderated mediation effects and produces inaccurate confidence intervals. Unfortunately, increasing the sample size does not make the biased estimates and confidence intervals from the regression approach more accurate. In other words, it is likely that the regression approach may conclude statistically significant moderation or moderated mediation effects even when the confidence interval of the estimated effects does not include the true value. Hence, the results of previous studies should be interpreted with caution. Researchers’ previous focus on conducting significance tests rather than reporting confidence intervals may explain why the bias in confidence intervals produced by regression analysis has been overlooked. Therefore, we suggest that future empirical studies on moderated mediation use the LMS approach, which provides accurate estimates and confidence intervals, although a larger sample size may be needed to achieve adequate power for testing small effect sizes.
Footnotes
Appendix 1
Appendix 2
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
