Abstract
Despite that common method variance (CMV) is widely regarded as a serious threat to the validity of findings based on self-reports, there is insufficient research on its confounding influence. We extend Evans’s (1985) pioneering work, and the more recent works by Ostroff, Kinicki, and Clark (2002) and Siemsen, Roth, and Oliveira (2010), to delineate the influence of CMV in a two-level hierarchical linear model based on self-report data. Our simulation results clearly show that in the absence of true effects, it is extremely unlikely for CMV to generate significant cross-level interactions. In fact, if a true cross-level interaction exists, CMV tends to lower the likelihood of its identification and erroneously underestimate the regression coefficient. Our simulation results also show that CMV may lead to a false significant cross-level main effect and overestimate the regression coefficient when no true effect exists. To reduce the probability of Type I errors, we show that raising the significance level to .01, the split sample strategy, and the addition of more CMV contaminated variables are effective in the vast majority of real-life situations and are more effective than increasing the number of groups or persons in each group. Both the split sample strategy and the addition of more CMV contaminated variables are also effective in reducing parameter bias when no true cross-level main effect exists. Trade-offs associated with different strategies are discussed.
Keywords
The confounding influence of common method variance (CMV) has attracted a great deal of attention in organizational research (e.g., Brannick, Chan, Conway, Lance, & Spector, 2010; Conway, 2002; Podsakoff, MacKenzie, Lee, & Podsakoff, 2003; Spector, 2006; Williams & Anderson, 1994). Defined as the shared variance among variables due to the use of a common measurement method (Spector & Brannick, 2009), CMV may inflate the observed relationships between variables (Spector, 2006). Pace (2010) concluded that “CMV may be the most common and dangerous threat to correct interpretation of research results or the most indiscriminately used reviewer critique leading to rejection of good manuscripts” (p. 421). Podsakoff et al. (2003) identified four potential sources of CMV, namely, self-report data, item characteristics, item context, and measurement context, but concerns about CMV are “almost exclusively limited to the use of self-report surveys” (Spector & Brannick, 2010, p. 403). There is consensus that the use of self-report measures threatens the validity of findings, and many journal reviewers indiscriminately reject papers that are based on self-report data. For instance, if respondents rate their leaders on their leadership style and leader effectiveness, they may try to maintain consistency between these two types of ratings, thus artificially inflating the correlation involved (Antonakis, Bendahan, Jacquart, & Lalive, 2010).
Two common strategies to circumvent the problem of self-report data include the collection of data from multiple sources and temporal spacing of variables (see a review by Podsakoff, MacKenzie, & Podsakoff, 2012). However, the use of multiple sources is not always applicable because many important variables, such as personality, attitudes, cognitions, and values, are primarily measured by self-reports and hard to be observed by others (Podsakoff et al., 2012). These variables often occupy key roles in theoretical models, functioning as independent variables, mediators, moderators, and dependent variables. If a moratorium is put on investigating percept–percept relationships, many conceptual models will be significantly impoverished because of the exclusion of intra-individual processes. The remedy of temporal spacing of measures may not be feasible in some research contexts, and the time gap involved may introduce other types of biases, such as the nonequivalence of contextual characteristics across the two waves of measurement. A more productive approach is to identify conditions in which CMV is problematic and develop strategies to minimize its influence (see reviews by Podsakoff et al., 2003, 2012; Richardson, Simmering, & Sturman, 2009). Despite the fact that CMV has received attention for several decades, empirical investigation on the extent to which CMV is a problem is limited (for exceptions, see Cote & Buckley, 1988; Evans, 1985; Siemsen, Roth, & Oliveira, 2010; Williams & Brown, 1994). Pioneering this line of research, a Monte Carlo study by Evans (1985) and a statistical proof by Siemsen et al. (2010) showed that an interaction effect cannot be produced by CMV, but bivariate linear relationships could be either inflated or deflated by CMV.
Most empirical work concerning the influence of CMV is confined to single-level designs, typically at the individual level (for an exception, see Ostroff, Kinicki, & Clark, 2002). However, multilevel perspectives on organizational phenomena are now commonplace (e.g., Herold, Fedor, Caldwell, & Liu, 2008; Hofmann, Griffin, & Gavin, 2000; Kirkman, Chen, Farh, Chen, & Lowe, 2009; Klein & Kozlowski, 2000). The use of self-report measures is very common in multilevel research, namely, measures at different levels are collected from the same respondents (Ostroff et al., 2002). For this type of design, there is the concern about the confounding influence of CMV on cross-level interactions, namely, the moderating effects of higher level variables on the relationships between lower lever variables (e.g., Erdogan & Enders, 2007; Henderson, Wayne, Shore, Bommer, & Tetrick, 2008; Hofmann, Morgeson, & Gerras, 2003; Kirkman et al., 2009; Scott & Barnes, 2011; Walumbwa, Hartnell, & Oke, 2010), and on cross-level main effects, namely, the influence of higher level variables on lower level variables (e.g., Herold et al., 2008; Liao & Rupp, 2005). However, there is limited research that explicitly investigates the extent to which CMV is a problem in multilevel designs.
When higher level variables are formed by aggregating individual-level responses, it is intuitive that their effects may be less susceptible to the influence of CMV (e.g., Kirkman et al., 2009; Liao & Rupp, 2005). Some researchers doubt that CMV is likely to give rise to significant cross-level moderation effects (e.g., Henderson et al., 2008; Hofmann et al., 2003; Kirkman et al., 2009; Scott & Barnes, 2011; Walumbwa et al., 2010). The single-level results of Evans (1985) are sometimes used to defend the conclusion that cross-level interactions cannot be generated by CMV. For example, Erdogan and Enders (2007) defended their cross-level interaction effects based on several self-report variables by citing Evans (1985). Nonetheless, to the best of our knowledge, there is no compelling evidence to support the relevance of Evans’s (1985) single-level conclusions for cross-level interaction effects. Hence, the first objective of the current research is to extend previous single-level findings to the multilevel context by addressing whether and when CMV inflates cross-level interaction effects.
Equally important, prior research has demonstrated the confounding influence of CMV on cross-level main effects. Schulte, Ostroff, and Kinicki (2006) discussed the possibility of the confounding influence of CMV with regard to their cross-level results. More specifically, in a study of the relationship between climate and satisfaction, Ostroff et al. (2002) compared cross-level correlations with cross-level split correlations, with half of the sample randomly selected to provide the overall climate score and the other half, the overall satisfaction score. There was clear evidence for the inflationary effect of common method bias, as the cross-level correlations were larger than the cross-level split correlations, leading to the conclusion that “researchers need to be concerned about the presence of method variance when conducting cross-level and multilevel studies” (Ostroff et al., 2002, p. 366).
Note that there is a methodological concern in the method employed by Ostroff et al. (2002), namely, the assignment of group-level scores to individuals for computing the cross-level correlations violated the independence assumption. Individuals in the same group “are exposed to similar stimuli within the group” (Hofmann, 1997, p. 725), which should be taken into account for drawing valid conclusions. Hierarchical linear modeling (HLM), the multilevel analogue of ordinary regression, is a more appropriate analytic technique to evaluate whether CMV plays a role in multilevel contexts. HLM is perhaps the most popular analytical tool in multilevel organizational research (e.g., Klein, Dansereau, & Hall, 1994; Klein & Kozlowski, 2000; Morgeson & Hofmann, 1999) because of its ability to “simultaneously investigate relationships within a particular hierarchical level, as well as relationships between or across hierarchical levels” (Hofmann, 1997, p. 726). Given that there is no systematic research on how CMV influences cross-level main effects under different circumstances, the second objective of this research is to augment the works of Evans (1985), Siemsen et al. (2010), and Ostroff et al. (2002) by evaluating the spurious influence of CMV on cross-level main effects with an HLM analysis as well as investigating the conditions in which CMV is problematic. Finally, to deal with the confounding influence of CMV in multilevel designs, the third purpose is to identify feasible statistical remedies to mitigate the effects of CMV and avoid erroneous inferences.
In sum, this study extends Evans’s (1985) pioneering work and the more recent works of Ostroff et al. (2002) and Siemsen et al. (2010) to evaluate the extent to which CMV associated with self-report data is a problem as well as identify feasible statistical remedies to control for CMV in a two-level hierarchical linear model based on the Monte Carlo approach. We begin by introducing CMV in the multilevel context and specifying the parameters and their values for the simulation study. We then present simulation results derived from a systematic assessment of the influence of CMV on cross-level interaction and main effects across different parameter values. Finally, we offer conclusions about the effects of CMV on cross-level relationships and provide recommendations to mitigate these effects to avoid erroneous inferences.
Hierarchical Linear Model and CMV
To test the influence of CMV, we propose a two-level hierarchical linear model with an individual-level predictor
We assume that self-report data at the two levels are collected from the same respondents or raters. The group-level predictor is based on the aggregation of an individual-level variable (
Following Bryk and Raudenbush (1992), the two-level hierarchical linear model examined is as follows:
where
This model provides a direct test for a cross-level interaction effect by regressing the individual-level slopes on the group-level variable
The aforementioned model can be rewritten as:
HLM provides t tests to determine whether the parameter estimates (i.e., the individual- and group-level regression parameters,
According to Evans (1985), the observed scores on each variable are consisted of true and error components, and the error term can be partitioned into a correlated component and a random component (Campbell, 1976):
where the subscript
The terms involving
where
To assess the impact of CMV on a cross-level main effect, we introduce the observed scores as given in Equation 5 into Equation 4 and simplify the equation by combining the random error terms
where
It can be seen from Equation 7 and Equation 8 that the random error
Choice of Parameters and Setting of Values
In analyzing the influence of CMV, Evans (1985) included four parameters in his simulation study: the size of the relationships between the outcome and the two predictors, whether the interaction effect truly exists, the proportion of CMV to true variance, and the size of the relationship between CMV in the outcome and that in the two predictors (see p. 309). Hence, we adapt these four parameters for our two-level model: (1) the size of the true relationship between
We derive two equations to simultaneously estimate all the parameters in our model (see Equation 7 for cross-level interaction and Equation 8 for cross-level main effect). Both Equation 7 and Equation 8 contain a product term
With regard to setting the values of the parameters, we follow Evans (1985) to set the true relationships of the predictors (
With regard to the correlation of the CMV of
In our review of the HLM studies in the organizational literature, only a handful of studies published in leading journals report the size of the variance of slopes (
With regard to the variance of group-level predictors (
Parameters and Their Values for Different Models.
Note: For ratio of true variance to CMV,
In summary, the current research investigates whether and when CMV inflates cross-level interaction and main effects across systematic variations of the seven parameters in a simulation study. We also offer suggestions to mitigate the confounding influence of CMV and avoid erroneous inferences based on the simulation results.
Method
We used a simulation program, S-Plus version 6.2, Professional Edition, for constructing multiple data sets to evaluate the impact of CMV in the HLM context. In the simulation, there were two levels of the relationship between a predictor and the outcome, resulting in six scenarios (two levels for
Although it is difficult to determine the optimal sample size required for conducting HLM (Hofmann, 1997), several researchers have made some general recommendations. For instance, both Bassiri (1988) and Van Der Leeden and Busing (1994) showed that a sample of 30 groups with 30 persons in each group or a sample of 150 groups with 5 persons in each group was needed to obtain sufficient power to detect cross-level interactions. However, the use of much smaller samples has become the standard practice, typically involving less than 40 groups with around 5 persons in each group (e.g., Erdogan & Enders, 2007; Henderson et al., 2008; Herold et al., 2008; Herold, Fedor, & Caldwell, 2007; Hirst, Van Knippenberg, & Zhou, 2009; Marrone, Tesluk, & Carson, 2007). Due to practical difficulties, it is not surprising to observe such a gap in the sample size recommended and the sample size actually adopted in practice. To make our simulations realistic, we adopted 40 groups with 5 persons in each group as the baseline design for the simulations, denoted as the 40 × 5 design. However, when we observed a significant influence of CMV, we included larger sample sizes to provide a comprehensive account of its effects.
Procedure for Data Generation
To generate data sets by adapting the approach of Evans (1985, p. 309) to the two-level hierarchical linear model, we assumed two independence relationships: the independence between the true scores of the individual-level variable and of the group-level variable ( the independence between the true scores and correlated errors (i.e.,
The first step was to generate the true scores of the two independent variables
where both
The S-Plus program provides t tests that indicate whether a parameter estimate (a cross-level interaction or main effect term) in each replication significantly departs from zero (Hofmann, 1997), and the conventional p value of .05 was used as a cut-off point for hypothesis testing. To investigate the effects of CMV, we used Type I error rate, defined as the probability of making wrong inferences when there is no true effect, and true positive rate, defined as the probability of making correct inferences when there is a true effect (absence of Type II errors). Type I error rates and true positive rates shown in Tables 2 through 6 represent the proportion of incorrect and correct inferences among the 100 replications for each scenario, respectively. We note that low statistical power, which is a function of small sample size and a small effect, may also result in inaccurate assessment of a model, and this point is considered in interpreting the results.
Type I Error Rates for No True Cross-Level Interaction.
Note: CMV = common method variance.
To assess the extent of the bias in parameter estimates for cross-level interactions and main effects, we systematically selected some scenarios typical of real-life situations: no CMV, high and low ratios of true variance to CMV (4:1 vs. 1:1). The value for parameter bias in each scenario was obtained by subtracting the true value from the average parameter estimate based on 100 replications. A positive value represents an overestimation of a regression coefficient, and a negative value denotes an underestimation. The farther a value deviates from 0, the more severe the parameter bias. To conserve space, the results are only briefly described because the levels of Type I error rates and true positive rates tend to correspond to the extent of parameter bias.
Results
No True Cross-Level Interaction
The results computed from the models without cross-level interactions were organized in two blocks, as shown in Table 2. As our focus was on the cross-level interaction effect (
Table 2 clearly shows that regardless of the variation of the parameter values, an increase in CMV in relation to true variance (from left to right) has little impact on the Type I error rates. In other words, the vast majority of the 100 replications in each scenario detected no significant cross-level interaction given that there was no such interaction in the data. Regarding the bias in parameter estimates, we found that for both large and small ratios of true variance to CMV (4:1 and 1:1), the values for parameter bias did not deviate largely from 0. CMV did not have a large influence on parameter bias for a cross-level interaction effect when there is no true effect in the data. This finding is consistent with the conclusion of Evans (1985) and Siemsen et al. (2010) for single-level results, suggesting that a significant cross-level interaction effect is extremely unlikely to be produced by CMV.
The Presence of a True Cross-Level Interaction
Table 3 presents the results of true positive rates estimated from models in which a true cross-level interaction exists. The format of this table is similar to that of Table 2, with two levels of correlation between
True Positive Rates for a True Cross-Level Interaction.
Note: CMV = common method variance.
It is clear that CMV can decrease the probability of detecting a statistically significant cross-level interaction effect, especially when both the relative amount of true variance and the interaction effect were low. We also note some similar results across the four blocks. The true positive rates were generally reduced by an increase in the variance of Level 1 slopes (
In sum, consistent with the single-level results of Evans (1985) and Siemsen et al. (2010), CMV generally reduces the statistical power to detect true cross-level interactions as well as leads to an underestimation of the regression coefficient. Type II errors are especially serious when the ratio of true variance to CMV is low, when the interaction effect is low, when the variance of Level 1 slopes is high, and when the variance of the group-level predictor is low. In our simulations, the values of the variance of Level 1 slopes and of the group-level predictor were realistic as they were based on published articles. Thus, a sizeable number of papers employing HLM may underreport significant cross-level interactions because of high variance of Level 1 slopes and low variance of the group-level predictor. Our simulation results also reveal that the parameter for a true cross-level interaction effect can be severely underestimated due to CMV, as in a single-level model (Siemsen et al., 2010).
No True Cross-Level Main Effect
The results for the models without a true cross-level main effect are presented in Table 4. The format of this table is similar to that of Table 2, except that the two effect sizes of the individual-level predictor
Type I Error Rates for No True Cross-Level Main Effect.
Note: CMV = common method variance.
Blocks A and B show the results generated from a model where
Consistent with Ostroff et al. (2002), who showed the inflationary influence of CMV on cross-level correlations, our findings provide support for the widespread worry that CMV may result in Type I errors, namely, the false identification of a significant cross-level main effect. Note that Ostroff et al. only provide a general test of the effect of CMV without exploring the conditions under which its effects are more serious. Because HLM takes into account the interdependence of individuals in a nested data structure and simultaneously estimates individual- and group-level parameters (Bryk & Raudenbush, 1992), our results are based on a more appropriate analytic technique and refine Ostroff et al.’s findings by showing that Type I errors are noticeable only when the proportion of CMV is relatively large (true variance to CMV ratios: 1.53:1, 1:1, and 1:1.5). Our results also revealed that for the cases of no true cross-level main effect, the higher the proportion of CMV, the more serious the parameter bias, which is consistent with the conclusion implied in Equation 7 in Siemsen et al. (2010, p. 461) for a single-level regression model. In a nutshell, the parameter estimation of cross-level main effects could be severely confounded by the magnitude of CMV. To sum up, a larger proportion of CMV may lead to a significant cross-level main effect that does not truly exist as well as an overestimation of its regression coefficient.
Solutions for Controlling for Type I Errors and Parameter Bias in the Presence of CMV
To control for Type I errors caused by CMV in assessing cross-level main effects, we investigated the effectiveness of two common strategies: increasing the significance level (Underwood, 1997) and the use of a larger sample. We first adopted the .01 instead of the .05 significance level in the t tests for significance testing. As expected, the first section of Table 5 (the 40 × 5 design) shows that the Type I error rates decrease except for the situations where there is more CMV than true variance (see Table 4) and at the same time the CMV of
Type I Error Rates for Effects of Significance Level and Sample Size for No True Cross-Level Main Effect.
Note: CMV = common method variance.
We increased the number of persons in each group while maintaining the significance level at .01. As expected, the 40 × 12 design presented in the second section showed negligible Type I errors, and if there is serious doubt about the threat of CMV, this more conservative design could be adopted.
We also explored the effectiveness of setting the significance level at .05, but increasing the sample size, both in terms of the number of groups and persons in each group. Contrary to our expectation, increasing the number of groups to 60 and 150 (the 60 × 5 and 150 × 5 designs) increased the Type I error rates with no alleviation of parameter bias. Parameter bias in both the 60 × 5 and 150 × 5 designs was similar to that of the 40 × 5 design. We then explored the effect of increasing the number of persons per group (30 and 10) while maintaining the number of groups at 40 (the 40 × 30 and 40 × 10 designs). As expected, the Type I error rates decreased with the number of persons in each group, with the lowest rates in the 40 × 30 design (see the third section of Table 5), but the level was still on average slightly higher than that of the 40 × 5 design at p < .01. With regard to parameter bias for the cases of no true cross-level main effect, parameter bias was lower for the 40 × 10 design and lowest for the 40 × 30 design in comparison with the 40 × 5 design.
We therefore conclude that increasing the number of groups or persons in each group is not as effective as raising the significance level to .01 in reducing the probability of Type I errors, but increasing the number of persons in each group is more effective in reducing parameter bias than increasing the number of groups when no true cross-level main effect exists. The 40 × 30 design seems effective in controlling for Type I errors in real-life situations, in which CMV is unlikely to exceed true variance. As noted before, using the p value of .01 involves a significant trade-off of reducing the power to detect a true cross-level main effect (the inflation of Type II errors) with no reduction of parameter bias.
Exploratory Analysis of Two Other Strategies for Mitigating CMV
In the extant literature, two other methods are proposed to control for CMV in multilevel designs. The split sample strategy (Ostroff et al., 2002) involves the use of half of a group to calculate the group-level predictor and the other half of the group for evaluating a model, thus avoiding the common source problem. The addition of more CMV contaminated variables is shown to be effective for single-level designs (Siemsen et al., 2010), and this strategy may be applied in a multilevel context.
5
To explore the effectiveness of the split sample strategy, we used a relatively larger sample size (40 groups with 10 persons in each group) and randomly split each group into two halves. One half (40 × 5) was used to obtain the aggregate measure of the group-level variable
The results clearly showed that for the cases of no cross-level main effect, both strategies were as effective as raising the significance level to .01 in reducing Type I error rates and led to less serious parameter bias as compared to the corresponding values for the 40 × 5 design at p < .05, especially when there was a larger proportion of CMV (1:1 ratio of true variance to CMV). These two strategies seem more desirable than raising the significance level to .01 due to their ability in reducing parameter bias in addition to Type I errors.
However, for the cases of true cross-level main effects, the true positive rates for both strategies were generally smaller than the corresponding values for the 40 × 5 design at p < .05, suggesting a higher level of Type II errors. In addition, both strategies also resulted in more serious parameter bias as compared to the corresponding values for the 40 × 5 design at p < .05. The split sample strategy may lead to more Type II errors and parameter bias, probably because a very large sample size is required by this approach. It is unclear why the addition of more CMV contaminated variables leads to more Type II errors and parameter bias. There are important unresolved issues associated with these two strategies, which deserve careful attention in future research. In sum, it turns out that these two strategies, like the use of p value of .01, also involve a trade-off between Type I and Type II errors and result in parameter bias, and they should be employed with caution.
Existence of a True Cross-Level Main Effect
In this section, we evaluated the cases in which a true cross-level main effect exists. To provide a detailed picture, we varied the correlations between
True Positive Rates for a True Cross-Level Main Effect.
Note: CMV = common method variance.
Our results show that CMV either lowered or increased the true positive rates depending on different combinations of parameter values. When
With regard to parameter bias for a strong cross-level main effect, a trend of underestimation was noted, with the magnitude ranging from –.30 to –.01, with a mean level of –.09. More noticeable parameter bias occurred for the 1:1 ratio of true variance to CMV (ranging from –.30 to –.13) than for the model with a ratio of 4:1 (ranging from –.04 to –.01). The parameter bias for the 1:1 ratio was on average eight times larger than that for the 4:1 ratio.
When there was a weak cross-level main effect (
With regard to the bias in parameter estimates for a weak cross-level main effect, a general trend of underestimation was noted, with the magnitude ranging from –.11 to .03 and a mean level of –.02. More noticeable parameter bias occurred for the 1:1 ratio of true variance to CMV (ranging from –.11 to .01, with a mean level of –.06) than for the ratio of 4:1 (ranging from –.03 to .03, with a mean level of –.01). The parameter bias for the 1:1 ratio was on average four times larger than for the 4:1 ratio.
To sum up, when there is a strong true cross-level main effect, CMV tends to lower the probability of detecting the effect. When there is a weak true cross-level main effect, the statistical power of the 40 × 5 design in detecting the effect is generally low. CMV may increase or decrease true positive rates, depending on the true variance to CMV ratio, the correlation between the CMV of the predictors and that of the outcome, and the variance of
Supplementary Analysis With a Large Sample
To explore if the previous results generalize to a larger sample size, we considered a relatively large sample size (60 groups with 12 persons in each group) for the four different types of models (without/with a true cross-level interaction and without/with a true cross-level main effect). For the sake of parsimony, we systematically selected several variations of the seven parameters in the simulation.
For models without a true cross-level interaction or main effect, CMV can create an artificial cross-level main effect, but not a cross-level interaction effect as in the 40 × 5 design. With regard to parameter bias, CMV does not result in large parameter bias for a cross-level interaction when no true effect exists, but leads to noticeable overestimation of the parameter for a cross-level main effect when there is no true effect and large CMV, which is similar to the results for the 40 × 5 design. For models with a true cross-level interaction or main effect, the true positive rates for the 60 × 12 design were on average higher than the corresponding values for the 40 × 5 design, which may be due to the increased power associated with a larger sample size. Regarding parameter bias, a general trend of underestimation was found when the proportion of CMV or the correlation between CMV of the predictors and that of the outcome variable increased. Parameter bias became severe for the 1:1 ratio of CMV to true variance. To sum up, the pattern for the Type I error rates, true positive rates, and parameter bias was generally consistent with those reported for the 40 × 5 design, thus supporting the generalizability of our findings to a larger sample size.
Discussion
There is a dearth of research on whether and when CMV inflates cross-level relationships in hierarchical linear modeling in organizational research and whether the role of CMV in single levels of analysis generalizes to multilevel designs. Based on a Monte Carlo approach, we address this important gap by delineating the influence of CMV on cross-level interaction and main effects and identifying strategies to reduce the probability of Type I errors and parameter bias by mitigating the confounding role of CMV.
The Role of CMV in Cross-Level Interactions
Our results clearly show that CMV cannot result in significant cross-level interaction effects that do not exist in the data nor any bias in parameter estimates. In contrast, CMV tends to suppress the identification of a true cross-level interaction and lead to an underestimation of the regression coefficient. This conclusion is in line with the single-level results of Evans (1985) and Siemsen et al. (2010). Evans concluded that “virtually no interactions were created by adding correlated error … interaction effects are strongly attenuated by the addition of correlated error” (pp. 316-317). In a similar vein, Siemsen et al. concluded based on statistical derivations that “CMV cannot create an artificial interaction effect. CMV can only deflate existing interactions” (p. 469) and emphasized that “researchers should not be criticized for CMV if the main purpose of their study is to establish interaction effects” (p. 470).
Our results strongly support the view that when there is a good theoretical reason to examine whether a group-level variable based on self-report moderates a percept-percept relationship, reviewers should not categorically raise the issue of CMV as a major methodological criticism. In fact, researchers have to be vigilant about the possibility that CMV may reduce the likelihood of confirming a well-grounded prediction of a cross-level interaction effect based on self-report measures. Interestingly, our simulation reveals a general trend that the higher the variance of the group-level predictor, the higher the true positive rates for detecting cross-level interactions. Thus, a potentially useful strategy is to collect data from geographically dispersed groups or diverse organizations, which may increase the variance of a group-level predictor. Strategies that help alleviate the underestimation problem caused by CMV in detecting a true cross-level interaction effect deserve attention in future research.
To account for the underidentification of true cross-level interactions due to CMV, we speculate that the reduction in covariation of CMV across levels may be a possible explanation.
6
Specifically, individual-level method variance tends to be reduced in the aggregated score
Because of the consistency between single-level results and our multilevel results, it is possible that the explanation for the impact of CMV on single-level interactions may generalize to the multilevel context. 7 For single-level interactions, as implied in Siemsen et al.’s (2010, p. 469) Equation 24 and Equation 25, when there is no true effect, CMV does not contribute to the true covariance between the outcome variable and the product term, which is the numerator in the parameter estimation formula. When there is a true interaction effect, however, the denominator becomes larger because of the variance contributed by the CMV contamination, thus attenuating the true relationship between the product term and the outcome variable. In other words, we speculate that as CMV is modeled as linear, additive variance (see Equation 5), which is different from the multiplicative nature of an interaction term (Cortina, 1993), CMV may not affect the true covariance between the outcome variable and the cross-level interaction term, but rather add noise to the denominator and lead to underestimation. To sum up, the mechanisms underlying the underestimation of true cross-level interaction effects due to CMV deserve careful scrutiny in future research.
The Role of CMV in Cross-Level Main Effects
Some researchers may believe intuitively that the influence of CMV is less severe in a multilevel than in a single-level data set (e.g., Liao & Rupp, 2005). Consistent with this view, Ostroff et al. (2002) reported that the influence of CMV appears to be more serious for aggregate correlations as compared to cross-level correlations. However, our results do not support the optimistic expectation that CMV can be generally ignored for cross-level main effects, because the CMV correlation between the individual-level outcome (
Going beyond the general conclusion that method variance is present at multiple levels of analysis (Ostroff et al., 2002), our research delineates when CMV can affect a cross-level main effect. That is, when the ratios of true variance to CMV are relatively low (1.53:1, 1:1, and 1:1.5), Type I errors may occur. Large CMV may lead researchers to erroneously report a significant cross-level main effect that does not truly exist and result in an overestimation of the regression coefficient. Following the logic for explaining the role of CMV in cross-level interactions, one speculation is that because the reduction in the covariation of the method variance of
Strategies for Reducing Type I Errors and Parameter Bias in Assessing Cross-Level Main Effects
Our results have important implications for mitigating the confounding influence of CMV in an HLM context. First, our results suggest that the use of the .01 significance level in the 40 groups × 5 persons design is effective in reducing the probability of Type I errors in the presence of CMV. This strategy is problematic only under very extreme conditions, in which CMV is larger than true variance, and at the same time the CMV of the predictors correlates highly with that of the outcome. In other words, unless CMV is extremely strong, which is unlikely in real-life situations, it will not generate a cross-level correlation that is significant at the .01 level. However, we note that this strategy involves a very significant trade-off of reducing the power to detect a true effect (the inflation of Type II errors) and that it cannot alleviate the problem of parameter bias. The decision to adopt this strategy depends on the cost of Type I errors vis-à-vis that of Type II errors.
Second, an increase in the number of groups or the number of persons per group, while maintaining the significance level of .05, turns out to be ineffective. Following Hofmann’s (1997) recommendation, we analyzed the 60 or 150 groups × 5 persons design and the 40 groups × 30 persons design. Surprisingly, Type I errors are more likely for the 60 × 5 and 150 × 5 designs than for the 40 × 5 design, and parameter bias is not alleviated by increasing the number of groups. We speculate that when the number of persons per group (5 persons in a group) does not change, the CMV correlation remains the same. But for a larger number of groups (i.e., 60 or 150), more random effects representing the group effect need to be estimated. Beal (1991) noted that when “there are too many random effects in the model, some of the variances and covariances will be poorly estimated” (p. 219). In this situation, the potentially poor estimation of standard deviation may result in more Type I errors, a speculation that awaits future verification. In addition, although the 40 groups × 30 persons design seems inadequate for controlling for Type I errors in the simulation results, it is on average better than the 40 × 5 design in this regard. Regarding parameter bias, for the cases of no cross-level main effect, the increase in the number of persons per group can alleviate parameter bias, whereas increasing the number of groups does not reduce parameter bias.
We speculate that increasing the number of persons in each group can directly lower the correlation between the CMV of
Third, we conducted exploratory analysis to investigate the effectiveness of the split sample procedure and the addition of more CMV contaminated variables to control for the influence of CMV. Our findings provide support for Ostroff et al.’s (2002) split sample strategy in reducing Type I errors and parameter bias in an HLM context. Regarding the addition of more CMV contaminated variables, our research extends Siemsen et al.’s (2010) conclusion for the cases of no true effect for single-level models to a multilevel context such that when there is no true cross-level main effect, adding more CMV contaminated variables is effective in reducing Type I error rates and parameter bias.
Interestingly, our results show that both strategies are similar to the use of p = .01 in that Type II error is elevated. For the split sample approach, a future research direction is to systematically explore its effectiveness as a function of sample size, especially the number of persons per group. A larger sample size gives rise to more statistical power, which should result in more accurate estimates (cf. Cohen, 1988). With regard to the addition of more CMV contaminated variables, our results show that Siemsen et al.’s (2010) analytical approach for single-level designs, and the mathematical formula (Equation 18, p. 466) proposed for deciding the number of such variables needed in reducing parameter bias, are unlikely to be directly applicable in a multilevel context. Deriving a mathematical equation that can appropriately describe the effects of CMV on parameter bias in multilevel models is very complex, but it is an important topic for future research.
CMV in Assessing Cross-Level Main Effects That Truly Exist
For the case in which a true cross-level main effect exists, CMV may reduce the likelihood of identifying the effect when the effect size is large. CMV may increase or decrease true positive rates when the effect size is small. Similar to cross-level interactions, a high between-group variance may increase true positive rates for cross-level main effects. To enhance the probability of correctly detecting a true main effect, one strategy is to gather data from diverse sources (e.g., units in different locations) to increase between group variance as in the case of cross-level interactions. Contrary to the popular belief that CMV tends to inflate observed relationships, it may sometimes suppress the likelihood of identifying a true cross-level relationship. Equally important, our results also suggest that the extent of parameter bias is affected by the ratio of CMV to true variance. A large proportion of CMV results in noticeable parameter bias with a trend of underestimation for both weak and strong cross-level main effects.
Researchers have to be careful about the effect of CMV when the data provide no support for their theoretical predictions of cross-level main effects. With regard to the mixed effects of CMV for models where true cross-level main effects exist, we speculate that the impact of CMV may depend on the relative size of the correlation between the CMV of
It is also possible that the additive nature of CMV may account for its impact on cross-level main effects, which are also additive in nature (Cortina, 1993). Indeed, as indicated in both Equation 7 and Equation 8 in Siemsen et al. (2010, p. 461), CMV tends to exert additive impact on single-level main effects. How the additive nature of CMV causes mixed effects on cross-level main effects deserves attention in future research.
Overview of Strategies to Control for CMV in Multilevel Modeling
Previous research has identified some statistical remedies to address the problem of CMV, such as the omitted variables model (Kim & Frees, 2006, 2007), Lewbel’s (1997) internal instrumental variable method (Ebbes, Bockenholt, & Wedel, 2004), and two-stage least squares (see a review by Antonakis et al., 2010; Ebbes et al., 2004). These strategies are effective under some specific circumstances and are reviewed together with the strategies evaluated in the present research in Table 7.
Summary of Strategies to Control for Common Method Variance (CMV) in Multilevel Modeling.
First, researchers in education and economics have developed some techniques to correct for the influence of CMV in a multilevel context, known as “omitted variables” (e.g., Ebbes et al., 2004; Kim & Frees, 2006, 2007). An important assumption made by Kim and Frees (2006, 2007) is that CMV is at a higher level than the predictors in a model. Under this situation, the authors recommended the use of an “orthogonal” transformation method to address the problem of parameter bias caused by an omitted variable, which is CMV in this context. A type of data amenable to this approach is panel data, because respondents are observed over multiple time points and they can be viewed as a higher level source of common method variance. For instance, this approach is suitable for analyzing ecological momentary assessment data, the collection of which requires raters to report their typical experiences and behavioral responses at many time points (Beal & Weiss, 2003). A different approach to eliminating the influence of an omitted effect is Lewbel’s (1997) internal instrumental variable method. For this approach to work, the skewness of a predictor needs to be strong (Ebbes et al., 2004), but the source of CMV can be at a higher level than or within the same level as the predictors.
Second, the work of Ebbes et al. (2004) and a review by Antonakis et al. (2010) suggested the use of a two-stage least squares procedure to address the problem of CMV if researchers can identify an external instrumental variable that does not correlate with
Third, our study focuses on self-report as the source of common method bias, in which case the association between CMV in the outcome variable and predictors is assumed to be within the same level. Our results show that to reduce the probability of Type I errors, raising the significance level to .01 is more effective than increasing the number of groups or persons in each group, although at the risk of heightened Type II errors and with no reduction of parameter bias. In addition, both the split sample procedure (Ostroff et al., 2002) and the addition of more CMV contaminated variables (Siemsen et al., 2010) are effective in reducing Type I errors and parameter bias in the cases of no true cross-level main effect, but also at the expense of increasing Type II errors. These two strategies tend to increase parameter bias in the cases of a true cross-level main effect.
Fourth, if self-report measures are assumed to be the source of CMV, the use of nonaggregated group-level variables from a different source, such as supervisory ratings, should be effective because the group-level predictor and the outcome variable come from different sources. However, Podsakoff et al. (2003) noted that other sources of common method bias, including item characteristics, item context, and measurement context, may bias cross-level relationships despite of the collection of data from multiple sources. For instance, Feldman and Lynch (1988) noted that the use of a similar response format may give rise to a bias in which “cognitions generated in answering one question will be retrieved to answer subsequent questions” (p. 427). This type of CMV can bias the cross-level results obtained.
To illustrate this point, we assessed a model where the Level 2 variable was not based on the aggregation of a Level 1 variable but was provided by a different source, such as supervisors. We investigated some typical scenarios (no CMV, 4:1, and 1:1 ratios of true variance to CMV) and generated the CMV of
In a similar vein, the split sample design proposed by Ostroff et al. (2002) can address CMV due to the use of self-report data, but may not be able to address other sources of CMV. Note that the split sample design has one more constraint, namely, “the inherent difficulty of obtaining large enough samples to conduct cross-level spilt” (Ostroff et al., 2002, p. 366).
Finally, temporal spacing of measures may help minimize the impact of CMV because Ostroff et al.’s (2002) results clearly suggest that incorporating time delays between independent and dependent variables can help reduce response bias associated with CMV. However, this strategy may involve other types of biases, such as the nonequivalence of the context in which the different waves of data are collected.
To conclude, despite the fact that CMV has been regarded as a serious threat to the validity of findings based on self-report measures for decades, very little empirical research has been undertaken to investigate how serious the threat is. Our study adds to this important literature by an extension to the HLM context. Our results are in line with the single-level results of Evans (1985) and Siemsen et al. (2010) in that CMV does not pose a threat to the validity of significant cross-level interaction effects. With regard to cross-level main effects, we extend Ostroff et al.’s (2002) study by documenting the possible influence of CMV in an HLM context and provide researchers with a range of possible strategies to mitigate such confounding influence.
Footnotes
Acknowledgments
We would like to thank Associate Editor Adam W. Meade and three anonymous reviewers for their constructive comments and suggestions.
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship and/or publication of this article: This research was supported in part by an MOE (Ministry of Education in China) Youth Foundation Project of Humanities and Social Sciences (grant number 12YJC630087) and by National Natural Science Foundation of China (NSFC) (grant number 81201817).
