Abstract
Recently, quantitative researchers have shown increased interest in two-step factor score regression (FSR) approaches to structural model estimation. A particularly promising approach proposed by Croon involves first extracting factor scores for each latent factor in a larger model, then correcting the variance−covariance matrix of the factor scores for bias before using this matrix as input data in a subsequent regression analysis or path model. Although not immediately obvious, Croon’s bias correction formulas are predicated upon the standard assumption of conditionally independent uniquenesses (measurement residuals). To our knowledge, the method’s performance has never been evaluated under conditions in which this assumption is violated. In the present research, we rederive Croon’s formulas for the case of correlated uniqueness and present the results of two Monte Carlo simulations comparing the method’s performance with standard methods when the unique factors were correlated in the population model. In our simulations, our proposed Croon FSR approaches outperformed methods that blindly assumed conditionally independent uniquenesses (e.g., uncorrected FSR, traditional Croon FSR, structural equation modeling [SEM] using standard specification), performed comparably to a correctly specified SEM, and outperformed SEMs that correctly specified the unique factor covariances but misspecified the structural model. We discuss the implications of our results for substantive researchers.
Structural equation modeling (SEM) is a powerful, flexible modeling framework that allows for the simultaneous estimation of (a) postulated latent constructs, via a measurement model and (b) postulated causal relations among them, via a structural regression model (cf. Bollen, 1989; Hayduk, 1987; Jöreskog & Sörbom, 1993; Kline, 2016). The ability to estimate measurement and structural parameters simultaneously is as much a weakness as a strength, however, as misspecification in any part of an SEM may result in bias that proliferates throughout the system of equations, rendering all parameter estimates questionable. For example, misspecification in the structural portion of a model, such as omitting a nonzero path or incorrectly assuming conditionally independent disturbances, can cause parameter estimates in the measurement model to shift, obscuring the nature and meaning of the latent constructs being estimated (Devlieger & Rosseel, 2017; Hoshino & Bentler, 2013). Alternatively, misspecification in any part of the model (measurement or structural) risks injecting bias into key structural regression coefficients capturing the relations between latent constructs. This outcome is particularly harmful, because structural relationships are often of greatest interest to empirical researchers (cf. Devlieger, Mayer, & Rosseel, 2016; Devlieger & Rosseel, 2017; Hancock & Mueller, 2011; Hoshino & Bentler, 2013; Lu, Kwan, Thomas, & Cedzynski, 2011). Perhaps more troubling, such misspecifications are not always easily detected using traditional methods (Hancock & Mueller, 2011).
As a potential solution to this problem, a small but growing group of quantitative researchers have begun to recommend switching from simultaneous to multistage estimation procedures such as factor score regression (FSR; Croon, 2002; Devlieger et al., 2016; Hoshino & Bentler, 2013; Lu et al., 2011; Skrondal & Laake, 2001) and factor score path analysis 1 (Devlieger & Rosseel, 2017). In brief, FSR involves two steps: (a) first, estimate the measurement model for each latent factor in a structural regression model separately, extracting factor scores for each; (b) use these factor scores as input data in a subsequent ordinary least squares (OLS) regression or path analysis.
Because factor scores are indeterminate and, therefore, not fully reliable (cf. Grice, 2001; Steiger & Schönemann, 1978), extra care must be taken to avoid bias when treating factor scores as data. Recently, quantitative researchers have suggested two promising approaches to address possible bias when performing FSR. A first approach is to strategically extract factor scores using estimation methods designed to avoid injecting bias in the first place (e.g., using regression factor score estimation for exogenous and Bartlett factor score estimation for endogenous latent variables in the bias-avoiding approach of Skrondal & Laake, 2001). A second approach is to first extract factor scores for all latent constructs using a single estimation method, and then correct the variances and covariances of the factors for bias using analytic formulas (Croon, 2002; cf. Hoshino & Bentler, 2013). Once these variances and covariance have been corrected, they may be used either as sufficient statistics for computing regression coefficients (Croon, 2002; Devlieger et al., 2016; Lu et al., 2011) or as covariance-matrix input for a subsequent path analysis (Devlieger & Rosseel, 2017).
A series of recent simulation studies have shown that Croon’s (2002) bias-correcting FSR approach outperforms the bias-avoiding approach of Skrondal and Laake (2001) under a variety of circumstances (Devlieger et al., 2016; Lu et al., 2011). For example, the bias-avoiding method suffers from attenuated standard errors and its unbiasedness breaks down when coefficients are standardized (Devlieger et al., 2016). Perhaps more crucially, because the bias-avoiding method hinges on using different factor score extraction methods for predictors and outcomes in an analysis, this method cannot easily be extended to the path analytic framework, in which a construct’s role may shift from outcome to predictor in different parts of the model (e.g., a mediating variable that is both predicted by X and predictive of Y; Devlieger & Rosseel, 2017).
For these reasons, in the present article we focus our attention on Croon’s (2002) FSR approach, using his analytic method to derive formulas that accommodate correlated uniquenesses. 2 In the next section, we briefly review the standard Croon formulas, derived under the assumption of conditionally independent unique factors. Then, we use Croon’s bias-correcting strategy to rederive these formulas under scenarios where unique factors may be correlated. Finally, we briefly compare the rederived Croon formulas with another cutting edge FSR approach proposed by Hoshino and Bentler (2013).
Review of Croon’s FSR Formulas
As a basis for introducing Croon’s (2002) bias-correction formulas, assume a researcher is interested in fitting a structural regression model in which an exogenous latent factor,

(A)-(C) visually depict factor score regression (FSR) estimation under conditional independence. (D)-(F) depict FSR under nonzero across-factor correlated uniquenesses. (G)-(I) depict FSR under nonzero within-factor correlated uniquenesses.
As stated above, due to factor indeterminacy, simply conducting an FSR using the raw factor scores would result in a biased structural regression coefficient,
Let
where
Similarly, a bias-corrected estimate of the covariance between
where
Or they can be used to construct a variance–covariance matrix that may be used as summary data input in a subsequent path analysis:
where
The Issue of Correlated Uniquenesses
In simulation studies, Croon’s method outperformed its competitors under a variety of conditions and sample sizes (Devlieger et al., 2016; Devlieger & Rosseel, 2017; Lu et al., 2011). Importantly, FSR appears more robust than simultaneous SEM to structural model misspecifications (Devlieger & Rosseel, 2017). However, the population models in these studies have always assumed conditionally independent unique factors. What if unique factors are not independent? In the following sections, we first address the issue of diagnosing nonzero unique factor covariances before turning to the issue of incorporating unique factor covariances into FSR models.
Diagnosing and Including Correlated Uniquenesses
Before unique factor covariances can be addressed using SEM or FSR, they first have to be known and correctly specified in one’s model. For example, nonzero unique factor covariances might be specified a priori on the basis of theory, or suspected based on research design considerations (as when residuals may be autocorrelated over time in longitudinal studies, cf. Bollen, 1980; Rubio & Gillespie, 1995; Singer & Willett, 2003). Absent an a priori theory dictating the structure of the unique factor covariance matrix, however, analysts must resort to exploratory searches to uncover possible nonzero unique covariances.
If the unique factors were observed variables in one’s data set, it would be a trivial matter to simply compute the complete covariance matrix of the uniquenesses, examining which seem to depart from zero. Because both the common and unique factors are latent rather than observed, however, estimating the full unique factor covariance matrix is impossible in practice. This is because simultaneous SEM estimation requires the specification of at least one across-factor item pair whose covariation is caused entirely by the covariance between the common factors. That is, in a simultaneous SEM model like the one depicted Figure 1D, the structural regression coefficient,
For these reasons, nonzero correlated uniquenesses must typically be diagnosed indirectly through the inspection of either standardized covariance residuals or modification indices when standard conditional independence models return substandard model fit (Bollen, 1989; McDonald, 1999; McDonald & Ho, 2002; Saris, Satorra, & Sörbom, 1987; Sörbom, 1989). Because it is well known that sequential specification searches using modification indices generally do not recover the correct population model (MacCallum, 1986; MacCallum, Roznowski, & Necowitz, 1992), we prefer the inspection of standardized (or normalized) covariance residuals.
Although a detailed treatment of this topic is far beyond the scope of the present article, we provide a brief summary here. For item pairs that load on the same factor, larger 4 positive standardized (or normalized) covariance residuals indicate item pairs whose sample covariances far exceed the covariances implied by the conditional independence model. For item pairs that load on different factors, standardized (or normalized) covariance residuals with larger absolute values that track in the same direction as the covariance between the latent factors indicate item pairs whose sample covariances exceed in absolute value the covariances implied by the conditional independence model. In either case, when the absolute value of the sample covariance between a pair of items far exceeds the absolute value of the model-implied covariance under conditional independence, this suggests that the items in question share additional covariation above and beyond that predicted by the common factor. As such, nonzero unique factor covariances are particularly plausible for these item pairs.
As the preceding discussion implies, unique factors may be correlated either with (a) other unique factors loading on indicators of a different common factor in a larger model (across-factor correlated uniquenesses) or (b) other unique factors loading on indicators of the same common factor (within-factor correlated uniquenesses). The next two sections use Croon’s analytic approach to derive formulas that apply to each of these scenarios, in turn.
Across-Factor Correlated Uniquenesses
Correlated unique factors occur whenever nonzero covariation between two or more indicators remains even after their prediction by a common factor. Such residual covariation may result from a variety of causes, such as the mutual influence of both unique factors by a second common factor in a bifactor model (cf. Gerbing & Anderson, 1984). Perhaps more interestingly, it is possible that the specific factors that influence items tapping different constructs may correlate. Though the decomposition of item variance into common factor variance, specific factor variance, and error variance is as old as factor analysis itself (Crocker & Algina, 2008; Guttman, 1945; McDonald, 1999; Spearman, 1904; Thurstone, 1935, 1947), the implications of specific factors have largely been ignored, and have only recently been the subject of renewed interest in the SEM literature (cf. Bentler, 2017, for a recent discussion of specific factors).
As an example, consider loneliness and depression, which have been well established in the literature as correlated-yet-distinct constructs (Cacioppo, Hawkley, & Berntson, 2003; Hawkley et al., 2008; Russell, Peplau, & Cutrona, 1980). Imagine, further, three items: “I feel alone,” from a loneliness scale, as well as “I feel hopeless,” and “sometimes it is hard to get out of bed in the morning,” both from a depression scale. It seems reasonable to expect that hopelessness and lethargy (amotivation) are both mutually caused by a latent depression factor. But what if there is a stronger correlation between feelings of hopelessness and feelings of loneliness than between feelings of loneliness and feelings of lethargy? If this stronger correlation results from the specific aspects of these items, rather than the strength of the relationship between these items and their respective common factors, a unique factor correlation may be at play.
Figure 1D depicts nonzero covariances between the first indicators and the third indicators on each common factor in our two-factor structural regression model. Assume that the structural regression coefficient,
What would happen to the estimate of
However, in addition to fitting the covariances between items loading on separate factors, the factor loadings must also minimize misfit in modeling the model-implied covariances between items loading on the same factor. Assuming standardized factors, the model-implied covariance between any two items loading on the same factor is
If Figure 1D is the true model, then, ignoring the nonzero across-factor correlated uniquenesses may result in an inflated structural coefficient,
Alternatively, what would happen to the estimate of
Unfortunately, this intuition turns out to be incorrect. As formally derived in Appendix A, Croon’s original formulas were predicated upon an assumption of conditionally independent uniquenesses, both within- and across-factors. In a model like Figure 1D, the formula for bias-corrected factor variances in Equation (1) once again remains accurate. In the presence of across-factor correlated uniquenesses, however, Equation (2), for estimating
where
where
The necessity of accounting for the covariances in
A second strategy applies Croon’s bias-correction method at the level of each connected measurement model, rather than each individual factor model. In the present context, a connected measurement model refers to a measurement model with ≥2 latent factors connected by across-factor correlated uniquenesses. In this approach, factor scores are estimated and extracted for multiple factors at once in a simultaneous model such as the model of Figure 1E. Subsequently, the variance–covariance matrix of the factor scores is corrected for bias resulting from nonzero
Factor model Croon
Arguably the most direct application of Croon’s (2002) method to structural regression models featuring across-factor correlated residuals would be to retain the factor-model-by-factor-model nature of the original Croon approach, incorporating an estimate of
First, estimate separate factor models as in Figure 1B. Extract factor scores for each, as well as factor loading matrices
Estimate the simultaneous model of Figure 1E, fixing the parameters of each factor model to their estimates from Step 1.
5
As such, the only estimated parameters should be the covariance(s) between the latent factors, as well as any nonzero unique factor covariances.
6
Extract the unique factor variance–covariance matrix
Correct the factor score variances and covariances using the quantities obtained in Steps 1 and 2, via Equations (1) and (6).
Run the FSR model, as depicted in Figure 1I.
Based on the analytic derivations in Appendix A, this method should serve to correct Croon’s (2002) original formulas for bias in the presence of across-factor correlated uniquenesses, yielding consistent estimates of the true factor covariances.
Measurement model Croon
A second strategy applies Croon’s bias-correction method at the level of the entire connected measurement model. Let
Solving algebraically for the variance–covariance matrix of the true latent factors,
Although in principle this formula can be used regardless of which factor score estimator is employed (e.g., regression vs. Bartlett) so long as the matrix product
In scenarios involving only a single connected measurement model, such as the example of Figure 1D and E and the template models featured in our simulations, this formula is all that is required to obtain a corrected covariance matrix of all latent factors in the model. Although not our main focus, we note that Appendix B contains additional formulas for computing the corrected covariance matrix between two connected measurement models,
Within-Factor Correlated Uniquenesses
Nonzero covariances may also occur between unique factors that load on items tapping the same construct. Once again, it is possible that such unique factor covariation is due to the unique factors’ mutual causation by a second factor or to the existence of correlated specific factors that influence both items. Whatever the case may be, suppose a researcher is interested in fitting the same structural regression between exogenous
What would happen to the estimate of the key structural coefficient,
For example, let
Luckily, using Croon’s original formulas in the case of within-factor correlated uniquenesses involves only a simple respecification of the initial factor models from which factor scores are extracted. Instead of fitting each measurement model in a standard manner, assuming conditionally independent uniquenesses, one simply fits each factor model including the correlated unique factors, as depicted in Figure 1H. As a result, the
Alternatively, though not required to estimate within-factor correlated uniquenesses, the Measurement Model Croon formulas described previously may just as easily be applied. For example, the entire model of Figure 1H may be fit simultaneously (including the factor covariance, represented by a dotted line) and the bias-corrected variance covariance matrix of
Hoshino and Bentler’s (2013) Alternative FSR Approach
Although Croon’s (2002) FSR method has attracted recent attention, it is important to mention an alternative approach proposed by Hoshino and Bentler (2013). These authors correctly noted that, under standard conditions, the Bartlett factor score estimator (Bartlett, 1937) is a consistent estimator of the true population covariances among the latent factors (cf. Hoshino & Bentler, 2013, section 4.7.2). For example, under conditional independence the true factor covariance is defined by Equation (2) as
Similarly, the denominator varnishes in Equation (1), leaving
Like Croon’s method, the Hoshino–Bentler approach can be applied at either the level of the individual factor models or the level of each connected measurement model (Hoshino & Bentler, 2013). We focus here on the connected measurement model implementation, since we employed this version in our simulations below. Recall that, at the level of a single connected measurement model,
When there are no across-factor correlated uniquenesses, however—even when there are some nonzero within-factor covariances—all off-diagonal elements of
The Present Research
The formulas and rhetorical arguments presented thus far are based entirely on statistical theory and the severity of possible degradations in performance of SEM estimation, standard (uncorrected) Croon FSR, and (in the case of across-factor correlated uniquenesses) the Hoshino–Bentler method under a population model with nonzero correlated uniquenesses remains unknown. Additionally, it is unclear whether the Factor Model and Measurement Model variants of our Croon formulas will perform identically or whether one method will outperform the other under particular circumstances. For example, at smaller sample sizes it is possible that one method might outperform the other. It could be that the smaller, factor-by-factor models will be easier to estimate accurately than a larger connected measurement model, leading to less biased results in smaller samples. Alternatively, it is just as plausible that estimates from larger, connected measurement models will be less biased in small samples, since these estimates are informed by a greater overall number of variables in a richer model.
To provide an initial assessment of the performance of these methods, then, we compare our proposed Factor Model and Measurement Model Croon formulas with (a) simultaneous SEM estimation, (b) uncorrected Croon FSR, (c) FSR using the Hoshino–Bentler (2013) method, and (d) uncorrected FSR using regression and Bartlett factor scores in two Monte Carlo simulations. Below, we refer to Croon’s (2002) original FSR method as either Croon FSR or FSR assuming conditional independence. We refer to our proposed corrections for correlated uniquenesses at the factor model level as either Croon FM or Factor Model Croon and to our proposed corrections for correlated uniquenesses at the measurement model level as either Croon MM or Measurement Model Croon. Finally, we refer to the Hoshino–Bentler method as either the HB method or simply Hoshino–Bentler.
Simulation Studies
To evaluate the efficacy of our proposed FSR methods, and to assess the robustness of simultaneous SEM estimation and standard Croon FSR to misspecification of the measurement model unique factor structure, we conducted two Monte Carlo simulation studies. Simulation 1 examined these methods using a population model with nonzero across-factor correlated uniquenesses. Simulation 2 used a population model with nonzero within-factor correlated uniquenesses. We coded both simulations in
Simulation 1: Across-Factor Correlated Uniquenesses
Population Model Used in the Simulation
As a population model for Simulation 1, we used the latent variable mediation model displayed in Figure 2. We chose the mediational framework because of its widespread use in the educational and psychological literature, because we wished to assess the potential proliferation of bias in models with at least one indirect pathway, and because, following other authors in this area, we wished to assess the effects of misspecifying the structural model by incorrectly fixing a direct pathway to zero (Devlieger et al., 2016; Devlieger & Rosseel, 2017). The coefficients were taken from the moderate effect size conditions reported in Ledgerwood and Shrout (2011), in which the indirect effect equals ab = .546 * −.546 = .30. All variables in the model were standardized.

Population model for Simulation 1.
Factors Varied in the Simulation
We varied four primary factors in Simulation 1: the sample size, the reliability of the four indicators used to measure each construct, the strength of the unique factor correlations, and the number of unique factor correlations.
The sample size, N
We simulated four sample sizes: N = {125, 250, 500, 1,000}, representing a range of small and large sample sizes.
Reliability of the indicators
For convenience, we held all population factor loadings equal in all conditions and selected three levels of Cronbach’s alpha (Cronbach, 1951),
Because the unique factor variances are necessarily larger under lower reliability, when less of the total variance of each item is explained by the common factor, we expected the biasing effects of ignoring nonzero correlations among the unique factors to be most severe in the
Strength of the unique factor correlations
We selected unique factor correlations of three different strengths: |r| = {.1, .3, .5}, corresponding to Cohen’s (1988) conventions for small, moderate, and large correlation effect sizes. All else being equal, we expected greater bias to result from misspecifying the measurement model unique factor structure when the strength of the unique factor correlations was higher.
Number of unique factor correlations
Finally, we generated either one or two nonzero across-variable correlated uniquenesses per common factor. This is depicted visually in Figure 2 via the use of solid versus dashed lines in the measurement model unique factor structure. The solid lines represent all unique factor correlations in cells with one correlated uniqueness per common factor. The dashed lines represent additional unique factor correlations in the population model for cells with two correlated uniquenesses per common factor. All else being equal, the more nonzero correlated uniquenesses in a model, the more disruptive an influence these unique variable correlations should have on the resulting parameter estimates.

Diagram of factor score regression (FSR) approach to fitting the model of Figure 2.
Analyses Conducted on Each Simulated Data Set
We analyzed each data set using eight different methods: SEM specified under conditional independence, Croon FSR assuming conditional independence, uncorrected FSR using the regression estimator, uncorrected FSR using the Bartlett estimator, Factor Model Croon, Measurement Model Croon, simultaneous SEM with a correctly specified residual structure, and Hoshino–Bentler FSR computed at the connected measurement model level under a correctly specified residual structure. 8 We note that the first four of these methods assumed conditional independence and ignored possible correlated uniquenesses whereas the latter four of these methods correctly specified the measurement model residual structure, including all nonzero unique factor covariances. For reasons mentioned above, we used Bartlett-estimated factor scores for all variants of Croon’s method.
Additionally, for each estimation method, we conducted two analyses: one with the structural model correctly specified, freely estimating all mediation model pathways, and one with the structural model misspecified, incorrectly fixing the
Simulation Outcomes
We assessed the performance of each method using two primary outcome measures: percent bias and mean square error (MSE) of our key structural parameters.
Percent bias
For each population parameter of interest,
where
Mean square error
Additionally, for each parameter of interest we computed MSE in each unique simulation cell using the formula:
where
MSE ratios
In the results below, instead of reporting raw MSE values, we report MSE ratios, computed as
That is, for each analysis cell, we formed the ratio of a given estimator to the measurement model version of Croon’s method. MSE ratios equal to 1 indicate estimators that are equivalent in their overall accuracy. Ratios less than 1 indicate scenarios in which a comparison estimator exhibits lower MSE than Croon MM. Ratios greater than 1 indicate scenarios in which the comparison estimator exhibits higher MSE than Croon MM. When both estimators are unbiased, the MSE ratio can be construed as a measure of the relative efficiency of the two estimators.
Simulation 1: Results
The results of Simulation 1 generally fell in line with our predictions. Because the pattern of results was stable across sample sizes, we present results for the N = 125 condition here. Additionally, to preserve space, we present full tables of results from the two correlated uniqueness cells. The one correlated uniqueness conditions followed similar trends, with less pronounced bias (but see the Supplemental Material, available online, for comprehensive results from all other conditions).
Percent bias
Tables 1 and 2 display percent bias of our key structural parameters, a, b, and the ab indirect effect, by (a) estimation method, (b) unique factor structure specification, (c) structural model specification, (d) reliability (alpha level), and (e) strength of the unique variable correlation in the N = 125, two correlated uniqueness cells. Table 1 presents results for the four methods that assume conditional independences, whereas Table 2 presents results for the four methods that correctly specified the measurement model residual structure.
Percent Bias in Models Omitting the Unique Factor Correlations by Parameter and Simulation Condition, Simulation 1: N = 125, 2 Unique Factor Correlations.
Note. Regression FS = Regression FSR method; Bartlett FS = Bartlett FSR method; Croon = Croon’s method using the original formulas uncorrected for unique factor correlations; SEM = structural equation modeling (simultaneous estimation) under the assumption of conditionally independent uniquenesses; CS = correct structural model specification (c′ path freely estimated); MS = structural misspecification (c′ path constrained to 0). Boldfaces entries indicate absolute values of percent bias >10.
Percent Bias in Models Correctly Specifying the Unique Factor Structure by Parameter and Simulation Condition, Simulation 1: N = 125, 2 Unique Factor Correlations.
Note. Hoshino–Bentler indicates Hoshino and Bentler’s (2013) FSR method; Croon FM = Croon’s method corrected for correlated uniquenesses at the factor model level; Croon MM = Croon’s method corrected for correlated uniquenesses at the measurement model level; SEM = structural equation modeling (simultaneous estimation) correctly specifying the correlated residual structure; CS = correct structural model specification (c′ path freely estimated); MS = structural misspecification (c′ path constrained to 0). Boldfaced entries indicate absolute values of percent bias >10.
Several trends are worth highlighting. First, examining Table 1, it is clear that standard, uncorrected FSR using the regression or Bartlett estimators exhibited substantial negative bias in all conditions, in line with previous simulation results (Devlieger et al., 2016; Devlieger & Rosseel, 2017; Lu et al., 2011). Second, when the structural model was correctly specified but the unique factor correlations were moderate (.3) or large (.5), both standard simultaneous SEM and standard Croon FSR, which assume conditional independence of all unique factors, showed problematic levels of positive bias.
The bias resulting from ignoring unique factor correlations and assuming conditional independence was worse when reliability was low (α = .7 and .8), and minimal when reliability was high (α = .9). This is intuitive, since high reliability (communality) implies very little leftover unique item variation. Bias was also greatest in the estimates of the indirect effect, a * b. Because the indirect effect is a product of coefficients, the bias in this parameter grew more quickly than that of the direct path coefficients. Since the indirect effect is often the quantity of greatest interest in a mediation analysis, the susceptibility of this coefficient to larger levels of bias is concerning.
Turning to Table 2, we see that the uncorrected Hoshino–Bentler method returned problematically biased parameter estimates when the unique factor correlations were moderate (.3) or high (.5) and when reliability was low (.7) or moderate (.8). In contrast, Croon FM, Croon MM, and simultaneous SEM exhibited little bias when the structural model was correctly specified, even in the lowest sample size condition of N = 125. At this sample size Croon FM did display slightly greater negative bias at lower levels of reliability than Croon MM and correctly specified SEM. This trend quickly dissipated as sample size increased, however.
So far, we have highlighted comparisons among analyses that correctly specified the structural portion of the model, assessing the magnitude of bias resulting only from misspecification of the unique factors in the measurement model. In these cells, Croon FM, Croon MM, and simultaneous SEM all exhibited low levels of bias. When the structural model was misspecified (rows labeled MS in Table 1) by erroneously fixing the direct
MSE ratios
MSE ratios are presented in Table 3 for the N = 125, two correlated uniquesses per factor conditions for the estimators that correctly specified the unique actor covariance structure. Comparing Croon MM to simultaneous SEM, we see that the methods exhibit equivalent performance when the entire model was correctly specified, but that Croon MM outperforms SEM when the structural model was misspecified. This suggests that Croon MM is no less efficient than simultaneous SEM, all else being equal. Comparing Croon MM with Croon FM, we see that Croon FM exhibited somewhat lower MSEs than Croon MM in a portion of simulation cells. Although the ratios generally did not depart drastically from 1, this suggests that Croon FM may be somewhat more efficient than Croon MM in some cases.
Mean Square Error Ratios in Models Correctly Specifying the Unique Factor Structure by Parameter and Simulation Condition, Simulation 1: N = 125, 2 Unique Factor Correlations.
Note. FSR = factor score regression; MSE = mean square error; Hoshino–Bentler indicates Hoshino and Bentler’s (2013) FSR method; Croon FM = Croon’s method corrected for correlated uniquenesses at the factor model level; SEM = structural equation modeling (simultaneous estimation), with correctly specified unique factor structure; CS = correct structural model specification (c′ path freely estimated); MS = structural misspecification (c′ path constrained to 0). All MSE ratios are divided by the MSE for Croon’s method corrected for correlated uniquenesses at the measurement model level (Croon MM), that is, MSEEstimator/MSECroon_MM.
Simulation 1: Discussion
Simulation 1 provided empirical evidence for several important phenomena. First, ignoring correlated uniquenesses can result in distorted estimates of structural model parameters. These effects are especially pronounced when the number of across-factor correlated uniquenesses is larger, the strength of the unique factor correlations is greater, and the reliability of the indicators is lower. Second, both correctly specified simultaneous SEM and our proposed Croon methods, accounting for correlated uniquenesses, successfully eliminated this bias when the structural model was correctly specified. Third, Croon FSR estimation outperformed simultaneous SEM in terms of bias when the structural model was misspecified. Finally, standard Hoshino–Bentler estimation exhibits bias in the presence of nonzero across-factor correlated uniquenesses, even though the connected measurement model used to compute the HB factor covariance matrix was correctly specified.
Simulation 1 specifically examined across-factor correlated uniquenesses but did not assess within-factor correlated uniquenesses. Simulation 2 used a similar procedure to compare these methods in the presence of nonzero within-factor correlated uniquenesses.
Simulation 2: Within-Factor Correlated Uniquenesses
Simulation Design
Figure 4 displays the population model for Simulation 2. This simulation closely mirrored that of Simulation 1 but with two key changes. First, we generated data according to the model in Figure 4, featuring within-factor correlated uniquenesses. Second, because preliminary simulations found noticeable negative bias in key parameters even with only one within-factor correlated uniqueness per common factor, for simplicity we did not include a set of conditions examining two correlated uniquenesses per factor. Otherwise, all design factors were the same. 10

Population model for Simulation 2.
Simulation 2: Results and Discussion
Mirroring Simulation 1, Tables 4 and 5 display the results for percent bias and Table 6 displays the results for MSE ratios in the N = 125 conditions. As expected, the general trends from Simulation 1 are all apparent here, but the direction of biased has reversed: In the presence of within-factor correlated uniquenesses, biased structural parameters are nearly always attenuated rather than magnified. Once again, simultaneous SEM estimation and standard Croon FSR estimation exhibited bias when the unique factor structure was specified to be conditionally independent. And, once again, this bias was most pronounced under moderate to strong unique factor correlations (.3 and .5) and lower levels of reliability (.7 and .8).
Percent Bias in Models Omitting the Unique Factor Correlations by Parameter and Simulation Condition, Simulation 2: N = 125.
Note. Regression FS = regression FSR method; Bartlett FS = Bartlett FSR method; Croon = Croon’s method using the original formulas uncorrected for unique factor correlations; SEM = structural equation modeling (simultaneous estimation) under the assumption of conditionally independent uniquenesses; CS = correct structural model specification (c′ path freely estimated); MS = structural misspecification (c′ path constrained to 0). Boldfaces entries indicate absolute values of percent bias >10.
Percent Bias in Models Correctly Specifying the Unique Factor Structure by Parameter and Simulation Condition, Simulation 2: N = 125.
Note. FSR = factor score regression; Hoshino–Bentler indicates Hoshino and Bentler’s (2013) FSR method; Croon FM = Croon’s method corrected for correlated uniquenesses at the factor model level; Croon MM = Croon’s method corrected for correlated uniquenesses at the measurement model level; SEM = structural equation modeling (simultaneous estimation) correctly specifying the correlated residual structure; CS = correct structural model specification (c′ path freely estimated); MS = structural misspecification (c′ path constrained to 0). Boldfaced entries indicate absolute values of percent bias >10.
Mean Square Error Ratios in Models Correctly Specifying the Unique Factor Structure by Parameter and Simulation Condition, Simulation 2: N = 125.
Note. FSR = factor score regression; MSE = mean square error; Hoshino–Bentler indicates Hoshino and Bentler’s (2013) FSR method; Croon FM = Croon’s method corrected for correlated uniquenesses at the factor model level; SEM = structural equation modeling (simultaneous estimation), with correctly specified unique factor structure; CS = correct structural model specification (c′ path freely estimated); MS = structural misspecification (c′ path constrained to 0). All MSE ratios are divided by the MSE for Croon’s method corrected for correlated uniquenesses at the measurement model level (Croon MM), that is, MSEEstimator/MSECroon_MM.
Examining Table 5, all methods performed well when both the measurement and structural models were correctly specified. We note, however, that there was a small but noteworthy effect of sample size in Simulation 2. Specifically, Croon FM was somewhat more biased at lower Ns but this bias became negligible at higher Ns (500 and 1,000). For example, the indirect effects for this method fell above the 10% cutoff when α was low (.7) across the N = 125 conditions. For all other sample sizes, this method produced acceptable levels of bias, however (though the degree of bias was still noteworthy at N = 250; see supplemental results [available online]).
In contrast, both Hoshino–Bentler and Croon MM exhibited minimal bias when the entire model was correctly specified. It is worth noting that because these methods both employed the Bartlett estimator to estimate the latent factor covariances and both employed equivalent corrections to the factor variances, the results for these two methods are identical. Simultaneous SEM performed comparably to these methods when both the measurement model and the structural model were correctly specified, but HB, Croon FM, and Croon MM all outperformed simultaneous SEM when the structural model was misspecified. This was particularly true under low and moderate levels of reliability (.7 and .8) and remained true across all sample size conditions (see supplemental results for details [available online]).
Finally, the vast majority of MSE ratios comparing Croon MM with HB and SEM either approached or exceeded 1, indicating that FSR with correlated uniquenesses performed either equivalently or superiorly in the majority of cases. Once again, Croon MM exhibited lower MSEs in several cases, but this result should be qualified by the higher levels of bias Croon FM displayed in the N = 125 and 250 conditions. Although the lower MSE suggests that Croon FM may be more efficient, the higher levels of bias observed in Table 5 suggest that this reduced sampling variability may be centered on a biased estimate.
Simulation 2: Discussion
Like Simulation 1, Simulation 2 provided a clear pattern of results. This simulation demonstrates that correct estimation of structural parameters suffers when the measurement model unique factor structure is ignored. Furthermore, simple steps can be taken to respecify the model in a manner that preserves the correct unique factor structure, using either simultaneous SEM or FSR. Finally, the Croon FM, Croon MM, and Hoshino–Bentler methods outperformed standard simultaneous SEM methods when the structural model was misspecified (see also Devlieger & Rosseel, 2017).
General Discussion
In the present research, we used Croon’s (2002) bias correction approach to derive formulas for the case of across-factor correlated uniquenesses and explicated how Croon’s original formulas may be employed in the case of within-factor correlated uniquenesses. Additionally, we reported the results of two Monte Carlo simulations comparing these methods’ performance with uncorrected regression and Bartlett FSR, standard Croon FSR assuming conditional independence, and simultaneous SEM estimation. Correctly specified simultaneous SEM estimation, Croon FM, and Croon MM, incorporating correlated uniquenesses, exhibited strong performance in our simulations. In line with previous studies (Devlieger et al., 2016; Devlieger & Rosseel, 2017), Croon FSR outperformed simultaneous SEM estimation when the structural model was misspecified.
Although the estimation of correlated uniquenesses is often discouraged in psychometric studies as a form of fishing for ways to improve fit (cf. Cole, Ciesla, & Steiger, 2007), when the focus is on accurately estimating the structural parameters, rather than the measurement model, our results suggest that the unique factor covariance structure ought not be ignored. Our simulations clearly showed that ignoring the measurement model covariance structure resulted in distorted estimates of key structural parameters across a variety of conditions. This is especially true when the primary goal is the assessment of indirect (mediational) effects, since these parameters quickly grew more biased than any other structural parameter in our simulations.
Our simulations suggest several guidelines for researchers considering applying these methods. If reliability is high (.9) or unique factor correlations low (.1), the measurement model unique factor structure might be ignored with little consequence. If reliability is lower (alpha or omega of .7-.8), the choice of how to model unique factors becomes more urgent. Furthermore, ignoring within-factor correlated uniquenesses risks attenuated structural regression coefficients, whereas ignoring across-factor correlated uniquenesses risks inflated coefficients.
Though both Croon FM and Croon MM performed well in the majority of conditions, our results suggest that when sample sizes are smaller (e.g., N = 125 or 250) Croon MM may be somewhat more accurate. This was especially true when the factor models featured within-factor correlated uniquenesses. For this reason, we recommend Croon MM when attempting to use FSR in small samples with correlated uniquenesses.
The results of our first simulation suggest that across-factor uniquenesses are more harmful to the extent that they are more numerous. Although bias resulted from ignoring even one correlated uniqueness per factor, the bias was more severe when there were two correlated uniquenesses per factor. Our simulations assessed only a simple case in which there were four indicators per factor, but it is possible to (cautiously) extrapolate to other scenarios. All else being equal, it seems reasonable to expect that even one or two correlated uniquenesses might be harmful in factor models with fewer indicators (e.g., two-indicator or three-indicator factors in the context of a larger structural model). Conversely, one or two correlated uniquenesses may not disrupt estimation as severely in factor models with more indicators (e.g., 10 or 20). This suggests that, all else being equal, applied researchers should pay greater heed to the issue of correlated uniquenesses in scenarios with fewer indicators and more numerous unique variable covariances.
Our simulation results mirror our analytic derivations in showing that Hoshino and Bentler’s (2013) method remains unbiased in the presence of within-factor correlated residuals but becomes biased in the presence of across-factor correlated residuals. Of course, correcting the HB method for this bias would be relatively simple. In the spirit of the original method, one manner of accomplishing this would be to use the covariances of the Bartlett factor scores to estimate the covariances between all factors whose measurement models do not feature across-factor correlated uniquenesses but substitute the estimated factor covariances from the initial simultaneous SEM runs in place of the Bartlett covariances between any factors whose measurement models feature across-factor correlated uniquenesses.
Our simulation results also suggest several possible areas for future research. First, if correlated unique factor structures are to be taken seriously by substantive researchers, it will be important to develop reliable and user-friendly methods for correctly identifying nonzero unique factor covariances. Because a model freely estimating all such correlated uniquenesses can never be identified (cf. Kenny, 1979; Kenny et al., 1998), alternative approaches must be used to diagnose the correct covariance structure. If the goal is to extract factor scores and conduct FSR, an intriguing possibility would be to utilize unique factor score estimates from each measurement model of interest 11 to diagnose possible nonzero unique factor covariances. Given the potential bias in estimated uniquenesses based on factor scores, however, it will be important to assess the analytic properties of these estimators as well as to empirically test their performance. 12
Second, future research is needed to extend FSR methods to more complex factor structures, such as hierarchical or bifactor models (Holzinger & Swineford, 1937; McDonald, 1999; Schmid & Leiman, 1957), as well as more complex residual structures (cf. Singer & Willett, 2003, for a review of residual structures in longitudinal models). Finally, as noted by others (Devlieger et al., 2016; Devlieger & Rosseel, 2017), before FSR can be widely implemented, a crucial area of future research will necessarily involve the derivation of accurate standard errors for the path analytic formulation (but see Devlieger et al., 2016, for a viable OLS regression-based standard error for FSR).
In sum, we believe that the present research makes an important contribution to the literature on FSR methods by extending these methods to the case of correlated uniquenesses. Since implementation of matrix-oriented bias correction formulas may prove challenging for many applied researchers, it is our hope that software implementations of these methods (e.g., in lavaan; Rosseel, 2012) incorporate functionality for implementing FSR with correlated unique factor structures in the future.
Supplemental Material
Online_Supplemental_Results – Supplemental material for Factor Score Regression in the Presence of Correlated Unique Factors
Supplemental material, Online_Supplemental_Results for Factor Score Regression in the Presence of Correlated Unique Factors by Timothy Hayes and Satoshi Usami in Educational and Psychological Measurement
Footnotes
Appendix A
Appendix B
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Supplemental Material
Supplemental material for this article is available online.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
