Abstract
The Vale and Maurelli algorithm is a widely used method that allows researchers to generate multivariate, nonnormal data with user-specified levels of skewness, excess kurtosis, and a correlation structure. Before obtaining the desired correlation structure, a transitional step requires the user to calculate the roots of a cubic polynomial referred to as the intermediate correlation equation. The Cardano method and a corollary of Rouché’s theorem were used to derive closed-form solutions to this equation. These solutions highlight the fact that three real-valued roots are possible, and solutions can either contain multiple roots within the allowable correlation range or can all be greater than 1 for many combinations of skewness and excess kurtosis. It was also found that large values of excess kurtosis exacerbate the issue of multiple solutions or solutions greater than 1, bringing further restrictions into the combinations of higher order moments that researchers can simulate from. A small computer study on the power of the t test for the correlation coefficient also uncovered that different values of the intermediate correlation can influence the results from Monte Carlos simulations. This note is intended to inform both researchers who conduct simulations with nonnormal data and users who inform their data analysis practice from simulation studies.
Keywords
1. The Third-Order Polynomial Transformation
Within the social sciences, few algorithms have seen the level of popularity of the third-order polynomial method proposed by Fleishman (1978) in order to generate nonnormal data for Monte Carlo simulation studies. If a researcher is interested in generating nonnormal variables with a prespecified level of skewness
where a, b, c, and d are real-valued polynomial coefficients and
so that for values of
The Fleishman method was originally derived to generate univariate distributions, and modifications are needed to generalize it into a multivariate setting. If variables following a specific covariance structure are Fleishman-transformed, the final nonnormal variables will have different correlations from those originally intended. If Fleishman-transformed values are rotated to follow a covariance structure by some procedures such as the Kaiser and Dickman (1962) or Cholesky decomposition, the higher order moments will not match the initial ones. A solution to this problem was proposed by Vale and Maurelli (1983), where they conceived a second correlation matrix, referred to as the “intermediate correlation matrix,” which would impose an initial covariance structure on the data. Once the variables were correlated, each one would be individually subjected to the third-order polynomial transformation, so that the intermediate correlation matrix would be modified into the correlation structure initially intended by the researcher. Consider again a random variable Y, which has been Fleishman-transformed from a standard normal variable defined as above,
Since Y is standardized by construction, the cross product of any two variables
where the expectation of
An algebraic rearrangement of Equations 7 and 8 yields the final intermediate correlation formula as conceived by Vale and Maurelli (1983):
2. Limitations of the Vale–Maurelli Transformation
The algorithm suggested by Vale and Maurelli (1983) exhibits certain limitations, many of which stem from the way in which the Fleishman (1978) method operates. Given that the focus of this note is the Vale–Maurelli approach, the interested reader can refer to Astivia and Zumbo (2015), Headrick (2002), and Headrick (2004) for an overview of the limits to the Fleishman (1978) method.
The first limitation of the Vale–Maurelli method is the restriction concerning the types of multivariate nonnormal data that it can generate. Foldnes and Grønneberg (2015) derived the probability density of Vale–Maurelli-transformed variables and found them to be contained within the family of Gaussian copula distributions. This brings into question whether or not the Monte Carlo studies that have relied on this method are simulating nonnormal data that are truly encountered in the real world or if they are restricted to a specific subset of multivariate distributions. A second limitation of this method pertains to the manner in which the intermediate correlation matrix is assembled. Equation 9 implies that the correlations must be calculated in a pairwise fashion, and there is no guarantee that this matrix will be positive definite once compiled. For such cases, Demirtas, Hedeker, and Mermelstein (2012) recommend calculating the “nearest matrix” as outlined in Higham (2002) and utilize said matrix instead.
Another limitation is that for certain values of skewness and kurtosis, the Vale–Maurelli transformation may not span the full [−1, +1] correlation range, limiting the effect size values that researchers can choose from. Headrick (2002, 2010) has shown that the type of coefficients obtained from the Fleishman (1978) method may place undue boundaries on the solution to Equation 9 and a higher order transformation, such as the fifth-order polynomial he derived, would be needed to alleviate this issue.
3. Multiple Solutions to the Vale–Maurelli Transformation: A Mathematical Description
An important feature of the Vale–Maurelli algorithm that has gone mostly unnoticed within the published literature is that the intermediate correlation shown in Equation 9 is defined as a third-degree polynomial, which, as per the Fundamental Theorem of Algebra, must have three roots (including those with multiplicities greater than 1). There is currently no description of these solutions, and there are no guidelines concerning which ones should be preferred over others when designing a Monte Carlo study. The purpose of this note is, therefore, to provide a full mathematical account of the multiple solutions to the Vale–Maurelli intermediate correlation equation, explore their properties, and offer recommendations to researchers who may be interested in using this method to generate multivariate, nonnormal data for simulation purposes.
Define
Note that since q is chosen by the researcher, it is always a predetermined constant.
Using the Cardano method (see Mollin, 1947, for a full description), it is possible to find closed-form expressions for the cubic polynomial roots. For this specific instance, the roots are
where
with
The expression
If
so that
If
so that z1is the only real-valued root, with
If
which implies
A back substitution to the original polynomial coefficients from Equation 9 results in:
The analytical form of the roots obtained via the Cardano method makes evident some of the conditions required to be able to successfully use the Vale–Maurelli transformation. The first condition is that the roots must be real-valued numbers. For the case where the discriminant is greater than 0, choosing the real solution would be the only sensible path for researchers wishing to use the intermediate correlation equation in their simulation methodology. For the other two cases, a second reasonable requirement would be to only select the solution (or solutions) that exist within the range of the Pearson correlation coefficient [−1, +1]. Determining which intermediate correlations are “valid” is equivalent to finding boundary conditions of the solutions to the cubic polynomial described in Equation 9 outside of the interval [−1, +1].
Consider the polynomial
The constant w takes the following forms depending on whether it is an upper or a lower bound. For the upper bound,
and for the lower bound,
Taking the case of the lower bound, by substituting the coefficients of Equation 9, one obtains
which implies that if
4. Multiple Solutions to the Vale–Maurelli Transformation: A Computer Simulation Exploration
The analytical description of the intermediate correlation equation discussed in the previous section shines a light on two important special cases. The first is that more than one real-valued solution is possible for Equation 9. Although it appears that researchers are more likely to find themselves in the instance where the discriminant
To the authors’ knowledge, there are no investigations within the published literature that describe the prevalence of these cases or that attempt to link them to any specific values of skewness and kurtosis. These properties of the Vale–Maurelli transformation can have several unintended consequences that could influence the results and conclusions of Monte Carlo simulations that rely on this algorithm. To begin with, they place practical restrictions on the combinations of higher order moments that researchers can simulate from beyond the quadratic relationship that limits the third and fourth moments. If certain values of skewness and kurtosis yield solutions to Equation 9 that are strictly greater than 1, then these values cannot be used in a simulation since the intermediate correlation matrix would be nonsensical. For the cases where more than one value within the valid correlation range is possible, it is becomes unclear which solution among the possible candidates should be selected. Take, for instance, the field of structural equation modeling (SEM) that has relied considerably on the Fleishman (1978) method and the Vale–Maurelli transformation to explore the robustness of various estimators, such as normal theory maximum likelihood (e.g., Curran, West, & Finch, 1996; Falk, 2017; Flora & Curran, 2004). Foldnes and Grønneberg (2017) derived closed-form expressions of the population Γ matrix of fourth-order moments (which is used in almost all robust corrections for SEM estimators) for Vale–Maurelli-transformed variables and discovered that the values of this Γ matrix depend on the intermediate correlations. If different solutions to Equation 9 imply different population Γ matrices, it becomes unclear whether or not certain solutions would cause this Γ matrix to behave in different and unexpected ways.
Unfortunately, it is impossible to find closed-form expressions for the skewness/kurtosis values that would yield either multiple valid solutions or solutions greater than 1 due to the fact that algebraically solving for the system (Equations 2 –5) yield fifth- and sixth-degree polynomials for which a closed-form solution cannot be found as per the Abel–Ruffini Theorem. To tackle this issue, an empirical computer investigation was conducted in the statistical programming language R (R Core Team, 2017).
A grid-search method was used over a large collection of possible combinations of skewness/kurtosis values and researcher-specified final correlation values of 0.1 and 0.5, which correspond to a small and large effect size (Cohen, 1992). The first step was generating a potential list of values, one for skewness and another one for the kurtosis, from −2 to 101.5 in intervals of 0.005 to mimic a large range of options for the third and fourth moments. The values of these lists were paired with one another to ensure every possible combination between the two were present, creating a long series of potential candidate values for the
Fleishman polynomial coefficients were obtained for the
Figure 1 shows the positive section of the skewness/kurtosis parabola defined by the Fleishman (1978) method. For pairs of variables

Combinations of skewness and kurtosis values for final correlations of 0.1 and 0.5 that yield solutions to the intermediate correlations outside of the [−1, +1] range.
The first noticeable difference between both panels is that the number of instances where the solutions are greater than |1| is considerably larger for the effect size of 0.5 than for 0.1. In both cases, it appears that the combination of low skewness and high kurtosis values results in intermediate correlations that violate the allowable range, although for the case of r = .5, the differences between the kurtosis values of Y1 and Y2 need to be substantially larger. For instance, the first pairs of
Figure 2 shows the second special case of the intermediate correlation equation where either two or three solutions were found within the [−1, +1] range. In this instance, the pattern is somewhat reversed from what was found in Figure 1, given that a larger combination of skewness and kurtosis values resulted in multiple valid solutions for the effect size 0.1 as opposed to the effect size of 0.5. It also seems that, whereas the combination of higher order moments for r = .1 was more spread out along the Fleishman (1978) parabola, for r = .5, these combinations of moments tended to cluster themselves at both the lower and higher ends of the kurtosis dimension, where combinations of lower

Combinations of skewness and kurtosis values for final correlations of 0.1 and 0.5 that yield multiple valid solutions.
5. Illustrative Example From the Published Literature,
Unidimensional marginal distributions with a skewness of 3 and kurtosis of 21 have been used within the social and behavioral sciences literature as a condition in Monte Carlo simulation studies testing “extreme” violations of the normality assumption (e.g., Berkovits, Hancock, & Nevitt, 2000; Curran et al., 1996; Nevitt & Hancock, 2000; Vallejo, Gras, & Garcia, 2007). In each case, multivariate nonnormal data were generated through the third-order polynomial method and correlated using the Vale–Maurelli transformation. This particular combination of values for higher order moments, however, falls within one of the special cases described in the previous sections where more than one intermediate correlation exists within the [−1, +1] range. More specifically, solving for the Fleishman coefficients and substituting them in Equation 1 define a random variable Y:
so that Y now follows a nonnormal distribution with population skewness of 3 and kurtosis of 21. For illustration purposes, take the case of a large effect size of
which, when solved for, yields the approximate roots
Figure 3 shows a small demonstration, similar to the study presented by Beversdorf and Sa (2011), where the power of the t test for the Pearson correlation is compared across conditions of bivariate normality and nonnormality

Power curves of the t test for the Pearson correlation coefficient under bivariate normality (none condition) and two intermediate correlation equations (−0.8898 and 0.7127). Population effect size (r) is set at 0.5 with sample sizes from 10 to 100.
Overall, the negative intermediate correlation influenced the simulation results so that the t test showed less power across all sample sizes when compared to its positive counterpart. The power differences across sample sizes were sometimes close to 0.1 between the negative and positive intermediate correlations, particularly around the sample size of 50 where the curve begins to form. The results also demonstrate that, even at the largest sample size of N = 100, the data generated using the negative intermediate correlation resulted in a power curve that does not reach the upper bound of 1, in contrast with the condition of bivariate normality or nonnormality using the positive intermediate correlation. No small sample bias of the correlation estimate was found in any of the nonnormal cases.
6. Conclusion
The Vale and Maurelli (1983) multivariate extension of the Fleishman (1978) method is one of the most widely used simulation algorithms within the simulations within the social and behavioral sciences. In spite of its popularity, the solution multiplicity of power polynomial transformations within the context of data-generating algorithms has received very little attention within the published literature. Although alluded to the studies by Steiger (2014), Astivia and Zumbo (2015), and Foldnes and Grønneberg (2015), the only in-depth treatment the authors have been able to find the empirical impact that these solutions have on simulation studies is in the doctoral work of Kraatz (2011), Luo (2011), and the article published by Astivia and Zumbo (2018). To the authors’ knowledge, a description or even the mere acknowledgment that the intermediate correlation equation also has multiple solutions has mostly gone unnoticed within the published literature. This note, therefore, provided a detailed account of this property and offers the potential reader some guidance concerning how to proceed when using this method.
For the cases, such as the one presented in the fourth section, where multiple valid solutions are possible, the general recommendation would be to investigate both cases and report any potential differences that may be found. Since the solution multiplicity of the Vale–Maurelli approach is a feature, and not an error, of the method itself, there are no immediate criteria to judge whether one solution is more proper or “valid” than another. For cases where all solutions to the intermediate correlation equation are outside the [−1, +1] range, it is perhaps advisable to make a note of it and use a different combination of marginal skewness and kurtosis values, since the intermediate correlation equation will no longer be a valid correlation equation and cannot be used in the initial step where multivariate normal data are generated.
Although other data generation methods are available in the literature, no one method can possibly encompass all types of nonnormal data and various options exist for applied researchers (Astivia & Zumbo, 2017). Methods such as Ruscio and Kaczetow’s (2008) iterative algorithm, Kowalchuk and Headrick’s (2010) multivariate g-and-h distributions, Mair, Satorra, and Bentler’s (2012) copula approach, or Foldnes and Olsson’s (2016) independent generator rely on known distributions with well-documented theoretical properties that circumvent the issue of needing to transform the data from normality to nonnormality. Although they are an important contribution to the literature, many of these methods do not have well-documented properties. To the best of the authors’ knowledge, for instance, the probability density function of these methods is not known. They can also be computationally intensive, such as the Ruscio and Kaczetow’s (2008) iterative algorithm or Kowalchuk and Headrick’s (2010) multivariate g-and-h distributions. Users of these methods in simulation studies need to consider some of these properties in their designs. Methods that rely on power transformations such as the Fleishman (1978) method, the Vale and Maurelli (1983) extension commented on here, or Headrick’s (2002) fifth-order approach are generally straightforward to implement, but since they rely on higher order polynomials, both for the marginal skewness and kurtosis as well as for the intermediate correlation, they can offer multiple solutions which may or may not provide sensible results.
There are several future research avenues presented in this note. A particularly important one is documenting the empirical impact that the multiple solutions to the intermediate correlation equation can have on applied Monte Carlo simulation research as shown in the fifth section. Yuan, Bentler, and Zhang (2005) have shown analytically how the fourth-order moment influences the standard error of quadratic statistics such as variances and covariances. For this particular case, the data were generated from Fleishman-transformed distributions, whose fourth moments are known, thanks to the work of Foldnes and Grønneberg (2017) in terms of intermediate correlations. By using different intermediate correlations, a differential effect of the power of the t test for the product–moment correlation became apparently contingent on which intermediate correlation was used. Extending this type of work to more complex scenarios involving multivariate statistical techniques would highlight the fact that, when using the third-order polynomial transform, researchers need to be aware of the solution multiplicity property of both of the Fleishman system shown in Equations 2 through 5 and the intermediate correlation Equation 9. As computer power becomes more accessible to researchers, simulation studies are becoming more prevalent within the published literature. It is, therefore, of utmost importance to understand the properties of these algorithms and how they can potentially influence a simulation design and the conclusions obtained from it.
Footnotes
Declaration of Conflicting Interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
