Through simulations, it was shown that either in balanced “one-way” random designs or in unbalanced “one-way” random designs, Sub-D estimates are in general more accurate than those provided by Anova-based estimators. Moreover, the Sub-D estimates exhibit less dispersion magnitude. Such estimates reveal to be also slightly more accurate than those provide by REML-based estimator, although this latter one presented dispersion with slightly less magnitude, which, indeed, is not bigger than 0.0261. In order to have somehow a robust tool which will allow us to infer over Sub-D’s efficiency in “one-way” random designs, this paper aims to deduce and discuss its MSE.
Searching for new contributions concerning the inference in Mixed Linear Models (MLM), Silva (2017) developed the new estimators for variance components called Sub-D and it’s corresponding improvement called Sub-DI. It’s relevant to emphasize that both estimators apply to MLM with an arbitrary number of variance components. It was proved that in MLM with two variance components, Sub-D and Sub-DI are equivalent, having better performance than the Anova-based method estimators and nearly the same results as the Likelihood-based estimators. More over, since Sub-D depends only on information retained by the eigenvalues of the design matrix and quadratic errors of sub-models (see Silva (2017), Silva and Fonseca (2018) and Ferreira et al. (2017)) it provides unbiased estimates whether the data are balanced or unbalanced in both crossed and nested designs, even having empty cells, which does not hold in case of Anova and Likelihood-based estimators.
Through simulations carried out in balanced “one-way” random designs it was shown that the Sub-D and Anova estimates as well as the corresponding dispersion magnitude are exactly the same (one may find the one-way random designs characterization in Silva (2017) or, for instance, Anderson (1975) and Anderson and Crump (1967)). Such estimates reveal more accurate than those provide by REML-based one, although the latter one presented dispersion with slightly less magnitude, more precisely, not bigger than 0.0261. By carrying out simulations in unbalanced “one-way” random designs it was shown that Sub-D still producing accurate estimates while those based on Anova and REML produce estimates with low accuracy (see Silva (2017)); also, it is worth highlight that the Sub-D estimates presents a slightly more dispersion than the REML ones.
Since the simulations studies carried out by Silva (2017) suggested that Sub-D has a good performance in “one-way” random designs (MLM with two variance components), better than those based on Anova methods, and somehow comparable to those based on REML methods, for the point of view of efficiency, it seems to be interesting to study its MSE. In fact, this problem has already been addressed by Klotz et al. (2012), although for likelihood-based estimators and only in balanced “one-way” random designs. Here we only set down the MSE of Sub-D, leaving the discussion regarding its efficiency versus the efficiency of Anova-based and likelihood-based estimators for the next issue.
Section 2 is dedicated to the Sub-D’s deduction (the background). In Section 3 the Sub-D’s MSE is introduced and discussed, while in Section 4 a practical example is explored. Finally, in Section 5 a final remark is made.
Background
We approach the random “one-way” design
where denotes the general mean, the observed random effect due to the th group of treatments, with mean zero and variance , and the random residual errors with mean zero and variance . It is assumed that and are independently observed.
Let and denote the dimension vectors of 1’s and 0’s, respectively. Let also denotes the matrices (matrices with rows and columns) and the identity matrices of dimension . Mor over, let denotes a random vector with mean and variance-covariance matrix .
We are also assuming that which is indeed an usual assumption, once the number of groups of treatment is largely small then the total number of sum of observations for all groups.
It must be remarked that when there is empty cells (which means no observation for certain group or groups of treatments) will not be a full rank matrix. Indeed, in such case it holds .
Since the parameters to be estimated do not depend on the fixed effect, Ferreira et al. (2017) and Silva (2017) thought it was convenient to remove the model dependence on the fixed effect, remarking that such an action cause no loss of information needed to estimate those parameters and consequently the reduction on the complexity of the model for the algebraic manipulation, as well as the bias in estimation process. The strategy followed is in all similar to the first phase of REML: the observations vector is projected on the orthogonal complement of , the vectorial subspace spanned by the columns of the main vector . Thus, following such a strategy, we found the following restricted model:
with and the matrix whose columns are the eigenvectors associated to the null eigenvalues of , that is the projection matrix onto , holding therefore
noting that and .
Discussing the rank of
Recall , with , and let .
With being a full rank matrix and so (that is ), according with Theorem 2.10 of Schott (1997), we will have
holding . On the other hand
holding . Then, .
Now, with not necessairelly a full rank matrix, that is , we have the following:
holding .
Thus, in both case we have that .
Estimator Sub-D
With , where , , is the matrix whose columns are the orthonormal eigenvectors associated to the th eigenvalue of , say , holding therefore , we may produce the new restricted model:
where and , since (notice has nonnull eigenvalues, say , with respective multiplicity , and one null eigenvalues with multiplicity ).
Now, with , the estimator Sub-D for in model Eq. (3) is given as
where , and , , the sub-models. See Silva (2017) for additional explanation.
Mean square error
For the several aspects of the efficiency of an estimator - namely Asymptotic efficiency, relative efficiency, etc - discussing the efficiency of an estimator goes through a strait analysis of its variance-covariance and/or MSE.
Let and , respectively, denote the expectation and variance-covariance matrix of a random variable (or vector) , and the trace of a real matrix (see Schott (1997)). The cross-covariance between the random variable (or vector) and is denoted as . Let also denotes the MSE of .
since and .
Since is an unbiased estimator according with Eq. (8), it follows that
with denoting the variance-covariance matrix of and a function fo and . We will handle it later in this section. It is clear that the unique unknown matrix in is . In what follows we prove that such a matrix is a diagonal one.
Proposition 1. Recall , . Then:
Proof Let , . The proof follows if we notice the variance of , , is given as
and the cross-covariance between and , for , given as
Since the columns of are the orthonormal eigenvectors of , the latest result is due to the following facts: , and the commutativity of the trace operator. For the variance and cross-covariance calculation we used the Theorem 9.18 of Schott (1997).
Now we resume this section main result: the MSE of stated at Eq. (9).
Proposition 2.Let
Then
(recall . See Silva and Fonseca (2018) for the proof).
Proof One should notice
with , and consequently
where
According with Eq. (9) and results in Eq. (3), and notice , it follows that
Thus, after a straightforward calculation the proposed result follows.
If we denote , for a natural number , with , the MSE of can be written as follows:
Due to its non-negativity, for fixed and , the magnitude of is directly connected to the parameter ; that is, the smaller the value of the smaller the value of the MSE of .
Numerical example
Recall the numerical example in Silva (2018) in which its is considered , , object produced on the ith of the 20 random chosen machines, with a continuous score recorded for each objects. The data (160 observations) was considered to be from the following random “one-way” designs:
where is the overall mean, each is the independent random effect due to machine and independent of the error (the error for th observation fo the group ). The data includes observations (sum of the observed objects for all the 20 machines).
The method Sub-D was used to infer about the variance for the machine’s effect as well as the one for the error effect; that is, the variance components for the considered design.
For such a model we have different eigenvalues:
It was found that , and for all and . It is also found that 0.8475, 10,5947 and 143.5984. Thus, the MSE for Sub-D in the corresponding design is:
Final remark
Sub-D have been vastly tested in mixed linear models, specifically thought simulations in balanced and unbalanced “one-way” random designs, nested and crossed “two-way” designs and nested “three-way” designs, as may be checked on Silva (2017), Silva (2017) and Silva and Fonseca (2018). Although the simulations suggest Sub-D has in general better performance than the Anova-based and likelihood-based estimators, we have yet no tools wherewith infer over its efficiency, so that the deduction of its variance-covariance matrix as well as its mean square errors seems to be very timely. In fact, discussing the efficiency of an estimator (on its several aspects) goes through its variance-covariance and/or MSE.
As it may be seen, the Sub-D MSE structure that its greatness is intrinsically linked to the number of the different eigenvalues () and the values of eigenvalues itself; that is the smaller the eigenvalues are, the smaller the values of (), and consequently the smaller the value of (see Eq. (14)).
The next issue will addressee the efficiency of Sub-D related to the ones of Anova-based and likelihood-based estimators.
Footnotes
Acknowledgments
This work is funded by National Funds through the FCT – Fundação para a Ciência e a Tecnologia, I.P., under the scope of the project UIDB/00297/2020 (Center for Mathematics and Applications).
References
1.
AndersonR. (1975). Designs and estimators for variance components. Statistical Design and linear Model, 1-30.
2.
AndersonR., & CrumpP. (1967). Comparisons of designs and estimation procedures for estimating parameters in a two-stages nested process. Technometrics, 499-516.
3.
FerreiraD.FerreiraS.NunesC.FonsecaM.SilvaA., & MexiaJ. (2017). Estimation and incommutativity in mixed models. Journal of Multivariate Analysis, 161, 58-67.
4.
KlotzJ.MiltonR., & ZacksS. (2012). Mean square efficiency of estimators of variance components. Journal of the American Statistical Association, 64, 1383-1402.
5.
SchottJ. (1997). Matrix analysis for statistics. John Wiley & Sons, New York.
6.
SilvaA. (2017). Variance components estimation in mixed linear models. Ph.D Thesis.
7.
SilvaA., & FonsecaM. (2018). The method Sub-D for variance componentes estimation in random one-way designs. International Journal of Multivariate Data Analysis, 1(4), 327-336.