Abstract
We propose a multivariate regression model to deal with multiple outcomes along with repeated measures in the context of longitudinal data analysis. Our model allows for flexible and interpretable modelling of the covariance structure within outcomes by using a linear combination of known matrices, while the generalized Kronecker product is employed to take into account the correlation between outcomes. We present maximum likelihood estimation along with extensions of the classical multivariate analysis of variance and multiple comparison hypothesis tests to deal with multivariate longitudinal data. The model and the associated multivariate hypothesis test are motivated by a prospective study conducted to compare three aesthetic eyelid surgery techniques, namely blepharoplasty, endoscopic forehead lift and endoscopic forehead lift associated with blepharoplasty. The effect of the techniques was assessed using measurements of a horizontal line through pupil centre and then three vertical lines, which go in direction to lateral canthus, middle pupil and medial canthus to the top of the brow. In this study, 30 female patients were randomly divided into three groups. Preoperative measurements were compared with postoperative measurements taken 30 days, 90 days and 10 years after the surgery. The presented multivariate model provided a better fit than its univariate counterpart. The results showed that the three surgery techniques tend to increase all considered outcomes in a long-term perspective, that is, from preoperative to 10 years postoperative evaluations. The only exception was for the outcome lateral eyebrow, for which the blepharoplasty had no significant effect.
Introduction
The periorbital region has received great attention in the field of aesthetic surgery. It can be seen through the evolution of rejuvenation techniques developed for this region (Ramirez (1994); Paul (2001)). On the evaluation of the patient's periorbital region, we should analyse the eyelid excess tissues, such as skin and adipocyte fat, as well as the eyebrow ptosis (McKinney et al., 1991). Based on this analysis, three diagnostics are possible: (a) the patient presents excess of eyelid tissues and eyebrow in the correct position; (b) the patient presents eyebrow ptosis without an excess of eyelid tissues; and (c) the patient presents both eyebrow ptosis and an excess of eyelid tissues. Thus, superior third face surgeries are recommended such as blepharoplasty (Naik et al., 2009), endoscopic forehead lift (Isse, 1994) or even a combination of both.
Often in biomedical studies, multiple outcomes are taken over time for a group of patients. Consequently, the joint analysis of multiple outcomes in biomedical studies has been of increased interest in the medical and statistical literature. Recent contributions include the analysis of ordinal and continuous outcomes (Grigorova and Gueorguieva, 2016), multivariate repeated measures and time to event data in crossover trials (Liu and Li, 2016), joint analysis of longitudinal and survival data (Andrinopoulou and Rizopoulos, 2016; Wen et al., 2016; Hagar et al., 2016) and multivariate disease mapping using linear models of coregionalization (MacNab, 2016; Botella-Rocamora et al., 2015; Martinez-Beneito, 2013).
Similarly to the aforementioned articles, we are interested in the analysis of multiple outcomes in the context of longitudinal data analysis. The study we shall describe in Section 2 was conducted to compare the effect of three aesthetic eyelid surgery techniques, namely blepharoplasty, endoscopic forehead lift and endoscopic forehead lift with blepharoplasty. In this study, 30 female patients were randomly divided into three groups. Thus, each group was assigned 10 patients. To evaluate the effect of each surgery technique, three measurements (medial, central and lateral) were made: one vertical measurement from the top of eyebrow to a horizontal line (through pupil centre) in medial portion, one vertical height from the top of eyebrow to the middle pupil and one from the top of eyebrow to the lateral portion of horizontal pupil centreline. Preoperative measurements were compared to postoperative measurements taken
The main data analysis goal is to assess the effect of the surgery techniques for all outcomes in a long-term perspective, that is,
The main goal of this article is to propose a multivariate regression model suitable to deal with the combination of repeated measures and multiple outcomes and applied such a model to analyse the dataset described in Section 2. Furthermore, we present maximum likelihood estimation and propose extensions of the classical multivariate analysis of variance (MANOVA) along with multiple comparison techniques to deal with multivariate longitudinal data.
When dealing with multiple outcomes in longitudinal studies, a general issue is how to specify the joint covariance matrix to take into account the within outcome correlation structure induced by the measures taken over time, as well as the correlation introduced by the multiple outcomes. In this article, we adopt the multivariate covariance generalized linear models (McGLM) framework (Bonat and Jørgensen, 2016), which provides flexible and interpretable modelling of the covariance structure. The within outcomes covariance matrix is specified for each marginal outcome using a linear combination of known matrices, while the joint covariance matrix is specified using the generalized Kronecker product (Martinez-Beneito, 2013; Bonat and Jørgensen, 2016). An advantage of this specification is that the univariate counterparts are easily identified as special cases of the multivariate model. Therefore, orthodox measures of goodness-of-fit as the log-likelihood value, Akaike and Bayesian information criteria can be used to compare the fit of the multivariate model with its univariate counterparts.
Given the recent developments in the McGLMs framework, the main contributions of this article are as follows (a) introducing a suitable specification of the McGLMs to deal with multiple outcomes in the context of longitudinal data; (b) presenting maximum likelihood estimation for the model parameters in contrast to the estimating function approach often used in the context of McGLMs; (c) proposing an extension of the standard MANOVA where we can explicitly model the covariance structure to take into account the correlation induced by the longitudinal structure; (d) extending standard multiple comparison test procedures to deal with multiple outcomes and longitudinal data; (e) providing
We present the dataset in Section 2. Section 3 presents the model and discusses its properties and components. Section 4 discusses the maximum likelihood estimation for model parameters. In Section 5, we present extensions of the MANOVA test and multivariate multiple comparison hypothesis tests. In Section 6, we apply the model to the data and present the main results. Finally, the main results are discussed in Section 7, including some directions for future investigations. The dataset and
Three measurements of the vertical height from the lateral canthus, middle pupil and medial canthus to the top of the brow (from ear to nose)
Three measurements of the vertical height from the lateral canthus, middle pupil and medial canthus to the top of the brow (from ear to nose)
The case study analysed in this article uses data collected in the Clinic's Hospital of the Paraná Federal University, Curitiba, Brazil. The study was conducted to assess the effect of three eyelid aesthetic surgery techniques, namely blepharoplasty, endoscopic forehead lift and a combination of both techniques. Thirty female patients were randomly assigned into three groups, that is, 10 patients for each surgery technique. A trained surgeon did the surgical procedures during the period from March to December in
The measurements were collected using a digital photo in the preoperative,
Box plots in Figure 2 suggested for the blepharoplasty cases, there is a slight decrease in the six response variables from the preoperative to the
Boxplots by response variables, surgery techniques, evaluation times and eyes
Boxplots by response variables, surgery techniques, evaluation times and eyes
In this section, we shall propose a multivariate regression model for dealing with multiple continuous outcomes in the context of longitudinal studies. Consider the case of
Let
where
is the generalized Kronecker product (Martinez-Beneito, 2013). The matrix
The key point to deal with multiple response variables in longitudinal studies is the specification of the covariance matrix within outcomes
where
An important issue that appears when dealing with general covariance models as the one presented in Equation 3.2 is the definition of the parameter space for the parameter vector
It is important to highlight that the matrix linear predictor is specified for each response variable. Thus, suppose without loss of generality that
The components of the matrix linear predictor for the outcome
in the exchangeable case. Similarly, in the inverse of Euclidean distance case, we have
where
Finally, to deal with variance heterogeneity in a regression model fashion, we specify the diagonal of the matrix linear predictor components as
To obtain the components of the matrix linear predictor for the entire response variable vector
where the operator
The correlation matrix
where
The model proposed in this section is a special case of the McGLMs obtained by using identity link function, constant variance function and identity covariance link function and can be easily fitted by the maximum likelihood method, as explained in Section 4.
In this section, we describe the likelihood approach used to estimate the model parameters. We divide the set of parameters into two subsets
where
where
Similarly, the score function for
By using similar calculations, we can show that the Fisher information matrix for
In a similar way, the
where
The cross-entry
Let
Let
Given the block diagonal structure of equation (4.5), the asymptotic variance of the maximum likelihood estimator of
where
The classical MANOVA is a huge topic in itself (Smith et al., 1962; Berndt and Savin, 1977). In this section, we explore the Wald test to propose an extension of the classical MANOVA test to deal with multiple responses in the context of longitudinal data. The main difference from our multivariate test to the classical MANOVA is that under our approach, we can model the covariance structure to explicitly take into account the correlation induced by the longitudinal design, while in the classical MANOVA the covariance matrix is completely non-specified, that is, we have to estimate all of its parameters.
In general, under the model in equation (3.1), the general linear hypothesis may be stated as
where
where
To assess the linear hypothesis specified in equation (5.1), we use the Wald statistics given by
which under the null hypothesis is asymptotically chi-square distributed with
To give a hint of the approach flexibility, we consider a simple bivariate case and a treatment with three levels (
where
The null hypothesis of the classical MANOVA test is stated by
Equivalently, in a matrix notation
where it is easy to see that
Given the
The
Finally, we can use similar ideas for joint testing the covariance parameters. For example, to assess the significance of the longitudinal effect over all response variables.
In this section, we apply the model proposed in Section 3 to analyse the eyelid dataset. In this application, for composing the linear predictor we have two covariates surgery techniques (
To specify the matrix linear predictor for each response variable, we consider the structures: variance heterogeneity, exchangeable and the inverse of Euclidean distance as described in Section 3. For comparison purposes, we also fitted a model considering independent observations. It is obtained by specifying the matrix linear predictor as an identity matrix. Finally, we fitted all the aforementioned models fixing the correlation parameters at zero, which correspond to fit separated models for each outcome. Table 1 presents the maximized log-likelihood values, degrees of freedom (
Maximized log-likelihood values (logLik ), degrees of freedom (df ), Akaike (AIC ) and Bayesian (BIC ) information criterion by fitted models
Maximized log-likelihood values (
The results in Table 1 show that for all measures of goodness-of-fit the models improved from the independent to the inverse of Euclidean distance and exchangeable covariance models in both univariate and multivariate specifications. It shows the relevancy to model the within covariance structure. Variance heterogeneity does not seem concerning. Thus, based on the log-likelihood value we conclude that the exchangeable structure provides the best fit. Such a conclusion is corroborated by the
For all within covariance structures, the multivariate model presented measures of goodness-of-fit better than its univariate counterparts. It highlights the importance of considering the multivariate model as proposed in this article. Hence, we opted to continue the data analysis by analysing the results of the multivariate model fitted using the exchangeable within covariance structure.
The results of the MANOVA test presented in Table 2 show that all effects considered in the model are highly significant.
Wald statistics
The MANOVA test showed the overall significance of the effects. Then, we extend the analysis by performing the test for each response variable. Results in Table 3 show that many of the effects composing the linear predictor are highly significant (
Wald statistics (
Table 4 shows parameter estimates, standard errors and Z-values for the components of the within covariance structure. The coefficients
Parameter estimates (
Parameter estimates (
It is interesting to note that the intraclass correlation is larger in the right eye than in the left eye for all response variables. The estimates and standard errors for the entries of the
All correlations are significantly different from zero and substantial in size. Furthermore, these correlations take into account the effect of all covariates and the longitudinal structure.
Finally, to address the main goal of this data analysis which is to assess the overall effect of the three eyelid aesthetic surgery techniques Table 6 presents all contrasts between evaluation times and surgery techniques, along with the associated Wald statistics and
Wald statistics
Concerning the effects presented in Table 6 is important to note that the differences from preoperative to
In order to further explore the results, Tables 7 and 8 presents the estimated differences and the associated standard errors, Z-statistics and
Estimates (
Estimates (
Regarding the blepharoplasty technique, results in Table 7 show that this technique has no effect on the outcomes Lateral canthus and Middle pupil for both eyes. For the outcome Medial canthus the difference from preoperative to
The dynamic for the techniques endoscopic forehead lift and endoscopic forehead lift with blepharoplasty is similar. In general, we have a significant effect of the surgery techniques, that is, the differences from preoperative to
We presented a statistical model to deal with multiple outcomes along with repeated measures in the context of longitudinal data analysis. The model was motivated for an experiment concerning the comparison of three aesthetic surgery techniques in relation to three outcomes, namely, lateral canthus, middle pupil and medial canthus. For modelling the correlation induced by the measures taken over time, we compared the fit of the variance heterogeneity, exchangeable and inverse of the Euclidean distance covariance structures. By using standard measures of goodness-of-fit, we concluded that the multivariate model combined with the exchangeable within covariance structure provided the best balance between goodness-of-fit and complexity. Furthermore, based on the presented multivariate model we extend the classical MANOVA and multiple comparison hypothesis test to deal with multivariate longitudinal data. Although, the model specification is simple and the fitting method amounts to finding the root for a set of non-linear equations, it also involves computational demanding tasks such as the Cholesky decomposition and matrix multiplication. Thus, in the case of large datasets, consisting of many outcomes and long follow-up, the computational cost may prevent routine use of the proposed model.
In our study the measurements were taken based on the horizontal line passing through the pupil centre, not in medial and lateral canthus (eye canthus) because these measurements change when blepharoplasty and endoscopic forehead lift are performed. So, to avoid a bias, a horizontal fixed line through pupil centre was determined. Lateral measurements are our focus in surgery evaluation, since they tend to increase. That is what we look for when surgery is made; especially in cases that endoscopic forehead technique has been used. Our results, showed that both the endoscopic forehead as well as the endoscopic forehead associated with blepharoplasty have a strong and significant effect on the Lateral canthus measures, even in a long-term perspective (
In cases of both techniques have been performed (endoscopic forehead and blepharoplasty), a lateral elevation of eyebrow was noted but it tends to be less than the elevation of lateral eyebrow endoscopic forehead surgery. We can explain this difference by a little inferior vertical vector after skin resection in blepharoplasty. Medial and central measurements tend to increase, especially in cases which endoscopic forehead technique has been used (associated or not to blepharoplasty). When forehead lift techniques are applied, a major lateral undermining (ligaments detach) is made, more than in the central part of the forehead. These findings have been noted in all patients, with a high tendency to increase lateral eyebrow elevation in cases in which forehead lift technique have been used. A long-term evaluation showed long lasting results and a lateral eyebrow elevation tendency in cases of endoscopic forehead lift (associated or not to blepharoplasty). When blepharoplasty was performed, as expected, vertical vectors have no significant elevation.
An additional feature of the multivariate model is the possibility to consider the six outcomes in one joint model and thus compute the correlation between outcomes. Although, the correlation between outcomes is not the main interest in our data analysis. It is still important to consider such a structure because it can potentially increase the power of the hypotheses tests associated with the regression coefficients and consequently increase the power of our analysis. Thus, we let as a future work to determine the gain in terms of power obtained by using the multivariate model and further investigate how it can be used to compute the sample size required to reach some determined power in the hypotheses tests, as discussed for example in (Li and McKeague, 2013). Another interesting extension consists of to allow more flexible models to deal with the longitudinal structure that are not linear covariance models as for instance models based on correlation functions as the exponential or spherical and the impact of such structures in the power of the multivariate hypothesis tests.
Supplemental material
Supplementary materials for this article are available from
Footnotes
Declaration of conflicting interests
Funding
The authors received no financial support for the research, authorship and/or publication of this article.
