Abstract
This article deals with the analysis of sensitivity to the non-ignorability of the dropout process in joint models (JMs). We investigate the behaviour of the maximum likelihood estimates for the longitudinal process in a neighbourhood of ignorability through the Index of Local Sensitivity to Non Ignorability (ISNI). Some concerns may arise since the ISNI is an absolute measure of change in parameter estimates induced by departures from the MAR assumption; for this reason, we introduce a relative index based on the ratio between the ISNI and a measure of its variability under the MAR assumption, highlighting potential interpretation and drawbacks of this approach. The local sensitivity of the JM and the performance of the relative index are discussed in a simulation study, by varying the number of repeated measurements per individual and the random effect covariance structure. The approach is also applied to a benchmark dataset on Primary Biliary Cirrhosis (PBC).
Introduction
Longitudinal studies often record two types of outcomes: repeated measurements for a response of interest and realizations of a time-to-event process describing individual participation to the study. An example can be derived from AIDS studies, where interest lies in the longitudinal evolution of markers such as the CD4 cells count, as well as in the time-to-AIDS or time-to-death. The longitudinal response may be directly related to the event process; after an event has occurred, longitudinal measurements may no longer be collected or considered relevant, therefore inducing dropout.
In many cases, the dropout process may depend on the unobserved values of the longitudinal response, corresponding to a not random missingness (Little and Rubin, 2002). Literature has so far concentrated on three modelling frameworks for the analysis of a longitudinal outcome subject to a (nonrandom) dropout process: Selection Models (SeMs), see e.g. (Diggle et al. 1994), Pattern Mixture Models (PMMs), (Little, 1993), and Shared Parameter Models (SPMs), (Follman and Wu, 1995). Even though these models can be quite flexible in the specification of the dropout mechanism, not too much trust must be placed on inferences due to potential sensitivity of parameter estimates to (unverifiable) modelling assumptions, see Little (1995), Rizopoulos et al. (2008a), Rizopoulos et al. (2008b) among others. Therefore, we need to investigate sensitivity of model parameter estimates to assumptions on the dropout mechanism; the majority of the proposals have focused on Selection and Pattern Mixture Models, see e.g. Troxel et al. (2004), Ma et al. (2005), Molenberghs (2005). Not too much have been done for the class of Shared Parameter Models, with the only exception of the recent work by Creemers et al. (2010).
In the present article, we consider a specific class of SPMs, referred to as joint models (JM), see Wulfsohn and Tsiatis (1997), where the longitudinal and the dropout processes may share a set of time-dependent covariates, as well as the corresponding fixed and random effects. In this framework, the time that a subject spends in the study is described by a proportional hazard model (Cox, 1972). Since the longitudinal evolution of the response of interest is measured with error, we assume that the risk of dropout is influenced by its (error-free) expected value, as well as by some baseline, time-invariant, covariates. As a sensitivity measure, we extend the Index of Local Sensitivity (ISNI) proposed by Troxel et al. (2004) and Ma et al. (2005), to joint models. The goal is to measure how much the maximum likelihood estimates are influenced by assumptions about the dropout mechanism. The main advantage of this approach is that it is based on quantities that can be calculated without currently fitting the full joint model.
By adapting the standard ISNI formulation to JMs, some concerns arise since ISNI is an absolute measure of change in parameter estimates induced by departures from the MAR assumption, see Troxel et al. (2004). We introduce a relative index based on the ratio between the ISNI and its standard error under MAR, and highlight potential interpretation and drawbacks of this formulation. To analyse the index behaviour in empirical applications and study the sensitivity of JM to non-ignorability, we review a benchmark dataset on primary biliary cirrhosis (PBC), see Murtaugh et al. (2002), where 312 patients have been considered to test for treatment effect on survival after adjusting for the longitudinal bilirubin levels. We also discuss the performance of absolute and relative indices through a simulation study where the number of subjects, the association structure between the longitudinal and the survival processes and the random effect covariance structure vary. This would help us analyze the proposed index behaviour when the model is correctly specified and when it is partially/globally misspecified.
The rest of the article is organized as follows: Section 2 describes the JM framework in details. Section 3 gives an insight on sensitivity analysis and introduces the ISNI formulation in JMs, while Section 4 entails the definition of the relative index. The sensitivity of JM and the empirical behaviour of the indices are discussed in Sections 5 for the PBC dataset and in Section 6 through a simulation study. Finally, Section 7 gives some concluding remarks and outlines potential developments.
The joint model
Let Yi(tij) represent a longitudinal continuous response recorded at occasion j = 1, . . ., ri for the ith subject (i = 1, . . ., n) and let yi(tij) be the corresponding observed outcome. Throughout the article, we will assume that Yi(tij) follows a linear mixed model of the form:
where
Our aim is at investigating the effect of potentially informative dropouts on parameter estimates for the longitudinal model. JMs represent an appealing solution to account for the association between the longitudinal and the survival processes. The literature on JMs for longitudinal data dates back at least to Faucett and Thomas (1996) and Wulfsohn and Tsiatis (1997); since then several authors, see among others Rizopoulos et al. (2009) and Rizopoulos and Ghosh (2011), have proposed extensions and different estimation methods. A nice overview of existing methods is given by Tsiatis and Davidian (2004). In this framework, the repeated measurements and the time-to-event are assumed to share a set of time-varying covariates as well as the corresponding fixed and (subject-specific) random coefficients, the latter inducing dependence in the longitudinal profile as well as between the two processes. More specifically, the risk of experiencing the event at time Ti, described through a proportional hazard model, Cox, 1972, depends on the expected (error-free) value of the longitudinal outcome at the same time, treated as a time-dependent covariate with regression parameter α.
The joint model, in the formulation of Wulfsohn and Tsiatis (1997), was originally proposed to account for the effect of time dependent covariates measured with error on the survival process. With this purpose, the authors postulated a linear mixed model for the time dependent covariate and introduced the error-free part (i.e., the expected value of the longitudinal covariates) in the hazard function for the event of interest. Over the years, this model has became popular also in the context of longitudinal data with dropout, see for instance Tsiatis and Davidian (2004) and Rizopoulos (2010), to mention just a few, as a generalization of shared parameter models, Follmann and Wu (1995).
The model can be expressed by a set of two equations, one for the longitudinal and the other one for the survival part:
where
Model (2.2) allows the dropout process to depend on covariates (the same or different than those affecting the longitudinal process) in the definition of the relative risk model, by including the covariate vector
A relevant issue concerns the meaning of a (near) null estimate for α. This leads to a missing completely at random (MCAR) model, (Little and Rubin, 2002), that is, to the assumption that the dropout mechanism, conditional on available covariates, does not depend on the longitudinal response, either observed or missing. In fact when α = 0 model parameters in the two submodels are distinct and the joint probability for the dropout and the longitudinal process can be factorized as follows:
The same factorization holds for the log-likelihood function with respect to the fixed effects in the longitudinal process. As we adopt a maximum likelihood approach to parameter estimation, we know that parameters estimates derived from maximizing the likelihood of the longitudinal process, that is p(Yi(tij)), yield maximum likelihood estimates that are valid both under MCAR and under MAR assumptions, i.e., under the hypothesis that the dropout mechanism depends on the observed responses only. No setup may describe, within this modelling formulation, a proper MAR mechanism. However, what we are actually comparing is the results of the mixed model fitted alone to the data to the results of the joint model. In that respect and because the linear mixed model alone produces valid results under both MCAR and MAR, it makes sense to talk about MAR in our settings. When α ! 0, on the other hand, the dropout process depends on both the observed and the unobserved response through mi (Ti) and parameter distinctiveness does not hold. In these cases, we are in presence of non-ignorabile dropout process.
The basic assumption of JMs is that of conditional independence, expressed by the following equations:
That is, conditionally on the random effects, the repeated measurements for the i-th individual are independent, and the same is true for the longitudinal and the survival processes. The random coefficients allow for within-individual dependence in the longitudinal process and for dependence between the longitudinal and the survival processes.
In this section, we present the general formulation of the log-likelihood and corresponding score vector and Hessian matrix in the case of JMs, needed for the sensitivity analysis described in Section 3.
By indicating with θ the vector of all model parameters, the log-likelihood function for a JM can be written as follows:
where
If a joint model parameterization is adopted, the survival function at time Ti is given by
The integral in (2.4) is solved using a Gauss-Kronrod numerical rule, see Kronrod (1964).
The score function, see e.g., Rizopoulos et al. (2009), takes the form:
where
For an index pair (u, v), indicating the (u, v)-th component of the parameter vector θ, and the i-th individual, the Hessian matrix takes the following
where
From a computational point of view, to calculate the score vector and the Hessian matrix, we apply Gauss–Hermite quadrature since the integrals with respect to
In the previous section, we have claimed that a joint model corresponds to a MNAR mechanism. To illustrate this, let us suppose that Ti denote the event (e.g., dropout) time for the i-th individual and let us indicate by
which depends on
In this section, we review the general formulation of ISNI, proposed by Troxel et al. (2004) and Ma et al. (2005), and further developed by Xie (2008) for longitudinal non-Gaussian responses and give a specific formulation in the case of JMs. A more detailed derivation can be found in Appendix A. This tool aims at investigating the local sensitivity of parameter estimates to departures from the MAR assumption, i.e., how much maximum likelihood parameter estimates for the longitudinal model are influenced by the hypothesis about ignorability of the dropout mechanism. Sensitivity could be also analyzed for the fixed effects in the time-to-event model; however, the focus here is on modelling the longitudinal response accounting for potential informative missing data. For this reason, the longitudinal model plays a primary role and sensitivity will be discussed for fixed effects in that model only. Ma et al. (2005) give an extended overview of the ISNI introduced by Troxel et al. (2004), and carry through this analysis for longitudinal data. In particular, they consider a selection model where the probability of dropout at a given time point is a function of the response measured at the same time and at the previous one. Our case is different and considers a special case of selection models, where the dropout and the longitudinal processes do not share the current response, but only the expected, error-free, part. This particular model has been mentioned in literature as ‘joint model’, see for instance Rizopoulos (2010). In many practicals cases, for instance when the actual response at the dropout time is not available, or it is measured with error, this modelling approach is particularly appealing. Moreover, as it has been pointed out for instance by Rizopoulos and Ghosh (2011), this approach is convenient from a survival analysis standpoint, when one would account for the presence of time dependent covariates measured with error; from a longitudinal standpoint, when the interest is in obtaining unbiased longitudinal parameter estimates; finally, when the focus is on both the processes. Ma et al. (2005) convincingly illustrated the nice features of ISNI for selection models. Motivated by this, the aim of the present article is to investigate how the ISNI can be adapted to work with shared parameter model and in particular with the joint model, which is one of the three main frameworks for dealing with dropout.
Let
where
For the sake of simplicity, we will indicate these quantities as
The expansion in (3.2) allows us to write the ML estimate of θ as a function of α:
The index of sensitivity to nonignorability is defined as the derivative of
Since our interest lies in the study of the sensitivity for the longitudinal parameter vector β, we report the corresponding specific index as follows:
The main advantage of this approach is that it actually does not require to fit the MNAR model, but only to compute the corresponding Hessian matrix at α = 0. Further, it may be noted that (3.4) and (3.5) yield to the following incremental limit:
which represents the ratio of the distance between the MNAR and the MAR estimates to the value of the nonignorability parameter as α → 0.
The approximation in (3.2) is a second order Taylor’s expansion of the likelihood similar to the one adopted by Siannis et al. (2005) to study the dependence between survival and the censoring mechanisms. In that case, the approximation is used to build confidence intervals for the parameter of interest and holds only for small values of the dependence parameter. In the present context, Equation (3.2) is used to derive the index of sensitivity to non-ignorability, which is the derivative of the maximum likelihood estimator for a → 0. The interest is on changes of the parameter estimate when we move away from a MAR (or MCAR) assumption, i.e., when α changes from 0. For this reason, the assumption of a small dependence parameter is not necessary. The ISNI definition is based on the Hessian matrix for the JM as α → 0, and it does not actually depend on the value of α, but only on the available data and the sample size. The estimates for α obtained by analyzed data give an idea of the strength of potential dependence between the response and the survival time distribution. For example, if the ISNI suggests that a large displacement in the longitudinal model parameter estimates may occur when α moves away from zero and the α parameter is large, this may imply that the adopted modelling structure could produce parameter estimates that are far from those obtained under MAR assumption and that care is needed to interpret JM results. While an estimate α ≠ 0 is not enough to state dependence, since the estimates depends on the adopted modelling formulation, a large value for the ISNI points out that model parameter estimates may be heavily influenced (in a sensitivity perspective) by the adopted assumptions about the missing data mechanism.
This interpretation is confirmed by Ma et al. (2005), where the ISNI is seen as a measure of the extent to which small departures from ignorability affect the MLE parameters. Along the article, we report the α estimates as an additional tool to quantify the impact that the assumptions on the dropout process suggested in the JM formulation have on results for the analyzed data.
As it can be noticed, Equation (3.5) represents an absolute measure of parameter change and, therefore, cannot be straight to interpret. To allow an easier interpretation, Troxel et al., (2004) consider the ratio of the ISNI to the standard error of the corresponding MAR estimate rescaled for the variability of the response(vy), stating sensitivity when this ratio is greater than 1. This is referred to as ‘sensitivity transformation’ (1/c, where
For this purpose, we propose to provide a variability estimate based on parametric bootstrap. Let us consider a dataset with longitudinal outcome
where
stating sensitivity when | isni |≥1.
The assessment of the sampling variability when the MAR assumption is true, could be of interest for different reasons: first, to study the precision of the ISNI estimate; second, it could lead to a valid alternative for a relative index of sensitivity. This should help us separate the ‘geometric’ feature of the ISNI (representing potential sensitivity of model parameter) from numerical changes due to the association structure as described by the JM.
As it can be easily observed by looking at Equation (2.2), the joint model structure is based on the assumption that some effects are shared by the longitudinal and the survival processes; therefore, we could observe sensitivity to nonignorable missingness even for an approximately null value for the ‘true’ α and for a modest amount of dropouts. Adopting the JM formulation, the β are computed through the joint likelihood, i.e., the score function for β depends, by definition, on the longitudinal and the survival information; the corresponding meaning obviously changes when compared to the estimate obtained from the longitudinal model under the MAR hypothesis. Therefore, we could not be able to distinguish between the sensitivity due to the dependence structure from the one due to effective changes in the meaning of the parameter vector when α moves away from zero and to numerical approximation. Even if expression (3.6) does not depend on the value of α, since the ISNI is defined as the incremental limit of the ML estimate, one would expect that, when the true model is MAR (or MCAR), i.e., when α = 0, parameter estimates are still sensitive to non-ignorability. In fact, some variability of the index could be also present when α is close to zero, for example due to parameter non-distinctness or numerical/model approximations. Therefore, we need to assess whether the observed ISNI value could be due to a strong departure of the MNAR estimates from the MAR ones, or rather to the sampling variability in the index.
Previous considerations suggest that an estimate of the ISNI variability calculated under the MAR assumption could be an appealing measure of the index variability, and the relative formulation
This example comes from the primary biliary cirrhosis (PBC) data collected by the Mayo Clinic from 1974 to 1984, see Murtaugh et al. (2002). PBC is a fatal liver desease characterized by inflammatory destruction of the small bile ducts within the liver, which may lead to cirrhosis of the liver. In this study, 312 patients were considered; 158 were randomly assigned to receive D-penicillamine and 154 placebo. By the end of the study, 140 patients (45%) died, 143 (46%) were alive and 9% were transplanted. We are interested in testing for treatment effect on survival after adjusting for the longitudinal bilirubin levels. The Kaplan-Meier estimate for the time to death is depicted in Figure 1.
By looking at the K-M estimate, the treatment does not seem to significantly affect the survival time; this may be due to the fact that the treatment effect changes with time. Furthermore, patients did not return to the study centers at prespecified time points to provide serum bilirubin measurements and thus we observe great variability between their visiting patterns. In particular, patients have on average 6.23 visits (standard deviation 3.77 visits), resulting in a total of 1945 observations.

Taking advantage of the randomization set-up of the study, we fit a joint model where we only correct for treatment. Given that the distribution of the observed bilirubin serum shows substantial skeweness, we consider its natural logarithm. Here, we model the longitudinal dependence of the the bilirubin serum and the survival of enrolled patients by employing the following model for the bivariate (longitudinal and survival) process:
where xi1 represents the visit time, xi2 the treatment, and xi1 × xi2 the interaction between treatment and time, while wi1 is the baseline treatment. It can be noticed that the parameter vector could be interpreted in a causal framework; in fact, the study is randomized and potential confounding factors are controlled by individual-specific, time-constant, random effects in
While a quite significant change in parameter estimates is experienced by the time and the treatment marginal effects and the application of the JM suggests a non-ignorable dropout mechanism (
It is interesting to note that with an estimate
As it is shown in Table 2, it is evident that bootstrap estimate of the ISNI standard error is not of the same order of the rescaled standard error of the MAR estimate. Therefore, this variability measure could account for the ‘geometric’ feature of the ISNI (representing potential sensitivity of model parameter estimates due to the shape of the likelihood surface in a neighbourhood of α = 0) and numerical changes due to sampling variability in the analyzed data and to approximation used in the JM estimation algorithm.
Absolute and relative ISNI for the PBC data set.
Standard errors of the MAR parameter estimates (rescaled for the response standard deviation σy) and the ISNI for the PBC data set.
To investigate the empirical behaviour of the ISNI and the proposed relative formulation in joint models, we performed the following simulation study.
Study design
We simulate a longitudinal response yi(tij) from a Gaussian distribution with mean
In this study, x1i is a sequence from 0 to K (maximum observation time) and represents the longitudinal measurement occasions, x2i is a Bernoulli variable with parameter 0.5, representing a treatment variable and [x1i × x2i] is the interaction between time and treatment. On the other hand, wi = x2i. Individual censoring times have been randomly drawn from an exponential distribution with mean chosen to result in about 50% censoring.
We consider N = 1000 samples of size n = 312 observed for K = 5 or K = 15 occasions. We simulate different scenarios by varying the longitudinal parameter values, the α values and the random effect covariance matrix. For the latter, we consider the following matrices:
and
While D1 is the estimated random effect covariance matrix for the PBC dataset, D2 describes a homoscedastic random effect covariance with a positive association between bi1 and bi2. The response variance and the survival covariate effect wi, as well as the baseline hazard h0(Ti), are fixed to the estimates for the PBC dataset, i.e., σ2 = 0.158 and γ = 0.07.
On each simulated dataset, we compute the ISNI in (2.3), the sensitivity transformation
Further, we perform a sensitivity analysis to model misspecification. We consider three potential cases of misspecification and fix α = 0:
The observed data are simulated according to the joint model
and the absolute and relative ISNIs are computed for parameter estimates for model (6.1); The longitudinal response is drawn from a skew-normal distribution, Azzalini (1985), with shape parameter equal to 5, while the ISNIs are computed for the JM with Gaussian response; The survival time is assumed to follow an exponential distribution with rate 0.5, while the sensitivity indexes are computed for the JM with survival time following a proportional hazard model.
The aim is to assess robustness of the ISNI to model specification; this is quite a major point, since the ISNI is known to measure departures from the MAR estimates only when the fitted model is the true one.
The simulation results are organized as follows. In Section 6.2 we explore the sensitivity under the JM parametrization. Section 6.3 gives some interpretational remarks, while Section 6.4 describes the ISNI behaviour when the model is misspecified.
Tables 3–5 show the median of the ISNI, the Troxel sensitivity transformation 1/c and the proposed relative
Simulation results. Median over 1000 datasets of ISNI, Troxel sensitivity transformation 1/c and the proposed relative
for different values of non-ignorability parameter α, random effects covariance matrices D1 and D2, number of repeated measurements K and true longitudinal parameter βTrue = (–0.7, 0.2, –0.1, –0.05).
Simulation results. Median over 1000 datasets of ISNI, Troxel sensitivity transformation 1/c and the proposed relative
Simulation results. Median over 1000 datasets of ISNI, Troxel sensitivity transformation 1/c and the proposed relative
Simulation results. Median over 1000 datasets of ISNI, Troxel sensitivity transformation 1/c and the proposed relative
Tables 3–5 highlight sensitivity for the intercept and the time effects. The observed sensitivity of the drug effect is negligible with ISNI values approaching to zero. This may be due to the fact that the drug effect is time-invariant. Moreover, it is worth noticing that the sensitivity of JMs decreases as the number of repeated measurements increases and that the random effect covariance matrix does not play a substantial role in parameter sensitivity. These empirical evidences could suggest that, if the JM is the ‘true’ model and as the number of repeated measures increases, the longitudinal fixed effect estimates are not substantially influenced by the presence of MNAR mechanisms.
As pointed out in Section 6.2, the ‘sensitivity transformation’ of Troxel et al. (2004) suggests a low sensitivity even when the non-ignorability parameter is far from zero. As it can be evinced by Equation (3.6), the ISNI variability takes into account both the variability of the MAR parameter estimates and the variability due to the dependence structure. Rather, 1/c considers the MAR parameter variation and may underestimate the effective sensitivity. The absolute ISNI, on the other hand, suggests sensitivity when α ≠ 0, but it might be misleading since it is an absolute measure and the corresponding values cannot be directly interpreted. In the simulation study, the proposed relative index (isni) leads to a clearer interpretation, since it provides a direct comparison of the ISNI and a measure of its variability in a neighbourhood of the MAR hypothesis. Thus, we compare the ISNI with its natural variation, calculated under α = 0; sensitivity is therefore evident for a strong curvature of the log-likelihood function or for a strong departure from MAR. Rather than a theoretical measure, isni is an empirical relative index, which is strongly linked to the analyzed data.
Local sensitivity under misspecification
The ISNI is known to depend on the parametric structure of the model. In this Section we show the sensitivity behaviour of JMs when the model is misspecified.
Table 6 suggests that the joint model structure and the longitudinal distribution do not seem to influence the JM sensitivity to non-ignorability. The survival distribution, on the other hand, may influence the indexes value. Therefore, it is suggested to check the assumptions on the survival model before handling a sensitivity analysis with the ISNI.
Simulation study for model misspecification. Median over 1000 datasets of ISNI, Troxel sensitivity transformation 1/c and the proposed relative
for source of misspecification relative to model structure (a), the longitudinal distribution (b) and the survival distribution (c), α = 0, βTrue = (0.7, 0.2, 0.1, –0.05) and random effects covariance matrix D1.
Simulation study for model misspecification. Median over 1000 datasets of ISNI, Troxel sensitivity transformation 1/c and the proposed relative
In this article, we have extended the notion of ISNI, see Troxel et al. (2004) and Ma et al. (2005), to the class of joint models, Wulfsohn and Tsiatis (1997). We have focused on changes registered in fixed parameters for the longitudinal process due to potential misspecification in the missing data mechanism and in the dependence structure.
We have proposed a relative index of local sensitivity, given by the ratio of the ISNI values to the corresponding standard error estimated via parametric bootstrap. In this way, we have provided an estimate of the sampling variability of the index when the MAR hypothesis is true, which could help define a further relative measure of parameter sensitivity to departures from the MAR assumption. This approach is based on the empirical evidence that, also when the MAR hypothesis is true, the ISNI may take non null values; therefore, this absolute measure of displacement could be compared to a measure of its sampling variability, which is, in the context of joint models, likely linked to parameter non-separability. The proposed relative ISNI (isni) seems to lead to a clearer interpretation of parameters sensitivity, while the classical ISNI and sensitivity transformation may be misleading, at least in some cases, when a joint model is adopted.
The data application and the simulation study have shown a slight sensitivity of model parameter estimates when the departure from MAR is not too large, and a decreasing effect of the assumptions about ignorability for the missing data mechanism as the length of the individual sequences increases. The random effect covariance structure does not seem to play a substantial role. In addition, the sensitivity with respect to model misspecification seems not to be meaningful, with the only exception of the misspecification for the survival distribution.
In conclusion, as further illustrated in a benchmark data example, the performed sensitivity analysis based on the proposed relative formulation can be helpful to properly account for the peculiarities of joint models.
Footnotes
Calculating the ISNI for the shared parameter models
The score vectors with respect to the longitudinal fixed effects and the association parameter for the joint model assume the form
and
The posterior distribution of the random effects is given by
We derive the ISNI for the longitudinal fixed covariate effects as follows:
where
and
while
and
