Abstract
The methodological advancements made in the field of joint models are numerous. None the less, the case of competing risks joint models has largely been neglected, especially from a practitioner's point of view. In the relevant works on competing risks joint models, the assumptions of a Gaussian linear longitudinal series and proportional cause-specific hazard functions, amongst others, have remained unchallenged. In this article, we provide a framework based on R-INLA to apply competing risks joint models in a unifying way such that non-Gaussian longitudinal data, spatial structures, times-dependent splines and various latent association structures, to mention a few, are all embraced in our approach. Our motivation stems from the SANAD trial which exhibits non-linear longitudinal trajectories and competing risks for failure of treatment. We also present a discrete competing risks joint model for longitudinal count data as well as a spatial competing risks joint model as specific examples.
Keywords
Introduction
Time-to-event data is observed in its simplest form as the time when a particular event happens or when the monitoring process halts (censoring time). A competing risks model (Hoel, 1972; Moeschberger and David, 1971; Altshuler, 1970) arises in the case where patients are at risk of one of multiple events, that is, multiple mutually exclusive outcomes. The occurrence of an event precludes all other events so that each observational unit is at risk of each of the
In most situations, additional data is available in the form of repeated measurements over time constituting a longitudinal data set. The joint modeling of the competing risks data and the longitudinal data (Hu et al., 2009; Elashoff et al., 2007) are often necessary since the missing longitudinal observations are not missing at random but rather due to the occurrence of one of the competing events. Ignoring this phenomenon, will bias the longitudinal estimates. If the treatment efficacy is being evaluated, then joint modeling enables us to evaluate the treatment effect in both the longitudinal and time-event endpoints simultaneously. Also, unobserved heterogeneity in the individuals due to some latent effect over time can be successfully accommodated through the joint modeling.
Computational frameworks for competing risks have been developed in recent years such as the CFC package (Sharabiani and Mahani, 2017) and the cmprsk package (Gray, 2014). The existence of these frameworks is essential for catalysing the use of competing risks models in practice.
For joint modeling of competing risks and longitudinal data, the R package joineR (Philipson et al., 2018) can be used. This package implements the model developed by (Williamson et al., 2008). This model is a cause-specific Cox proportional hazards regression model (for two competing events) and a linear mixed effects model with an assumed latent Gaussian association process. An accelerated failure time (AFT) model, a non-Gaussian distribution for the longitudinal model component or spatial data thus excludes the use of joineR. Generic Bayesian software platforms such as BayesX, STAN and JAGS can also be used to make an inference on competing risks joint models through simulation, like Markov chain Monte Carlo (MCMC). In this article we will not use simulation-based tools but rather approximate methods which have considerable computational advantage.
Practitioners depend largely on the availability of wieldy computational frameworks to ease the computational burden of implementing such joint models. On the flip side, too many different developments could confuse the potential users and exacerbate the hesitation to apply competing risks models. This is especially true when more complex models are needed and new packages are developed for each additional complexity. This is cumbersome for users since many different packages should be familiarized for common use. The exigency for a more cohesive approach to competing risks joint models is clear. We believe that we address this deficiency in this article.
To achieve our aim of presenting a cohesive computational framework for competing risks joint models, we are spurred to characterize competing risks models within the class of latent Gaussian models. This characterization enables us to use the integrated nested Laplace approximation (INLA) method proposed by (Rue et al., 2009). Our rationale for this endeavour is the computational efficiency and ability to incorporate various modeling elements available in the accompanying R-INLA package (Rue et al., 2011). The INLA framework has been used in various fields of application since its inception. In survival analysis, (Martino et al., 2011) provided the details for basic survival models and (Álvaro-Meca et al., 2013) presented an approach for competing risks models using R-INLA, while (Van Niekerk et al., 2019) presented the case of joint survival longitudinal modeling. Competing risks joint models with frailties (Martino et al., 2011), spatial structures (Bakka et al., 2018) and non-linear covariate effects (Lindgren and Rue, 2008) with non-Gaussian longitudinal submodels are all natural possibilities within our framework.
In the SANAD trial (Marson et al., 2007), the aim was to evaluate the efficacy of various treatments for seizure control in patients with epilepsy. Two treatment failure types, unacceptable adverse effects (UAE) and inadequate seizure control (ISC) were considered as competing events. From exploratory data analysis, the non-linear mean trajectory of the titration dosage over time for the two different treatments is clear and also noted by (Williamson et al., 2008). A piecewise linear model with two different slopes (with the changepoint at time
In Section 2 we present an introduction and necessary details of competing risks joint models in general which are then formulated as latent Gaussian models in Section 3. We also provide two specific examples of competing risks joint models which might arise in practice to give the reader an intuition for the proposed methodology. We apply the proposed method to the SANAD trial in Section 4 and a simulation study for a discrete competing risks joint model (longitudinal count data) is presented in Section 5. Finally, we conclude with a discussion in Section 6 on the proposed method and some exciting possibilities thereof.
Joint models for competing risks and longitudinal data
The applications of joint models have increased over the last few years. Essentially, a joint model is a statistical method to model two or more dependent endpoints/responses simultaneously. An example of joint models in spatial statistics are coregionalization models (Schmidt and Gelfand, 2003; Krainski et al., 2018) where there is dependency between multiple spatial fields. In survival analysis, the use of joint models to infer joint survival longitudinal models is very popular. In these cases, joint modeling is fundamentally correct since the time-event process and the longitudinal data per individual is inherently dependent since it is governed by the same biological process. Although joint models are popular in survival analysis with one event of interest, the potential for their application in more complicated survival models is yet to be explored. In this article, we focus on competing risks joint models with various realistic features such as multiple events of interest, spatial random effects and non-linear longitudinal trajectories. To this end, we present cause-specific competing risks joint models with proportional or non-proportional cause-specific hazard functions and non-Gaussian or Gaussian longitudinal series as latent Gaussian models. This venture allows the use of the fast and accurate INLA methodology Rue et al. (2009) for Bayesian inference of the model.
Cause-specific competing risks joint model
The competing risks joint model is used to make an inference for the biological process that underpins the longitudinal and competing risks data. For simplicity we assume
For the longitudinal marker
Denote the cause-specific hazard function for cause
where
where
Graphical illustration of the setup of the competing risks joint model
where
In this section we briefly present the details of latent Gaussian models (LGMs) and the integrated nested Laplace approximations (INLA) method for approximate Bayesian inference. If we can formulate competing risks joint models as LGMs and consequently use INLA for the inference, then we can fit these types of models very efficiently and with complex modeling components.
Latent Gaussian models and the integrated nested Laplace approximations method
Latent Gaussian models are a specific subset of hierarchical Bayesian additive models. This class comprises of well-known models such as mixed models, temporal and spatial models amongst many others. An LGM is defined as a model having a specific hierarchical structure, as follows:
The observed responses, The linear predictor is formulated as follows:
where The latent field A prior (possibly non-Gaussian),
From this hierarchical Bayesian formulation the joint posterior distribution is then given as follows:
Within this framework the joint posterior density (3.2) and subsequently the marginal posterior densities,
In this section we show that most competing risks joint models are indeed LGMs and can therefore be defined as one model. This enables the use of the INLA methodology for many different competing risks joint models, for example discrete competing risks joint models, spatial competing risks joint models, non-linear competing risks joint models and correlated competing risks joint models.
Consider the case with
Now define the stacked response list
From this construction the cause-specific competing risks joint model (in general, with various random effects such as spatial fields, spline effects and correlated models.) is an LGM and the joint posterior distribution of the latent field and the hyperparameters is as follows:
Since this proposition is abstract, we provide some specific examples of relevant competing risks joint models in the next section.
Specific examples
Next, we present some specific (non-exhaustive) examples of competing risks joint models that fit in this framework.
Example 1. Poisson competing risks random-effects joint model
Suppose that the longitudinal observations,
Now define the associated linear predictors as follows:
where
then this competing risks joint model is an LGM following Section 3.
Suppose that the longitudinal observations,
Now define the associated linear predictors as follows:
where
Now define
then this spatial competing risks joint model is an LGM following Section 3.
The SANAD trial (Marson et al., 2007) was a randomized trial study to investigate the effect of seizure control drugs in patient with epilepsy for whom carbamazepine (CBZ) was considered as the standard treatment protocol. Two treatment failure types, UAE and ISC were considered as competing events. The patients were randomly assigned to CBZ, lamotrigine (LTG), gabapentin, oxcarbazepine and topiramate. In the main analysis, LTG appeared to be superior to CBZ but since then various criticisms regarding this finding have risen mainly due to the difference in titration speed and therefore the dosage. In Williamson et al. (2008) this data was analysed using a competing risks random-effects joint model with a changepoint at time
Mean trajectories for CBZ and LTG treatment groups
Mean trajectories for CBZ and LTG treatment groups
We propose the following competing risks joint model with non-linear longitudinal trajectories for the SANAD trial data at time
where
with penalizing complexity priors (Simpson et al., 2017) for most of the hyperparameters, such that
The user-defined parameter
The estimated mean dosage trajectories for the two treatements, CBZ and LTG, on a log scale are presented in Figure 3. The effect of the joint model on the longitudinal trajectories is clear when compared to Figure 2, in which the dropout data is not used. The dropout process thus affects the longitudinal trajectories in a meaningful way. The estimated competing risks joint model is as follows:
Mean estimated dosage trajectories for the CBZ and LTG treatment groups in bold (with
credible intervals)
Mean estimated dosage trajectories for the CBZ and LTG treatment groups in bold (with
credible intervals)
with
The negative association between the cause-specific hazard functions of UAE and ISC, respectively, and the smoothed dosage level should be interpreted with the shape parameters,
Our findings are thus consistent with those of (Williamson et al., 2008) with the added advantage of the flexible estimated mean longitudinal trajectories for the two treatment dosages.
In this section we simulate longitudinal count data as well as time-event data with three competing risks (Causes
with age a discrete explanatory variable within the range
Results
In Figure 4 the longitudinal series and the mean estimated trajectory from a random walk order two model with precision
Simulated longitudinal data and estimated mean trajectory
Simulated longitudinal data and estimated mean trajectory
Estimates from the discrete competing risks joint model using R-INLA
From the estimates of the covariate effects in the cause-specific hazard functions we can deduce various other quantities of interest such as the cause-specific survival-like function and CIF etc. In Figure 5 the survival-like functions of the three risks are illustrated. Certain functions in the R-INLA framework can be used to estimate the marginal distributions of these quantities.
Cause-specific survival-like functions of the three risks with
credible interval for risks 1,2 and 3 (from top to bottom)
Here we presented a general method to apply a wide class of competing risks joint models using the R-INLA framework. The main stimulus for this research was the necessity for a proper non-linear competing risks joint model as motivated from the SANAD trial. Although we addressed this issue, we also developed a class of latent Gaussian competing risks joint models which enable the practitioner to use our method for various realistic situations which demand different types of competing risks joint models. We presented the details of two specific realistic situations, one being a competing risks model with three competing events and one longitudinal series of count data, the other a spatial competing risks joint model. The application of the non-linear competing risks joint model to the SANAD trial proved fruitful as this model properly captures the non-linear trajectory.
The possibility of a non-linear competing risks joint model based on a random walk order two model emerged through the R-INLA framework. The INLA methodology (Rue et al., 2009) has been proved to be a very efficient and popular computational platform for Bayesian models. The use of the R-INLA framework necessitated the formulation of the proposed model as a latent Gaussian model, which in turn facilitated the formulation of various other competing risks joint models that could arise in practical situations, such as multiple longitudinal series, discrete longitudinal distributions, spatial models and frailty models to name a few. Quantities of interest such as the cause-specific survival function and the all-cause survival function, the median survival time can be computed in R-INLA.
The possibilities of realistic competing risks joint models that can be inferred through the proposed methodology are vast and exciting. In particular, spatial/spatio-temporal random effects can be included in the cause-specific hazard functions and/or the longitudinal submodel to develop, for example, a spatial competing risks joint model. The works of (Hickey et al., 2018) on the SANAD trial, (Huang et al., 2010) on outlying values in the longitudinal submodel, (Rajeswaran et al., 2018) on multivariate longitudinal data and the model presented by (Elashoff et al., 2008) are some examples in the literature that can be applied with our approach through R-INLA. A thorough presentation of all the possibilities is not possible in this article but we believe that the general method presented herein catalyses future research, especially different instances of competing risks joint models motivated from a modeling perspective.
Footnotes
Declaration of Conflicting Interests
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Appendix
The code for this article is available at
