Abstract
Background:
Patient characteristics may predict the progression of Alzheimer’s disease (AD) and may moderate the effects of donepezil.
Objective:
To build a personalized prediction model for patients with AD and to estimate patient-specific treatment effects of donepezil, using individual patient characteristics.
Methods:
We systematically searched for all double-masked randomized controlled trials comparing oral donepezil and pill placebo in the treatment of AD and requested individual participant data through its developer, Eisai. The primary outcome was cognitive function at 24 weeks, measured with the Alzheimer’s Disease Assessment Scale-cognitive component (ADAS-cog). We built a Bayesian meta-analytical prediction model for patients receiving placebo and we performed an individual patient data meta-analysis to estimate patient-level treatment effects.
Results:
Eight studies with 3,156 participants were included. The Bayesian prediction model suggested that more severe cognitive and global function at baseline and younger age were associated with worse cognitive function at 24 weeks. The individual participant data meta-analysis showed that, on average, donepezil was superior to placebo in cognitive function (ADAS-cog scores, –3.2; 95% Credible Interval (CrI) –4.2 to –2.1). In addition, our results suggested that antipsychotic drug use at baseline might be associated with a lower effect of donepezil in ADAS-cog (2.0; 95% CrI, –0.02 to 4.3).
Conclusion:
Although our results suggested that donepezil is somewhat efficacious for cognitive function for most patients with AD, use of antipsychotic drugs may be associated with lower efficacy of the drug. Future research with larger sample sizes, more patient covariates, and longer treatment duration is needed.
INTRODUCTION
Alzheimer’s disease (AD) is a progressive neurodegenerative disorder characterized by increasing aggravation of memory and other cognitive functions [1]. AD is the most common cause of dementia accounting for 60 to 80% of the cases worldwide, while it affects 10% of the population aged 65 or older [2]. There are limited therapeutics approved for the treatment of AD and specific patient factors contributing to treatment efficacy are not well understood.
Donepezil, an acetylcholinesterase inhibitor (AChEI), is the first approved drug currently available to treat AD, thus has accumulated the most abundant evidence. Donepezil is a widely standard treatment for the entire spectrum of AD. A recent systematic review of donepezil shows small efficacy of donepezil for AD in cognitive function, global clinical states rated by a clinician, and activities of daily living, compared to placebo [3]. Other AChEIs, such as rivastigmine and galantamine, were approved for mild and moderate stages, while the N-methyl-D-aspartate receptor antagonist, memantine, was for moderate to severe stages. In June 2021, a new drug, aducanumab, was granted by the US Food and Drug Administration expedited approval. However, none of these therapeutic drugs are expected to cure AD [2], and there has been no breakthrough medication for AD for the past two decades.
The efficacy of donepezil may be different for different types of patients. Personalized medicine (also known as “stratified” or “precision medicine”) aims to find the best treatment for each patient, given the patient’s individual characteristics [4]. This approach may lead to better patient outcomes. For example, personalized medicine can target providing beneficial interventions to treatment-sensitive patients and avoid specific treatments to treatment-resistant or harm-sensitive patients. To predict outcomes at the patient level, identification of both prognostic factors (i.e., characteristics that predict an outcome independent of the treatment) and effect modifiers (i.e., characteristics that predict differential response to alternative treatments) is needed. A previous observational study of patients with AD prescribed AChEI treatments showed that younger age, lower instrumental activities of daily living (IADLs), concomitant use of antipsychotic drugs, non-steroidal anti-inflammatory drugs, acetylsalicylic acid, solitary living, higher education, and lower mean dose of AChEIs may decrease the response to AChEIs for moderate AD [5]. These characteristics may be potential prognostic factors and/or effect modifiers. Previous randomized controlled trials (RCTs) have also identified potential effect modifiers. For example, one RCT did not find evidence of an effect of donepezil on cognitive impairment in people with AD and comorbid depression [6]. Another randomized controlled pilot trial indicated an association between olfactory deficits and the better efficacy of donepezil on cognitive function for those with depression and cognitive impairment [7].
Although there are several models available for predicting general dementia risk according to individual patient characteristics [8, 9], there are few studies on the prognosis of AD, and none have predicted the relative treatment effect. A recent study developed a prediction model for the prognosis of cognitive function in AD using the data of The Alzheimer’s Disease Prediction of Longitudinal Evolution (TADPOLE). However, except for age and education, the covariates used are not readily available in usual clinical practice (e.g., neuroimaging, biomarkers, and genetic factors) [10]. In addition, prediction models based on observational studies are not able to accurately predict the relative treatment effect compared to no treatment because of existing confounders. Although the RCT is a strong study design to estimate treatment effects with reduced bias due to confounding factors, RCTs are commonly powered to detect average treatment effects. This means that a single RCT will usually lack the statistical power needed to accurately estimate personalized treatment effects. Individual participant data meta-analysis (IPD-MA) of several RCTs can both increase the power to identify effect modifiers and predict relative treatment effects simultaneously by combining information from multiple patient-level datasets [11]. However, in the previous IPD-MA of donepezil for AD, only the mean treatment effect size was estimated, and patient-level treatment effects were not estimated [12]. Another study analyzed IPD from several double-masked RCTs to predict factors associated with rapid or slow cognitive decline, but the effect modification in the treatment of AD by donepezil was not examined [13].
The aim of the current study is therefore dual. We aimed to utilize easily accessible patient-level characteristics 1) to build a prediction model for the placebo response, i.e., to allow us to predict the natural course of the disease progression of AD, and 2) to estimate relative treatment effects of donepezil at an individual patient level. Thus, this study aims to provide tools for everyday clinical practice, by mapping the patient-specific natural disease progression and quantifying the expected benefit from donepezil at the patient level. Findings may guide both individual personalized treatment and future development of AD medications.
METHODS
This systematic review has been registered in PROSPERO (registry ID: CRD42019149573). We followed the PRISMA-IPD statement [14] and the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement [15].
Eligibility criteria and search strategy
The eligibility criteria for the current IPD-MA were as follows: double-masked RCT; treatment of AD diagnosed according to standard international operationalized diagnostic criteria (e.g., NINCDS-ADRDA criteria); oral donepezil as monotherapy for 24 weeks or longer, delivered daily in dose ranging between 5 and 23 mg as licensed in the USA, EU, or Japan; control condition was pill placebo; cognitive function was assessed by a validated psychometric test (e.g., ADAS-cog); the studies were conducted by Eisai, the developer of donepezil. We had planned to include multi-arm RCTs so long as donepezil and placebo were compared and crossover trials in which the data of the first phase were available, but we found no such studies meeting the eligibility criteria. We set no limitation of language and publication year. We searched articles published by August 9th, 2021, in the Cochrane CENTRAL, Medline, and WHO ICTRP. The detail of the search terms is described in the appendix (Supplementary Table 11). Two independent researchers (KY and YL) identified the eligible studies.
Data collection
We requested the IPD including the pre-specified variables (see below, outcomes and candidate covariates) and study protocols of the identified studies through ClinicalStudyDataRequest.com (https://www.clinicalstudydatarequest.com). We checked the obtained data by comparing the summary statistics in the publications of each study.
Risk of bias of individual studies
Two independent raters (KY and YL) assessed the risk of bias with regard to the primary outcome (cognitive function at 24 weeks, see below) for each study with a revised Cochrane risk-of-bias tool for randomized trials (RoB2) where the following domains were assessed as high risk, some concerns or low risk: randomization process, deviations from the intended interventions, missing outcome data, measurement of the outcome and selection of the reported result [16]. Any disagreement was resolved by discussion and, when necessary, consultation with the other review team members.
Data availability bias of individual studies
We assessed data availability bias by comparing the combined standardized mean difference (SMD) in the change of the primary outcome (cognitive function, see below) within 24 weeks between the analyzed studies and the other eligible studies of which we could not obtain the IPD. We used the software Review Manager version 5.4.1 for the calculation of the combined SMD.
Outcomes
For all the outcomes, we used measurements at 24 weeks after the initiation of the treatment.
Primary outcome
The primary outcome was cognitive function, as measured with the total score of the 11-item Alzheimer’s Disease Assessment Scale-cognitive subscale (ADAS-cog) [17, 18], the Severe Impairment Battery (SIB) [19, 20], or the Mini-Mental State Examination (MMSE) [21]. When only SIB or MMSE was assessed, we transformed the SIB (preferred) or the MMSE total scores to the ADAS-cog total scores according to the conversion table based on an equipercentile linking study of the three cognitive scales [22]. ADAS-cog is the most widely used cognitive scale in clinical trials for dementia. It consists of 11 items: word recall, word recognition, constructional praxis, orientation, naming objects and fingers, commands, ideational praxis, remembering test instruction, spoken language, word-finding, comprehension. The ADAS-cog total score ranges from 0 to 70 with the higher score representing more impaired cognitive function.
Secondary outcomes
We had pre-specified two secondary outcomes. One was the global assessment as measured using the Clinician’s Interview-Based Impression of Change Plus Caregiver Input (CIBIC-Plus) [18, 23] or the Clinical Dementia Rating Sum of Boxes (CDR-SB) [24]. CDR-SB is the sum of the component scores in CDR, ranging from 0 to 18 with the higher score representing the more severe dementia [25]. When CIBIC-Plus was not administered but the data for CDR-SB scores were available, we transformed the change of CDR-SB scores from baseline to the CIBIC-Plus score according to the conversion table based on an equipercentile linking study for global assessment scales [26]. CIBIC-Plus is a standard global change rating scale for global function in the clinical trial for dementia. The score is derived from a semi-structured interview with patients and their caregivers. CIBIC-Plus score ranges from 1 to 7 with 1 corresponding to “markedly improved” and 7 corresponding to “markedly worse”.
Another secondary outcome was all-cause dropouts, which was assessed by withdrawals from the trial in the 24-week treatment period. We adopted this outcome as a surrogate outcome for the acceptability of the treatment.
Candidates for prognostic factors and effect modifiers
We collected the data of the following characteristics as pre-specified potential prognostic and/or effect modifiers based on the literature and clinical expertise:
Regarding baseline severity, when CDR-SB was not administered at baseline but CIBIS-Plus (Clinician’s Interview-Based Impression of Severity Plus Caregiver Input) was available, CIBIS-Plus scores were transformed to CDR-SB according to the conversion table based on an equipercentile linking study for global assessment scales [26].
Statistical analysis
Overview
For each outcome, we first developed a meta-analytical prediction model using only data from trial participants randomized to placebo. Next, we conducted an IPD-MA to estimate the relative treatment effects of donepezil over placebo as a function of patient characteristics. These two models combined can provide information about the expected natural progression of the disease (i.e., in the placebo arm), as well as the expected benefit due to drug therapy for each patient. We used this two-fold modelling strategy to best leverage the evidence provided by the studies to answer our research questions, without breaking randomization in the data.
Handling of missing data
Patients with missing covariates were excluded from the analyses because there were very few instances (15 patients, 0.5%). For partial missing in the items of ADAS-cog, SIB, MMSE, and CDR-SB, we used the ipsative mean imputation method assuming the average score of the remaining items when 20% or less of the data of the items were missing [30]. For the prediction model, we excluded patients with missing outcomes since we could not use them for assessing model performance (see below). Conversely, for the IPD-MA model we imputed missing outcomes assuming that they were missing at random. For imputations, we used multilevel joint modeling multiple imputation, where between-study heterogeneity was modeled by random effects [31]. The imputation model included all the predictors, treatment indicator, treatment-covariate interactions, and outcome measurements prior to week 24 (weeks 4, 6, 8, 12, 16, 18, and 20).
Prediction model for placebo response
We built a set of competing prediction models for each outcome of our interest using only data from patients on placebo. We explored the following competing modeling strategies: simple linear (logistic) regression; frequentist and Bayesian (generalized) linear mixed-effects model with random effects placed on the study intercept; ridge regression; random forest; gradient boosting machines; and support vector machines. We selected the final model for each outcome after evaluating each models’ performance using an internal-external cross-validation method (i.e., leave-one-study-out cross-validation) [32]. In this approach, each study was removed from the dataset, and the model was developed in the remaining studies. Then the model was evaluated in the left-out study. Finally, we cycled through all studies. As for measures of predictive performance, for continuous outcomes, we used the mean squared error (MSE) and coefficient of determination (R-squared) for observed versus predicted outcomes [32]. For binary outcome (i.e., all-cause dropouts), we used Area Under the Receiver Operating Characteristic Curve as a discrimination metric. After selecting the final models, we refitted using the whole dataset (i.e., all placebo patients from all studies).
Estimation of relative treatment effects (difference between outcomes with donepezil and placebo) through IPD-MA
For each outcome, we fitted a one-stage random effects IPD-MA model in the whole dataset [33]. The models included all aforementioned predictors as prognostic factors as well as treatment-covariate interactions. We used a linear mixed-effects model for the continuous outcomes (ADAS-cog and CIBIC-Plus) and a generalized linear mixed-effects model for the binary outcome (all-cause dropouts). We used informative prior distributions for the heterogeneity of the treatment effects (log odds ratios) in binary outcomes as provided in [34], for mental health indicator outcomes. Following recent recommendations [35], we incorporated shrinkage on all effect modifiers through a Bayesian LASSO. All covariates were standardized prior to fitting the models. After fitting the models, we reported the posterior estimates after reverting them to the original scale of the predictors, so that results were interpretable. Finally, using the developed models we estimated patient-specific treatment effects for all included participants as the sum of effect modification and average treatment effect [35]. We then generated histograms, to allow a visual inspection of the treatment effect heterogeneity in the included population. We assessed heterogeneity of the treatment effect among the included studies with τ2, which represents the estimated between-study variance of underlying true effects across studies.
Implementation details
We used the statistical software R version 3.4.3 for all the analyses. We fit all Bayesian models using the R package rjags 4.1.0 [36]. For multiple imputation, we used the R package jomo 2.7.1. The code used to perform the analyses is available at https://github.com/MikeJSeo/phd/tree/master/donepezil
When fitting both the prognostic as well as the IPD-MA models, we used five imputed datasets and ran three chains of 10,000 iterations each, with 1000 burn-in. We assessed convergence using the Gelman-Rubin diagnostics [37]. For all models, we used a vague prior distribution for the precision of continuous outcomes, i.e., Γ (0.001, 0.001). For regression coefficients, we used a Normal (0, σ2 = 1000) distribution. When applying Bayesian LASSO, a vague prior distribution was placed on variance parameter for Laplace prior, i.e., U (0, 5).
RESULTS
Included studies
The initial literature search identified 1,219 published articles and trial registries that were screened as potential candidates meeting the eligibility criteria (Supplementary Figure 1). Seventy full texts were checked for eligibility, 13 RCTs (n = 4,003) of which were deemed relevant for the current study. Of these, we obtained the IPD of 3,156 participants (1,838 with donepezil, 1,318 with placebo) from eight studies through ClinicalStudyDataRequest.com [38–45]. In one study [41], there were no data provided for the 23 patients who discontinued the trial before the trial period. The reasons for not providing data for the remaining five studies were as follows: trials conducted by another pharmaceutical company collaborating with Eisai at that time [46, 47], the different focus of the study aim (i.e., neuroimaging and neuronal markers) [48, 49], and limited to only early-stage AD [50]. We also obtained the protocols of six studies [39, 42–45]. The study duration was 24 weeks in seven studies [38–43, 45] and 54 weeks in one study [44] in which cognitive and global function was assessed also at 24 weeks. We used IPD of all the eight studies for the analyses of cognitive function outcome and global rating assessment outcome, and seven 24-week trials for the analyses of the all-cause dropout outcome.
All studies administered 5 mg/day or 10 mg/day of oral donepezil, usually with a dose-titration period in the intervention arms. Concomitant antipsychotic drug use was not allowed in four studies [38–40, 44] but allowed in the other four studies [41–43, 45]. The former studies included mild to moderately severe AD, while the latter included more severe cases. The SIB total scores in three studies [41, 45] and the MMSE total scores in two studies [43, 44] were transformed to the ADAS-cog total scores. Change of the CDR-SB scores from baseline in three studies [38, 44] was transformed to CIBIC-Plus scores. Regarding covariates, the CIBIS-Plus scores at baseline in three trials [41, 45] were transformed to the CDR-SB scores. The covariates which were available across all the included studies were age, sex, weight, concomitant antipsychotic drug use, concomitant medication other than antipsychotic drugs, ADAS-cog score, and CDR-SB score. Table 1 shows the baseline characteristics for each study.
Study characteristics of the eight included studies at baseline
SD, standard deviation; ADAS-cog, Alzheimer’s Disease Assessment Scale-Cognitive Subscale; CDR-SB, Clinical Dementia Rating Sum of Boxes. Quantitative data are given as mean (SD); categorical data are given as number (%). *There were no data provided for 23 patients who discontinued the trial before the trial period although 325 patients initially enrolled. †In the dataset, age ranged from 60 to 90 years, where less than 60 and over 90 years were recorded as 60 and 90 respectively. ‡In the dataset, weight ranged from 35kg to 115kg where less than 35kg and over 115kg were given as 35kg and 115kg respectively. §ADAS-cog total score was transformed from Severe Impairment Battery (SIB) total score. ||ADAS-cog total score was transformed from Mini-Mental State Examination (MMSE) total score. ¶CDR-SB score was transformed from Clinician’s Interview-Based Impression of Severity plus caregiver input (CIBIS-Plus) score.
Risk of bias of individual studies
Supplementary Table 1 shows the risk of bias assessment of the included studies. All but two trials were rated at low risk of bias for all domains. Two trials [38, 41] were rated at some concerns in risk of bias due to baseline imbalance in dementia severity.
Data availability bias of individual studies
The point estimate for the SMD in cognitive function within 24 weeks in the analyzed eight studies (–0.43 in IPD-MA and also in aggregate data meta-analysis) [38–45] was almost identical to that in the other five eligible studies (–0.42 in aggregate data meta-analysis) [46–50], suggesting no data availability bias.
Primary outcome: Cognitive function measured with ADAS-cog
The proportion of randomized participants with missing outcome data for the ADAS-cog total score at 24 weeks was 24%.
Prediction model for patients in placebo
We started by fitting all competing prediction models using the leave-one-study-out analysis. Results are shown in Supplementary Table 2. The linear mixed-effects model and the Bayesian linear mixed-effects model were the best-performing methods, with an MSE, 60.4 and R-squared of 0.69. Among the two, and in order to be consistent with the IPD-MA model, we selected the Bayesian prediction model as the final model for this outcome. We then refitted the selected model in all placebo patients. Table 2 shows the estimated coefficients of each. Most predictive covariates were baseline measurements for ADAS-cog (coefficient, 0.95, meaning 0.95 ADAS-cog points increase at 24 weeks per one ADAS-cog point increase at baseline; 95% credible interval (CrI), 0.91 to 0.99), baseline CDR-SB (0.36; 95% CrI, 0.16 to 0.55), and age (–0.11; 95% CrI, –0.17 to –0.05).
Estimated parameters of the prediction model for placebo response in Alzheimer’s Disease Assessment Scale-Cognitive Subscale (ADAS-cog) at 24 weeks
ADAS-cog, Alzheimer’s Disease Assessment Scale-Cognitive Subscale; CDR-SB, Clinical Dementia Rating Sum of Boxes.
Relative treatment effects: IPD-MA
Table 3 shows the estimates of each covariate (prognostic factor) and each treatment-by-covariate interaction (effect modifier) in the IPD-MA for the ADAS-cog total score. The average treatment effect of donepezil compared to placebo was –3.15 (95% CrI, –4.20 to –2.14). The most important potential effect modifier was concomitant antipsychotic drug use at baseline (2.00, meaning 2.00 ADAS-cog points increase at 24 weeks when taking donepezil compared to placebo; 95% CrI, –0.02 to 4.26), estimating an average lower treatment effect of donepezil, albeit with some uncertainty. Supplementary Figure 2 provides the distribution of patient-specific treatment effects in ADAS-cog scores among the analyzed participants, representing the estimated relative treatment effect by comparing donepezil to placebo for each participant. The distribution ranged from –5.18 to –0.04 (median, –3.19), indicating that donepezil was beneficial for cognitive function to some extent for the majority of patients. For some patients, donepezil was estimated to offer little benefit or even no benefit at all.
Estimated parameters from the individual participant data meta-analysis model regarding relative treatment effects (donepezil versus placebo) in Alzheimer’s Disease Assessment Scale-Cognitive Subscale (ADAS-cog) at 24 weeks
ADAS-cog, Alzheimer’s Disease Assessment Scale-Cognitive Subscale; CDR-SB, Clinical Dementia Rating Sum of Boxes. Heterogeneity (τ2), 1.07 (95% CrI 0.15 to 2.50).
Secondary outcome: Global function measured with CIBIC-Plus
The proportion of randomized participants with missing outcome data for the CIBIC-Plus score at 24 weeks was 24%.
Prediction model for patients in placebo
We found that all competing prediction models for this outcome performed poorly. The R-squared was negative, implying that the model prediction of the outcome was worse than a simple average of the outcome for each study (Supplementary Table 2). Thus, we deemed that these prediction models were not usable for this outcome, and we did not pursue them further.
Relative treatment effect: IPD-MA
Table 4 shows the estimated coefficients of all model parameters for the IPD-MA model for the CIBIC-Plus score. The average treatment effect of donepezil versus placebo was –0.42 (95% CrI, –0.54 to –0.30). The most important potential effect modifier was concomitant antipsychotic drug use at baseline (0.29; 95% CrI, –0.02 to 0.64), albeit with uncertainty. Supplementary Figure 3 shows the distribution of patient-specific treatment effects in CIBIC-Plus scores among the analyzed participants, ranging between –0.70 and 0.01 (median: –0.39). This means that although the treatment was beneficial for most, for some patients, donepezil did not improve CIBIC-Plus scores.
Estimated parameters from the individual participant data meta-analysis model regarding relative treatment effects (donepezil versus placebo) in Clinician’sInterview-Based Impression of Severity Plus Caregiver Input (CIBIC-Plus) at 24 weeks
ADAS-cog, Alzheimer’s Disease Assessment Scale-Cognitive Subscale; CDR-SB, Clinical Dementia Rating Sum of Boxes. Heterogeneity (τ2), 0.10 (95% CrI 0.00 to 0.27).
Secondary outcome: All-cause dropouts
We excluded one 54-week study [44] from the analysis for all-cause dropouts at 24 weeks because dropout at 24 weeks was not recorded in the study. The proportion of randomized participants with missing outcome data for the all-cause dropouts total score at 24 weeks was 0.2% (7 participants).
Prediction model for patients in placebo
As in CIBIC-Plus, we found that all prediction models performed poorly. For example, the leave-one-study-out gave an Area Under the Receiver Operating Characteristic Curve of only 0.53 (Supplementary Table 2). Similar to the case of CIBIC-Plus, we decided that our prediction models were not useful for this outcome.
Relative treatment effect: IPD-MA
Supplementary Table 8 (standardized covariate results) and Supplementary Table 9 (reverting standardized covariates to original scale) show the coefficient estimates of model parameters for all-cause dropouts IPD-MA. The average treatment effect in an odds ratio was estimated at 1.03 (95% CrI, 0.59 to 1.75). Supplementary Figure 4 shows the distribution of patient-specific treatment effects in all-cause dropouts among the analyzed participants ranged between 0.78 and 1.24 (median: 1.03) in the odds ratio scale. We did not find evidence of strong effect modifications for this outcome, and all estimated treatment effects were very uncertain.
Sensitivity analysis
We were unable to conduct the pre-specified sensitivity analysis because no included studies were rated as high risk in the overall risk of bias assessment (Supplementary Table 1). We compared predictions obtained from the different models (Bayesian linear mixed-effects, frequentist linear mixed-effects, and ridge regression models) and found very good agreement (Supplementary Figures 5 and 6). In addition, we only used the three studies where ADAS-cog was not transformed to fit a linear mixed-effects model and obtain predictions for this outcome and compared them with the predictions obtained from the full dataset. Results were broadly consistent (Supplementary Figure 7). We did not conduct a sensitivity analysis of these three studies to fit an IPD-MA model because patients in these studies had nearly no patients with concomitant antipsychotic drug use. Likewise, we only used the five studies where CIBIC-Plus was not transformed to fit an IPD-MA model. Results were consistent with the analysis obtained from the full dataset (Supplementary Table 10).
Interactive web application
To facilitate the use of our results in clinical practice, we developed an interactive web application (https://cinema.ispm.unibe.ch/shinies/donepezil/). This allows users to make predictions of absolute outcomes and patient-specific treatment effect for different combinations of baseline characteristics. More specifically, the application demonstrates the prediction for the ADAS-cog in placebo and provides an estimate of the relative treatment effect of donepezil over placebo. It also estimates the relative treatment effect in terms of CIBIC-Plus. We only included the prediction model for the ADAS-cog outcome because the prediction models for CIBIC-Plus and dropout performed poorly. In addition, we did not include the estimated treatment effects for dropout, since these were estimated with large uncertainty. The default baseline characteristics are set to the median covariate values of the patients. Figure 1 further illustrates three hypothetical cases, for a 75-year-old woman (Fig. 1A), for a woman taking an antipsychotic drug (Fig. 1B), and for a younger man with milder dementia (Fig. 1C).

Interactive web application for individual prediction and treatment effect estimation of cognitive function and global function severity. A) For a typical patient: 75-year-old woman with baseline ADAS-cog of 45.0 and baseline CDR-SB of 11 without antipsychotic drug use at baseline. B) For a patient with antipsychotic use at baseline: 75-year-old woman with baseline ADAS-cog of 45.0 and baseline CDR-SB of 11 with antipsychotic drug use at baseline. C) For a younger male patient with milder dementia: 65-year-old man with baseline ADAS-cog of 25.0 and baseline CDR-SB of 7 without antipsychotic drug use at baseline. In examples A through C, other factors are set to Weight, 62.0 kg, Use of any medication other than antipsychotics at baseline = Yes.
DISCUSSION
This study analyzed the IPD from more than 3,000 patients with AD who participated in double-masked RCTs comparing donepezil and placebo. For our primary outcome, cognitive function measured in ADAS-cog, we first developed a Bayesian model to predict placebo response, and then we estimated patient-level treatment effects of donepezil using an IPD-MA. Our prediction model for placebo response identified that baseline cognitive function, global functioning, and age are prognostic factors for cognitive function. Our IPD-MA results suggested that for most patients, donepezil is expected to yield some benefit in cognitive and global functioning, as compared to placebo. Conversely, for a minority of patients, our models suggested no benefit from donepezil. While we could not identify definitive evidence for heterogeneous effects of donepezil for AD in all outcomes, our results suggested that concomitant antipsychotic drug use at treatment initiation may be associated with a reduced effect of donepezil for cognitive and global function.
Prognostic factors associated with worse cognitive function after 24 weeks included more severe baseline cognitive and global function, and younger age. These findings were consistent with a previous study where the outcome was the change in cognitive function and dichotomized [13]. Our result that younger age for the same dementia severity predicted worse cognitive function was concordant with other previous findings [5, 51].
Regarding relative treatment effects of donepezil on cognitive function, although the magnitude of the clinically meaningful difference between the treatment arms has not been defined, the average between-group difference of ADAS-cog (–3.15 point in our analysis) may be clinically meaningful when accounting for the minimal clinically important change in the ADAS-cog total score is approximately three [52]. The magnitude of the treatment effect in our analysis was a little larger compared to that reported in the most recent Cochrane review (–2.02 in 5 mg and –2.81 in 10 mg of donepezil) [3] and a previous IPD-MA (–2.0 in 5 mg and –3.1 in 10 mg of donepezil) [12]. The differences may be due to differences in the included studies and the statistical methods used (e.g., handling of missing data). Cognitive function in five out of eight studies was not originally measured with ADAS-cog but transformed from SIB (three studies) or MMSE (two studies) to ADAS-cog using equipercentile linking [22]. We also used joint modeling multiple imputation for missing outcome data at 24 weeks in the same way across studies.
Potential effect modification by antipsychotic drug use, which was suggested in our analyses, is important because antipsychotic drugs are often prescribed off-label in clinical settings to deal with behavioral and neuropsychiatric symptoms such as agitation/aggression, delusion, and hallucination, which are frequently occurring in people with AD [53–55]. However, no antipsychotic drugs have been approved for the treatment of behavioral and psychological symptoms secondary to AD by the US Food and Drug Administration because of their serious side effects such as stroke and death [56]. Our findings of a smaller treatment effect on cognitive function for patients with concomitant antipsychotic drug use was in line with the secondary analysis of one RCT (the Clinical Antipsychotic Trials of Intervention Effectiveness–Alzheimer’s Disease study: CATIE-AD study) [57]. In this study, in which 64% of patients were taking AChEIs, atypical antipsychotic drugs were associated with greater deteriorated cognitive function compared to placebo. Another mixed-effects model analysis in an observational study of patients with AD prescribed AChEI treatment also reported that concomitant antipsychotic drug use predicted negative treatment response for AD with a 1.99 point lower MMSE total score [5]. A detailed neurobiological mechanism has not been identified, but the potential antipsychotic drug use effect modification may be because of the drug interaction between antipsychotics and AChEIs. This could be explained by the opposite influences of cholinergic and anticholinergic drug effects [58]. We should also interpret the results carefully with the consideration of confounding variables. For example, a decreased effect of donepezil may not be due to antipsychotic drug use itself, but rather due to concurrent behavioral and neuropsychiatric symptoms of AD. Previous studies showed the association between those symptoms and rapid cognitive decline in people with AD [55, 59], but potential effect modification by such symptoms on donepezil for AD is unclear. Future research is necessary to elucidate whether a potentially reduced effect of donepezil for AD was due to antipsychotic drug use itself or due to some underlying confounding factor.
Regarding relative treatment effects of donepezil on all-cause dropouts, one of our secondary outcomes was a surrogate outcome for the acceptability of the treatment. We could not identify evidence of treatment effect modification, which were estimated with large uncertainty. However, donepezil should be used for AD after due consideration of both risks and benefits to maximize the treatment effect because some side effects have been reported more frequently in patients taking donepezil compared to those taking placebo, although most of them are mild, such as nausea, vomiting, and diarrhea commonly [3].
Our study has limitations of note. First, we could not obtain the IPD from all the studies meeting the eligibility criteria but analyzed the IPD from eight (total n = 3,156) out of 13 eligible studies (total n = 4,003) in the comprehensive literature search. However, data availability bias was deemed small, as suggested by the similar SMD in cognitive function between the analyzed studies and the other eligible studies. Second, we included only studies conducted by Eisai co. Ltd, the manufacturer of donepezil. However, a previous study found no evidence suggesting a difference in the treatment effects of donepezil between pharmaceutical-company-sponsored studies and others [60]. Third, although we listed several potential prognostic factors and effect modifiers a priori, only a few variables were available for our analyses. This limitation is often seen in meta-analysis in practice, as studies usually collect different sets of covariates. Future research should examine the impact of those factors that we could not include on the prognosis of AD and the treatment effect of donepezil such as APOE ɛ4 allele. We could not build prediction models for our secondary outcomes (global function measured with CIBIC-Plus and all-cause dropouts) because of poor model performance. It may be because we could not include important predictors in the models. Fourth, we could not perform the analysis considering the specific type of concomitant antipsychotic or non-antipsychotic drugs because the dataset did not provide relevant information. Any medication other than antipsychotic drugs may have included drugs for physical or psychiatric symptoms. Future research is needed to elucidate which medication or comorbidities influence the treatment effect of donepezil for AD. Fifth, the condition of concomitant medication was different across the included studies. In particular, antipsychotic drugs were mainly used in the studies targeting patients with more severe AD. Future research should confirm the role of concomitant antipsychotic drug use as a potential effect modifier, especially in relation to AD severity. Finally, our 24-week cognitive outcome may be comparatively short for the chronic progression of AD. For example, the European Alzheimer’s disease consortium recommended an 18-month follow-up period for disease-modifying trials [61]. Future research is necessary to examine longer outcomes.
Notwithstanding the limitations, our study has several important strengths. First, this study analyzed IPD from over 3,000 patients with AD who participated in several double-masked RCTs comparing donepezil and placebo. Jointly analyzing the data from all these studies was achieved by linking three cognitive function scales and two global function scales through equipercentile linking. Furthermore, IPD-MA of double-masked RCTs allowed us to estimate relative treatment effect estimates without compromising randomization. Second, since this is an analysis based on IPD instead of aggregate-level data, we applied the same statistical method for all studies in the sample, including imputation for missing data. Moreover, the proportion of missing covariates was small. Third, we used an internal-external cross-validation method, to assess the generalizability of our findings in new settings. Following this procedure, we decided not to present prediction models for the secondary outcomes, but we found that the prediction model for the primary outcome had acceptable performance. Additionally, we developed an interactive web application (https://cinema.ispm.unibe.ch/shinies/donepezil/) to quantitatively demonstrate the prediction for cognitive function in placebo after 24 weeks. This accounts for the relative treatment effect of donepezil over placebo, based on individual patient characteristics, which would be useful in clinical settings for patients and clinicians to discuss their treatment options.
Conclusions
Our analyses of individual participant data from eight RCTs of donepezil for AD suggests that donepezil is beneficial for cognitive and global function for most patients with AD. Importantly, the efficacy of the drug may be different for different patient characteristics. Concomitant use of antipsychotic drugs may be associated with reduced efficacy of donepezil for AD in both cognitive and global function. Whether this reduction is due to the antipsychotic drugs or due to some confounding factor associated with taking antipsychotics (e.g., agitation/aggression) remains to be seen. Further studies with larger sample sizes, collecting more patient covariates such as APOE ɛ4 allele, with longer treatment duration, are needed to predict more precisely the natural disease course and the relative treatment effects of donepezil at the patient-level.
Footnotes
ACKNOWLEDGMENTS
We acknowledge Eisai Co. Ltd for providing us with the study data. Eisai Co. Ltd did not provide study design or critical input for the study. We acknowledge
for hosting the study data.
This study was supported in part by JSPS Grant-in-Aid for Scientific Research (Grant Number 22K19688) to Furukawa.
Cipriani is supported by the National Institute for Health Research (NIHR) Oxford Cognitive Health Clinical Research Facility, by an NIHR Research Professorship (grant RP-2017-08-ST2-006), by the NIHR Oxford and Thames Valley Applied Research Collaboration and by the NIHR Oxford Health Biomedical Research Centre (grant BRC-1215-20005). The views expressed are those of the authors and not necessarily those of the UK National Health Service, the NIHR, or the UK Department of Health.
Efthimiou is supported by project grant No.180083 from the Swiss National Science Foundation (SNSF).
This study did not receive any other specific grant from funding agencies in the public, commercial, or not-for-profit sectors.
