Abstract
Objective:
To guide clinicians in selecting the “next line” selective serotonin reuptake inhibitor (SSRI) for adolescents with treatment-resistant major depressive disorder, we sought to compare response rates among SSRIs in the Treatment of SSRI-Resistant Depression in Adolescents (TORDIA) study and to jointly model tolerability and efficacy for the specific SSRI comparisons.
Methods:
Efficacy and tolerability data for paroxetine, citalopram, and fluoxetine were extracted from the TORDIA study. Using a joint bivariate normal likelihood for response and tolerability (based on the maximum implied variance from the 95% credible intervals previously reported for the three SSRIs), a Monte Carlo pseudorandom sample (100,000 draws) was obtained, from which credible intervals, means, posterior tail probabilities, etc. were determined. Joint null hypotheses of no difference in efficacy and tolerability were then evaluated with regard to superiority of each SSRI over the others.
Results:
No significant differences in response were observed for citalopram compared with fluoxetine (p = 0.247) or for fluoxetine compared with paroxetine (p = 0.110), although citalopram trended toward being superior to paroxetine (mean difference: 0.2, p = 0.055). For efficacy–tolerability models, citalopram and fluoxetine were superior to paroxetine (p = 0.029 and p = 0.022, respectively) but did not differ between each other (p = 0.146).
Conclusions:
Joint efficacy–tolerability models suggest that citalopram and fluoxetine were statistically significantly superior to paroxetine while citalopram trended toward superiority over paroxetine in the efficacy model. These findings provide a more granular and practical evidence base for clinicians faced with treatment sequencing decisions in adolescents with SSRI-resistant depression.
Introduction
Two in five adolescents with major depressive disorder (MDD) fail to respond to initial treatment with a selective serotonin reuptake inhibitor (SSRI) (Emslie et al. 2002, 2009). In fact, current treatments produce remission in only 30% of adolescents (March et al. 2004, 2007; Curry et al. 2011). Moreover, adolescents with persistent depressive symptoms are more likely to be psychiatrically hospitalized and to attempt suicide; they are more likely to have impaired peer and family relationships, lower academic achievement, and are more likely to develop additional psychiatric morbidity. However, we have few studies to inform “next step” interventions for youth with SSRI-resistant MDD. The largest trial comparing antidepressant switching, the Treatment of SSRI-Resistant Depression in Adolescents (TORDIA) Study, randomized adolescents aged 12–17 years (n = 334) to four evidence-based treatments for 12 weeks (March et al. 2004). Patients were randomized to switch to (1) a different SSRI (paroxetine, citalopram, or fluoxetine); (2) a different SSRI + cognitive behavioral therapy (CBT); (3) venlafaxine; or (4) venlafaxine + CBT. Improvement was assessed with the Clinical Global Impressions-Improvement (CGI-I) scale (Guy 1976). CBT + medication switch (either to another SSRI or venlafaxine) was associated with a marked increase in response compared with simply changing medication, and no differences were detected between switching to venlafaxine relative to another SSRI. Longer-term follow-up of the TORDIA patients suggests that by 72 weeks >60% of youth reached remission and that the initial TORDIA treatment assignment did not predict remission rate or time to remission. Interestingly, patients who were switched to a second SSRI (as opposed to venlafaxine) experienced a more rapid improvement in depressive symptoms and suicidal ideation than those assigned to venlafaxine (Vitiello et al. 2011).
In TORDIA, post hoc analyses of response rates for the three SSRIs suggested no differences among the SSRIs (paroxetine, 19/50 [38.0%; 95% confidence interval (CI), 25%–52%]; fluoxetine, 41/84 [48.8%; 95% CI, 38%–60%]; citalopram, 19/34 [55.9%; 95% CI, 39%–73%]; χ 2 = 2.81, p = 0.25). However, these frequentist estimates may fail to detect group differences and suffer limitations with regard to small samples and distributional assumptions of symmetry. Over the past decade, advances in Bayesian modeling have facilitated analyses of efficacy and tolerability of psychopharmacologic interventions in youth (Cohen et al. 2012) and the impact of clinical characteristics on antidepressant treatment response (Fountoulakis et al. 2013).
Previously, we employed Bayesian models to evaluate placebo response in pediatric anxiety disorders (Strawn et al. 2017) and to analyze abandoned clinical trials (Strawn et al. 2018). This approach provides clinically significant insight into the efficacy and tolerability of treatments and, compared with traditional statistical approaches, requires fewer restrictive assumptions regarding the relative variance of treatment groups. Furthermore, other restrictions inherent to the data can be easily incorporated (e.g., bounds above 0).
In the decade since the TORDIA trial was completed, there has been increased attention given to the heterogeneity of SSRI treatment response in internalizing disorders and to SSRI-specific differences in tolerability (Dobson et al. 2019). In this regard, antidepressant response rates in adolescents with MDD vary substantially (March et al. 2004; Emslie et al. 2006, 2009), although contemporary work suggests that fluoxetine produces the greatest magnitude of antidepressant effect compared with placebo (Cipriani et al. 2016). In addition, pediatric patients who are treated with tricyclic antidepressants, duloxetine, and venlafaxine have higher rates of adverse effects than placebo (Cipriani et al. 2016). Taken together, these data underscore the need for evidence-based approaches for treatment-resistant depression in adolescents, particularly given that at least 50% of adolescents receiving antidepressant in acute or continuation treatment are nonadherent (Fontanella et al. 2011).
With these considerations in mind, we sought to comparatively evaluate SSRI response rates in the TORDIA study and then to jointly model tolerability and efficacy for the specific SSRI comparisons to guide clinicians in selecting the “next line” SSRI for adolescents with MDD who have failed to respond to an initial course of SSRI treatment of adequate dose and duration. Given previous randomized controlled trials of paroxetine in children and adolescents with MDD (Emslie et al. 2006), we hypothesized that response rates would be lower in paroxetine-treated patients (compared with those youth receiving fluoxetine or citalopram). In addition, prior reports highlight tolerability concerns in pediatric patients receiving paroxetine with 30% of patients withdrawing from one prior trial (Emslie et al. 2006). In the context of these concerns, we hypothesized that tolerability–response profiles for paroxetine would be inferior to citalopram or fluoxetine. Lastly, given the expansive evidence supporting the salutary effects of psychotherapy in adolescents with depressive disorders, we focused on the medication-related aspects of the TORDIA study.
Methods
Data extraction
We extracted summary response data (CGI-I score ≤2 and a CDRS-R decline ≥50%) from the TORDIA study (Brent et al. 2008) as well as pooled adverse event-related discontinuation data (discontinuation secondary to an adverse event) for paroxetine (κ = 5), citalopram (κ = 2), and fluoxetine (κ = 10) from network meta-analyses of these SSRIs in pediatric MDD to generate the joint distributions described herein. In addition, efficacy and tolerability data for the three SSRIs were extracted from the same network meta-analysis to validate the statistical approach (described in Response modeling below). However, it should be noted that the SSRI tolerability data (discontinuation due to an adverse event) that were extracted from the NMA represent tolerability data in adolescents with MDD, rather than SSRI-resistant MDD.
Response modeling
As response (i.e., efficacy) is inherently linked to tolerability and vice versa, a joint model of both variables is necessary to account for their interaction. Nonetheless, although these factors are related, the relative strength of the relationship is unknown. Therefore, the model specified allows us to examine a range of correlations between efficacy and tolerability, ranging from no dependence to a relatively strong relationship between the two. Moreover, a Bayesian inferential approach was selected given its advantages over the frequentist approach. Specifically, the Bayesian approach allows direct probability statements and comparisons for the unknown parameters, which enables simulation across potential dependencies even in the absence of data that would allow direct estimation of dependencies (i.e., evaluating covariance). In addition, the Bayesian approach obviates the need to adjust for multiple comparisons given the sequential relationship between inference and hypothesis testing (Kruschke 2015).
Tolerability outcomes were assumed to be log-normally distributed while response outcomes were assumed to be normally distributed given the results of large pediatric MDD trials (Emslie et al. 2009). We specified a joint bivariate normal likelihood for response and log tolerability. Sample variances were based on the maximum implied variance from the 95% credible intervals previously reported in youth with MDD for citalopram, paroxetine, and fluoxetine. A Monte Carlo (MC) pseudorandom sample (M = 100,000 draws) from the joint conditional posterior distribution was obtained, and credible intervals, means, posterior tail probabilities, etc. were computed from the MC sample of M values from the posterior of differences in means (Lancaster 2004; Greenberg 2008). We assumed that the observed mean response,
Combining this likelihood with an uninformative prior leads to a joint normal conditional posterior distribution for the means,
with mean vector
The use of the Bayesian approach allows multiple comparisons without Bonferonni correction. In this regard, the posterior distribution of the modeled parameters (e.g., average response rate) represents the complete implication of the data and is unaffected by subsequent testing (Kruschke 2015). This contrasts with the frequentist approach in which there is no separation between inference and testing. Thus, with a frequentist approach, each test necessitates a new estimate from the same data, whereas with the Bayesian inferential approach, the data are used to generate the posterior distribution and then, this distribution is used for hypothesis testing. This eliminates the need for multiple comparison adjustments (Kruschke 2015).
Results
Effectiveness of SSRIs
In terms of efficacy (response), the mean difference for citalopram compared with fluoxetine was 0.067 (p = 0.247), whereas the mean difference between fluoxetine and paroxetine was 0.104 (p = 0.11). Citalopram trended toward being superior to paroxetine in terms of mean difference (0.171, p = 0.055) with a posterior probability ratio of 17.3 to 1 against the null hypothesis. In other words, a patient would be 17 times more likely to respond to citalopram than to paroxetine. Regarding tolerability, discontinuation due to adverse events was extracted from a previously reported network meta-analysis of double-blind placebo-controlled trials of the three antidepressants (fluoxetine vs. citalopram: odds ratio [OR] = 0.69 [CrI: 0.24–3.50]; citalopram vs. paroxetine: OR = 0.93 [0.20–2.77]; and fluoxetine vs. paroxetine: OR = 0.78 [0.21–2.18]) (Cipriani et al. 2016).
Joint efficacy–tolerability comparisons
For efficacy–tolerability models, citalopram was superior to paroxetine regardless of ρ (ρ = 0, p = 0.029; ρ = 0.25, p = 0.041; ρ = 0.5, p = 0.047). Fluoxetine was superior to paroxetine at all values of ρ (ρ = 0, p = 0.022; ρ = 0.25, p = 0.030; ρ = 0.5, p = 0.035). Last, there were no statistically significant differences between fluoxetine and citalopram at any ρ (ρ = 0, p = 0.146; ρ = 0.25, p = 0.175; ρ = 0.5, p = 0.204) (Fig. 1).

Joint probability density for antidepressant-related improvement and tolerability (pooled discontinuation due to adverse events) in adolescents with major depressive disorder. Citalopram
Discussion
In models considering tolerability and efficacy jointly, citalopram and fluoxetine were both superior to paroxetine, whereas in the efficacy only model, citalopram trended toward superiority over paroxetine (p = 0.055). This finding is consistent with several lines of accumulating clinical experience and data. First, in the Food and Drug Administration meta-analysis, paroxetine was associated with greater suicidality than the other antidepressants (Hammad et al. 2006), whereas in another meta-analysis, fluoxetine and citalopram had superior pooled effect sizes than paroxetine (Bridge et al. 2007). Second, clinical trials of paroxetine in pediatric patients suggest that this antidepressant may be poorly tolerated (Keller et al. 2001; Wagner et al. 2004; Emslie et al. 2006). In one of these randomized controlled trials, younger patients had higher dropout rates and it was suggested that, in this trial, paroxetine doses were too high in younger patients (Emslie et al. 2006). Third, paroxetine has greater anticholinergic liability than other SSRIs and this affinity is comparable with desipramine (Nemeroff and Owens 2003). Fourth, paroxetine exhibits nonlinear pharmacokinetics in children and adolescents. Thus, the increase (or decrease) in plasma paroxetine concentration resulting from a dose increase (or decrease) is difficult to predict. Fifth, paroxetine clearance (maximum plasma concentration [Cmax] and exposure (i.e., area under the curve during a 24-hour dosing period [AUC24]) are remarkably variable, a phenomenon that is likely subtended by polymorphic cytochrome CYP2D6 activity (Findling et al. 2006a, 2006b). Sixth, compared with other SSRIs, paroxetine has a greater likelihood of discontinuation symptoms (Rosenbaum et al. 1998).
Although this is the first probabilistic efficacy or efficacy–tolerability comparison of SSRIs in treatment-resistant MDD and yields findings that are consistent with general recommendations, at least with regard to paroxetine use in pediatric patients, there are several important limitations. First, summary data—rather than patient-level data—were used. Because we lack patient-level data, we are unable to examine the impact of age or other patient characteristics on efficacy–tolerability differences. Second, we assumed log normally distributed outcomes within the data set. This assumption is informed by previously reported distributions of tolerability and efficacy in other antidepressant trials in both MDD and anxiety disorders, and from a central limit theorem standpoint, the sample means of moderately large samples are generally approximated by a normal distribution (even when data are not normally distributed). Third, the covariance,
Conclusions
The results from these analyses raise the possibility that for adolescents, fluoxetine and citalopram may have a superior joint efficacy–tolerability profile compared with paroxetine. Given the challenges in the design and execution of clinical trials in children and adolescents with treatment-resistant MDD, this study highlights the importance of Bayesian and alternative approaches to analyzing clinical psychopharmacologic trials. Finally, prospective studies and patient-level examinations of individual factors that may influence differences in SSRI tolerability–efficacy are needed to inform real-world treatment sequencing in clinical practice.
Clinical Significance
In these analyses, fluoxetine and citalopram are superior with regard to the efficacy–tolerability diathesis to paroxetine for adolescents with SSRI-resistant MDD. When considering an SSRI switch in an adolescent with treatment-resistant MDD, clinicians might consider citalopram or fluoxetine in lieu of paroxetine as the “next step.” Finally, although this report focuses on pharmacotherapy in adolescents with SSRI-resistant MDD, clinicians treating youth with depressive disorders should consider the critical role of psychotherapy in increasing the likelihood of remission (March et al. 2004), overall response (Brent et al. 2008; The Treatment for Adolescents With Depression Study (TADS) Team et al. 2009), and maintenance of response (Kennard et al. 2008; Curry et al. 2011).
Footnotes
Acknowledgment
The authors thank the Yung Family Foundation for their generous support of this work.
Disclosures
Dr. Strawn has received research support from Edgemont, Eli Lilly, Shire, Forest Research Institute, Lundbeck, and the National Instituted of Health (NIMH and NIEHS). He receives royalties from Springer Publishing for two texts, and has received material support from Assurex. Dr. Croarkin receives research support from the National Institute of Mental Health, NeuroStar Advanced Therapy, NeoSync, and Genesight/Assurex. Dr. Mills has no potential financial conflicts of interest.
