Conditional power of antidepressant network meta-analysis

Holper, Lisa

doi:10.1186/s12888-021-03094-5

Conditional power of antidepressant network meta-analysis

Research article
Open access
Published: 05 March 2021

Volume 21, article number 129, (2021)
Cite this article

Download PDF

You have full access to this open access article

BMC Psychiatry Aims and scope Submit manuscript

Conditional power of antidepressant network meta-analysis

Download PDF

Lisa Holper¹

2019 Accesses
1 Citation
1 Altmetric
Explore all metrics

Abstract

Background

Conditional power of network meta-analysis (NMA) can support the planning of randomized controlled trials (RCTs) assessing medical interventions. Conditional power is the probability that updating existing inconclusive evidence in NMA with additional trial(s) will result in conclusive evidence, given assumptions regarding trial design, anticipated effect sizes, or event probabilities.

Methods

The present work aimed to estimate conditional power for potential future trials on antidepressant treatments. Existing evidence was based on a published network of 502 RCTs conducted between 1979-2018 assessing acute antidepressant treatment in major depressive disorder (MDD). Primary outcomes were efficacy in terms of the symptom change on the Hamilton Depression Scale (HAMD) and tolerability in terms of the dropout rate due to adverse events. The network compares 21 antidepressants consisting of 231 relative treatment comparisons, 164 (efficacy) and 127 (tolerability) of which are currently assumed to have inconclusive evidence.

Results

Required sample sizes to achieve new conclusive evidence with at least 80% conditional power were estimated to range between N = 894 - 4190 (efficacy) and N = 521 - 1246 (tolerability). Otherwise, sample sizes ranging between N = 49 - 485 (efficacy) and N = 40 - 320 (tolerability) may require stopping for futility based on a boundary at 20% conditional power. Optimizing trial designs by considering multiple trials that contribute both direct and indirect evidence, anticipating alternative effect sizes or alternative event probabilities, may increase conditional power but required sample sizes remain high. Antidepressants having the greatest conditional power associated with smallest required sample sizes were identified as those on which current evidence is low, i.e., clomipramine, levomilnacipran, milnacipran, nefazodone, and vilazodone, with respect to both outcomes.

Conclusions

The present results suggest that conditional power to achieve new conclusive evidence in ongoing or future trials on antidepressant treatments is low. Limiting the use of the presented conditional power analysis are primarily due to the estimated large sample sizes which would be required in future trials as well as due to the well-known small effect sizes in antidepressant treatments. These findings may inform researchers and decision-makers regarding the clinical relevance and justification of research in ongoing or future antidepressant RCTs in MDD.

View this article's peer review reports

Comparative efficacy of placebos in short-term antidepressant trials for major depression: a secondary meta-analysis of placebo-controlled trials

Article Open access 07 September 2020

Strategic use of new generation antidepressants for depression: SUN(^_^) D protocol update and statistical analysis plan

Article Open access 14 October 2015

Success and efficiency of phase 2/3 adjunctive trials for MDD funded by industry: a systematic review

Article Open access 27 January 2020

Background

Research suggests that a majority of randomized clinical trials (RCTs) on medical interventions may not be justified based on established evidence, but contain unjustified research. Justified clinical trials may be defined as trials designed around a clear hypothesis around which uncertainty exists and that uncertainty should be as established through systematic reviews or network meta-analyses (NMA) based on existing evidence [1]. This is of relevance because estimated costs of each piece of evidence in a series of RCTs increases across decades [2, 3]. Optimizing the number of clinical trials to scientifically justifiable amounts is therefore recommended to save resources, reduce exposure of patients to less effective treatments, and allow for earlier uptake of treatment recommendations in practice [1].

Conditional power of NMA has been introduced as a concept to optimize trial designs thereby contributing to the reduction of unjustified research [4–6]. Conditional power is the probability that updating existing inconclusive evidence in NMA with additional trial(s) will result in conclusive evidence, given assumptions regarding trial design, anticipated effect sizes, or event probabilities [7, 8]. A key issue when designing a RCT is to determine how large the sample size needs to be in order to achieve a desirable level of power given a predefined significance level α [7]. Further, some interventions may not achieve high levels of power when considered within a single trial in isolation. In such situations, two or more RCTs in combination may be appropriate to form a cumulative synthesis of findings from RCTs addressing the same question [5, 6]. This situation may also arise if a direct treatment comparison of interest includes treatments that are known to be poorly tolerated in patients (e.g., due to known adverse events); therefore, adding indirect evidence including only better tolerable treatments in future trials may be more appropriate for the evidence to become conclusive. If conditional power analysis suggests for example at least 80% conditional power, which conventionally implies that trial(s) investigating a true effect will correctly reject the null hypothesis [9], together with a reasonable required sample size, further research may be promising. Otherwise, if such an analysis suggests for example less than 20% conditional power, which conventionally may be regarded as futility boundary with values below indicating that a trial is likely to be futile under the null hypothesis [10], then it may be recommended to refrain from further RCTs on a given intervention to save resources.

The present work aimed to estimate conditional power for NMA on antidepressant treatments. The analysis was based on a published network known as the GRISELDA dataset [11], contributing 502 RCTs for the acute treatment of adult major depressive disorder (MDD) conducted between 1979-2018 [12]. Together the network compares 21 antidepressants, considering outcomes such as efficacy in terms of the symptom change on the Hamilton Depression Scale (HAMD) [13] and tolerability in terms of dropout rate due to adverse events (Supplement 1 Fig. S1).

At the time of writing (as of October 2020), four ongoing RCTs can be found on clinicaltrials.gov that cover one or more of the afore-mentioned antidepressants and fit the inclusion criteria of the present data set (NCT04364997, intervention: bupropion (BUP), escitalopram (ESC), mirtazapine (MIR), sertraline (SER), venlafaxine (VEN), planned sample size N = 400, estimated start and completion dates Jun-18 to Dec-22, Beijing Anding Hospital, China [14]; NCT03538691, intervention: citalopram (CIT), duloxetine (DUL), escitalopram (ESC), fluoxetine (FLO), paroxetine (PAR), sertraline (SER), venlafaxine (VEN) versus placebo (PLA), planned sample size N = 1450, estimated start and completion dates Jul-18 to Sep-22, Otsuka Pharmaceutical Development & Commercialization, Inc. [15]; NCT04345471, intervention: desvenlafaxine (DES) versus placebo (PLA), planned sample size N = 594, estimated start and completion dates May-20 to Dec-22, Mochida Investigational sites, Japan [16]; NCT04422652, intervention: desvenlafaxine (DES) versus vortiozetine (VOR), planned sample size N = 600, estimated start and completion dates Aug-20 to Apr-26, H. Lundbeck A/S [17]).

For example, one of the most recent antidepressants is vortioxetine (VOR) approved in 2013 by the US Food and Drug Administration (FDA). The existing evidence on VOR comprises 17 RCTs (16 placebo-controlled RCTs, 1 head-to-head RCT) completed between 2007 - 2017 and published between 2012 - 2018 [18–34]. Based on this current evidence, VOR has been shown to be more effective (standardized mean difference (SMD) -0.29 [95%CI -0.38 - -0.20]), but less tolerable (odds ratio (OR) 1.48 [95%CI 1.15 - 1.89]) compared to placebo, with the evidence becoming conclusive in 2009 (efficacy) and 2011 (tolerability), respectively. An ongoing phase IV, double-bind RCT (NCT04448431 [35]) started in August 2020 with estimated completion date in April 2026. This RCT aims to compare the efficacy of VOR versus desvenlafaxine (DES) in 600 MDD patients that have tried one available treatment without getting the full benefit, with the primary outcome being the change in the Montgomery and Åsberg Depression Rating Scale (MADRS) from baseline to week 8. Based on current evidence, the comparison DES:VOR is inconclusive in terms of efficacy (SMD -0.06 [95%CI -0.19 - 0.08]) and tolerability (OR 0.80 [95%CI 0.54 - 1.18]); suggesting a slight yet inconclusive advantage for VOR compared to DES with respect to both outcomes. To estimate whether the advantage for VOR may turn into conclusive evidence, conditional power analysis may support the decision whether the ongoing research on that comparison is promising or otherwise futile. This example shows how the present work may inform decision-makers and researchers regarding the expected clinical relevance of ongoing and future antidepressant RCTs that aim to challenge antidepressant treatment recommendations.

Methods

Data sources

A total of 535 RCTs (445 published trials, 90 unpublished trials) were identified on the acute treatment of MDD conducted between 1979 and 2018. 522 trials constituted the GRISELDA dataset [11] provided by Cipriani et al. [12]. Additional 13 trials [34, 36–47] were identified by own literature search. Together the network compares 21 antidepressants, agomelatine (AGO), amitriptyline (AMI), bupropion (BUP), citalopram (CIT), clomipramine (CLO), desvenlafaxine (DES), duloxetine (DUL), escitalopram (ESC), fluoxetine (FLO), fluvoxamine (FLV), levomilnacipran (LEV), milnacipran (MIL), mirtazapine (MIR), nefazodone (NEF), paroxetine (PAR), reboxetine (REB), sertraline (SER), trazodone (TRA), venlafaxine (VEN), vilazodone (VIL), vortioxetine (VOR), and placebo (PLA). The supplementary appendix provides a PRISMA flow-chart (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) [48] detailing the study selection process (Supplement 1, Fig. S1a, Tab. S1), a complete list of the included studies (Supplement 1, Tab. S4).

Two outcomes were considered. The continuous outcome efficacy in terms of the symptom change on the Hamilton Depression Scale (HAMD) [13], estimated on the standardized mean difference (SMD) scale, was available in 438 trials (99 direct comparisons) with a total sample size of N = 109’254 (median sample size N = 249 [range N = 7 - 821]). The binary outcome tolerability in terms of the dropout rate due to adverse events, estimated on the odds ratio (OR) scale, was available in 438 trials (99 direct comparisons) with a total sample size of N = 105’616 (median sample size N = 241 [range N = 3 - 657]). The final dataset, containing information on either one of the outcomes, consisted of 502 trials. Other commonly used outcomes related to the effectiveness of antidepressants, such as response and remission rates, were not considered due to well-known methodological difficulties arising from dichotomization, such as reduced statistical power and inflated effect sizes [49–52].

Study year was defined as study year of completion, study year of publication, or year of drug approval from the FDA, where available in this order; preference was given to study year of completion, because unpublished trials, by definition, have no year of publication [53]. The resulting study year range was 1977-2017.

Conditional power

Conditional power was estimated using the ConditionalPower package provided by Nikolakopoulou et al. [7, 8, 54] in R [55]. Briefly, conditional power in NMA can be described as [7], for example for a comparison of interest:

$$ {\begin{aligned} CP = \phi \left(\frac{-z_{a/2} * \sqrt{C} - H*M}{\sqrt{H^{N}*\nu^{N} * \left(H^{N}\right)^{\prime}}}\right) + \phi \left(\frac{-z_{a/2} * \sqrt{C} + H*M}{\sqrt{H^{N}*\nu^{N} * \left(H^{N}\right)^{\prime}}}\right) \end{aligned}} $$

(1)

where C represents the covariance matrix of the NMA (direct and indirect) effect estimates, the vector M contains the NMA (direct and indirect) effect estimates of the old pairwise meta-analyses and the alternative effect sizes for the comparison of interest, the matrices H and H^N connect the NMA (direct and indirect) effect estimates to the pairwise (direct) effects derived from old or new trials, respectively, and the vector ν^N represents the variances of the pairwise (direct) effect estimates derived from new trials. The reader may be referred to Nikolakopoulou et al. [7] for further details.

Conditional power was estimated across a range of possible N = 1 - 5000 sample sizes assuming 1:1 randomization between treatment arms. Results were reported in terms of two conditional power indices quantifying sample sizes:

N_CP=20%: Sample size at 80% conditional power, which conventionally implies that a trial investigating a true effect will correctly reject the null hypothesis 80% of the time and will report a false negative (commit a type II error) in the remaining 20% of cases [9].
N_CP=80%: Sample size at 20% conditional power, which conventionally may be regarded as futility boundary with values below indicating that a trial is likely to be futile under the null hypothesis [10].

Three parameters were considered for each outcome of interest:

Trial design: The main analysis considered a trial design with a ratio of direct/indirect evidence (r) of r = 1/0. The ratio r = 1/0 indicates that conditional power for each treatment comparison was assessed by updating the network with one new trial contributing direct evidence regarding the comparison of interest, but without any new trials contributing indirect evidence. A sensitivity analysis was conducted to estimate conditional power by updating with trial design represented by two additional ratios of r = 1/1 and r = 1/2. The ratio r = 1/1 indicates that conditional power for each treatment comparison was assessed by updating the network with one new trial contributing direct and one new trial contributing indirect evidence regarding the comparison of interest (for this trial design 41 possible combinations for each comparison were computed), whereas the ratio r = 1/2 indicates that conditional power for each treatment comparison was assessed by updating the network with one new trial contributing direct and two new trial contributing indirect evidence regarding the comparison of interest (for this trial design 820 possible combinations for each comparison were computed). Results were reported in terms of the optimal trial designs for each comparison, i.e., those with smallest N_CP=80%.
Effect size: The main analysis considered anticipated treatment effects (f_xy) set equal to the relative effect estimates (i.e., the relative effects between competing treatments of interest) observed in the network (f_xyN). A sensitivity analysis was conducted to estimate conditional power at alternative effect sizes (f_xy = 0.01, 0.1, 0.2, 0.3, 0.5, 0.8) in terms of Cohen’s d (small effect d = 0.2, moderate effect d = 0.5, large effect d = 0.8) [56].
Event probability: The main analysis considered anticipated event probabilities (pc) set equal to the average event probabilities observed in the entire network (pc_N). For the outcome efficacy, anticipated average event probability (pc_N = 0.17) was calculated in terms of the proportion of change on the HAMD of at least 4 points (number of trials with change ≥4 points divided by the number of trials with change <4 points) corresponding to Cohen’s d = 0.5 [57]. For the outcome tolerability, anticipated average event probability (pc_N = 0.08) was calculated in terms of the proportion of dropouts (total number of dropouts divided by the total sample size in the network) [7]. A sensitivity analysis was conducted to estimate conditional power at alternative event probabilities in terms of small to large event risks (pc = 0.01, 0.1, 0.2, 0.3, 0.5).

Conditional power is typically estimated for direct comparisons observed in the network [7]. The antidepressant network however contains only 99 direct comparisons out of a total of 231 comparisons. It was therefore hypothesized that inclusion of all competing treatment comparisons in the network would be of clinical interest. For this purpose, dummy connections (with sample size = 1) were created to connect treatment comparisons not-directly observed in the network, and subsequently included in the analysis. Dummy connections did not affect relative treatment effects as assessed by the Pearson correlation between original and dummy effect sizes (efficacy r = 0.999, tolerability r = 0.995) (Supplement 1, Fig. S1d). Between-trial heterogeneity was assumed to be equal to that observed in the original NMA.

All results reported in the article can be found in the supplementary appendices (Supplement 1 & 2). The data set used in the analysis is provided in comma-separated values (CSV) format (Supplement 3).

Results

Existing evidence

The cumulative evolution of conclusive evidence in the antidepressant network across decades is illustrated in Fig. 1, for the two outcomes efficacy and tolerability. Since 2017, no new conclusive evidence has been observed. As of 2020, the ratio of the number of comparisons with conclusive evidence versus inconclusive evidence was found to be half the size for the outcome efficacy (ratio = 0.41, conclusive N = 67 versus inconclusive N = 164) compared to tolerability (ratio = 0.82, conclusive N = 104 versus inconclusive N = 127).

Conditional power main analysis

The estimated strength of conditional power across all comparisons with inconclusive evidence is illustrated in Fig. 2, based on the main analysis considering anticipated effect sizes set equal to f_xyN and anticipated event probabilities set equal to pc_N. The figure further demonstrates how the two conditional power indices quantifying sample sizes were derived, i.e., sample sizes at 20% and 80% conditional power (N_CP=20%, N_CP=80%). Across all comparisons with inconclusive evidence, required sample sizes at 80% conditional power (N_CP=80%) were estimated to be approximately double the size for efficacy (median N = 1586, range N = 894 - 4190) than those required for tolerability (median N = 791, range N = 521 - 1246). By contrast, sample sizes at the futility boundary of 20% conditional power (N_CP=20%) were estimated to be comparable between outcomes (efficacy median N = 250 [range N = 49 - 485], tolerability median N = 198 [range N = 40 - 320]) (Table 1). The relation between the two indices, N_CP=20% and N_CP=80%, for each individual comparison is detailed in Fig. 3. The network graphs depicted in Fig. 4 finally summarize the sample size needed to achieve conditional power. To translate these indices to the individual antidepressant level, the medians of the two indices, N_CP=20% and N_CP=80%, were computed across all inconclusive comparisons including each individual antidepressants. Antidepressants with the smallest median sample sizes were identified as CLO, LEV, MIL, NEF, and VIL with respect to both outcomes (Fig. 4). This is reasonable as these antidepressants (or better the associated comparisons) are the once on which current direct evidence is low. Thus, although estimated conditional power differed in the overall strength between outcomes, with that for efficacy being weaker compared to tolerability, the proportional strength of conditional power in individual treatment comparisons was comparable (Pearson r = 0.81). The supplementary appendix provides details on the conditional power for each individual comparison (Supplement 1, Tab. S2 and Supplement 2).

Table 1 Conditional power

Full size table

Conditional power sensitivity analyses

Sensitivity analysis quantifying the trial design ratio between direct/indirect evidence (r) suggested that adding indirect evidence may considerably increase conditional power and consequently reduce required sample sizes. Compared to a trial design ratio of r = 1/0, considering trial design ratios of r = 1/1 and r = 1/2 reduced median sample sizes (N_CP=80%) by median percentages changes of -24% and -35% for efficacy and -7% and -15% for tolerability (Table 1).

By contrast, sensitivity analysis assessing varying anticipated effect sizes suggested that the impact of f_xy on the strength of conditional power was small. Considering larger effect sizes (e.g., d = 0.8 in terms of Cohen’s, which is indeed unrealistic) than those observed in the network estimates (f_xyN) would increase sample sizes by up to 5% (efficacy) and 3% (tolerability), whereas smaller effect sizes (e.g., d = 0.01 in terms of Cohen’s) had basically no impact on sample sizes (0% efficacy, -1% tolerability) (Table 1).

Last, sensitivity analysis assessing varying event probabilities suggested a relatively larger impact of pc on the strength of conditional power. However, considering the current evidence in terms of average event probabilities (efficacy pc_N = 0.17, tolerability pc_N = 0.08), larger event probabilities may hardly be considered (Table 1). The supplementary appendix provides details on all sensitivity analyses (Supplement 1, Fig. S3, Tab. S3).

Discussion

The recent NMA by Cipriani et al. [12] provided evidence regarding the ongoing debate on the effectiveness of antidepressant treatment. Today, two years after the publication of the NMA, the question aires whether additional RCTs updating the evidence would pay off. Current ongoing RCTs [14–17] may contribute to answer the question, but final results may only be expected after estimated completion of the RCTs (completion dates 2022 - 2026). It may therefore be of clinical interest to estimate the probability whether the current research may lead to updates in treatment recommendations or whether it may be considered unjustified.

Overall, the present findings value the probability of achieving new conclusive evidence in antidepressant treatment recommendations that goes beyond current evidence to be low. Though, sufficient conditional power may be obtained for a majority of evaluated treatment comparisons (Fig. 4), there are substantial limitations in terms of both required sample sizes and expected effect sizes.

Considering median sample sizes in the in the four ongoing RCTs (range N = 400 - 1450) [14–17], required sample sizes obtained by the present analysis to achieve conventionally recommended power of at least 80% [9] were estimated to be more than double (tolerability) or even three times (efficacy) the size and may not even exceed the estimated futility boundaries (Table 1). Though, sample sizes may be reduced using optimized trial designs including additional indirect evidence, the associated research costs when conducting multiple trials may not pay off.

It should be noted that the present work is limited in the evaluation of optimal trial designs evaluating the relation between direct and indirect evidence. Nikolakopoulou et al. [54] demonstrated how decisions in future trials may be supported by conditional power analyses considering not only ’different ratios of the number of trials’ contributing direct versus indirect evidence, as done in the current work, but also by considering ’different ratios of the sample size between trials’ assessing direct versus indirect information. An extensive analysis assessing these ratios is feasible in small networks or may be applied to selected treatment comparisons of interest based on a priori hypotheses. The large treatment space in the present network, however, did not allow for such extensive sensitivity analyses due to practical reasons considering both processing time and exponential result dimension. Future research should therefore consider the present findings as an approximation for a more detailed breakdown of the evidence.

Compared to the impact of trial designs on reducing sample sizes, the impact of varying effect sizes or event probabilities may be assumed of less practical importance; this is because trial designs can be experimentally modified, whereas effect sizes and event probabilities are inherently limited by the existing evidence of the various treatments. In particular, considering the well-known overall small effect sizes for efficacy in antidepressants in the conclusive treatment comparisons (i.e., drug-placebo differences with a median d = 0.3 in terms of Cohen’s d [57]) and the even smaller effect sizes in so far inconclusive relative treatment comparisons (median d <0.1 in terms of Cohen’s d [57]) (Supplement 1, Tab. S2), the clinical relevance of additional trials aiming to challenge current antidepressant treatment recommendations may be low. In other words, it may be questioned whether any additional RTCs on antidepressant treatment can challenge the current treatment recommendations.

Referring to the example in the introduction, the present results may be applied to judge the conditional power of the ongoing RCT (NCT04448431 [35]) aiming to compare the efficacy of VOR versus DES. Though, current evidence may assume a trend towards the advantage of VOR compared to DES in terms of both efficacy and tolerability Supplement 1, Fig. S1), the probability of achieving conclusive evidence at reasonable sample sizes is low. The present analysis suggested required sample sizes to achieve at least 80% conditional power (N_CP=80%) of N = 1670 and N = 733 in terms of efficacy and tolerability, respectively (Fig. 3). These estimated sample sizes are considerably larger than the planned sample size of N = 600 [35]. Indeed, the planned sample size of N = 600 corresponds to approximately 56% (efficacy) and 74% (tolerability) (Supplement 2), and may thus be considered too low to reach new conclusive evidence in an updated NMA.

The above-mentioned example demonstrates the importance of a priori conditional power analyses, if it is the aim of a RCT to challenge current treatment recommendations. Based on the information available in the ongoing RCTs, it is unclear whether a priori conditional power analysis has been performed. The results expected after the completion of the ongoing RCTs will show whether a priori conditional power analysis could have contributed to improved trial designs, and thus would have saved resources in terms of clinical trial costs.

It should however be made clear that the ongoing RCTs may focus on primary aims other than challenging current antidepressant treatment recommendations. In other words, and they may have not been indented to be conditionally powered for possible future updating of NMAs, but may indeed be sufficiently powered as stand-alone trials. As discussed by Salanti and Nikolakopoulou [58], when NMA is deemed inconclusive and future trials should be planned, specific recommendations about what sort of trials should be planned are required. Trials can be planned to reduce risk of bias in particular comparisons, to explain heterogeneity, or to inform outcomes for which evidence is imprecise. When the aim is to included the planned trial in an updated NMA later on, trials may not be considered as stand-alone trials but may be seen as sequential additions to the existing evidence. The power and findings of individual trials are thus not of interest; rather, the conditional power of the NMA when the new trial is added and the resulting summary effect are of importance. Consequently, when NMA is deemed inconclusive because of imprecision, sample size calculations should be based on the conditional power of an updated NMA.

With this in mind, the present work should not be misunderstood or lead to possible miss-use of conditional power analyses. Weber et al. [59] raised that fundamental question regarding the use of conditional power analyses by asking whether “it is appropriate to gain power for an updated NMA by in- or decreasing the number of planned future trials while manipulating the power of each of the individual planned future trials?” The authors argued that traditional methods of power analysis are still favorable due to the fact that drug licensing is based on stand-alone RCT. Regardless of planning one or multiple trials, trials planned using conditional power may require different sample sizes (smaller or larger) than those planned using traditional power analysis aimed to achieve stand-alone conclusiveness. In other words, “individual RCTs should always be designed to satisfy their objectives and stand-alone studies (should not be) substituted by a meta-analysis of trials of inadequate size” [60].

Conclusions

In conclusion, the present analysis may inform decision-makers and researchers in the planning future antidepressant trials in MDD. Results suggests that new conclusive evidence leading to potential updates in antidepressant treatment recommendations may hardly be achieved within reasonable trial scales. Limiting the use of the presented conditional power analysis are primarily due to the estimated large sample sizes which would be required in future trials as well as due to the overall well-known small effect sizes in antidepressant treatments. These findings may be of importance to evaluate the clinical relevance and justification of research in ongoing or future RCTs on antidepressant treatments in MDD.

Availability of data and materials

All results reported in the article can be found in the supplementary appendices (Supplement 1 & 2). The data set used in the analysis is provided in comma-separated values (CSV) format (Supplement 3).

Abbreviations

AGO:: Agomelatine
AMI:: Amitriptyline
BUP:: Bupropion
CIT:: Citalopram
CLO:: Clomipramine
DES:: Desvenlafaxine
DUL:: Duloxetine
ESC:: Escitalopram
FLO:: Fluoxetine
FLV:: Fluvoxamine
LEV:: Levomilnacipran
MIL:: Milnacipran
MIR:: Mirtazapine
NEF:: Nefazodone
PAR:: Paroxetine
PLA:: Placebo
REB:: Reboxetine
SER:: Sertraline
TRA:: Trazodone
VEN:: Venlafaxine
VIL:: Vilazodone
VOR:: Vortioxetine
FDA:: US food and drug administration
HAMD:: Hamilton depression scale
MDD:: Major depressive disorder
NMA:: Network meta-analysis
OR:: Odds ratio
PRISMA:: Preferred reporting items for systematic reviews and meta-Analyses
RCT:: Randomized clinical trials
SMD:: Standardized mean difference

References

De Meulemeester J, Fedyk M, Jurkovic L, Reaume M, Dowlatshahi D, Stotts G, Shamy M. Many randomized clinical trials may not be justified: a cross-sectional analysis of the ethics and science of randomized clinical trials. J Clin Epidemiol. 2018; 97:20–5. https://doi.org/10.1016/j.jclinepi.2017.12.027.
Article PubMed Google Scholar
Sertkaya A, Wong H-H, Jessup A, Beleche T. Key cost drivers of pharmaceutical clinical trials in the united states. Clinical Trials. 2016; 13(2):117–26. https://doi.org/10.1177/1740774515625964.
Article PubMed Google Scholar
Moore TJ, Zhang H, Anderson G, Alexander G. Estimated costs of pivotal trials for novel therapeutic agents approved by the us food and drug administration, 2015-2016. JAMA Intern Med. 2018; 178(11):1451–7. https://doi.org/10.1001/jamainternmed.2018.3931.
Article PubMed PubMed Central Google Scholar
Langan D, Higgins J, Gregory W, Sutton A. Graphical augmentations to the funnel plot assess the impact of additional evidence on a meta-analysis. J Clin Epidemiol. 2012; 65(5):511–9. https://doi.org/10.1016/j.jclinepi.2011.10.009.
Article PubMed Google Scholar
Sutton AJ, Cooper N, Jones DR, Lambert P, Thompson J, Abrams KR. Evidence-based sample size calculations based upon updated meta-analysis. Statistics in Medicine. 2007; 26(12):2479–500. https://doi.org/10.1002/sim.2704.
Article PubMed Google Scholar
Roloff V, Higgins J, Sutton AJ. Planning future studies based on the conditional power of a meta-analysis. Stat Med. 2013; 32(1):11–24. https://doi.org/10.1002/sim.5524.
Article PubMed Google Scholar
Nikolakopoulou A, Mavridis D, Salanti G. Using conditional power of network meta-analysis (nma) to inform the design of future clinical trials. Biom J. 2014; 56(6):973–90. https://doi.org/10.1002/bimj.201300216.
Article PubMed Google Scholar
Salanti G, Nikolakopoulou A, Sutton AJ, Reichenbach S, Trelle S, Naci H, Egger M. Planning a future randomized clinical trial based on a network of relevant past trials. Trials. 2018; 19(1):365. https://doi.org/10.1186/s13063-018-2740-2.
Article PubMed PubMed Central Google Scholar
Cohen J. Statistical Power Analysis for the Behavioral Sciences. NJ: Lawrence Erlbaum Associates; 1988, pp. 1–17.
Google Scholar
Walter SD, Han H, Guyatt GH, Bassler D, Bhatnagar N, Gloy V., Schandelmaier S, Briel M. A systematic survey of randomised trials that stopped early for reasons of futility. BMC Med Res Methodol. 2020; 20(1):10. https://doi.org/10.1186/s12874-020-0899-1.
Article CAS PubMed PubMed Central Google Scholar
Cipriani A. Cipriani et al_GRISELDA_Lancet 2018_Open dataset. Mendeley Data, V2. 2018. https://doi.org/10.17632/83rthbp8ys.2.
Cipriani A, Furukawa T, Salanti G, Chaimani A, Atkinson L, Ogawa Y, Leucht S, Ruhe H, Turner EH, Higgins JPT, Egger M, Takeshima N, Hayasaka Y, Imai H., Shinohara K, Tajika A, Ioannidis JPA, Geddes J. Comparative efficacy and acceptability of 21 antidepressant drugs for the acute treatment of adults with major depressive disorder: a systematic review and network meta-analysis. Lancet. 2018; 391(10128):1357–66. https://doi.org/10.1016/S0140-6736(17)32802-7.
Article CAS PubMed PubMed Central Google Scholar
Hamilton M. A rating scale for depression. J Neurol Neurosurg Psychiatry. 1960; 23(1):56–62.
Article CAS PubMed PubMed Central Google Scholar
CSPC ZhongQi Pharmaceutical Technology Co. Ltd.Study of Desvenlafaxine in Treating Major Depressive Disorder (Clinicaltrials.gov Identifier NCT04364997). 2020. https://clinicaltrials.gov/ct2/show/NCT04364997.
University of Texas Southwestern Medical Center, University of Washington, National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK). Combination of Novel Therapies for CKD Comorbid Depression (Clinicaltrials.gov Identifier NCT04422652). 2020. https://ClinicalTrials.gov/show/NCT04422652.
Mochida Pharmaceutical Company Ltd. |Pfizer. A Study of MD-120 in Patients With Depression (Clinicaltrials.gov Identifier NCT04345471). 2020. https://ClinicalTrials.gov/show/NCT04345471.
Otsuka Pharmaceutical Development & Commercialization, Inc.A Trial to Evaluate the Efficacy, Safety & Tolerability of Brexpiprazole in the Maintenance Treatment of Adults With Major Depressive Disorder (Clinicaltrials.gov Identifier NCT03538691). 2021. https://ClinicalTrials.gov/show/NCT03538691.
Alvarez E, Perez V, Dragheim M, Loft H, Artigas F. A double-blind, randomized, placebo-controlled, active reference study of lu aa21004 in patients with major depressive disorder. Int J Neuropsychopharmacol. 2012; 15(5):589–600. https://doi.org/10.1017/s1461145711001027.
Article CAS PubMed Google Scholar
Jain R, Mahableshwarkar A, Jacobsen PL, Chen Y, Thase M. A randomized, double-blind, placebo-controlled 6-wk trial of the efficacy and tolerability of 5 mg vortioxetine in adults with major depressive disorder. Int J Neuropsychopharmacol. 2013; 16(2):313–21. https://doi.org/10.1017/s1461145712000727.
Article CAS PubMed Google Scholar
Mahableshwarkar A, Jacobsen P, Chen Y, Serenko M, Trivedi M. A randomized, double-blind, duloxetine-referenced study comparing efficacy and tolerability of 2 fixed doses of vortioxetine in the acute treatment of adults with mdd. Psychopharmacology (Berl). 2015; 232(12):2061–70. https://doi.org/10.1007/s00213-014-3839-0.
Article CAS Google Scholar
Mahableshwarkar A, Jacobsen P, Serenko M, Chen Y, Trivedi M. A randomized, double-blind, placebo-controlled study of the efficacy and safety of 2 doses of vortioxetine in adults with major depressive disorder. J Clin Psychiatry. 2015; 76(5):583–91. https://doi.org/10.4088/JCP.14m09337.
Article PubMed Google Scholar
Baldwin DS, Loft H, Dragheim M. A randomised, double-blind, placebo controlled, duloxetine-referenced, fixed-dose study of three dosages of lu aa21004 in acute treatment of major depressive disorder (mdd). Eur Neuropsychopharmacol. 2012; 22(7):482–91. https://doi.org/10.1016/j.euroneuro.2011.11.008.
Article CAS PubMed Google Scholar
Henigsberg N, Mahableshwarkar A, Jacobsen P, Chen Y, Thase M. A randomized, double-blind, placebo-controlled 8-week trial of the efficacy and tolerability of multiple doses of lu aa21004 in adults with major depressive disorder. J Clin Psychiatry. 2012; 73(7):953–9. https://doi.org/10.4088/JCP.11m07470.
Article CAS PubMed Google Scholar
Katona C, Hansen T, Olsen C. A randomized, double-blind, placebo-controlled, duloxetine-referenced, fixed-dose study comparing the efficacy and safety of lu aa21004 in elderly patients with major depressive disorder. Int Clin Psychopharmacol. 2012; 27:215–23. https://doi.org/10.1097/YIC.0b013e3283542457.
Article PubMed Google Scholar
Boulenger JP, Loft H, Olsen CK. Efficacy and safety of vortioxetine (lu aa21004), 15 and 20 mg/day: a randomized, double-blind, placebo-controlled, duloxetine-referenced study in the acute treatment of adult patients with major depressive disorder. Int Clin Psychopharmacol. 2014; 29(3):138–49. https://doi.org/10.1097/yic.0000000000000018.
Article PubMed PubMed Central Google Scholar
Jacobsen P, Mahableshwarkar A, Serenko M, Chan S, Trivedi M. A randomized, double-blind, placebo-controlled study of the efficacy and safety of vortioxetine 10 mg and 20 mg in adults with major depressive disorder. J Clin Psychiatry. 2015; 76(5):575–82. https://doi.org/10.4088/JCP.14m09335.
Article PubMed Google Scholar
NCT, 01255787. Efficacy and safety study of vortioxetine (lu aa21004) for treatment of major depressive disorder. https://doi.org/https://clinicaltrials.gov/ct2/show/NCT01255787. Accessed 2020.
NCT, 01355081. Efficacy study of vortioxetine (lu aa21004) for treatment of major depressive disorder. https://doi.org/https://clinicaltrials.gov/ct2/show/NCT01355081. Accessed 2020.
McIntyre R, Lophaven S, Olsen CK. A randomized, double-blind, placebo-controlled study of vortioxetine on cognitive function in depressed adults. Int J Neuropsychopharmacol. 2014; 17(10):1557–67. https://doi.org/10.1017/S1461145714000546.
Article CAS PubMed Google Scholar
Wang G, Gislum M, Filippov G, Montgomery S. Comparison of vortioxetine versus venlafaxine xr in adults in asia with major depressive disorder: a randomized, double-blind study. Curr Med Res Opin. 2015; 31(4):785–94. https://doi.org/10.1185/03007995.2015.1014028.
Article CAS PubMed Google Scholar
NCT, 02279966. Efficacy of vortioxetine on cognitive dysfunction in working patients with major depressive disorder. https://doi.org/https://clinicaltrials.gov. Accessed 2020.
Mahableshwarkar A, Zajecka J, Jacobson W, Chen Y, Keefe R. A randomized, placebo-controlled, active-reference, double-blind, flexible-dose study of the efficacy of vortioxetine on cognitive function in major depressive disorder. Neuropsychopharmacol. 2015; 40(8):2025–37. https://doi.org/10.1038/npp.2015.52.
Article CAS Google Scholar
Mahableshwarkar A, Jacobsen P, Chen Y. A randomized, double-blind trial of 2.5mg and 5mg vortioxetine versus placebo for 8 weeks in adults with major depressive disorder. Curr Med Res Opin. 2013; 29(3):217–26. https://doi.org/10.1185/03007995.2012.761600.
Article CAS PubMed Google Scholar
Nishimura A, Aritomi Y, Sasai K, Kitagawa T, Mahableshwarkar A. Randomized, double-blind, placebo-controlled 8-week trialof the efficacy, safety, and tolerability of 5, 10, and 20 mg day vortioxetine in adults with major depressive disorder. Psychiatry and Clinical Neurosciences. 2018; 72(2):64–72.
Article CAS PubMed Google Scholar
NCT, 04448431. Comparison of vortioxetine and desvenlafaxine in adult patients suffering from depression. https://doi.org/https://ClinicalTrials.gov/show/NCT04448431. Accessed 2020.
Feiger AD, Tourian K, Rosas GR, Padmanabhan S. A placebo-controlled study evaluating the efficacy and safety of flexible-dose desvenlafaxine treatment in outpatients with major depressive disorder. CNS Spectrums. 2009; 14(1):41–50. https://doi.org/10.1017/S1092852900020046.
Article PubMed Google Scholar
Septien-Velez L, Pitrosky B, Padmanabhan SK, Germain J-M, Tourian K. A randomized, double-blind, placebo-controlled trial of desvenlafaxine succinate in the treatment of major depressive disorder. Int Clin Psychopharmacol. 2007; 22(6).
Liebowitz M, Yeung P, Entsuah R. A randomized, double-blind, placebo-controlled trial of desvenlafaxine succinate in adult outpatients with major depressive disorder. J Clin Psychiatry. 2007; 68(11):1663–72.
Article PubMed Google Scholar
Kornstein SG, Jiang Q, Reddy S, Musgnung J, Guico-Pabia CJ. Short-term efficacy and safety of desvenlafaxine in a randomized, placebo-controlled study of perimenopausal and postmenopausal women with major depressive disorder. J Clin Psychiatry. 2010; 71(8):1088–96.
Article CAS PubMed Google Scholar
Soares CN, Thase M, Clayton A, Guico-Pabia CJ, Focht K, Jiang Q, Kornstein SG, Ninan P, Kane CP, Cohen L. Desvenlafaxine and escitalopram for the treatment of postmenopausal women with major depressive disorder. Menopause. 2010; 17(4):700–11. https://doi.org/10.1097/gme.0b013e3181d88962.
Article PubMed Google Scholar
Wang Z, Xu X, Tan Q, Li K, Ma C, Xie S, Gao C, Wang G, Li H. Treatment of major depressive disorders with generic duloxetine and paroxetine: a multi-centered, double-blind, double-dummy, randomized controlled clinical trial. Shanghai archives of psychiatry. 2015; 27(4):228–36.
CAS PubMed PubMed Central Google Scholar
Khazaie H, Rezaie L, Rezaei Payam N, Najafi F. Antidepressant-induced sexual dysfunction during treatment with fluoxetine, sertraline and trazodone; a randomized controlled trial. Gen Hosp Psychiatry. 2015; 37(1):40–5. https://doi.org/10.1016/j.genhosppsych.2014.10.010.
Article PubMed Google Scholar
Khan A, Bose A, Alexopoulos GS, Gommoll C, Li D, Gandhi C. Clin Drug Investig. 2007; 27(7):481–92. https://doi.org/10.2165/00044011-200727070-00005.
H. Lundbeck A/S. Efficacy of Vortioxetine on Cognitive Dysfunction in Working Patients With Major Depressive Disorder (Clinicaltrials.gov Identifier NCT02279966). 2017. https://ClinicalTrials.gov/show/NCT02279966.
Rickels K, Amsterdam J, Clary C, Fox I, Schweizer E, Weise C. J Clin Psychiatry. 1992; 53 Suppl:30–2.
Claghorn J. The safety and efficacy of paroxetine compared with placebo in a double-blind trial of depressed outpatients. J Clin Psychiatry. 1992; 53 Suppl:33–5.
CAS PubMed Google Scholar
Smith W, Glaudin V. A placebo-controlled trial of paroxetine in the treatment of major depression. J Clin Psychiatry. 1992; 53(Suppl):36–9.
PubMed Google Scholar
Moher D, Liberati A, Tetzlaff J, Altman D. The PG Preferred reporting items for systematic reviews and meta-analyses: The prisma statement. PLOS Med. 2009; 6(7):1000097. https://doi.org/10.1371/journal.pmed.1000097.
Article Google Scholar
Moncrieff J, Kirsch I. Efficacy of antidepressants in adults. BMJ Clin Res ed. 2005; 331(7509):155–7. https://doi.org/10.1136/bmj.331.7509.155.
Article Google Scholar
Altman D, Royston P. The cost of dichotomising continuous variables. BMJ Clin Res ed. 2006; 332(7549):1080. https://doi.org/10.1136/bmj.332.7549.1080.
Article Google Scholar
Hengartner M. Methodological flaws, conflicts of interest, and scientific fallacies: Implications for the evaluation of antidepressants’ efficacy and harm. Frontiers in psychiatry. 2017; 8:275. https://doi.org/10.3389/fpsyt.2017.00275.
Article PubMed PubMed Central Google Scholar
MacCallum R, Zhang S, Preacher KJ, Rucker D. Psychol Methods. 2002; 7(1):19–40. https://doi.org/10.1037/1082-989X.7.1.19.
Furukawa T, Cipriani A, Atkinson LZ, Leucht S, Ogawa Y, Takeshima N, Hayasaka Y, Chaimani A, Salanti G. Placebo response rates in antidepressant trials: a systematic review of published and unpublished double-blind randomised controlled studies. Lancet Psychiatry. 2016; 3(11):1059–66. https://doi.org/10.1016/S2215-0366(16)30307-8.
Article PubMed Google Scholar
Nikolakopoulou A, Mavridis D, Salanti G. Planning future studies based on the precision of network meta-analysis results. Stat Med. 2016; 35(7):978–1000. https://doi.org/10.1002/sim.6608.
Article PubMed Google Scholar
R Core Team. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2017. https://www.R-project.org/.
Google Scholar
Borenstein M, Hedges L, Higgins JPT, Rothstein HR. Introduction to Meta-Analysis.Wiley; 2009.
Hengartner M, Plöderl M. Statistically significant antidepressant-placebo differences on subjective symptom-rating scales do not prove that the drugs work: Effect size and method bias matter!Front Psychiatry. 2018; 9:517. https://doi.org/10.3389/fpsyt.2018.00517.
Article PubMed PubMed Central Google Scholar
Salanti G, Nikolakopoulou A. Actively Living Network Meta-Analysis, Working Paper, Institute of Social and Preventive Medicine (ISPM), University of Bern. 2021. https://www.ispm.unibe.ch/e93945/e93947/e451597/e488010/e488012/pane680818/e680822/EBAR_framework_description_paper_eng.pdf.
Weber K, Lasch F, Koch A. Stat Med. 2018; 37(8):1402–4. https://doi.org/10.1002/sim.7595.
The European Agency for the Evaluation of Medicinal Products. Application with 1. Meta-analyses; 2. One pivotal study, Reference number CPMP/EWP/2330/99. 2001. https://www.ema.europa.eu/en/application-1-metaanalyses-2-one-pivotal-study.

Download references

Acknowledgements

Not applicable.

Funding

Swiss National Science Foundation (SNSF).

Author information

Authors and Affiliations

University Hospital of Psychiatry, University of Zurich, Zurich, Switzerland
Lisa Holper

Authors

Lisa Holper
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

LH performed the data analysis, interpreted the results, and wrote the manuscript. The author read and approved the final manuscript.

Corresponding author

Correspondence to Lisa Holper.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The author declares that she has no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

Supplement1 provides a flow chart and checklist according to the PRISMA statement, details on the results for individual treatment comparisons, and details on the sensitivity analyses.

Additional file 2

Supplement2 provides illustrations of conditional power results for individual treatment comparisons.

Additional file 3

Supplementary file 3.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Holper, L. Conditional power of antidepressant network meta-analysis. BMC Psychiatry 21, 129 (2021). https://doi.org/10.1186/s12888-021-03094-5

Download citation

Received: 28 October 2020
Accepted: 20 January 2021
Published: 05 March 2021
DOI: https://doi.org/10.1186/s12888-021-03094-5

Conditional power of antidepressant network meta-analysis

Abstract

Background

Methods

Results

Conclusions

Similar content being viewed by others

Comparative efficacy of placebos in short-term antidepressant trials for major depression: a secondary meta-analysis of placebo-controlled trials

Strategic use of new generation antidepressants for depression: SUN(^_^) D protocol update and statistical analysis plan

Success and efficiency of phase 2/3 adjunctive trials for MDD funded by industry: a systematic review

Background

Methods

Data sources

Conditional power

Results

Existing evidence

Conditional power main analysis

Conditional power sensitivity analyses

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1

Additional file 2

Additional file 3

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation