Introduction

Major depressive disorder (MDD) is a mood disorder which can lead to a persistent feeling of persistent sadness and loss of interest [1]. The lifetime prevalence of MDD is between 10–20% [2,3,4]. Recent estimates in 204 countries and territories found that the global prevalence and burden of MDD increased by 27.6% in 2020 due to the COVID-19 pandemic [5]. MDD is the most serious disease in disability-adjusted life years (4.3%), and is estimated to be the leading cause of morbidity worldwide by 2030 if such trend continues [6]. The etiology of MDD is multifactorial, and social, cultural, genomic, aging and other underlying biological factors all play a role [7,8,9,10,11,12].

Both pharmacological and non-pharmacological treatments are effective for MDD [13], however, antidepressant drugs (ADs) remain the mainstay of treatment in primary and secondary medical institutions [14]. Amongst ADs, there are many different agents are available, including tricyclic antidepressants (TCAs), monoamine oxidase inhibitors (MAOIs), selective serotonin reuptake inhibitors (SSRIs), serotonin-noradrenaline reuptake inhibitors (SNRIs), and other newer ADs. The use of ADs is growing globally, especially in high-income countries [15], mainly due to the increasing consumption of SSRIs and newer ADs [16, 17]. SSRIs are generally better tolerated than TCAs, though the difference in efficacy is slight or negligible [18]. However, head-to-head comparisons have provided inverse findings. Duloxetine, for example, may have the edge over SSRIs in terms of efficacy [19, 20]. In addition, individual SSRIs and SNRIs may have varied outcomes [21]. SSRIs are first-line treatments for MDD [22]; however, these drugs work slowly and, in some patients, may not even work [23]. Escitalopram, one of SSRIs, is the representative of antidepressants currently used in terms of safety and efficacy [20, 23].

Recently, new randomized controlled trials (RCTs) of escitalopram in the treatment of MDD are pouring out, which were conducted in different circumstances [24,25,26], and the integration effects of these studies was ambiguous. Therefore, it is urgent to determine the true effect size of escitalopram for clinicians and clinical pharmacists. The aim of this present meta-analysis was to evaluate the efficacy of escitalopram in alleviating the acute symptoms of MDD, and to investigate the acceptability and adverse effects (AEs) of escitalopram in comparison with other ADs.

Methods

This meta-analysis was performed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [27]. A review protocol with search strategy was registered in the PROSPERO (CRD42022364229).

Search strategy

Electronic databases, including Medline/PubMed, EMBASE, the Cochrane Library, CINAHL, and Clinical Trials.gov, were searched to identify relevant studies from inception to July 10, 2023. No restrictions on language, publication status or gender were imposed. The reference lists of all included articles were also manually searched to identify any potential studies that might qualify.

Selection criteria

Only RCTs were included. Participants who were 18 years or older with a primary diagnosis of MDD were eligible. Studies prior to the 1990s may have used ICD-9, DSM-III/DSM-III-R or other diagnostic criteria. Later studies were more likely to have used criteria of DSM-IV or ICD-10. Studies using Research Diagnostic Criteria or Feighner criteria were included. However, ICD-9 criteria cannot be operationalized, so studies using ICD-9 were excluded. Studies in which no more than 20% of the participants might have bipolar depression were included.

Experimental intervention drug is escitalopram (as monotherapy). Comparator intervention drugs are other ADs for MDD, including TCAs, heterocyclic ADs, SSRIs, and newer ADs (SNRIs, MAOIs, newer agents, and non-conventional ADs such as herbal products). There were no restrictions on dose, frequency, intensity, and duration. Other types of psychotropic agents, such as anxiolytics, anticonvulsants, antipsychotics or mood-stabilizers, were excluded. Depressive patients with severe concomitant diseases, Axis I or II disorders were also excluded. Studies were excluded only if data were not provided at the time of meta-analysis.

The primary outcome was number of participants who responded to treatment, showing a reduction of at least 50% on the Hamilton Depression Scale (HAM-D) or Montgomery-Asberg Depression Rating Scale (MADRS), or any other depression scale, or score less than 2 on CGI-Improvement. The HAM-D was preferred to judge response when more than one criterion was provided. Secondary outcomes included number of participants who achieved remission, scores of change from baseline to the time point in question, acceptability and tolerability. The cut-off point for remission was preset to be score (1) less than 7 on the 17-item HAM-D or less than 8 for all the other longer versions of HAM-D, or (2) less than 12 on the MADRS, or (3) less than 2 on CGI-Severity. The HAM-D was preferred to judge remission when two or more criteria were provided. Change scores from baseline to the time point in question (early response, acute phase response, or follow-up response as defined above) were provided based on HAM-D, MADRS, or any other depression scale. We adopted a looser form of intention-to-treat (ITT) analysis, namely all the participants with more than one post-baseline measurement were represented by their final observations. Acceptability was assessed by total dropout rate, dropout rates due to inefficacy, and dropout rates due to AEs. Tolerability was assessed by total number of patients experiencing at least one AEs, total number of participants experiencing Deaths and suicide. In order to avoid missing any relatively rare yet important AEs, in the data extraction phase, we collected all AEs data reported in the literature and discussed methods for post-hoc summarization.

All titles and abstracts were checked by two reviewers independently (JY and XS) to determine if they met the rough inclusion criteria. All the studies rated as possible candidates by either of the two reviewers were added to the preliminary list. All the full-text articles in the preliminary list were then inspected independently by two reviewers (CW or XL) to determine whether they met the strict inclusion criteria. Any discrepancies were resolved by discussion, or adjudication of a third reviewer (MM).

Data extraction

Data were collected by two reviewers (JY and CW) using a Microsoft Excel spreadsheet. The extracted data included: author, year of publication, sample size, age, study duration, dose, diagnostic criteria, outcome measures, response criteria, remission criteria rate, overall discontinuation rate, discontinuation rate due to AEs. At the end of the data extraction phase, all key extracted data were reviewed and quality checked by the same two reviewers. Any discrepancies were resolved by discussion at first; and then brought to a third author (MM) for resolution, as required.

Risk of bias assessment

Two authors (JY and YC) independently used the Cochrane “Risk of bias” (ROB 2.0) tool to assess the methodological quality of the included trials [28].

Statistical analysis

Data analyses were performed using RevMan 5.4 (Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration, 2020). To evaluate heterogeneity, we used the I2 statistic (with I2 > 50% indicating significant heterogeneity) [29] and Cochran’s Q P value (with P < 0.05 indicating significant heterogeneity). Risk ratio (RR) with 95% confidence interval (CI) was described by categorical data. Standardized mean difference (SMD) with 95% CI was used for continuous outcomes.

Publication bias was evaluated by visual inspection of funnel plot [30]. We performed subgroup analyses to determine whether the results were influenced by the different types of control groups (other SSRIs or newer ADs). Sensitivity analyses were conducted to evaluate the robustness of the synthesized results by excluding studies whose dropout rate was greater than 20%.

Results

Study selection and characteristics

The preliminary search yielded 109 references of potentially eligible studies. After exclusion of studies that were not relevant (mainly for reviews were or non-randomized studies), a total of 30 RCTs were included in this present review (Fig. 1) [24,25,26, 31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57].

Fig. 1
figure 1

Flowchart of study selection

The basic characteristics for all included studies are displayed in Table 1. In the presentation of the following analyses, a post-hoc decision was made to present all SSRIs (with sub-totals) together in one group, and SNRIs and newer ADs (without sub-totals) together in another group. Sixteen trials (53.3%) compared escitalopram with another SSRI and fourteen (46.7%) compared escitalopram with a newer AD (venlafaxine, bupropion, duloxetine, agomelatine, vilazodone and desvenlafaxine). Neither trials comparing escitalopram with TCAs or MAOIs. Among the total 30 included studies, 28 studies were multicenter, randomized, double-blind trials, the other 2 was randomized, open-label trial. Of all the 30 studies, 17 had an overall high risk of bias, 11 had some concerns of bias, and 2 had a low risk (Figure S1).

Table 1 Characteristics of Included Studies

The effects of interventions in efficacy, acceptability and tolerability are presented below. The results are reported by comparison (dividing SSRIs from newer ADs). AEs are only reported when statistically significant.

Number of patients who responded to treatment

Acute phase treatment (6 to 12 weeks)

There was a statistically significant difference with escitalopram being more effective than other SSRIs (RR 0.88, 95% CI 0.82 to 0.95, I2 = 33%; 14 studies, 4111 participants) (Fig. 2). There was a statistically significant difference with escitalopram being more effective than newer ADs (RR 0.90, 95% CI 0.83 to 0.97, I2 = 43%; 11 studies, 3663 participants) (Fig. 3).

Fig. 2
figure 2

Failure to respond at endpoint (6–12 weeks): Escitalopram versus other SSRIs

Fig. 3
figure 3

Failure to respond at endpoint (6–12 weeks): Escitalopram versus newer ADs

Early response (1 to 4 weeks)

There was no statistically significant difference with escitalopram being more effective than other SSRIs (RR 1.02, 95% CI 0.93 to 1.11) (Figure S2) or newer ADs (RR 0.97, 95% CI 0.87 to 1.08) (Figure S3).

Follow-up response (16 to 24 weeks)

There was no statistically significant difference between escitalopram and other SSRIs (RR 0.83, 95% CI 0.66 to 1.05, I2 = 0%) (Figure S4). And there was no statistically significant difference between escitalopram and newer ADs (RR 0.78, 95% CI 0.51 to 1.19) (Figure S5).

Number of patients who achieved remission

Acute phase treatment (6 to 12 weeks)

There was statistically significant difference between escitalopram being more effective than other SSRIs (RR 0.89, 95% CI 0.81 to 0.99, I2 = 69%) (Fig. 4), however, there was no statistically significant difference with escitalopram being more effective than newer ADs (RR 0.96, 95% CI 0.91 to 1.01, I2 = 36%) (Figure S6).

Fig. 4
figure 4

Failure to remission at endpoint (6–12 weeks): Escitalopram versus other SSRIs

Follow-up remission (16 to 24 weeks)

There was no statistically significant difference between escitalopram being more effective than other SSRIs (RR 0.84, 95% CI 0.55 to 1.27) (Figure S7) or newer ADs (RR 0.82, 95% CI 0.61 to 1.10) (Figure S8).

Mean change from baseline (6 to 12 weeks)

Escitalopram was found to be more efficacious than other SSRIs in reduction of depressive symptoms (SMD -0.13, 95% CI -0.19 to -0.06, I2 = 34%) (Figure S9) or newer ADs (SMD -0.41, 95% CI -0.81 to -0.02, I2 = 97%) (Figure S10).

Tolerability-Total number of patients experiencing at least one side effect

There were statistically significant differences between escitalopram and other SSRIs in terms of tolerability (RR 0.93, 95% CI 0.89 to 0.97, I2 = 0%) (Fig. 5). However, there were no statistically significant differences between escitalopram and newer ADs in terms of tolerability (RR 0.97, 95% CI 0.93 to 1.01, I2 = 29%) (Figure S11).

Fig. 5
figure 5

Subjects with at least one TEAE: Escitalopram versus other SSRIs

Sensitivity analysis

The trials whose dropout rates were greater than 20% were excluded in the sensitivity analysis. Referring to other SSRIs, a dropout rate greater than 20% was found for three studies comparing escitalopram with citalopram [37, 49, 50], one with fluoxetine [40] and one with paroxetine [36]. Among newer ADs, a dropout rate greater than 20% was found for all the three studies comparing escitalopram with bupropion [31, 32, 52], two with duloxetine [48, 55], two with agomelatine [26, 39], one with desvenlafaxine [53], one with vilazodone [25], and one with venlafaxine [35]. Three studies had only one arm reporting a dropout rate greater than 20% [38, 42, 43]. Therefore, sensitivity analysis was conducted only for the comparisons between escitalopram and other SSRIs.

Results from the sensitivity analyses remained in favor of escitalopram, not only when studies whose dropout rate was greater than 20% in both arms were ruled out (RR 0.81, 95% CI 0.68 to 0.97, I2 = 68%; 7 studies, 1893 participants; Figure S12), but also when studies whose dropout rate was greater than 20% in only one arm were additionally ruled out (RR 0.71, 95% CI 0.51 to 0.98, I2 = 71%; 4 studies, 1062 participants; Figure S13). Therefore, results from these sensitivity analyses did not materially change the main findings, suggesting that the pooled analyses were robust.

Publication bias

Funnel plot of the studies enrolled in the meta-analysis demonstrated no significant asymmetry by visual inspection, therefore, the outcome of failure to respond at 6–12 weeks (escitalopram vs. other SSRIs) was not affected by publication bias (Figure S14).

Discussion

Thirty studies were included in this review. Escitalopram was superior to other SSRIs or newer ADs for the acute phase treatment of MDD in terms of efficacy (citalopram, fluoxetine, desvenlafaxine and duloxetine) and acceptability (Paroxetine and duloxetine). A quarter of the included trials used citalopram as the comparator and only a few trials per comparison were found for most of the remaining ADs (with the exception of duloxetine, fluoxetine, paroxetine and bupropion), which limits the power of this study to detect moderate but clinically meaningful differences between the drugs. The randomized evidence collected in the datasets for this review was sufficient to detect differences in early response to treatment (after two weeks of intervention). However, checking the data reported in the studies included in this meta-analysis, the question on comparative efficacy of early onset response has not been resolved and remains a controversial issue.

To the best of our knowledge, our work is the most comprehensive review to date of the comparative efficacy, acceptability and tolerability of escitalopram and other ADs for the treatment of MDD. The sample size in our study was larger than that of similar systematic reviews comparing an AD versus other ADs [58, 59]. Escitalopram is a relatively new compound and the quality of psychiatric trials may have improved over the past few years. By using a broader scope and advanced statistical methods, our findings provide more certainty about results than previous reviews which assessed similar research questions [20, 60]. Although these studies also focused on RCTs to assess the comparative safety and efficacy of drugs, we employed meta-analyses which enabled us to use a more comprehensive evidence base including head-to-head trials.

It has long been argued that placebo-controlled trials are required to adequately demonstrate the efficacy of newer ADs [61], however, receiving placebo in RCTs increased the chances of dropout and decreased the absolute response of participants to active ADs [62]. In the case of ADs, it may be more appropriate to conduct trials using an active comparator (chosen from the most effective and better-tolerated treatments available) [63]. Therefore, in this review, we included only head-to-head trials comparing between escitalopram and other active treatments. Because the literature search was comprehensive, it might be impossible that some studies had not been identified.

Our study demonstrated that escitalopram was superior to other ADs for the acute phase treatment of MDD in terms of efficacy, acceptability and tolerability. The mechanism action of escitalopram to improve MDD especially in the acute phase may be characterized by the increased subcortical network-ventral attention network connectivity [64], lower plasma kynurenine levels and resting-state regional activity in the left dorsolateral prefrontal cortex [65].

There are some limitations in this review. At First, although the sample size was larger, most studies still do not report adequate information on randomization and allocation concealment. For example, outcomes that were clearly relevant to patients and clinicians, in particular, patients' and their caregivers' attitudes to interventions, their ability to resume work and normal social functioning, were not reported in the enrolled studies. Furthermore, information on randomization and allocation concealment was occasionally lacking, which may be due to reporting in the text than real defects in study design. At last, the reports of the outcomes in the included studies were often unclear or incomplete and the figures used for the analyses were not easy to understand. And sometimes there were some inconsistencies between published data and unpublished data on the websites of pharmaceutical industries. In order to make up for these limitations, we evaluated the risk of bias in the results of trials, and preferred to report meta-analyses restricted to trials with low risk of bias [66,67,68].

In terms of AEs profile, we found that different ADs showed different tolerability profiles, which is an important issue from a clinical point of view, and the outcomes of this study are consistent with previous findings [69, 70]. However, a full description of tolerability profile of drugs cannot rely solely on randomized evidence [71, 72]. In addition, AEs were inconsistently reported in the included studies in our review, which hampers cross-study comparisons. The reporting of AEs needs to be standardized, and more consideration should be given to patients' subjective experience of medication. During the evidence-based decision-making process, clinicians should consider and inform patients of different AEs profiles among ADs, therefore, the issue on tolerability is clinically important. However, it has been shown that RCTs might not be the most effective tool for identifying possible causal relationship between ADs and even severe adverse events (SAEs) [73]. This applies to class-related AE, but might also apply to each specific compound. The more information that is pooled together in a systematic review and meta-analysis, the more precise and accurate is the estimate [74]. We are also aware of the possibility that a number of RCTs comparing escitalopram with other ADs are currently underway [75]. With more reliable and longer-term studies, the real impact and burden of the newer ADs on treated patients in terms of tolerability will be known.

Moreover, clinicians should take dosage and duration into consideration when administering drug therapy. In this review, nearly all studies used dosages and durations within the therapeutic range. Sixteen studies used a flexible-dose regimen and the remaining fourteen used a fixed-dose one. Among the included studies, there was no evidence of imbalance in terms of dosage, duration, or disease severity in favor of the investigational drug.

Conclusion

Our study demonstrated that escitalopram appears to be suitable as first-line antidepressant treatment for moderate to severe MDD. Escitalopram was superior to other ADs for the acute phase treatment of MDD in terms of efficacy, acceptability and tolerability. However, there is insufficient evidence to detect a difference between escitalopram and other ADs in early response or follow-up response to treatment of MDD. Sponsorship bias may lead to overestimation of treatment effects; therefore, results reported for comparative efficacy should be treated with caution.