Introduction

Depressive disorders remain one of the leading causes of disease burden worldwide. The most recent and comprehensive statistics available, dating to 2019, rank these disorders as the second and 13 leading causes of years lost due to disability and disability-adjusted life years, respectively. Furthermore, from 1990 to 2019, their prevalence has remained fundamentally unchanged [12•].

Despite extensive research, the exact etiology and pathophysiology of depression remain elusive. Multiple theories have been proposed, including the monoamine hypothesis, which suggests dysregulation of serotonin, dopamine, and noradrenaline pathways in depressed individuals [14].

Currently, pharmacological treatment of depression primarily relies on antidepressants, particularly selective serotonin reuptake inhibitors (SSRIs), serotonin-noradrenaline reuptake inhibitors (SNRIs), and tricyclic antidepressants. It is thought that these drugs work first and foremost by increasing the synaptic levels of monoamines [5], thus falling within the framework of the monoamine hypothesis.

The exact definition of what does and does not constitute a psychostimulant is not widely agreed upon; namely, atomoxetine (included in this review as a psychostimulant) is usually considered to be a non-stimulant ADHD treatment [30]. However, for the purposes of this review, this class encompasses several drugs that, in increasing synaptic dopamine and noradrenaline levels [4, 7, 11, 21, 30], produce overlapping effects, such as aiding attention, reducing sleepiness and fatigue, and increasing alertness, and therefore share therapeutic indications [25].

Having been established that monoamine dysregulation is significantly implicated in the pathogenesis of depression and that psychostimulants influence the metabolic pathways of monoamines, not unlike most antidepressants, it would follow that treatment of depressed patients with psychostimulants might constitute an effective approach. Moreover, on a symptomatic level, the mood-elevating, cognitive-enhancing, and wake-inducing effects of these drugs are in stark contrast with common depressive symptoms, which further corroborates a potential therapeutic use for these drugs.

As previously expanded on, MDD is a prevalent and burdensome disease, which makes having effective treatment options of special importance. Nevertheless, antidepressants have been shown to only be moderately more effective than placebo [5] and even to be comparable in efficacy to “active” placebos [20]. This paradigm allows for two approaches: either novel, more effective, drugs are developed, or older drugs are repurposed. This review explores the latter option.

Experimental Procedures (Methods) 

Criteria for considering studies

Type of Studies

We included randomized controlled trials with at least double blinding, in which participants have been randomly allocated to an experimental group and a placebo control group. Only studies published in English and after the year 2000 were considered.

Participants

Studies were included if they involved adult (> 18 years old) men and women, with a primary diagnosis of major depressive disorder, as defined by the DSM (any edition) or the ICD (any edition). A decision was made to exclude studies which included participants with a diagnosis of bipolar disorder, because it is not within the objectives of this review to assess the efficacy of augmentation in this subset of depressed patients, as the distinct nature of bipolar disorder would merit its own review. It was also decided that studies focusing on depressed patients diagnosed with diseases for which psychostimulant therapy is formally indicated (ADHD, narcolepsy, and obstructive sleep apnea, for example) or with severe fatigue or apathy-causing diseases (notably, AIDS, cancer, Parkinson’s disease, and dementia) should be excluded, as any benefit of psychostimulant therapy cannot necessarily be attributed to its effect on the pathophysiology of depression.

No restrictions were placed regarding location, setting, and number of participants.

Intervention

This review only included trials in which antidepressant therapy (with antidepressants (ADs) of any class and in any regimen) was augmented with a psychostimulant and a placebo (as a control intervention). Thus, the comparison can be formulated as: “Psychostimulant vs placebo augmentation of antidepressant therapy.”

Psychostimulants considered for the purpose of this review were atomoxetine, armodafinil, dextroamphetamine, lisdexamfetamine, methylphenidate, and modafinil.

Outcome Measures

Change in Depression Severity from Baseline to End-of-Study

Continuous measures of change in symptom severity, assessed by adequately validated scales specific to depression, were included. As such, measures determined by assessment instruments not specific to this disease, such as the CGI scales, were excluded. Whenever a study measured the outcome using more than one appropriate scale, this review used the values derived from the scale most frequently used in the included studies. If the scales were equally frequent, a decision was made to select the most appropriate option.

Remission at End-of-Study

This review also included dichotomous measures of remission, defined by the categorization of the scores of the scales. As with the previous outcome, if a study included different definitions of remission, this review considered the definition most frequently applied in the included studies. A decision aiming to select the most appropriate option was made whenever the definitions were equally frequent.

Search Methods for Identification of Studies

Electronic Searches

Searches were carried out from 11th February to 6th July 2023 on PubMed, Web of Science Core Collection, and ClinicalTrials.gov. On PubMed, the query used was as follows: (“depression” [MeSH Terms] OR “depression” OR “depressive disorder” [MeSH Terms] OR “depressive disorder” OR “depressive disorders” OR “depressive disorder, major” [MeSH Terms] OR “major depressive disorder”) AND (“modafinil” [MeSH Terms] OR “modafinil” OR “armodafinil” OR “atomoxetine hydrochloride” [MeSH Terms] OR “lisdexamfetamine dimesylate” [MeSH Terms] OR “lisdexamfetamine dimesylate” OR “lisdexamfetamine” OR “dextroamphetamine” [MeSH Terms] OR “dextroamphetamine” OR “atomoxetine hydrochloride” OR “atomoxetine” OR “methylphenidate” [MeSH Terms] OR “methylphenidate”). On Web of Science Core Collection, the query used was as folllows: (ALL = “depression” OR ALL = “depressive disorder” OR ALL = “depressive disorders” OR ALL = “major depressive disorder”) AND (ALL = “modafinil” OR ALL = “armodafinil” OR ALL = “lisdexamfetamine dimesylate” OR ALL = “lisdexamfetamine” OR ALL = “dextroamphetamine” OR ALL = “atomoxetine” OR ALL = “atomoxetine hydrochloride” OR ALL = “methylphenidate”). ClinicalTrials.gov was searched using “depression” as the “Condition or disease term” and the name of each psychostimulant as the “Intervention/treatment” term.

Searching Other Resources

No other searches were made.

Data Collection and Analysis

Selection of Studies

In accordance with the inclusion criteria, the identified studies were initially screened based on the information provided in the title and in the abstract. Following screening, the full text of the initially included studies was assessed for eligibility and reasons for exclusion were provided. This process was carried out by two authors (DJ and FN).

Data Extraction and Management

Using a data extraction form, the information was extracted solely by two authors (DJ and FN). For every included study, whenever possible, the following data was retrieved: methodological characteristics of the trial, total number of participants, number of male and female participants, inclusion and exclusion criteria, diagnostic criteria applied in diagnosing depression, characteristics of the intervention and the comparison, outcomes assessed, and treatment effects for the relevant outcomes.

Assessment of Risk of Bias in Included Studies

Risk of bias was accessed using the Revised Cochrane risk-of-bias tool for randomized trials [31]. Five domains were taken into consideration: randomization process, deviations from intended interventions, missing outcome data, measurement of the outcome, and selection of the reported result. Each domain was assessed as either low-risk, some concerns, or high-risk of bias. Based on these judgements, an overall assessment of bias for each included study was made.

Measures of Treatment Effect

Continuous Data

As different scales were used to measure the same outcome, the standardized mean difference was used, along with its 95% confidence interval.

Dichotomous Data

Odds ratios and their 95% confidence intervals were used.

Unit of Analysis Issues

Studies with Multiple Treatment Groups

If a study had more than two arms, with one of them being placebo control and the other interventions with different doses of a same psychostimulant, then the formulae presented in Sect. 6.5.2.10 of the Cochrane Handbook for Systematic Reviews of Interventions [13•] were used in order to combine treatment groups and create a single “Psychostimulant vs placebo augmentation” comparison.

Dealing with Missing Data

For continuous data, whenever the standard deviation was not provided, but instead a standard error of the mean was indicated, the standard deviation was calculated by multiplying the standard error of the mean by the square root of the sample size. When possible, in the cases in which a standard error of the mean was also not provided, the standard deviation was obtained through the calculations described in the Cochrane Handbook for Systematic Reviews of Interventions [13•].

For dichotomous data, an intention to treat analysis was performed.

Assessment of Heterogeneity

Statistical heterogeneity was assessed using the I-squared statistic, which indicates the percentage of the variability in effect estimates that is due to heterogeneity rather than chance [13•].

Assessment of Reporting Biases

Reporting biases were assessed through funnel plot inspection and, if the meta-analysis included ten or more studies, a Begg's Rank correlation test of funnel plot asymmetry [3]. The decision to not test for funnel plot asymmetry if the meta-analysis included less than ten studies stems from the fact that the power of this test is significantly lower in meta-analyses with few studies [13•].

Data Synthesis

Statistical analysis was done using Review Manager 5.4.1.

In the analysis of both continuous and dichotomous outcome measures a random-effects model was used. For the continuous outcome, the meta-analysis reported the standardized mean difference and its 95% confidence interval. In the case of the dichotomous outcome, the odds ratio and its 95% confidence interval were presented.

Results

Description of Studies

Results of the Search

The electronic search produced a total of 3152 references. Of these, 929 were eliminated based on being duplicates. The remaining 2223 were analyzed by title and abstract and, at this point, 2182 references were excluded. Next, the 41 articles’ full texts were assessed for eligibility and 12 of them were included. Richards et al. [28] describes two separate trials, hence the study was divided into Richards et al. [28] and Richards et al. [29], raising the number of included studies to 13. This selection process is schematically represented in Fig. 1. Reasons for exclusion are detailed in the “Excluded studies” section.

Fig. 1
figure 1

Flow diagram of selection process

Included Studies

Thirteen studies were included [1, 6, 8, 9, 15, 17, 19, 23, 26, 28, 29, 32] (Table 1).

Table 1 Study characteristics

All included studies were prospective, double-blind, placebo-controlled, parallel-group randomized trials. Two were single-center [15, 17], while the other 11 were multi-center. Publication years ranged between 2003 [6] and 2017 [29].

All studies included male and female patients with a diagnosis of MDD according to the DSM-IV [1, 6, 8, 9, 15, 19, 23] or the DSM-IV-TR. Only two trials exclusively included elderly participants [15, 17].

A majority of studies (7/13) only included patients with a partial-response to antidepressant therapy [6, 8, 9, 19, 28, 29, 32], and one study had a population of both non- and partial-responders to antidepressant therapy [23]. The other five trials did not account for previous therapeutic response.

A minority of studies (2/13) restricted inclusion to patients who displayed significant fatigue and sleepiness [8, 9].

Regarding interventions, of the thirteen trials, four evaluated modafinil [1, 6, 8, 9], four evaluated methylphenidate [15, 17, 23, 26], one evaluated atomoxetine [19], and another four evaluated lisdexamfetamine [28, 29, 32].

Nearly all studies had two conditions, antidepressant plus psychostimulant and antidepressant plus placebo. The exceptions were Lavretsky et al. [17], which added a third condition, psychostimulant plus placebo, and Richards et al. [29], which established five conditions, one of them being antidepressant plus placebo and the other four being antidepressant plus different doses of the same psychostimulant.

The duration of the intervention ranged from 4 [23] to 16 weeks [17].

Three studies did not present data for the mean change in depression severity from baseline to end-of-trial using the assessment instruments deemed pertinent [8, 17, 19]. Of those that did present such data, six used MADRS [9, 26, 28, 29, 32], one used Ham-D-24 [15], one used Ham-D-21 [23], one used Ham-D-17 [1], and one used both Ham-D-21 and Ham-D-17 [6].

All but three studies [6, 26, 29] assessed remission rates. Of the ten studies which considered remission, two defined it as a Ham-D-24 score of less than 7 [15, 17], two defined it as a Ham-D-21 score of less than 8 [8, 23], two defined it as a Ham-D-17 score of less than 8 [1, 9], two defined it as a MADRS score of less than 11 [28], one defined it as a Maier and Philipp core mood severity subscale score of less than 5 with no single item scored upwards of 1 [19], and one gave two definitions, either a MADRS score of less than 11, or a Ham-D-17 score of less than 8, [32].

Excluded Studies

Of the forty-one full-text articles assessed for eligibility, twenty-nine articles were excluded. Of these, thirteen were excluded on the basis of not being full articles, but instead publications related to poster sessions. Nine articles were not included for being ClinicalTrials.gov Study Registers of other studies being assessed for eligibility, and therefore duplicates. Other reasons for exclusion were concurrent treatment of participants with sleep deprivation [2], being a retrospective pooled analysis of two already included studies [10], being a secondary analysis of already included studies [16, 22, 27], including patients in full remission [18], and not defining the diagnostic criteria applied in diagnosing depression [24].

Risk of Bias in Included Studies

Risk of Bias Arising from the Randomization Process

Given that few studies published detailed information on allocation sequence generation and concealment, judgements were necessary. It was decided, based on each trial’s context, that all studies randomly generated and adequately concealed allocation sequences. No studies showed relevant baseline differences between groups. All studies were therefore judged to be at low risk of bias arising from the randomization process (Fig. 2).

Fig. 2
figure 2

Traffic light plot of risk of bias

Risk of Bias Due to Deviations from Intended Interventions

All publications included were, per inclusion criteria, double-blinded. However, nearly half of the studies (6/13) used per-protocol analysis instead of the more appropriate intention-to-treat or modified intention-to treat analyses. In each of these studies, only a small fraction of participants were excluded, thereby lowering the potential for substantial impact on the results. They [8, 9, 28, 29, 32] were therefore judged to raise some concerns of bias due to deviations from intended interventions, while the other seven studies were assessed as low risk (Fig. 2).

Risk of Bias Due to Missing Outcome Data

Only in Abolfazli et al. [1] was data available for nearly all participants (defined as availability of data from at least 95% of participants). Two studies [23, 32] imputed missing data using the inappropriate last-observation-carried-forward approach and another two studies [6, 19] made no mention of strategies for dealing with missing data. These four studies were judged to be at high risk of bias, as the nature of depression makes it probable that drop-outs are tied to disease severity, thus requiring an analysis that adequately accounts for bias. Since nearly all data was available in Abolfazli et al. [1], it was deemed as low risk, as were the remaining eight studies, which accounted for bias using mixed-effects models (Fig. 2).

Risk of Bias in Measurement of the Outcome

Symptom scales used for measuring outcomes were adequately validated across all studies and there was no evidence that measurement procedures differed between groups or that outcome assessors were unblinded. Hence, all studies were assessed as low risk of bias in measurement of the outcome (Fig. 2).

Risk of Bias in Selection of the Reported Result

Through a non-systematic online search, only five analysis plans previous to the study start date were retrieved. As no relevant differences existed between these plans and their respective studies [17, 28, 29, 32], they were considered to be at low risk of bias. The other eight studies raised some concerns because no such plans were found (Fig. 2).

Effects of Interventions

Change in Depression Severity from Baseline to End-of-Study

Ten trials were found to present data for the mean change in depression scores between the beginning and the end of the trial. In spite of this, only nine were included in the meta-analysis, as Patkar et al. [23] provided neither standard deviations, nor information which could be used to ascertain standard deviations.

Data pooled from the nine trials indicates that augmentation of antidepressant therapy with psychostimulants significantly improves depression scores when compared to placebo augmentation (N = 1827; SMD =  − 0.18; 95% CI (− 0.36, − 0.01); p = 0.04, I2 = 65%) (Fig. 3).

Fig. 3
figure 3

Forest plot for change in depression severity from baseline to end-of-study

Patkar et al. [23] reported a greater mean decrease in depression scores in the psychostimulant augmented group (− 6.9 vs − 4.7), but the effect was deemed not significant (between-group effect: F1,47 = 1.24, p = 0.22).

Remission at End-of-Study

All of the ten trials that published remission rates were included in the meta-analysis. Psychostimulant augmentation was found to be somewhat better than placebo augmentation at achieving remission, but the effect was not significant (N = 1709; OR = 1.30; 95% CI (0.97, 1.75); p = 0.08, I2 = 32%) (Fig. 4).

Fig. 4
figure 4

Forest plot for remission at end-of-study

Discussion

This review set out to evaluate the efficacy of psychostimulant augmentation of standard antidepressant therapy in depressed patients by focusing on its effects on two distinct, yet related outcomes, change in depression scores and remission rates.

The main findings indicate a statistically significant improvement in depression scores among individuals who received psychostimulants in conjunction with antidepressants, suggesting that this therapeutic approach may lead to a notable reduction in depressive symptoms. However, in terms of remission rates, this review failed to demonstrate a statistically significant increase in the psychostimulant augmentation group.

These findings imply that while the intervention may be beneficial in reducing depressive symptoms, it does not necessarily lead to a greater proportion of individuals achieving remission from the disease. As such, these results, albeit limited, suggest a role for psychostimulants in the symptomatic relief of depressed patients.

Some limitations must be considered when interpreting the completeness and applicability of the evidence presented due to the following reasons: (1) the population samples used in the included studies differed, notably in regards to response to previous antidepressant therapy; (2) studies aimed at patients with certain comorbidities were excluded; (3) only six psychostimulants were considered for this review and no eligible studies were found to evaluate the use of two of them (armodafinil and dextroamphetamine); (4) there were no studies assessing the augmentation of antidepressants not belonging to the SSRI and SNRI classes; (5) the duration of the intervention differed considerably across studies; (6) the scales used for measuring the continuous outcome were inconsistent, as were the definitions of remission; and (7) only studies published in English and after the year 2000 were considered.

Quality of the Evidence

The quality of the evidence was assessed using the GRADE approach. The grading of each outcome is presented in the summary of findings table (Table 2).

Table 2 Summary of findings

Potential Biases in the Review Process

As with all systematic reviews, there is the possibility of bias, particularly reporting bias. The present review focused exclusively on studies published in English after the year 2000, without reaching out to authors to address missing data or incorporating unpublished data. These methodological decisions make this review particularly susceptible to reporting bias. Notwithstanding, there was no indication of significant asymmetry through funnel plot inspection for both outcomes, which suggests the absence of reporting bias. The test for funnel plot asymmetry [3] conducted for the “Remission at end-of-study” outcome corroborated this (p = 0.1797).

Conclusions

In the present context of high depression prevalence, relatively low efficacy of classic therapeutic strategies, and lack of new drugs targeting the disease, the application of novel alternative regimens is more relevant than it has ever been. There is evidence to suggest that, as with other diseases, successfully treating depression, particularly in its early stages, leads to better outcomes and, as such, a case can be made for the use of augmentation strategies earlier on in the disease and not only in patients with treatment resistant depression.

This review showed that there is significant evidence that psychostimulant augmentation relieves the overall symptomatic burden of depression and that its impact on remission rates remains inconclusive. Therefore, the findings support the clinical use of certain psychostimulants for disease severity reduction.

However, due to the limitations found in the literature, it is fair to say that not only are more trials necessary, but also that these newer trials should aim to study, possibly give priority to, psychostimulants less researched in this context. Moreover, it is important that further research be in the form of larger trials with longer intervention and follow-up periods.

Additionally, outcome assessment instruments and the definitions for remission varied substantially between studies, which raised important difficulties in pooling data. For this reason, moving forward, it is important that some degree of uniformity be achieved, ideally through the adoption of a predefined set of pertinent scales and definitions to be used across studies.