Introduction

For patients with hormone-receptor-positive advanced breast cancer, endocrine therapy is the recommended initial treatment, and aromatase inhibitors (AIs) are the preferred option for postmenopausal women because they elicit prolonged disease control compared with tamoxifen [13]. Treatment options for patients who experience disease progression after nonsteroidal AI therapy are limited. Recently, the combination regimen of everolimus, an mTOR inhibitor, and exemestane, a steroidal AI, was approved for use in this setting based on significant improvement of progression-free survival (PFS) versus exemestane alone [4]. Other options include exemestane alone, fulvestrant (an estrogen-receptor down-regulator), megestrol acetate, and chemotherapy [13]. Current guidelines recommend continued endocrine therapy for responsive disease; however, clinical questions regarding optimal sequencing of therapy remain [13].

Exemestane has shown activity in patients with advanced breast cancer as a second- and third-line agent following nonsteroidal AI treatment [5]. In patients who had disease progression after first-line letrozole or anastrozole (n = 249), exemestane provided a clinical benefit rate of 27 % with a median PFS of 3.4 months [5]. Even after two prior endocrine treatments in the advanced breast cancer setting (n = 60), exemestane demonstrated a clinical benefit rate of 38 % [6].

Robust clinical evidence favoring the use of everolimus plus exemestane rather than exemestane alone was demonstrated in BOLERO-2, a phase 3 study in postmenopausal women with hormone-receptor-positive, human epidermal growth factor receptor 2 (HER2)-negative advanced breast cancer progressing/recurring on/after nonsteroidal AI treatment [4, 7]. In this study, median PFS was more than doubled in the everolimus plus exemestane group (7.8 months vs 3.2 for exemestane alone, by local assessment; hazard ratio [HR] = 0.45; P < 0.0001). Central assessment of median PFS confirmed the clinical benefit with everolimus plus exemestane (11.0 months vs 4.1 for exemestane alone; HR = 0.38; P < 0.0001). In addition, PFS benefits were consistent across patient subgroups defined by age, race, presence of visceral metastases, and prior chemotherapy.

A clinical trial of fulvestrant 250 mg in postmenopausal women with hormone-receptor-positive advanced breast cancer progressing on nonsteroidal AI treatment showed a similar duration of disease control as with exemestane (median time to progression [TTP] = 3.7 months in both treatment groups; HR = 0.963; P = 0.6531) [8]. However, a clinical trial of fulvestrant 250 mg versus fulvestrant 500 mg in patients progressing after endocrine treatment demonstrated a significant, albeit modest, improvement in median PFS in the fulvestrant 500-mg group (5.5 vs 6.5 months, respectively; HR = 0.80; P = 0.006) [9]. In patients previously treated with an AI, the magnitude of median PFS benefit with 500 versus 250 mg fulvestrant was less than in patients previously treated with an antiestrogen.

The currently available clinical evidence data support the use of both everolimus plus exemestane and fulvestrant in postmenopausal women with advanced breast cancer progressing on or after nonsteroidal AI therapy. However, the limited clinical data on direct comparisons of these agents present a challenge to optimization of disease management. Although the apparent magnitude of PFS benefit with everolimus plus exemestane exceeds that with fulvestrant, such empirical cross-trial comparisons have limited validity. In such cases, treatment decisions may be guided by an indirect comparison, which can estimate relative efficacy parameters of different treatment regimens across clinical studies. For example, a recent indirect treatment comparison for the efficacy of bevacizumab plus interferon-alpha-2a versus tyrosine kinase inhibitors as first-line treatment in metastatic renal cell carcinoma showed that the two treatment regimens were similar with regard to PFS [10]. Guidelines for conducting and interpreting indirect treatment comparisons exist to standardize and support this area of research [1113]. This network analysis was conducted to compare the efficacy of everolimus plus exemestane versus fulvestrant in patients with advanced breast cancer who are eligible for further endocrine therapies.

Methods

Evidence base: search strategy

As everolimus (plus exemestane) and fulvestrant 250 mg have both been compared with exemestane monotherapy, an indirect comparison was possible through the common exemestane arms in the studies. However, comparisons involving fulvestrant 500 mg, and everolimus plus tamoxifen, required the identification of additional studies with common treatment/comparator arms to complete the network. Studies connecting everolimus and fulvestrant were identified through systematic reviews of treatments for advanced or metastatic breast cancer found from a database search of the Cochrane Library (CDSR, DARE, and HTA, 2010–2012), EMBASE, and MEDLINE. Search terms included breast or mammary and disease descriptors (cancer, neoplasm, oncology, tumor, malignancy, carcinoma, adenocarcinoma, or sarcoma) as well as metastasis, advanced, secondary, recurrent, inoperable, disseminated, or incurable. Search terms for treatments included everolimus or Afinitor as well as SDZ-RAD, rad001, or 159351-69-6. Recent studies were identified through Web site searches (National Horizon Scanning Centre, NICE, FDA, EMA, ASCO, ESMO, ISPOR, ECCO, EBCC, and SABCS) conducted March 8–9, 2012.

Eligible studies were assessed for quality based on seven items: appropriate randomization; adequate concealment of treatment allocation; groups similar at the onset of the study in terms of prognostic factors, care providers, participants, and outcome assessors blind to treatment allocation; unexpected imbalances in dropouts between groups; evidence to suggest that more outcomes were measured than reported; intent-to-treat analysis; and appropriate methods used to account for missing data.

Quality assessments and assumptions

Data extracted from each trial included design, methodology, statistical methods, patient characteristics, and outcome measures. Subgroup data, whenever possible, were analyzed for relevance. The primary analysis was for relative efficacy (PFS/TTP) between everolimus and fulvestrant in the full patient population of the studies included in the network, using the local assessment of PFS from BOLERO-2. Secondary analyses included a subgroup analysis by prior AI treatment and an additional analysis for everolimus plus exemestane versus fulvestrant using the BOLERO-2 central assessment of PFS. Using the connector studies, exploratory analyses of the relative efficacy of everolimus plus tamoxifen versus fulvestrant were conducted. Additionally, the network meta-analysis of the full patient population was performed for the relative efficacy of everolimus plus tamoxifen versus fulvestrant using the Howell study [14] in place of the Paridaens study (which was a comparison of tamoxifen versus exemestane and needed the SoFEA/EFECT studies to complete a network for comparison versus fulvestrant 250 mg) as an additional sensitivity analysis [15]. The Howell study was identified following the systematic review, and therefore did not have data extracted in as much detail as the other studies and was not assessed for quality.

In conducting these analyses, several assumptions were made regarding the data from the individual studies. For the primary analysis, PFS and TTP were assumed to be similar because disease-specific death events were included in each endpoint in the studies analyzed. Moreover, as the survival time in advanced breast cancer is shorter than in early stage disease [9, 16], the majority of deaths were assumed to be disease-specific.

Statistical analysis

All treatments evaluated in the primary analysis were connected by one or more studies. Data consisted of the HRs for PFS/TTP and the 95 % confidence intervals (CI) for each of the trials as extracted from published reports. The log HRs and their precision (reciprocal of the variance) were calculated and input into the model. For the CONFIRM study, the HRs for the subgroup with prior AI therapy were calculated from a digitized copy of the published forest plot [9]. Accuracy was checked against calculations for known results (all patients); resulting differences on the HR scale were 0.005 or less. As not all studies included outcome information on patients with prior AI therapy, the only comparisons available were for everolimus plus exemestane, exemestane alone, and fulvestrant 250 and 500 mg.

The indirect treatment comparison for relative efficacy between everolimus (with exemestane or tamoxifen) and fulvestrant used a Bayesian approach as described by Welton et al. [17]. Specifically, a Bayesian fixed-effect model was used to describe the log HRs. In this type of comparison, evidence is synthesized from aggregate data arising from randomized, controlled studies. Exemestane was adopted as the reference treatment because it is used in clinical practice and is the treatment with the most information in the network; basic parameters of the model are the log HRs with respect to exemestane. The log HR from each study was assumed to be normally distributed, with the log HR mean equaling the true log HR observed in each study and the variance equaling the observed variability in each study. The Markov chain Monte Carlo (MCMC) software WinBUGS (version 1.4.3) was used to estimate the log HRs for each possible pairwise comparison. For each outcome measure, the analysis was performed with three chains, each run for a series of 20,000 burn-in simulations to allow convergence. Convergence was checked using the Gelman–Rubin statistic, and a visual inspection demonstrated satisfactory convergence by 20,000 iterations. A further 20,000 updates were run for each chain, and estimates were obtained from those updates. Median HRs and the 95 % credible interval (CrI) are presented for the HRs. If the 95 % CrI does not include 1, this suggests that there is evidence of a difference between the treatments.

Results

Network analysis evidence base

A total of seven studies were identified that could be used in a network analysis. Of these, only six were used to form the basis of a network analysis, with the seventh [14] used as an alternative for an additional sensitivity analysis. The evidence network with the connections between the comparators included in the primary analyses is shown in Fig. 1. Details of the individual study designs are provided in Table 1 [4, 5, 79, 14, 15, 18]. All patients in the 7 trials were postmenopausal and had locally advanced or metastatic disease. All trials reported on PFS/TTP outcomes. The TAMRAD study reviewed disease progression data locally, whereas the BOLERO-2 study included both local and central review. The other studies did not specifically state that a central review was performed. In addition, all but the open-label Paridaens and TAMRAD trials were double-blind.

Fig. 1
figure 1

Network of trials used to perform the analysis

Table 1 Details of the trials included in the network analysis

In general, the overall patient demographics and disease characteristics were similar (Table 2) [4, 5, 8, 9, 14, 15, 18]. There were slight differences in the CONFIRM population versus the other primary studies regarding prior treatments before study entry. Although all patients in CONFIRM had prior endocrine therapy, prior AI therapy was not an inclusion criterion, whereas the other primary studies (except the Paridaens and Howell studies) had this constraint. However, information from the subgroup of patients who had prior AI treatment was available for analysis from the CONFIRM study (as noted in the statistical analysis), and was used in a comparison with the other studies’ patient populations who had prior AI therapy. The Paridaens and Howell studies also differed in other prior treatment eligibility requirements. The Paridaens study permitted prior chemotherapy but not endocrine therapy as first-line treatment in the advanced setting, whereas in the Howell study, first-line treatment for metastatic disease was not allowed and information on patients with prior AI therapy was not available. Hazard ratios for PFS/TTP in each study used in the analyses are shown in Table 3 [4, 5, 8, 9, 14, 15, 18, 19].

Table 2 Patient demographics and disease characteristics at baseline for each study
Table 3 Individual trial hazard ratios used in analysis

Analysis of efficacy

In the primary analysis, the results suggest that everolimus plus exemestane is more efficacious for PFS/TTP than both fulvestrant 250 (HR = 0.47; 95 % CrI 0.38–0.58) and 500 mg (HR = 0.59; 95 % CrI 0.45–0.77) (Table 4). The exploratory analysis of the other pairwise comparisons (using the Paridaens study) found no evidence of a difference between everolimus plus tamoxifen and fulvestrant 250 or 500 mg (Table 4). The analysis using the BOLERO-2 PFS data based on central review suggested that everolimus plus exemestane remained more efficacious for PFS/TTP than fulvestrant 250 mg and fulvestrant 500 mg (HR = 0.40; 95 % CrI 0.31–0.51 and HR = 0.50; 95 % CrI 0.37–0.67, respectively; Table 4).

Table 4 Efficacy of everolimus (EVE) plus exemestane (EXE) versus fulvestrant (FUL) and other pairwise comparisons in the total patient population

The results of a sensitivity analysis based on an alternative network, which substituted the Howell study for the Paridaens study, were generally consistent with the analysis based on the original network, although this exploratory analysis suggested that everolimus plus tamoxifen is more efficacious for PFS/TTP than fulvestrant 250 mg (HR = 0.46; 95 % CrI 0.29–0.71) and 500 mg (HR = 0.57; 95 % CrI 0.35–0.92).

Prior aromatase inhibitor therapy

In patients who received prior AI therapy, based on local assessment of PFS from BOLERO-2, everolimus plus exemestane was more efficacious for PFS/TTP than fulvestrant 250 and 500 mg (HR = 0.47; 95 % CrI 0.38–0.58 and HR = 0.55; 95 % CrI 0.40–0.76, respectively; Table 5). Using the PFS data from the centrally reviewed BOLERO-2 study did not substantially change the results using the locally reviewed PFS data; everolimus plus exemestane remained more efficacious for PFS/TTP than fulvestrant 250 and 500 mg (Table 5).

Table 5 Efficacy of everolimus (EVE) plus exemestane (EXE) versus fulvestrant (FUL) and other pairwise comparisons in patients with prior aromatase inhibitor therapy

Discussion

Direct comparative clinical evidence for optimal treatments following a nonsteroidal AI in postmenopausal women with hormone-receptor-positive advanced breast cancer is limited [8, 20]. In this setting, clinical studies of endocrine monotherapy or combinations of endocrine agents have demonstrated limited efficacy [8, 20], and the optimal sequence of endocrine treatment remains unclear. In contrast, combining endocrine agents with newer targeted agents (e.g., everolimus) to block cell signaling pathways known to interact with the estrogen receptor provided increased clinical benefits versus endocrine monotherapy [4, 7]. In the absence of head-to-head clinical studies, an indirect comparison of clinical evidence can be used to provide estimates of relative efficacy and inform treatment decisions.

In this network meta-analysis, everolimus plus exemestane was shown to provide an improved PFS/TTP compared with fulvestrant alone at either 250 or 500 mg. Everolimus plus exemestane also provided greater clinical benefit versus fulvestrant 250 and 500 mg in the subgroup of patients who had prior AI therapy. The results from this network analysis are supported, in part, by the clinical evidence from an earlier network meta-analysis, which reported that fulvestrant 500 mg was expected to be at least as efficacious as exemestane alone in postmenopausal women with advanced breast cancer who progressed on endocrine therapy [21], and everolimus plus exemestane was more efficacious than exemestane alone in this setting [4]. Furthermore, in this earlier meta-analysis, fulvestrant plus anastrozole did not increase clinical benefits (TTP and overall survival) versus anastrozole alone in patients with breast cancer at first relapse [22]. However, this patient population was dissimilar (premenopausal patients were included and very few patients received prior AI therapy) to the patients in the current analysis.

The Decision Support Unit of National Institute for Health and Clinical Excellence reports that network meta-analyses may suffer from problems of unobserved effect modifiers, as do pairwise meta-analyses [23]. However, provided the constituent trials are unbiased and were conducted in similar patient populations, both analyses should result in unbiased estimates of treatment effects and are superior to observational studies because they are based on randomized comparisons [23]. However, there are limitations inherent to these types of indirect comparisons. The small number of included studies precluded a quantitative exploration of heterogeneity among trial results. Heterogeneity is an important aspect in indirect analyses, and PFS (used in this analysis to assess comparative efficacy) is affected by patient prognosis at baseline. An important feature of this methodology is that no assumptions are made regarding trial-specific baselines and that between-trial heterogeneity is set to zero, thus assuming homogeneity of the underlying true treatment effects. The CONFIRM trial did have a different patient population than the other trials; however, the subgroup analysis of patients who had prior AI therapy ameliorated those differences. The patient populations from the Paridaens and Howell studies were also different in terms of prior endocrine therapies, which can bias the analysis, although they only affect the exploratory comparison of everolimus plus tamoxifen versus fulvestrant and do not affect the primary comparison of everolimus plus exemestane versus fulvestrant. In addition, the CONFIRM and EFECT studies did not have data on HER2 status. Inclusion of a sizable proportion of patients with HER2-positive disease may lower efficacy results compared with the other studies that had a low proportion of patients with HER2-positive disease.

Limitations specific to this network analysis include the use of both double-blind and open-label trials and a combination of two different outcome measures—PFS and TTP. Although PFS and TTP can have dissimilar results because deaths are not typically included in TTP, the two outcome measures had overlapping criteria for disease-specific deaths or deaths within 6 months of last tumor assessment as an inclusion event in the network studies. Survival time in advanced breast cancer is shorter than in early stage disease [9, 16]; therefore, the majority of on-study deaths were assumed to be disease-specific, and the analysis assumed that PFS and TTP would yield similar results.

In conclusion, this indirect treatment comparison suggests that everolimus in combination with exemestane may be more efficacious than fulvestrant 250 mg (HR = 0.47; 53 % risk reduction of PFS event) and 500 mg (HR = 0.59; 41 % risk reduction of PFS event) in postmenopausal women with hormone-receptor-positive, advanced breast cancer that has relapsed or progressed on/after a nonsteroidal AI. Future trials should evaluate a selective combination of these therapies to define an optimal treatment strategy.