Introduction

A sizable risk of late recurrence exists in estrogen receptor (ER)-positive breast cancers after completion of 5 years of adjuvant tamoxifen therapy [1, 2]. This justifies consideration of extended endocrine treatment in ER-positive early-stage cancers. Letrozole after 5 years of tamoxifen was found to reduce recurrence and improve overall survival [1], demonstrating that some micrometastatic foci retain sensitivity to endocrine therapy even after 5 years of tamoxifen. Continuing tamoxifen to 10 years has also demonstrated an overall survival benefit compared to 5 years of tamoxifen [2]. However, the small absolute benefit should be weighted against the potential side effects of prolonged endocrine therapy [3]. Available molecular markers do not identify patients who are at high risk for late recurrence (after 5 yrs) who would be the best candidates for extended endocrine therapy. While the assays of Recurrence Score and PAM50 predict prognosis for over 10 years, they are most prognostic in the first 5 years and have limited predictive values for late relapses [4, 5]. Both tests rely on measuring genes associated with proliferation and estrogen receptor signaling [4, 5]. However, the combination of these markers in these tests is fixed and does not allow detection of time-varying effect or interactions between markers. Given the independent prognostic and predictive values of proliferation and estrogen signaling, we examined in a time-dependent way whether a combination of proliferation, measured by the Mitotic Kinase Gene Expression Score (MKS), and ER-related gene expression, measured by an estrogen-related gene expression score (ERS), could improve the ability of these variables to predict early versus late relapses in women with ER-positive breast cancer. This information could help in selecting patients for extended adjuvant endocrine therapy based on their persistent risk for late relapses.

Methods

Tumor samples

Publicly available Affymetrix gene expression data from 683 patients (cohort 1; GSE6532 [6] GSE9195 [7] GSE17705 [8] and GSE12093 [9]) with ER-positive breast cancers treated with 5 years adjuvant tamoxifen and 559 ER-positive, untreated, node-negative breast cancers (GSE2034 [10] GSE7390 [11] GSE11121 [12] GSE5327 [13] GSE2990 [14] and GSE6532 [6]) were analyzed. Duplicated patients were removed from the different datasets. We also assessed an independent series of ER-positive patients treated with adjuvant tamoxifen for 5 years (cohort 2, n = 282, GSE26971 [15]) and 58 ER-positive patients treated with neoadjuvant letrozole for 3 months for which gene expression profiles are available at baseline, 14 and 90 days after beginning of the neoadjuvant treatment (GSE20181 [16]). In this series, patients with a volume reduction higher than 50% over the 3-month treatment period were considered as clinical responders.

Although of limited sample size, cohort 2 was added to our analysis to provide a qualitative confirmation of our findings. Multivariate analysis of this series was not feasible due to the small number of late events (n = 18). Gene expression data used in this study were publicly available and ethical approvals have been obtained as reported in the original publications to which the datasets referred.

Description of molecular marker used for analysis

The proliferation score was calculated as the average expression of 12 mitotic kinases to produce the MKS as previously described [17]. The ERS was defined as the average expression of the four genes from the ER-associated group of the Oncotype DX test (Additional file 1: Table S1) [18]. None of these gene scores was developed or optimized on the current data and therefore our results represent an independent validation of their predictive and prognostic values. To avoid over-fitting of an arbitrary marker threshold to the data, median values were calculated over the combined series of untreated patients and cohort 1 of the tamoxifen-treated patients, and used to define high (that is, above the median) and low (that is, below the median) expression groups. A similar approach was used in cohort 2. By combining these two metrics, four biomarker groups were created (low-MKS/high-ERS; low-MKS/low-ERS; high-MKS/high-ERS and high-MKS/low-ERS).

In order to assess the robustness of our findings, we also tested other measures of proliferation and ER signaling (Additional file 1: Table S1) [19]. We assessed the ability of four previously reported gene signatures to predict early versus late relapse including the PAM50 subtypes [19], the GGI (Genomic Grade Index) [14], the MammaPrint [20] and the three-gene classifier (ER, human epidermal growth factor receptor 2 (HER2) and Aurora Kinase A (AURKA) [21] (Additional file 2: Supplementary Methods).

Statistical analysis

Statistical analyses were performed using BRB-ArrayTools v4.1.0 - Beta_3 Release [22] and R software v2.9.0. All statistical tests were two-sided. Distant event-free survival was the primary endpoint. For each biomarker, the proportional hazards assumption for Cox regression model was assessed based on the scaled Schoenfeld residuals.

In order to assess the time-dependent prognostic value of our biomarkers groups, we examined recurrences rates in three distinct time intervals, including 0.0 to 2.5 years, 0.0 to 5.0 years and 5.0 to 10.0 years. These intervals were selected because they correspond to clinically important decision points. Patients at high risk for relapse after 5 years of tamoxifen (that is, late relapse) may be the optimal candidates for extended adjuvant endocrine therapy. The period from 0.0 to 2.5 years before the conventional switch to aromatase inhibitors is when the selection of the upfront endocrine treatment (tamoxifen versus aromatase inhibitors) is made. Landmark analysis of the Kaplan-Meier estimates by biomarker groups in different time intervals was used. The estimates of the hazard function by biomarker groups were performed using the 'muhaz’ R package (v 1.2.5) [23]. The univariate and multivariate hazard ratios were estimated using Cox regression analysis by the Wald test.

Results

Patient characteristics and correlation between clinical and molecular variables

DEFS information was available for 673 of the 683 patients who received adjuvant tamoxifen in cohort 1. The median follow up was 9.15 years. DEFS information was available for 553 of the 559 patients who did not receive systemic adjuvant therapy and their median follow up was 9.9 years. The treated (cohort 1) and untreated patient cohorts were significantly different in age, tumor (T) size, nodal status and histological grade (Table 1).

Table 1 Summary of patient and tumor characteristics

There was no association between the four molecular groups (low-MKS/high-ERS, low-MKS/low-ERS, high-MKS/high-ERS and high-MKS/low-ERS), nodal status or age. However, high-MKS cancers had higher grade and larger T size (T2/T3). The majority of HER2-positive tumors (77.5%) were in the high-MKS/low-ERS group.

Time-varying prognostic values of clinical and molecular variables in adjuvant tamoxifen-treated patients

We assessed the prognostic values of clinical and molecular variables for 10 years DEFS using Cox analysis in tamoxifen-treated patients (cohort 1) and tested the assumption of proportional hazards for each model (Table 2). Age, T size and nodal status did not violate the proportional hazards assumption but some molecular variables showed violation of the proportional hazard assumption indicating that their prognostic values varied over time.

Table 2 Univariate Cox analysis of clinical variables and molecular markers for tamoxifen-treated patients (cohort-1)

We divided the time axis into less than 5 and 5 to 10 years and estimated the hazard ratios (HRs) separately for each variable in both time periods. High-proliferation tumors had a higher risk of relapse during the entire period, but the risk decreased significantly over time. HER2-positive tumors had a higher risk of relapse only during the first 5 years. The ERS also had a strong time dependent effect (P = 0.0008 over 10 years). The HRs for recurrence of low- versus high-ERS groups in the first 2.5 years, from 2.5 to 5.0 years, and >5.0 years were 8.36 (95% CI 3.37, 21.7), 1.93 (95% CI 1.07, 3.48) and 0.91 (95% CI 0.50, 1.66), respectively. The four-group model obtained by combining MKS and ERS had the strongest time-dependent effect (P <0.0001). Kaplan-Meier analysis of the four biomarker groups is represented in Figure 1 for the tamoxifen-treated patients (Figure 1A). The time-dependent effect of biomarkers was qualitatively similar in node-negative (Figure 1B) and -positive (Figure 1C) tumors.

Figure 1
figure 1

Kaplan-Meier estimates of distant event-free survival (DEFS) by marker groups in tamoxifen-treated and untreated patients. Kaplan-Meier estimates of 10-year DEFS of the four biomarker groups obtained by combining low and high MKS (mitosis kinome score) and ERS (estrogen-related score). Results are plotted separately for tamoxifen-treated (all patients) (A), tamoxifen-treated (node negative) (B), tamoxifen-treated (node positive) (C), and tamoxifen-untreated patients (D). loP, low MKS; hiP, high MKS; loER, low ERS; hiER, high ERS.

Time-varying prognostic values of clinical and molecular variables in untreated patients

We performed a similar analysis in patients who received no adjuvant therapy (Additional file 3: Table S2). Age and T size did not violate the proportional hazards assumption over 10 years. The low-ERS group had a constant, slightly higher risk of recurrence than the high-ERS group in the first 5 years (HR 1.90, 95% CI 1.29, 2.80; P = 0.001) without a significant violation of the proportional hazard assumption (P = 0.127). High-proliferation tumors (that is, high histological grade, high-MKS, high-GGI, luminal B) had a higher risk of relapse over the entire 10-year period, but the risk decreased significantly after the first 5 years. There was a trend for a higher relative rate of relapse in low proliferation tumors after 5 years (Additional file 4: Figure S1). In the absence of endocrine therapy, the four-group model had only a trend for time-dependent effect (P = 0.065) (Figure 1D).

Prediction of early and late relapse in tamoxifen-treated patients

Among the tamoxifen-treated patients, cancers with low proliferation and low ERS were associated with a continuously increasing risk of relapse over 10 years (Table 2), that was higher between 5 to 10 years than in 0 to 5 years (HR 3.36, 95% CI 1.29, 8.73; P = 0.013) (Additional file 4: Figure S2A). Low proliferation and high-ERS tumors had an excellent prognosis at 10 years, even after stopping tamoxifen at 5 years (Figure 1). In the high-proliferation group, ERS had an even stronger time-dependent prognostic value (P <0.0001). Among high proliferation cancers, the HRs for the low- versus high-ERS groups in the time periods 0.0 to 2.5, 2.5 to 5.0, and >5.0 to 10.0 years were 11.8, 95% CI 2.83, 49.3; P = 0.0007, 1.80, 95% CI 0.88, 3.69; P = 0.110, and 0.23, 95% CI 0.08, 0.61; P = 0.003), respectively indicating very high risk of early relapse among high-proliferation low-ESR cancers (Additional file 4: Figure S2A). This was similar in node-negative (Figure 2A) and -positive (Figure 2B) patients. To test if this effect was independent of the threshold used, we also examined the time dependence when ESR was assessed by tertiles or as continuous variables. The lowest tertile of ERS compared to the highest showed a higher risk of early relapse within 0 to 5 years (HR 3.93, 95% CI 1.99, 7.75; P <0.0001) and a lower risk of late relapse between 5 to 10 years (HR 0.09, 95% CI 0.01, 0.70; P = 0.021) (Additional file 4: Figure S3). ERS analyzed as a continuous variable produced similar results, indicating a robust finding.

Figure 2
figure 2

Time-dependent assessment of estrogen-related score (ERS) in the high mitosis kinase score (MKS)-group in both tamoxifen-treated and untreated patients. Landmark analyses of the Kaplan-Meier estimates of distant event-free survival (DEFS) for highly proliferative tumors (high-MKS) according to ERS groups (high and low) during the first 2.5 years (left side of each panel), from 2.5 to 5 years (center of each panel) and from 5 to 10 years (right side of each panel) are shown. Results are plotted separately for tamoxifen-treated, node-negative (A), tamoxifen-treated, node-positive (B) and tamoxifen-untreated, node-negative (C) patients.

These findings also indicate that the prognostic value of ER-associated genes for late recurrence is different in low- and high-proliferation cancer groups (Additional file 4: Figure S2A). In the 5- to 10-year interval, Cox analysis including MKS and ERS as continuous variables and their interaction term showed that they were each independently significant (P = 0.035 and P = 0.048, respectively). The interaction term was also significant (P = 0.027). In the 0.0- to 2.5-year interval, almost all recurrences occurred in the high-MKS/low-ERS group (HR 6.53, 95% CI 3.48, 12.2; P <0.0001) compared to other groups (Additional file 4: Figure S4B). In this high-risk group the nodal status was not significantly prognostic (P = 0.098) (Additional file 4: Figure S4F), but in the other molecular groups enriched for more endocrine-sensitive tumors, almost all relapses were observed in the node-positive cohort (HR 6.56, 95% CI 1.83, 23.5; P = 0.0008) (Additional file 4: Figure S4E). The average ERS of high-MKS tumors that relapsed at 0.0 to 2.5, 2.5 to 5.0 and >5.0 to 10.0 years was progressively and significantly higher in late relapses (Additional file 4: Figure S5B).

Results of a Cox multivariate analysis for the 0.0- to 2.5- and 5.0- to 10.0-year time periods are presented in Table 3. After adjustment for clinical variables, high-MKS/high-ERS (HR 4.59, 95% CI 1.56, 13.5; P = 0.006) and low-MKS/low-ERS groups (HR 3.53, 95% CI 1.21, 10.2; P = 0.021) remained at higher risk of late relapses than the low-MKS/high-ERS group.

Table 3 Multivariate Cox analysis of clinical variables and marker groups in tamoxifen-treated (cohort 1) and untreated patients

Prediction of early and late relapse in untreated patients

ERS was not predictive of early versus late relapse when proliferation (MKS) was low (Additional file 3: Table S2). In the high-MKS group, there was a constantly higher risk of relapse for the low-ERS group over 10 years (HR 1.80, 95% CI 1.18, 2.73); P = 0.006) (Figure 2C and Additional file 3: Table S2). In the first 2.5 years, the high-MKS/low-ERS group had a higher risk of relapse than the other groups combined (Additional file 4: Figure S4A) and the HER2-positive patients were also at significantly higher risk of relapse. Cox multivariate analysis showed that none of the variables was prognostic for late relapse (Table 3). When the average ERS was compared between tumors relapsing at 0.0- to 2.5-, 2.5- to 5.0- and 5.0- to 10-year time intervals we did not observe any significant difference by proliferation group (Additional file 4: Figure S5C and S5D).

A combination of proliferation and ER-related genes can further risk-stratify patients for early and late relapse within conventional prognostic risk categories assigned by PAM50 and Mammaprint

To test the robustness of our observations, we performed the same analysis substituting the genomic grade index for MKS as a proliferation marker and the luminal gene score from the PAM50 classifier for the ERS as a measure of estrogen-related genes. Results using the combination of these markers also showed higher risk of early relapse in highly proliferative, low estrogen-score cancers and substantial risk for late relapse in highly proliferative and high estrogen-score tumors (Additional file 4: Figure S6). We also noted that ESR1 mRNA expression levels did not correlate strongly with either the ERS or luminal gene scores, an indication that these scores capture molecular information not provided by ESR1 expression levels alone (Additional file 4: Figure S7).

Furthermore, the four biomarker groups based on MKS and ERS could further risk-stratify patients for early (<5 years) and late relapses (within 5 to 10 years) within prognostic risk groups assigned by three first-generation genomic prognostic predictors, including the three-gene model (Figure 3A and 3B), luminal A and B molecular subtypes by PAM50 (Figure 3C and 3D) and low- and high-risk groups by Mammaprint (Figure 3E and 3F).

Figure 3
figure 3

Time-dependent assessment of combinations of different markers of proliferation and estrogen receptor (ER)-related genes. Landmark analyses of the Kaplan-Meier estimates of distant event-free survival (DEFS) during the first 5 years (left side of each panel) and from 5 to 10 years (right side of each panel) are shown for tamoxifen-treated patients according to four biomarker groups by combining mitosis kinase score (MKS) (low and high) and estrogen-related score (ERS) (high and low) in ER-positive/human epidermal growth factor (HER)2-negative low proliferation tumors (A) and high-proliferation tumors (B) by the three-gene model [21]; in Luminal A (C) and luminal B (D) by PAM50; and in low risk (E) and high risk (F) by Mammaprint. loP, low MKS; hiP, high MKS; loER, low ERS; hiER, high ERS.

Confirmation of results in an independent cohort of patients treated with adjuvant tamoxifen

The patient characteristics of an independent series of ER-positive patients treated with adjuvant tamoxifen are described in Table 1. The median follow up was 7.2 years. The median cutoff point was applied to MKS and ERS. The time-varying pattern of distant relapse was qualitatively similar to that observed in the previously described cohort (Figure 4A-C). In the high-proliferation group the risk of relapse for the high-ERS group was lower in the first 5 years (P = 0.057) and higher after 5 years (P = 0.072) (Figure 4D). We also confirmed that in the first 2.5 years the majority of the relapses occurred in the high-MKS/low-ERS groups in node-negative but not in node-positive patients (Additional file 4: Figure S8).

Figure 4
figure 4

Prognostic values of the four biomarker groups in the second cohort of tamoxifen-treated patients. Kaplan-Meier estimates of 10-year distant event-free survival (DEFS) of the four biomarker groups obtained by combining low and high mitosis kinase score (MKS) and estrogen-related score (ERS) in all patients (A) and node-negative patients (B). Annual hazard rates for distant recurrences by biomarker-group estimates using kernel-based methods for all patients (C). Landmark analyses of the Kaplan-Meier estimates of DEFS during the first 5 years (left side of the panel) and from 5 to 10 years (right side of the panel) is shown for the high-MKS groups by ERS (high and low) (D). loP, low MKS; hiP, high MKS; loER, low ERS; hiER, high ERS).

Predictive values of proliferation and ER-associated genes in tumors treated with neoadjuvant letrozole

We evaluated the relationship between MKS and ERS and clinical and molecular response to neoadjuvant letrozole (n = 58). Molecular response was defined as downregulation of the MKS, as a measure of proliferation, at 14 and 90 days during neoadjuvant therapy. Among cancers with high proliferation (that is, high MKS) at baseline (n = 26), the clinical response rates were 47% (7/15) and 100% (11/11) for the low-ERS and high-ERS groups, respectively (P = 0.013). High-ERS cancers also had a more significant downregulation of proliferation at 14 (P = 0.028) and at 90 days (P = 0.045) compared to low-ERS tumors in response to letrozole therapy (Figure 5). These findings confirm lesser endocrine sensitivity of low-ERS cancers and are consistent with the observation of high early relapse rates among the high MKS/low ERS cancers.

Figure 5
figure 5

Predictive value of the estrogen-related score (ERS) in the high-proliferation group after neoadjuvant letrozole. The dynamic change of proliferation measured by mitosis kinase score (MKS) at baseline, 14 and 90 days in patients with baseline high proliferation treated with neoadjuvant letrozole in the high-ERS (A) and low-ERS (B) groups (red line = clinical response; blue line = non responder; green line = clinical information not available). (C) Box plot describing the proliferation at 14 and 90 days in the high-ERS and low-ERS groups. A comparison between MKS values in different groups was performed by Wilcoxon rank-sum test.

Discussion

The potential time-dependency of prognostic and predictive variables is well-known [11, 24, 25], but it is rarely studied purposefully. In this paper we examined the time-dependent interaction of proliferation [17] and ER-related gene expression [18, 26] with the risk of early versus late recurrence in ER-positive early-stage breast cancer. We show that the high risk of relapse of highly proliferative tumors decreases over time in both untreated and tamoxifen-treated patients [11, 24]. We also discovered a strong time-dependent prognostic interaction between the ERS and proliferation in tamoxifen-treated patients. The low risk of early recurrence in high-ERS, high-proliferation cancers during the first 5 years of active therapy converts into a higher risk of recurrence after 5 years. Highly proliferative cancers with strong ER signaling (high-MKS/high-ERS) show the greatest absolute increase for risk of late relapse after completion of 5 years of endocrine therapy, and therefore may be the best candidates for extended endocrine treatment. Among the low-proliferation cancers, the low-ERS group had constantly higher risk of relapse over the 10-year period compared to low-proliferation, high-ERS cancers. Extended endocrine therapy in this group also deserves further study. The time-dependent interaction between proliferation and estrogen signaling in predicting recurrence may explain why biomarkers that do not take into account such information (PAM50, Oncotype DX) do not predict late recurrences well [4, 5].

The majority of the relapses within the first 2.5 years on tamoxifen occurred in the high-proliferation/low-ERS group. In a series of ER-positive patients treated with neoadjuvant letrozole, this group also had low clinical response rates and had minimal decrease in proliferation metrics [27]. These data suggest that the high-proliferation/low-ERS group is enriched in tumors intrinsically resistant to both tamoxifen and letrozole and in part explain why the use of upfront aromatase inhibitors (AIs) was not superior to the sequence of tamoxifen followed by an aromatase inhibitor in unselected patients [28, 29]. Clinical trials could be designed and investigate specific strategies to overcome intrinsic resistance in these molecular groups. Additionally, node-negative and node-positive tumors show a different pattern of relapse in this early period, indeed almost all the recurrences in the molecular groups different from the high-proliferation/low-ERS group occur in node-positive cancers (P = 0.0008). The observation is consistent with findings in the BIG1-98 trial that showed a benefit from upfront letrozole compared to tamoxifen only in node-positive tumors during the short median follow up of 2.5 years [30].

Highly proliferative tumors relapsing after adjuvant tamoxifen had a progressively higher ERS score in late recurrences compared to early recurrences, but the same association was not observed in untreated tumors. One can speculate that early relapses are most common in tumors intrinsically resistant to endocrine treatment, while late recurrences occur also in tumors that could have relapsed earlier without treatment, but relapse was delayed due to adjuvant therapy. Our observation could also explain why in different reports the risk-reduction is modest around 20% for upfront AIs compared to tamoxifen [31], while it increases to 30% when tamoxifen is switched to an AI after 2 to 3 years [31], and the risk reduction is up to 50% when the AI is administered after 5 years of tamoxifen [1].

Our results are consistent with other reports and show that tumor size and nodal status remain independent prognostic factors. Clinical and molecular information, particularly nodal status should be used in combination for decision-making about extended endocrine treatments [32].

We acknowledge that the different characteristics of patients in the tamoxifen-treated and untreated cohorts call for caution in any comparison between the two groups. Also, our analysis is not based on data from randomized clinical trials but from prospectively assembled cohorts of cases for prognostic marker studies. The cohorts have been annotated and published in peer-reviewed journals [21], and the validity of clinical information has been demonstrated in several different studies [6, 8, 33]. The generalizability of our results was assessed in an independent cohort of tamoxifen-treated patients, and also supported by recently presented data by other groups [34, 35].

Conclusions

Our findings suggest that patients with highly proliferative (high MKS or high GGI) and high-ER breast cancers (high expression of ER-related genes both adopted from Oncotype DX or from PAM50) have the highest absolute risk for late relapse tumors, despite the fact that their risk of recurrence is modest in the first 5 years on tamoxifen. These tumors could be the best candidates for extended endocrine therapy. Patients with low proliferation and low-ER-related genes also remain at risk for late relapses, and benefit from extended endocrine treatment with aromatase inhibitors should be also assessed.

Early relapses (in the first 2.5 years) are highest in highly proliferative/low-ERS cancers, in particular in node-negative tumors, and this group also had minimal response to neoadjuvant letrozole. These findings suggest that this molecular group is enriched in tumors intrinsically resistant to endocrine therapy (both tamoxifen and letrozole) and in part can explain why upfront AIs were not superior to the sequence of tamoxifen followed by an AI in unselected patients.

Overall, the combined use of markers of proliferation and ER-related genes can inform the design of future adjuvant trials and assist in the selection of patients for extended endocrine treatment.