Introduction

The classification of breast cancer tumours by gene expression analysis into intrinsic subtypes (Luminal A, Luminal B, human epidermal growth factor receptor 2-enriched [HER2-E], and Basal-like), is well-established1,2,3. Genomic testing is recommended as a complement to conventional risk assessment in postmenopausal patients with equivocal risk of recurrence4,5. PAM50 intrinsic subtyping and risk of recurrence (ROR) score, initially developed by Parker et al., are included in the Prosigna© Breast Cancer Prognostic Gene Signature Assay3,6. This is approved and validated for postmenopausal women with oestrogen receptor-positive/HER2-negative (ER+/HER2−) tumours allocated to 5 years of endocrine therapy, and ROR score provides prognostic information7,8,9,10,11.

The prognostic value of PAM50 subtypes and ROR score in premenopausal women remains unclear. Previous studies have indicated that PAM50 and ROR score are applicable also for premenopausal women12,13,14,15. Moreover, some of these studies also demonstrated a possible predictive effect of PAM50 subtypes for tamoxifen and chemotherapy benefit12,13. Surrogate classification of breast cancer tumours into Luminal ASurrogate Classification, (SC) and Luminal BSC, using immunohistochemistry (IHC)/in situ hybridisation (ISH), was used in clinical settings before multigene assays were broadly implemented for prognostication and guiding decisions about adjuvant treatments. However, surrogate classifications have shown poor concordance to the corresponding intrinsic subtypes16,17,18,19, resulting in suboptimal risk estimation for patients with ER+/HER2− tumours.

In the SBII:2pre trial, premenopausal women were randomised between 2 years of adjuvant tamoxifen or no adjuvant systemic therapy (control), irrespective of hormone-receptor status and thus one third of the patients had ER-negative tumours. Availability of long-term follow-up data (>30 years) and preserved archival tumour tissues from the participants allowed us to assess the long-term effects across gene expression profiles and subtypes in this cohort.

The primary aim of this study was to investigate the prognostic value of PAM50 intrinsic subtypes and ROR score in premenopausal patients with ER+/HER2− tumours. The secondary aims were to compare luminal PAM50 and St. Gallen 2013 surrogate subtypes and to evaluate if luminal PAM50 subtypes can be used to predict tamoxifen benefit in premenopausal patients.

Results

Sample availability and cohort characteristics

Gene expression analyses were successfully performed for 220 and 217 tumours (and the corresponding number of patients) in the control and tamoxifen arms, respectively (Fig. 1). The median follow-up time for those with no breast cancer events regarding breast cancer-free interval (BCFi) and overall survival (OS) were 28 and 33 years, respectively. Patient and tumour characteristics for the entire cohort and for the ER+/HER2− cohort are presented in Table 1.

Fig. 1: Flow chart of the included patients.
figure 1

aAvailable ROR score categories in n = 219 and n = 216 patients in the control (no systemic treatment) and tamoxifen arm, respectively, due to n = 1 missing nodal status for one patient in each treatment arm. bDefined accordingly: LumASC, low Ki67 (<20%) and high PR (≥20%); LumBSC, high Ki67 (≥20%) and/or low PR (<20%). Cases with missing re-evaluated PR data were substituted (n = 2 in the control arm) with previously available IHC data for PR. ER oestrogen receptor, HER2 human epidermal growth factor receptor 2, IHC immunohistochemistry, Lum Luminal, PR progesterone receptor, ROR risk of recurrence, SC surrogate classification, TAM tamoxifen.

Table 1 Patient and tumour characteristics for the whole study cohort (n = 560) by study arm and for the ER-positive/HER2-negative subgroup.

The proportions of Luminal A, Luminal B, HER2-E, and Basal-like intrinsic subtypes by PAM50 (n = 437) were 44%, 19%, 17%, and 20%, respectively (Supplementary Fig. 1). The median ROR score was 56 and the proportions among patients with available nodal status classified into the low, intermediate, and high ROR categories (n = 435) were 10%, 24%, and 66%, respectively.

Prognostic value of PAM50 subtypes

Cumulative incidence curves for BCFi and OS by PAM50 subtypes are presented in Fig. 2a, b for all patients and in Fig. 2c, d for patients with ER+/HER2− tumours. After the maximum follow-up period, patients with ER+/HER2− and Luminal BPAM50 tumours had a higher cumulative incidence of breast cancer events than patients whose tumours were categorised as Luminal APAM50 (hazard ratio [HR]BCFi: 1.56, 95% confidence interval [CI] 1.09–2.22, P = 0.01). The results were similar for the period 0–10 years (HRBCFi: 1.93, 95% CI: 1.27–2.93, P = 0.002; Table 2). The results also indicated increased overall mortality for patients whose tumours were Luminal BPAM50 as compared to Luminal APAM50 (maximum follow-up: HROS: 1.49, 95% CI: 1.05–2.12, P = 0.03; 0–10 years: HROS: 2.55, 95% CI: 1.56–4.17, P < 0.001). The results were similar after adjusting for other clinicopathological variables (Table 2) and for all included patients irrespective of hormone-receptor status (Supplementary Table 1). Additionally, cumulative incidence curves for recurrence-free interval (RFi) with essentially the same results are depicted in Supplementary Fig. 2a–d.

Fig. 2: Cumulative incidence curves for BCFi and OS by PAM50 subtypes.
figure 2

(a, b) All included patients, (c, d) patients with ER-positive/HER2-negative tumours, and (e, f) patients with available intrinsic PAM50 and surrogate subtyping by St. Gallen 2013. Overall P-values from log rank test, Gehan’s version for BCFi, for maximum follow-up and for different time intervals. BCFi breast cancer-free interval, HER2-E human epidermal growth factor receptor 2-enriched, Lum Luminal, OS overall survival, SC surrogate classification.

Table 2 Cox regression uni- and multivariable models for BCFi and OS by PAM50 subtypes and St. Gallen 2013 surrogate subtypes for different time intervals in patients with ER-positive/HER2-negative tumours.

Agreement and prognostic effect of luminal PAM50 and St. Gallen 2013 surrogate subtypes

In the agreement analyses (ER + /HER2 − cohort, n = 207), 67% and 33% were assessed as Luminal APAM50 and Luminal BPAM50, respectively. The corresponding figures for St. Gallen 2013 surrogate subtypes were 26% and 74%, respectively. In total, 58% (90/154) of patients classified as Luminal BSC were classified as Luminal APAM50 (Table 3).

Table 3 Distribution of luminal subtypes according to PAM50 and St. Gallen 2013 surrogate subtyping (n = 207) and corresponding agreement analyses (percentage and kappa [κ] statistic).

The difference in outcomes between the four different combinations of the St. Gallen 2013 surrogate and PAM50 subtypes (Luminal BSC/Luminal BPAM50 [n = 64], Luminal BSC/Luminal APAM50 [n = 90], Luminal ASC/Luminal BPAM50 [n = 5], and Luminal ASC/Luminal APAM50 [n = 48]) is illustrated in Fig. 2e, f and Supplementary Fig. 2e, f. After 10 years of follow-up, patients with tumours classified as Luminal BSC/Luminal APAM50, had better prognosis than those with tumours classified as Luminal BSC/Luminal BPAM50 (HRBCFi: 0.52, 95% CI: 0.33–0.83, P = 0.006; HROS: 0.37, 95% CI: 0.21–0.66, P = 0.001; Table 2). A proportion (91/207) of the cohort was at risk for breast cancer events at 20 years of follow-up, hence the prognostic effects became weaker with long-term (>30 years) follow-up and the results were similar in multivariable analyses (Table 2).

Predictive value of luminal PAM50 subtyping for tamoxifen benefit

After 10 years of follow-up, a beneficial effect of adjuvant tamoxifen was observed in patients with ER+/HER2− and Luminal APAM50 tumours (HRBCFi: 0.41, 95% CI: 0.23–0.74, P = 0.003; Fig. 3a); however, not for patients with Luminal BPAM50 tumours (HRBCFi: 1.19, 95% CI: 0.63–2.27, P = 0.59; Fig. 3c). Hence, the effect of tamoxifen was threefold better in patients with Luminal APAM50 tumours as compared with those with Luminal BPAM50 tumours (interaction: HRBCFi: 0.34, 95% CI: 0.14–0.83, P = 0.02). Similar results were observed for OS (Table 4 and Fig. 3b, d) and in the multivariable analyses (Supplementary Table 2), but was not as evident after maximum follow-up. Corresponding cumulative incidence curves for RFi are presented in Supplementary Fig. 3. When selecting all patients with luminal PAM50 tumours, regardless of ER/HER2 status (n = 274), the findings after 10 years of follow-up were essentially the same (interaction: HRBCFi: 0.45, 95% CI: 0.21–0.96, P = 0.04)).

Fig. 3: Cumulative incidence curves for BCFi and OS according to treatment arm.
figure 3

(a, b) Patients with LumA and (c, d) LumB tumours by PAM50. P-values from log rank test, Gehan’s version for BCFi, for maximum follow-up and for different time intervals. BCFi breast cancer-free interval, Lum Luminal, OS overall survival, TAM tamoxifen.

Table 4 Cox regression models for BCFi and OS by luminal PAM50 subtypes, tamoxifen treatment, and PAM50 subtype by treatment interaction in patients with ER-positive/HER2-negative tumours.

Prognostic value of ROR score (ER+/HER2− subgroup)

Among all patients (ER+/HER2− subgroup, n = 236), the distributions of low, intermediate, and high ROR score categories were: 13%, 27%, and 60%, respectively. The outcomes are illustrated in Fig. 4 and in Supplementary Fig. 4. For all patients with ER+/HER2− tumours, high vs low ROR score was associated with worse outcomes after 10 years of follow-up (HRBCFi: 2.36, 95% CI: 1.18–4.72, P = 0.02; Table 5). This effect was less pronounced after maximum long-term follow-up (HRBCFi: 1.70, 95% CI: 1.01–2.85, P = 0.04). The corresponding results for OS and the multivariable analyses are presented in Table 5 and in Supplementary Table 3, respectively.

Fig. 4: Cumulative incidence curves for BCFi and OS by ROR score categories for patients with ER-positive/HER2-negative tumors.
figure 4

(a, b) all patients (c, d) node-negative, and (e, f) node-positive patients. Overall P-values from log rank test, Gehan’s version for BCFi, for maximum follow-up and for different time intervals. BCFi breast cancer-free interval, ER oestrogen receptor, HER2 human epidermal growth factor receptor 2, OS overall survival, ROR risk of recurrence.

Table 5 Cox regression univariable models for BCFi and OS by ROR score categories for different time intervals for patients with ER-positive/HER2-negative tumours in all patients, node-negative, and node-positive (1–3 positive nodes) subgroups of patients.

Stratified by nodal status, the distributions of ROR score categories were: node-negative (N0) (n = 60): 48%, 22% and 30%, N+ (1–3 positive nodes, n = 123): 2%, 42% and 57%, respectively. High vs low ROR score yielded 2.5- and 1.7- fold increased incidence of breast cancer events in N0 patients after 10 years of follow-up and maximum follow-up, respectively (Table 5). Due to small sample size (n = 2), the low ROR category was omitted in the analysis of N+ (1–3 positive nodes) patients and the results indicated that intermediate ROR score was associated with lower incidence of breast cancer events than high ROR score (Table 5).

Discussion

This study demonstrates that PAM50 subtypes and ROR score could provide long-term prognostic information in premenopausal patients, and indicates a possible tamoxifen-predictive effect by luminal intrinsic subtyping after 10 years of follow-up. The Prosigna© Breast Cancer Prognostic Gene Signature Assay, which can use formalin-fixed paraffin-embedded tissues in a decentralised mode20, is validated in postmenopausal women7,8. Our results demonstrated that premenopausal patients with Luminal BPAM50 as compared with Luminal APAM50 tumours, had a > 1.4-fold higher incidence of breast cancer events and death after long-term follow-up.

Our results confirmed the suboptimal agreement between luminal intrinsic and surrogate subtyping16,17,18,19, and we demonstrated that over 50% of Luminal BSC tumours were re-classified as Luminal APAM50. These results are in agreement with the report by Viale et al. using BluePrint/MammaPrint17. Importantly, the re-classification in our study was translated into improved prognosis and this emphasises the possibility of overestimating the risk for breast cancer events in patients with surrogate Luminal B tumours, which could affect treatment decisions.

Two years of adjuvant tamoxifen was previously reported to be associated with a long-term survival benefit and reduction of breast cancer events for ER-positive patients by the SBII:2pre trial21,22. In this study, during the first 10 years of follow-up, a tamoxifen-treatment effect was seen in patients with Luminal APAM50 tumours, but not in those with Luminal BPAM50 tumours. However, the evidence of treatment interaction was weak in the subsequent follow-up time periods. This indicated that the luminal PAM50 subtype could be a predictive marker for tamoxifen benefit in addition to ER status. Our finding was observed regardless of ER and HER2 status by IHC/ISH, suggesting the PAM50 subtyping could potentially be clinically used upfront to stratify premenopausal patients to tamoxifen therapy. The luminal PAM50 subtypes have previously been reported to be associated with benefit from 5 years adjuvant tamoxifen treatment in premenopausal women (n = 398); however, no separate treatment effects in patients with Luminal A and B were reported12. Yu et al. presented a long-term beneficial effect of 5-year tamoxifen treatment in postmenopausal women with Luminal APAM50 tumours, and the effect attenuated over time in patients with Luminal BPAM50 tumours23.

This study demonstrated better long-term prognosis for patients with low vs high ROR score. However, the trial was underpowered to define the prognostic value of ROR score by nodal status. In general, our cohort had more aggressive tumour characteristics and the outcome for node-negative patients with low ROR score was worse compared with postmenopausal women in the validation studies7,8,20. It remains unclear if premenopausal patients with low ROR score are potential candidates for abstaining adjuvant chemotherapy. Data from the TAILORx trial indicated a beneficial effect of adjuvant chemotherapy in patients ≤50 years and a recurrence score of 16–2524 and the results from the RxPONDER demonstrated that node-positive premenopausal women with a recurrence score ≤25 did benefit from additional chemotherapy25. These results are emphasising that further studies of multigene assays including premenopausal patients are warranted.

A strength of this study is the long-term follow-up data and the fact that it is based on a trial including only premenopausal women randomised to tamoxifen monotherapy vs control (systemically untreated patients). Moreover, reassessments of the progesterone receptor and Ki67 were performed for the surrogate subtypes and a well-established gene expression method was used. A limitation of this study is that the quality of old preserved tissues may result in uncertainty, especially regarding Ki67 assessment26. However, the required RNA quantity is minimal, and quality check of the RNA assured that gene expression output data were reliable. Further limitations include that the duration of endocrine treatment was shorter than current recommendation and, the power of this study was low due to the limited number of included patients. Even though we demonstrated that PAM50 subtyping and ROR score could separate premenopausal into groups with different risks of recurrence and death, it remains unclear if these can be used for de-escalation of adjuvant chemotherapy27.

In conclusion, PAM50 subtypes and ROR score provided independent prognostic information after long-term follow-up. After 10 years of follow-up, the re-classification of Luminal BSC tumours into Luminal APAM50 was associated with a lower incidence of breast cancer events. Moreover, the tamoxifen effect was associated with the Luminal A intrinsic subtype, independent of ER status.

Methods

Study population

The patients in this study were included in the SBII:2pre study, which randomised 564 premenopausal women between 2 years of adjuvant tamoxifen or no systemic treatment. Inclusion and exclusion criteria have been described previously and demonstrated long-term beneficial effect of tamoxifen treatment21,22,28,29. Patients were classified as premenopausal until one year after menstrual periods had stopped according to the study protocol (Supplementary Reference 1). The cohort included in this study is illustrated in Fig. 1.

Compliance with ethical requirements

Oral informed consent was obtained from all participants included in the SBII:2pre trial, and approval was given by the ethical committees in Lund and Linköping, Sweden. The oral consent was verified by a signature of the investigator in the registration form which was sent to the coordinating centre. The follow-up study was approved by the ethical committee of Lund (Dnr LU 2015/350) for extended follow-up as well as for genomic analysis (Dnr LU 2017/97). Biobank approval was cleared for all involved pathology departments.

Study endpoints and follow-up data

The primary endpoints were BCFi including any of the following first events: local, regional, or distant recurrence; contralateral breast cancer (invasive or ductal cancer in situ); or breast cancer-related death (data cut-off Nov 30 2016)22, according to the DATECAN recommendation30. The secondary endpoint was OS and follow-up data were retrieved from the Swedish Causes of Death Register (data cut-off 10 December 2020). In sensitivity analysis, we additionally reported on RFi excluding contralateral breast cancer events30. Results for maximum follow-up and the two time intervals 0–10 years and >10 years were reported.

Tumour characteristics and assessments of progesterone receptor (PR) and Ki67 status

Archival formalin-fixed paraffin-embedded (FFPE) tissues (n = 520) from breast tumours of the study participants were collected. Reassessments of PR (n = 464) and Ki67 (n = 463) status using whole tissue sections were performed according to Swedish national guidelines by a breast pathologist (UK)31. Data on ER, Nottingham histological grade (NHG), and HER2 was available as described previously21,28,29,32,33. These assessments were performed retrospectively and independently by two national reference breast pathologists28,33.

Gene expression analyses

1–5 sections (10 µm thick) from FFPE tissue with invasive breast carcinoma, were used to extract RNA (AllPrep DNA/RNA FFPE kit (Qiagen Cat:80234, Hilden, Germany)). Gene expression analysis was performed according to the manufacturer’s instructions using the NanoString Breast Cancer 360TM assay on an nCounter® SPRINT Profiler instrument (NanoString Technologies)34.

Housekeeping gene geomean quality control (QC) categorised samples as PASS/BORDERLINE (≥202) or FAIL (<202) with 91% (437/479) of the samples passing QC. PAM50 genes were normalised to the PAM50 housekeeper gene geomean. The correlation between the observed scaled expression for the PAM50 genes and a centroid for each of the four subtypes was then determined. The subtype with the greatest correlation value defined the intrinsic subtype. In the ROR score, a weighted sum of the proliferation score, the four subtype correlations and tumour size were used to calculate a score between 0 and 100. The categorisation of ROR score was determined based on nodal status according to the following definitions: N0 low: 0–40, intermediate: 41–60, high: 61–100, N1 (1–3 positive nodes); low: 0–15, intermediate: 16–40, high: 41–100, N2 (≥4 positive nodes); high: 0–10020.

Surrogate subtyping

The ER+/HER2− breast cancer tumours were classified as Luminal ASC or Luminal BSC according to the St. Gallen 2013 guidelines: Luminal ASC; low Ki67 (<20%) and high PR (≥20%), Luminal BSC; high Ki67 (≥20%) and/or low PR (<20%)35.

Statistical analyses

Cumulative incidence curves were used to illustrate outcomes for patient subgroups. The estimates for BCFi take the competing event death without a preceding breast cancer event into account. In analyses of RFi, also contralateral breast cancer as first event was treated as a competing risk. Evidence against equality of two or more cumulative incidence curves was evaluated using the log rank test. We used a trend version of the test for ordered groups and a modified version, derived by Geskus36, for comparison of cause-specific cumulative incidence curves (BCFi and RFi). Cox regression models, stratified for region, were used to estimate HRs with 95% CIs Cause-specific Cox regression was used for the endpoint BCFi, censoring the follow-up at time of death for patients who died without a registered breast cancer event. Similarly, the follow-up time was censored at the time of death without a preceding breast cancer event or at the diagnosis of contralateral breast cancer as first breast cancer related event in analyses of RFi. Proportional hazards assumptions were in general not met in analyses of long-term follow-up. The corresponding HRs should therefore be interpreted cautiously as average effects over time. Our way of handling this problem was to also calculate the relative effects with the follow-up restricted to 10 years. The evidence against proportional hazards, as measured by Schoenfeld’s test, was in general much lower for the two intervals 0–10 years and 10+ years compared to the evidence in analyses of maximum follow-up.

Log rank tests of prognostic or predictive effects are presented in figures whereas tests based on Cox regression models are presented in tables. For Cox models, we present both effects relative to a chosen reference category, with 95% CIs and P-values, and an overall Wald test of each factor. Multivariable analyses of PAM50 subtype and ROR categories were adjusted for established prognostic factors, but since nodal status and tumour stage are included in the definition of ROR, these factors were excluded from the set of adjustment variables in analyses of the independent prognostic effect of ROR categories.

For prognostic differences between luminal PAM50 and surrogate subtyping, a variable including the four combinations of Luminal APAM50, Luminal BPAM50, Luminal ASC and Luminal BSC tumours, was created. Percentage agreement and kappa (κ) statistics were used in agreement analyses. To evaluate the differential effect of tamoxifen benefit in luminal PAM50 tumours, a Cox model was fitted including an interaction variable between luminal PAM50 subgroup and treatment arm. The prognostic effect of ROR score was studied in ER+/HER2− patients and also stratified by nodal status.

The results are presented in accordance with the Reporting Recommendations for Tumour Marker Prognostic Studies (REMARK) where applicable37,38. All statistical tests were two-sided, and a P-value <0.05 was considered statistically significant. No adjustment for multiple testing was performed. All calculations were performed using IBM SPSS, version 25.0 (IBM Corp., Armonk, NY, USA) and the cumulative incidence curves were drawn using STATA, version 17.0 (StataCorp LLC, College Station, TX, USA).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.