Introduction

Breast cancer (BC) is the most frequent cancer and the fourth cause of death by cancer in women in developed countries [1]. Rates of survival with breast cancer have increased significantly all over the world in the past decades [2]. In Spain, it has been estimated that 32,825 new BC cases were diagnosed in 2018, with an incidence rate (Age-standardised [World]) of 75.4 cases per 100,000 women-years [3]. Main factors associated with prognosis in breast cancer are stage (enclosing tumour size and local infiltration, lymph nodes and metastasis), tumour grade of differentiation, histological type and presence/absence of hormone and Her2/neu receptors [2].

Although paclitaxel has been available since 1993, breast cancer treatment has considerably improved from 1995 onwards as conservative surgery has become widely performed. Systemic therapy has switched from CMF (cyclophosphamide, methotrexate and 5-fluorouracil) to regimens containing anthracycline [4], routine searches for oestrogen and progesterone receptors has allowed for the introduction of tamoxifen, as well as identification of tumours positive to Her2/neu has enabled targeted therapy with trastuzumab [5], which was authorised in the European Union on August 28, 2000. In this regard, identification of intrinsic subtypes (i.e. luminal A, luminal B, Her2-positive and basal-like) has allowed for agreement in general recommendations for first-line therapy in early-stage breast cancer [6]. Survival has thus enhanced and 5-year survival is now around 85% [7,8,9]. In spite of the improvement in systemic therapy, tumour stage remains a key prognosis factor in breast cancer patients [10], which reinforces the relevance of early diagnosis. The American Joint Committee for Cancer (AJCC) has recently suggested a pathological prognostic score combining tumour stage, grade and hormonal/Her2 receptors [11], which has been proved effective as survival predictor [12, 13].

Studies carried out in population-based cancer registries covering 17% of the Spanish population reported that women with BC diagnosed in Spain had 86.0% 5-year age-adjusted relative survival if diagnosed between 1995 and 1999 [7, 14] with no improvements for women diagnosed from 2000 to 2007 [7]. That study, however, included women diagnosed well before the extension of anthracycline-based chemotherapy and the introduction of endocrine therapy with tamoxifen and anti-Her2 treatment with trastuzumab. The main goal in this study is to analyse the relative survival with BC in women diagnosed in 2008–2013—after new treatments were generalised—and to ascertain the current effect that tumour characteristics such as grade, stage or subtype have on survival as well as the new AJCC-pathological prognostic score.

Methods

Setting and patients

The breast cancer MCC-Spain follow-up study is a prospective cohort study of incident breast cancer cases diagnosed between 2008 and 2013 in 18 hospitals located in 10 Spanish provinces (Asturias, Barcelona, Cantabria, Girona, Gipuzkoa, Huelva, León, Madrid, Navarra and Valencia). 1738 women between 20 and 85 years old with newly diagnosed primary breast cancer were recruited in the context of a case–control study. The identification of incident cases was carried out by active search through periodic visits to the relevant hospital departments (i.e. gynaecology, oncology, general surgery, radiotherapy and pathology departments). All the cases that were included had histological confirmation and included all the malignant BC (International Classification of Diseases 10th Revision (ICD-10) [15] code C50) and frequent breast cancers in situ (ICD-10: D05.1, D05.7), without a malignant history of BC, diagnosed between 2008 and 2013 in the selected hospitals. Patients were residing in the catchment areas of hospitals for at least 6 months before recruitment. A detailed description of methods and sample characteristics has been published elsewhere [16, 17] and then, 98% recruited patients have been followed until 2017/2018 (N = 1685). Only patients signing an informed consent were recruited; the informed consent also asked them for their permission for later consulting their medical records during the follow-up.

Information obtained at recruitment

When the patients were recruited, medical records were reviewed in order to gather information on age, date of diagnosis, tumour characteristics, including tumour size, lymph nodes, presence of metastases, grade of differentiation. Furthermore, oestrogen (ER), progesterone (PgR) and human epidermal growth factor receptor 2 (HER2) status were assessed and then classified as positive (+), negative (−) and unknown. Specifically, ER and PgR were considered positive if 1% or more of neoplastic cells showed nuclear immunohistochemical (IHC) staining. HER2 was considered positive (overexpressed) if graded 3 + on IHC performed, and all other grades (0 to 2 +) were considered negative unless fluorescence in situ hybridization of 2 + cases confirmed increased gene copy number. Molecular subtypes of breast carcinoma is classified into luminal A (ER + and / or PR + , HER2-, Ki-67 < 14%), luminal B (ER + and / or PR + , HER2-, or ER + and / or PR + , HER2-, Ki-67 ≥ 14%), HER2 + (ER-, PR-, HER2 +) and basal-like or triple negative is a tumour which does not meet any pathologic criteria for positivity of oestrogen receptor, progesterone receptor, or ERBB2 (ER-, PR- and HER2-) [6]. Tumour stage was based on the AJCC standards [18]. Tumour size was classified as T1: ≤ 2 cm, T2: > 2 cm and ≤ 5 cm, T3: > 5 cm, T4: any size with direct extension to chest wall and/or skin; lymph node involvement was classified as N0: no pathologically proven positive lymph nodes, N1: 1–3 positive nodes, N2: 4–9 positive nodes, N3: ≥ 10 positive nodes. Tumour grade was categorised as I: well differentiated, II: moderately differentiated, III: poorly differentiated, according to the modified Bloom and Richardson grading system [19]; when Bloom and Richardson grade was unavailable, other similar classifications were used as it has been proved they are highly reliable [20]. First-line therapy was also recorded, including type of surgery (conservative or mastectomy), margins (free or invaded), chemotherapy regime, radiotherapy, endocrine therapy, Her2-targeted therapy; for all systemic therapies, whether they were adjuvant or neoadjuvant was also recorded.

Follow-up

In 2017 and 2018, medical records were reviewed to obtain the last date the patient was attended to in t the health system and to ascertain current patient’s vital status. For patients that had not have been at hospital in the 3 months before their medical record review, the National Death Index (Índice Nacional de Defunciones) was consulted [21]. This index is a database recording date of death for all people who have died in Spain; it is hosted on the Ministry of Health web page and can only be accessed with specific authorisation for research purposes. Last date of search in the National Death Index was in December 2018.

Statistical analysis

Deaths by any cause were considered events and patients alive in their last contact with hospital and not included in the National Death Index were considered censored. Time of follow-up was considered as the time from diagnosis to death or to the last contact in hospital in alive patients. Relative survival, defined as the ratio of the observed survival in our study population and the expected survival in general population Spanish women with the same age as cases in the year of diagnosis, was estimated using Ederer II method [22]; for this purpose, age-specific mortality rates from 2007 to 2018 obtained from the Spanish National Institute for Statistics (INE) were used as reference. 3-, 5- and 8-year survival Kaplan–Meier estimates were obtained. Weibull regression adjusting by age, hospital, grade and stage was used to investigate prognosis factors; its results are displayed as hazard ratios with their 95% confidence intervals. Groups that included less than 25 people were excluded from this analysis. Stata 14/SE package was used in the analyses.

Results

Characteristics of patients

We followed 1685 women out of 1738 breast cancer patients (98%), who had primary breast cancer between 2007 and 2013. Patient-years of follow-up were 10,931 person-years. The maximum period for breast cancer follow-up was nine and a half years (from 13th July 2007 to 22nd March 2018) and 206 patients died in the follow-up; 5-year survival was 90.7%.

The mean age at diagnosis was 56.5 ± 12.6-year-old and 65% were postmenopausal. Other main characteristics of the patients are shown in supplementary Table 1.

Table 1 Relative survival probability 3, 5 and 8 years after breast cancer diagnosis according to tumour characteristics

Tumour characteristics

Regarding the characteristics of the tumours, the most usual histological type was ductal (75.7%), followed by lobular (6.5%). About half of cancers were T1 (52%) and lymph node negatives (52%). Only 42 women (2.5%) had metastasis at the time of diagnosis.

Concerning intrinsic subtypes, 997 (59.2%) could be classified as luminal A-like, 331 (19.6%) as luminal B-like, 81 (4.8%) as Her2 (non-luminal)-like and 130 (7.7%) as basal-like. Grade of differentiation could not be obtained from medical records in 481 patients (28.5%). Almost 31% of the cancers were moderately differentiated while poorly differentiated accounted for about 21% cancers. Hormone receptor status was available for most cases; 83% were oestrogen receptor positive, 73% progesterone positive and 17.4% were Her2 positive (Table 1).

Relative survival

5-year relative survival with breast cancer was 93% (95% CI 92 – 94) (Fig. 1a). Table 1 shows 3-, 5- and 8-year relative survival according to tumour characteristics. Women diagnosed in stage I had the same survival probability than women with the same age without breast cancer (i.e. 100% relative survival) even after 8 years of follow-up. At the same time, relative survival decreased with follow-up time in women diagnosed in more advanced stages: 3-, 5- and 8-year relative survival were 98%, 95% and 93% for breast cancer diagnosed in stage II, 94%, 88% and 81% in women diagnosed in staged III and 63%, 40% and 24% in those diagnosed in staged IV (Table 1 and Fig. 1b). Relative survival also decreased as grading got less differentiated [8-year relative survival: 99% in well differentiated tumours, 93% in moderately differentiated and 88% in poorly differentiated (Table 1 and Fig. 1c)]. Relative survival after 8 years of follow-up was 94% for luminal A-like breast cancers, 88% for luminal B-like, 82% for Her2 non-luminal) and 74% for basal-like cancers (Table 1 and Fig. 1d).

Fig. 1
figure 1

Relative survival in Spanish women with breast cancer: a Overall survival, b survival according to TNM staging, c survival according to grading, d survival according to intrinsic subtype

Prognostic factors on overall mortality

Age, hospital, stage and grade-adjusted hazard ratios on association between survival and tumour characteristics and first-line treatment are displayed in Table 2. Age, premenopausal status, tumour size, nodal infiltration and presence of metastasis were significantly associated with overall mortality. Hazard ratios increased with tumour stage, being 1 (reference) for T1 and 1.62 (1.04 – 2.52) and 2.04 (1.13 – 3.68) for T2, T3, respectively. Breast cancers negative for oestrogen or progesterone receptors behaved worse than their opposite. Luminal A- like and luminal B-like tumours had similar prognosis; hazard for Her2 (non-luminal) cancers were 50% higher than for luminal A-like (hazard ratio = 1.50; 95% CI 0.87 – 2.59), and basal-like tumour’s hazard was three times that of luminal A-like (hazard ratio = 3.50; 95% CI 2.31 – 5.30). Grading showed a dose–response association with mortality, with hazard ratios 1 (reference) for well differentiated cancers, 1.48 (0.86 – 2.54) for moderately differentiated and 2.42 (1.41 – 4.17) for poorly differentiated. To further explore whether intrinsic subtype adds prognosis value to tumour stage, we studied the effect of stage by stratifying for intrinsic subtype; results in Fig. 2a show that higher stages had higher hazard ratios, whatever the intrinsic subtype. However, when stratifying the effect of intrinsic subtype for tumour stage, we found that intrinsic subtype had no effect at all in patients in stage I, while Her2 (non-luminal) and basal-like tumours had worse prognosis than luminal-like tumours in patients with stage II and, especially, stages III and IV (Fig. 2b).

Table 2 Characteristics of the tumour and their association with survival
Fig. 2
figure 2

Hazard ratios for overall survival showing the interaction between TNM stage and intrinsic subtype. a Hazard ratios for TNM stage stratified by intrinsic subtype, b hazard ratios for intrinsic subtype stratified by TNM stage

Pathologic prognostic score in women without neoadjuvant therapy

Figure 3 displays the association between the AJCC- Pathologic Prognostic Score and overall survival in women without neoadjuvant therapy. Hazard ratios stepped up as the score rose, being 2.31, 4.00, 4.94, 7.92, 2.26, 14.9 and 58.9 for scores IB, IIA, IIB, IIIA, IIIB, IIIC and IV, respectively. These results are somewhat limited because of their wide confidence intervals. This information appears stratified by menopausal status in supplementary material, where hazard ratios increased faster in premenopausal women (Supplementary Table 2).

Fig. 3
figure 3

Kaplan–Meier survival estimates according to the AJCC pathological prognostic score

First-line treatment for patients

Table 3 shows the characteristics of the treatment. Conservative surgery was performed in 1231 (73.1%) patients and mastectomy in the remaining 454 (26.9%). Surgical margins were negative in 66% patients. Radiotherapy was used in 1158 women (68.7%), while adjuvant or neoadjuvant chemotherapy was administered to 50.13% patients. Adjuvant or neoadjuvant hormone therapy was used in 1023 women (60.7%), immunotherapy in 11.1% and Her2-targeted therapy in 152 patients (9.0%).

Table 3 First-line treatment and relative survival probability 3, 5 and 8 years after diagnosis

Hazard ratios on association between survival and first-line treatment are displayed in Table 4. Type of surgery (mastectomy/conservative surgery) and surgical margins were not associated with survival. Among systemic treatments, adjuvant therapy (whether chemo-, radio- or hormone-therapy) was associated with lower mortality and both neoadjuvant and palliative treatment were associated with worse prognosis (Table 4).

Table 4 First-line treatment and their association with survival

Discussion

In spite of recent improvements in breast cancer systemic therapy, our results confirm that tumour stage and grading are major prognosis factors regarding overall survival with breast cancer. Among components of TNM staging system, tumour size greater than 50 mm (i.e. T3 or T4) more than doubled the risk of dying, while N3 nodal involvement and presence of metastasis had a huge effect on mortality. The pathological prognostic score suggested by the AJCC in 2017 combining anatomic characteristics (i.e. TNM staging) with biological characteristics (i.e. grading and hormonal/Her2 receptors) strongly correlated with survival.

TNM staging has largely been the main factor regarding survival with breast cancer, which led the AJCC to suggest its Anatomic Stage Groups [11]. Our results on TNM components are consistent with recent studies still proving that TNM classification is a main contributor to survival [10, 23,24,25]. For instance, a large study with data from Netherlands Cancer Registry reported hazard ratios over 2.5 for T3 or T4 tumours and 4.0 for N3 tumours in women diagnosed between 2006 and 2012. These hazard ratios were similar to those obtained in the same registry for women diagnosed between 1999 and 2005 [10], which reinforces TNM staging relevance for survival has not changed with recent treatment improvements, thus strengthening the importance of early diagnosis.

Recent developments, however, have focused on tumour biology, making the identification of breast cancer subtypes via differentiation grade or presence of hormonal and Her2/neu receptors possible, heading to specific therapies according to tumour histological and molecular characteristics [6, 26,27,28]. Our results also confirmed higher survival for women with breast cancer positive to either oestrogen or progesterone receptors and negative to HER2 receptors, as well as worsening prognostic as grading increased from well to poorly differentiated tumour. In this regard, the new AJCC Pathological Prognostic Score [11] has been suggested by combining both anatomic staging and biological markers, thus refining the previous AJCC Anatomic Stage Groups [12], which is now intended only for regions were biological markers are not routinely available [11]. In our cohort, the AJCC Pathological Prognostic Score had large prognostic value with hazard ratios swiftly rising as the Pathological Prognostic Score increased. In addition, this score presents a higher predictive ability in premenopausal women. This can be explained by (i) the rate at which the cancer develops—tumours could be faster and receive poorer prognosis in premenopausal women—and (ii) the fact that a younger woman, i.e. premenopausal, with breast cancer is more likely to die because of the tumour itself, as opposed to postmenopausal women, who have more chances to die due to other causes. Of note, the analysis of Pathological Prognostic Score should have been restricted to women without neoadjuvant therapy as this score is based on the pathological TNM staging, which is obtained after surgery.

Adjuvant chemotherapy, radiotherapy or endocrine therapy was associated with better prognosis in our cohort, which agrees with previous reports [4, 29,30,31]. Neoadjuvant therapy, however, was associated with lower survival. Results on systemic therapy—survival relationship cannot be straight forwardly interpreted as, even after adjusting for stage and grade, there is still room for confounding by indication as women with more severe cancer are more likely treated with more aggressive therapies. For instance, neoadjuvant therapy is not usually recommended for low proliferating cancers and hormone receptor-positive, HER2-negative, grading I–II cancers [26]; therefore, its high hazard ratios could be due to residual confounding as it is frequently indicated in cancers with worse prognostic.

Our study has some limitations. Firstly, the sample size—although enough for general results—is small for studying specific subgroups or interactions. Secondly, basal information was obtained from medical records, which could have partial information on some variables, leading to some amount of missing data. On this subject, it is noteworthy that missing categories did not follow a clear pattern in our results as missing data in TNM components behaved almost as the reference category (i.e. good prognostic), while missing data in grading performed as poorly differentiated tumours. Therefore, it is not possible for us to attribute any data omission in medical records to its irrelevance. Thirdly, our sample could be biased towards cancers with better prognostic as women with advanced disease could have refused to participate, thus underrepresenting women in stage IV; had this occurred, it could have overestimated relative survival; however, this problem should have not biased hazard ratios estimates. Finally, the multicentric design of our study is both a limitation and a strength. A limitation as routine clinical practice could vary from hospital to hospital, eventually increasing random variation; we dealt with this problem by adjusting all our analyses for hospital. Nevertheless, it is also a strength as it obtained a fair representation of usual clinical practice in Spain about 2008–2013. An additional strength is the active prospective follow-up, carried out via three complementary sources: medical records, phone calls to the patient and National Death Index consultation.

Conclusions

In conclusion, both TNM staging and histological/molecular biomarkers are associated with overall survival in Spanish women with breast cancer; when both are combined in the AJCC Pathological Prognostic Score, its prognostic value vastly improved in women without neoadjuvant therapy.