Breast cancer, the prevailing cancer type for females worldwide, affects women across all ages and ethnicities, making it a necessity to continuously advance treatment and diagnosis regimes. In the past year, the global incidence of breast cancer counted 2.1 million newly diagnosed cases with mortality reaching 40,000 patients per year in the United States and 138,000 in Europe alone.1,2,3,4,5

Evaluating the response to chemotherapy with the goal to improve disease-free and overall survival has become an important milestone in monitoring the treatment of breast cancer patients and for clinical research.6,7,8 Neoadjuvant chemotherapy, a well-established method of treatment for a select group of patients, allows radiographic, clinical, and pathological monitoring of tumor response before to the definite operation.9,10,11,12,13,14,15 One of the main purposes of neoadjuvant treatment modalities is to observe directly its effects on tumor response in a relatively short timeframe, allowing the subsequent pathological response evaluation. In fact, pathological assessment of the surgical specimen is considered the most accurate method for chemotherapy evaluation.8,12,16 Previous studies have shown that pathological complete remission (pCR) strongly correlates with improved overall (OS) and recurrence-free survival (RFS).16,17,18,19,20,21,22,23 Although pCR, due to this correlation, has been considered the most important surrogate end point for clinical trials, the prognostic value depends on its definition, which varies depending on the used evaluation method.13,19,24,25 Current recommendations suggest the use of a histological classification for the assessment of the pathological response but do not favor a particular grading system.26 For international clinical trials, systems that include pretreatment cellularity, such as the Miller-Payne or the Pinder system, have been recommended.20,26 Additional response classifications have been established and used, such as Chevallier and Sataloff score, and more recently the residual cancer burden (RCB) score. As one of the developed scores for the evaluation of pCR, the RCB has demonstrated reliable reproducibility, a wide range of significant prognostic pathological elements, and prognostic power across various breast cancer subtypes and chemotherapy regimens.24,27,28,29,30,31,32 There are several advantages of the RCB score compared to other methods. The RCB score is based on the mathematical model and, as opposite to other methods segregating response into groups, RCB is an exact deliverable with a precisely calculated score. Its main advantage is the potential to identify a patient subgroup with an excellent prognosis despite (minimal) residual disease (RCB I).

So far, the RCB score has only been used to a limited extent by independent researchers outside of the MD Anderson Cancer Center.15,27,33,34,35,36 In the following study, we investigated the prognostic impact of the RCB score with special emphasis on RFS in a breast cancer collective treated at a single oncology unit and analyzed at a single pathology department.

Methods

Study Design

We performed a retrospective analysis of 184 patients with early or locally advanced breast cancer, who underwent neoadjuvant therapy at the Division of Oncology, Department of Internal Medicine, University Hospital, Medical University of Graz, Austria, before undergoing surgical treatment in curative intent at three departments of surgery all related to our local oncology network (Hospital Graz Sued-West, Brothers of Charity Hospital and Elisabethinen Hospital Graz, Austria). These 184 patients were drawn from 213 patients, who had undergone surgery at these three surgical departments after neoadjuvant therapy between January 1, 2011 and December 31, 2016. We excluded 29 patients who (1) underwent neoadjuvant endocrine therapy (n = 23), (2) progressed during neoadjuvant therapy (n = 3), (3) had sentinel lymph node excision before neoadjuvant therapy (n = 2), and (4) did not have available follow-up data (n = 1). For all patients, the pathology procedure and diagnosis was performed at the Department of Pathology, Hospital Graz Sued-West, including prospective assessment of the RCB score and class and estrogen and progesterone receptor, HER2 (human epidermal growth factor receptor 2), and Ki67 labeling index (Ki67 LI). We retrospectively ascertained demographic variables, clinicopathological data from the biopsy prior neoadjuvant therapy, the postoperative pathology report, and follow-up data after surgery (local recurrence, distant metastasis, second mammary, and nonmammary tumors, death) via investigation of the local electronic healthcare databases, paper charts, and telephone interviews. All patients underwent a structured postsurgical follow-up, with regular clinical visits and selected imaging studies every 3 months during the first 3 years, every 6 months during the subsequent 2 years, and yearly thereafter for a maximum duration of 10 years.

The date of definitive surgery was defined as the baseline data. Primary endpoint of this study was 5-year RFS, defined as a composite of local recurrence from breast cancer, distant metastasis from breast cancer, or death-from-any-cause, whatever came first, during the first 5 years of follow-up. The primary endpoint was ascertained retrospectively and was adjudicated by two oncologists (HDM & MB). Metachronous contralateral breast cancer occurring during follow-up was not counted as primary endpoint event. The secondary endpoint was 5-year OS, defined as the time between surgery and death from any cause or censoring alive, respectively.

Pathology

Pretherapeutic Procedure

Before therapy, all cases were histologically diagnosed by core needle biopsy on which also ER, PR, HER2, and Ki67 LI were analyzed by immunohistochemistry. For the assessment of ER and PR the Allred score, for HER2 the 2013 ASCO/CAP guidelines were used.25,26,27,37 Equivocal results for HER2 on immunohistochemistry were further evaluated by in situ hybridization using DDISH (Roche-Ventana®, Ventana, CA). The Ki67 LI was assessed by estimation using 10% steps except for the area between 10% and 20% for which 5% steps were used.38 If a distinction between 10% steps was not possible, a range for Ki67 LI (e.g., 20–30%) was included in the pathological report. Immunohistochemistry and in situ hybridization were performed on an automated stainer (BenchMark Ultra™, Roche-Ventana®). Histological typing and grading were performed according to WHO.39 The carcinomas also were categorized in molecular subgroups based on the four immunohistochemical analyses.37,40,41 When luminal tumors were diagnosed, only selected tumors with either higher stage or luminal B-like tumors with higher proliferation were selected for neoadjuvant chemotherapy.

Postoperative Procedure

All surgical specimens were intraoperatively submitted to the pathology department without prior fixation. The margins were inked, the specimens sliced at 4–5 mm, and subsequently assessed grossly. If required by close distance of the tumor from the margins, they also were assessed by frozen section. After fixation in 4% buffered formalin for at least 12 h, specimens after breast-conserving therapy were totally embedded in serial order from the mammillary to the peripheral end using either usual or gross cassettes. From mastectomy specimens, the tumor bed was usually subtotally embedded. If no residual tumor was grossly visible, the tumor bed was completely embedded. The pathology report included the histopathological tumor type and grade, status of the surgical margins, ypTNM classification, including vascular and lymph vascular invasion, and the regression grading using the RCB score. The RCB score was assessed by using the RCB calculator on the MD Anderson website and reported including the particular class.17,27,42 If residual tumor was present, the analysis for ER, PR, HER2, and Ki67 LI was repeated.

Statistical Methods

All statistical analyses were performed with Stata® 15.0 by FP (Stata Corp., Houston, TX). Continuous variables were reported as medians [25th–75th percentile] and count data as absolute numbers (%). The distribution of continuous variables between two groups was compared with rank sum-tests, whereas the association between two categorical variables was investigated with chi2 (χ2) and Fisher’s exact tests. Simple and multiple linear regression models were fitted to examine the association between covariates and the continuous RCB score. Median follow-up time was estimated with the reverse Kaplan–Meier estimation.43 RFS and OS were computed with Kaplan–Meier estimators, and RFS and OS functions were compared between two or more groups using log-rank tests. The association between prognostic variables, such as the RCB scores and classes, respectively, and the hazards of recurrence or death were calculated using uni- and multivariable Cox proportional hazards models. The RCB was investigated as (1) a continuous variable (RCB “index” from 0 to 4), (2) a binary variable with an empirical prespecified cutoff at the 75th percentile of the RCB distribution (in the absence of validated cutoffs for the RCB “index”), and (3) as a 4-level categorical variable (RCB “classes” 0, 1, 2, and 3). The proportionality of hazards assumption was assessed by fitting interactions between the continuous RCB index and linear follow-up time. The discriminative performance of the RCB indices and classes, respectively, toward 5-year RFS were quantified with Harell’s c-index.44 Potential effect modifiers (“interactions”) of RCB with clinically relevant subgroup variables were investigated by fitting interactions between the RCB score and classes, respectively, and selected covariates within a Cox model.

Results

Analysis of the Neoadjuvant Treatment Phase and Definitive Surgery

A total of 184 female patients were included in the analysis. The median age of the cohort at diagnosis was 54 years [25th–75th percentile: 47–64] (Table 1). Histologically, the majority of the 184 invasive carcinomas were of no special type and grade 3 (n = 122, 66%) with a median Ki67 LI of 35% [15–60]. One third (n = 61, 33%) of the carcinomas were HER2 amplified, and 49 carcinomas (27%) were triple-negative. Ten (5%) patients had been previously diagnosed with breast cancer. After diagnosis, 169 (92%) patients were treated with sequential anthracycline-taxane neoadjuvant chemotherapy and the remaining 15 (8%) with other neoadjuvant chemotherapy regimens. HER2 directed therapy was given as indicated (Table 1). All analyses were performed in the entire cohort (n = 184) and in the group of patients treated with anthracycline and taxane neoadjuvant chemotherapy (n = 169), and there were no differences in the main findings. Therefore, we only show data for the entire cohort. Primary definitive surgery was breast-conserving therapy in 118 (64%) and mastectomy in 66 (36%) patients. Axillary lymph-node dissection and sentinel node biopsy were performed after neoadjuvant therapy in 157 patients (85%) and 27 (15%) patients, respectively.

Table 1 Baseline characteristics of the study population (n = 184)

Analysis of RCB

The median RCB score was 1.57 with a range from 0 to 4.94. Corresponding RCB classes are reported in Table 1, and outcomes by intrinsic breast cancer subtype in Supporting Table 1. In linear regression, the strongest uni-variable predictors of a low RCB score were triple-negative and HER2-positive subtypes and higher Ki67 LI, whereas higher age predicted for a higher RCB score (Table 2). In a multivariable regression model, only breast cancer subtypes and Ki67 LI prevailed as independent predictors of RCB (Supporting Table 2).

Table 2 Simple linear regression models investigating predictors of the RCB score (n = 184)

Analysis of the Primary Endpoint: Recurrence-free Survival (RFS)

Median follow-up (with follow-up truncated at 5 years) was 4 years; 75% of patients were followed for at least 3 years. During this interval, we observed 38 recurrences, and 28 patients died (n = 43 RFS events). The 1-, 2-, 3-, and 5-year RFS estimates were 93%, 86%, 80%, and 73%, respectively (Supporting Figure 1). The observed recurrences (n = 38) included local recurrence (n = 10; including skin and lymph nodes), distant lymph node metastasis (n = 1), visceral metastases (n = 17; cerebral, liver, gastrointestinal), and nonvisceral metastases (n = 10; bone, lung, pleural). In addition, we observed contralateral breast cancer in five patients and deaths from other causes, including other malignancies in three patients, which were not included in the analysis.

Average RCB scores were significantly higher in the 43 patients who developed recurrence and/or died during follow-up compared with the 141 who remained event-free (median index: 2.34 vs. 1.39 points, rank-sum p < 0.0001). In a uni-variable Cox regression, higher RCB scores were associated with worse RFS (hazard ratio (HR) per 1 point increase = 1.80, 95% CI 1.44–2.24, p < 0.0001, Table 3).

Table 3 Predictors of 5-year recurrence-free survival (RFS) univariable Cox regression

Five-year RFS estimates were 40% (95% CI 20–59) in the 46 patients with an RCB index above the 75th percentile of its distribution (cutoff: 2.41 points), and 84% (95% CI 76–89) in the 138 patients with an RCB score equal or below this cutoff (log-rank p < 0.0001, Fig. 1).

Fig. 1
figure 1

Kaplan–Meier recurrence-free survival (RFS) functions by RCB score. Patients were dichotomized into two groups based on having an RCB score > (n = 46) or ≤ (n = 138) the 75th percentile (Q3) of its distribution. The Q3 cutoff is at 2.41 RCB index points. Numbers below the Kaplan–Meier plot represent a risk table, with the number of RFS events occurring within the respective interval report in round brackets

The RCB class was also strongly associated with RFS in univariable Cox regression (Table 3), but the RFS experience of patients with RCB classes 0–2 did not appear to differ substantially at 5 years (Fig. 2). The strongest other univariable predictors of RFS were residual tumor and residual axillary lymph node metastases as determined by the ypTNM categories, which are by definition components of the RCB (Table 3).

Fig. 2
figure 2

Kaplan–Meier recurrence-free survival (RFS) function by RCB class. Numbers below the Kaplan–Meier plot represent a risk table per RCB class, with the number of RFS events occurring within the respective interval report in round brackets

Analysis of RCB’s Predictive Performance, Time-Dependent Impact on RFS, and Potential Subgroup Effects

The continuous RCB score achieved a Harell’s c-index of 0.70 for discriminating between patients who experienced the primary endpoint during follow-up or not. The corresponding c-index of the RCB class variable was slightly lower (Harell’s c = 0.67). Importantly, we did not observe evidence for non-proportionality of hazards according to the RCB score (interaction HR with linear follow-up time = 1.00, p = 0.896). This is consistent with the hypothesis that the adverse prognostic impact of the RCB score on RFS prevails throughout follow-up period. Indeed, HRs for RFS per 1 point increase in the RCB score was 1.83 (95% CI 1.22–2.74, p = 0.003) for the first year after definitive surgery, 1.72 (95% CI 1.35–2.18, p < 0.0001) for the first 3 years after surgery, and 1.80 (95% CI 1.44–2.24, p < 0.0001) for the first 5 years after surgery, respectively. Finally, in an exploratory hypothesis-generating analysis, we investigated whether the adverse impact of RCB on RFS may be different across selected clinically important subgroups. We found that the association between higher RCB score and worse RFS was consistent across different molecular subtypes, as well as for tumor grade (Supporting Table 3). However, the adverse prognostic impact of RCB as indicated by the RCB score on RFS was more pronounced in patients with a high Ki67 LI, in the elderly, as well as in those that could not receive sequential anthracycline-taxane based chemotherapy.

Analysis of the Secondary Endpoint: Overall Survival

The 1-, 2-, 3-, and 5-year overall survival (OS) estimates were 97%, 91%, 88%, and 80%, respectively (Supporting Figure 2). Five-year OS estimates were 61% (95% CI 10–40) in the 46 patients with an RCB score > 3rd quartile and 86% (95% CI 76–92) in the 138 patients with an RCB score ≤ 3rd quartile (log-rank p < 0.0001; Fig. 3). Similar to the RFS analysis, the adverse impact of the RCB class on OS was driven by class 3 patients, whereas survival outcomes for class 0–2 patients were relatively similar (Supporting Figure 3). In Cox regression, both higher RCB scores and classes emerged as prognostic factors for worse OS (Supporting Table 4).

Fig. 3
figure 3

Kaplan–Meier overall survival (OS) function by RCB score. Patients were dichotomized into two groups based on having an RCB score > (n = 46) or ≤ (n = 138) the 75th percentile (Q3) of its distribution. The Q3 cutoff is at 2.41 RCB score points. Numbers below the Kaplan–Meier plot represent a risk table, with the number of OS events occurring within the respective interval report in round brackets

Discussion

In this study, we retrospectively evaluated the feasibility of the RCB scoring and classification and its impact on prognosis of patients treated with neoadjuvant systemic therapy. Because neoadjuvant systemic therapy has significantly improved survival and is the approach of choice in the selected high-risk population of localized primary breast cancer, this analysis is of great importance in the real-world clinical setting. Our cohort shows a mixture of plausible incidence of high risk breast cancer patients (Table 1: TNBC 27% and HER2-positive 33%), offering comparable distributions of these molecular subtypes to the RCB validation cohorts (HER2 pos. 1–53%, triple-negative breast cancer 21–29%) published by Symmans et al.17,27,32 Although our study showed some limitations, we found that the prognostic impact of RCB was consistent over time and an even distribution of hazards made it to an extent universally applicable.

Several differences to the original validation cohort need to be addressed. Patients who received less than 75% of the prescribed cycles of chemotherapy, as well as patients with previous invasive breast cancer and possible genetic predisposition for reoccurring breast cancer, were included in our study. Furthermore, we did not differentiate according to chemotherapy administered, because most patients had received an anthracycline- and taxane-based chemotherapy. Presently, recurrences are only roughly categorized, because a broad spectrum of different types of events was observed in our relatively small cohort. By extending the population cohort, we will be able to achieve a better categorization of the recurrences in future. However, we strongly believe that the prognostic impact of RCB would thereby remain unchanged. In addition, because in the original study by Symmans et al.27 clinical (c-) stage and age were not independent prognosticators for the overall cohort, in relation to RCB, we decided not to analyze them separately. Moreover, we chose a 5-year primary endpoint, resulting in a shorter follow-up period compared with the original validation cohort, although notably we used the date of surgery and not the date of the initial histological diagnosis to calculate RFS.27 This shorter follow-up might limit our results and, in particular, may lead to a falsely improved disease-free survival for luminal subtypes, because these recurrences tend to occur later.45 However, it has been shown that pCR and also RCB play a greater role in aggressive breast cancer subtypes.29,31 Therefore, despite these limitations of the study, we were able to show stable results with only little variation to Symmans cohort concerning the prognostic impact of RCB.23,24

The strength of this study is undoubtedly the consistency regarding the systemic oncological treatment, including the pathological processing, which could only be achieved through the comparatively small cohort. All patients were treated within the Division of Oncology, Department of Internal Medicine at the University Hospital under the same protocols and standards of treatment. All surgical specimens were processed in the Department of Pathology in Hospital Graz Sued-West using a standardized protocol for grossing and embedding and standardized cancer reports, including RCB, which was assessed prospectively according to the protocol and the website developed by Symmans et al.17,27,42

In our study, we were able to demonstrate that nodal status after chemotherapy, tumor size, and number of positive lymph nodes at diagnosis showed, in accordance with other research, a strong impact on RFS and OS.29,30,41 Because all of these factors are included in the calculation of the RCB, their individual impact has positively influenced the prognostic value of RCB on RFS and OS.27 Furthermore, we were able to show that the prognostic difference between the low RCB classes is small and does not reach statistical significance. This finding suggests that the prognostic impact of RCB on RFS is mainly driven by the highest RCB class, which has been previously observed by Symmans et al.27 for the HER2-amplified population, in which the prognostic impact between RCB classes I and II was not significant. However, our cohort was relatively small, and it can be reasonably speculated that a larger sample size with a longer follow-up and consequently higher event rate would have rendered the small numerical differences in survival outcomes between patients with RCB class 0–1 and RCB class 2 statistically significant. Otherwise, the adverse impact of higher RCB scores on RFS was consistent across several clinical subgroups but more pronounced in patients with high Ki67 LI, suggesting that residual disease from highly proliferative tumors may be more detrimental than residual masses from slowly proliferating tumors. This finding concurs with other research.46,47

We were able to show that the strongest uni-variable predictors of a lower RCB score were high risk molecular subtypes of breast cancer (triple-negative breast cancer, HER2-amplified, and breast cancer with high Ki67), which in general tend to have a greater risk for recurrence but also higher rates of pCR in response to neoadjuvant chemotherapy.45 This could be explained by the fact that these more aggressive molecular subtypes are more vulnerable to cytotoxic treatment and can be targeted by specific drugs.48,49 In these breast cancer subtypes, neoadjuvant chemotherapy and targeted treatment leads primarily to a better response, which is better reflected by RCB compared to the pCR rate.7 For these more aggressive molecular subtypes, pCR at surgery is particularly associated with longer RFS and OS, even comparable to less aggressive molecular breast cancer subtypes.7 In addition, incomplete pathological response with similar prognostic impact as pCR can be easily measured and determined using RCB.31

Concerning less aggressive molecular subtypes, higher age confounds with statistically higher probability for hormone receptor-positive breast cancer, which generally has a better overall prognosis but benefits less from taxane-based neoadjuvant chemotherapy.5 The combination of patients’ higher age and a high RCB score is associated with worse prognosis (Table 3 and Supporting Table 2). This observation may help facilitate the decision toward omitting neoadjuvant chemotherapy in older patients with hormone receptor positive disease, knowing that they have worse prognosis. Thus, these patients could be selected for novel neoadjuvant approaches.

Our analysis provides a scientific confirmation of the RCB and an excellent basis for the risk stratification of patients in daily routine practice, both in the follow-up and in selection of patients for novel systemic approaches and clinical studies in the future. The risk stratification based on the RCB provides a better tool to identify patients who need less attention in the follow-up but also selects patients with dismal prognosis who are candidates for postneoadjuvant clinical studies. We were able to show that RCB can be used in routine clinical setting, where it demonstrates valuable prognostic impact for disease-specific outcome in breast cancer patients following neoadjuvant therapy. Our findings reveal reliable results concerning the prognostic impact compared to the original investigators from MD Anderson. Although our cohort showed the RCB as highly significant, stable, prognostic parameter, it cannot serve as confirmation for long-term prognosis, particularly for patients with hormone receptor-positive disease. Therefore, it is warranted to further investigate the potential of RCB for guiding the intensity and duration of follow-up or patient selection for clinical trials.