Introduction

Over the last decade, neoadjuvant systemic therapy (NST) has gained considerable therapeutic importance and has been extended to include patients with operable node-positive breast cancer. Although current evidence shows no improvement in survival for these patients, the benefits of locoregional downstaging and response-driven adjuvant therapy continues to drive this shift [1]. The use of NST is not only associated with increased frequency of breast-conserving therapy but also de-escalation of axillary surgery in patients with limited nodal disease [2]. Neoadjuvant therapy, particularly chemotherapy in patients with more aggressive breast cancer subtypes, often converts clinically node-positive (cN+) disease to pathologically node negative (ypN0) [3]. This can therefore be effectively managed with limited lymph node removal with sentinel lymph node (SLN) surgery alone or in combination with imaging-guided localization of the biopsy-proven node (targeted axillary dissection) resulting in lower rates of lymphoedema and other complications [4]. Uncertainties and controversies remain regarding the ideal dose, intensity, duration of proposed NST and treatment options, which potentially leads to a significant amount of heterogeneity in clinical practice between different countries and institutions. For the treating physicians, evidence-based standardization of these practices is critically important. A better understanding of these factors could influence not just the outcome, but also the cost and convenience of the regime. This is essential in the era of quality and value-based medical decision making.

The international multicenter phase-III OPBC-03/TAXIS trial (ClinicalTrials.gov Identifier: NCT03513614) was designed to assess the optimal locoregional management of the axilla in patients with cN+ breast cancer, including patients with residual nodal disease following NST [5]. Its main objective is to show that the combination of tailored axillary surgery (TAS) and axillary radiotherapy (ART) is non-inferior to the current standard of axillary lymph node dissection (ALND) in terms of disease-free survival in the era of effective systemic therapy and extended regional nodal irradiation. The TAXIS study protocol is unique inasmuch as its pragmatic design allows inclusion of patients both in the neoadjuvant and adjuvant setting according to the preferences of the treating physicians and institutions. Therefore, it provides an excellent opportunity to study patterns and trends in the use of NST in different institutions across Europe. The aim of this study was to evaluate the use of NST in patients with clinically node-positive breast cancer in Europe to assess the need for international standardization of NST.

Methods

This prospective observational cohort study was preplanned within the pragmatic randomized controlled international multicenter phase-III TAXIS trial (OPBC-03/SAKK 23/16/IBCSG 57-18/ABCSG-53/GBG 101; ClinicalTrials.gov Identifier: NCT03513614) to assess trends in use of NST [5]. Patients with cN+ breast cancer were included, defined as nodal disease detected by palpation or imaging at the time of initial diagnosis and histologic or cytologic confirmation of both the primary tumor and lymph node metastasis. According to the pragmatic design, patients can be included in the upfront surgery as well as in the neoadjuvant setting, with mandatory confirmation of residual nodal disease at the time of surgery in the latter setting. Patients with American Joint Commission on Cancer (AJCC) stage IV, cN3c or cN2b breast cancer, contralateral or other tumor malignancy within 3 years, prior axillary surgery except SLN biopsy, or prior axillary radiotherapy were excluded. The patient population in the present study was a priori defined to include the first 500 consecutive randomized patients who were included from August 2018 to June 2022.

The trial was approved by the local ethics committees and was performed in accordance with the requirements of the national regulatory authorities. Written informed consent was obtained from all patients.

Systemic therapy

In line with the pragmatic TAXIS trial protocol, type of systemic therapy was left to the discretion and preference of the treating physicians and institutions. All drugs used for adjuvant systemic anticancer treatment (if indicated) were locally chosen according to international and/or local guidelines including the sequence of systemic therapy in relation to surgery (neoadjuvant versus adjuvant setting). All drugs used for adjuvant systemic anticancer treatment were systematically recorded. Planning of study visits followed local practice in frequency, interval, and duration. Adjuvant patients were defined as patients who did not receive NST. Investigators were encouraged to enroll all eligible patients consecutively without selecting patients and tumors according to the likelihood of not achieving complete pathological response (pCR) to maintain TAXIS study eligibility.

Endpoints

Primary endpoint for this study was the rate of patients undergoing NST (proportion of entire TAXIS patient population that underwent NST) [5]. Secondary endpoints included the rate of patients undergoing NST by country, by study site, by stage, and by intrinsic subtype defined by the expression of hormonal receptors (HR) and human epidermal growth factor receptor 2 (HER2).

Statistical analysis

This project reflects an interim analysis of the TAXIS trial that was pre-planned after 500 patients were randomized (one third of the total sample size). It was planned to gain relevant insight on the use of adjuvant and post-neoadjuvant systemic treatment, which, in turn, may have an impact on the primary endpoint of the main trial (disease-free survival).

Continuous endpoints were summarized using median and interquartile range (IQR) and compared between treatment arms using Wilcoxon rank-sum tests. Categorical endpoints were summarized using frequency counts and percentages and compared between treatment arms using Fisher’s exact tests. Logistic regression was applied to investigate the influence of year of administration of neoadjuvant treatment. Two-tailed tests with a significance level of 0.05 were used. No adjustment was made for multiple testing and all analyses are considered exploratory. All analyses were performed using R version 4.2.1.

Results

Patient demographics and tumor characteristics in adjuvant versus neoadjuvant setting are shown in Table 1. A total of 750 patients undergoing NST were screened and consented to the TAXIS study, baseline characteristics shown in Supplementary Appendix 1. However, 250 patients were screening failures due to exclusion criteria at the time of surgery, including pathologic complete response (pCR) in 182 patients, no SLN identified in 10, no radiologically identified clip in 38, and other reasons in 20 patients. The baseline characteristics of the 182 patients who were excluded due to nodal pCR are shown in Supplementary Appendix 2. The remaining 500 patients were randomized for the TAXIS trial and included in the present sub-study. Median age was 57 years (interquartile range [IQR]: 48–69 years). Most patients were postmenopausal (68.4%) with grade 2 and 3 HR+HER2-breast cancer with a median tumor size of 28 mm (IQR 20–40). Patients were recruited from 44 breast centers in six countries in Europe. The largest volume of patients was recruited from Switzerland (n = 335; 67%), followed by Hungary (n = 99; 19.8%).

Table 1 Overall patient and tumor characteristics by adjuvant vs neoadjuvant treatment

Of these 500 patients, 165 patients (33%) were treated with NST, and 335 patients (67%) underwent upfront surgery. Following surgery, a total of 243 patients (48.6%) underwent adjuvant chemotherapy. Patients who underwent neoadjuvant chemotherapy (NACT) were significantly younger and more likely to be premenopausal and having a triple negative or HER2 positive breast cancer than patients who underwent upfront surgery (Table 1).

Among the 165 patients treated with NST, a total of 151 had NACT, 24 received neoadjuvant endocrine therapy (NAET) and 42 immunotherapy (e.g., anti-HER2 therapy; Table 2). While 100 patients were treated with NACT alone, 13 received only NAET and one patient was treated with neoadjuvant double HER2-blockade without chemo- or endocrine therapy.

Table 2 Type of neoadjuvant treatment

Of 335 patients who did not receive NST, 193 (57.6%) received adjuvant chemotherapy, 102 (30.4%) endocrine therapy alone and 40 patients (12%) received no adjuvant systemic treatment. Of the patients who received adjuvant chemotherapy, 126 (53%) were also treated with other adjuvant treatment modalities.

The use of NACT was significantly influenced by receptor status (Table 3). Among patients with triple negative breast cancers (TNBC) and HER2+ cancers, a significantly larger proportion underwent NACT compared to the HR+/HER2-patients, and significantly more patients with AJCC stage 3 disease received NACT compared to patients with stage 2 disease (33.5% vs. 18.9%; p = 0.007).

Table 3 The use of neoadjuvant systemic therapy by intrinsic subtype

Use of NST varied significantly across countries (Table 4) (p < 0.001). Austria had the highest overall number of patients undergoing NST and Hungary the lowest. The use of NACT across countries did not differ significantly in stage 2 (p = 0.2) disease, but differences were significant for stage 3 disease (p = 0.008). Finally, the administration of NST increased significantly over the years (OR 1.42 95% CI 1.18–1.71; p < 0.001). The proportion of patients receiving NST more than doubled from 20 to 46.7% from 2018 to 2022 (Table 5, Supplementary Appendix 3).

Table 4 The use of neoadjuvant systemic therapy by country
Table 5 The administration of neoadjuvant treatment by year

Discussion

Careful patient selection for NST is a key component of best practice in breast cancer management. The present study shows that while there was a significant increase in the use of NST, there was also substantial heterogeneity by country and by study site, primarily in patients with HR+/HER2-breast cancer. Several findings were expected. For example, premenopausal patients were more likely to be treated with NACT in comparison to postmenopausal patients. It has been well established that chemotherapy in premenopausal women under the age of 50 can improve disease-free survival [6, 7]. Recent exploratory analyses using multigene assays have suggested that the use of chemotherapy was associated with some benefit for women 50 years of age or younger with node-negative disease and midrange Oncotype DX 21-gene recurrence score of 16 to 25, as well as most patients with 1–3 positive nodes and a recurrence score ≤ 25 [8]. Despite these studies only including patients where chemotherapy was administered in an adjuvant setting, this data is often extrapolated to favor the use of NST worldwide. The use of multigene assays in the neoadjuvant setting is slowly gaining popularity. Oncotype DX is the most extensively studied assay in this context, with higher rates of pCR in patients with a high recurrence score compared to patients with low to intermediate scores [9]. Furthermore, in node-positive patients, higher recurrence score results were significantly associated with the likelihood of pCR in the axilla [10]. However, we did not collect data on the use of genomic tests to refine indications for neoadjuvant therapy in this study and, hence, were not capable of evaluating the proportion of patients undergoing NACT based on genomic high risk.

Interestingly, in our study, only 13 patients were solely treated with NAET (tamoxifen 31% and aromatase inhibitors 61%). The sample size is too small to perform a formal comparison between baseline and tumor characteristics of patients who underwent NAET alone and patients who underwent neoadjuvant chemotherapy. However, we added a table as appendix 4 showing the baseline characteristics by use of NAET or neoadjuvant chemotherapy, indicating the expected selection of patients with small G1-2 hormonal receptor positive tumors. Unfortunately, data collection on treatment duration was incomplete at the date of data cut-off and was thus not reported. The use of NAET may gain popularity in the future, as the use of multigene assays in the neoadjuvant setting have been shown to successfully predict cancer response to NAET [11]. The largest of these trials, the TransNEOS trial prospectively validated the role of recurrence score testing in predicting clinical response after 6 months of neoadjuvant letrozole in patients with ER+/HER2-breast cancer. The authors showed that low Oncotype DX recurrence scores are associated with higher clinical response rates in these patients [12].

Among patients treated with NST in our study, the majority had HR+/HER2-breast cancer (61%). This does not seem to fully reflect European NST practice given that some investigators may have selected these patients for pre-registration in the study because patients with triple negative and HER2+ breast cancer have a higher likelihood of pCR, which excludes them from the TAXIS trial. This will be discussed more in detail in the limitation section below. When comparing patients who underwent upfront surgery to patients who had NST, the ones with triple negative and HER2+ breast cancer were more likely to receive NST compared to primary surgery. This is expected given that HER2-positivity mandates the use of targeted HER2 therapy, which is usually combined with chemotherapy [13], and associated with an increased pCR rate [14, 15]. This is an important prognostic marker as a pCR following NST has been shown to translate into a sustained benefit in event-free survival [14]. Furthermore, response-driven chemotherapy became standard care in women with residual triple negative and HER2+ disease following NST after publication of the landmark trials showing a relevant benefit for the administration of adjuvant capecitabine and trastuzumab emtansine, respectively [16, 17]. Therefore, patients with these subtypes received standard NST except for the beginning of the study.

Austria had a high proportion of patients undergoing NST recruited to the study (55%). Participating sites are members of the Austrian Breast & Colorectal Cancer Study Group (ABCSG). The ABCSG has facilitated standardization of diagnostics and therapy in breast cancer throughout Austria providing patients with the latest and best possible treatments including participation in clinical trials, which may have accentuated the use of NST in Austria. Finally, there was a difference observed in the increased use of NACT across countries in AJCC stage 3 disease, but not in stage 2 disease. In the present study, patients with stage 2 disease were more likely to have smaller breast cancers and therefore the use of NACT was less likely to improve the ability to conserve the breast by local down-staging, which may have contributed to that finding.

NST primarily consisted of NACT and was increasingly used over the study period; in fact, its use more than doubled from 2018 to 2022. According to the pragmatic trial design, indications for systemic therapy including timing (neoadjuvant versus adjuvant) were left at the discretion of the local investigators. This was necessary to ensure applicability of the generated data to the participating institutions, while in explanatory trials, uniformity of treatment regimens is achieved by standardization in the study protocol. Despite this pragmatic approach, a significant increase in the use of NACT was observed in every country except for Lithuania and Italy with the lowest numbers of patients. This reflects the change in practice in many centers where NACT became increasingly well received by patients and clinicians alike over the last few years. As we entered the era for de-escalation of surgical treatment in patients with breast cancer, NACT is increasingly used to reduce the tumor size allowing for breast conservation and better aesthetic results and for downstaging the axilla in node positive patients [2]. Following NACT, up to 60% of patients with HER2+ and 48% with TNBC breast cancer with initially node-positive disease showed a pCR in the axilla [3]. With the implementation of the SLN procedure and targeted axillary dissection to determine nodal pCR, and low axillary recurrence rates without ALND, and its endorsement by clinical guidelines, NACT gained further popularity over the last few years since these patients who converted to clinically node-negative following NACT can be spared ALND [18,19,20,21,22,23,24,25].

Limitations

Our study excluded a large number of patients without residual axillary disease following NST as pCR screening failures. Importantly, investigators were encouraged to pre-register every eligible patient consecutively irrespective of intrinsic breast cancer subtype. However, we found a differing percentage of patients with screening failures by country (Fig. 1a) and by study site (Fig. 1b). In addition, the over-representation of patients with HR+/HER2-disease (79.4%, Table 1) further suggests selection bias toward pre-registration of patients with a lower likelihood of pCR, which, in turn, reduced the number of screening failures that were not reimbursed by the patient fee. As in most pragmatic trials, the risk spectrum of included patients is significant since the majority of patients who undergo the procedure under investigation- in this case axillary dissection-should be included. Skepticism among clinicians may hamper acceptability of such protocols at both ends of the risk spectrum. For example, patients with low volume disease burden may be selectively omitted because surgeons may consider axillary dissection as overtreatment in patients with only one or two positive nodes that otherwise fulfil the Z0011 criteria. This is particularly applicable to patients without palpable disease when nodal metastases are detected solely by ultrasound. On the other hand, surgeons may be reluctant to omit axillary dissection in patients with residual palpable disease after neoadjuvant chemotherapy who are randomized into the axillary radiation arm. To account for that potential selection bias, method of detection of nodal disease (palpable versus imaging only) is used as stratification factor in the TAXIS trial. Furthermore, the lower number of patients included outside of Switzerland limits the generalizability of these findings. Moreover, the analysis might be biased as not all breast centers in these countries included patients into the trial.

Fig. 1
figure 1

a Accrual and screening failures per country. b Accrual and screening failures

Conclusion

NST in patients with luminal breast cancer is adequately represented in the TAXIS population. However, substantial heterogeneity in the use of NST in patients with HR+/HER2-breast cancer exists in Europe. While stringent guidelines are available for the use of NST in triple-negative and HER2+ breast cancer, the presented data show the need for the development of and adherence to such well-defined recommendations also for patients with HR+/HER2-breast cancer.