Introduction

As a result of early detection and improved treatment, survival after breast cancer has improved significantly. Consequently, an increasing number of women is in need of follow-up care after curative treatment [1]. The main aim of follow-up is the early detection of local (LR) or regional recurrences (RR) and secondary primary tumours [2]. The incidence of first recurrence is influenced by prognostic factors such as age, grade, nodal involvement, hormone receptor status and treatment of the primary tumour [3, 4]. Patients with LR or RR have a higher risk of developing distant metastasis (DM) and have worse survival compared to patients without LR or RR [5,6,7,8,9]. Information on the pattern of site-specific first and second recurrence after LR or RR is lacking and the effect of prognostic factors such as the disease-free interval (DFI) after a first LR or RR on the development of subsequent recurrences is not well documented. Earlier studies showed a peak in first recurrence hazard approximately two years after the primary tumour [10,11,12]. Some studies also demonstrated a second peak between 4–9 years after treatment, mainly in ER-positive patients [10,11,12,13,14,15,16]. However, these studies lack contemporary treatments [11] or report single-institution data [12,13,14,15,16]. Also the pattern of subsequent recurrence after the first is not yet analysed.

The pattern of recurrence risk and prognostic factors for the development of subsequent breast cancer recurrences can provide valuable information for informed clinical decision-making and patient centred follow-up. Using a population-based cohort of women with follow-up data of ten years after treatment for primary invasive breast cancer, we aimed to (1) analyse the occurrence and timing of not only first, but also subsequent LR, RR and DM, (2) identify prognostic factors for first and subsequent LR, RR and DM and (3) identify consequences of these patterns for tailoring of follow-up.

Patients and methods

Data collection and study population

Data originate from the nationwide population-based Netherlands Cancer Registry (NCR), for which data are collected directly from patient medical records by a team of specially trained data managers. All data are registered in accordance with national and international coding rules, and include patient, primary tumour and treatment characteristics, as well as data concerning tumour recurrences within ten years following diagnosis.

Women with primary invasive breast cancer diagnosed in Dutch hospitals in 2003 were included (N = 10,356). Eligibility criteria were stage I–III breast cancer, no previous or synchronous cancer, no direct extension to chest wall or skin, surgical treatment in a Dutch hospital and no neo-adjuvant therapy. Patients with macroscopic residue after surgical treatment or microscopic residue without adjuvant treatment were excluded.

End-points

Site of recurrence was classified as follows: (1) local—any epithelial breast cancer or ductal carcinoma in situ (DCIS) in ipsilateral breast tissue, or in skin and subcutaneous tissue of the ipsilateral thoracic wall, (2) regional—breast cancer in ipsilateral lymph nodes or contralateral lymph nodes if axillary lymph node dissection was performed, or (3) distant—breast cancer in any other location except the contralateral breast [17].

Only patients with LR or RR as first recurrence were considered at risk for subsequent LR, RR or DM. Subsequent recurrences after DM were not taken into account, as DM is considered incurable and further LR or RR has no consequence. For patients with synchronous recurrences, the most life-threatening site of recurrence was taken as endpoint: in case of synchronous LR and RR, the recurrence was registered as RR; when DM was diagnosed with LR or RR, DM was registered. Within the NCR, tumours detected within three months after diagnosis of the previous tumour were considered synchronous. Therefore, follow-up time started three months after diagnosis of the primary tumour. The time to first recurrence or disease-free interval (DFI) was measured from three months after diagnosis of the primary tumour to date of detection of recurrence at any site. DFI to second and third recurrence was measured from detection of the previous recurrence to detection of subsequent recurrence. Follow-up time was censored at ten years after start of follow-up. In addition, survival analyses were performed per type of recurrence for survival after the primary tumour, after first and after second recurrence.

Prognostic factors

Variables were selected based on literature and availability of the data. Estrogen receptor (ER) and progesterone receptor (PR) status were combined into one prognostic factor (ER or PR positive, ER and PR negative) [18]. Age at diagnosis of the primary tumour (<40, 40–49, 50–75, >75), size (≤2, >2–5, ≥5 cm), grade (I, II, III), histological type (ductal, lobular or other), multifocality (yes, no), lymph node status (negative, 1–3, 4–9 nodes), hormone receptor status, type of surgery (breast conserving (BCS) or mastectomy), chemo-, endocrine and radiation therapy (all yes or no) of the primary tumour were assessed as prognostic factors for first recurrence. The same patient, primary tumour and treatment characteristics, DFI and treatment of the recurrence (surgery, chemo-, radiation, and endocrine therapy) were analysed as prognostic factors for subsequent recurrence.

Statistical analysis

The number of recurrences at specific recurrence sites were compared using the Wilcoxon rank sum test. We identified prognostic factors for site-specific first recurrence and subsequent recurrences after LR or RR using extended Cox regression analysis. A minimum hazard ratio (HR) of 1.1 was used for inclusion in the multivariable analysis. To avoid overfitting, an effective sample size of ten per estimated parameter was used, excluding the ones with the smallest effect first [19]. Variables that were significant when testing the proportional hazards assumption by means of Schoenfeld residuals and non-parallel on the log–log plot of the DFI were included as time-varying coefficients in the extended Cox model. Cox regression was also used for modelling the survival after the primary tumour and the first and second recurrence. With the logrank test, the equality of the survival functions was tested.

Prognostic factors with missing values were multiple imputed using a chained equation approach [20]. It was assumed that missing values occurred at random, which validated the use of imputation. All tests were two-sided, and probability values of <0.05 were considered statistically significant. All analyses were performed in STATA® 14.0.

Results

After exclusion, 9342 patients were included in the analyses. Table 1 summarizes the characteristics of the study population. Median age at diagnosis was 58 years (range 20–96). The majority of patients had primary tumours ≤2 cm (60%), grade II disease (44%) and no nodal involvement (61%). Median follow-up was ten years (interquartile range (IQR) 6.5–10.0), in which 27% of the patients died. Forty imputed datasets were pooled using Rubin’s rules and showed healthy convergence.

Table 1 Patient, tumour and primary treatment characteristics categorized by subsequent recurrence

Patterns of first recurrence

Recurrence occurred in 1853 patients (20%) of which 362 patients (20%) had LR, 148 (8%) RR and 1343 (72%) DM (Fig. 1). For the entire cohort, the risk of recurrence was highest in the second year after diagnosis (3.9%; 95% CI 3.5–4.3), with a median DFI of 3.3 years (Fig. 2a). A second peak was present around year 8. However, as there was overlap in the 95% CIs, this finding was not statistically significant. For LR and DM separately, a similar risk pattern was present. The risk of DM was 3% (95% CI 2.6–3.3) during the second year, with a median DFI of 3.2 years. The annual risk of LR showed a double peaked pattern with a risk of 0.7% (95% CI 0.5–0.8) in year two and 0.4% (95% CI 0.2–0.5) in year eight. Median DFI was 3.6 years (Fig. 2b). The annual risk of RR showed a single peak of 0.3% (95% CI 0.2–0.4) in year two and decreased thereafter. Median DFI was 3.1 years (Fig. 2b). Patients with a recurrence around year 8–9 (second peak) had on average a better differentiated primary tumour, positive hormone receptor status and received more often breast conserving therapy with radiation therapy, compared to patients with an early recurrence around year two. Supplementary Figures S1–12 illustrate the patterns of first recurrence stratified by the different prognostic factors. The peak in risk around year two was most pronounced for patients with grade III tumours (Fig. S4) or negative hormone status (Fig. S5), while the pattern of first recurrence showed a more gradual decline in patients with ER-positive tumours.

Fig. 1
figure 1

Flow chart of first and subsequent recurrences. Abbreviations: LR local recurrence, RR regional recurrence, DM distant metastasis

Fig. 2
figure 2

Hazard of a all first recurrences, and b first LR and RR during 10 years of follow-up. Abbreviations: LR local recurrence, RR regional recurrence, DM distant metastasis

Patterns of second and third recurrence

After LR, 102 out of 362 patients (28%) developed a second recurrence (1.1% of total population). Of those, 10 patients had another LR (10%), 12 patients RR (12%) and 80 patients DM (78%). The risk of subsequent recurrence after LR reached its maximum of 15% (95% CI 11–20%) in the first year after previous LR (Fig. 3a). A second peak was present in the eighth year after previous LR (6.4%; 95% CI 0.0–13.6). However, as there were only 7 events after a previous LR after year 7, this finding was not statistically significant. Median DFI after LR was 1.1 year (IQR 0.3–2.5). Ninety-five percent of all subsequent recurrences after first LR occurred between 5 weeks and 7 years.

Fig. 3
figure 3

Hazard of subsequent (second) recurrence after a first LR, and b first RR. Abbreviations: LR local recurrence, RR regional recurrence, DM distant metastasis

Seventy-four out of 148 patients (50%, 0.8% of total population) had subsequent recurrence after first RR; eight patients had LR (11%), three patients had another RR (4%) and 63 patients DM (85%). The proportion of patients with recurrence after RR was significantly higher than the proportion of patients with recurrence after LR (p < 0.001). The risk of second recurrence after RR was highest in the first year (2.7%; 95% CI 1.8–3.6) (Fig. 3b). Median DFI after RR was 1.1 year (IQR 0.6–2.2). After RR, 95% of all subsequent recurrences occurred between 6 weeks and 5 years after diagnosis of the first.

Eighteen patients (55%, 0.2% of the total population) with two previous LR or RR (N = 33) developed a third recurrence. Seven patients (39%) had two previous LRs, two patients (11%) had two previous RRs and nine patients (50%) had previous LR and RR.

Prognostic factors for first and subsequent recurrences

In univariable analysis, age at diagnosis, size and grade of the primary tumour, lymph node status, multifocality and chemotherapy were statistically significant prognostic factors for first recurrence. Multivariable analysis demonstrated that >3 positive lymph nodes was the most important prognostic factor for occurrence of overall first recurrence (HR 3.7, compared to N0), followed by age <40 years and tumour size >5 cm. Younger age, negative hormone status, BCS and no endocrine therapy were most influential factors for first LR, and besides these, also primary tumour size and radiation therapy for first RR (Table 2). Tumour grade II/III and >3 positive lymph nodes displayed the highest HRs for first DM. With 102 observed events after first LR and 74 observed events after first RR, the number of variables in the multivariable analyses for subsequent recurrence after LR and RR was set to ten and seven, respectively (Table 3). Prognostic factors for recurrence after LR were larger primary tumour size, positive lymph nodes and higher grade. Larger primary tumour size, multifocality and shorter DFI to first RR were prognostic factors for recurrence after RR. With only 33 patients at risk and 18 events, we did not have enough power to identify prognostic factors for a third recurrence.

Table 2 Prognostic factors for site-specific first recurrence during 10 years of follow-up in women with stage I–III invasive breast cancer (n = 9342)
Table 3 Prognostic factors for subsequent recurrence after previous LR or RR

Survival after the primary tumour and after recurrence

Ten-year survival after the primary tumour differed significantly (p < 0.001) with 82% for women without recurrence, compared to 61, 41 and 20% for women with local, regional or distant recurrence, respectively (Supplementary Figure S14). There was also a significant difference in survival after the first (p < 0.001) and second recurrence (p = 0.021). The ten-year survival after a first local, regional or distant recurrence was 47, 31 and 5%, respectively (Supplementary Figure S15). After a first local or regional recurrence, the ten-year survival was 21, 15 and 9% for local, regional or distant second recurrences, respectively (Supplementary Figure S16).

Discussion

We investigated the pattern of site-specific recurrence and identified prognostic factors for first and subsequent recurrences during a follow-up of ten years using data from 9342 women treated for primary invasive breast cancer. The pattern of first recurrence was comparable for LR, RR and DM with a major peak in the second year after starting follow-up. The pattern, as well as identified prognostic factors for overall first recurrence, seemed to be dominated by the high percentage of DM. When recurrence was analysed according to site, a difference in identified prognostic factors was present. The hazard of subsequent recurrences after LR and RR both declined towards the end of follow-up. The risk of developing a second recurrence was significantly higher after RR than after LR. And after a second recurrence, more than half of the women were diagnosed with a third.

The pattern of first recurrence is consistent with previous studies [10,11,12, 16]. As more aggressive tumours recur earlier, individuals with these type of tumours are thereby censored, leaving only patients with tumours that grow more slowly and have more favourable characteristics. This leads to an early peak in the recurrence risk and keeps long-term recurrence rates much lower [21]. In our data, a second peak in the hazard of recurrence was present between year eight and nine. This pattern was present in the different recurrence types as well as in different subgroups (see Supplementary Figs. S1–13). However, as there were only 299 events between the years 7–10 and the 95% CIs around the hazard showed overlap, this finding is not statistically significant. Jatoi et al. [11] observed a second peak around year five and hypothesized influence from interval censoring as patients were followed clinically with regular intervals in this period. Yin et al. [12] report a second recurrence peak near the 9.5th year and attributed the bimodal pattern to tumour dormancy: a state in which tumour cells are present, but disease progression not clinically apparent [22]. Demicheli et al. [13] reason that biological characteristics are responsible for the distinctive pattern in tumour recurrence. Our findings confirm this, as patients with late recurrence (second peak) showed more favourable patient and tumour characteristics than patients with early recurrence. Besides more favourable characteristics, a previous study also found better survival for patients with longer DFIs during five years of follow-up [9]. As the follow-up period of patients with late recurrence was much shorter than follow-up for patients with early recurrence, we could not compare the amount of subsequent recurrences between the groups.

The peak in risk was most pronounced for patients with grade III tumours or negative hormone status, and more gradual in patients with ER-positive tumours. In this study, ER-positive patients received endocrine therapy for a maximum of five years. Present guidelines recommend extended endocrine treatment beyond five years [23]. This could delay or prevent most of the ER-positive recurrences that constitute the late recurrence peak. Additionally, efforts to improve adherence to endocrine therapy and the introduction of aromatase inhibitors will likely flatten the second recurrence peak [24]. In hormone receptor-positive breast cancer, late recurrences after >20 years can occur [25], as hormone-positive tumours have slower doubling times and might be suppressed for a prolonged time by endocrine therapy [26]. Also, there may be influence of immune surveillance in controlling progression [27]. Although patients with first recurrence eight to nine years after starting follow-up were more often hormone receptor positive, there was no significant difference in percentage receiving endocrine therapy when comparing patients with early and late recurrences.

Consistent with previous studies, younger age, larger primary tumour and higher grade were important prognostic factors for first recurrence [4, 8, 10]. The HRs of the prognostic factors for subsequent recurrence after LR or RR were higher than those for first recurrence, as can be explained by the higher incidence. With a highest HR of 3.7 for first recurrence (>3 positive nodes, compared N0), it is impossible to appoint one or just a few prognostic factors that can be used for risk stratification. Thrift et al. [28] state that in the case of relative risk, factors should be higher than ten for good prediction of individual risk. In the absence of a ‘perfect predictor’, multiple factors need to be taken into account for risk prediction and subsequent follow-up.

With only 33 patients at risk and 18 events after two previous LRs or RRs, it was hard to identify prognostic factors for third recurrence. The number of events determines the statistical strength of a multivariable analysis. A commonly used rule of thumb based on simulation studies [29,30,31], is a ratio of at least ten to twenty events per explaining variable to maintain the validity of the model. However, there are others suggesting that this rule is too conservative [32].

Strengths of this study include the large sample size, a follow-up duration of ten years and inclusion of an unselected patient cohort from the population-based NCR, representative for the majority of breast cancer patients. By extracting data in retrospect from patient medical records, time of recurrence diagnosis was determined with high accuracy. This is important in avoiding bias in the shape of hazard [33]. We used extended regression, while the proportional hazards assumption did not hold for several variables, as could be expected with long-term follow-up [34]. Changes and improvements in diagnostic procedures and treatment have resulted in a sustained decline in breast cancer recurrence [35], which means data from 2003 may not be representative for the current risk of recurrence. Because regular registration of Human Epidermal Receptor-2 (HER-2) status initiated in 2005, this prognostic factor could not be included in the analysis. However, as trastuzumab treatment for HER-2-positive patients is currently standard practice, the prognostic value of HER-2 in models is less important [4]. Inclusion of more recurrence characteristics than site and treatment, could likely result in better risk prediction of subsequent recurrence than models largely based on primary tumour characteristics. As it is hard to distinguish second primary breast cancer from contralateral recurrence, only ipsilateral and distant recurrence were taken into account in the analysis, which might have resulted in underestimation of the actual recurrence rate [36]. In addition, even though second primary tumours are of importance for follow-up, the focus of this study was on recurrence of the primary tumour and their corresponding characteristics. Although subsequent recurrence does occur in a small subset of patients diagnosed with DCIS, these were excluded from the current analysis [37, 38]. During the follow-up period, 27% of the patients died. The 10-year survival after the primary tumour, as well as after the first and second recurrence differed significantly per recurrence type. For more details on factors influencing 5-year survival after recurrence, the reader is referred to Witteveen et al. [9]. Competing risk analysis was not considered relevant as breast cancer specific mortality was assumed to be only possible after DM, and further recurrence after DM were not taken into account. Time to subsequent recurrence was counted from diagnosis of first recurrence, instead of three months afterwards, as the time until second recurrence is on average much shorter (median DFI 1.1 years, compared to 3.3 for first recurrence). This means almost half of all possible second recurrences will be diagnosed in the first year of follow-up, in which most patients already have more hospital visits due to prolonged treatment or aftercare. Also, more than 80% of the second recurrences after LR or RR, and almost 80% of third recurrences are DM, of which early detection is not aim of regular follow-up care, as earlier detection will not (yet) result in better prognosis.

This study is, to the best of our knowledge, the first analysis that takes site-specific first and also subsequent recurrence after LR and RR into account. The pattern of first recurrence was similar for LR, RR and DM with a major peak in recurrence risk around the second year post-diagnosis. The hazard of subsequent recurrence was higher if the first recurrence was RR compared to LR. As most risk factors only have modest effects, multiple risk factors need to be taken into account for risk prediction and subsequent follow-up decisions. Although the percentage of patients with first recurrence that develop a second recurrence is high, the percentage among all breast cancer patients remains very low (1.9% during ten years). As almost 50% of the second recurrences will be diagnosed in the first follow-up year, combined with the low absolute number of second recurrences, more intensive follow-up for detection of subsequent recurrence is not likely to be (cost-)effective.