The incidence of breast cancer has substantially increased worldwide during the last few decades, and it is also currently the most common female malignancy in Korea.1,2 As the overall survival rate for patients with breast cancer has steadily increased over time, a large number of breast cancer survivors are undergoing regular follow-up visits.3,4 The estimated number of breast cancer survivors in the USA was more than 4 million women in 2022.5

Current guidelines recommend against using imaging or laboratory tests to detect distant metastasis in asymptomatic patients with breast cancer after their initial treatment.6,7 These recommendations are based on the results of randomized clinical trials that showed no survival improvement with the use of intensive surveillance for distant metastasis.8,9 However, as acknowledged in the guidelines, there is an urgent clinical need to re-assess this issue since previous trials were conducted two decades ago when diagnostic tools were less accurate and therapeutics were less effective.6

Despite the current recommendations, real-world data show that breast cancer patients regularly undergo one or more imaging or laboratory tests to screen for distant metastasis.10,11,12 The frequent use of tests may also arise from the belief that earlier therapeutic intervention for distant metastasis may lead to survival benefits, based on the studies that have reported successful new therapeutics in metastatic breast cancer patients.13,14,15,16

In this study, we report the results of a multicenter retrospective study conducted by the Korean Breast Cancer Society Survivor Research Group (KBCS-SRG) that aimed to investigate the relationship between the use of frequent tests and survival for patients with breast cancer. We collected the real-world data of patients with breast cancer treated at 11 teaching hospitals in Korea during the period when subtype-based therapeutics, such as aromatase inhibitors or trastuzumab, were all approved for clinical use.

Patients and Methods

Patients

We retrospectively reviewed the clinicopathologic records of 4130 patients who underwent surgery for breast cancer between January 2010 and December 2011 from the 11 hospitals in South Korea. Patients with carcinoma in situ lesions, male breast cancer, recurrent breast cancer, metastatic breast cancer, and synchronous or metachronous malignancies in other organs were excluded. There were no restrictions on patients’ ages. Initial breast cancer was staged according to the American Joint Committee on Cancer (AJCC) cancer staging manual, the eighth edition.17 The information on the hormonal receptor status was collected as one parameter rather than separately for estrogen receptor and progesterone receptor. Therefore, patient classification was only possible by the anatomic staging system. Ki-67 expression level was categorized as high or low on the basis of each institution’s cutoff value. Positive hormone receptor (estrogen and/or progesterone receptors) was defined as ≥ 1% of stained cells or Allred scores above 2 on the immunohistochemistry assay. Human epidermal growth factor receptor type 2 (HER2) was evaluated with anti-HER2 antibodies, and patients with equivocal results underwent fluorescence in situ hybridization or silver-enhanced in situ hybridization tests. Among patients with HER2-positive breast cancer, those with tumor size of > 1.0 cm or metastatic axillary lymph nodes were treated with the HER2-targeted therapy according to the insurance coverage indications of the Korean National Health Insurance. High-volume centers were defined as those treating > 500 patients with breast cancer annually. The study was approved by the institutional review board of each institution. All procedures were conducted following the Declaration of Helsinki, and the requirement for informed consent was waived.18

Intensity of Surveillance Exams

We gathered the data on the number of imaging tests conducted during the distant metastasis-free survival period after surgery. The use of various imaging tests depended on each physician’s consideration. The imaging modalities included bone scans, chest computed tomography (CT), abdomen ultrasonography (USG), and abdomen CT, all of which are commonly performed tests for surveillance in Korea.10 Imaging tests that were performed for symptoms unrelated to breast cancer were also included. Laboratory tests and plain X-rays were excluded in our analysis as these tests show low sensitivity to detect distant metastasis compared with other tests.

Patients were divided into two groups on the basis of the median value of measured screening intensity, calculated by dividing the total number of exams by the distant metastasis-free survival (DMFS) interval. We excluded 18Fludeoxyglucose-positron emission tomography (18F-FDG/PET) from our study to minimize selection bias, as the Korean National Health Insurance covers it only when distant metastasis is suspected on other imaging modalities.

Definition of Recurrence and Survival Period

Locoregional recurrence events included ipsilateral breast tumor recurrence and recurrence at the ipsilateral chest wall, mastectomy scar, and regional lymph nodes, such as ipsilateral axillary, supra-/infra-clavicular, or internal mammary lymph nodes. Distant metastasis referred to any recurrence beyond the locoregional recurrence, such as metastasis to bone, lung, liver, and distant lymph nodes.

DMFS was defined as the interval between the date of surgery and the date of histologic confirmation of distant metastasis or the date of the last follow-up. When the biopsy was not feasible, DMFS was determined as the period until the date of clinical diagnosis that led to the initiation of palliative treatments. Breast cancer-specific survival (BCSS) was defined as the time from surgery to the death from breast cancer or breast cancer treatment-related complications.

Statistics

The continuous variables were analyzed with the Mann–Whitney U test, and categorical variables were analyzed with Pearson’s χ2 test. The Log-rank test was used to compare the survival curves derived from the Kaplan–Meier method. Cox proportional hazards regression model was used for multivariable analysis and to estimate the adjusted hazard ratio. Variables with a two-sided p < 0.10 in the univariate analysis were analyzed in the multivariable analysis, and those that showed multicollinearity were excluded. Missing data were treated using a complete-case analysis approach. Statistical significance was set at p < 0.05. All analyses were conducted using SPSS, version 27.0 (SPSS, INC., IBM, Armonk, NY, USA), and the Kaplan–Meier curves were drawn using GraphPad PrismTM, version 9.0 (GraphPad Software, San Diego, CA, USA).

The propensity scores were estimated using a logistic regression model. Nearest-neighbor matching method was used for 1:1 matching without replacement using a caliper width of 0.10 standard deviation of the logit of the propensity scores. Variables that were differently distributed between the two groups were included in matching analysis. Propensity score matching was performed using the “MatchIt” R package (version 3.6.3).

Results

Patients

A total of 4130 patients with breast cancer who met the inclusion criteria were included in the study. Table 1 shows the clinicopathologic characteristics of the subjects and the use of imaging tests in these patients. As presented in Table 1, more than half of the patients had T1 (54.7%) and/or N0 (62.1%) status, and 3038 (73.6%) patients were treated with breast conservation surgery. The median frequency of surveillance during the DMFS period was 1.38 exams per year [interquartile range (IQR): 0.77–2.27]. During the median follow-up of 110 months, 175 (4.2%) patients experienced locoregional recurrence: 26 (14.9%) patients had distant metastasis concurrently and 33 (18.9%) patients developed distant metastasis later. A total of 301 (7.3%) patients developed distant metastasis during the follow-up period, among which 242 patients had no locoregional recurrence history. At the time of diagnosis of metastasis, 76 (25.2%) patients experienced symptoms related to distant metastasis.

Table 1 Demographic and clinicopathological characteristics of patients

Surveillance Intensity and DMFS

We estimated the association between the intensive metastasis screening and time to distant metastasis by using the survival data of the 301 patients who eventually developed distant metastasis during the follow-up. As expected, highly intensive surveillance for distant metastasis showed significantly shorter distant metastasis-free survival among the 301 patients [p < 0.001, hazard ratio (HR) 1.62; confidence interval (CI) 1.29–2.04; Fig. 1a]. The difference in median time to distant metastasis between the two groups was 15.3 months (28.8 months for intensive surveillance vs. 44.1 months for less-intensive surveillance). We also observed that the initial site of metastasis was associated with the degree of lead-time. As shown in Fig. 1b–d, bone metastasis showed the most prolonged difference in median DMFS for metastasis surveillance (30.0 vs. 52.0 months, p <0.001), followed by lung metastasis (24.9 vs. 39.7 months, p = 0.004). Intensive metastasis surveillance was not significantly associated with DMFS in patients who developed liver metastasis (median 21.9 vs. 34.3 months, p = 0.051).

Fig. 1
figure 1

Kaplan–Meier curves showing distant metastasis-free survival among patients who developed distant metastasis. Patients who experienced distant metastasis after breast cancer surgery at the 11 hospitals in South Korea were retrospectively analyzed. The Kaplan–Meier curves show DMFS for 301 patients who eventually developed distant metastasis (a) and patients with bone (b), lung (c), and liver metastasis (d). p values were calculated by using the log-rank test, and the hazard ratio was calculated with univariate Cox-regression analysis. DMFS distant metastasis-free survival, CI confidence interval

Surveillance Intensity and BCSS

Next, the whole study population (n = 4130) was grouped into the high-intensity surveillance (High-IS) or the low-intensity surveillance (Low-IS) groups on the basis of the median value of the screening frequency of all patients (Supplementary Fig. 1). The median frequency of the tests was 2.26 (IQR, 1.81–3.12) and 0.77 (IQR, 0.51–1.00) exams per year for the High-IS group and the Low-IS group, respectively. As shown in Fig. 2a, the High-IS group showed significantly worse BCSS compared with that of the Low-IS group [p < 0.001, hazard ratio (HR), 3.13; 95% CI 2.32–4.21]. Since more patients with advanced stages underwent frequent imaging studies, we stratified the patients by their tumor stage and observed consistent results (Fig. 2b–d, Table 2). Moreover, the statistically significant association between the surveillance intensity and survival was shown after stratification by patient’s age, hormonal receptor status, or HER2 overexpression status (Fig. 3a–f).

Fig. 2
figure 2

Kaplan–Meier curves showing breast cancer-specific survival. Data on patients who underwent breast cancer surgery between January 2010 and December 2011 at the 11 hospitals in South Korea were retrospectively analyzed. The Kaplan–Meier curves show BCCS for all patients (a) and patients with each anatomic stage (bd) according to the intensity of surveillance. p values were calculated by using the log-rank test, and the hazard ratio was calculated with univariate Cox-regression analysis. BCCS breast cancer-specific survival, Low-IS low-intensity surveillance, High-IS high-intensity surveillance, CI confidence interval

Table 2 Univariate and multivariable analyses for breast cancer-specific survival
Fig. 3
figure 3

Kaplan–Meier curves showing breast cancer-specific survival according to age and tumor subtypes. Data on patients who underwent breast cancer surgery between January 2010 and December 2011 at the 11 hospitals in South Korea were retrospectively analyzed. The Kaplan–Meier curves show BCSS after stratification according to age at the surgery (ab), hormone receptor (cd), and HER2 receptor (ef). p values were calculated by using the log-rank test, and the hazard ratio was calculated with univariate Cox-regression analysis. BCCS breast cancer-specific survival, Low-IS low-intensity surveillance, High-IS high-intensity surveillance, CI confidence interval, HER2 human epidermal growth factor receptor type 2

Multivariable Cox proportional hazards regression model of all patients showed that the screening intensity was not associated with BCSS after adjusting for relevant variables (p = 0.263, HR 1.27; 95% CI 0.84–1.93, Table 2). Moreover, the observed difference in the BCSS can be the result of uneven distribution of well-known risk factors between the two groups, since features associated with worse outcomes, such as younger age, advanced stages, or high histologic grade, were associated with the High-IS group (Table 3). Notably, the High-IS group was also significantly associated with high-volume institutions, molecular subtypes, and the use of systemic therapies. Thus, to adjust for these confounding effects, we performed a 1:1 propensity score matching analysis. Among the 4130 patients, a total of 1698 patients were included in the propensity score matching analysis with well-balanced variables (Table 3). In this analysis, we observed that the intensity of surveillance tests for metastasis was not related to significant survival differences (Fig. 4).

Table 3 Clinicopathological characteristics between the two groups before and after propensity score matching
Fig. 4
figure 4

Kaplan–Meier curves showing breast cancer-specific survival after 1:1 propensity score matching. We performed 1:1 propensity score matching analysis to adjust for confounding effects of clinicopathologic variables of original cohort. A total of 1698 patients were yielded and there was no significant difference in BCSS according to the intensity of surveillance. p value was calculated by using the log-rank test, and the hazard ratio was calculated with univariate Cox-regression analysis. BCCS breast cancer-specific survival, Low-IS low-intensity surveillance, High-IS high-intensity surveillance, CI confidence interval

Discussion

The present retrospective analysis of the 4130 breast cancer patients from 11 teaching hospitals in Korea shows the frequent use of imaging studies in breast cancer patients in real-world practice. Additionally, our data demonstrate that the use of intensive surveillance for breast cancer patients was not associated with an improved survival outcome despite the use of currently available therapeutic modalities.

Current guidelines that recommend against the use of routine imaging tests to detect distant metastasis in asymptomatic patients with breast cancer stemmed from the results of two randomized trials reported in 1994.6,7,8,9 The Interdisciplinary Group for Cancer Care Evaluation (GIVIO) randomized 1320 patients with breast cancer into the intensive surveillance group and the control group.8 The intensive surveillance group routinely conducted bone scan, liver sonography, chest x-ray, and laboratory tests, while the control group only conducted clinically indicated tests. With a median follow-up of 71.0 months, they reported that intensive surveillance had no significant impact on survival outcomes and quality of life. Del Turco et al. also randomized 1243 patients and showed that intensive surveillance was significantly associated with earlier detection of distant metastasis, while the overall survival was comparable between the two groups.9 Additionally, Kokko et al. prospectively randomized 472 patients who underwent surgery between 1991 and 1995, and reported that intensive surveillance tests were not associated with improved disease-free or overall survival but did increase surveillance costs by 2.2 times.19 However, it is important to note that these trials were conducted before the era of recent advancements in therapeutic regimens and highly sensitive imaging modalities.20

Jung et al. have shown that, among various postoperative imaging studies, only mammography showed prognostic implication based on the insurance reimbursement data for patients who were treated between 2002 and 2010.21 However, the limitation of this study was that the intrinsic nature of the insurance reimbursement data prohibited the investigators from collecting detailed data on the demographic and clinicopathologic features and the recurrence data. Furthermore, we have previously shown the lack of survival differences according to the intensity of metastasis surveillance before developing distant metastasis in 398 patients who were treated between 2000 and 2015 in a single institution.22 In the present study, we demonstrate that the use of frequent imaging tests to screen distant metastasis in patients with breast cancer was not related to improved BCSS based on a multi-institution dataset consisting of detailed clinicopathologic information and long-term follow-up. Our findings are particularly noteworthy that the patients included in the present study were treated with contemporary standard treatment that was unavailable during the pivotal trials conducted decades ago.

Our data also show that, despite the current guidelines, many physicians prescribe various imaging tests to detect distant metastasis in patients with breast cancer after their initial treatment for real-world clinical practice. This practice, which may vary among regions or institutions, can reflect the common belief that earlier detection of distant metastasis might improve the survival outcomes of breast cancer patients.10,11,12,23,24,25 Another reason for performing various imaging tests for metastasis surveillance is to provide reassurance for disease-free breast cancer patients.26,27 However, it is worthwhile to note that an intensive surveillance approach may also cause harms to the patients. Patients often experience anxiety and fear during their follow-up exams, and intensive surveillance programs result in an increased financial burden to the healthcare system.19,24,28,29 Most importantly, earlier detection of distant metastasis by intensive use of imaging studies can increase the duration of palliative treatment without meaningful survival gain, an effect known as the lead-time bias.22,30,31 Interestingly, our data demonstrate that the use of intensive metastasis surveillance resulted in significantly earlier detection of lung and bone metastasis but not of liver metastasis. Lung and bone metastasis in breast cancer are associated with more indolent clinical progression compared to the liver metastasis that shows poor post-metastasis survival similar to the brain or multiple organ metastasis.32 These findings indicate that patients who develop lung or bone metastasis can experience a significantly longer duration of the lead-time bias effect when they undergo intensive surveillance.

Moreover, early detection with higher sensitivity exams may enable accurate and timely treatment as numerous ongoing trials investigating adjuvant therapeutics have shown promising results.13,14,15,16,33,34,35 Thus, in an era of novel imaging modalities and treatment regimens, continuous investigations will be necessary to establish concrete evidence on whether intensive surveillance leads to prolonged survival or merely contributes to prolonged lead-time bias without providing any survival benefits.

The current study has several limitations. First, the nature of the retrospective study carries an inherent selection bias. As observed in our results, patients with more aggressive tumor features tended to undergo intensive screening exams. A large-scale randomized-controlled study is currently being conducted in Japan to determine the impact of intensive follow-up on survival outcomes, and we expect that the results of this study would provide a more definitive answer to this issue.36 Another limitation of this retrospective study was that we were not able to collect accurate information on metastasis-related symptoms. Symptomatic patients with more advanced stage of metastasis may have conducted intensive surveillance before diagnosis. Moreover, we could not determine whether the cause of death was directly owing to breast cancer or to complications from treatment, such as sepsis caused by an infection during chemotherapy. However, given the relatively low incidence of deaths from treatment complications in patients with breast cancer, it might have a minimal impact on our results.37 Second, we did not review in detail whether the imaging tests were conducted for screening of breast cancer metastasis or for other medical purposes. However, it is important to note that imaging tests performed for other medical diseases can also identify breast cancer metastasis. Thus, including all tests conducted for various diseases during the disease-free period would be appropriate for the purpose of our study. Third, as we excluded male patients with breast cancer from the study, our results may not fully represent the real-world data. Finally, although our dataset represents a relatively recent cohort for whom most of the contemporary adjuvant treatment options are provided, there are some novel therapeutic options that were not available, such as CDK4/6 inhibitors or pertuzumab.16,33 The implication of these recently developed therapeutic options in breast cancer with distant metastasis should be further explored in the future.

Conclusions

In this multi-institutional retrospective study involving 4130 breast cancer patients who were treated between 2010 and 2011, the use of frequent imaging studies to detect distant metastasis resulted in earlier detection of distant metastasis, especially for lung and bone metastasis. However, intensive surveillance was not associated with prolonged BCSS.