Introduction

Breast cancer is the most frequently diagnosed malignancy and is currently considered the second most common death-related malignancy in females [1]. Previous statistical studies have shown that 266,120 women in the United States suffer from breast cancer within one year, resulting in a 15.4% mortality rate associated with breast cancer [2].

Axillary lymph node status (ALNS) is a long-standing concern in clinical practice and research as it is one of the most important prognostic indicators of distant metastases from breast cancer [3]. Sentinel lymph node (SLN) biopsy significantly improves axillary lymph node (ALN) staging. Simultaneously, using haematoxylin and eosin (H&E) staining and immunohistochemistry (IHC), we further delineated ALNS. According to the size of the tumour lymph node metastases, we divided them into macrometastases (Nx > 2.0 mm), micrometastases (Mic > 0.2 mm; ≤ 2.0 mm) and isolated tumour cells (ITC < 0.2 mm), which were categorised by the American Joint Committee on Cancer (AJCC) Cancer Staging ManualIn. Rather than the traditional dichotomy (i.e., ALN-negative or ALN-positive), ITC and Mic introduce a semi-quantitative nodal continuum that distinguishes traditional classifications. With the incidence of ITC ranging from 7 to 19% and that of Mic ranging from 8 to 45%, each of which has a high incidence rate [4,5,6,7,8,9].

The International Breast Cancer Study Group (IBCSG) 23–01 trials suggest that axillary lymph node dissection (ALND) should be contraindicated in patients with one or more Mic (≤ 2.0 mm) in SLN  when the tumours ≤ 5 cm. Provided that the patient was treated with conventional whole-breast radiotherapy and systemic adjuvant therapy, there will be no adverse impact on survival [10]. Moreover, analogous research includes the American College of Surgery in Oncology (ACOSOG) Z0011 trial, which indicates that 1–2 SLN-positive patients can be exempted from ALND with breast-conserving surgery followed by radiotherapy and adjuvant therapy [11]. In addition, an ongoing randomised clinical trial (the SERC trial) has the potential to further compare survival disparity in non-ALNDs across ALNS; unfortunately the trial has not yet reached a conclusion [12]. Accordingly, there is a continuous controversy about the survival and treatment of patients with axillary lymph node metastases especially for ITC and Mic.

Simultaneously, internal mammary lymph nodes (IMLN), as the second most important regional lymph node in breast cancer following ALN, are well known adverse prognostic factors and are associated with a significantly lower survival rate in ILMN-positive patients [13]. Yang et al. argued that neglecting IMLN clearance and systemic radiotherapy could result in a survival benefit. Also, it has been demonstrated that systemic adjuvant therapy for patients with IMLN is still relevant for long-term survival [14, 15]. However, there are few studies on IMLN, with the associated risk factors for prognosis not yet known.

In summary, although ALN and IMLN have been investigated by numerous specialists, the strategies for surgery and adjuvant treatment remain controversial and the risk factors for survival and prognosis remain unclear. Therefore, in this study, we used the Surveillance, Epidemiology, and End Result Program (SEER) database to identify the contribution of ALNS and IMLN to breast cancer and constructed nomogram models to predict patient survival, with the final score helping clinicians decide on the appropriate treatment strategy.

Methods

Data source

The data source for this study was the SEER database, which is a publicly available, comprehensive database of demographic information for selected US states and counties (approximately 35% of the US population), containing millions of patients, all from 18 local cancer registries (http://seer.cancer.gov/about/overview.html). The SEER database provides details on the epidemiological characteristics of patients, stage and grade of the primary tumour, varying treatment strategies, and follow-up appointments. Following a rigorous vetting and application process, we were granted access to the SEER database data and waived the right to informed consent (reference number 14492-Nov2020) [16].

We obtained demographic information from the SEER*Stat 8.4.0 software for 348,604 patients with breast cancer (the main components included ICD-O-3:8500/3, 8520/3, 8522/3, 8523/3) from 2004 to 2015. Furthermore, from the SEER, we further extracted the following: age, sex, race, grade, laterality, pathology, stage, T stage, regional nodes examined, oestrogen receptor status (ER-status), progesterone receptor status (PR-status), tumour extension, chemotherapy, radiation, surgery, ALNS, IMLN, and survival outcomes. Based on the survival data to which we had access, we further categorised patients’ survival outcomes into overall survival (OS), which is the time from the commencement of the patient's treatment strategy for any cause leading to death and breast cancer specific survival (BCSS), which is the time from the commencement of the patient's treatment until death due to breast cancer disease. The inclusion criteria were principally patients with a malignant pathological type and unilateral, primary, or single breast cancer. Moreover, the exclusion criteria were post-neoadjuvant therapy, distant metastases, recurrent tumours, multiple primary breast cancer lesions, coexistence with other tumours, and incomplete medical information. Data was collected according to the inclusion and exclusion criteria, and 279,078 people were enrolled for future evaluation and were randomly separated into a training set and a validation set (Fig. 1).

Fig. 1
figure 1

Flowchart for screening patient information derived from the SEER database. The patient details obtained contained a large number of missing data or data unsuitable for the study at the present, which also included CS size N = 21,043 and ALNS N = 15,210

Statistical analysis

We randomly divided the dataset into a training set, which was primarily used to construct the prediction model, and a validation set, which was primarily used for internal validation of the constructed nomogram. The applications used for the analysis and calculation of data were SPSS 22.0, with methods including the chi-square test, Kaplan–Meier method, log-rank test, univariate and multivariate analyses, and R studio 4.2.0 for the construction of nomograms (packages: rms), receiver operating characteristic (ROC) curves (packages: riskRegression and survival), calibration curves (packages: rms and survival), and decision curve analysis (DCA) curves (packages: ggDCA, rms, and survival).

The chi-square test was applied to evaluate the correlation between the training and validation sets and to verify baseline information for clinical classification. OS and BCSS curves were plotted and compared using the Kaplan–Meier method and log-rank test. The log-rank test was employed for univariate analysis, while Cox regression was utilized to construct the multivariate regression model. Hazard ratios (HRs) and 95% confidence intervals (CIs) were used for the training set. In the univariate analysis, factors with p < 0.05 were incorporated into the multivariate regression model for further analysis. Conversely, factors with p < 0.05 in the multivariate analysis were screened for nomogram construction. The predictive accuracy of the nomogram was demonstrated using a ROC curve and quantified by the area under the curve (AUC). Calibration plots were generated using the bootstrap method with 300 replications, and DCA assessed the clinical practicality and positive net benefits.

Results

Patients and pathological characteristics

We ultimately collected clinical characteristics from 279,078 breast cancer patients from 2004 to 2015 and randomly divided the patients into a training set (195,355) and a validation set (83,723) at a ratio of 7:3 (Supplement1). The main clinicopathological features and treatment strategies are summarised in Table 1. Among the 195,355 patients, 1295, 9236, and 1129 were in the ITC, Mic, and IMLN groups, respectively.

Table 1 Comparison of baseline information for breast cancer patients with different ALNS

In the training set, the median age of the patients was 60–64 years, with the majority being over 50 years (ITC: 978, 75.5%; Mic: 6669, 72.2%). Furthermore, patients were predominantly G2 in tumour grading (ITC: 642, 49.5%; Mic: 4532, 49.1%), with only a small number being G1 (ITC: 264, 20.4%; Mic: 1903, 20.6%). G4 (ITC: 5, 0.4%; Mic: 24, 0.2%) was predominantly undifferentiated, which is clinically extremely rare. In the N0 and ITC groups (N0: 134,037, 97.6%; ITC: 1254, 96.8%), most patients had localized staging; however, unlike in the Mic and Nx groups, all patients were staged regionally. There was no apparent difference in T-stage between patients in the ITC and Mic groups; however, in the N0 and Nx groups, N0 patients were mainly in the T1-stage while Nx patients were predominantly in the T2-stage. The vast majority of patients in the N0 group were positive for estrogen receptor (ER) and progesterone receptor (PR) status (ER+: 115,063, 83.8%; PR+: 110,430, 73.1%); however, a large number of patients in the ITC (ER−: 1132, 87.4%; PR−: 989, 76.4%), Mic (ER−: 8031, 87.0%; PR−: 7129, 77.2%), and Nx (ER−: 37,490, 78.9%; PR−: 32,100, 67.5%) groups tested negative. Approximately half of the patients in the ITC and Mic groups received radiotherapy and chemotherapy; however, in the N0 and Nx groups, more patients received radiotherapy, and only a small proportion opted for chemotherapy. The number of patients choosing ALND was higher in the Nx group (25,895, 54.5%) than in the N0 (13,772, 10.0%), ITC (156, 12%), and Mic (1992, 21.6%) groups. Since patients with IMLN were allocated to the Nx group, their clinicopathological features were not further described. Overall, the clinical characteristics were similar in the ITC and Mic groups, but in terms of treatment strategy, the two groups were more akin to N0 and were treated more favourably than the Nx group.

Survival analysis and prognostic risk factors

In the training set of all patients, the three-year OS was 92.8% and BCSS was 96.5%; moreover, the five-year OS was 88.3% and BCSS was 94.4%, while the ten-year OS and BCSS were 81.7% and 92.05, respectively. Furthermore, ALNS (OS: HR: 1.203, 95% CI 1.179–1.228, p < 0.001; BCSS: HR: 1.290, 95% CI 1.253–1.328, p < 0.001) was associated with OS and BCSS in both univariate analysis and multivariate Cox regression analysis (Table 2, Supplement 2), and we used Kaplan–Meier analysis to illustrate the survival differences (Fig. 2A, B, Table 3). There was no significant difference in survival curves between the N0 and ITC groups in terms of OS and BCSS, but both were higher in the OS group than in the Mic group; concurrently, patients in the ITC group had better survival than did those in the Mic group (OS: p < 0.001; BCSS: p < 0.001).

Table 2 Univariate and multivariate analysis conducted for all patients
Fig. 2
figure 2

Kaplan–Meier curve were analysed for OS versus BCSS in overall patients with different ALNS and IMLN. OS A of total patients in the ALNS was compared with BCSS B for survival between groups. Comparison between OS (C) and BCSS (D) for different metastatic states in patients with IMLN

Table 3 Kaplan–Meier comparing survival in individual groups

In the ITC group, the three-year OS was 96.1% and BCSS was 98.2%; furthermore, the five-year OS was 92.4% and the BCSS was 97.1%, and the ten-year OS was 87.2% and the BCSS was 95.5%, respectively. Univariate analysis was used to screen for five variables associated with OS and three variables associated with BCSS (Supplement 3), namely age (HR: 2.099, 95% CI 1.356–3.250, p = 0.001), chemotherapy (HR: 0.539, 95% CI 0.385–0.754, p = 0.001), and radiation (HR: 0.659, 95% CI 0.488–0.889, p = 0.006) (Table 4), while in BCSS only T-stage (HR: 1.907, 95% CI 1.351–2.692, p < 0.001) was determined to be associated with survival.

Table 4 Univariate and multivariate analysis with ITC group

In the Mic group, the three-year OS was 94.9% and BCSS was 97.3%; furthermore, the five-year OS was 90.6% and BCSS was 95.0%. The ten-year OS was 84.1% and 92.4%, respectively. Univariate analysis was used to ultimately screen and thirteen variables associated with OS and ten variables associated with BCSS, respectively (Supplement 4). Multivariate Cox regression analysis showed that age (HR: 2.172, 95% CI 1.881–2.510, p < 0.001), grade (HR: 1.292, 95% CI 1.194–1.399, p < 0.001), T-stage (HR: 1.426, 95% CI 1.321–1.538, p < 0.001), ALN (HR: 0.891, 95% CI 0.805–0.986, p = 0.026), ER (HR: 0.836, 95% CI 0.707–0.989, p = 0.037), PR (HR: 0.632, 95% CI 0.562–0.710, p < 0.001), tumour extension (HR: 1.568, 95% CI 1.296–1.897, p < 0.001), chemotherapy (HR: 0.470, 95% CI 0.422–0.524, p < 0.001), radiation (HR: 0.861, 95% CI 0.775–0.975, p = 0.006), and surgery (HR: 1.169, 95% CI 1.040–1.313, p = 0.009) were associated with OS and age (HR: 2.636, 95% CI 2.289–3.036, p < 0.001). Furthermore, grade (HR: 1.208, 95% CI 1.118–1.305, p < 0.001), T-stage (HR: 1.322, 95% CI 1.226–1.424, p < 0.001), PR (HR: 0.690, 95% CI 0.600–0.793, p < 0.001), tumour extension (HR: 1.649, 95% CI 1.364–1.995, p < 0.001), radiation (HR: 0.776, 95% CI 0.690–0.849, p < 0.001), and surgery (HR: 1.124, 95% CI 1.002–1.260, p = 0.046) was associated with BCSS (Table 5).

Table 5 Univariate and multivariate analysis with Mic group

Across the IMLN groups, Kaplan–Meier analysis revealed a distinct difference in OS and BCSS for patients with different IMLN (Fig. 2C, D). Moreover, the three-year OS was 78.9% and BCSS was 83.0%; the five-year OS was 69.6% and BCSS was 75.6%. The ten-year OS was 60.7% and BCSS was 69.3%, respectively.

We used univariate analysis to screen for ten variables associated with OS and ten variables associated with BCSS (Supplement 5), namely age (HR: 1.268, 95% CI 1.015–1.584, p = 0.037), grade (HR: 1.313, 95% CI 1.118–1.543, p = 0.001), T-stage (HR: 1.363, 95% CI 1.195–1.555, p < 0.001), PR (HR: 0.689, 95% CI 0.535–0.887, p = 0.004), grade (HR: 0.391, 95% CI 0.310–0.494, p < 0.001), radiation (HR: 0.737, 95% CI 0.604–0.899, p = 0.003) and IMLN status (HR: 1.455, 95% CI 1.277–1.657, p < 0.001) were determined to be associated with OS when using multivariate Cox regression analysis. Furthermore, in BCSS grade (HR: 1.397, 95% CI 1.155–1.690, p = 0.001), T-stage (HR: 1.455, 95% CI 1.248–1.697, p < 0.001), PR (HR: 0.636, 95% CI 0.478–0.847, p = 0.002), chemotherapy (HR: 0.487, 95% CI 0.367–0.646, p < 0.001), radiation (HR: 0.778, 95% CI 0.619–0.978, p = 0.031), and IMLN status (HR: 1.613, 95% CI 1.380–1.885, p < 0.001) were determined to be associated with survival (Table 6).

Table 6 Univariate and multivariate analysis with IMLN metastasis group < 

Construction and validation of nomogram for predicting survival

Within the training set of patients, independent prognostic risk factors associated with OS and BCSS were incorporated into the findings of clinical investigations, and multivariate Cox regression analysis was incorporated into the nomogram for 3-, 5-, and 10- year survival prediction. To better interpret the nomogram, we first drew a vertical line for each variable that corresponded to the score. Then, we added all the points and drew a vertical line over the total score to find the survival rates at 3, 5, and 10 years. The accuracy of the nomogram predictions was verified ROC, calibration, and DCA curves (Fig. 3A–G).

Fig. 3
figure 3

Construction of nomograms predicting 3-, 5-, and 10-years OS and BCSS in patients with ALNS versus IMLN in training set. Nomogram used to predict survival of ALNS vs IMLN. According to the patient information where each clinicopathological feature corresponds to a point at the top of the chart, the sum of all variables corresponds to a total point, and the bottom line perpendicular to the total point is the 3-, 5- and 10-years OS or BCSS. Predicting survival of OS (A) and BCSS (B) in total patients. Predicting survival of OS (C) and BCSS (D) in Mic group patients. Predicting survival of OS (E) and BCSS (F) in IMLN group patients. Predicting survival of OS (G) in ITC group patients

In the training set, combining previous research and clinical practice (according to previous studies, sex, race, and side of the patient were excluded), nine variables (age, grade, T-stage, radiation, chemotherapy, surgery, ER, PR, and lymph node status) associated with OS (Fig. 3A) and BCSS (Fig. 3B) were included in the nomogram for the prediction of all patients. In the ITC group, according to clinical research and recommendations for breast cancer therapy, three variables (age, radiation, and chemotherapy) associated with OS (Fig. 3G) were included. In the Mic group, eight variables (age, grade, T stage, radiation, chemotherapy, surgery, ER, and PR) associated with OS (Fig. 3C) and six variables (age, grade, T-stage, radiation, surgery, and PR) associated with BCSS (Fig. 3D) were included. In the IMLN group, seven variables (age, grade, T stage, PR, radiation, chemotherapy, and internal mammary lymph node status) associated with OS (Fig. 3E), and six variables (grade, T stage, PR, radiation, chemotherapy, and internal mammary lymph node status) associated with BCSS (Fig. 3F) were eventually included. A Nomogram was developed to predict overall survival versus disease-free survival at 3, 5, and 10 years. Among the cohorts, the AUC values indicated a satisfactory assessment of the model, whereas the ROC curve indicated that the model had excellent clinical prediction and credibility (Table 7, Fig. 4, Supplement 7). The calibration curves showed high concordance between the predicted and actual probabilities for overall survival and disease-free survival for 3, 5, and 10 years (Fig. 5) and high concordance in the validation set (Supplement 6). The DCA curves demonstrated that the nomogram was a greater predictor of OS and BCSS for patients at 3, 5, and 10 years (Fig. 6), with high concordance in the validation set (Supplement 8).

Table 7 AUC from different subgroups at 3-, 5- and 10- years
Fig. 4
figure 4

The ROC of the discriminant nomogram in training set, and the AUC values for each subset are shown in Table 7. ROC curves of OS (A) versus BCSS (B) in total patients. ROC curves of OS (C) versus BCSS (D) in Mic group patients. ROC curves of OS (E) versus BCSS (F) in IMLN group patients. ROC curve of OS (G) in ITC group patients

Fig. 5
figure 5

Verifying the predictive superiority of nomogram in training set, the X-axis of the calibration curve represents the predicted probability, the Y-axis represents the actual probability, and the dotted line represents the predictive ability of the calibration curve. Calibration curves of OS (A) versus BCSS (B) in total patients. Calibration curves of OS (C) versus BCSS (D) in Mic group patients. Calibration curves of OS (E) versus BCSS (F) in IMLN group patients. Calibration curve of OS (G) in ITC group patients

Fig. 6
figure 6

Decision curves used to predict ALNS vs. IMLN in training set, represented by the lines are shown. DCA curves of OS (A) versus BCSS (B) in total patients. DCA curves of OS (C) versus BCSS (D) in Mic group patients. DCA curves of OS (E) versus BCSS (F) in IMLN group patients. DCA curve of OS (G) in ITC group patients

Discussion

ALN metastasis and IMLN metastasis in breast cancer are well-known prognostic decision risk factors; moreover, ITC and Mic have a higher occurrence, but the treatment strategy and prognosis prediction for ALNS are still debated. While previous studies on ALN and IMLN have been conducted, others have had relatively limited sample volumes or lacked evidence of predictive survival [7,8,9, 17]. Furthermore, according to the latest available research, there are no established nomograms for predicting ALNS in breast patients. The nomogram is a simplified numerical stacking method that can assist clinicians in decision-making regarding treatment. In a retrospective review of 279,078 patients with breast cancer from the SEER database, clinicopathological characteristics, prognostic indicators, and survival were assessed, and a nomogram was constructed to predict OS and BCSS at 3, 5, and 10 years based on the results of the multivariate analysis.

Across the different nomogram prediction models, tumour stage and tumour extension were not incorporated into the construction of the nomogram because of conceptual overlap with T-staging. Similarly, in ALN, there was no clear distinction between SLN ≤ 2 or SLN ≤ 3 and the impact on clinical prognosis and treatment decisions in previous studies; thus, it was excluded in the construction of the nomogram.

Among prospective prognostic factors for patients with breast cancer, ALNS is an independent risk factor. However, Houvenaeghel et al. showed that ITC and Mic had no additional negative implications for OS and BCSS in a study of 8001 patients [18]. This contradicts our conclusion that patients with Mic had a worse prognosis than N0 patients did in terms of both OS and BCSS (Fig. 2A, B). However, no difference in survival was observed between patients in the ITC and N0 groups in this study. This is in accordance with the results of Herbert et al., who found that ITC had no adverse impact on OS or BCSS [19]. However, it is intriguing to note in Fig. 3A, B that the prediction of survival in patients with breast cancer was worse in the N0 group than in the ITC group. This may be due to several reasons: firstly, the higher number of individuals in the N0 group with more homogeneous clinicopathological features than in the ITC group; secondly, the wide time span during which lymph node biopsy techniques have improved, leading to better detection of ITC [5, 24]; finally, the fact that usually SLN ≤ 2 are performed (Table 1), which still does not exclude the possibility of misdiagnosis in the N0 group. The prognosis of ALN in breast cancer varies, particularly in mice, distinguishing between N0 and Nx. This difference necessitates the development of a more standardized treatment [9].

Prior experience with the ITC group is limited, and the therapeutic approach is usually consistent with that of the N0 group. One study demonstrated no difference in the recurrence-free survival between patients in the ITC group who underwent ALND and those in the non-ALND group [20]. Concordant with the outcomes of our study, ALND did not improve the OS and BCSS of patients, while adjuvant treatment was found to improve the OS of patients. A study by Maaskant-Braat et al., through a multivariate analysis of 9038 collected patients, demonstrated that receiving systemic therapy when ITC was present (OR 1.5, 95% CI 1.05–2.15) or Mic (OR 10.7, 95% CI 8.56–13.27) could enhance patient survival. Additionally, receiving adjuvant therapy increased the 5-year BCSS of patients, a finding that is consistent with our conclusion [21, 22]. Previous studies have suggested that age and tumour size are correlated with ITC prognosis. In contrast to the present study, in which age and tumour size (p = 0.08) were still considered important prognostic factors for OS and BCSS, we only found a correlation with tumour size (Table 4) [23]. Therefore, to study BCSS, we needed a broader sample to construct the nomogram.

Substantial evidence of Mic as a significant risk contributor has been demonstrated in previous research, as illustrated in the study by Yvette Andersson et al., comparing 5-year specific survival with BCSS in the N0 group (94.1% v 96.9% and 79.6% v 87.1%) [24]. Chen et al. also performed a multivariate analysis of patients with Mic using the SEER database and found that sex, ER, PR, lobular histology, grading, age, and T stage were significant prognostic risk factors [25]. Furthermore, Hetterich et al. compared adjuvant chemotherapy in 540 patients with Mic and discovered that adjuvant chemotherapy administered to the N0 group versus the Mic group did not improve OS or BCSS [5]. Compared with the present study, in addition to our evidence of the therapeutic benefit of chemotherapy in both OS and BCSS, we refined our prognostic hazard indicators and predicted OS and BCSS for patients based on clinicopathological characteristics and varying therapeutic regimens using a nomogram.

The most controversial area is whether micrometastases deserve ALND, as opposed to other ALNS. One study has shown that in cases of mice, exemption from ALND increases the 5-year recurrence rate [7]. Houvenaeghel et al. also demonstrated that waiving ALND increased recurrence and mortality following prolonged follow-up and that the procedure's conditions needed to be rigorously managed [27]. Meanwhile, studies by A. Suyoi et al., Viviana Galimberti et al., and Tsai-Wei Huang et al. have indicated that dispensing with ALN is secure for mice with a light tumour burden. Hennigs A et al. also demonstrate that the incidence of ALND is decreasing when the criteria for inclusion in ACOSOG Z0011 are met. Accordingly, we identified a preference for ALND in patients with a high axillary burden [28,29,30,31,32]. Furthermore, Collins et al. described how lymph node oedema from ALND far exceeds the recurrence rate [33]. In the aforementioned study, we also concluded that ALND could be eliminated in mice. First, adjuvant therapy has a higher value than ALND does, for which chemotherapy is particularly crucial. Second, in the nomogram, we discovered that ALND reduces the survival rate, which may be due to severe postoperative complications (Fig. 3C, D).

With regard to lymph node metastasis in the breast, the inevitable implication of IMLN metastasis was shown by Qi et al. to be significantly more pronounced in patients with positive ALN; however, the effect of IMLN on OS and BCSS was not found [34]. With regard to therapeutic aspects, it only proved that the prognosis of clinically diagnosed IMLN metastases was better than that of distant and IMLN metastases detected by fine-needle aspiration biopsy [35, 36]. Notably, some specialists have used mathematics to predict the prognosis of patients with IMLN metastases [37]. In contrast, our review focused more on the outcomes of patients with IMLN and quantified the importance of radiotherapy and chemotherapy in conversion to IMLN treatment. We demonstrated the significance of surgery in both OS and BCSS on univariate analysis (Table 6), but no relationship with survival was found on multivariate analysis, principally because fewer than 10 patients with IMLN metastases underwent IMLN dissection for statistical analysis. Therefore, further investigations are needed to determine the survival benefit of IMLN dissection for patients.

ALNS as a significant independent prognostic risk factor for breast cancer has been of major concern to scholars. Nevertheless, there is a paucity of available research on ITC, Mic and IMLN, as well as a lack of uniformity in treatment and long-term prognosis in the clinic. The present review constructs a nomogram about diverse ALNDs for a new therapeutic reference strategy for breast cancer patients that present with lymph node metastases. For example, for patients with Mic, which the treatment is more controversial, clinicians can apply the nomogram to predict long-term OS and BCSS in accordance with the patient's actual condition. With survival prediction, combined with clinical realities, it assists the clinician in making a decision about whether this patient should be treated with step-up or step-down therapy. An individualised therapeutic strategy for patients to maximise clinical survival benefit and achieve nomogram application.

However, there is still a limitation in the current study. First, the different molecular subtypes of breast cancer play an important role in determining the prognosis of patients; nonetheless, in the present study, due to database limitations, we were only able to retrieve data on ER and PR, while we could not further discuss HER-2 overexpression. Second, owing to the sample of patients in the ITC group itself, the ITC patients included in the SEER database all presented relatively well-characterised clinicopathology and survival outcomes, with greater survival rates, resulting in an AUC value that did not reach the best value interval in the final validation of the nomogram. Finally, it is interesting to note that we found that hormone receptor positivity in all subgroups indicated a better prognosis; however, it was not feasible to ascertain whether they benefited from endocrine therapy because of the limited information on treatment in the database. Consequently, we provide clinicians with a reference for managing different lymph node states, although there are still aspects that need to be addressed by clinical patients for further research.