Background

Axillary lymph node (ALN) status is a critical prognostic factor in the therapeutic plan of breast cancer patients because it can determine the extent of surgery and assess the necessity of chemotherapy or radiotherapy [1,2,3,4]. Axillary lymph node dissection (ALND) is used to assess ALN status, provide accurate staging of axillary lymph nodes and eliminate potential metastatic lymph nodes, and it is the standard surgical method for patients associated with greater lymph node burden [3,4,5,6]. However, ALND can cause severe complications, such as shoulder dyskinesia and arm lymphedema, which can have a negative impact on quality of life [7,8,9,10,11,12]. Hence, avoiding excessive ALND becomes a pressing issue. The results of the ACOSOG Z0011 trial showed no statistically significant differences between the ALND and no-ALND groups in terms of local recurrence rate and 10-year overall survival rate in patients with < 3 axillary lymph node metastases [13]. Patients with ≥ 3 axillary lymph node metastases (high axillary nodal burden) are more likely to have local recurrence, and are thus considered suitable for neoadjuvant chemotherapy or ALND [14,15,16,17]. Therefore, preoperative prediction of lymph node metastasis and identification of patients with high axillary nodal burden are crucial to determine the appropriate therapeutic management.

Preoperative imaging, including ultrasound (US), mammography, computed Tomography, magnetic resonance imaging, positron emission tomography etc., has become increasingly important and more widely used in assessing ALN metastasis in patients with breast cancer [18,19,20,21]. Compared with other imaging modalities, ultrasound is more cost-effective, non-invasive and reproducible [14, 15, 22]. However, Hieken et al. showed a false positive rate of 79.8% for suspicious axillary ultrasound results according to pathological examination [23]. Therefore, it is insufficient to assess ALN burden by axillary ultrasound alone. Fortunately, the US characteristics of primary breast lesion are reported to be obviously associated with high ALN metastasis [2425]. In addition, certain clinicopathological features of patients with breast cancer are related to ALN metastasis [26,27,28]. The purpose of this study was to integrate the ultrasonic features of lymph nodes and primary lesions with clinicopathological characteristics to identify independent predictors and develop a model to predict high-burden lymph node (≥ 3) in patients with breast cancer and prevent unnecessary ALND.

Materials and methods

This study design followed the international regulations according to the Declaration of Helsinki. Our research was approved by the Ethical Committee of the Affiliated Hospital of Nantong University (2022-K108-01) and Affiliated Hospital of Jiangsu University (KY2021K1213), and written informed consent was obtained from participants.

Patient enrolment

Patients who underwent breast surgery and US examination at the Affiliated Hospital of Nantong University (centre 1, n = 1232) and at the Affiliated Hospital of Jiangsu University (centre 2, n = 566) between January 2012 and December 2016 and between January 2020 and March 2022, respectively, were deemed eligible for this study (n = 1798). Patients were included if they: [1] had clinical N1 or N0; [2] had a preoperative breast US and axillary US performed within two weeks of surgery, which recorded US characteristics of the primary breast tumour and axillary lymph nodes; [3] had breast surgery and axillary lymph node dissection; [4] had pathology which documented the number of axillary lymph node metastasis; [5] and had a pathology tumour size that was T1 or T2. Exclusion criteria were as follows: [1] patients with a history of other malignant tumours and ipsilateral axillary surgery history; [2] patients who underwent preoperative chemotherapy, radiotherapy or immunotherapy and [3] absence of clinicopathological or US information. Finally, 347 patients were included in this study (Fig. 1).

Fig. 1
figure 1

Flowchart of the study population

US analysis

All patients included in this study underwent a preoperative breast and axillary US examination within two weeks of surgery using the GE LOGIQ E9 with a linear array transducer (12–15 MHz). Patients were positioned flat with bilateral arms raised to fully expose the bilateral breasts and axilla. US parameters, such as gain, depth, focal length, etc., were adjusted to enable a clear display of the primary lesion. The primary lesion or lymph node used for the assessment was situated in the central part of the ultrasound screening. Subsequently, we scanned the lesion from multiple angles and acquired images of the primary lesion and lymph nodes. The images were then stored for further analysis.

US features of the primary tumour and ALNs were observed independently by two radiologists who were blinded to any information that could interfere or bias with their task. Potential disagreements or differences were arbitrated by a third experienced radiologist to reach a consensus. The US features of the primary tumour were analysed, including quadrants (upper outer, upper inner, lower outer and lower inner quadrants), margins (circumscribed or non-circumscribed), orientation (parallel or non-parallel), shape (regular or irregular), attenuation (weak or not) and calcification (with or without). Furthermore, axillary US measured the longitudinal to transverse ratio (< 2 or ≥ 2) and cortical thickness (< 3 mm or ≥ 3 mm), and determined the absence of hilum (yes or not).

Clinicopathological analysis

Surgical-histopathologic data included the number of metastatic lymph nodes (< 3 or ≥ 3), T stage (≤ 2 cm or 2–5 cm), histological grade (I, II or III) and the expression of estrogen receptor (ER), progesterone receptor (PR), CerbB-2, P53 and Ki-67 from histopathology reports. ER and PR positivity were defined as the expression of greater than 1% [29], while Ki67 positivity was defined as the expression of greater than 14% [30]. CerbB-2 receptor of 3 + in HE or 2 + in gene amplification was defined as human epidermal growth factor receptor 2 (HER-2) positive [29, 31].

Statistical analysis

Statistical analysis was conducted using the SPSS software (ver. 24.0; SPSS Inc., Chicago, IL, USA), the MedCalc software (ver.19.07) the R software (ver 4.0.1). The enrolled patients were randomly divided into the training group and the verification group according to the ratio of 7:3. X2 tests or Fisher’s exact test were performed between the training and the validation groups. Univariate logistic regression analysis was used to identify factors that could significantly affect the training group. Multivariate logistic regression analysis was applied to determine independent predictors of the number of axillary lymph node metastasis and incorporate them into the model. The area under the area under curve (AUC), accuracy (ACC), sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) was used to assess model discrimination. Delong test was conducted to compare different diagnostic models across the training and validation cohort for nomogram, cortical thickness, lymphatic gate, and T stage by MedCale. Finally, a calibration diagram was drawn to evaluate the ability of calibration and the fit of the model was assessed by the Hosmer–Lemeshow goodness-of-fit test.

Results

Baseline characteristics

Table 1 shows the characteristics of the research population. Patients were divided into the training group (n = 243) and the verification group (n = 104) with a ratio of 7:3. Fifty-six (23.0%) and 30 (28.8%) patients had ≥ 3 lymph node metastases of primary breast cancer in the training and validation cohorts, respectively. There were no significant differences in the US characteristics and clinicopathological parameters between the two groups. The mean ages of the training and validation groups were 55.84 ± 10.74 and 55.54 ± 11.66, respectively.

Table 1 Characteristics of the training and validation cohorts

Univariate and multivariate analyses

In the univariate analysis, variables that were significantly associated with ≥ 3 lymph node metastases included cortical thickness (p < 0.001), longitudinal to transverse ratio (p = 0.001), absence of hilum (p < 0.001), T stage (p = 0.002) and Ki-67 (p = 0.039) (Table 2 and 3). The remaining factors were not found to be significant for the identification of high-burden lymph nodes (all p > 0.05). In the multivariate logistic regression analysis, cortical thickness (p = 0.001), absence of hilum (p = 0.042) and T stage (p = 0.012) are shown in Table 4. Ki-67 and longitudinal to transverse ratio were not independent predictors. Cortical thickness, absence of hilum and T stage were considered as independent predictors of HBN, and these parameters were then incorporated into the predictive model to create a nomogram (p < 0.05).

Table 2 Univariable analysis ultrasound features of lymph nodes in the training cohort
Table 3 Univariable analysis of ultrasound and clinicopathological features of primary tumor in the training cohort
Table 4 Comparison of the multivariable in the training cohort

Development of the nomogram

Based on the results of the multivariate logistic regression analysis, cortical thickness, absence of hilumand T stage were incorporated to create a nomogram (Fig. 2). The nomogram had an AUC of 0.749 (95% CI: 0.676–0.823), sensitivity of 71.4%, specificity of 75.9%, PPV of 67.7% and NPV of 81.1% (Table 5). The AUCs of cortical thickness, lymphatic gate and T stage were 0.690 (95% CI: 0.609–0.770), 0.639 (95% CI: 0.551–0.728) and 0.621 (95% CI: 0.539–0.703), respectively (Fig. 3). The AUC of model was greater than the AUCs of cortical thickness (p = 0.003), lymphatic gate (p < 0.001) and T stage (p < 0.001) (Table 5). The C-index of this model was 0.749 (95% CI: 0.677–0.820). The Hosmer–Lemeshow-Goodness-of-Fit test had a p-value of 0.995 and the calibration plot is shown in Fig. 3. Through bootstrap validation, the C-index of the nomogram was considered 0.68. The decision curve analysis (DCA) showed good net benefits in the training set in Fig. 4.

Fig. 2
figure 2

Nomogram for predicting axillary lymph node ≥ 3 metastasis

Table 5 The ROC analysis of cortical thickness, hilum, T stage and the model in the training cohort
Fig. 3
figure 3

Development and validation of a nomogram to predict axillary lymph node ≥ 3 metastasis. a Receiver operating characteristic curves of the model. b Calibration plot of the model. In the calibration plot, the dotted line at a 45° angle represents perfect calibration

Fig. 4
figure 4

Decision curve analysis for the nomogram model in the training set

Validation of the nomogram

The validation group model consisted of 104 patients. The AUC of the prediction model for the validation group was 0.783 (95% CI: 0.685–0.881) (Fig. 5 and Table 6). The C-index of the validation group was 0.783(95% CI:0.685–0.881). The AUC of model was greater than the AUCs of cortical thickness (p = 0.009), lymphatic gate (p = 0.001) and T stage (p = 0.007) (Table 6). The P-value of the Hosmer–Lemeshow-Goodness-of-Fit test was 0.783. The DCA had good net benefits in the validation group (Fig. 6).

Fig. 5
figure 5

Validation of the nomogram to predict axillary lymph node ≥ 3 metastasis. a Receiver operating characteristic curves of the model in validation cohort. b Calibration plot of the model in validation cohort. In the calibration plot, the dotted line at a 45° angle represents perfect calibration

Table 6 The ROC analysis of cortical thickness, hilum, T stage and the model in the validation cohort
Fig. 6
figure 6

Decision curve analysis for the nomogram model in the validation set

Discussion

Although US examination is currently one of the most widely used and important imaging technologies, it is not completely accurate in predicting high-burden lymph nodes [32, 33]. Predicting high lymph node burden can guide individualised treatment strategies with respect to the application of neoadjuvant chemotherapy and selection of the type of axillary surgery (SLNB vs. ALND) [34]. Therefore, prediction of lymph node metastasis and identification of patients with high axillary lymph node loads are both essential and challenging processes. The main strength of this study is that we successfully integrated US and clinicopathological features of lymph nodes and primary lesion ultrasound and establish a nomogram that could predict a high axillary lymph node burden.

In our study, the cortical thickness and lymphatic hilum of lymph nodes and the T stage of the primary lesion were found to be independent predictors of high-burden lymph nodes. Based on these three parameters, we established a nomogram to predict high-burden node (HBN), and our results showed that its AUC was 0.749, i.e. a satisfactory predictive value. In the nomogram, cortical thickness was more important. The point of cortical thickness, which was ≥ 3 mm, was 100 points, and thus greater than the lymphatic hilum of lymph nodes and T stage. The US characteristics of the lesion proved impossible to identify high-burden lymph nodes. However, according to the studies performed by Torstenson and Ansari, the distance between the tumour and nipple and the distance between the tumour and skin were significantly correlated with positive lymph node metastasis [35, 36]. In Yi’s study, the distance from the nipple was interconnected with high-burden lymph nodes [24]. Since this was a retrospective study, the US report failed to count the distance from the primary lesion to the nipple, which may have caused a degree of impact on the diagnostic efficiency. To our knowledge, only a limited number of studies have combined ultrasound of axillary lymph nodes, ultrasound of primary lesions and clinicopathological characteristics. Our study could comprehensively evaluate the relationship between these three parameters and high burden lymph nodes metastasis.

There are still some limitations in our research. First, this study was a retrospective study. In the axillary ultrasound report, we mainly focused on lymph nodes with large size or potential malignancies, while some lymph nodes with small size but abnormal morphology were ignored. Consequently, there may be sample selection deviation. Second, due to the small sample size, the low-burden group included a significantly greater number of cases compared to the high-burden group, which may influenced our results. Third, we did not statistically analyse the blood flow in the lymph nodes and the primary lesion. In addition, the clinical and pathological factors used in developing the nomogram were obtained postoperatively, making it challenging to directly predict the burden of lymph nodes before surgery. However, the study added information about ultrasound results, and future research may require additional studies incorporating MRI, molybdenum target and clinically assessable pathological factors before surgery and establish a preoperative prediction model. Therefore, future studies with a larger sample size and more comprehensive characteristics are expected to confirm the clinical application of the present model.

Conclusion

In conclusion, we established a nomogram integrating the US and clinicopathological features of axillary lymph nodes and primary breast lesions to predict high-burden lymph nodes metastasis. Lymph node cortical thickness, lymphatic hilum and T stage were found to be important indicators for predicting high-burden lymph nodes metastasis, and these parameters are expected to be helpful in clinical practice in terms of reducing unnecessary ALND and identifying the patients requiring neoadjuvant chemotherapy.