Introduction

Axillary lymph node (ALN) staging remains critical in the management of patients with breast cancer; it helps determine the clinical stage, treatment plan, and prognosis [1]. The standard of reference and the most accurate way for evaluating ALN status in breast cancer patients is surgical lymph node biopsy, either axillary lymph node dissection (ALND) or sentinel lymph node biopsy (SLNB) [2, 3]. SLNB, which has been associated with less morbidity, has gradually become the dominant surgical ALN evaluation procedure in patients with clinical node-negative primary operable breast cancer [4,5,6,7,8,9].

However, SLNB is not a risk-free procedure, which is related to about 6% axillary lymph edema rate and up to 10% morbidity risk [10, 11]. Non-invasive lymph node evaluation methods that result in lower morbidity than surgical ALN biopsy without compromising disease control remain the goal of modern imaging studies. The non-invasive imaging modalities for assessing ALNs are rapidly evolving, and magnetic resonance imaging (MRI) has been one of the potentially promising tools. A meta-analysis revealed the pooled diagnostic sensitivity and specificity of MRI to detect ALN metastasis in patients with breast cancer were 0.77 (95% confidence interval [CI] 0.750.80) and 0.90 (95% CI 0.890.91), respectively [12]. Recent studies showed the negative predictive value (NPV) was around 80% and accuracy was 70–80% for MRI to detect ALN metastasis in breast cancer patients [13,14,15,16,17,18,19,20,21,22,23,24,25,26], which is acceptable but remained space to improve.

In the past 10 years, breast cancer diagnosis and treatment have been greatly influenced by the concept of different intrinsic subtypes [27], which have different patterns of disease presentation, metastatic spread, and response to treatment [28, 29]. However, there is limited information about the diagnostic accuracy of breast MRI in the prediction of ALN metastasis regarding different intrinsic subtypes. We hypothesized that the intrinsic subtype might influence the diagnostic accuracy of MRI in the prediction of ALN metastasis, and by combining pre-operative clinicopathologic factors, intrinsic subtypes, and MRI findings, we could further identify some relatively low-risk groups of patients with invasive breast cancer and improving the diagnostic performance of MRI for ALN.

Material and methods

Patients

Women with primary operable invasive breast cancer who underwent breast surgery during the period of January 2009 to December 2021 were retrospectively recruited in this cohort study. Patients were systemically excluded from the study if there was no pre-operative MRI, non-invasive breast cancer, received a neoadjuvant treatment (chemotherapy, hormone therapy or radiotherapy), or had a locoregional recurrence, or if the axillary status was not mentioned in the MRI report or pathology report. The clinicopathologic factors gathered from the database include age, tumor location, biopsy method, pathologic tumor size, histology, tumor grade, status of estrogen receptor (ER), progesterone receptor (PR), human epithelial growth factor receptor 2 (HER-2) expression, and Ki-67 percentage. The study was approved by our Institutional Review Board and granted a waiver of informed consent.

Diagnostic accuracy of breast MRI to predict ALN metastasis in different intrinsic subtypes

Diagnostic performance parameters (sensitivity, specificity, positive predictive value (PPV), NPV, and accuracy) were calculated for breast MRI. In the subgroup analysis, these diagnostic performance parameters were calculated for each intrinsic subtype using immunohistochemistry (IHC) surrogate markers [30]. The subtypes were Luminal A (ER > 1% positive, PR ≥ 20% positive, HER-2 negative, Ki-67% ≤ 14%), Luminal B1 (ER > 1% positive, and/or PR < 20% positive, HER-2 negative, Ki-67% > 14%), Luminal B2 (ER > 1% positive, and/or PR > 1% positive, HER-2 positive, Ki-67% ≤ 14%), HER-2 (ER and PR negative, HER-2 positive), and triple-negative breast cancer (TNBC) (ER/PR/HER-2 negative).

MRI protocols

MR imaging was performed with a 3.0 Tesla MRI machine (Siemens MAGNETOM Verio, Munich, Germany). All patients were imaged in the prone position with both breasts placed into a dedicated 16-channel breast coil. MR imaging protocols included the following: bilateral axial turbo-spin-echo fat-suppressed T2-weighted imaging (TR/TE 4630/70 ms; field of view 320 mm; slice thickness 3 mm; number of excitations (1), axial turbo-spin-echo T1-weighted imaging (TR/TE 736/9.1 ms; field of view 320 mm; slice thickness 3 mm; number of excitations (1). Dynamic contrast-enhanced MR images (DCE-MRI) were obtained with a three-dimensional fat-suppressed volumetric interpolated breath-hold examination (VIBE) sequence with parallel acquisition once before and five times after a bolus injection of gadobenate dimeglumine (0.1 mmol/kg). Both breasts were examined in the transverse plane at 60 s intervals in each phase of the dynamic studies. The dynamic MRI parameters were as follows: TR/TE 4.36/1.58 ms; field of view 320 mm; slice thickness 1 mm.

Evaluation of axillary lymph nodes via MRI

Three radiologists with 37, 18, and 12 years of breast imaging experience performed the breast MRI interpretation and made the reports. The ALNs were assessed by nodal morphology and size on T2-weighted and contrast-enhanced T1-weighted sequences. Morphology criteria of pathologic nodes were considered when there were one or more of the following features: cortical thickening greater than 3 mm, abnormal lymph node shape (round or not uniform), completely/partially effaced fatty hilum, or asymmetry compared with the contralateral side [31].

Statistical analyses

Data were expressed as mean ± standard deviation for continuous variables and numbers (percentage) for categorical or ordinal variables. Differences in the MRI ALN diagnostic performance regarding true-positive, false-positive, true-negative, and false-negative numbers among intrinsic subtypes were evaluated by Kruskal–Wallis test for non-normal distribution. Significant predictors in the univariate analysis were included in a multivariate logistic regression model to identify the most important predictors. The incidence of metastatic ALN, the NPV of MRI, the nodal status distribution, and the average number of false-negative ALNs were calculated to assess the MRI efficacy after subgrouping by tumor size on pathology and imaging, as well as by intrinsic subtype. These covariates were chosen based on multivariate findings. Statistical analyses were performed by statistical experts using Statistical Product and Service Solutions (SPSS) for Windows (Version 19.0, SPSS Inc, Chicago, IL).

Results

Study participants

A total of 2473 patients with primary operable invasive breast cancer who underwent pre-operative breast MRI evaluation and post-operative pathologic ALN biopsy results were enrolled in the current study (Fig. 1). Among them, 861 (34.8%) patients had pathologically confirmed metastatic ALNs, and 1612 (65.2%) had negative ALNs. According to intrinsic subtype classifications, there were 932 (39.4%) Luminal A, 760 (32.1%) Luminal B1, 307 (13%) Luminal B2, 176 (7.4%) HER-2, and 192 (8.1%) TNBC patients. The demographic data and histological results of the 2473 patients are summarized in Table 1.

Fig. 1
figure 1

Study design and patients’ allocation of current MRI axillary lymph node diagnostic performance evaluation. ALN: axillary lymph node. HER-2: human epidermal growth factor receptor 2 TNBC: triple-negative breast cancer

Table 1 Demographic data and tumor characteristics of the 2473 Patients

MRI ALN prediction in different intrinsic subtypes

The diagnostic performance of breast MRI to predict ALN metastasis was calculated: the sensitivity was 63.2% (544/861), specificity 68.5% (1104/1612), NPV 77.7% (1104/1421), PPV 51.7% (544/1052), and the overall accuracy was 66.6% (1648/2473) (Table 2).

Table 2 Diagnostic performance of MRI on axillary lymph node metastasis per molecular subtype and subgroup analysis

The accuracy of breast MRI for detecting metastatic ALNs in different intrinsic subtypes was evaluated and is summarized in Table 2. The NPV was highest in TNBC (86.4%), and lowest in Luminal B2 (71.7%). The sensitivity was highest in the TNBC group (77.8%), Luminal A was associated with the highest specificity (76.2%), and the highest PPV was in the Luminal B2 group (58.7%). The distribution of true-positive, true-negative, false-positive, and false-negative between intrinsic groups was significantly different (p = 0.007).

Axillary lymph node tumor burden and MRI predictive value

The sensitivity of MRI ALN metastasis prediction increased significantly from 57.1% in N1 (1–3 positive nodes) to 78.2% in N2 (4–9 positive nodes), and 86.7% N3(≥ 10 positive nodes). In patients with no to low axillary tumor burden (N0 + N1), the sensitivity of MRI to detect positive ALN was 57.1%, and in patients with high axillary tumor burden (N2 + N3), the sensitivity increased to 81.1% (Table 2).

Prediction of ALN metastasis by clinicopathologic biomarkers

Using univariate analysis, we found that pathologic tumor size, MRI tumor size, histologic type, histologic grade, ER, PR, and Ki-67 were statistically significant predictors of ALN metastasis. Multivariate analysis identified pathologic tumor size (Odds ratio (OR) = 1.48), MRI tumor size (OR = 1.19), histologic type (non-invasive ductal carcinoma (IDC) versus IDC, OR = 0.47), higher histologic grade (OR = 1.33), and PR (OR = 1.91) were independent predictive factors of ALN metastasis (Table 3).

Table 3 Risk factors for axillary lymph node metastasis in breast cancer patients

Prevalence of ALN metastasis and NPV of MRI by combining clinicopathologic and imaging factors

The ALN-positive rate was 29.5% (275/932) in Luminal A, 41.2% (313/760) in Luminal B1, 43.6% (134/307) in Luminal B2, 30.1% (53/176) in HER-2, and 29.7% (57/192) in TNBC. Based on the results of multivariate analysis (pathologic & MRI tumor size), four individual risk groups of nodal involvement for each intrinsic subtype were generated (Table 4). The prevalence of ALN metastases, NPV, average false-negative (FN) LN, and distributions of lymph nodes (N1, N2, and N3) were further summarized and correlated to different intrinsic subtypes and tumor size for further evaluation. Patients with HER-2 (MRI tumor ≤ 2 cm), or TNBC (MRI tumor size ≤ 2 cm, MRI tumor size ≤ 3 cm, and pathologic tumor size ≤ 3 cm) were found to have MRI-ALN-NPV higher than 90%, and these FN cases were limited to low axillary tumor burden (N1, Table 4).

Table 4 Correlation of intrinsic subtype, tumor size, lymph node metastasis, and MRI-negative predictive value

Discussion

Our current study enrolled 2473 primary operable invasive breast cancer patients with detailed pre-operative breast MRI evaluation and post-operative ALN pathologic results for diagnostic accuracy analysis. We analyzed the performance of MRI in ALN metastatic status evaluation and found a significant difference between intrinsic subtypes. Factors related to ALN metastasis were also analyzed with univariate and multivariate analyses. We found that in patients with pre-operative MRI showing negative ALN metastasis and small tumor size, some intrinsic subtypes (HER-2 and TNBC) patients were associated with high NPV, relatively low risk of FN ALN, and even in FN cases were limited to low axillary tumor burden (N1).

Owing to more effective pre-operative evaluation and locoregional adjuvant therapies, the potential risks of axillary surgery may outweigh its actual benefits, especially in early-stage breast cancer patients treated with breast-conserving surgery [6, 32]. In 1994, Giuliano reported that SLNB is a highly reliable modality in axillary staging [3]. Since then, there has been a trend towards minimizing invasive staging and treatment of the axilla in clinically node-negative breast cancer patients owing to increased arm morbidity and decreasing quality of life after ALND [8]. In our current cohort, the positive ALN rate was only about a third (34.8% (861/2473), Table 1) in patients with primary operable invasive breast cancer patients, which echoed the need for less-invasive ALN evaluation and surgical treatment policy in breast cancer of screening era.

In the past decade, many studies were designed to investigate non-invasive imaging staging of the axilla [33, 34] to decrease surgical morbidity and, thus, improve patients’ quality of life. MRI is widely used on breast cancer patients for pre-operative assessment of disease extent and ALN status, screening of the contralateral breast, and evaluation of post-neoadjuvant treatment outcome [35, 36]. In the current study, we found that breast MRI is associated with 63.2% sensitivity, 68.5% specificity, 77.7% NPV, 51.7% PPV, and 66.6% accuracy. These results were consistent with previous literature reported series summarized in Table 5, and overall, MRI is associated with an NPV of around 68.4% ~ 85%, and accuracy in a range of 66.6% ~ 90%. When patients were segregated into different axillary disease extent, the sensitivity increased from 57.1% (N1) in low tumor burden cases to 78.2% (N2) or 86.7% (N3) in high tumor burn patients (Table 2). It meant that in patients with no to low axillary tumor burden (N0 + N1), the sensitivity of MRI to detect positive ALN metastasis was 57.1% (PPV 42.1%), and in patients with high axillary tumor burden (N2 + N3), the sensitivity significantly increased to 81.1% with a 100% PPV100%.

Table 5 MRI diagnostic performance on axillary lymph node in current study combined with literature review

The incorporation of breast cancer “intrinsic subtype” information into clinical breast cancer assessment and treatment planning became an important step toward personalized medical care [30]. Hence, examining MRI results among different breast cancer intrinsic subtypes is an emerging area of research. Our study demonstrated that the diagnostic performance of ALN evaluation via MRI was significantly different among intrinsic subtypes (p = 0.007, Table 2). The luminal A (69.5%) breast cancer was associated with the highest overall accuracy while HER-2 (58.9%) was the lowest. The overall NPV of MRI was 77.7%, and this NPV could be increased to “86.4% and 79.7%” in “TNBC and Luminal A,” and dropped to “71.7% and 73.5%” in Luminal B2 & Luminal B1 type breast cancer. These variations in the diagnostic performance of MRI in ALN evaluation, which were important and rarely reported before, reminded clinical physicians the awareness of the impact of intrinsic subtypes on the accuracy of imaging interpretations.

To improve the diagnostic accuracy of pre-operative non-invasive imaging, we tried to identify clinicopathologic factors related to ALN metastasis with univariate and multivariate analysis. Pathologic tumor size (odds ratio, OR = 1.48), MRI tumor size (OR = 1.19), higher histologic grade (grade II, III versus I, OR = 1.33), and PR positivity (OR = 1.91) were significant independent risk factors (Table 3). These results were consistent with previous studies [37,38,39] showing that tumor size, either from pathology reports or from MRI reports, was an important independent predictive factor of ALN metastasis. By combining IHC biomarkers (intrinsic subtypes) and anatomical features (tumor size) that associated with ALN involvement, we further improved the NPV of ALN by MRI up to more than 90% in some groups of patients. Patients with HER-2 with pre-operative MRI tumor (≤ 2 cm) or TNBC breast cancer with pre-operative MRI tumor (≤ 2 cm) were found to have MRI-ALN-NPV of 90.1%, and 92.3% (92.5% in pathologic tumor ≤ 2 cm), separately, which apparently increased the NPV of 77.7% of MRI in the general population (Table 4).

The NPV is higher for MRI ≤ 3 cm than for pathology size ≤ 3 cm in nearly all molecular subtypes (Table 4). Onesti et al. and our previous study reported that MRI tumor size correlates with pathology size but tends to overestimate [40, 41]. The reason might be attributed to a smaller actual size in the MRI group than in the pathology group, which leads to less axillary involvement. Thus, using tumor size from MRI is a more favorable criterion than using size from a pathology report, which is only available post-operation. The ALN-positive rate was 29.7% in TNBC, 34.8% in ER( +)/HER-2(−), and 38.7% of HER-2( +)/[ER(−) or ER( +)] breast cancer in the current study (Table 4). Similar to our study, Lu et al. [42] and Houvenaeghel et al. [43] reported that TNBC had a lower probability of node metastasis. Owing to a high NPV and lower possibility of ALN metastasis in the TNBC group, ALN staging by MRI may be useful for the design of neoadjuvant chemotherapy or surgical planning.

Currently, the standard of ALN evaluation in invasive breast cancer patients is SLNB, and a meta-analysis showed that SLNB had a FN rate of around 8.61% (95% CI 8.05–9.2%) [38]. That means a cutoff point of NPV ≥ 90% of MRI-ALN-NPV should be the minimal requirement of the “threshold” of non-invasive ALN imaging modality to be considered the “alternative” choice compared to the current “standard of care”-SLNB. We found that by combining pre-operative clinicopathologic factors and intrinsic subtypes, MRI could identify some groups of patients with low risks of ALN metastasis, high NPVs, and low burdens of axillary disease even in FN cases (Table 4). Through the current study, we showed the possibility of improving the diagnostic accuracy (NPV) of non-invasive imaging modalities, like MRI, in a stepwise way. Recently, novel techniques such as radiomics [44] and deep-learning methods [45] showed promising results on ALN prediction, with accuracy up to 0.970. The performance of these models may be superior to radiologists, and the algorithms may become non-invasive biomarkers that contribute to the advancement of personalized medicine. However, the generalizability of deploying these AI models is still challenging.

Our present study was limited in its retrospective nature and in a single institution where all the studies were scanned on a 3 T MRI scanner. Second, we excluded patients who received neoadjuvant chemotherapy in which direct comparison of pretreatment image findings, and final nodal pathology was impossible. This may be the reason that the number of patients in TNBC and HER-2 groups was relatively small. Besides, since most suspicious nodal-positive cases would go on neoadjuvant chemotherapy nowadays, excluding NAC may have a gradually negative impact on specificity. However, it is important to note that these results are based on 2473 patients, which featured the largest number of cases reported from a single institute, with complete pre-operative breast MRI evaluation and detailed post-operative breast cancer and lymph node pathologic information. This enabled us to perform comprehensive intrinsic subtype analysis and showed stepwise improvement of ALN metastasis MRI NPV by combining clinicopathologic and imaging factors. The information and evidence derived from the current study may provide a potential for non-invasive imaging evaluation of ALN.

Conclusion

In the current study, we demonstrated that MRI prediction of ALN metastasis differed according to different intrinsic subtypes. By combining IHC biomarkers (intrinsic subtypes) and anatomical features (tumor size), the NPV of ALN by MRI could reach more than 90% in some groups of patients. Patients with HER-2 with pre-operative MRI tumor ≤ 2 cm or TNBC breast cancer with pre-operative MRI tumor ≤ 2 cm was found to have MRI-ALN-NPV of 90.1%, and 92.3%, respectively, which apparently increased the NPV of 77.7% of MRI in the general population. By utilizing these stepwise methods, the study demonstrated that breast MRI's NPV could be enhanced, thus, making MRI a more potent non-invasive and alternative approach for pre-operative evaluation of ALN burden.