Background

Axillary nodal status is a cornerstone for cases of recently diagnosed breast cancer which is commonly evaluated by axillary sonography (US) along with sampling when suspicious [1]. It has been shown that complete nodal axillary dissection (ALND) for cases with limited nodal axillary involvement (1–3 nodes) is not useful [2]. Rather invasive techniques (such as nodal biopsy) carry a potential risk of complications and so a need for non-invasive axillary assessment has emerged [3].

Diffusion-weighted magnetic resonance imaging (DW-MRI) utilizes water molecular mobility (Brownian motion) to identify differences in the capacity of diffusion (and consequently signal differences) between malignant and benign tissues providing both functional and morphological data, where water molecule diffusivity and apparent diffusion coefficient (ADC) are markedly affected by lesion cellularity [4].

The sensitivity of diffusion is affected by altering the b value (gradient strength for diffusion) which is directly proportional to the amplitude and duration of the gradient, as well as the interval of time between gradients (measured in s/mm2). Free water molecules (high motion degree) will have lower signal at a lower b value (b = 50–100 s/mm2) (perfusion) while slower moving water commonly need higher b values (b = 800 s/mm2) [5].

Aim of work

We aimed at evaluating the added role of DW-MRI (to that of US) for diagnosis and accurate staging of the axillary nodal status in patients with recently diagnosed breast cancer, being a preoperative non-invasive tool.

Methods

In our prospective study, we included 77 axillary nodes in 77 female patients who presented to our institution during the period from February 2017 to April 2019 (age range 20–78 years, mean 50 ± 12.6 SD) who had suspicious lesions on sonomammography (SM) (scoring BIRADS 4/5). These patients presented to the General Surgery Department at our institution with variable complaints including inflammatory breast changes, breast or axillary lumps. We excluded patients scoring BIRADS 1, 2, or 3 on SM or those with general contraindications to magnetic resonance imaging (MRI) (such as pacemakers or cochlear implants). We also excluded those who had previous ALND, or those who received any treatment that may affect the outcome of the study. All patients provided informed consent and our institutional ethical approval was obtained. The most suspicious axillary lymph node on the same side of the primary breast lesion was included in the study. Criteria for the most suspicious lymph node were focal or marked cortical thickening, hilar fat loss, or decreased longitudinal to transverse dimension ratio (L/T ratio). If no suspicious nodes were found, we included the largest one in the study.

Patients underwent detailed history taking (including name, age, marital status, residence, history of previous illness, etc.), clinical breast examination, SM, and US-guided biopsy from any suspicious lesions, as well as breast dynamic DW-MRI.

For US, the study was conducted using a General Electric Medical Systems LOGIQ S8 or LOGIQ P US device that includes a multi-frequency linear probe operating at 7.5 to 12 MHz. Patients were examined in the classic oblique supine position with their ipsilateral arm abducted and externally rotated so that their hand is behind their head. The nodes of concern were focused upon where the protocol of scanning included real-time longitudinal and transverse imaging. Each node was examined for size (long and short axis in centimeters), shape (as indicated by its L/T ratio which if more than 2, the node is considered ovoid while if less than 2, it is considered non-ovoid), cortical thickening (if present whether it is diffuse or focal), and hilar fat (preserved or not). If more than one node was found in the same axilla, the most suspicious was analyzed. The most suspicious node on the ipsilateral side of the primary lesion was included in the study. Axillary nodes chosen as most suspicious to be included in the study were those having one or more of the following criteria on US: focal or marked diffuse cortical thickening, hilar fat effacement, or decreased longitudinal to transverse ratio of the node. If no suspicious nodes were found, the largest node on the same side of the primary lesion was included in the study.

Examined nodes were classified by US into three groups blind to the final pathology results:

  • Benign/non-specific: ovoid with central fatty hilum and uniform cortical thickness ≤ 3 mm.

  • Malignant/infiltrated: either a node with lost hilum or a node with focal/diffuse cortical thickening > 3 mm with eccentric hilum (peripheral).

  • Indeterminate: node showing cortical diffuse thickening > 3 mm with central hilum.

The technique of MRI and DWI

For MRI and diffusion-weighted imaging (DWI), images were acquired using Philips Intera or Achieva machines (1.5 Tesla) followed by post-processing.

For MRI acquisition, three-plane, fast single-shot localizers were initially obtained to localize and plan the sequences. Axial T1-weighted (TR = 537 ms, TE = 10 ms) and T2-weighted (TR = 4123 ms, TE = 120 ms) fast spin-echo (FSE) sequences were taken (the number of excitations: 2; the direction of frequency encoding: right to left (R/L); section thickness: 3 mm; gap: 0.5 mm; field of view (FOV): 36–40 cm; matrix: 288 × 224). Short tau inversion recovery (STIR)-weighted images were also acquired (TR = 4007 ms, TE = 70 ms) in the axial and sagittal planes (TR/TE, 1.6 ms/20 ms; number of excitations, 3; direction of frequency encoding: R/L; section thickness, 3 mm; gap, 1 mm; FOV, 36–40 cm; matrix, 288 × 224).

DWI was acquired in a transverse plane using single-shot echo-planar imaging with suppression of fat, using four b values; low (0–50 s/mm2), intermediate (200 and 400 s/mm2), and high b value (1000 s/mm2) in an attempt to get more accurate values of ADC, using the following parameters: 5000 ms/75 ms (TR/TE), slice thickness 3–9 mm with an interslice gap of 0–1.5 mm, and the number of excitations ranging from 1 to 10, 256 × 256 (matrix), 30 × 30 cm (FOV).

Qualitative assessment

Signal intensity in the DWI and the corresponding ADC map images were visually evaluated at intermediate and high b values. Diffusion restriction was indicated by intermediate or high SI on DWI that dropped on the corresponding ADC map.

Quantitative assessment

Mean ADC values were calculated by manually drawing regions of interest (ROI) in the most hypo-intense (most restricted) portion of the selected node on the ADC map so that the ROI is slightly smaller than the real node to avoid partial volume averaging. Multiple measurements were acquired to calculate mean ADC values. ADC maps were calculated by the MRI system via linear regression analysis of the natural log of signal intensity using all measured b values.

While going through the study, we were faced with inadequate axillary visualization in some cases using the standard breast MRI coil. This may be attributed to the uncomfortable prone positioning with breast coils especially with patient obesity resulting in noticeable motion artifact.

As a result, we tried image acquisition using a chest coil in the supine position with the upper limbs abducted. This allowed a wider FOV and better spatial resolution, resulting in better nodal imaging using FSE techniques due to high contrast, excellent suppression of fat, and relative motion insensitivity during acquisition.

US and DWI-MRI images were conjointly interpreted by two consultant radiologists, having more than 10 years of breast imaging experience. Finally, results of US and DW-MRI were correlated to those of final pathology which served as a standard of reference. All cases with suspicious axillary nodes underwent image-guided sampling. All those with no suspicious axillary nodes were subjected to intraoperative sentinel nodal sampling. All image-guided sampling from suspicious nodes was from level I axillary nodes. Although some patients had additional level II (17 cases) or level III (9 cases) nodal involvement, there were no skip lymphatic lesions and therefore level I nodes were sampled.

For statistical analysis, we used the Statistical Package for the Social Sciences (SPSS, version 25) to code the data which was summarized as mean, standard deviation (SD), median, minimum, and maximum for quantitative data and as frequency (count) and relative frequency (percentage) for categorical data. Quantitative variables were compared using the Mann-Whitney non-parametric test while categorical data was compared using the Chi-square (χ2) test. However, when the expected frequency was less than 5, Exact test was used. Standard diagnostic indices such as sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and diagnostic efficacy were calculated. Receiver operating characteristic curve (ROC) was constructed with the area under curve (AUC) analysis performed to determine best cutoff values of different parameters that detect infiltration. To predict infiltration by combining US and MRI, logistic regression was done, considering P values less than 0.05 as statistically significant.

Results

This prospective study included 77 axillary nodes from the ipsilateral sides of suspicious breast lesions from 77 patients. These nodes were evaluated by both conventional US and DW-MRI in an attempt to evaluate the added role of DW-MRI in axillary evaluation. The final pathologic diagnosis, which served as our standard of reference, proved that 47 out of the examined 77 nodes (61%) were infiltrated and 30/77 (39%) were not infiltrated.

US evaluation of axillary nodes

Axillary nodes were classified by US into three groups, namely the non-specific (benign), malignant, and indeterminate groups.

US axillary nodal evaluation according to nodal shape

Considering an ovoid shape as a benign descriptor and a non-ovoid shape as a malignant one, 45 out of the examined 77 nodes (58.4%) were ovoid and 32 (41.6%) were non-ovoid resulting in a calculated sensitivity, specificity, PPV, NPV, and accuracy of 61.7%, 90%, 90.6%, 60%, and 72.7%, respectively, having a significant P value of 0.009 (Table 1).

Table 1 Correlation of nodal shape by US to the final pathology

US axillary nodal evaluation according to nodal size

Axillary nodes were classified according to their size by US in long axis, short axis, and long/short axis ratio (Table 2).

Table 2 Axillary nodal classification by size according to US

The cut-off value for the nodal short axis (as calculated by ROC curve analysis) between benign and malignant nodes was 0.95 cm. For nodal long axis, we found no cut-off value as malignant and benign results overlapped and consequently, no statistical significance was found for the long axis in differentiating malignant from benign nodes (Fig. 1). For nodal long/short axis ratio, the calculated cut-off value between benign and malignant nodes was 1.9 (Table 3, Fig. 2).

Fig. 1
figure 1

ROC curve for detection of nodal infiltration using size in long and short axes

Table 3 Diagnostic indices for nodal short and long axial measurements by US
Fig. 2
figure 2

ROC curve for detection of nodal infiltration using long/short ratio

US axillary nodal evaluation according to nodal hilum

Considering a preserved hilum as a benign descriptor and a non-preserved one as a malignant descriptor, the calculated sensitivity, specificity, PPV, NPV, and accuracy were 72.3%, 100%, 100%, 69.7%, and 83.1%, respectively (Table 4).

Table 4 Correlation of nodal hilum by US to the final pathology

US axillary nodal evaluation according to cortical thickness

Nodes were classified according to their cortical thickness, where the calculated cut-off value for cortical thickness differentiating infiltrated and non-infiltrated nodes was 5 mm (as calculated by ROC curve analysis) (Fig. 3). For the pathologically proven infiltrated axillary nodes, the mean cortical thickness was 0.95, the SD was 0.61, the median was 0.7, the minimum was 0.36, and the maximum was 2.6. On the other hand, for the pathologically proven, non-infiltrated axillary nodes, the mean cortical thickness was 0.39, the standard deviation was 0.11, the median was 0.4, the minimum was 0.2, and the maximum was 0.6. Accordingly, the calculated sensitivity, specificity, PPV, NPV, and accuracy were 76.6%, 93.3%, 94.7%, 71.7%, and 83.1%, respectively with P value < 0.001.

Fig. 3
figure 3

ROC curve for detection of nodal infiltration using cortical thickness

Axillary nodal evaluation according to the studied combined US criteria

According to the combined studied US criteria (nodal shape, size, hilum, and cortical thickness), 11 out of the examined 77 nodes (14.3%) were reported by US as benign, 35 (45.5%) as malignant, and 31 (40.3%) as indeterminate.

Indeterminate nodes were added to the considered-positive ones, and all counted up as suspicious, resulting in a total of 66 out of the examined 77 nodes being considered by US as suspicious, out of which 19 were proved benign by pathology (false positive) and 47 were proved malignant (true positive). On the other hand, 11/77 nodes were diagnosed as benign by US and confirmed truly benign by pathology (true negative). There were no false negative results concluding a calculated sensitivity, specificity, accuracy, PPV, and NPV for US of 100%, 36.6%, 75.3%, 71.2%, and 100%, respectively (Table 5).

Table 5 Correlation of overall US results to those of final pathology

As for the US-reported indeterminate nodes (31 nodes), 19 turned out to be benign by pathology (false positive) and 12 were confirmed malignant (true positive).

DW-MRI evaluation of axillary nodes

Qualitative DW-MRI evaluation of axillary nodes (restricted or not)

Classification of nodes according to being restricted or not on DW-MRI and correlation with final pathology results was done, considering DWI restriction as a malignant descriptor and non-restriction as a benign one (Table 6). Accordingly, the calculated sensitivity, specificity, PPV, NPV, and accuracy were 76.6%, 63.3%, 76.6%, 63.3%, and 71.4% respectively with P value < 0.001.

Table 6 Distribution of lymph nodes by qualitative DW-MRI

ADC evaluation of axillary nodes

The calculated cut-off value differentiating infiltrated and non-infiltrated nodes for ADC was 0.95 × 10−3 mm2/s (according to ROC curve analysis, AUC = 0.754). Accordingly, the calculated sensitivity, specificity, PPV, NPV, and accuracy were 76.6%, 63.3%, 76.6%, 63.3%, and 71.4%, respectively with P value < 0.001 (Fig. 4). For the pathology-proven, infiltrated axillary nodes, the mean ADC value was 0.84, the SD was 0.28, the median was 0.7, the minimum was 0.5, and the maximum was 1.6. For the pathology-proven non-infiltrated nodes, the mean ADC value was 1.12, the SD was 0.32, the median was 1.1, the minimum was 0.6, and the maximum was 1.6.

Fig. 4
figure 4

ROC curve for detection of nodal infiltration using ADC

Performance of DW-MRI in the evaluation of the US-diagnosed indeterminate axillary nodes

US-diagnosed indeterminate nodes were evaluated by DW-MRI and classified as either benign or malignant. Out of the 31 US-diagnosed indeterminate nodes, 15 were reported by DW-MRI as malignant, out of which 7 turned out to be benign by pathology (false positive) and 8 were proved malignant (true positive). On the other hand, 16/31 nodes were reported by DW-MRI as benign, out of which 12 were confirmed by pathology to be benign (true negative) and 4 turned out to be malignant (false negative), resulting in a calculated sensitivity of 66.6%, a specificity of 63.1%, and a total accuracy of 64.5%. The PPV and NPV were 53.3%, and 75%, respectively, for DW-MRI evaluation of US-diagnosed indeterminate nodes (Table 7) (Figs. 5, 6, 7, and 8).

Table 7 Correlation of DW-MRI classification of US diagnosed indeterminate nodes to the final pathology
Fig. 5
figure 5

A 33-year-old female with left Paget’s disease and breast lump

Fig. 6:
figure 6

A 45-year-old female with a right breast lump

Fig. 7
figure 7

A 58-year-old lady with a palpable left breast lump

Fig. 8
figure 8

A 52-year-old female with a palpable right breast lump

Performance of DW-MRI in combination with conventional US in axillary nodal evaluation

Using DW-MRI in combination with conventional US improved the overall accuracy of axillary nodal evaluation, having a sensitivity, specificity, PPV, NPV, and accuracy of 76.60%, 76.67%, 83.72%, 67.65%, and 76.62%. So, adding DW-MRI could improve specificity (36.6%) and accuracy (75.3%) of US both to 76.6%.

Discussion

Axillary nodal metastasis is a cornerstone for the prognosis of cases with breast cancer, affecting their survival rate [6]. However, a definite diagnosis of axillary nodal affection needs tissue biopsy or surgical dissection which are rather invasive procedures and carry considerable complication risks [7]. Therefore, a need has developed for accurate, non-invasive, preoperative axillary imaging in an attempt to reduce complication risks and cut off costs by avoiding unnecessary sampling [8].

In the current study, we examined 77 axillary nodes in females with recently discovered malignant breast lesions, aiming at evaluating the role of DW-MRI, in comparison to US, in the assessment of the axillary nodal status in patients with newly discovered breast cancer, as an attempt to preoperatively detect axillary nodal metastasis, and consequently, reduce unnecessary invasive sampling and avoid axillary over or under-staging. US morphologically classified axillary nodes into benign (non-specific), malignant, and indeterminate groups, whereas DW-MRI functionally classified them into benign (non-specific) and malignant groups.

We found no correlation between patient age and axillary nodal affection (P value = 0.95) which is in accordance with other studies [9].

To calculate statistical indices for the performance of US, the US-diagnosed indeterminate group of nodes was added to the pathological group, all together forming a suspicious group. This resulted in calculated sensitivity, specificity, PPV, NPV, and accuracy for the combined US criteria of 100%, 36.67%, 71.21%, 100%, and 75.3%, respectively. This may justify the relatively high figure for sensitivity and NPV for US performance where any node having a non-classic feature was added to the suspicious group. On another note, the relatively lower figure for US specificity in our study may be related to the presence of a considerable number of US-diagnosed indeterminate nodes (31/77) (40%) where, ultimately, many of which turned out to be benign by pathology (19/31) (61%). US has rather variable reported values for sensitivity (ranging from 49 to 87%) and specificity (ranging from 56 to 97%) for metastatic axillary nodal detection [10].

In the current study, cortical thickness proved to be the most sensitive US criterion (76.6% sensitivity, 93.3% specificity) for the detection of metastatic axillary nodes as pathologically proven malignant nodes showed significantly higher maximal cortical thickness compared to benign ones, with a calculated cut-off value for cortical thickness that differentiates benign from malignant nodes of 5.2 mm, which is rather comparable to the cut-off value used in previous studies [11].

The results of the current study concluded that the mean short axial diameter of malignant nodes (1.25 ± 0.61 cm) was significantly higher than that of benign ones (0.84 ± 0.17 cm) (P value = 0.001). We also found that mean cortical thickness in malignant nodes (0.95 ± 0.61 cm) was significantly higher than that of benign ones (0.39 ± 0.11 mm) (P value = 0.001) and that non-preserved hila were significantly associated with malignant nodes (P value = 0.001) which goes in agreement with the results of other previous studies [3, 12].

We also concluded that the mean (L/T) ratio in malignant nodes (1.8 ± 0.43 mm) significantly differed from that of benign ones (2.13 ± 0.34 mm), having a P value of 0.001. This agrees with the results of several other previous studies [13, 14]. Yet, other investigators, such as Zaiton and his colleagues [15], reported that they found no statistically significant difference in mean short axial diameter or L/T ratio between malignant and benign nodes (P value: 0.87 and 0.82, respectively) which also agrees with the results of some other studies [3, 12].

We also concluded the statistically significant value (P value < 0.001) of hilar fat loss in cases of malignant axillary nodal involvement, which agrees with the results of other investigators [15, 16].

DW-MRI is a functional magnetic resonance imaging technique that needs no administration of contrast [15].

The results of our current study show that diffusion restriction in nodes was significantly associated with malignancy (P value = 0.001), having 76.6% sensitivity, 63.3% specificity, 71.4% accuracy, 76.6% PPV, and 63.3% NPV. This agrees with the results of some previous researchers, such as Scaranelo [17] and Abdelatif [3], who reported that differentiation of malignant and benign axillary nodes by visually detecting signal restriction on DWI and ADC maps showed a sensitivity of 84% and 100%, respectively, a specificity of 77% and 66.7%, respectively and an accuracy of 80% and 92.7%, respectively. It also goes in agreement with the results of other previous studies [15, 18, 19]. On the other hand, other researchers did not detect a statistically significant difference comparing DWI signal intensity between inflammatory and metastatic nodes [8].

Quantitative assessment of the diffusibility of the molecules of water in tissues was done via measuring ADC values where the greater the b value, the higher the degree of attenuation of the signal from water molecular motion [20]. This concept was used in some previous studies where rather high b values (1000 and 800 s/mm2) were used, assuming that a high b value could more precisely evaluate the diffusion of water by eliminating capillary perfusion effects [8, 15].

We utilized the same principle, where the obtained ADC values were significantly lower for nodes with metastasis than those without metastasis, yet, with an overlap between both groups. Similar results were published by some previous investigators [8, 18, 19] and an explanation for this overlap was postulated by Wang and his team [8] as they found that malignant nodes were not entirely replaced by malignant cells, yielding metastatic areas with lower ADC values and other areas without metastasis, having higher ADC values where a ROI of ADC can be heterogeneous, containing both metastatic and non-metastatic parts. On the other hand, non-metastatic nodes may show limited diffusibility of molecules of water (and consequently relatively lower ADC values) due to other associated conditions, such as infiltration by inflammatory cells or reactive hyperplasia.

There is a wide variation regarding both the mean and cutoff values of ADC between various studies which may be attributed to many factors such as MRI acquisition parameters [21].

In our current study, we found that the mean ADC values were generally lower for metastatic than for non-metastatic nodes, which goes in accordance with the results of various other previous studies, all showing rather comparable statistical indices [3, 15, 22,23,24]. Nevertheless, other studies reported that values of ADC for metastatic nodes were significantly higher than those for non-metastatic ones [25].

Comparing the results of the overall performance of US and DW-MRI in the current study concerning indeterminate nodes, which are those of most clinical concern, we found that DW-MRI had higher specificity (63.1%) than US (36.6%) while US had higher sensitivity (100%).

Limitations

One of the limiting factors of our current study was only including the most suspicious node, assuming that it correlated to the final histopathological results.

Another limiting point was being confronted by the suboptimal axillary visualization on using the standard breast MRI coil, possibly due to uncomfortable positioning and limitations related to patient obesity. On the other hand, on using a chest coil, we had patients in a more comfortable supine position with abducted arms and overcame the constraints of patient built. This gave way to better axillary imaging due to better spatial resolution and wider FOV using FSE technique which overcomes the problem of motion sensitivity.

One other limitation of the current study was adding the US-diagnosed pathological and indeterminate groups together, forming a suspicious group of nodes so that any node having non-classic US features was included in the suspicious group, which may justify the relatively higher figure for US sensitivity and lower specificity.

Conclusion

Accurate axillary assessment is important for proper management of cases with recently discovered breast cancer. Although US shows rather excellent sensitivity for axillary nodal visualization, yet specificity could be ameliorated by adding DW-MRI to it. This can be used to reduce the diagnosis of indeterminate nodes which eliminates grey zones and better helps tailor management options, avoiding either over or under staging of the axillary nodal status for each patient.