Background

Evaluating the status of axillary lymph nodes (ALNs) is crucial in staging, deciding the treatment planning, and predicting the long-term survival in breast cancer [13]. Biopsy is recognized as the gold standard for assessing ALNs. However, the drawbacks of biopsy are high false negative ratio result from sample errors and its invasiveness [4]. The imaging modalities for assessing the ALNs are rapidly evolving. Ultrasound (US) is applied widely for its convincing and dynamic observation. However, the sensitivity and specificity of ultrasound for lymph node metastasis were unreliable and controversial [5, 6]. Owing to radiation and relative lower diagnostic accuracy, computer tomography (CT) is limited in clinic [7]. Positron emission tomography (PET) and positron emission tomography/computed tomography (PET/CT) can reflect metabolism of glycolytic activity. Undoubtedly, they have shown the higher diagnostic significance in assessing distant metastases and regional metastatic ALNs [8], but their high radiation and expensive fee keep the common people away. Ultrasmall super paramagnetic iron oxide (USPIO) is the same. Its satisfactory performance was extinguished by drawbacks of time-consuming, underlying risk and forbidden in clinical practice [9].

Magnetic resonance imaging (MRI) is developing with an unimaginable speed; over the past years, it has been used to evaluate ALNs [10]. MRI can detect deep and contralateral lymph nodes. Its sensitivity and specificity for metastatic ALNs were higher than US and CT [11]. Diffusion-weighted imaging (DWI) as an advanced technology of MRI with its superior ability to apparent the diffusion of water molecules freely in tissues [12] has also been useful for diagnosing lymph nodes in the axillary and other sites. Apparent diffusion coefficient (ADC) can quantify the diffusion of water molecules; ADC value and ADC ratio were applied to distinguish metastatic from nonmetastatic lymph node widespread [13].

The role of DWI in diagnosing the metastatic ALNs can be found in several published studies; however, the existing results in these studies were enormously varied, and a comprehensive systematic review would be useful to synthesize the current available information. The aim of this meta-analysis was to assess the clinical value of DWI in detecting ALN metastases in patients with breast cancer.

Methods

Literature search

We searched studies about the diffusion-weighted imaging in diagnosing lymph node metastases in the PubMed, EMBASE, and Cochrane library until May 2015 following the key words “diffusion-weighted,” “diffusion weighted imaging,” “diffusion weighted magnetic resonance imaging,” “DWI,” “lymph nodes,” “axillary,” “breast cancer,” “breast carcinoma,” “ductal cancer,” “tubular cancer,” and “medullar cancer”.

Inclusion and exclusion criteria

We reviewed studies for the following inclusion criteria: (1) studies were published in English, (2) DWI was performed in detecting ALNs with breast cancer, (3) histopathological results were used as the reference standard, and (4) the true positive (TP), false positive (FP), true negative (TN), and false negative (FN) values can be calculated with sufficient data. We excluded the studies with the following criteria: (1) lack the explanations of DWI-detected ALNs with breast cancer; (2) without histopathological reference standard; (3) insufficient data to get the TP, FP, TN, and FN values; (4) experimental subject was an animal and ex vivo; (5) the type of study was review, case report, letter to editor, and meta-analysis; and (6) unable to get the full text.

Data extraction and quality assessment

We extracted the following items in each study: author, nation, publication year, sample size, b value, field strength, the cutoff of ADC value, the mean ADC value of the metastatic and nonmetastatic ALNs, study design, and TP, FP, TN, and FN values.

Assessing studies through QUADAS [14], the total score of the included studies must be greater than 10 points or equal to 10 points.

Statistical analysis

We applied Meta-DiSc version 1.4, Stata 12.0, SPSS 19.0, to analyze data.

In order to assess the clinical significance of DWI, we calculated sensitivity, specificity, positive and negative likelihood ratios (PLR, NLR), and pretest probability and performed a summary receiver operating characteristic (SROC) curve based on the collected studies of TP, FP, TN, and FN values.

Heterogeneity can affect the accuracy of estimation, and I 2 index was used to evaluate heterogeneity. There was existing heterogeneity when P value was less than 0.05 and/or I 2 value was over 50 % [15]. In diagnostic meta-analysis, the threshold effect was considered as an important source of heterogeneity. We used Spearman correlation coefficient in Meta-DiSc version 1.4 to check it. If threshold effect existed in our analysis, we summarized receiver operating characteristic (ROC) curve and calculated the area under the curve (AUC) directly. If no threshold effect existed in our analysis, after calculating the SROC and AUC, we also used meta-regression and subgroup analysis to explore the source of heterogeneity in Meta-DiSc version 1.4. We used Deeks’ funnel plot in Stata 12.0 to assess publication bias [16].

Owing to the different mean ADC values in metastatic and nonmetastatic ALNs, we used Student’s t test in SPSS 19.0 to compare the different mean ADC values of different status ALNs.

Results

Selection of studies

We total yielded 38 primary studies after searching the PubMed, EMBASE, and Cochrane library. We excluded 17 studies after reading the title. From the remaining 21 studies, we excluded 11 studies after reading abstract or full text and included 10 studies [13, 1725]. The detail of selection can be shown in Fig. 1.

Fig. 1
figure 1

Flow chart: selection process of the studies

Data extraction and study assessment

This study included a total of 801 patients and 2305 ALNs of the included 10 studies in this meta-analysis. The detail characteristic data can be seen in Table 1. All the selected studies were conducted with the QUADAS test, and their scores were greater than 10 points or equal to 10 points (Table 2).

Table 1 Characteristics of inclusion articles
Table 2 Results of the evaluation of each study according to QUADAS-2

Heterogeneity test

We evaluated heterogeneity through I 2 index in Stata 12.0. We found P < 0.05 and I 2 = 8.7 % indicating there was existing heterogeneity in the included studies. We used Spearman correlation coefficient in Meta-DiSc version 1.4 to check threshold effect. We found that the value of Spearman correlation coefficient was −0.064 and P = 0.852. We can confirm that threshold effect was not the source of heterogeneity in this meta-analysis.

In order to assess the source of heterogeneity further, we applied meta-regression (Table 3) and subgroup analysis by adding nation, design, b value, field strength, and coil (Table 4). Meta-regression and subgroup analysis showed that the source of heterogeneity cannot be found in these factors (P > 0.05). However, among these studies, the heterogeneity was highly significant (I 2 > 50 %). Thus, we should interpret the results carefully.

Table 3 Result of meta-regression analysis

Diagnostic performance of DWI and publication bias

The sensitivity, specificity, PLR, NLR, and the AUC of DWI were 0.89 (95 % CI 0.79–0.95), 0.83 (95 % CI 0.71–0.91), 3.86 (95 % CI 2.75–5.41), 0.17 (95 % CI 0.09–0.32), and 0.93 (95 % CI 0.90–0.95), respectively (Figs. 2, 3, and 4). Figure 5 shows that the post probability positive (PPP) and post probability negative (PPN) were 86 and 13 %, respectively, when the pretest probabilities were defined as 53 and 54 %.

Fig. 2
figure 2

Forest plots of sensitivity and specificity with corresponding 95 % CIs of 18 studies. The pooled sensitivity and specificity were 0.89 (95 % CI 0.79–0.95) and 0.83 (95 % CI 0.71–0.91), respectively

Fig. 3
figure 3

Scattergram of the PLR and NLR. Pooled estimates for the DWI were as follows: PLR was 3.86 (95 % CI 2.75–5.41), and NLR was 0.17 (95 % CI 0.09–0.32). LLQ left lower quadrant, LRN negative likelihood ratio, LRP positive likelihood ratio, LUQ left upper quadrant, RLQ right lower quadrant, RUQ right upper quadrant

Fig. 4
figure 4

SROC curve for the diagnostic performance of DWI for all 10 studies combined. The pooled ROC with corresponding 95 % CI was 0.93 (95 % CI 0.90–0.95)

Fig. 5
figure 5

Posttest probabilities were calculated in Fagan’s nomograms by using different pretest probabilities of ALN metastases. The left, middle, and right parts represented the 53 and 54 % pretest probabilities

As presented in Fig. 6, Deeks’ funnel plot indicated that we found no significant publication bias among the studies (P = 0.759).

Fig. 6
figure 6

Deeks’ funnel plot indicated no publication bias (P = 0.759)

The results of ADC values

We compared the mean ADC values of metastatic and benign ALNs by Student’s t test. Figure 7 shows that the mean ADC value of metastatic ALNs was significantly lower than that of benign ALNs (P < 0.000).

Fig. 7
figure 7

Box plot showed the comparison of the mean ADC value of metastatic and nonmetastatic ALNs. The mean ADC value of metastatic ALNs was significantly lower than that of the nonmetastatic ones (P < 0.000)

Discussion

Diffusion-weighted MR is a noninvasive technique to reflect functional information of tissues and is widely used in diagnosing the malignant lesions. Recently, detecting breast cancer and metastatic ALNs by DWI is a hot spot. We desire to assess the clinical significance of DWI in diagnosing metastatic ALNs in patients with breast cancer through systemic review. We selected 10 studies following the included criteria. We got the high sensitivity, specificity, and AUC as 89, 83, and 93 %, respectively. The results demonstrated that DWI was a promising method to differentiate diagnosis metastatic from nonmetastatic ALNs.

We also calculated the PLR and NLR to assess the performance of DWI. PLR and NLR are recognized as the better parameters to test the diagnostic accuracy. They can interpret one diagnostic tool through probability. We can rule in a disease when the PLR > 10 and rule out a disease when the NLR < 0.1 [26]. The PLR and NLR were 3.86 (95 % CI 2.75–5.41) and 0.17 (95 % CI 0.09–0.32), respectively. The results indicated that DWI could not rule in or rule out metastatic ALNs. Combined with the PLR and NLR, DWI had an inferior capacity to exclude or confirm metastatic ALNs. Fagan’s nomogram was also used to estimate the probability of having a disease in patients [27]. In this meta-analysis, the pretest probabilities of a patient with suspected metastatic ALNs were 53 and 54 %, and the posttest probability positive and posttest probability negative were 68 and 13 %, respectively, after likelihood ratio. The results meant that the pretest probabilities of 53 and 54 % were the cutoff for DWI-diagnosed metastatic ALNs. We can confirm the metastatic ALNs in patient when the pretest probabilities were higher than 53 and 54 % and excluded the metastatic ALNs in patient when the pretest probabilities were lower than 53 and 54 %. Therefore, the performance of DWI in differential diagnosis between metastatic and benign ALNs needs to be lucubrated.

ADC derived from DWI can determine the malignant lesions quantitatively and excluded the effect of T2 shine through [28]. Brownian motion of water molecules in cells can be quantitatively reflected. Wang et al. [29] found that the density of ALNs had significant positive correlation with metastatic lymph nodes and had significant negative correlation with ADC value. It meant that the diffusion of metastatic ALNs was higher than that of the benign ones and the ADC value of metastatic ALNs was lower than that of the benign ones. In Fig. 7, we found that the mean ADC value of metastatic ALNs was lower than that of the benign ones (P < 0.000). However, Xue et al. [30] and Roy et al. [31] got the opposite results. Actually, owing to the different selected ROI [32], liquefied active necrosis [33], and inflammatory and fibrous connective tissue proliferation [29], we acknowledged that certain overlap existed in metastatic and benign lymph nodes. In order to avoid the overlap, applying ratio of lymph node ADC value to primary tumor ADC value [34] and relative ADC values [35] may be a better method for detecting lymph node metastasis. Therefore, the clinical significance of different kinds of ADC values has more perspective. In this meta-analysis, we also found that the cutoff of diagnosing metastatic ALNs were variable and had no statistical significance.

Interestingly, in previous animal research [29], they found that the effect of perfusion can be ignored with b value ≥1000 and the diagnostic performance was better than that of the low b value. Previous study also demonstrated that with higher b value, the more accuracy the ADC had in other tumor [36]. Combined with professional knowledge, b value may be one of the sources of heterogeneity. In selected studies, the diagnosis performance with high b value was higher than with the lower ones (Table 4). However, with a low b value, owing to the perfusion effect, water diffusion motion cannot be reflected accurately. Moreover, the risk of distortion and susceptibility artifacts existing in DWI with high b value also cannot be ignored [37]. Different b values can affect the measurement of ADC and cutoff of ADC. It is necessary to get optimal b value for us to assess the statues of ALNs. Unifying the parameters of scan, cutoff of ADC, and optimal b value and improving spatial resolution were the direction of DWI in diagnosing ALN metastases in the future.

Table 4 The results of subgroup analysis

We should acknowledge the limitations of this meta-analysis. First, with the limited number of selected studies, we cannot find the source of heterogeneity. Second, the potential publication bias could not be ignored, although our result showed no significant publication bias. Third, lacking enough high-quality prospective studies may uncover the ability of DWI detecting the metastatic ALNs.

In conclusion, DWI is a promising diagnostic method for differentiation between benign and metastatic ALNs. ADC value can quantitatively analyse ALNs. Larger number of high-quality prospective studies regarding DWI and ADC to detect ALN metastases still need to be performed.