1 Introduction

Small renal masses (SRMs), defined as renal tumors less than 4 cm in diameter, are less likely to cause clinical symptoms such as hematuria, low back pain, or palpable masses [1]. The three most common subtypes of malignant renal cell carcinoma (RCC) are clear cell RCC (ccRCC), papillary RCC (pRCC), and chromophobe RCC (chRCC) [2]. Additionally, the Fuhrman classification grading system divides RCCs into either high- or low-grade tumors based on histocytology. In clinical practice, the subtype and histopathologic grade should be taken into consideration when predicting the prognosis of RCC.

In recent years, SRMs have increasingly been incidentally discovered through imaging examinations [3]. However, certain benign tumors, such as angiomyolipoma (AML) and oncocytoma, may display some similar imaging characteristics, making them difficult to differentiate from RCC [1, 4, 5] and thus making clinicians hesitant to remove SRMs surgically or to actively surveil SRMs.

Renal tumor biopsy (RTB) can provide a pathological diagnosis for uncertain SRMs. Nevertheless, biopsy is not a reliable method for tumor grading due to the possible underestimation of a simple specimen [6]. Additionally, RTB is an invasive procedure with possible complications, such as hemorrhage, infection, pneumothorax, and hollow organ perforation. In the diagnosis or selection of treatment for these SRMs, novel imaging modalities are still useful first tools [7].

Computed tomography (CT) and magnetic resonance imaging (MRI) are the first-line noninvasive imaging methods for both the diagnosis and staging of SRMs. MRI is considered a better tool than CT because there is no radiation exposure to the patient, and diffusion images can also provide extra information in patients with poor renal function for whom contrast medium administration is unsuitable [8]. Multiple MRI parameters can show the intratumoral content of SRMs. For example, T2-weighted, T1-weighted, and chemical shift imaging can reveal manifestations such as hemorrhage, protein, macroscopic fat, or microscopic fat within the tumor; the postcontrast series of dynamic contrast-enhanced (DCE) images can demonstrate the enhancement pattern of solid tumors, while diffusion-weighted imaging (DWI) with apparent diffusion coefficient (ADC) values can illustrate the cellular microenvironment of the tumor by assessing the movement restrictions of free water molecules. These imaging features may be useful in differentiating malignant RCC from benign lesions and classifying RCC subtypes and histological grades [1].

Multiparametric MRI has been reported to be helpful in differentiating malignant RCCs from benign lesions in several research studies [9,10,11,12]. However, summaries of the diagnostic values of such parameters have seldom been reported. Therefore, we focused on four pathological subtypes (ccRCC, pRCC, AML, and other types of RCC) and analyzed the MRI sequences reported to be useful in diagnosing high- and low-grade RCCs as well as benign renal tumors in previous studies.

2 Methods

2.1 Subjects

This retrospective study was approved by our Institutional Review Board. By reviewing the medical records from a medical center in Taiwan, this retrospective study identified patients who had undergone surgical resection of a renal mass measuring less than 4 cm from 2014 to 2019 who also had preoperative multiparametric MRI data of the kidney available for analysis. Patients with renal masses larger than 4 cm who had a pathologic diagnosis of urothelial carcinoma or renal metastasis were not included. Sixty-eight patients with a total of 80 SRMs were initially selected. Six patients with benign renal cysts and two with benign inflammatory disease were excluded because our study aimed to analyze tumor masses.

Finally, 60 adult patients with a total of 72 SRMs were included: 58 (80.6%) with pathologically confirmed RCC, including 38 (52.8%) ccRCCs, 9 (12.5%) pRCCs, 3 (4.2%) chromophobe RCCs (chRCCs), and 8 (11.1%) other RCC types (4 tuberous sclerosis-associated RCCs, 1 unclassified RCC, 1 Xp11 translocation RCC, and 2 acquired cystic disease-associated RCCs). Additionally, there were 14 SRMs (19.4%) considered benign renal tumors: 12 (16.7%) AMLs (1 AML with an epithelioid component) and 2 (2.8%) oncocytomas. Due to the small number of chRCC (n = 3) and oncocytoma (n = 2) patients, we did not include these two tumor types in our tumor subtype analysis. We divided the SRMs into four subgroups for comparisons: ccRCC, pRCC, other types of RCC, and AML.

2.2 Image Acquisition

All patients with SRMs underwent MRI using both conventional MRI and DWI sequences. DCE MRI was administered if the patient’s renal function allowed. MRI was performed with a 1.5-T body scanner (MAGNETOM AERA, Siemens, Erlangen, Germany) using an 18-channel phased-array body coil with the following protocols:

  1. (1)

    Axial T2-weighted half-Fourier acquisition single-shot turbo spin–echo (HASTE): repetition time (TR) = 1200 ms, echo time (TE) = 95 ms, flip angle = 160°, field of view = 36 × 30 cm, matrix = 320 × 182, slice thickness = 5 mm, and gap = 0 mm.

  2. (2)

    Axial T1-weighted fast spoiled gradient-recalled echo dual-echo (FSPGR-DE): TR = 142 ms, TE = 2.38 ms and 4.76 ms, flip angle = 70°, field of view = 36 × 30 cm, and matrix = 256 × 166.

  3. (3)

    Axial DWI: TR = 3300 ms, TE = 65 ms, field of view = 38 × 31 cm, matrix = 192 × 125, number of excitations (NEX) = 3, bandwidth = 1736 Hz/pixel, diffusion mode = 3-scan trace, slice thickness = 5 mm, and gap = 0 mm with b values = 0 and 800 s/mm2 [13,14,15]. DWI was performed by using a single-shot spin–echo echo-planar imaging sequence with a parallel imaging technique and fat saturation.

  4. (4)

    Dynamic MRI was performed in multiple breath-holds (BHs) using axial 3D fat-suppressed T1-weighted volumetric interpolated BH (VIBE) sequences: TR = 4.5 ms, TE = 2.2 ms, flip angle = 15°, field of view = 38 × 38 cm, matrix = 320 × 192, slice thickness = 5 mm, and gap = 0 mm.

A gadolinium-based contrast medium was used for the dynamic scan at a dose of 0.1 mmol/kg of body weight as a bolus injection. Corticomedullary and late nephrographic phase images were acquired 30 and 90 s after the contrast medium injection, respectively.

These image parameters remained constant from 2014 to 2019; only the set of b values changed from four b values to seven b values (both sets including b values of 0 and 800 s/mm2), and this had no effect on the ADC calculation.

2.3 Image Interpretation and Definition of Features

The data of different MRI sequences, including the T2-weighted image signal intensity, intravoxel fat, bulk fat, DCE enhancement pattern, AWI, and ADC values, were recorded. There were no postprocessing changes to image signal intensity in this study. The definition of each MRI feature with its respective grading scale is summarized in Table 1. These images were reviewed and measured by a genitourinary radiologist with five years of experience. Qualitative results were determined by careful visual comparison by a professional genitourinary radiologist; quantitative results, such as the AWI and ADC, were calculated through ratios. A region of interest (ROI) was placed within a portion of the solid area where the minimum ADC value was identified through visual inspection of the ADC map. The necrotic and hemorrhagic regions were identified on conventional MRI scans and were avoided.

Table 1 Definition of the imaging features and their grading scales

2.4 Statistical Analysis

First, we used the Kolmogorov–Smirnov test to evaluate the normality of the continuous variables, including age, tumor size, arterial wash-in index (AWI), and ADC value. These samples were standardized and compared with a standard normal distribution to determine whether the parametric or nonparametric method should be applied for data analysis. One-way ANOVA (parametric method) and the Kruskal–Wallis test (nonparametric method) were performed to compare the parameters of age, tumor size, AWI, and ADC value among the different renal tumor subtypes. A Dunn–Bonferroni post hoc test was also applied to further select pairwise comparisons. The χ2 test was used for the comparison of categorical variables, such as conventional MRI features, sex, and enhancement patterns.

The t-test (parametric method) and Mann–Whitney U test (nonparametric method) were applied to compare the parameters of AWI and ADC values between low-grade and high-grade malignant tumors. A box-and-whisker plot was created to compare the AWI and ADC values of benign renal lesions and RCCs of different histological subtypes.

SPSS 22.0 software was used for data analysis. Both the AWI and ADC values are expressed as the mean ± SD. The results were considered statistically significant when the p value was < 0.05.

3 Results

3.1 Comparisons Among Different Tumor Subtypes

We compared the imaging and basic features of the four subtypes of renal tumors, ccRCC, AML, pRCC, and other types of RCC. There were significant differences among the four groups of tumors in terms of age, AWI, ADC, size, T2-weighted signal intensity, intravoxel fat, bulk fat, and enhancement pattern (p = 0.033, < 0.001, < 0.001, 0.012, < 0.001, < 0.001, < 0.001, and < 0.001, respectively). There were no significant differences among these four subtypes regarding sex (p = 0.056) and grade (p = 0.426). Finally, the pairwise comparisons that showed the significant differences in the above parameters among the groups of tumors are listed in Tables 2 and 3.

Table 2 Comparison of different MRI parameters between each group of tumors (continuous variables)
Table 3 Comparison of different MRI parameters between each group of tumors (categorical variables)

In the comparison of ccRCC and AML, ccRCC exhibited a higher T2-weighted signal intensity (high: 81.6% vs. 16.7%, and low: 5.3% vs. 66.7%) (p < 0.001); less intravoxel fat (present: 10.5% vs. 75.0%, and absent: 89.5% vs. 25.0%, p < 0.001); less bulk fat (present: 0.0% vs. 50.0%, and absent: 100.0% vs. 50.0%, p < 0.001); more type 2 and 3 enhancement patterns (types 2 and 3: 100.0% vs. 83.3%, p = 0.023); a higher AWI (260.9 ± 116.7 vs. 165.4 ± 56.5, p = 0.015); and a higher ADC value (1.907 ± 0.336 vs. 1.191 ± 0.180, p < 0.001).

In the comparison of ccRCC and pRCC, ccRCC exhibited a higher T2-weighted signal intensity (high: 81.6% vs. 22.2%, and low: 5.3% vs. 77.8%, p < 0.001); more type 3 enhancement patterns (type 3: 75.0% vs. 0.0%, and type 1: 0.0% vs. 60.0%, p < 0.001); a higher AWI (260.9 ± 116.7 vs. 30.5 ± 13.7, p < 0.001); and a higher ADC value (1.907 ± 0.336 vs. 0.986 ± 0.202, p < 0.001).

In the comparison of ccRCC and other types of RCC, ccRCC exhibited a higher T2-weighted signal intensity (high: 81.6% vs. 37.5%, and low: 5.3% vs. 37.5%, p = 0.026) and larger size (2.8 ± 1.0 vs. 1.5 ± 0.6, p = 0.012).

In the comparison of AML and pRCC, AML exhibited more intravoxel fat (present: 75.0% vs. 0.0%, and absent: 25.0% vs. 100.0%, p = 0.003) and more type 3 enhancement patterns (type 3: 83.3% vs. 0.0%, and type 1: 16.7% vs. 60.0%, p = 0.010).

In the comparison of AML and other types of RCC, AML exhibited more intravoxel fat (present: 75.0% vs. 0.0%, and absent: 25.0% vs. 100.0%, p = 0.003).

In the comparison of pRCC and other types of RCC, pRCC exhibited more type 1 enhancement patterns (type 3: 0.0% vs. 83.3%, and type 1: 60.0% vs. 0.0%, p = 0.023) and a lower AWI (30.5 ± 13.7 vs. 225.2 ± 49.8, p = 0.014).

Eight patients (12%) with poor renal function did not undergo contrast-enhanced imaging, limiting the data collected for determination of the AWI and DCE values. However, since these MRI parameters are independent factors, these missing values did not affect the remaining parameters.

Box-and-whisker plots of the AWI and ADC values of ccRCC, pRCC, other types of RCC, and AML are shown in Fig. 1.

Fig. 1
figure 1

a Box-and-whisker plot of the AWIs of SRMs, and b box-and-whisker plot of the ADC values of SRMs [the boxes indicate the interquartile range, the whiskers indicate the range of all values, and the horizontal line within the box indicates the median value, confidence interval (CI)—95%]

Figures 2, 3 and 4 demonstrate the typical image features of the major types of SRMs. Figure 2 illustrates the MRI images of a Fuhrman grade II ccRCC. The images show that the T2-weighted signal intensity of the tumor is higher than that of the normal renal cortex. There is no obvious signal dropout on the fat-suppressed images compared with nonfat-suppressed T2-weighted images, which indicates that there is no detectable bulk fat. Additionally, there is no obvious signal dropout on the opposed-phase images compared with in-phase images, which indicates that there is no detectable intravoxel fat; the AWI is 244.16, with the presence of a plateau pattern and an ADC value = 1.911 × 10−3 mm2/s.

Fig. 2
figure 2

MRI scans of ccRCC. Note T2W T2-weighted signal intensity, FS fat-suppressed, IP in-phase images, OP opposed-phase images, ADC apparent diffusion coefficient

Fig. 3
figure 3

MRI scans of AML. Note T2W T2-weighted signal intensity, FS fat-suppressed, IP in-phase images, OP opposed-phase images, ADC apparent diffusion coefficient

Fig. 4
figure 4

MRI scans of pRCC. Note T2W T2-weighted signal intensity, FS fat-suppressed, IP in-phase images, OP opposed-phase images, ADC apparent diffusion coefficient

Figure 3 illustrates the MRI images of a benign AML. The images show that the T2-weighted signal intensity of the tumor is lower than that of the normal renal cortex. There is a detectable signal dropout on the opposed-phase images compared with that on the in-phase images, which indicates the presence of intravoxel fat. The AWI was 150.00, with the presence of a washout enhancement pattern and an ADC value = 1.201 × 10−3 mm2/s.

Figure 4 illustrates the MRI images of a Fuhrman grade III pRCC. The images show that the T2-weighted signal intensity of the tumor is lower than that of the normal renal cortex. There was no bulk or intravoxel fat component detected in fat-suppressed images or opposed-phase sequences. The AWI was 28.10, with the presence of a progressive enhancement pattern and an ADC value = 0.896 × 10−3 mm2/s.

3.2 Tumor Grade Comparison

The study also analyzed differences in imaging and basic features between low-grade and high-grade RCCs, and these results are shown in Table 4. Compared with patients with low-grade ccRCCs, those with high-grade ccRCCs were significantly older (p = 0.024), and the ADC value was lower for high-grade ccRCCs (p < 0.001). Compared with patients with any subtype of RCC (ccRCC, pRCC, chRCC, and other type RCCs) considered low grade, those with high-grade RCCs were also older (p = 0.015), and the ADC value was lower for high-grade RCCs (p = 0.001). There were no significant differences between low-grade and high-grade ccRCCs or among all subtypes of RCCs in the parameters of sex, AWI, size, T2-weighted signal intensity, intravoxel fat or enhancement pattern.

Table 4 Comparison of imaging and basic features between low-grade and high-grade ccRCCs and among all subtypes of RCCs

The box-and-whisker plot of ADC values between low-grade and high-grade ccRCCs and the corresponding data for all subtypes of RCC are shown in Fig. 5.

Fig. 5
figure 5

a Box-and-whisker plot of the ADC values of low-grade and high-grade ccRCCs, and b box-and-whisker plot of the ADC values of low-grade and high-grade RCCs of any subtype [the boxes indicate the interquartile range, the whiskers indicate the range of all values, and the horizontal line within the box indicates the median value, confidence interval (CI)—95%]

4 Discussion

In our study, we found that ccRCC showed a higher T2-weighted signal intensity than AML, pRCC, and other types of RCC. This result is in agreement with those from both Kay et al. [3] and Roy et al. [16], suggesting that pRCC typically has a low signal intensity in T2-weighted images and that ccRCC mostly displays a higher signal intensity on T2-weighted images than pRCC, chRCC, and lipid-poor AML.

In essence, AMLs are benign tumors, and regular follow-up imaging is recommended over surgical resection. However, all of the AMLs in our study were pathologically diagnosed through surgery because these were all lipid-poor AMLs that were difficult to distinguish from malignant tumors simply through imaging findings. In our study, bulk fat was found only in AMLs (6 in 12 AMLs), while intravoxel fat was found in both AMLs (9 in 12) and ccRCCs (4 in 34). These findings indicate that the presence of microscopic or macroscopic fat is still the key factor in differentiating AMLs from RCCs. A previous study found that the presence of intravoxel fat may indicate that the tumor is ccRCC rather than another subtype of RCC [17]. Consistently, none of the other subtypes of RCC were found to have an intratumoral microscopic fat component in our study.

On contrast-enhanced images, the AWI is a quantitative parameter, and the DCE pattern is a qualitative parameter. In our research, ccRCC exhibited a higher AWI than both AML and pRCC, which is in agreement with the findings of Sun et al. [2] and Kay et al. [3] in that there is higher contrast enhancement in ccRCCs than in pRCCs. chRCC had an intermediate signal intensity between that of ccRCC and pRCC. Cornelis et al. [18] also showed that the AWI tended to be lower for pRCCs than for lipid-poor AMLs; however, our study did not observe statistically significant differences between these two tumors. All ccRCCs in our study exhibited a plateau- or washout-type enhancement pattern. These results indicate that the enhancement pattern can play some type of role in differentiating among these tumors. However, this is in contrast to Kay et al. [3] who found nonsignificant differences in enhancement pattern among these tumors. Therefore, further studies on the diagnostic value of the DCE pattern may be needed.

Sandrasegaran et al. [15], Zhang et al. [19] and Lassel et al. [20] found that the ADC levels were significantly different between benign and malignant renal tumors. However, simply comparing the differences between benign and malignant renal tumors can easily lead to statistical errors due to the distribution of the common types of tumors in the population. Therefore, we classified each major tumor subtype separately using different characteristics so that the results will reflect a more accurate comparison. In our study, ccRCC exhibited a higher ADC level than both AML and pRCC. This result contradicted the conclusions of some previous articles, which suggested that benign renal tumors, including cysts, had a higher ADC value than RCCs [13,14,15, 19, 20]. In contrast, similar to in our study, Hötker et al. [21] separately measured the ADC values of AML, oncocytoma, ccRCC, pRCC, chRCC, and unclassified RCC and found that ccRCC had the highest ADC level in comparison to the other types of renal tumors. Nonetheless, larger and ideally prospective studies of different benign and malignant subtypes of SRMs are still needed to determine the exact ADC value threshold for differential diagnosis.

In our study, the ADC values of high-grade ccRCC and any high-grade RCC were significantly lower than those of low-grade ccRCC and any low-grade RCC, indicating that the ADC value is an independent imaging factor associated with tumor grade. Mytsyk et al. [13] observed a difference in the ADC values of different Fuhrman grades of ccRCC, with high-grade tumors presenting lower ADC values and low-grade lesions presenting higher ADC values, which is consistent with our study. High-grade malignant tumors have a higher cell density and stronger angiogenic activity than low-grade malignancies, so water diffusion is more restricted. This phenomenon can be seen not only in ccRCC but also in all RCCs.

In our study, the patients with high-grade ccRCC or any high-grade RCC were significantly older than those with low-grade ccRCC or any low-grade RCC. Mukhopadhyay et al. [22] concluded that patient age is an independent prognostic factor of RCCs. Further clarification of the relationships among patient age, Fuhrman nuclear grade, and tumor stage and their association with prognostic factors may be indicated in the future.

MRI has some disadvantages, such as limited accessibility, slow scanning speed, high cost, and unsuitability for claustrophobic patients. However, overall, MRI is considered a better tool than CT because its multiple parameters can provide more information regarding tumors than simple CT findings. The traditional MRI, DCE, and DWI sequences provide a wide range of opportunities to deeply explore the intrinsic characteristics of tumors.

There are several limitations in our study. First, CT is still the major presurgical imaging tool for SRMs surveyed in our hospital, and only patients considered difficult to diagnose undergo MRI, which resulted in an insufficient number of MRI cases. Second, most of the RCCs were ccRCC in our study, and the number of other subtypes of RCC was relatively small. This may make it difficult to statistically analyze different subtypes. Third, some patients with poor renal function were not imaged with a contrast agent, limiting the data collection of AWI and DCE values.

In conclusion, multiparametric MRI is useful in differentiating among common pathological types of SRMs. Additionally, ADC values may be helpful in evaluating the histological grade of RCC.