Introduction

The clear cell likelihood score (ccLS) is a five-category Likert scale (from 1 point = very unlikely to 5 points = very likely) designed to enhance the prediction of histopathological characteristics of small renal masses (SRMs) in multiparametric magnetic resonance imaging (MRI) studies (Fig. 1) [1,2,3]. The ccLS assessment is based on three major imaging features (T2 signal intensity, level of post-contrast enhancement in corticomedullary phase compared to renal cortex, and the presence of microscopic fat) and several ancillary parameters [1, 4]. The algorithm is designed to provide measures for noninvasive detection or exclusion of the most aggressive form of renal carcinoma (RCC) – clear cell subtype (ccRCC) [5]. Precise differentiation between histopathologic subtypes is especially consequential for management planning since many postcontrast-enhancing renal tumors, including some papillary and chromophobe RCCs, exhibit an indolent behavior, with slow growth and only minimal risk of metastases [6, 7].

Fig. 1
figure 1

The ccLS algorithm. Adapted from Pedrosa I, Cadeddu JA. How We Do It: Managing the Indeterminate Renal Mass with the MRI Clear Cell Likelihood Score. Radiology. 2022 Feb;302(2):256–269; with permission. The ccLS assessment is based on three major imaging features (T2 signal intensity, level of post-contrast enhancement in corticomedullary phase compared to renal cortex, and the presence of microscopic fat) and several ancillary parameters

The ccLS is currently an interesting topic that has been studied and commented on extensively in the literature recently [1,2,3,4,5, 7,8,9,10,11,12,13,14,15]. Canvasser et al. revealed moderate to good inter-reader reliability of the ccLS algorithm (κ = 0.53) [16]. Schieda et al. recently published a large cohort, cross-sectional study validating diagnostic performance of the ccLS algorithm showing moderate sensitivity and specificity for diagnosis of ccRCC with a threshold of ccLS ≥ 4 and high negative predictive value of ccLS ≤ 2 to exclude ccRCC [17]. Tian et al. noted that state-of-the-art studies demonstrated moderate sensitivity and specificity for the ccLS algorithm; nevertheless, the algorithm was effective in the context of clinical application [18]. Available data in this area predominantly is from retrospective, single-center studies which were conducted on a limited population (ranged from 57 to 241 patients). The existing literature has also highlighted certain limitations of the algorithm within specific diagnostic domains, e.g. the ccLS does not consider the papillary and chromophobe subtypes of RCC [18]. This suggests that some improvements should be considered in the future version of ccLS, e.g. adding parameters, which may indicate significant post-contrast enhancement, such as low T1 ratio or high arterial to delayed enhancement (ADER) [19, 20].

Therefore, we attempted to further validate the ccLS algorithm in a new patient cohort and investigated the value of other selected MRI features, especially related with post-contrast enhancement in the prediction of ccRCC occurrence.

Methodology

Studied population

In this retrospective, observational, single-center study, we included N = 50 patients with indeterminate SRMs discovered on MRI between 2012 and 2023. Final diagnoses were made using histopathologic results as standard reference (n = 38). In 12 patients, the final diagnosis was established based on clinical and imaging features (diagnosis of lipid-poor angiomyolipoma or non-ccRCC masses without exact subtype information) if a regression or complete lack of progression was observed during follow-up (minimum 36 months, mean 45.58 ± 9 months) [6, 21,22,23,24]. Exclusion criteria were as follows: diffuse infiltrative renal disease (i.e. lymphoma), masses containing macroscopic fat, predominantly cystic mass, MRI artifacts that could compromise renal lesion segmentation, acute intralesional complications (e.g. hemorrhage), incomplete clinical data, and inability to make a definitive diagnosis due to ambiguous radiological image and lack of histopathology. The study was approved by the Ethics Committee.

Magnetic resonance imaging studies

Magnetic resonance imaging studies were performed in the Department of Diagnostic Imaging in the University Hospital of Cracow, both on 1.5T and 3.0T scanners, using protocols meeting the requirements of ccLS algorithm application and according to the Society of Abdominal Radiology Disease expert consensus. All MRIs included conventional sequences: axial and coronal 2D T2w single shot acquisitions, axial 2D T1w gradient echo in and out of phase for chemical shift imaging, pre- and (dynamic) post-contrast 3D T1w SPGR with fat saturation including delayed scans and diffusion-weighted imaging. All MRI examinations were retrospectively reviewed by two radiologists (T.B. and J.D.J) with at least 3 years experience and were blinded to the final diagnosis. The inter-reader reliability of the ccLS algorithm in our study was good (κ = 0.63). We used the ccLS system to provide a standardized framework for categorizing small renal masses [9], as well as conducted separate evaluation of selected imaging parameters, both those present in the ccLS algorithm and potentially useful new parameters. The following MRI parameters were used for our analysis: T1 and T2 signal intensity (SI) ratio (measured as the ratio of tumor SI to renal cortex SI), intensity of multiphasic contrast enhancement, chemical shift imaging for the presence of microscopic fat, diffusion restriction, ADER (obtained as the ratio of difference between signal intensity in the corticomedullary phase and pre-contrast images to the difference in signal intensity in delayed phase and pre-contrast images), and segmental enhancement inversion (defined as the presence of relative inversion of post-contrast enhancement degree in corticomedullary and later phases).

Statistical analysis

Scoring each renal mass for ccRCC diagnosis was compared to the reference standard, which was determined by histology or imaging (ccRCC vs. ‘all other’ masses). Receiver operating characteristic (ROC) analysis was performed to calculate the area under the curve (AUC) and was used to determine which of the selected MRI features are discriminators between ccRCC and other types of SRMs. A logistic regression was performed in order to determine which of the MRI parameters are significant predictors of ccRCC. The Student’s T-test or Mann-Whitney U test were used to assess differences in continuous variables between ccRCC and non-ccRCC, when appropriate. Continuous variables were expressed as means ± standard deviations or medians and interquartile ranges (IQR). The Pearson χ2 test was used for investigation of associations between categorical variables. Categorical variables were given as numbers and percentages. A p-value of 0.05 or less was considered statistically significant. Specificity, sensitivity, positive predictive value, negative predictive value, accuracy, and likelihood ratio were calculated for each ccLS. Statistical analyses were performed using IBM SPSS Statistics (version 24, IBM Corp., Armonk, NY, USA). Confidence intervals (CI) were calculated using MEDCALC (free statistical calculators).

Results

Patient characteristics

A total of 50 patients with indeterminate renal masses (27 females, 23 males; median age 61.5 [IQR: 52.3–68.3] years) were retrospectively enrolled in the study. In 70.0% of patients, hypertension was observed, and 44.0% had a documented history of cigarette smoking. Symptomatic presentations were noted in 20.0% of the cases: flank pain (14.0%), microhematuria (8.0%), macrohematuria (6.0%), polycythemia (4.0%); no reports of palpable flank mass or paraneoplastic syndromes. A positive family history of cancer was observed in 20% of the patients. In the ccRCC group, there were significantly more cigarette smokers than in the non-ccRCC group (14 [60.9%] vs. 8 [29.6%], p = 0.03). Apart from this, the ccRCC and non-ccRCC groups did not differ in main risk factors related with ccRCC development (Table 1).

Table 1 Prevalence of renal carcinoma risk factors in patients with confirmed ccRCC and group with other etiologies of renal masses

MRI features in ccRCC and non-ccRCC groups

When comparing MRI features, only median values of the T1 ratio and ADER differ significantly between ccRCC and non-ccRCC groups. Median (IQR) of the T1 ratio was significantly lower in the ccRCC group than in the non-ccRCC group (0.8 [0.7–0.9] vs. 0.9 [0.9-1.0], p = 0.01). On the other hand, median (IQR) of the ADER was significantly greater in the ccRCC group when compared to other renal mass etiologies (1.1 [1.0-1.4] vs. 0.9 [0.6–1.1], p = 0.04). All evaluated MRI features in ccRCC and non-ccRCC groups are presented in Table 2.

Table 2 Differences in MRI features in ccRCC group and patients with other etiology of renal masses

Diagnostic accuracy of ccLS scores in the prediction of ccRCC and other masses

Patients with ccRCC differ significantly from patients with non-ccRCC only regarding ccLS = 1/2 and ccLS = 5 (1 [4.3%] vs. 8 [29.6%], p = 0.02; 6 [26.1%] vs. 1 [3.7%], p = 0.02; respectively, Table 3). Sensitivity, specificity, and other parameters for particular ccLS scores in the prediction of ccRCC are described in Table 4. The best performance parameter was ccLS = 4/5 with an accuracy of 60.0%, sensitivity of 59.3%, and specificity of 60.9%. The ccLS = 5 had a satisfactory accuracy of 64.0% and very high specificity of 96.3%; however, its sensitivity of 26.1% was too low.

Table 3 Differences in ccLS scores between patients with ccRCC and other renal masses
Table 4 Diagnostic performance for particular ccLS scores

Selected MRI features in the prediction of ccRCC

Analysis of ROC revealed that from the selected MRI features only the T1 ratio and ADER were discriminators between ccRCC and other types of renal masses (Fig. 2; Table 5). The cut-off points determined in our ROC analysis using the Youden index were 0.73 (p = 0.01) for the T1 ratio, and 0.99 for ADER (p = 0.03).

Fig. 2
figure 2

ROC analysis of the T1 ratio (A) and the ADER (B). T1 ratio - the ratio of tumor SI to renal cortex SI; ADER - arterial to delayed ratio, obtained as the ratio of difference between signal intensity in the corticomedullary phase and pre-contrast images to the difference in signal intensity in delayed phase and pre-contrast images

Table 5 Area under the curve analyses according to selected MRI parameters

The logistic regression model demonstrated that ccLS = 5 (OR = 15.5, CI = 1.1-218.72, p = 0.04) was a significant predictor of ccRCC (Table 6). Interestingly, increasing values of the T1 ratio (OR = 0.002, CI:0.00-0.81, p = 0.04) significantly correspond to decreasing odds of the ccRCC occurrence.

Table 6 Multivariate logistic regression model predicting an occurrence of ccRCC

Discussion

The main goal of our study was validation of the ccLS algorithm. We demonstrated that only ccLS = 5 was a significant predictor of ccRCC occurrence, with a specificity of 96.3% and sensitivity of 26.1%. Unfortunately, in our study, 73.9% of ccRCC cases were classified outside the ccLS = 5 group (reflecting low sensitivity of ccLS = 5), most of them in ccLS = 3 and ccLS = 4 groups, where the ccLS algorithm has lower diagnostic confidence. Previous studies have shown that the ccLS had a moderate to high accuracy for differentiating ccRCC from other small renal masses [13, 16, 18, 25]. A recently published meta-analysis has demonstrated that the sensitivity and specificity of ccLS ≥ 4 for risk stratification of ccRCC were 0.75–0.89 and 0.58–0.82 for individual studies, respectively [18]. These data regarding the specificity are consistent with our results. The difference in observed sensitivity may result from the adopted cut-off point of ccLS (ccLS ≥ 4 in meta-analysis and ccLS = 5 in our study). For ccLS = 4/5 we observed an accuracy of 60.0%, sensitivity of 59.3%, and specificity of 60.9%. However, this cut-off point did not reach the level of statistical significance, probably due to the small sample size included in our study.

Historically, the assessment of SRMs was based mainly on computer tomography imaging and was binary, confined to confirming post-contrast enhancement and excluding the presence of macroscopic fat within the lesion, and with limited differentiation of the histopathological nature of the tumor [26]. Various methods have been proposed for augmenting this fundamental assessment by differentiating tumor types [27], including radiomics-based models for differentiating ccRCC from other SRMs [28, 29]. Compared to computer tomography, MRI provides superior tissue resolution and intravenous contrast response, as well as the simultaneous capability of detecting microscopic fat, potentially rendering better results in prediction of the histopathologic type of lesion. Apart from the ccLS, various MRI algorithms for ccRCC differentiation were created; however, to the best of our knowledge, validation of these algorithms has not been performed [30, 31]. In light of our findings, further improvements and increased utilization of the ccLS algorithm may enhance the role of active surveillance as initial management in selected SRMs as opposed to invasive management; thus, reducing the burden of surgical complications and negative effects of nephron loss.

Interestingly, our study showed that values of the T1 ratio were significantly lower in the ccRCC group than in the non-ccRCC group (0.8 [0.7–0.9] vs. 0.9 [0.9-1.0]), and that a low T1 SI ratio (< 0.73) was a good discriminator between ccRCC and other types of renal masses in ROC analysis. Moreover, logistic regression analysis revealed that increasing values of the T1 ratio (OR = 0.002, CI:0.00-0.81, p = 0.04) correspond to decreasing odds of ccRCC occurrence. However, confirmation of this data requires further research on a larger group of patients. Currently, T1-weighted acquisitions are used in the ccLS algorithm only in chemical shift imaging and multiphasic contrast-enhanced imaging [9]. In light of our results, employment of direct values of the T1 SI ratio may optimize diagnostic confidence of this algorithm; however, further studies in this area are needed. Specifically, papillary RCC tumors require testing, since it has been previously demonstrated that mean tumor T1 SI ratios of papillary RCCs (reader 1, 0.86 ± 0.23; reader 2, 0.82 ± 0.30) were not significantly different from those of ccRCCs (reader 1, 0.81 ± 0.15; reader 2, 0.84 ± 0.27) [28]. Papillary RCC, the second most common RCC variant, usually presents with both low T1 SI and T2 SI (with a tumor T2 SI ratio of ≤ 0.66 having a specificity of 100% and sensitivity of 54% for papillary RCC) [32, 33]. Therefore, in cases of papillary RCC, a low T1 SI may be an especially useful parameter in the assessment of ccRCC risk in lesions with a moderate to high T2 SI.

Our analysis also confirmed the importance of the ADER parameter, showing a statistically significant high specificity of intermediate values of ADER in the prediction of ccRCC, with a mean ADER = 1.1 [1.0-1.4] for ccRCC and a mean ADER = 0.9 [0.6–1.1] for non-ccRCC. During ccLS assessment, ADER > 1.5 is mainly used for differentiation of ccRCC from fat-poor angiomyolipoma, which is a lesion with a high “washout” [34]. Our results correspond to observations that ccRCC tends to show moderate “washout” and demonstrate that ccRCC may be differentiated from other forms of renal masses with a lower cut-off point of ADER = 0.99. However, this cut-off point should be validated in larger populations.

Our observations regarding T1 and ADER parameters will undoubtedly require validation in studies with a larger cohort and under multi-center conditions. If subsequent research confirms their effectiveness, the above parameters may aid in clinical decision-making regarding surgical procedures or biopsy verification, or they might even contribute to the construction of a new, modified scale. Figure 3 demonstrates a clinical case showing the implementation of T1 and ADER parameters and utilization of the ccLS score.

Fig. 3
figure 3

A clinical case showing the implementation of T1 and ADER parameters and utilization of the ccLS score. Axial images of the right kidney show a posteriorly localized, well-defined, oval mass in the right kidney of a 78-year-old female patient. The lesion was subsequently resected and diagnosed as clear cell renal cell carcinoma using histopathological verification. (A) T2-weighted acquisition shows a heterogeneous, mostly T2-hyperintense mass with areas of isointense signal (note that a T2-isointense lesion with similar characteristics would also be classified as ccLS = 5) (B) Two T1-weighted Dixon acquisitions at the same level show a significant decrease in signal intensity on opposed-phase image (right) compared with in-phase images (left) (C) T1-weighted image showing a heterogeneous, slightly hypointense lesion (ROI with *) compared to normal renal cortex (ROI with arrow) with T1 SI ratio = 0.87 (D) Two fat-saturated T1-weighted postcontrast images at the same level in corticomedullary (left) and delayed (right) phases show significant early enhancement (ROI with *) and signal drop in delayed phase (ROI with arrow) with ADER = 1.2 (E) Two DWI (left) and ADC map (right) images at the same level show diffusion restriction in the tumor compared to renal cortex (note that restriction at DWI is considered only an ancillary feature in the ccLS algorithm and in many cases it is more suggestive of indolent lesions rather than ccRCC. In this case, the tumor would be considered ccLS = 5, regardless of its DWI signal)

Our study has several limitations. First, it is retrospective, which may lead to a high risk of bias in terms of the patient selection domain. Additionally, it is only a single-institution series and not all patients who had MRI underwent surgery or biopsy. Currently, there is a lack of prospective data comparing available surveillance strategies. This results in essential variability in the approach and in varied frequency and duration of follow-up imaging. However, there are studies regarding active surveillance for patients with SRMs with a follow-up time of 3 years [6, 21,22,23,24]. Therefore, the minimum follow-up time of patients without surgery or biopsy in our study was stated as 36 months [6, 21,22,23,24]. Diagnostic accuracy of MRI studies could vary based on the radiologist’s experience (given the somewhat subjective nature of assigning the ccLS score). Without randomization of management (surveillance, biopsy, and surgery), there is an inherent selection bias in which patients had histopathology available. Lastly, our study has a small cohort of only 50 renal masses, of which 23 were ccRCC.

Conclusions

The ccLS algorithm has moderate sensitivity and specificity for the evaluation of ccRCC. These findings suggest that the ccLS algorithm should be still validated and tested before it can be incorporated as a diagnostic algorithm, and will guide future iterations of clinical guidelines. Employment of direct values of the T1 ratio parameter may strengthen the diagnostic accuracy of ccRCC diagnosis. However, further studies in this area performed on large and multi-center cohorts are required. The continued development and widespread acceptance of the ccLS algorithm may in the near future reduce the reliance on biopsies in the diagnosis of renal tumors, enhance the early detection of the clear cell variant of RCC, and decrease the burden of avoidable surgical procedures in the management of SRMs by replacing them with an active surveillance strategy.