Introduction

Over the last few years, evidence has established magnetic resonance imaging (MRI) as the most accurate and cost-effective diagnostic imaging modality for the detection and staging of clinically significant disease (csPCa) [1,2,3,4]. Recent trials validated the so-called "MRI pathway" as a mean to trigger MRI-informed biopsy strategies in biopsy-naïve patients [5,6,7,8,9], increasing the detection of csPCa, linked to worst outcomes, while allowing a reduction of overdiagnosis of clinically insignificant prostate cancer (ciPCa), which do not cause symptoms nor death [10]. However, the positive predictive value (PPV) of MRI for csPCa detection is still low and widely variable (35% for PI-RADS score ≥ 3 and 49% when ≥ 4) [11], meaning that many suspicious MRI findings are in fact false-positives on targeted biopsy.

Currently, international guidelines recommend performing MRI, acquired and reported using the Prostate Imaging—Reporting and Data System (PI-RADS) v2.1 [12,13,14]. PI-RADS recommendations suggest performing MRI-directed biopsies (MRDB) on lesions classified as PI-RADS scores 4 and 5, while the management of PI-RADS 3 findings is still currently highly debated. Indeed, this category is one of the main sources of false-positive biopsy results [15]. The cancer yield of findings classified as PI-RADS 3 is still uncertain, with a prevalence of csPCa in biopsied cases spanning from 3 to 50% in the literature [15, 16]. As per PI-RADS v2.1 recommendations, a lesion without positive dynamic contrast-enhanced (DCE) MRI should be classified as PI-RADS 3 when showing an equivocal appearance on diffusion-weighted imaging (DWI) and apparent diffusion coefficient (ADC) map (DWI score 3) (Supplementary Table 1). In case of lesions with equivocal DWI/ADC (DWI score 3), but positive DCE, the overall PI-RADS score should be upgraded to 4 [17]. This upgrade must be considered only in case of peripheral zone (PZ) lesions, which represent the vast majority of prostate lesions and are associated with poorer clinical outcomes [17,18,19].

On this basis, there are two possible types of PI-RADS 4 assignments: those with clearly restricted diffusion (DWI score 4) and size lower than 1.5 cm, and those deriving from the upgrading of a peripheral zone PI-RADS 3 lesion showing positive DCE. As far as we know, no previous prospective works investigated the prevalence of csPCa in PI-RADS 4 lesions deriving from PI-RADS 3 upgrading. Based on our empirical experience in a tertiary referral high-volume center, many of those lesions result in negative or ciPCa on targeted biopsy.

Hence, the primary aim of the study was to evaluate the diagnostic performance of MRI and PI-RADS score in detecting csPCa in PI-RADS 4 categories with clearly restricted diffusion (DWI score 4) or after upgrading from PI-RADS 3 category because of positive DCE and assess the impact of a revised PI-RADS categorization on biopsy decisions. As a secondary objective, we aimed at determining the clinical and radiological features associated with csPCa in those subcategories.

Materials and methods

Patient population and study design

This observational prospective single-center cohort study received formal Institutional Review Board and Ethical Committee approval. Written informed consent was obtained from each participant. A total cohort of 468 patients was consecutively enrolled from February 2020 to February 2022. According to the study design, men with clinical suspicion of PCa (total PSA > 3.0 ng/mL, or > 2.5 in patients with family history, and/or positive digital rectal examination [DRE], and/or suspicious findings at ultrasound) were referred to MRI and, if positive to MRDB (see below for the definition of “positive”). Inclusion criteria also included an acceptable MRI quality to rule in and/or rule out csPCa; only MR images with a Prostate Imaging Quality score [20] (PI-QUAL) ≥ 3 were included. Exclusion criteria were a prior diagnosis of PCa, previous prostate biopsy, dementia or altered mental status, and any contraindication to MRI or biopsy procedures (Fig. 1).

Fig. 1
figure 1

Study flowchart showing the outline of its different phases: enrollment, intervention, and outcomes. PCa, prostate cancer; MRI, magnetic resonance imaging; MRDB, MRI-directed biopsy

MRI protocol, image interpretation, and MRDB

All exams were conducted on two 3 Tesla MRI Scanners (GE Discovery 750 and MAGNETOM Vida Siemens), using a 32-channel surface phased-array body coil. According to PI-RADS v2.1 recommendations, all the exams were acquired using a multiparametric protocol, which included high-resolution T2-weighted imaging (T2WI) on the axial and coronal planes, DWI at b values of 100, 800, 1000, and 2000s/mm2 (the last as part of a separate acquisition) with ADC map reconstruction (based on 100, 800 and 1000 s/mm2), and perfusion (DCE) at a temporal resolution of 6 s following an intravenous bolus of gadolinium-based contrast media. MRI acquisition parameters are listed in Supplementary Table 2.

Images were reported independently by two readers both qualifiable as experts according to the European Society of Urogenital Radiology (ESUR) / EAU Section of Urologic Imaging (ESUI) consensus criteria (15 and 5 years of experience in prostate imaging, who are referred to respectively as “more” and “less experienced reader” throughout the text) [21, 22]. For image interpretation, readers used adjusted PI-RADS v2.1 rules for the PZ as follows: (i) when a lesion showed equivocal diffusion pattern (DWI score 3) and negative DCE, the lesion was categorized as PI-RADS 3; (ii) when a PI-RADS 3 lesion (DWI score 3) was associated to positive DCE, that lesion was categorized as “PI-RADS 3up” rather than as PI-RADS 4. Therefore, PI-RADS 4 categorization included only those lesions showing clearly restricted diffusion (DWI score 4). Readers also attributed the PI-QUAL score to each MRI examination. For images acquired before the introduction of the quality scoring system in 2020, readers retrospectively excluded PI-QUAL < 3 MR images.

MRDBs were performed by the same team of radiologists, with a targeted technique. Cores were obtained with a transrectal approach using a dedicated system (Urostation Koelis) [23]. The selected MRI threshold for setting indication to biopsy was PI-RADS 3 associated with PSA density ≥ 0.15 ng/mL/mL; therefore, all PI-RADS 3up lesions were directed to MRDB.

A dedicated pathologist with 15 years of experience in genitourinary pathology reported the biopsy cores according to the International Society of Urological Pathology (ISUP) grading groups, after they were stored in formalin-filled containers.

Standard of reference and statistical analysis

For the purpose of the study, analysis was performed on per-index lesion basis. Index lesion was defined as the MRI finding showing the higher PI-RADS category or, being equal to the category, the one showing the largest size or extra-prostatic extension. After matching MRI images with biopsy results, MRI findings were assessed as true-positive if matching with the presence of csPCa, and false-positive if not (no cancer or ciPCa). True-negative findings correspond either to a negative histopathologic finding at biopsy or to a nonprogressive, negative follow-up MRI after at least 12 months, without an increase of serum PSA value. Only patients with at least one proper follow-up examination were included (Fig. 1).

Data analysis focused on two different outcomes, namely “outcome 1”, standing for the identification of overall PCa (both ciPCa and csPCa), and “outcome 2”, standing for the detection of csPCa only. The latter was defined as ISUP grading group ≥ 2 [12]. The prevalence of csPCa and ciPCa was calculated for all the different PI-RADS categories, including for PI-RADS 3up. The performance of MRI was assessed by means of receiver operating characteristic (ROC) analysis, according to both readers’ assessments for both outcomes, deriving sensitivity, specificity, PPV, NPV, and accuracy according to the Youden index. For both outcomes, analysis was performed using two different thresholds to define MRI positive, determining biopsy indication: (i) PI-RADS 3 category with increased PSAD, and (ii) an adjusted threshold using PI-RADS 3 or 3up category with increased PSAD.

Inter-reader agreement analysis was performed with Cohen’s kappa (k) statistics for overall PI-RADS assessment and on a per-sequence basis. Differences between patient demographics, clinical characteristics, and PI-RADS score were analyzed using the analysis of variance (ANOVA). Univariable and multivariable analysis with logistic regression was performed to assess which of the following variables was independently predictive of csPCa: patient age (< 70 vs ≥ 70), family history (positive vs negative), PSA value (< 10.0 vs > 10.0 and < 20.0 vs > 20.0 ng/mL), PSA density value (< 0.15 vs ≥ 0.15 ng/mL/mL), the total number of MRI suspicious foci [1 vs > 1], best MRI sequence on which the finding is identifiable (DWI/ADC vs DCE) and the PI-RADS score.

Statistical analysis was performed using Statistical Package for the Social Sciences (SPSS) v28.0.1.1. Tests were two-sided, and statistical significance was set at p < 0.05.

Results

Patient characteristics

The 389 patients included in the final analysis presented a median age of 67.2 years (interquartile range [IQR] 43–85), median total PSA value of 8.42 ng/mL (IQR 0.08–86.20), and median PSA density of 0.19 ng/mL/mL (IQR 0.002–1.60), with a statistically significant difference in PSA density between patients without PCa or having ciPCa and csPCa (0.13 vs 0.16 vs 0.3, p < 0.001). The clinical, radiologic, and pathologic characteristics of the final cohort are summarized in Table 1.

Table 1 Summary of cohort population’s clinical, radiological, and pathological data

A total amount of 284 MRDB was performed, with a mean per-patient number of 3.6 cores taken from each MRI target. One-hundred-ninety-two/389 men (49.4%) were diagnosed with PCa, including 132/389 cases of csPCa (33.9%) and 60/389 cases of ciPCa (15.4%). The remaining 197/389 men (50.6%) were found negative for PCa, with most of them showing inflammatory patterns at histological examination.

Cancer prevalence according to PI-RADS categories

The prevalence of csPCa, ciPCa, and inflammatory patterns in each category of PI-RADS scores is summarized in Table 2. Of note, csPCa prevalence was low in category 3 (2.5%) and 3up (14.5%) (Fig. 2), while we witnessed a drastic increase in PI-RADS 4 (DWI score 4) lesions (53.3%) (Fig. 3), with a statistically significant difference between PI-RADS 3up and PI-RADS 4 both considering PCa prevalence (p < 0.001) and csPCa prevalence (p < 0.001).

Table 2 Prevalence of histologic results among patients with lesions classified with different PI-RADS scores
Fig. 2
figure 2

A 73-year-old man with clinical suspicion of prostate cancer (PSA total value of 6.3 ng/mL, PSA density of 0.09 ng/mL2, positive family history of prostate cancer). a T2WI acquired on the axial plane showing a hypointense nodular lesion on a wider wedge-shaped alteration on the apical-right posterior peripheral zone, with post-contrast enhancement on early DCE images (b), with restricted diffusion at b-value 2000 (c) and normal ADC value (d), classified as PI-RADS 3up. e, f The lesion was biopsied using MRI-TRUS TBx. Histopathology proved the absence of neoplastic disease, with an inflammatory pattern. PSA, prostate-specific antigen; T2WI, T2-weighted imaging; DCE, dynamic contrast-enhanced; TBx, targeted biopsy

Fig. 3
figure 3

A 62-year-old man with clinical suspicion of prostate cancer (PSA total value of 12.9 ng/mL, PSA density of 0.18 ng/mL2). a T2WI acquired on the axial plane showing hypointense nodular lesion on the mid-left posterior peripheral zone, with mild post-contrast enhancement on DCE images (b), with marked restriction diffusion at b-value 2000 (c) and low ADC value (d), classified as PI-RADS 4. e, f The lesions was biopsied using MRI-TRUS TBx. Histopathology confirmed the presence of clinically significant prostate cancer, ISUP 3 (GS 4 + 3). PSA, Prostate-specific antigen; T2WI, T2-weighted imaging; DCE, dynamic contrast-enhanced; TBx, targeted biopsy; ISUP, International Society of Urogenital Pathology; GS, Gleason score

Diagnostic performance of MRI and PI-RADS assessment

Inter-reader agreement considering the assignment of PI-RADS categories was substantial (k = 0.829).

MRI showed a sensitivity, specificity, PPV, NPV, and accuracy of 99.0%, 52.6%, 67.2%, 98.1%, and 75.6% versus 100.0%, 40.9%, 46.5%, 100.0%, and 60.9% for outcome 1 versus outcome 2, respectively. The most experienced reader and less experienced reader achieved an area under the curve (AUC) for outcome 1 and outcome 2 of 0.84 (95% CI: 0.80–0.88) and 0.83 (95% CI: 0.79–0.87) versus 0.82 (95% CI: 0.77–0.86), and 0.82 (95% CI: 0.78–0.86), respectively (Table 3; Fig. 4).

Table 3 Diagnostic performance and area under the curve of MRI and PI-RADS score with a conventional and adjusted threshold for prostate cancer (outcome 1) and clinically significant prostate cancer (outcome 2) detection
Fig. 4
figure 4

ROC analysis for the performance of MRI and PI-RADS score, in detecting prostate cancer (outcome 1) and clinically significant prostate cancer (outcome 2), for both more and less experienced readers. ROC, receiver operating curve; PI-RADS, Prostate Imaging—Reporting and Data System

Moreover, considering MRI positive in cases of lesions classified as PI-RADS 5, 4, and both 3 and 3up only when associated with elevated PSA density, MRI showed a sensitivity, specificity, PPV, NPV, and accuracy of 92.2%, 71.4%, 76.1%, 90.3% and 81.7% versus 93.9%, 57.2%, 53.0%, 94.8%, and 69.7% for outcome 1 versus outcome 2, respectively (Table 3).

Features associated with prostate cancer

Results of the univariable and multivariable analysis are shown in Table 4 and Table 5. The multivariable logistic regression model, for what concerns both outcomes 1 and 2, showed that the variables independently correlating with PCa were age (both p < 0.001), PSA density (p = 0.03 and < 0.001 respectively), DWI as best sequence (both p < 0.001), conventional PI-RADS score (p = 0.007 and = 0.04 respectively) and adjusted PI-RADS score (p < 0.001 and 0.03).

Table 4 Univariable and multivariable regression analysis assessing the correlation among clinical and radiological factors with prostate cancer
Table 5 Univariable and multivariable regression analysis assessing the correlation among clinical and radiological factors with clinically significant prostate cancer

Discussion

In this study, we applied an adjusted version of the PI-RADS scoring system, considering prostate peripheral zone lesions. We differentiated lesions classified as PI-RADS 4 (with DWI score 4) from those upgraded from PI-RADS 3 (DWI score 3) because of positive DCE and identified as PI-RADS 3up. Moreover, we considered a different threshold for defining MRI positive and setting indication to prostate biopsy: both PI-RADS 3 and PI-RADS 3up are associated with increased PSA density. Our analysis resulted in a reduction of MRI false positives and a significant increase in csPCa prevalence among patients with PI-RADS 4 lesions. Furthermore, in our analysis, we witnessed a marked improvement of PPV when PI-RADS 3up lesions are directed to MRDB only in case of association with elevated PSA density. PPV is currently still low and high-variable among centers [11]; this was probably because PI-RADS 4, including lesions upgraded due to a positive DCE, is the second most frequent source of false positive assignments after PI-RADS 3 [24].

These results strengthened the diagnostic power of PI-RADS 4 categorization in the detection of csPCa, with the potential effect of reducing the excess of false positives, already widely described in the literature [15]. This analysis could have a potentially positive effect on biopsy planning, still maintaining the high sensitivity and NPV of MRI.

The secondary objective was to define the clinical and radiological features independently correlating with the presence of PCa and of csPCa, to profile patients at higher risk. The logistic regression models demonstrated how age, PSA Density ≥ 0.15, a clearer definition of the lesion on DWI (compared to DCE), and the PI-RADS score show independent correlation with csPCa. In this sense, when considering PI-RADS 3up as PI-RADS 3 and no longer as PI-RADS 4, 2.6% of csPCa would have been missed if not considering additional data. These results may suggest that clinical-radiological data, if included in nomograms, could support a risk-based stratification of PI-RADS 3 and 3up lesions.

Considering what we found, a new subcategorization of PI-RADS 3 lesions could be proposed: (i) PI-RADS 3B consisting of PI-RADS 3 and 3up findings requiring to be directed to biopsy according to clinical data; and (ii) PI-RADS 3FU, consisting of PI-RADS 3 and 3up findings that should be directed to a follow-up MRI.

Expectedly, DCE did not independently correlate with csPCa detection, which reflects its impact on the number of false positive MRI findings. Indeed, as already proved by other studies, DWI plays a key role in the diagnostic workup of PCa and the applicability of non-contrast MRI in the clinical practice is becoming more and more appealing and necessary, given the requested high burden of MR [25], and also given its comparable diagnostic accuracy in the detection of csPCa [26,27,28]. However, as properly assessed by Belue et al. [29], this is still a controversial topic, since data are limited; studies found in literature often lack the stratification of findings according to prostate cancer grade group per PI-RADS score, the image quality assessment, and readers’ experience [30]. The present study considered all these aspects, proving the relevance of DWI and the secondary role of DCE, as an additional sequence to upgrade equivocal lesions. Our results suggest limiting the DCE’s implication in the scoring assessment and support the use of non-contrast MRI in selected cases, especially in high-volume referral centers, where expert radiologists interpret the images. Moreover, these results are in line with the results of the meta-analysis by Zeng et al. [31], who found that DCE’s role is limited in case of equivocal lesions and expressed concerns on the need of a more suitable method to improve the detection of csPCa. In this sense, the adjusted PI-RADS threshold to direct patients to MRDB proposed in this study could represent the answer to their conclusions. Nonetheless, DCE could still be valuable in cases where DWI and/or T2WI do not reach a sufficiently good quality level [29, 32]; however, we did not investigate it in our study.

Limitations need to be acknowledged: first, the single-center study was conducted in a high-volume referral center with both readers being experienced; second, low-quality exams were excluded from the analysis. These might hamper the reproducibility of the results. Finally, we acknowledge that proposing in a retrospective fashion a new threshold for defining an MRI finding as positive could cause a selection bias in future investigations, but we believe that this study could represent a first experience in this scenario, setting the base for future research and randomized trials.

In conclusion, we can affirm that the upgrade of lesions with DWI Score 3 to PI-RADS 4 given a positive DCE MRI does not provide a positive impact on the overall diagnostic performance of MRI for the detection of clinically significant prostate cancer, leading to a reduced cancer prevalence yield. Moreover, these results might boost the applicability of non-contrast MRI for selected populations and centers. Finally, a new subcategorization of PI-RADS 3 scoring could be proposed, consisting of PI-RADS 3 and 3up findings, and divided into: PI-RADS 3B requiring to be directed to biopsy according to clinical data, and PI-RADS 3FU, to be followed-up with MRI.