Introduction

In the recent years, Magnetic Resonance Imaging (MRI) is increasingly used in the diagnostic pathways for prostate cancer (PCa). Simplified MRI protocols with shorter acquisition time and with no use of contrast agents (biparametric MRI) hold the promise to further reduce costs of prostate imaging and promote its use in any patient at risk of PCa [1, 2]. Furthermore, compared to other tools such as biomarkers, prostate MRI allows to characterize and localize PCa lesions and to eventually perform targeted biopsy of any suspicious lesions. Given the reported superiority of the MRI-targeted biopsy (TBx) compared to systematic biopsy (SBx) in recent prospective clinical trials (PROMIS [3], PRECISION [4], and MRI-FIRST [5]), there is a growing interest to evaluate if SBx is needed in addition to TBx in men with a clinical suspicion of PCa. Avoiding SBx would reduce the risks of infection, bleeding and pain associated with additional cores sampling [6, 7]. Additionally, the incidental detection of clinically insignificant PCa (iPCa) is more frequent in SBx than in TBx [8]. Finally, the workload for pathologist evaluating biopsy cores would be reduced from reviewing 12–18 cores to 2–6 cores. However, the risk of missing clinically significant PCa (csPCa) must be addressed. A recent Cochrane meta-analysis of the results of studies evaluating MRI and MRI-targeted biopsy suggests that omitting systematic biopsy would miss approximately 16% of csPCa in biopsy-naive patients and 10% in the repeat-biopsy setting.

Several multivariable models to predict negative SBx have been developed to avoid systematic cores at the time of target biopsy [10, 11]. None of these models showed excellent accuracy, and their clinical benefit is questionable. There are at least three reasons behind TBx failure: (1) Misdiagnosis of the lesion by the radiologist either due to the low quality of prostate MRI or misinterpretation; (2) Presence of csPCa that is not visible on MRI; (3) Targeting error.

Since smaller lesions in big prostates are more likely to be missed by both the reader or by the person performing the TBx, we aimed to develop and externally validate a nomogram based on MRI volumetric parameters and clinical information for deciding when SBx should be performed in addition to TBx in man with suspicious prostate MRI.

Materials and methods

Study design and population

We retrospectively analyzed data from two prospective clinical trials involving a total of 499 patients (Supporting Material Figure S1). All patients underwent biparametric MRI according to IMPROD protocol (IMPROD bpMRI). Patients with negative MRI (IMPROD bpMRI Likert score/PIRADsv2.1 score of 1–2) were excluded from the analysis (n = 112). Additionally, four men were excluded: three having a PIRADsv2.1 score > 2 while IMPROD bpMRI was Likert 2 and no TBx was performed; one having PIRADsv2.1 score of 1 while IMPROD bpMRI was Likert 3 (Supporting Material Figure S1). Imaging findings for these four men are presented in the Supporting Material (Figure S2–S4) and the following 15 tweet series on twitter https://twitter.com/jambor_ivan/status/1185272940671180800. The final population consisted of two cohort: a single institution development cohort, IMPROD trial (NCT01864135) n = 122 and a multi-institution validation cohort, MULTI-IMPROD trial (NCT02241122) n = 262. All the study biopsies were taken during 4/2013—05/2017. The patient study flow is shown in Figure S1.

IMPROD bpMRI protocol and MRI reporting

As described before, the MRI examination was performed using surface array body coils at 3 T in Turku (Verio, Siemens), Helsinki (Skyra, Siemens), and Tampere (Skyra, Siemens), while 1.5 T (Aera, Siemens) MRI scanner was used in Pori [12]. All imaging data sets were prospectively reported by a local radiologist (at least one year of prostate MRI experience at the beginning of the MULTI-IMPROD trial) and confirmed or re-reported centrally by one designated central reader (IJ, 5 years of prostate MRI experience at the beginning of the MULTI-IMPROD trial) to guarantee reporting integrity using a dedicated IMPROD bpMRI reporting system developed before initialization of the trial (See details at https://mrc.utu.fi/mri/improd). Following the completion of the trial, Prostate Imaging Reporting and Data System version 2.1 (PI-RADSv2.1) scores [12] were assigned by the central reader (IJ). Volumetric analysis was performed using manually delineating whole prostate volumes and all MRI suspicious lesions on axial T2-weigthed images.

Volumes were calculated as the sum of prostate/lesion area on each slice multiplied by slice thicknesses (3 mm). Cancer volume on MRI was calculated as the sum of all the lesion volumes in each patient. Total percentage of cancer on MRI was calculated as the ratio of cancer volume on MRI divided the whole prostate volume. The MRI protocols, all MRI data and MRI reports are freely available at the following address: https://petiv.utu.fi/improd/ and https://petiv.utu.fi/multiimprod/.

Biopsy procedure and histopathological analysis

In IMPROD trial, only the dominant lesion was targeted (two targeted biopsy cores), whereas in MULTI-IMPROD trial, up to two lesions were targeted (two targeted biopsy cores per each lesion). A cognitive targeting method was used with the exception of one center (Helsinki) where a MRI-TRUS fusion platform was used (n = 45). Every patient in the present study underwent the first TBx of every MRI suspicious lesion. The same surgeon subsequently performed 12 cores SBx using a sextant template. The location of each SBx core was random and thus may overlap in some degree with MRI lesions. Histopathology of all biopsies was reported separately at each center by a uropathologist, each with at least 5 years of experience in genitourinary pathology at the beginning of the trial, using the 2014 International Society of Urological Pathology modified Gleason grading system [13]. The Gleason score for each patient was assigned using the overall Gleason score from all SBx and TBx cores. Findings from MRI and clinical data were not available for the pathologists.

Outcome measurements and statistical analysis

Outcomes of this study were the rates of csPCa, Gleason Grade Group (GGG) > 1 and iPCa, GGG = 1, in SBx, TBx and overall (SBx + TBx).

Descriptive characteristics of the development and validation cohorts were calculated. Continuous variables are reported as medians ± standard deviation (when appropriate) and compared by the Mann Whitney test for independent groups. Differences in rates were tested by the chi-square test.

To graphically assess how prostate and lesion dimensions impact the diagnostic accuracy of TBx and SBx, we plotted the actual probability of diagnosing csPCa and iPCa in SBx and TBx according to the percentage of prostate involved by tumor.

Univariable and Multivariable analyses were performed in the development cohort to evaluate predictors of csPCa in SBx, TBx and overall (SBx + TBx).

Two different logistic regression models were developed for each of the three outcomes (csPCa in SBx, TBx, SBx + TBx) including either PI-RADSv2.1 or IMPROD bpMRI Likert score in addition to the other variables. The models with the best area under the curve (AUC) in the development cohort were selected and the linear prediction of the logistic function was used to perform the decision curve analysis in the validation cohort.

Finally, to provide physicians and patients with an outline for deciding when to perform SBx in addition to TBx, we performed the systematic analysis of model-derived cutoffs, considering the predicted probability of the models in the validation cohort. Statistical analyses were performed using Stata 14 (StataCorp LP, College Station, TX, USA). All tests were 2-sided with a significance level set at p < 0.05.

Results

Patients’ characteristic for the included 122 and 262 men in the developmental (IMPROD trial) and validation (MULTI-IMPROD) cohorts, respectively, are presented in Table 1. PSA density (0.20 vs 0.16, p 0.004), index lesion volume (0.98 vs 0.73, p 0.015) and csPCa detection rate in TBx (60% vs 47%, p 0.01) were slightly higher in the developmental cohort but no difference were found in overall csPCa detection rate ( csPCa detection rate in TBx + SBx, p 0.051). CsPCa detection rate according to Likert 1–2, 3 and 4–5 lesion were respectively 4.5%, 47.6% and 79.7% in the development cohort, 10.8%, 30.5% and 70.3% in the validation cohort. No difference was found in csPCa detection rate according to PIRADsv2.1 score.

Table 1 Descriptive characteristics of the development and validation cohort

Univariate analysis in the developmental cohort

The results of univariate analysis evaluating predictors of csPCa in SBx, TBx and all cores (SBx + TBx) are shown in Table 2. In contrast to TBx, MRI suspicious lesion location and prostate volume were significant predictors of csPCa in SBx with p values of 0.039 and 0.02, respectively. The remaining significant predictors of csPCa in SBx were similar to TBx. The total volume of the suspicious lesion on MRI, index lesion volume and the percentage of prostate involved by cancer emerged as important predictors csPCa in SBx, TBx and all cores (SBx + TBx).

Table 2 Univariable analysis evaluating predictors of csPCa in systematic, target, and all cores in the development cohort

Figure 1 graphically shows that the actual probability of diagnosing csPCa in SBx and TBx increase with the increase of the percentage of prostate involved by a suspicious lesion on MRI. However, in patients with low percentages, the probability of finding csPCa in SBx only is very low and actually lower than the probability of finding iPCa.

Fig. 1
figure 1

Actual Probability of Prostate cancer diagnosis in systematic and targeted biopsy cores as a function of MRI-suspicion lesion volume percentage

Multivariable analysis in the developmental cohort

The results of multivariable analysis evaluating predictors of csPCa in SBx using IMPROD bpMRI Likert score and PI-RADsv2.1 are shown in Table 3 and Supporting Material Table S1, respectively. The corresponding results for predictors of csPCa in TBx are shown in Table 4 and Supporting Material Table S2, respectively. Finally, the results of multivariable analysis evaluating predictors of csPCa in SBx + TBx using IMPROD bpMRI Likert score and PI-RADsv2.1 are shown in Supporting Material Table S3 and S4, respectively. In contrast to predictors of csPCa in TBx and SBx + TBx, PSA and MRI-suspicion lesion location were a significant predictor of csPCa only in SBx (Table 3 and Supporting Material Table S1). Decision curve analysis for SBx, TBx, and SBx + TBx are shown in Supporting Material Figure S6, S7, and S8, respectively.

Table 3 Multivariable analysis to predict clinically significant prostate cancer (Gleason score > 3 + 3) in systematic cores using IMPROD bpMRI Likert scoring system
Table 4 Multivariable analysis to predict clinically significant prostate cancer (Gleason score > 3 + 3) in target biopsy cores using IMPROD bpMRI Likert scoring system

Risk calculator

A risk calculator for deciding when SBx should be performed in addition to TBx in men with suspicious MRI findings was developed using the developmental cohort (IMPROD, n = 122) while the results using the validation cohort (MULTI-IMPROD, n = 262) are shown in Table 5. The developed nomogram included age, PSA, prostate volume, MRI suspicion score (IMPROD bpMRI Likert or PIRADsv2.1 score), MRI-suspicion lesion volume percentage, and lesion location. Beta coefficients of the logit function are shown in Supporting Material Table S5-S6.

Table 5 Systematic analysis of the nomogram-derived cutoffs to help identify patients who are going to benefit from systematic cores in addition to target cores in the validation cohort

The external validity of the risk calculator was performed in the validation cohort (n = 262) using DCA and evaluating the clinical utility of the risk calculator at different cut-offs (Table 5). DCA showed a net clinical benefit for every value of predicted probability above 10%.

The systematic analysis of the model derived cutoffs revealed that 19–43% of men would have avoided SBx while missing 1–4% of csPCa and avoiding detection of 9–20% of iPCa (Table 5). Similar performance was found for nomograms using IMPROD bpMRI Likert score or PIRADsv2.1.

Discussion

In this study, using data of two prospective trials, we developed a novel instrument based onto readily-available clinical parameters (age, PSA, prostate volume, IMPROD bpMRI Likert or PIRADsv2.1 score, total percentage of cancer volume on MRI and lesion location) that can help the physician to understand the added value of SBx in patients who underwent TBx of MRI suspicious lesion.

Avoiding SBx in patients who have a suspicious MRI lesion represents a great challenge for a urologist.

In an ideal world, where MRI and targeting systems show perfect performance, avoiding SBx would result in less complications and a lower rate of diagnosis of iPCa.

Recently Wegelin et al., in a post hoc analysis of a randomized controlled trial, compared adverse events among three mpMRI-based TBx techniques of the prostate in men with prior negative SBx and a persisting suspicion of PCa [6, 14]. No difference were found in PCa detection rates between in-bore MRI TBx, MRI-TRUS fusion TBx, and cognitive registration TRUS TBx [14], however, the number of biopsy cores taken was different between the various techniques and was significantly associated with the occurrence of adverse events (OR 1.11 [95% CI 1.06–1.17, p < 0.001]) [6].

To the best of our knowledge no level 1 evidence is available in the biopsy naïve setting. Indeed, when mpMRI is positive (i.e. PIRADsv2.1 > 3), EAU guidelines suggest performing 12-cores SBx in addition to TBx of every suspicious lesion in biopsy naïve patients (Level of evidence: 2A; strength rating: strong) and TBx only in patients with prior negative biopsy (Level of evidence: 2A; strength rating: weak) [15].

Although this might not be feasible in every center at the moment, the aim of this study was to find a subset of biopsy naïve men with a clinical suspicion of PCa that can safely avoid 12-core SBx without compromising the detection of csPCa.

Unlike prior studies, we considered MRI volumetric parameters and we found that adding the prostate volume and percentage of prostate involved by tumor to a model based on age, PSA, lesion location and lesion suspicious score, increased the model predictive accuracy for csPCa on SBx from 0.82 to 0.86 for the model using IMPROD bpMRI Likert and from 0.80 to 0.85 for the model using PIRADsv2.1. Our rationale for this decision was that smaller lesions in big prostates are more likely to be missed by a reader interpreted prostate MRI or by the person performing TBx.

Turning findings into clinical practice, in a 56 years old man with PSA 4 ng/ml and one PIRADsv2.1 4 lesion of 1 cm (0.52 mL), the estimated probability of finding csPCa in SBx is 10.4% in case of prostate volume 60 mL (0.87% of prostate involved by tumor), as opposed to 5.7% in case of prostate volume 80 mL (0.65% of prostate involved by tumor).

Among the models for prediction of sPC in SBx, the nomogram developed by Sathianathen et al. considered the clinical setting (biopsy naive, previous negative biopsy and Active surveillance patients), number of MRI lesions and the highest PIRADS score [11]. Such model showed clinical benefit in a range of probability between 10 and 30%. The lack of external validation limits the generalization of these findings. Moreover, the model was developed to predict csPCa in systematic cores only (with negative TBx). Indeed, our outcome was intentionally csPCa in SBx irrespective of the results of TBx. We believe that information on every positive core (from TBx or SBx) are still necessary when planning focal therapies and to assess the oncologic outcome of the patient since the current risk stratification tools take into account the number of positive cores (e.g. NCCN risk categories that stratifies patients into favorable or unfavorable intermediate-risk based on the number of cores with GGG 2 [16]).

Similar to our risk calculator, the model developed by Stabile et al. predicts csPCa in SBx. Their model ultimately included age, PSA, Prostate volume, number of MRI lesions, PIRADS score and the biopsy setting (biopsy naive vs previous negative biopsy patients) [10]. However, the DCA showed modest clinical benefit in the external validation cohort and the authors concluded that the number of systematic biopsies spared compared with the number of aggressive PCa missed is negligible. Moreover, no useful clinical model can be developed to safely identify those patients who could avoid SBx in addition to TBx. Conversely, our risk calculator showed a clinical benefit in the external validation cohort for the low threshold of probabilities. The addition of volumetric parameters derived from MRI seems to be the key to develop nomograms that help in this scenario.

Biomarkers such us a 4Kscore, selectMDx or the Stockholm3 test could also help to predict the risk of csPCa in SBx in patients with a positive MRI. However, the evidence in this regard is very limited and while MRI is becoming widely available, molecular tests are not yet used in routine clinical practice [17,18,19,20].

The present study is not devoid of limitations. First, we acknowledge that we used a cohort of patients enrolled in a prospective clinical trial that received MRI and biopsies according to the highest standard of care with a central prospective review of MRI images for prostate cancer detection. Although, this is a reflective of the daily clinical practice in referral academic centers it may be limit the validity of our findings to nonacademic centers where radiologists and urologists are still in the initial phase of their learning curve. Moreover, we used the combination of SBx + TBx rather than the Radical prostatectomy specimen or template transperineal saturation biopsy as a reference standard. Even if the previous studies showed a certain rate of discordance between Bx GGG and final pathology GGG [21,22,23], this still represents the standard of care and none of the two other options would have been feasible.

Conclusions

In conclusion, patients with a small lesion on MRI in big prostate benefit less from standard cores at the time of MRI-targeted biopsy. We developed and externally validated in a multi-institutional cohort a risk calculator to predict the added values of SBx to TBx that could help urologist avoid unnecessary biopsy sampling, reduce detection of iPCa while maintaining detection of csPCa.