Automated deep-learning system in the assessment of MRI-visible prostate cancer: comparison of advanced zoomed diffusion-weighted imaging and conventional technique

Hu, Lei; Fu, Caixia; Song, Xinyang; Grimm, Robert; von Busch, Heinrich; Benkert, Thomas; Kamen, Ali; Lou, Bin; Huisman, Henkjan; Tong, Angela; Penzkofer, Tobias; Choi, Moon Hyung; Shabunin, Ivan; Winkel, David; Xing, Pengyi; Szolar, Dieter; Coakley, Fergus; Shea, Steven; Szurowska, Edyta; Guo, Jing-yi; Li, Liang; Li, Yue-hua; Zhao, Jun-gong

doi:10.1186/s40644-023-00527-0

Automated deep-learning system in the assessment of MRI-visible prostate cancer: comparison of advanced zoomed diffusion-weighted imaging and conventional technique

Research article
Open access
Published: 17 January 2023

Volume 23, article number 6, (2023)
Cite this article

Download PDF

You have full access to this open access article

Cancer Imaging Aims and scope Submit manuscript

Automated deep-learning system in the assessment of MRI-visible prostate cancer: comparison of advanced zoomed diffusion-weighted imaging and conventional technique

Download PDF

Lei Hu¹,
Caixia Fu²,
Xinyang Song³,
Robert Grimm⁴,
Heinrich von Busch⁵,
Thomas Benkert⁴,
Ali Kamen⁶,
Bin Lou⁶,
Henkjan Huisman⁷,
Angela Tong⁸,
Tobias Penzkofer⁹,
Moon Hyung Choi¹⁰,
Ivan Shabunin¹¹,
David Winkel¹²,
Pengyi Xing¹³,
Dieter Szolar¹⁴,
Fergus Coakley¹⁵,
Steven Shea¹⁶,
Edyta Szurowska¹⁷,
Jing-yi Guo¹⁸,
Liang Li¹⁹,
Yue-hua Li¹ &
…
Jun-gong Zhao¹

2934 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Background

Deep-learning-based computer-aided diagnosis (DL-CAD) systems using MRI for prostate cancer (PCa) detection have demonstrated good performance. Nevertheless, DL-CAD systems are vulnerable to high heterogeneities in DWI, which can interfere with DL-CAD assessments and impair performance. This study aims to compare PCa detection of DL-CAD between zoomed-field-of-view echo-planar DWI (z-DWI) and full-field-of-view DWI (f-DWI) and find the risk factors affecting DL-CAD diagnostic efficiency.

Methods

This retrospective study enrolled 354 consecutive participants who underwent MRI including T2WI, f-DWI, and z-DWI because of clinically suspected PCa. A DL-CAD was used to compare the performance of f-DWI and z-DWI both on a patient level and lesion level. We used the area under the curve (AUC) of receiver operating characteristics analysis and alternative free-response receiver operating characteristics analysis to compare the performances of DL-CAD using f- DWI and z-DWI. The risk factors affecting the DL-CAD were analyzed using logistic regression analyses. P values less than 0.05 were considered statistically significant.

Results

DL-CAD with z-DWI had a significantly better overall accuracy than that with f-DWI both on patient level and lesion level (AUC_patient: 0.89 vs. 0.86; AUC_lesion: 0.86 vs. 0.76; P < .001). The contrast-to-noise ratio (CNR) of lesions in DWI was an independent risk factor of false positives (odds ratio [OR] = 1.12; P < .001). Rectal susceptibility artifacts, lesion diameter, and apparent diffusion coefficients (ADC) were independent risk factors of both false positives (OR_{rectal susceptibility artifact} = 5.46; OR_diameter, = 1.12; OR_ADC = 0.998; all P < .001) and false negatives (OR_{rectal susceptibility artifact} = 3.31; OR_diameter = 0.82; OR_ADC = 1.007; all P ≤ .03) of DL-CAD.

Conclusions

Z-DWI has potential to improve the detection performance of a prostate MRI based DL-CAD.

Trial registration

ChiCTR, NO. ChiCTR2100041834. Registered 7 January 2021.

Performance of an ultra-fast deep-learning accelerated MRI screening protocol for prostate cancer compared to a standard multiparametric protocol

Article Open access 23 May 2024

Does deep learning software improve the consistency and performance of radiologists with various levels of experience in assessing bi-parametric prostate MRI?

Article Open access 20 March 2023

Predicting clinically significant prostate cancer with a deep learning approach: a multicentre retrospective study

Article Open access 21 November 2022

Background

Diffusion-weighted imaging (DWI) is an indispensable technique in prostate magnetic resonance imaging (MRI), providing both qualitative and quantitative functional information of prostate tissue and lesions [1,2,3,4,5]. Relying on the Prostate Imaging Reporting and Data System (PI-RADS) [2], DWI combined with T2-weighted imaging (T2WI) has shown significantly improved accuracy of detection and characterization of prostate cancer (PCa) lesions [6] and plays an important role in the clinical management strategy of patients with suspected PCa. However, due to differences in hardware, software, and technical experience [7, 8], there is a high variation of diagnostic accuracy and inter-observer agreement in the interpretation of prostate MRI across medical centers [1, 2]. These factors limit the clinic application of prostate MRI.

Various deep-learning-based computer-aided diagnosis (DL-CAD) systems using prostate MRI for PCa detection have demonstrated comparable or improved performance and reproducibility with less time and labor compared to experienced radiologists [7, 9,10,11,12]. Nevertheless, DL-CAD systems are vulnerable to high heterogeneities in DWI [13], which can interfere with DL-CAD assessments and impair performance. Developing a DWI sequence that can improve the accuracy and reliability of DL-CAD would improve PCa diagnosis. DWI acquisition performed using full-field-of-view (FOV) ssEPI-DWI (f-DWI) is prone to distortions, susceptibility artifacts, and limited spatial resolution. By contrast, zoomed-field-of-view echo-planar DWI (z-DWI) using a small FOV that only covers a specific region-of-interest (ROI) results in fewer geometric distortions, susceptibility artifacts, and higher spatial resolution [2, 10, 14,15,16,17,18]. Previous studies indicated that variation of noise, deformation and changes of resolution are important factors interfering with the judgement of DL-CAD [13, 19,20,21,22], therefore, we hypothesized that z-DWI might be helpful to improve the performance of DL-CAD for PCa diagnosis.

In this study, we assessed the use of DL-CAD with f-DWI and with z-DWI and compared each format in diagnosing MRI-visible PCa. The risk factors of patient condition, image quality, and lesion characteristics that could affect the DL-CAD diagnostic efficiency were analyzed.

Methods

This retrospective study was approved by the local ethics committee at our institution [Approve No: 2022-KY-073(K)]. As part of a prospective study aiming to build a robust AI system for PCa diagnosis, all the enrolled subjects signed the informed consent before they underwent the MRI examination and allowed us to use their data for a series of follow-up studies about AI system building for PCa diagnosis. All procedures performed in studies involving human participants were according to the 1964 Helsinki Declaration and its later amendments.

Participant selection

Between January 2021 and January 2022, participants with clinically suspected PCa undergone MRI examinations and subsequent MRI fusion ultrasound-guided targeted biopsies (2–4 cores) of MRI-suspicious lesions (PI-RADS score ≥ 3) followed by systematic biopsies (10–12 cores) were consequently enrolled from Shanghai Sixth People’s Hospital Affiliated to Shanghai Jiao Tong University School of Medicine. All MRI scans were interpreted by two senior radiologists with more than 15 years of experience in prostate MRI interpretation using the PI-RADS version 2.1.

The inclusion criteria included the following: (a) prostate lesions with definite boundaries on three types of MR images according to according to PI-RADS version 2.1 (T2WI, DWI, and ADC); (b) complete clinical information and entire MRI reports including the number, PI-RADS score, and location of suspected PCa lesion; (c) complete biopsy records and results, including the number, location, Gleason score (GS) of lesions. Exclusion criteria were (a) a prior history of PCa treatment; (b) biopsy within 6 months prior to the MRI examination; (c) an interval of more than 2 weeks between MRI and the biopsy procedure; (d) unavailability of the final PCa diagnosis.

MRI examination

Patients were advised to empty their bowel prior to the examination. All patients underwent both T2-weighted imaging (T2WI), f-DWI, z-DWI with b-values of 50, 1000, and 1500 s/mm² on a 3 T MRI scanner (MAGNETOM Skyra, Siemens Healthcare, Erlangen, Germany), and a phased-array 18-channel body coil in combination with an integrated 32-channel spine coil was used for signal reception. Z-DWI was performed with a slight rotation of the field-of-excitation [16], motion registration [23], and complex averaging [24]. More detailed parameters of each MRI sequence are shown in Table 1.

Table 1 MR sequence parameters

Full size table

Histopathology matching and annotation

The ground truth of this study was lesion confirmation on histopathology after biopsy. At least one GU radiologist and one GU pathologist retrospective reviewed MRI and histopathology examinations together at a multidisciplinary meeting scheduled monthly. Each lesion in MRI was matched to the corresponding location on the specimen through visual co-registration. According to the Ginsburg Study group method, a sextant scheme of the prostate has been used for analysis of the correct identification of the lesions’ localization [25]. Prostate contours were segmented on T2WI images and partitioned into sextants using the midsagittal plane and four additional angulated planes according to the biopsy protocol. Sextant-specific systematic biopsy histopathology was assigned to all MR sextants and augmented by calculating the maximum GS between systematic histopathology and histopathology from targeted fusion biopsy to sextants intersecting with MR lesions to create a sextant map of histopathology ground truth [26]. MRI reports and biopsy results, including the number, GS, and prostate location by MRI of lesions of all selected participants, were recorded in the regular retrospective review.

Deep learning-based computer-aided diagnosis

A prototype DL-CAD system (MR Prostate AI v1.2.5; October 2020; Siemens Healthcare GmbH, Erlangen, Germany) was used to test the performances of z-DWI and f-DWI in PCa detection. This DL-CAD system was trained using 2170 bp-MRI prostate examinations from 7 institutions, consisting of 944 lesion-free cases and 1226 cases with at least 1 clinically significant lesion that is deemed at least equivocal as designated by PI-RADS score 3 and higher; none of the cases included in the testing data set was part of the training [9]. The architecture and processing steps of the DL-CAD system have been described in detail in previous studies [9, 12, 27]. In brief, the system computes apparent diffusion coefficient (ADC) maps and calculates b-value images at b = 2000 s/mm² using the input DWI and then segments whole-gland volumes using T2WI. After that, the DWI and ADC are aligned to the T2WI. Finally, the system identifies the clinically relevant lesions based on T2WI, calculated b-value images, ADC maps, and prostate segmentations, and provides detected lesion localization, a PI-RADS category, and case-based level of suspicion (LoS). The LoS represents the confidence of the software that a lesion with a PI-RADS score of 3 or above is present in that patient, and ranges from 3.0 to 5.0 in steps of 0.1.

Evaluation of detection performance

The patient-based diagnostic performance of DL-CAD was evaluated by computing ROC curves for each case-based LoS. We evaluated two clinically relevant tasks: (a) differentiating between benign lesion and PCa (Gleason Grade Group (GGG) ≥ 1), and (b) differentiating between benign lesion or low-grade cancer of GS 3 + 3 and clinically significant PCa (csPCa) (GGG ≥ 2) [11].

The DL-CAD system can automatically detect clinically relevant PCa lesions which were defined on pathology/histology as Gleason score ≥ 7, and/or volume ≥ 0.5 cc, and/or extra-prostatic extension according to PI-RADS v2.1. The clinically relevant PCa lesions-based detection performances of DL-CAD were evaluated using free-response receiver operating characteristics (FROC) analysis due to PCa’s multi-focality [6, 28, 29]. In addition, considering the FROC curve has an infinite area under the curve (AUC), we also used alternative free-response receiver operating characteristics (AFROC) analysis with a finite AUC ranging from 0 to 1 to evaluate the lesion-based detection performance of the DL-CAD [30].

DWI image analysis

Two radiologists with approximately 3 years of experience in prostate MRI reporting and blinded to all clinical details and biopsy results twice evaluated the DWI sets including the image quality scoring, DWI signal intensities measurements and ADC measurements. The independent readings of the first time were used for assessing inter-reader agreement on quality scores, and DWI signal intensities and ADC measurements. The final image quality score of each DWI set was determined by the two radiologists by consensus. The final image quality score and the mean results of DWI signal intensities measurements and ADC measurements of the two radiologists were used for the evaluation of risk factors affecting the DL-CAD diagnostic efficiency.

After four weeks, the second time image analysis was performed by the two readers in a different order to test for reproducibility. Image evaluation was performed using the Image J Software (National Institutes of Health, Bethesda, MD, USA).

For each time, the DWI sets in two different sessions at intervals of at least two weeks to minimize recognition bias. In each session, only 1 of the 2 DWI sets for each patient was evaluated. Specifically, the DWI sets were reviewed in a random order and were rated in terms of overall quality and anatomic distortion using a 5-point Likert scale with 5 indicating the highest quality [18, 31]. Axial T2WI images were used as a reference for guiding anatomical localization of findings on the DWI sets [32]. In addition, the presence of artifacts including rectal susceptibility artifacts, phase wrap-around, artifacts from artificial joint replacements, other artifacts from outside the body (only for f-DWI) was noted, and the grade of artifact influence on image quality was scored as: 1, excellent image quality; 2, mild artifact, not impacting diagnosis; 3, moderate artifact, mildly impacting diagnosis; 4, pronounced artifact, moderately impacting diagnosis; 5, pronounced artifact, non-diagnostic. Artifacts scored ≥3 were considered to have an influence on the diagnosis [10].

To evaluate noise, lesion conspicuity, and ADC values of each DWI set, the radiologists were also asked to draw ROIs on the ADC map, which were then copied to the DWI (b = 1500 s/mm²) image. One ROI was placed in the center of the lesion in the slice with the largest extent of the lesion. The second ROI was placed in the corresponding contralateral normal tissue as a reference ROI. If the contralateral tissue was also abnormal, the reference ROI was placed in healthy appearing tissue of the same anatomical zone as the lesion. To calculate the standard deviation of the noise (SD_noise) in a noise-only area, a third ROI was placed in the center of the bladder with the DWI at b = 1500 s/mm². The ADC values, mean signal intensities in the lesion ROI (S_lesion), the reference ROI (S_normal), and SD_noise were recorded for further analyses.

The difference in the noise between z-DWI and f-DWI was calculated using the estimated signal-to-noise ratio (eSNR) [33]:

$$eSNR={S}_{lesion}/S{D}_{noise}$$

Lesion conspicuity was determined by the contrast-to-noise ratio (CNR):

$$CNR=\left({S}_{lesion}-{S}_{normal}\right)/S{D}_{noise}$$

We compared DWI quality and the characteristics of benign lesions and malignant lesions to determine risk factors affecting the DL-CAD diagnostic efficiency. The relationships between DL-CAD diagnostic performance and image quality as well as lesion characteristics were also evaluated.

Statistical analyses

The one-sample Kolmogorov-Smirnov test was used to check the assumption of a normal distribution of the data. The independent t test or paired t test was used for normally distributed data. The Mann-Whitney U test was used to assess non-normally distributed continuous variables. Categorical variables were reported as percentages and compared by χ² test. Comparisons of sensitivity and specificity were performed using the McNemar test. Comparisons of AUCs were performed using the Delong test. Inter- and intra-observer agreement of overall quality, anatomic distortion, and artifact evaluation were tested with a weighted κ coefficient. ADC values and SI_lesion and SI_normal measurements were assessed with the intraclass correlation coefficient.

Univariable logistic regression analyses and multivariable logistic regression analyses with stepwise approaches were applied to assess the relationship between false positives and potential risk factors and between false negatives and potential risk factors. The multicollinearity of variables in the multivariable analysis was determined using a variance inflation factor (VIF) of greater than 10.

Statistical evaluations were performed using R v4.10 (R Foundation for Statistical Computing, Vienna, Austria; https://www.R-project.org/). The VIFs were calculated using the “car” package. Comparison of Binary Diagnostic Tests in a Paired Study Design was performed using “DTComPair” package. The ROC curves were plotted using the “pROC” package. The FROC curves and AFROC curves were plotted using the “BayesianFROC” package. Forest plots of the logistic regression analyses were performed using “forestmodel” package. P values less than 0.05 were considered statistically significant.

Results

Participant and lesion baseline characteristics

Initially, a total of 389 participants were enrolled. Of these, 35 were excluded according to the inclusion and exclusion criteria. The detailed reasons for exclusion are listed in Fig. 1. A total of 354 patients (median age, 65 years; interquartile range [IQR], 71–77 years) with 486 lesions (250 cancer lesions and 236 benign lesions) were included in the final study. Baseline epidemiologic and clinical characteristics of the participants are shown in Table 2.

Table 2 Demographic and Clinical Characteristics of Included Patients

Full size table

There were no significant differences in the mean patient age between subjects with and without PCa (P = 0.23). Participants with PCa had higher levels of total PSA and free PSA and lower free PSA ratio than those without PCa (All P < .001). Detailed information about lesions, including lesion location, pathologic findings, and clinical assessment, is shown in Table 3.

Table 3 Lesion characteristics

Full size table

Patient-based performance

Figure 2 shows that, compared with f-DWI, DL-CAD based on z-DWI had better performance in differentiating between benign lesion and PCa (Sensitivity: 0.79 [95% CI: 0.73-0.85] vs. 0.78 [95% CI: 0.71-0.84]; Specificity: 0.89 [95% CI: 0.83-0.93] vs. 0.83[95% CI: 0.77-0.89]; AUC: 0.89 [95% CI:0.85-0.92] vs. 0.86 [95% CI: 0.81-0.89], P = 0.007) and in differentiating between benign tissue or PCa of GS 3 + 3 and csPCa (Sensitivity: 0.81 [95% CI: 0.74 - 0.87] vs. 0.78 [95% CI: 0.70-0.84]; Specificity: 0.85 [95% CI: 0.79-0.80] vs. 0.82 [95% CI:0.76-0.87]; AUC: 0.88 [95% CI:0.84-0.91] vs. 0.85 [95% CI: 0.81-0.88], P = 0.024).

Lesion-based detection performance

Lesion-based detection performance of the DL-CAD system is shown in Table 4 and Fig. 3.

Table 4 Prostate cancer lesion detection performance of DL-CAD using f-DWI and z-DWI

Full size table

Compared with f-DWI, z-DWI had significantly higher sensitivity but lower specificity for lesion detection at PI-RADS category greater than or equal to 3 (Sensitivity: 0.93 [95% CI: 0.90-0.96] vs. 0.73 [95% CI:0.68-0.79]; Specificity: 0.61 [95% CI: 0.55-0.67] vs. 0.76 [95% CI:0.70-0.81]; P < .001 for all comparisons).

At a detection sensitivity > 0.1, DL-CAD using z-DWI provided lower False Positive Fractions per patient than DL-CAD using f-DWI (Fig. 4a). DL-CAD using z-DWI had better performance for PCa lesion detection with less false-positive detections per patient than DL-CAD f-DWI (AUC, 0.855 [95% CI:0.825-0.883] vs. 0.760 [95% CI: 0.714-0.799]; P < .001) (Fig. 4b).

DWI image analysis

The inter- and intra-observer agreement for image quality scores, DWI signal intensities, and ADC measurements were concordant (Suppl. material).

As shown in Table 5, DL-CAD using z-DWI has significantly higher scores for overall image quality and distortion of the prostate, and lower scores of the severity of artifacts compared with DL-CAD using f-DWI (P ≤ .035). For both benign lesions and malignant lesions, DL-CAD using z-DWI had lower ADC values and higher CNR and eSNR than those in DL-CAD f-DWI (P ≤ .011).

Table 5 Comparison of subjective image quality and the main lesion between different diffusion-weighted imaging sequences of the prostate

Full size table

Risk factors evaluation

Examples of DL-CAD diagnosis using f-DWI and z-DWI are shown in Fig. 5. Based on subjective visual evaluation, prostate deformation, artifacts, and lesion signal intensity on DWI and ADC values are possible reasons resulting in DL-CAD misdiagnosis.

As shown in Fig. 6, CNR was an independent risk factor for false positives of DL-CAD (odds ratio [OR], 1.12; 95% CI, 1.05-1.21; P < .001) and rectal susceptibility artifacts, diameter, and ADC were independent risk factors associated with both false positive detections (OR_{rectal susceptibility artifact,} 5.46; 95% CI, 2.77-10.96; OR_diameter, 1.12; 95% CI, 1.07-1.17; OR_ADC, 0.998; 95% CI, 0.997-0.999; all P < .001) and false negative detections (OR_{rectal susceptibility artifacts}, 3.31; 95% CI, 1.15-9.68; OR_diameter, 0.82; 95% CI, 0.75-0.89; OR_ADC, 1.007; 95% CI, 1.004-1.009; all P ≤ .03) of DL-CAD.

Discussion

Our study has two main contributions. First, we compared PCa detection performance of the DL-CAD system with the use of z-DWI and f-DWI, finding that the DL-CAD system exhibited significantly better PCa detection performance based on z-DWI than using f-DWI. It indicates that z-DWI may be a way towards more consistent and better image quality. Second, risk factors that affected the diagnostic performance of the DL-CAD system in the assessment of PCa were identified. As these types of image artifacts are common in prostate MRI, the risk factors that interfere with one DL-CAD system may have similar effects in other DL-CAD systems with different network structures and parameter settings. Understanding these risk factors might be helpful for standardizing prostate MRI scanning guidelines for DL-CAD analysis, customizing the corresponding DL-CAD training strategies, and improving the diagnostic accuracy and generalization of DL-CAD.

DWI acquired by zoomed-FOV technology has provided better image quality than that obtained with other technologies [10, 15, 18, 32, 34]. However, due in part to the physiological limitations of visually identifying subtle differences among lesions, the improvement in image quality did not significantly improve the subjective evaluation performance of radiologists for PCa detection in many previous studies [15, 34]. Given the ability of DL-CAD to mine the sub-pixel level, previous radiomic study found that a radiomics model based on z-DWI had a higher diagnostic accuracy, sensitivity, and specificity than a model based on f-DWI [17]. Partly differing from previous result, the improvement of PCa detection performance of the DL-CAD using z-DWI primarily comes from the improvement in sensitivity. We found that the sensitivity of DL-CAD using z-DWI for detecting lesions improved by 20%. However, its specificity for detecting lesions was reduced by 15%. Our results indicate that the observed operating point of DL-CAD using z-DWI was shifted in favor of higher sensitivity, but considering the superior ROC curves, z-DWI achieved superior specificity at given sensitivity levels.

In contrast to the radiomic model which was constructed with explicable texture feature information, the training and diagnosis process of DL-CAD is much more complex. We used common clinical research methods to find risk factors contributing to diagnostic errors from the macroscopic level of DL-CAD. We found that CNR was positively associated with false positives of DL-CAD, whilst ADC was negatively associated with false positives of DL-CAD. It indicates that parameter settings producing on average lower ADC values and higher CNR in PCa lesions in DWI might be helpful to improve the performance of DL-CAD using DWI sets.

Because the lesions in z-DWI have lower ADC values but higher CNR in both benign and malignant lesions than those in f-DWI, it is not surprising that DL-CAD using z-DWI detected more true positives but had increased false positives. Therefore, strategies to overcome these problems will need to be determined before applying z-DWI to existing f-DWI based DL-CAD systems, e.g., by further training the DL-CAD with z-DWI data in the future.

Consistent with our clinical observations, we found that the severity of rectal susceptibility artifacts is an independent high-risk factor for both false positives and false negatives of DL-CAD. Severe artifacts in the rectal region led to signal gain or loss in adjacent prostate tissue and to local gland deformation of the prostate, which impairs the performance of DL-CAD which relies on an accurate co-registration of T2WI and DWI. Therefore, we hold that the reduction of rectal susceptibility artifacts was one of the main reasons why the DL-CAD diagnostic accuracy using z-DWI was improved. It is also indicated that good bowel preparation before prostate MRI examination may help to maintain the accuracy and stability of DL-CAD.

Another interesting finding of our study which is inconsistent with our initial expectations is that noise and phase wrap-around were not found to be independent risk factors that interfered with the diagnostic efficiency of DL-CAD. These aspects may not be important considerations for developing improved DWI scanning strategies of DL-CAD for PCa diagnosis. According to the results, artifacts caused by artificial joint replacements and other artifacts out-of-body in DWI also were not key factors resulting in the misdiagnoses of DL-CAD. However, considering few patients suffering these artifacts, this result still needs to be further verified by larger samples.

Our study had limitations. First, only one trained DL-CAD based on full-FOV DWI was used for the comparison of DWI sets. Although the effects of reduced image quality are typically not limited to a single model, whether the risk factors we have identified affect the performance of other models still needs further verification. Second, only DWI images obtained from a single manufacturer were included for comparison. Results from multi-vendor datasets obtained from multiple imaging centers are needed to verify our results. Third, because in our clinic routine, active surveillance instead of biopsy is typically selected for patients with elevated PSA but negative MRIs, it is hard for us to evaluate the performances of DL-CAD on PCa patients with negative MRIs. Therefore, only patients with prostate lesions with definite boundaries on all MR images were included, there might be a potential source of selection bias. Fourth, a reduced FOV may prevent the visualization of lymph nodes, a full-FOV DWI is still needed to study lymph nodes. Finally, in our study, targeted biopsies were used as reference standards. Whole-mount histopathology may have improved the accuracy of the agreement between the MR images and the histopathology.

Conclusions

In conclusion, z-DWI has the potential to improve the detection performance of a prostate MRI based DL-CAD system.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

DL-CAD:: Deep learning-based computer aided diagnosis
PCa:: Prostate cancer
ssEPI:: Single-shot echo-planar imaging
f-DWI:: Full-field-of-view DWI
z-DWI:: Zoomed-field-of-view echo-planar DWI
AUC:: Area under the curve
CNR:: Contrast-to-noise ratio
OR:: Odds ratio
ADC:: Apparent diffusion coefficient
FOV:: Field-of-view
PI-RADS:: Prostate Imaging Reporting and Data System
GGG:: Gleason Grade Group
LoS:: Case-based level of suspicion
ROI:: Region-of-interest
FROC:: Free-response receiver operating characteristics
VIF:: Variance inflation factor
GS:: Gleason score
SD:: Standard deviation
eSNR:: Estimated signal-to-noise ratio

References

Giganti F, Rosenkrantz AB, Villeirs G, Panebianco V, Stabile A, Emberton M, et al. The evolution of MRI of the prostate: the past, the present, and the future. AJR Am J Roentgenol. 2019;213(2):384–96.
Article Google Scholar
Turkbey B, Rosenkrantz AB, Haider MA, Padhani AR, Villeirs G, Macura KJ, et al. Prostate imaging reporting and data system version 2.1: 2019 update of prostate imaging reporting and data system version 2. Eur Urol. 2019;76(3):340–51.
Article Google Scholar
Donati OF, Mazaheri Y, Afaq A, Vargas HA, Zheng J, Moskowitz CS, et al. Prostate cancer aggressiveness: assessment with whole-lesion histogram analysis of the apparent diffusion coefficient. Radiology. 2014;271(1):143–52.
Article Google Scholar
Hambrock T, Somford DM, Huisman HJ, van Oort IM, Witjes JA, Hulsbergen-van de Kaa CA, et al. Relationship between apparent diffusion coefficients at 3.0-T MR imaging and Gleason grade in peripheral zone prostate cancer. Radiology. 2011;259(2):453–61.
Article Google Scholar
Vargas HA, Akin O, Franiel T, Mazaheri Y, Zheng J, Moskowitz C, et al. Diffusion-weighted endorectal MR imaging at 3 T for prostate cancer: tumor detection and assessment of aggressiveness. Radiology. 2011;259(3):775–84.
Article Google Scholar
Cao R, Zhong X, Afshari S, Felker E, Suvannarerg V, Tubtawee T, et al. Performance of deep learning and genitourinary radiologists in detection of prostate Cancer using 3-T multiparametric magnetic resonance imaging. J Magn Reson Imaging. 2021;54(2):474–83.
Article Google Scholar
Hiremath A, Shiradkar R, Merisaari H, Prasanna P, Ettala O, Taimen P, et al. Test-retest repeatability of a deep learning architecture in detecting and segmenting clinically significant prostate cancer on apparent diffusion coefficient (ADC) maps. Eur Radiol. 2021;31(1):379–91.
Article Google Scholar
Yang F, Dogan N, Stoyanova R, Ford JC. Evaluation of radiomic texture feature error due to MRI acquisition and reconstruction: a simulation study utilizing ground truth. Physica Medica. 2018;50:26–36.
Article Google Scholar
Winkel DJ, Tong A, Lou B, Kamen A, Comaniciu D, Disselhorst JA, et al. A novel deep learning based computer-aided diagnosis system improves the accuracy and efficiency of radiologists in Reading Biparametric magnetic resonance images of the prostate: results of a multireader. Multicase Study Invest Radiol. 2021;56(10):605–13.
Article CAS Google Scholar
Hu L, Wei L, Wang S, Fu C, Benker T, Zhao J. Better lesion conspicuity translates into improved prostate cancer detection: comparison of non-parallel-transmission-zoomed-DWI with conventional-DWI. Abdom Radiol (NY). 2021;46(12):5659–68.
Article Google Scholar
Bulten W, Pinckaers H, van Boven H, Vink R, de Bel T, van Ginneken B, et al. Automated deep-learning system for Gleason grading of prostate cancer using biopsies: a diagnostic study. Lancet Oncol. 2020;21(2):233–41.
Article Google Scholar
Yu X, Lou B, Shi B, Winkel D, Szolar D. False Positive Reduction Using Multiscale Contextual Features for Prostate Cancer Detection in Multi-Parametric MRI Scans. In: In: 2020 IEEE 17th International Symposium on Biomedical Imaging (ISBI). Iowa City: IEEE; 2020. p. 1355–9.
Google Scholar
Hirano H, Minagi A, Takemoto K. Universal adversarial attacks on deep neural networks for medical image classification. BMC Med Imaging. 2021;21(1):9.
Article Google Scholar
Feuerlein S, Davenport MS, Krishnaraj A, Merkle EM, Gupta RT. Computed high b-value diffusion-weighted imaging improves lesion contrast and conspicuity in prostate cancer. Prostate Cancer Prostatic Dis. 2015;18(2):155–60.
Article CAS Google Scholar
Brendle C, Martirosian P, Schwenzer NF, Kaufmann S, Kruck S, Kramer U, et al. Diffusion-weighted imaging in the assessment of prostate cancer: comparison of zoomed imaging and conventional technique. Eur J Radiol. 2016;85(5):893–900.
Article Google Scholar
Finsterbusch J. Improving the performance of diffusion-weighted inner field-of-view echo-planar imaging based on 2D-selective radiofrequency excitations by tilting the excitation plane. J Magn Reson Imaging. 2012;35(4):984–92.
Article Google Scholar
Hu L, Zhou DW, Fu CX, Benkert T, Jiang CY, Li RT, et al. Advanced zoomed diffusion-weighted imaging vs. full-field-of-view diffusion-weighted imaging in prostate cancer detection: a radiomic features study. Eur Radiol. 2021;31(3):1760–9.
Article Google Scholar
Rosenkrantz AB, Chandarana H, Pfeuffer J, Triolo MJ, Shaikh MB, Mossa DJ, et al. Zoomed echo-planar imaging using parallel transmission: impact on image quality of diffusion-weighted imaging of the prostate at 3T. Abdom Imaging. 2015;40(1):120–6.
Article Google Scholar
Xu M, Zhang T, Li Z, Liu M, Zhang D. Towards evaluating the robustness of deep diagnostic models by adversarial attack. Med Image Anal. 2021;69:101977.
Article Google Scholar
Allyn J, Allou N, Vidal C, Renou A, Ferdynus C. Adversarial attack on deep learning-based dermatoscopic image recognition systems: risk of misdiagnosis due to undetectable image perturbations. Medicine (Baltimore). 2020;99(50):e23568.
Article Google Scholar
Akhtar N, Mian A. Threat of adversarial attacks on deep learning in computer vision: a survey. Ieee Access. 2018;6:14410–30.
Article Google Scholar
Vidnerova P, Neruda R. Vulnerability of classifiers to evolutionary generated adversarial examples. Neural Netw. 2020;127:168–81.
Article Google Scholar
Jolly MPD, Guetter C, Guehring J. Cardiac segmentation in MR cine data using inverse consistent deformable registration. In: In: Proceedings of the 2010 IEEE International Symposium on Biomedical Imaging: From Nano to Macro. Rotterdam: IEEE; 2010. p. 484–7.
Chapter Google Scholar
Kordbacheh H, Seethamraju RT, Weiland E, Kiefer B, Nickel MD, Chulroek T, et al. Image quality and diagnostic accuracy of complex-averaged high b value images in diffusion-weighted MRI of prostate cancer. Abdom Radiol (NY). 2019;44(6):2244–53.
Article Google Scholar
Kuru TH, Wadhwa K, Chang RT, Echeverria LM, Roethke M, Polson A, et al. Definitions of terms, processes and a minimum dataset for transperineal prostate biopsies: a standardization approach of the Ginsburg study Group for Enhanced Prostate Diagnostics. BJU Int. 2013;112(5):568–77.
Article Google Scholar
Bonekamp D, Schelb P, Wiesenfarth M, Kuder TA, Deister F, Stenzinger A, et al. Histopathological to multiparametric MRI spatial mapping of extended systematic sextant and MR/TRUS-fusion-targeted biopsy of the prostate. Eur Radiol. 2019;29(4):1820–30.
Article Google Scholar
Yang D, Xu D, Zhou SK, Georgescu B, Chen M, Grbic S, et al. Automatic Liver Segmentation Using an Adversarial Image-to-Image Network. Cham: Springer International Publishing; 2017. p. 507–15.
Google Scholar
Ehteshami Bejnordi B, Veta M, Johannes van Diest P, van Ginneken B, Karssemeijer N, Litjens G, et al. Diagnostic assessment of deep learning algorithms for detection of lymph node metastases in women with breast Cancer. JAMA. 2017;318(22):2199–210.
Article Google Scholar
Cao R, Mohammadian Bajgiran A, Afshari Mirak S, Shakeri S, Zhong X, Enzmann D, et al. Joint prostate Cancer detection and Gleason score prediction in mp-MRI via FocalNet. IEEE Trans Med Imaging. 2019;38(11):2496–506.
Article Google Scholar
Nandram B, Peiris T. Bayesian analysis of a ROC curve for categorical data using a skew-binormal model. Statistics and Its Interface. 2018;11(2):369–84.
Article Google Scholar
Nketiah G, Selnaes KM, Sandsmark E, Teruel JR, Kruger-Stokke B, Bertilsson H, et al. Geometric distortion correction in prostate diffusion-weighted MRI and its effect on quantitative apparent diffusion coefficient analysis. Magn Reson Med. 2018;79(5):2524–32.
Article Google Scholar
Rosenkrantz AB, Chandarana H, Hindman N, Deng FM, Babb JS, Taneja SS, et al. Computed diffusion-weighted imaging of the prostate at 3 T: impact on image quality and tumour detection. Eur Radiol. 2013;23(11):3170–7.
Article Google Scholar
Klingebiel M, Ullrich T, Quentin M, Bonekamp D, Aissa J, Mally D, et al. Advanced diffusion weighted imaging of the prostate: comparison of readout-segmented multi-shot, parallel-transmit and single-shot echo-planar imaging. Eur J Radiol. 2020;130:109161.
Article CAS Google Scholar
Tamada T, Prabhu V, Li J, Babb JS, Taneja SS, Rosenkrantz AB. Assessment of prostate cancer aggressiveness using apparent diffusion coefficient values: impact of patient race and age. Abdom Radiol (NY). 2017;42(6):1744–51.
Article Google Scholar

Download references

Acknowledgements

None.

Funding

This study received funding by the National Natural Science Foundation of China (Nos. 81901845, 81671791), Science Foundation of Shanghai Jiaotong University Affiliated Sixth People’s Hospital (No. 201818), and Shanghai key discipline of medical imaging (No: 2017ZZ02005).

Author information

Authors and Affiliations

Department of Diagnostic and Interventional Radiology, Shanghai Sixth People’s Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, No. 600, Yi Shan Road, Shanghai, 200233, China
Lei Hu, Yue-hua Li & Jun-gong Zhao
MR Application Development, Siemens Shenzhen magnetic Resonance Ltd., Shenzhen, China
Caixia Fu
Department of Radiology, Xiangyang No.1 People’s Hospital, Hubei University of Medicine, Xiangyang, 441000, China
Xinyang Song
MR Application Predevelopment, Siemens Healthcare GmbH, Erlangen, Germany
Robert Grimm & Thomas Benkert
Innovation Owner Artificial Intelligence for Oncology, Siemens Healthcare GmbH, Erlangen, Germany
Heinrich von Busch
Digital Technology and Innovation, Siemens Healthineers, Princeton, NJ, USA
Ali Kamen & Bin Lou
Radboud University Medical Center, Nijmegen, Netherlands
Henkjan Huisman
New York University, New York City, NY, USA
Angela Tong
Charité, Universitätsmedizin Berlin, Berlin, Germany
Tobias Penzkofer
Eunpyeong St. Mary’s Hospital, Catholic University of Korea, Seoul, Republic of Korea
Moon Hyung Choi
Patero Clinic, Moscow, Russia
Ivan Shabunin
Universitätsspital Basel, Basel, Switzerland
David Winkel
Changhai Hospital, Shanghai, China
Pengyi Xing
Diagnostikum Graz Süd-West, Graz, Austria
Dieter Szolar
Oregon Health and Science University, Portland, OR, USA
Fergus Coakley
Loyola University Medical Center, Maywood, IL, USA
Steven Shea
Medical University of Gdansk, Gdansk, Poland
Edyta Szurowska
Clinical Research Center, Shanghai Sixth People’s Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, 200233, China
Jing-yi Guo
Department of Radiology, Renmin Hospital of Wuhan University, Wuhan, 430060, China
Liang Li

Authors

Lei Hu
View author publications
You can also search for this author in PubMed Google Scholar
Caixia Fu
View author publications
You can also search for this author in PubMed Google Scholar
Xinyang Song
View author publications
You can also search for this author in PubMed Google Scholar
Robert Grimm
View author publications
You can also search for this author in PubMed Google Scholar
Heinrich von Busch
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Benkert
View author publications
You can also search for this author in PubMed Google Scholar
Ali Kamen
View author publications
You can also search for this author in PubMed Google Scholar
Bin Lou
View author publications
You can also search for this author in PubMed Google Scholar
Henkjan Huisman
View author publications
You can also search for this author in PubMed Google Scholar
Angela Tong
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Penzkofer
View author publications
You can also search for this author in PubMed Google Scholar
Moon Hyung Choi
View author publications
You can also search for this author in PubMed Google Scholar
Ivan Shabunin
View author publications
You can also search for this author in PubMed Google Scholar
David Winkel
View author publications
You can also search for this author in PubMed Google Scholar
Pengyi Xing
View author publications
You can also search for this author in PubMed Google Scholar
Dieter Szolar
View author publications
You can also search for this author in PubMed Google Scholar
Fergus Coakley
View author publications
You can also search for this author in PubMed Google Scholar
Steven Shea
View author publications
You can also search for this author in PubMed Google Scholar
Edyta Szurowska
View author publications
You can also search for this author in PubMed Google Scholar
Jing-yi Guo
View author publications
You can also search for this author in PubMed Google Scholar
Liang Li
View author publications
You can also search for this author in PubMed Google Scholar
Yue-hua Li
View author publications
You can also search for this author in PubMed Google Scholar
Jun-gong Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

LH, CF: Conceptualization, Methodology, Investigation, Visualization, Writing − original draft. XS: Methodology. AK, BL, HH, AT, TP, MC, IS, DW, PX, DS, FC, SS, ES: Methodology, Software, Formal analysis. JZ, LL: Resources, Supervision. RG, HB, TB: Resources, Writing − review & editing. JG, YL: Supervision, Formal analysis. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Jun-gong Zhao.

Ethics declarations

Ethics approval and consent to participate

This retrospective study was approved by the local ethics committee at our institution [Approve No: 2022-KY-073(K)]. As part of a prospective study aiming to build a robust AI system for PCa diagnosis, all the enrolled subjects signed the informed consent before they underwent the MRI examination and allowed us to use their data for a series of follow-up studies about AI system building for PCa diagnosis.

Consent for publication

Consent for publication have been obtained from that person whose images were contained in this article.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Hu, L., Fu, C., Song, X. et al. Automated deep-learning system in the assessment of MRI-visible prostate cancer: comparison of advanced zoomed diffusion-weighted imaging and conventional technique. Cancer Imaging 23, 6 (2023). https://doi.org/10.1186/s40644-023-00527-0

Download citation

Received: 16 November 2022
Accepted: 11 January 2023
Published: 17 January 2023
DOI: https://doi.org/10.1186/s40644-023-00527-0

Automated deep-learning system in the assessment of MRI-visible prostate cancer: comparison of advanced zoomed diffusion-weighted imaging and conventional technique

Abstract

Background

Methods

Results

Conclusions

Trial registration

Similar content being viewed by others

Performance of an ultra-fast deep-learning accelerated MRI screening protocol for prostate cancer compared to a standard multiparametric protocol

Does deep learning software improve the consistency and performance of radiologists with various levels of experience in assessing bi-parametric prostate MRI?

Predicting clinically significant prostate cancer with a deep learning approach: a multicentre retrospective study

Background

Methods

Participant selection

MRI examination

Histopathology matching and annotation

Deep learning-based computer-aided diagnosis

Evaluation of detection performance

DWI image analysis

Statistical analyses

Results

Participant and lesion baseline characteristics

Patient-based performance

Lesion-based detection performance

DWI image analysis

Risk factors evaluation

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation