Using deep learning to safely exclude lesions with only ultrafast breast MRI to shorten acquisition and reading time

Jing, Xueping; Wielema, Mirjam; Cornelissen, Ludo J.; van Gent, Margo; Iwema, Willie M.; Zheng, Sunyi; Sijens, Paul E.; Oudkerk, Matthijs; Dorrius, Monique D.; van Ooijen, Peter M.A.

doi:10.1007/s00330-022-08863-8

Using deep learning to safely exclude lesions with only ultrafast breast MRI to shorten acquisition and reading time

Imaging Informatics and Artificial Intelligence
Open access
Published: 26 May 2022

Volume 32, pages 8706–8715, (2022)
Cite this article

Download PDF

You have full access to this open access article

European Radiology Aims and scope Submit manuscript

Using deep learning to safely exclude lesions with only ultrafast breast MRI to shorten acquisition and reading time

Download PDF

Xueping Jing ORCID: orcid.org/0000-0002-7424-5057¹,
Mirjam Wielema²,
Ludo J. Cornelissen³,
Margo van Gent²,
Willie M. Iwema⁴,
Sunyi Zheng¹,
Paul E. Sijens²,
Matthijs Oudkerk⁵,
Monique D. Dorrius² &
…
Peter M.A. van Ooijen¹

2845 Accesses
17 Citations
1 Altmetric
Explore all metrics

Abstract

Objectives

To investigate the feasibility of automatically identifying normal scans in ultrafast breast MRI with artificial intelligence (AI) to increase efficiency and reduce workload.

Methods

In this retrospective analysis, 837 breast MRI examinations performed on 438 women from April 2016 to October 2019 were included. The left and right breasts in each examination were labelled normal (without suspicious lesions) or abnormal (with suspicious lesions) based on final interpretation. Maximum intensity projection (MIP) images of each breast were then used to train a deep learning model. A high sensitivity threshold was calculated based on the detection trade - off (DET) curve on the validation set. The performance of the model was evaluated by receiver operating characteristic analysis of the independent test set. The sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) with the high sensitivity threshold were calculated.

Results

The independent test set consisted of 178 examinations of 149 patients (mean age, 44 years ± 14 [standard deviation]). The trained model achieved an AUC of 0.81 (95% CI: 0.75–0.88) on the independent test set. Applying a threshold of 0.25 yielded a sensitivity of 98% (95% CI: 90%; 100%), an NPV of 98% (95% CI: 89%; 100%), a workload reduction of 15.7%, and a scan time reduction of 16.6%.

Conclusion

This deep learning model has a high potential to help identify normal scans in ultrafast breast MRI and thereby reduce radiologists’ workload and scan time.

Key Points

• Deep learning in TWIST may eliminate the necessity of additional sequences for identifying normal breasts during MRI screening.

• Workload and scanning time reductions of 15.7% and 16.6%, respectively, could be achieved with the cost of 1 (1 of 55) false negative prediction.

Localization of contrast-enhanced breast lesions in ultrafast screening MRI using deep convolutional neural networks

Article Open access 02 September 2023

Automated artifact detection in abbreviated dynamic contrast-enhanced (DCE) MRI-derived maximum intensity projections (MIPs) of the breast

Article Open access 02 April 2022

Improving workflow in prostate MRI: AI-based decision-making on biparametric or multiparametric MRI

Article Open access 09 August 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Dynamic contrast-enhanced MRI (DCE-MRI) of the breast has been widely used as a supplementary screening tool for breast cancer. Breast MRI can not only detect more breast cancer cases than mammography but also detect cancers at an earlier stage [1]. Especially for women with extremely dense breasts, screening with supplemental MRI has the potential to reduce interval cancers [2]. These advantages have led to a renewed interest in using breast MRI to screen a larger population [3]. However, cost-effectiveness is still the most substantial obstacle for the wider application of this sensitive modality [4].

The most promising approaches to reducing the cost of breast MRI are to improve the throughput of the MRI scanner by shortening the acquisition time [5,6,7,8] and reducing radiologists’ workload by shortening the interpretation time [9]. Current diagnostic breast MRI protocols require up to 20 min. Several abbreviated protocols have been proposed to replace the standard protocol for screening [10, 11]. A recent multicenter, multireader study [12] found that time-resolved angiography with stochastic trajectories (TWIST) [13] alone can achieve a comparable sensitivity (84% vs. 86%) and higher specificity (82% vs. 76%) than the full diagnostic protocol when interpreted by radiologists. This TWIST-alone protocol, requiring less than 2 min of magnet time, can thus minimize the time needed for the scanning process.

Image interpretation is another bottleneck in breast cancer screening with MRI. The average interpretation time in different studies varied from 25 to 178 s [11]. It is worth noting that the cancer rate in a screening study may be only 15.5 per 1000 [14], which suggests that radiologists spend most of their time reading normal scans without suspicious lesions. On the other hand, reading quality is also related to the total number of examinations and the position of the examination in the queue [15]. Short reading batches and risk-based reading queues may help further improve radiologists’ performance.

The combination of artificial intelligence (AI) and ultrafast MRI could help improve the efficiency of breast MRI screening by automatically excluding scans without lesions. Identifying suspicious lesions from numerous screening scans and prioritizing a scan according to risk could help reduce the workload and improve efficiency. In addition, an early stop strategy could also be applied to scans without suspicious lesions. Since malignant lesions are more likely to enhance rapidly at the early stage of DCE-MRI [16, 17], cancellation or adjustment of further sequences based on the output of ultrafast MRI could help reduce scanning time and thus improve the throughput. Moreover, based on the real-time analysis of the ultrafast sequences, additional scanning (e.g., T2, DWI) or even a full diagnostic protocol could still be performed if any abnormalities were detected.

We hypothesized that a deep learning model, with only TWIST sequences as input, might be able to identify normal MRI exams without human intervention. Integrating this deep learning system in the screening workflow could improve the throughput and reduce the radiologist’s workload. Therefore, the aim of this study was to develop and evaluate a deep learning model for automated abnormality prediction with only TWIST sequences as input.

Materials and methods

The institutional review board approved the study and waived the requirement to obtain informed consent for our retrospective study, which used fully anonymized reports and MRI examinations.

Study population

The initial population included 1447 breast MRI examinations from 809 consecutive patients who underwent breast MRI examinations between April 2016 and October 2019 at our institution. Of the 1447 examinations, the following MRI scans were excluded: 287 due to inconsistent protocols, 156 due to incomplete data, and 159 due to another indication for scanning (34 to measure response to chemotherapy, 94 for surgery follow-up, and 31 to evaluate prosthesis rupture). Furthermore, 8 examinations were excluded due to failed scans. The final dataset for deep learning model development and evaluation consisted of 837 examinations from 488 patients. Among the 837 examinations, 178 examinations from 149 patients were obtained after deep learning model development, and those data were used as an independent test set since they were not involved in the model development. The remaining 659 examinations from 339 patients were randomly divided into training and validation sets as follows: 494 examinations from 214 patients in the training set and 165 examinations from 125 patients in the validation set. It should be noted that the data were divided on the patient level; thus, there was no overlap in patients in the training and test sets. Figure 1 summarizes this process.

MRI scanner and imaging technique

Examinations were performed with a full diagnostic protocol (Fig. 2) on either a 3.0-T or 1.5-T scanner (MAGNETOM Skyra or MAGNETOM Avanto^fit, Siemens Healthineers) in the prone position. For 3.0-T and 1.5-T scanners, the full protocol requires 17.95 and 19.61 min, respectively, while the 15 TWIST acquisitions require 1.3 and 1.46 min. The acquisition parameters for ultrafast breast MRI are summarized in Table 1.

Table 1 Acquisition Parameters for ultrafast MRI

Full size table

Reference standard

Classification of the MRI examinations was based on the assessments and conclusions in the radiology reports, supplemented with pathology reports, biopsy, and ultrasound results. For each patient, the left and right breasts were evaluated independently. Breasts with one or more visible enhanced lesions were classified as abnormal, while breasts with unenhanced lesions or without suspicious lesions were classified as normal. Then, all the labels were further examined by a senior radiologist to ensure that they were consistent with the visibility in TWIST. Examples of classified breasts are shown in Electronic supplementary material Fig. S1.

Development of the MIP-based deep learning system

The proposed deep learning system had three main stages: breast region segmentation, MIP generation, and abnormality prediction (Fig. 3).

For breast segmentation, a previously reported 3D U-Net [18] was used to generate the mask of the breast region. The segmentation was performed on a T1-weighted fat-suppressed sequence acquired before contrast agent injection. The obtained masks were then mapped onto TWIST sequences by shape resizing and FOV (field of view) alignment. Then, the breast area was divided into left and right segments from the middle of the mask.

At the stage of MIP generation, only the last four TWIST acquisitions out of the fourteen postcontrast phases were used. Previous research shows that the time of arrival of benign lesions may be much longer than that of malignant lesions [19, 20]; thus, most of the early MIPs contained no enhancing lesions. Therefore, to identify as many lesions as possible and reduce computational burden, in this study, the generated MIP images were then used to train the deep learning model.

A ResNet-34 model [21], which was pretrained on the ImageNet dataset, was modified and retrained for abnormality prediction. The output of the last fully connected layer of the model was changed to 2 to fit the task. The training data were then used for transfer learning, and validation data were used for hyperparameter tuning. The tasks used for training were the presence or absence of visible lesions in the MIP image. During the training process, image augmentation was applied with random horizontal flipping, random rotation within 10°, and random scaling within 10%. The batch size was set to 4, and the Adam optimizer was used. The final model was obtained by 60 epochs of training with an initial learning rate of 10⁻⁴. During inference, each of the 4 MIP images from a single breast was input into the deep learning model; if any of these images was predicted to be positive, the breast was then categorized as abnormal. The breast was only categorized as lesion free when all 4 MIP images were predicted to be negative.

Model calibration and evaluation

To leverage the trained model to identify as many abnormal MRI exams as possible, a probability threshold that ensures a lower false negative rate (FNR) is preferable. On the other hand, the effect of the false-positive rate (FPR) on the workload in the screening workflow should also be considered. To illustrate the relationship between FNR and FPR, the detection error trade-off (DET) curve for the validation set was generated. Thresholds that corresponded to a sensitivity of 100% or 95% and a negative predictive value (NPV) above 98% on the validation set were then selected as high sensitivity thresholds.

To evaluate the prediction performance of the proposed deep learning system, receiver operating characteristic (ROC) curves on the independent test set were generated and the area under the receiver operating curve (AUC) was calculated. Sensitivity, specificity, positive predictive value (PPV), and NPV were also calculated for the default and high sensitivity thresholds, respectively. Furthermore, to help explain the decision-making of the classification model, Gradient-weighted Class Activation Mapping (Grad-CAM) was used to produce a coarse localization map, highlighting class-discriminative regions in each MIP image.

Strong background parenchymal enhancement (BPE) has been reported to be associated with higher abnormal interpretation rates and lead to higher rates of unnecessary biopsies [22]. The percentage of each category of BPE in false positive and false negative predictions was examined to illustrate the effect of BPE on the model output.

To evaluate the effect of the deep learning system on the clinical workflow, we simulated the scenario in which negative results from the TWIST sequences did not require patients to undergo further work-up or require radiologists to interpret those examinations. The reduced acquisition time and percentage of excluded MRI examinations were calculated based on this scenario.

Statistical analysis

Medcalc (version 19.6.1 Medcalc Software Ltd) and scikit-learn (version 0.24.1; https://scikit-learn.org) were used for statistical analyses. The 95% confidence intervals (CI) for the AUCs were computed with DeLong’s method [23], 95% Clopper-Pearson CI for sensitivity and specificity, and 95% standard logit CI [24] for PPV and NPV were also reported.

Results

Patients and lesions

The training and validation sets consisted of 339 patients (median age ± standard deviation, 44 ± 11 years; range, 22–80 years) who underwent 659 breast screening MRI examinations. Among these, 494 examinations were used for model training, and 165 were used for validation. The left and right breasts in each examination were classified separately, which resulted in the identification of 118 abnormal breasts (lesion size ± standard deviation, 17.9 ± 17.4 mm; range, 5.0–110.0 mm) and 1200 normal breasts. Eighty-four of the abnormal breasts contained benign lesions (lesion size ± standard deviation, 13.7 ± 12.8 mm; range, 5.0–81.0 mm), while the other 34 contained malignant lesions (lesion size ± standard deviation, 25.1 ± 19.8 mm; range, 6.0–110.0 mm).

The independent test set consisted of 149 patients (median age ± standard deviation, 44 ± 15 years; range, 24–76 years) who underwent 178 breast screening MRI examinations. Fifty-five breasts were classified as abnormal (lesion size ± standard deviation, 24.0 ± 19.8 mm; range, 5.0–76.0 mm), and 301 were classified as normal. Twenty-five of the 55 abnormal breasts contained benign lesions (lesion size ± standard deviation, 15.0 ± 16.1 mm; range, 5.0–75.0 mm), and 30 contained malignant lesions (lesion size ± standard deviation, 31.0 ± 19.4 mm; range, 6.0–76.0 mm). Detailed patient and lesion characteristics are provided in Tables 2 and 3.

Table 2 Patient and examinations characteristics

Full size table

Table 3 Description of lesions in the abnormal breasts

Full size table

Model calibration

The DET curve on the validation set, which illustrates the trade-off between FPR and FNR with the threshold ranging from 0 to 1, is shown in Fig. 4. Two cutoff thresholds were selected based on the DET curve. With a threshold of 0.37, a sensitivity of 97% (30 of 31, 95% CI: 83%; 100%) and NPV of 98% (123 of 124, 95% CI: 95%; 99%) were achieved. With this threshold, one breast with a benign lesion (chronic active inflammation with fat necrosis, 38 mm) was misclassified in the validation set, and no malignant lesions were missed. With a threshold of 0.25, a sensitivity of 100% (31 of 31, 95% CI: 89; 100) and NPV of 100% (74 of 74) were achieved with no lesion missed.

Independent test

On the independent test set, the model achieved an AUC of 0.81 (95% CI: 0.75; 0.88) (Fig. 5). With the threshold of 0.37, a sensitivity of 95% (52 of 55, 95% CI: 85%; 99%) and NPV of 97% (106 of 109, 95% CI: 92%; 99%) were achieved, while with the threshold of 0.25, a sensitivity of 98% (54 of 55, 95% CI: 90%; 100%) and NPV of 98% (55 of 56, 95% CI: 89%; 100%) were achieved. The classification performance with each threshold is summarized in Table 4.

Table 4 Performance of the model on validation and independent test set for different threshold setting

Full size table

Heatmaps generated with Grad-CAM indicate that, for positive predictions, the model made the decision mainly based on the enhanced regions in the breast parenchyma, while for negative predictions, the model’s focus was outside of the breast parenchyma. Examples are shown in Fig. 6.

The percentage of each BPE level in the false predictions of the independent test set was also investigated. For false negative predictions, 1 had minimal BPE and 2 had moderate BPE; meanwhile for false positive predictions, 35.9% were minimal BPE, 30.7 % were mild BPE, 25.6% were moderate BPE, and 5.1% were marked BPE.

Standard workflow vs. triage

When applying a threshold of 0.37 on the independent test set, 3 breast lesions were misclassified by the model; one contained a malignant lesion (mucinous carcinoma, 8 mm, BI-RADS 6), while the other two contained benign lesions (one with fibroadenoma, 9 mm, BI-RADS 4 and one not biopsied, 6 mm, BI-RADS 2). With the threshold of 0.25, only the one with fibroadenoma was misclassified as normal; no breasts with malignant lesions were missed.

Despite the possible risk of misclassifying breast lesions, with a threshold of 0.37, 109 breasts were triaged as normal and 247 as abnormal, resulting in a workload reduction of 30.6% (109 of 356) at the breast level or 15.7% (28 of 178) at the examination level. If the threshold was further lowered to 0.25, 56 breasts were triaged as normal, while 300 were triaged as abnormal, resulting in a workload reduction of 15.7% (56 of 356) at the breast level and 6.2% (11 of 178) at the examination level. Furthermore, 30.2% (982.2 of 3253.8 min) or 16.6% (538.8 of 3253.8 min) of scanner time could be saved over 178 examinations under different settings if scanning was only continued when an abnormality was detected by ultrafast MRI.

Discussion

In this study, we combined clinical experience with artificial intelligence for the purpose of improving the efficiency and accessibility of breast MRI screening. A deep learning model was developed to identify normal ultrafast breast MRI examinations.

The model achieved an AUC of 0.81 (95% CI: 0.75; 0.88) on an independent test set. High sensitivity (95% and 98%) and negative predicted values (97% and 98%) were obtained by applying different thresholds (0.37 and 0.25). When integrated into the workflow, the model has the potential to reduce radiologists’ workload by excluding normal scans and improving throughput by reducing scanning time. Moreover, the heatmap generated with Grad-CAM could also support radiologists’ image interpretation by identifying possible lesions in the MIP image.

Although a conservative strategy was adopted, there were still false negative predictions. All the missed lesions were smaller than 10 mm, and the relatively small size may be the main reason that the deep learning model did not detect them. One malignant lesion (a mucinous carcinoma) was missed when using the threshold of 0.37. However, it should be noted that there was only one mucinous carcinoma in the training dataset, and the scarcity of this rare cancer might have caused the model to be insufficiently trained to identify it. For false positive predictions, the percentages of minimal, mild, moderate, and marked BPE were 35.9%, 30.7%, 25.6%m and 5.1%, respectively. Compared with the BPE distribution in Table 2 (37.1% minimal, 30.9% mild, 27.0% moderated, and 5% marked), it is hard to make a conclusion that BPE had a negative impact on the classification of MIPs in TWIST. Meanwhile, 134 of the 195 false positive prediction were BI-RADS 2, and 113 were assessed within heterogeneous and extreme FGT. This finding indicates that proper handling of dense and BI-RADS 2 breasts may be the key to reducing false positives in the future.

Similar models have been developed or evaluated in other studies on screening [25, 26]. Verburg et al [27] developed a classification model with 4581 MRI examinations of extremely dense breasts; the model could help exclude 39.7% of the MRI examinations without lesions and preserve 90.7% with lesions for radiologic review. Rodriguez-Ruiz et al [28] and Yala et al [9] showed that AI could help reduce mammogram screening workload by 17% or 19.3% with a sensitivity of 90.6% or 90.1%, respectively. Raya-Povedano et al [29] also reported a 29.7% workload reduction for tomosynthesis screening with a sensitivity of 84.1%. Even though the modality is different, the challenge of using AI in triaging is the same: a lower threshold is safer but less efficient, and the trade-off between the risk of missing breast cancer and the reduction of workload makes the threshold difficult to determine.

One of the limitations in our study is that the model was developed with a high-risk population dataset collected from a single institution. This may affect the generalizability of this study. External validation with diverse populations is necessary before clinical implementation. Another limitation of this study is that the cancer rates in the independent test set and the training and validation sets were not equal. These two subsets of data were downloaded separately from the same picture archiving and communication system via a time-consuming acquisition process. This ensured independence but may have introduced discrepancies in the reported results. In addition, this study was limited in exploring the real effect of the deep learning model in the triage workflow. A double-blind, randomized clinical trial may be necessary to further evaluate the performance of the model. Moreover, the proposed method used the 3D mask derived from T1-weighted fat-suppressed sequences, which may introduce systematic error. Developing a TWIST-based segmentation method might help further improve its performance. Furthermore, the MIP images used in this study are only generated in the axial plane, and potential masking effects may hinder the deep learning model from achieving better performance. Evaluation of multiplanar MIPs may be a potential solution to address MIP masking effects.

In conclusion, the classification of ultrafast breast MRI examinations with a deep learning model in the workflow may be a promising method to improve the efficiency and accessibility of breast MRI screening. Reduced scanning and interpretation time could result in significantly lower breast MRI screening costs, making it possible to provide MRI screening for a wider population.

Abbreviations

AI:: Artificial intelligence
DCE-MRI :: Dynamic contrast-enhanced MRI
DET:: Detection error trade-off
Grad-CAM:: Gradient-weighted class activation mapping
MIP:: Maximum intensity projections
NPV:: Negative predictive value
PPV:: Positive predictive value
TWIST:: Time-resolved angiography with stochastic trajectories

References

Saadatmand S, Geuzinge HA, Rutgers EJT et al (2019) MRI versus mammography for breast cancer screening in women with familial risk (FaMRIsc): a multicentre, randomised, controlled trial. Lancet Oncol 2045:1–12. https://doi.org/10.1016/S1470-2045(19)30275-X
Article Google Scholar
Bakker MF, de Lange SV, Pijnappel RM et al (2019) Supplemental MRI screening for women with extremely dense breast tissue. N Engl J Med 381:2091–2102. https://doi.org/10.1056/NEJMoa1903986
Article PubMed Google Scholar
Sippo DA, Burk KS, Mercaldo SF et al (2019) Performance of screening breast MRI across women with different elevated breast cancer risk indications. Radiology 292(1):51–59. https://doi.org/10.1148/radiol.2019181136
Article PubMed Google Scholar
Tollens F, Baltzer PAT, Dietzel M et al (2021) Cost-effectiveness of digital breast tomosynthesis vs. abbreviated breast MRI for screening women with intermediate risk of breast cancer—how low-cost must MRI be? Cancers (Basel) 13:1–14. https://doi.org/10.3390/cancers13061241
Article Google Scholar
Leithner D, Moy L, Morris EA et al (2019) Abbreviated MRI of the breast: does it provide value? J Magn Reson Imaging 49:e85–e100. https://doi.org/10.1002/jmri.26291
Article PubMed Google Scholar
Mann RM, Kuhl CK, Moy L (2019) Contrast-enhanced MRI for breast cancer screening. J Magn Reson Imaging 50:1–14. https://doi.org/10.1002/jmri.26654
Article Google Scholar
Mann RM, Hooley R, Barr RG, Moy L (2020) Novel approaches to screening for breast cancer. Radiology 297:266–285. https://doi.org/10.1148/radiol.2020200172
Article PubMed Google Scholar
Partovi S, Sin D, Lu Z et al (2020) Fast MRI breast cancer screening – ready for prime time. Clin Imaging 60:160–168. https://doi.org/10.1016/j.clinimag.2019.10.013
Article PubMed Google Scholar
Yala A, Schuster T, Miles R et al (2019) A deep learning model to triage screening mammograms: a simulation study. Radiology 293:38–46. https://doi.org/10.1148/radiol.2019182908
Kuhl CK, Schrading S, Strobel K et al (2014) Abbreviated breast magnetic resonance imaging (MRI): first postcontrast subtracted images and maximum-intensity projection - a novel approach to breast cancer screening with MRI. J Clin Oncol 32:2304–2310. https://doi.org/10.1200/JCO.2013.52.5386
Article PubMed Google Scholar
Chhor CM, Mercado CL (2017) Abbreviated MRI protocols: wave of the future for breast cancer screening. Am J Roentgenol 208: 284–289. https://doi.org/10.2214/AJR.16.17205
Van Zelst JCM, Vreemann S, Witt HJ et al (2018) Multireader study on the diagnostic accuracy of ultrafast breast magnetic resonance imaging for breast cancer screening. Invest Radiol 53:579–586. https://doi.org/10.1097/RLI.0000000000000494
Mann RM, Mus RD, Van Zelst J et al (2014) A novel approach to contrast-enhanced breast magnetic resonance imaging for screening: high-resolution ultrafast dynamic imaging. Invest Radiol 49:579–585. https://doi.org/10.1097/RLI.0000000000000057
Article PubMed Google Scholar
Kuhl CK, Strobel K, Bieling H et al (2017) Supplemental breast MR imaging screening of women with average risk of breast cancer. Radiology 283:361–370. https://doi.org/10.1148/radiol.2016161444
Article PubMed Google Scholar
Backmann HA, Larsen M, Danielsen AS, Hofvind S (2021) Does it matter for the radiologists’ performance whether they read short or long batches in organized mammographic screening ? Eur Radiol 31(12):9548–9555. https://doi.org/10.1007/s00330-021-08010-9
Article PubMed PubMed Central Google Scholar
Mus RD, Borelli C, Bult P et al (2017) Time to enhancement derived from ultrafast breast MRI as a novel parameter to discriminate benign from malignant breast lesions. Eur J Radiol 89:90–96. https://doi.org/10.1016/j.ejrad.2017.01.020
Article PubMed Google Scholar
Onishi N, Sadinski M, Gibbs P et al (2020) Differentiation between subcentimeter carcinomas and benign lesions using kinetic parameters derived from ultrafast dynamic contrast-enhanced breast MRI. Eur Radiol 30:756–766. https://doi.org/10.1007/s00330-019-06392-5
Article PubMed Google Scholar
Zhang J, Saha A, Zhu Z, Mazurowski MA (2018) Hierarchical convolutional neural networks for segmentation of breast tumors in MRI with application to radiogenomics. IEEE Trans Med Imaging 38:435–447. https://doi.org/10.1109/TMI.2018.2865671
Article PubMed Google Scholar
Wang S, Abe H, Newstead GM et al (2016) Ultrafast bilateral DCE-MRI of the breast with conventional Fourier sampling. Acad Radiol 23:1137–1144. https://doi.org/10.1016/j.acra.2016.04.008
Article PubMed PubMed Central Google Scholar
Cover KS, Duvivier KM, de Graaf P et al (2018) Summarizing the 4D image stack of ultrafast dynamic contrast enhancement MRI of breast cancer in 3D using color intensity projections. J Magn Reson Imaging 49:1391–1399. https://doi.org/10.1002/jmri.26521
Article PubMed Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 770-778. https://doi.org/10.1109/CVPR.2016.90
Sogani J, Morris EA, Kaplan JB et al (2017) Comparison of background parenchymal enhancement at contrast-enhanced spectral mammography and breast MR imaging. Radiology 282:63–73. https://doi.org/10.1148/radiol.2016160284
Article PubMed Google Scholar
Sun X, Xu W (2014) Fast implementation of DeLong’s algorithm for comparing the areas under correlated receiver operating characteristic curves. IEEE Signal Process Lett 21:1389–1393. https://doi.org/10.1109/LSP.2014.2337313
Article Google Scholar
Mercaldo ND, Lau KF, Zhou XH (2007) Confidence intervals for predictive values with an emphasis to case–control studies. Stat Med 26:2170–2183. https://doi.org/10.1002/sim
Article PubMed Google Scholar
Lång K, Dustler M, Dahlblom V et al (2021) Identifying normal mammograms in a large screening population using artificial intelligence. Eur Radiol 31:1687–1692. https://doi.org/10.1007/s00330-020-07165-1
Article PubMed Google Scholar
Houssami N, Lee CI, Buist DSM, Tao D (2017) Artificial intelligence for breast cancer screening: opportunity or hype? Breast 36:31–33. https://doi.org/10.1016/j.breast.2017.09.003
Article PubMed Google Scholar
Verburg E, van Gils CH, van der Velden BHM et al (2021) Deep learning for automated triaging of 4581 breast MRI examinations from the DENSE Trial. Radiology. 302:29–36. https://doi.org/10.1148/radiol.2021203960
Article PubMed Google Scholar
Rodriguez-Ruiz A, Lång K, Gubern-Merida A et al (2019) Can we reduce the workload of mammographic screening by automatic identification of normal exams with artificial intelligence? A feasibility study. Eur Radiol 29:4825–4832. https://doi.org/10.1007/s00330-019-06186-9
Article PubMed PubMed Central Google Scholar
Raya-Povedano JL, Romero-Martín S, Elías-Cabot E et al (2021) AI-based strategies to reduce workload in breast cancer screening with mammography and tomosynthesis: a retrospective evaluation. Radiology 300:57–65. https://doi.org/10.1148/radiol.2021203555
Article PubMed Google Scholar

Download references

Funding

The authors state that this work has not received any funding.

Author information

Authors and Affiliations

Department of Radiation Oncology, and Data Science Center in Health (DASH), Machine Learning Lab, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713, GZ, Groningen, The Netherlands
Xueping Jing, Sunyi Zheng & Peter M.A. van Ooijen
Department of Radiology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713, GZ, Groningen, The Netherlands
Mirjam Wielema, Margo van Gent, Paul E. Sijens & Monique D. Dorrius
Department of Radiation Oncology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9713, GZ, Groningen, The Netherlands
Ludo J. Cornelissen
Faculty of Medical Sciences, University of Groningen, Antonius Deusinglaan 1, 9713, AV, Groningen, The Netherlands
Willie M. Iwema
Faculty of Medical Sciences, University of Groningen and Institute of Diagnostic Accuracy, Wiersmastraat 5, 9713, GH, Groningen, The Netherlands
Matthijs Oudkerk

Authors

Xueping Jing
View author publications
You can also search for this author in PubMed Google Scholar
Mirjam Wielema
View author publications
You can also search for this author in PubMed Google Scholar
Ludo J. Cornelissen
View author publications
You can also search for this author in PubMed Google Scholar
Margo van Gent
View author publications
You can also search for this author in PubMed Google Scholar
Willie M. Iwema
View author publications
You can also search for this author in PubMed Google Scholar
Sunyi Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Paul E. Sijens
View author publications
You can also search for this author in PubMed Google Scholar
Matthijs Oudkerk
View author publications
You can also search for this author in PubMed Google Scholar
Monique D. Dorrius
View author publications
You can also search for this author in PubMed Google Scholar
Peter M.A. van Ooijen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xueping Jing.

Ethics declarations

Guarantor

The scientific guarantor of this publication is Assoc. Prof. Peter M. A. van Ooijen.

Conflict of interest

The authors of this manuscript declare no relationships with any companies, whose products or services may be related to the subject matter of the article.

Statistics and biometry

No complex statistical methods were necessary for this paper.

Informed consent

Written informed consent was waived by the Institutional Review Board.

Ethical approval

Institutional Review Board approval was obtained.

Methodology

• Retrospective

• Case-control study

• Performed at one institution

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

ESM 1

(DOCX 3222 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jing, X., Wielema, M., Cornelissen, L.J. et al. Using deep learning to safely exclude lesions with only ultrafast breast MRI to shorten acquisition and reading time. Eur Radiol 32, 8706–8715 (2022). https://doi.org/10.1007/s00330-022-08863-8

Download citation

Received: 01 February 2022
Revised: 05 May 2022
Accepted: 07 May 2022
Published: 26 May 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s00330-022-08863-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Using deep learning to safely exclude lesions with only ultrafast breast MRI to shorten acquisition and reading time

Abstract

Objectives

Methods

Results

Conclusion

Key Points

Similar content being viewed by others

Localization of contrast-enhanced breast lesions in ultrafast screening MRI using deep convolutional neural networks

Automated artifact detection in abbreviated dynamic contrast-enhanced (DCE) MRI-derived maximum intensity projections (MIPs) of the breast

Improving workflow in prostate MRI: AI-based decision-making on biparametric or multiparametric MRI

Introduction

Materials and methods

Study population

MRI scanner and imaging technique

Reference standard

Development of the MIP-based deep learning system

Model calibration and evaluation

Statistical analysis

Results

Patients and lesions

Model calibration

Independent test

Standard workflow vs. triage

Discussion

Abbreviations

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Guarantor

Conflict of interest

Statistics and biometry

Informed consent

Ethical approval

Methodology

Additional information

Publisher’s note

Supplementary Information

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation