Checklist for Reproducibility of Deep Learning in Medical Imaging

Moassefi, Mana; Singh, Yashbir; Conte, Gian Marco; Khosravi, Bardia; Rouzrokh, Pouria; Vahdati, Sanaz; Safdar, Nabile; Moy, Linda; Kitamura, Felipe; Gentili, Amilcare; Lakhani, Paras; Kottler, Nina; Halabi, Safwan S.; Yacoub, Joseph H.; Hou, Yuankai; Younis, Khaled; Erickson, Bradley J.; Krupinski, Elizabeth; Faghani, Shahriar

doi:10.1007/s10278-024-01065-2

Checklist for Reproducibility of Deep Learning in Medical Imaging

Original Paper
Published: 14 March 2024

(2024)
Cite this article

Journal of Imaging Informatics in Medicine Aims and scope Submit manuscript

Mana Moassefi¹,
Yashbir Singh¹,
Gian Marco Conte¹,
Bardia Khosravi^1,2,
Pouria Rouzrokh^1,2,
Sanaz Vahdati¹,
Nabile Safdar³,
Linda Moy⁴,
Felipe Kitamura⁵,
Amilcare Gentili⁶,
Paras Lakhani⁷,
Nina Kottler⁸,
Safwan S. Halabi⁹,
Joseph H. Yacoub¹⁰,
Yuankai Hou¹¹,
Khaled Younis¹²,
Bradley J. Erickson¹,
Elizabeth Krupinski¹³ &
…
Shahriar Faghani ORCID: orcid.org/0000-0003-3275-2971¹

277 Accesses
24 Altmetric
Explore all metrics

Abstract

The application of deep learning (DL) in medicine introduces transformative tools with the potential to enhance prognosis, diagnosis, and treatment planning. However, ensuring transparent documentation is essential for researchers to enhance reproducibility and refine techniques. Our study addresses the unique challenges presented by DL in medical imaging by developing a comprehensive checklist using the Delphi method to enhance reproducibility and reliability in this dynamic field. We compiled a preliminary checklist based on a comprehensive review of existing checklists and relevant literature. A panel of 11 experts in medical imaging and DL assessed these items using Likert scales, with two survey rounds to refine responses and gauge consensus. We also employed the content validity ratio with a cutoff of 0.59 to determine item face and content validity. Round 1 included a 27-item questionnaire, with 12 items demonstrating high consensus for face and content validity that were then left out of round 2. Round 2 involved refining the checklist, resulting in an additional 17 items. In the last round, 3 items were deemed non-essential or infeasible, while 2 newly suggested items received unanimous agreement for inclusion, resulting in a final 26-item DL model reporting checklist derived from the Delphi process. The 26-item checklist facilitates the reproducible reporting of DL tools and enables scientists to replicate the study’s results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Reproducibility of Deep Learning Algorithms Developed for Medical Imaging Analysis: A Systematic Review

Article 05 July 2023

Artificial intelligence with deep learning in nuclear medicine and radiology

Article Open access 11 December 2021

Automated Deep Learning for Medical Imaging

Abbreviations

DL:: Deep learning
CVR:: Content validity ratio

References

McDermott MBA, Wang S, Marinsek N, Ranganath R, Ghassemi M, Foschini L. Reproducibility in Machine Learning for Health. arXiv [cs.LG]. 2019. Available: http://arxiv.org/abs/1907.01463
Stupple A, Singerman D, Celi LA. The reproducibility crisis in the age of digital medicine. NPJ Digit Med. 2019;2: 2.
Article PubMed PubMed Central Google Scholar
Baker M. 1,500 scientists lift the lid on reproducibility. Nature. 2016;533: 452–454.
Article ADS CAS PubMed Google Scholar
Vasilevsky NA, Brush MH, Paddock H, Ponting L, Tripathy SJ, Larocca GM, et al. On the reproducibility of science: unique identification of research resources in the biomedical literature. PeerJ. 2013;1: e148.
Article PubMed PubMed Central Google Scholar
Moassefi M, Rouzrokh P, Conte GM, Vahdati S, Fu T, Tahmasebi A, et al. Reproducibility of Deep Learning Algorithms Developed for Medical Imaging Analysis: A Systematic Review. J Digit Imaging. 2023. https://doi.org/10.1007/s10278-023-00870-5
Article PubMed Google Scholar
Venkatesh K, Santomartino SM, Sulam J, Yi PH. Code and Data Sharing Practices in the Radiology Artificial Intelligence Literature: A Meta-Research Study. Radiol Artif Intell. 2022;4: e220081.
Article PubMed PubMed Central Google Scholar
Dalkey N. An experimental study of group opinion: The Delphi method. Futures. 1969;1: 408–426.
Article Google Scholar
Jones J, Hunter D. Consensus methods for medical and health services research. BMJ. 1995;311: 376–380.
Article CAS PubMed PubMed Central Google Scholar
Gupta UG, Clarke RE. Theory and applications of the Delphi technique: A bibliography (1975–1994). Technol Forecast Soc Change. 1996;53: 185–211.
Article Google Scholar
Steurer J. The Delphi method: an efficient procedure to generate knowledge. Skeletal Radiol. 2011;40: 959–961.
Article PubMed Google Scholar
Lawshe CH. A quantitative approach to content validity. Pers Psychol. 1975;28: 563–575.
Article Google Scholar
Mongan J, Moy L, Kahn CE Jr. Checklist for Artificial Intelligence in Medical Imaging (CLAIM): A Guide for Authors and Reviewers. Radiol Artif Intell. 2020;2: e200029.
Article PubMed PubMed Central Google Scholar
Hernandez-Boussard T, Bozkurt S, Ioannidis JPA, Shah NH. MINIMAR (MINimum Information for Medical AI Reporting): Developing reporting standards for artificial intelligence in health care. J Am Med Inform Assoc. 2020;27: 2011–2015.
Article PubMed PubMed Central Google Scholar
Liu X, Cruz Rivera S, Moher D, Calvert MJ, Denniston AK, SPIRIT-AI and CONSORT-AI Working Group. Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension. Nat Med. 2020;26: 1364–1374.
Ayre, C., & Scally, A. J. (2014). Critical Values for Lawshe’s Content Validity Ratio Revisiting the Original Methods of Calculation. Measurement and Evaluation in Counseling and Development, 47, 79–86. - references - scientific research publishing. [cited 30 Aug 2023]. Available: https://www.scirp.org/(S(lz5mqp453edsnp55rrgjct55.))/reference/referencespapers.aspx?referenceid=2434615
Free online form builder & form creator. [cited 17 Oct 2023]. Available: https://www.jotform.com/
Klontzas ME, Gatti AA, Tejani AS, Kahn CE Jr. AI Reporting Guidelines: How to Select the Best One for Your Research. Radiol Artif Intell. 2023;5: e230055.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The study was organized through Society of Imaging Informatics for Medicine (SIIM) Machine Learning Tools/Research and Education subcommittees.

Funding

No funding resources.

Author information

Authors and Affiliations

Mayo Clinic Artificial Intelligence Laboratory, Department of Radiology, Mayo Clinic, 200 1st St SW, Rochester, MN, 55905, USA
Mana Moassefi, Yashbir Singh, Gian Marco Conte, Bardia Khosravi, Pouria Rouzrokh, Sanaz Vahdati, Bradley J. Erickson & Shahriar Faghani
Department of Orthopedic Surgery, Orthopedic Surgery Artificial Intelligence Laboratory, Mayo Clinic, Rochester, MN, USA
Bardia Khosravi & Pouria Rouzrokh
Department of Radiology and Imaging Sciences, Emory Healthcare, Emory University, Atlanta, GA, USA
Nabile Safdar
Department of Radiology, NYU Langone Health, New York, NY, USA
Linda Moy
DasaInova, Dasa, Universidade Federal de São Paulo, São Paulo, Brazil
Felipe Kitamura
San Diego VA Health Care System, San Diego, CA, USA
Amilcare Gentili
Department of Radiology, Thomas Jefferson University Hospital, Philadelphia, PA, USA
Paras Lakhani
Radiology Partners Research Institute, El Segundo, CA, USA
Nina Kottler
Department of Medical Imaging, Ann & Robert H. Lurie Children’s Hospital of Chicago, Chicago, IL, USA
Safwan S. Halabi
Department of Radiology, MedStar Georgetown University Hospital, Washington, DC, USA
Joseph H. Yacoub
Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, USA
Yuankai Hou
Philips Research North America, Cambridge, MD, USA
Khaled Younis
Department of Radiology and Imaging Science, Emory University School of Medicine, Atlanta, GA, USA
Elizabeth Krupinski

Authors

Mana Moassefi
View author publications
You can also search for this author in PubMed Google Scholar
Yashbir Singh
View author publications
You can also search for this author in PubMed Google Scholar
Gian Marco Conte
View author publications
You can also search for this author in PubMed Google Scholar
Bardia Khosravi
View author publications
You can also search for this author in PubMed Google Scholar
Pouria Rouzrokh
View author publications
You can also search for this author in PubMed Google Scholar
Sanaz Vahdati
View author publications
You can also search for this author in PubMed Google Scholar
Nabile Safdar
View author publications
You can also search for this author in PubMed Google Scholar
Linda Moy
View author publications
You can also search for this author in PubMed Google Scholar
Felipe Kitamura
View author publications
You can also search for this author in PubMed Google Scholar
Amilcare Gentili
View author publications
You can also search for this author in PubMed Google Scholar
Paras Lakhani
View author publications
You can also search for this author in PubMed Google Scholar
Nina Kottler
View author publications
You can also search for this author in PubMed Google Scholar
Safwan S. Halabi
View author publications
You can also search for this author in PubMed Google Scholar
Joseph H. Yacoub
View author publications
You can also search for this author in PubMed Google Scholar
Yuankai Hou
View author publications
You can also search for this author in PubMed Google Scholar
Khaled Younis
View author publications
You can also search for this author in PubMed Google Scholar
Bradley J. Erickson
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth Krupinski
View author publications
You can also search for this author in PubMed Google Scholar
Shahriar Faghani
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Mana Moassefi and Shahriar Faghani, were instrumental in the development and study design as well as writing up the initial draft, incorporating critical revisions of the manuscript and, organization of the Delphi study. Yashbir Singh and Gian Marco Conte provided critical reviews of the draft of the manuscript. Pouria Rouzrokh, Bardia Khosravi, Sanaz Vahdati, Mana Moassefi, and Shahriar Faghani prepared the primary checklist. The remaining individuals listed as authors are recognized as expert panelists who actively engaged in conducting two rounds of the Delphi process and reviewing and commenting the manuscript.

Corresponding author

Correspondence to Shahriar Faghani.

Ethics declarations

Competing Interests

The authors declare no competing interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Moassefi, M., Singh, Y., Conte, G.M. et al. Checklist for Reproducibility of Deep Learning in Medical Imaging. J Digit Imaging. Inform. med. (2024). https://doi.org/10.1007/s10278-024-01065-2

Download citation

Received: 11 December 2023
Revised: 26 February 2024
Accepted: 28 February 2024
Published: 14 March 2024
DOI: https://doi.org/10.1007/s10278-024-01065-2

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Checklist for Reproducibility of Deep Learning in Medical Imaging

Abstract

Access this article

Similar content being viewed by others

Reproducibility of Deep Learning Algorithms Developed for Medical Imaging Analysis: A Systematic Review

Artificial intelligence with deep learning in nuclear medicine and radiology

Automated Deep Learning for Medical Imaging

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Checklist for Reproducibility of Deep Learning in Medical Imaging

Abstract

Access this article

Similar content being viewed by others

Reproducibility of Deep Learning Algorithms Developed for Medical Imaging Analysis: A Systematic Review

Artificial intelligence with deep learning in nuclear medicine and radiology

Automated Deep Learning for Medical Imaging

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation