Translation of predictive modeling and AI into clinics: a question of trust

Caspers, Julian

doi:10.1007/s00330-021-07977-9

Translation of predictive modeling and AI into clinics: a question of trust

Editorial Comment
Open access
Published: 25 April 2021

Volume 31, pages 4947–4948, (2021)
Cite this article

Download PDF

You have full access to this open access article

European Radiology Aims and scope Submit manuscript

Translation of predictive modeling and AI into clinics: a question of trust

Download PDF

Julian Caspers ORCID: orcid.org/0000-0002-1354-3667¹

1900 Accesses
9 Citations
3 Altmetric
Explore all metrics

The Original Article was published on 17 March 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

During the last decade, data science technologies such as artificial intelligence (AI) and radiomics have emerged strongly in radiologic research. Radiomics refers to the (automated) extraction of a large number of quantitative features from medical images [1]. A typical radiomics workflow involves image acquisition and segmentation as well as feature extraction and prioritization/reduction as preparation for its ultimate goal, which is predictive modeling [2]. This final step is where radiomics and AI typically intertwine to build a gainful symbiosis.

In recent years, the field of medical imaging has seen a rising number of publications on radiomics and AI applications with increasingly refined methodologies [3, 4]. Formulation of best practice white papers and quality criteria for publications on predictive modeling like the TRIPODS [5] or CLAIM [6] criteria have substantially promoted this qualitative gain. Consequently, relevant methodological approaches advancing generalizability of predictive models are increasingly being observed in recent publications, e.g., the accurate composition of representative and unbiased datasets, avoidance of data leakage, the incorporation of (nested) cross-validation approaches for model development, particularly on small datasets, or the use of independent, external test samples. In this regard, the work of Song et al [7] on a clinical-radiomics nomogram for prediction of functional outcome in intracranial hemorrhage published in the current issue of European Radiology is just one example for the general trend.

However, in contrast to the rising utilization and importance of predictive modeling in medical imaging research, these technologies have not been widely adopted in clinical routine. Beside regulatory, medicolegal, or ethical issues, one of the major hurdles for a broad usage of AI and predictive models is the lack of trust in these technologies by medical practitioners, healthcare stakeholders, and patients. After more than a decade of scientific progress on AI and predictive modeling in medical imaging, we should now take the opportunity to focus our research on the trustworthiness of AI and predictive modeling in order to trailblaze their translation into clinical practice.

Several prospects could enhance trustworthiness of predictive models for clinical use. One of the main factors will be transparency on their reliability in real-world applications. Large multicentric prospective trials will be paramount to assess and validate the performance and especially generalizability of predictive models in a robust and minimally biased fashion. Additionally, benchmarking of AI tools by independent institutions on external heterogeneous real-world data would provide transparency on model performances and enhance trust.

In general, trust in new technologies is severely influenced by the comprehensibility of these techniques for their users. In the field of predictive modeling, this topic is often described with the term “explainable AI,” which is being increasingly considered in current research [8]. Explainable AI seeks to unravel the “black-box” nature of many predictive models, including artificial neural networks, by making decision processes comprehendible, e.g., by revealing the features that drive their decisions. Trust in predictive models will therefore substantially increase, when models are developed transparently and AI systems made comprehensible. Another issue of current AI tools is that they mainly incorporate narrow AI, i.e., they address only one very specific task. We are currently miles, if not light-years away, from building real strong AI, that is, artificial intelligence having the capacity to learn any intellectual task that a human being can. However, building more comprehensive AI systems solving multiple predictive tasks might enhance their trustworthiness for users. For example, a user might be inclined to follow thoughts along the line of “I have good experience in this system predicting the outcome of disease X, then it will likely perform well in prediction of outcome in disease Y and disease Z.” Another point that could increase trustworthiness of AI systems might be transparency on their level of confidence or uncertainty on a specific prediction. Currently, many predictive models discussed in recent literature yield hard binary classifications, i.e., they assign a dataset exclusively to one of two or more classes, for example, diseased vs. not diseased or good outcome vs. unfavorable outcome. If results from predictive models would also include an indication of certainty on classification, model-based decisions would potentially be perceived as more genuine or human-like, which could increase their trustworthiness and also their applicability in a clinical setting [9]. Such probabilistic classification approaches can for example be realized with methods like probability calibration or fuzzy classifiers. Additionally, the adjustment of pretrained models to local conditions should be more strongly considered in AI research. Individual fine-tuning of models, e.g., by applying techniques from domain adaptation and transfer learning [10], would allow for harmonization of different scanners, imaging protocols, or patient populations and avoid biases between the data used to train the model and at the site of usage. If predictive models would be tailored specifically to the local domain of application in such a way, this would clearly enhance their reliability and trustworthiness. Last but not least, the seamless integration of AI into radiologic workflows will be vital for its wide utilization. Close-knit cooperation between researchers, developers, and vendors promoting direct inclusion of predictive models into PACS and image-generating systems as well as upcoming AI marketplaces may strongly facilitate AI adoption.

In conclusion, time is ripe to focus research on the translation of predictive modeling into clinical practice and on approaches to enhance its trustworthiness in a clinical context. The prophecy of AI as a game-changer for radiology is already ubiquitous; it is now up to us to make it happen.

References

Lambin P, Rios-Velazquez E, Leijenaar R et al (2012) Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer 48:441–446. https://doi.org/10.1016/j.ejca.2011.11.036
Article PubMed PubMed Central Google Scholar
Gillies RJ, Kinahan PE, Hricak H (2016) Radiomics: images are more than pictures, they are data. Radiology 278:563–577. https://doi.org/10.1148/radiol.2015151169
Article PubMed Google Scholar
Hosny A, Parmar C, Quackenbush J et al (2018) Artificial intelligence in radiology. Nat Rev Cancer 18:500–510
Article CAS PubMed PubMed Central Google Scholar
Pesapane F, Codari M, Sardanelli F (2018) Artificial intelligence in medical imaging: threat or opportunity? Radiologists again at the forefront of innovation in medicine. Eur Radiol Exp 2(1):35
Collins GS, Reitsma JB, Altman DG, Moons KGM (2015) Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD Statement. BMC Med 13. https://doi.org/10.1186/s12916-014-0241-z
Mongan J, Moy L, Kahn CE (2020) Checklist for Artificial Intelligence in Medical Imaging (CLAIM): a guide for authors and reviewers. Radiology: Artificial Intelligence 2:e200029. https://doi.org/10.1148/ryai.2020200029
Article PubMed PubMed Central Google Scholar
Song Z, Tang Z, Liu H, Guo D, Cai J, Zhou Z (2021) A clinical-radiomics nomogram may provide a personalized 90-day functional outcome assessment for spontaneous intracranial hemorrhage. Eur Radiol. https://doi.org/10.1007/s00330-021-07828-7
Reyes M, Meier R, Pereira S et al (2020) On the interpretability of artificial intelligence in radiology: challenges and opportunities. Radiology: Artificial Intelligence 2:e190043. https://doi.org/10.1148/ryai.2020190043
Article PubMed PubMed Central Google Scholar
Kompa B, Snoek J, Beam AL (2021) Second opinion needed: communicating uncertainty in medical machine learning. NPJ Digit Med. https://doi.org/10.1038/s41746-020-00367-3
Dinsdale NK, Jenkinson M, Namburete AIL (2021) Deep learning-based unlearning of dataset bias for MRI harmonisation and confound removal. Neuroimage:228. https://doi.org/10.1016/j.neuroimage.2020.117689

Download references

Acknowledgements

Open Access funding enabled and organized by Projekt DEAL. The author likes to thank Christian Rubbert and Kaustubh Patil for the inspiring discussions on trustworthiness of AI.

Funding

The author states that this work has not received any funding.

Author information

Authors and Affiliations

University Düsseldorf, Medical Faculty, Department of Diagnostic and Interventional Radiology, D-40225, Düsseldorf, Germany
Julian Caspers

Authors

Julian Caspers
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Julian Caspers.

Ethics declarations

Guarantor

The scientific guarantor of this publication is Julian Caspers.

Conflict of Interest

The author of this manuscript declares no relationships with any companies whose products or services may be related to the subject matter of the article.

Statistics and Biometry

No complex statistical methods were necessary for this paper.

Informed Consent

Not applicable

Ethical Approval

Not applicable

Methodology

editorial comment

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This comment refers to the article available at https://doi.org/10.1007/s00330-021-07828-7.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Caspers, J. Translation of predictive modeling and AI into clinics: a question of trust. Eur Radiol 31, 4947–4948 (2021). https://doi.org/10.1007/s00330-021-07977-9

Download citation

Received: 19 February 2021
Revised: 03 March 2021
Accepted: 01 April 2021
Published: 25 April 2021
Issue Date: July 2021
DOI: https://doi.org/10.1007/s00330-021-07977-9

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Translation of predictive modeling and AI into clinics: a question of trust

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Guarantor

Conflict of Interest

Statistics and Biometry

Informed Consent

Ethical Approval

Methodology

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation