Explainable AI in radiology: a white paper of the Italian Society of Medical and Interventional Radiology

Neri, Emanuele; Aghakhanyan, Gayane; Zerunian, Marta; Gandolfo, Nicoletta; Grassi, Roberto; Miele, Vittorio; Giovagnoni, Andrea; Laghi, Andrea

doi:10.1007/s11547-023-01634-5

Explainable AI in radiology: a white paper of the Italian Society of Medical and Interventional Radiology

Computer Application
Open access
Published: 08 May 2023

Volume 128, pages 755–764, (2023)
Cite this article

Download PDF

You have full access to this open access article

La radiologia medica Aims and scope Submit manuscript

Explainable AI in radiology: a white paper of the Italian Society of Medical and Interventional Radiology

Download PDF

Emanuele Neri¹,
Gayane Aghakhanyan ORCID: orcid.org/0000-0001-5152-497X¹,
Marta Zerunian²,
Nicoletta Gandolfo³,
Roberto Grassi⁴,
Vittorio Miele⁵,
Andrea Giovagnoni⁶,
Andrea Laghi² &
SIRM expert group on Artificial Intelligence

3039 Accesses
9 Citations
4 Altmetric
Explore all metrics

Abstract

The term Explainable Artificial Intelligence (xAI) groups together the scientific body of knowledge developed while searching for methods to explain the inner logic behind the AI algorithm and the model inference based on knowledge-based interpretability. The xAI is now generally recognized as a core area of AI. A variety of xAI methods currently are available to researchers; nonetheless, the comprehensive classification of the xAI methods is still lacking. In addition, there is no consensus among the researchers with regards to what an explanation exactly is and which are salient properties that must be considered to make it understandable for every end-user. The SIRM introduces an xAI-white paper, which is intended to aid Radiologists, medical practitioners, and scientists in the understanding an emerging field of xAI, the black-box problem behind the success of the AI, the xAI methods to unveil the black-box into a glass-box, the role, and responsibilities of the Radiologists for appropriate use of the AI-technology. Due to the rapidly changing and evolution of AI, a definitive conclusion or solution is far away from being defined. However, one of our greatest responsibilities is to keep up with the change in a critical manner. In fact, ignoring and discrediting the advent of AI a priori will not curb its use but could result in its application without awareness. Therefore, learning and increasing our knowledge about this very important technological change will allow us to put AI at our service and at the service of the patients in a conscious way, pushing this paradigm shift as far as it will benefit us.

Explainable Artificial Intelligence in Health Care: How XAI Improves User Trust in High-Risk Decisions

The clinician-AI interface: intended use and explainability in FDA-cleared AI devices for medical image interpretation

Article Open access 26 March 2024

Designing User-Centric Explanations for Medical Imaging with Informed Machine Learning

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

In recent years, artificial intelligence has rapidly entered diagnostic imaging, demonstrating a lot of potential, both as a catalyst of the workflow and as an aid to the interpretation of bio-images, becoming a promising engine of the decision support systems in radiology [1]. One of the major drivers behind the steady blossoming of AI in medical imaging is powered not only by the widespread availability of large data sets and advancements in both hardware and software systems, but the urge to achieve greater efficiency in clinical care and management. By providing quantitative image data with radiomics in combination with AI tools, AI in radiology smoothly embeds the essence of diagnostic, predictive, and prognostic applications [2]. The popular pillars for the key AI technologies shaping the future of radiologists cover image processing, computer vision, natural language processing, and much more [3]. Besides, the growing evidence indicates that AI algorithms provide support at all levels of radiology workflow management for a variety of non-diagnostic applications, such as quality, safety, and operational efficiency [1]. The integration of AI into the imaging workflow has the potential to enhance efficiency, minimize errors, and meet specific goals with minimal human intervention. [4]. However, due to the “black-box” nature of AI models, they are often perceived as being less trustworthy by physicians, which has limited their implementation in real-world clinical settings. [5]. To address this issue, the field of Explainable Artificial Intelligence (xAI) has been developed, with the goal of improving the interpretability of AI decisions. The focus of xAI is to create new techniques and algorithms that increase the transparency of the decisions accepted by algorithms and predictive models, thus the reliability and the impact of each feature on the outcome. [6].

This white paper of the Italian Society of Medical and Interventional Radiology (SIRM) is intended to aid radiologists, medical practitioners, and scientists in understanding an emerging field of xAI, enhancing awareness of the black-box problem behind the success of AI, increasing the knowledge of the xAI methods that enable to unveil the black-box into a glass-box, raising consciousness about the role, and the responsibilities of the radiologists for appropriate use of the AI-technology.

The clinical use of AI and the problem of the black-box

Currently, two primary AI methods are commonly employed in radiology. The first one adopts handcrafted engineered attributes, such as radiomics features, that are used as inputs in cutting-edge machine learning models trained to perform various clinical decision-making tasks [7]. The second method, based on deep neural networks or deep learning (DL), gained significant attention in the last decade [8, 9].

There are three primary types of machine learning algorithms including supervised learning, unsupervised learning, and reinforcement learning. In supervised learning algorithms, such as linear and multivariate regression, logistic regression, Naive Bayes, decision trees, k-nearest neighbor and linear discriminant analysis, the input data is labeled. In comparison to supervised learning, the unsupervised learning does not required labeled data. Clustering analysis, anomaly detection, hierarchical clustering, and principal component analysis represent unsupervised learning algorithms. Reinforcement learning is a more advanced machine learning algorithm that solves multi-level problems through learning [7]. DL is a relatively new area of study. While machine learning techniques rely on statistical methods to recognize patterns, DL resembles the human brain and it is best known for its neural network models. A deep neural network typically consists of three types of layers: the Input Layer, the Hidden Layer, and the Output Layer (Fig. 1).

The Input Layer receives the input data, while the Hidden Layer performs various computations on that data. The Output Layer produces the final result. It is important to note that a neural network can have multiple hidden layers, allowing for more complex computations and predictions. One of the advantages of DL algorithms is their ability to learn characteristic attributes from data automatically, with no requirement for human experts to define them beforehand. With sufficient amounts of example data, DL models can identify abnormalities in tissue and avoid the need for human-defined segmentations, which allows for more abstract feature definitions and improves generalizability. DL's ability to learn complex data representations often makes it vigorous against unwelcome variations, including, for example, inter-reader variability, and further enables it to put into a wide range of clinical conditions and frameworks [7]. Table 1 summarizes the main advantages and disadvantages of machine learning and DL methods (Table 1).

Table 1 Advantages and disadvantages of machine learning and deep learning

Full size table

The DL tools can generate extremely reliable outcomes, yet they own an intrinsic “opacity”, and although not entirely opaque, their behavior can be difficult to comprehend. Even experts at the highest level may struggle to fully understand the so-called “black-box” models, the reasonability through which models come to forecasting decisions in areas that are critical and relevant to our society, including healthcare information technology and medical imaging, may be still difficult [10]. The highly opaque nature or inexplicability of AI represents the main element of distrust on the part of medical professionals and patients towards this new technology [11]. This fact generates an obstruction to its practical application, which is particularly reflected in those susceptible fields, where automation influences the existence and survival of the human being, as in a particular way in the sector of healthcare. Applying AI to the field of medicine poses significant challenges. Medical decision-making typically involves uncertainty, incomplete and noisy data sets, and a high level of complexity [12]. As a result, transparency in AI models is particularly crucial in medical care, because of its inner ambiguous quality. While humans may not always be able to explain their reasoning, understanding how an AI model makes decisions can provide confidence in human–machine interactions [13]. With an increasing focus on incorporating ethical standards into AI technology design and implementation, there is a growing demand for “Trustable AI,” a term that with slight conceptual modification may encompass Valid AI, Responsible AI, Privacy-Preserving AI, and Explainable AI (xAI). In this context, the xAI aims to display cardinal issues about the decision-making process either for human or machine positions [10].

What does explainable AI mean?

The xAI is an emerging field with several new strategies and multiple ongoing studies that generate a significant impact on the development of AI in many different areas. Van Lent et al., put in place, first, the concept of xAI by describing their system's ability to explain AI-based predictions [14]. Although the term has been inconsistently applied, it generally refers to a class of systems that can shed light on how an AI system arrives at its settlements [15]. The xAI investigates the reasoning behind the decision-making process, outlines the system's strengths and weaknesses, and predicts the future conduct of the model [10].

Thus far, the xAI may be considered an umbrella term covering certain aspects of xAI [10, 16], including

Interpretability, refers to the understanding of the output of the algorithm for end-user implementation
Explainability, involves clarifying how a decision was reached so that a broader range of users can understand it.
Transparency, refers to the degree of the incomprehensibility of the model.
Justifiability, involves providing an in-depth case to support certain conclusions.
Contestability, relates to the fact that users are able to proclaim a particular decision.

In AI, there is often a negative association between the complexity or depth of a system and its interpretability. This inherent tension between predictive accuracy and explainability frequently results in the most accurate methods (such as DL) being the least transparent, while the most interpretable methods (like decision trees) are less accurate [17]. It is essential to attain a balance between the performance of the model and its interpretability, as the first concept will markedly improve patient care, while the second one will enhance the adoption and trust of AI in radiological practice [16].

Ethical, legal, and social issues (ELSI) of xAI

The pursuit of transparent and explainable AI in recent years has not only sparked significant research efforts in the field, but it has also become a central focus of many ethical and responsible design proposals [5, 11, 18]. Additionally, people often express concerns about privacy and security when it comes to AI technologies [19]. The need for greater clarity and transparency was recognized by various institutions. The European Commission has produced a white paper aimed at creating a regulatory framework for a digital ecosystem of trust in reliable AI, among which the fundamental ethical requirements identified are transparency and explainability. In the Ethical Guidelines for reliable AI document, drawn up by the High-Level Expert Group on AI of the European Union, the right is stated to “require an adequate explanation of the decision-making process” whenever AI “significantly affects the people's lives “ [20].

It is intrinsic that after human intelligence fails with significant consequences, the appropriate best practice is to find the root causes, make improvements, and learn from our own mistakes. In the case, the AI fails, it is important to acknowledge it, and increasingly, there is a demand for an explanation of what went wrong in the AI decision-making algorithm [21]. The practical outcome is to establish accountability both in the legal and social sense. Without a clear assignment of liability, it is unlikely that AI can be widely implemented in real-world situations. Therefore, an unforeseen legal challenge may arise, which could have significant implications. [21]. However, addressing only ethical or legal concerns surrounding AI may not be sufficient. All Ethical, Legal, and Social Issues (ELSI) of AI deserve equal attention and certainly should be ahead of AI and xAI implementation in healthcare, as the aim of an ELSI reflection is to provide decision-makers and stakeholders with a comprehensive understanding of the ethical, legal, and social issues associated with a particular technology or practice [22].

In recent times, explaining the output of AI systems has become a crucial issue, not just technically but also legally and politically. There is a general belief that explainable AI systems should be ethically desirable and possibly even legally necessary, which has driven much research in this area [23]. The question of transparency has been given significant attention in regulatory proposals at the EU level, particularly in the proposed Artificial Intelligence Act (AIA). However, discussions and consultations around regulating AI systems are ongoing, and the obligations for explainability under existing regulations and future policies are still being debated [18].

Solutions to the black box?—explainable AI models

Explainability methods, either in the research setting or legal communities, are being recommended as a practical means to increase transparency and discrimination in AI models [24].

A few proposals to classify the xAI techniques have been promoted so far based on the three fundamental dimensions [25]:

the xAI technique implementation stage (ante-hoc, post-hoc)
the xAI technique is intended to provide either a global explanation of the model or a local explanation of a prediction
the xAI technique is model-specific or model-agnostic

Figure 2 summarizes the simplified classification of xAI techniques with a diagrammatic view. (Fig. 2).

Broadly speaking, two types of explainable AI models can be distinguished: post-hoc explainability, occurring after the event in question; and ante-hoc explainability, or so-called, inherent explainability, occurring before the event in question. The concept of xAI can be applied through two approaches: post-hoc and ante-hoc [12]. Post-hoc xAI involves the use of external explainers to interpret a trained model’s behavior during testing. In contrast, ante-hoc xAI incorporates explainability into the AI model's structure from the outset, prioritizing natural understandability while still striving for optimal accuracy during training. Essentially, ante-hoc aims to consider a model’s explainability throughout its development, whereas post-hoc merely explains the model’s behavior after it has been trained [12, 26].

The explainability of machine learning models is generally feasible when models rely on input data that is easily quantifiable and interpretable. There are algorithms, for instance, the decision trees, sparse linear and additive models, or the Bayesian classifiers that are designed with a limited number of internal components, thus allowing the inspection of the model's prediction and/or classification operations. These models provide traceability and transparency in their decision-making [25]. However, in modern AI algorithms, models and data are often complex and high-dimensional, making them difficult to explain with a simple relationship between inputs and outputs. For example, DL models are a category of machine learning algorithms that surrender the model’s understandability for prediction and/or classification accuracy [25]. The DL frameworks are used in applications such as speech and image recognition, natural language processing, and analyzing complex image and sound data. Therefore, explainability techniques for these “black-box” models are post-hoc explainability techniques. First, they resemble DL black-box models into simpler interpretable models, and by doing so, they permit to explore and explain the black-box [12]. These techniques are called xAI, with the main aim to migrate form “black-box” models into more transparent and interpretable, akin to “glass-box” models [25]. The scope of an explanation can be either global or local, with global explanations aiming to translate the whole inferential course transparent and intelligible, while locally explainable methods aim to explain individual feature attributions [26].

The common form of post-hoc explainability in medical imaging settings is heat maps or saliency maps. These maps are a common form of post-hoc explainability that bring out the contribution of each region into the process of decision formation [27]. These methods are not solitary instrumentation available for xAI users, despite their immature state. In the medical imaging field, some other approaches have been already successfully adopted, including methods for feature visualization and prototypical comparisons. More common general post-hoc explanation methods acceptable for complex medical imaging data embrace the locally interpretable model-agnostic explanations (LIME) and Shapley values (SHAP). LIME attempts to understand decisions at the discrete stage by permuting the input sample, while SHAP generates explanations by measuring the contribution of each feature to a specific prediction. LIME and SHAP are generic and applicable to various types of data in healthcare, not limited to medical imaging data. They are commonly used to provide explanations for complex models in the healthcare domain [27, 28].

Post-hoc xAI methods can be model-specific or model-agnostic (Fig. 2). Model-specific methods reshapes DL models in a way to incorporate interpretability context into the structure and learning mechanisms of the model itself, in contrary to model-agnostic methods that operate at the level of the inputs and output of the black box models to handle the explainability issues and to draw explanations. However, ante-hoc methods focus on creating a running model transparent, which is why the ante-hoc methods are intrinsically model-specific. For model-agnostic methods, the internal elements of a model can be ignored, hence these types of models can be applied to any learning approach, while model-specific methods are limited to a determinant subgroup of models [26]. Figure 3 shows possible work-flow approach of different AI model, applied on a specific input (renal solid mass) including black box, post-hoc xAI and ante-hoc xAI.

Explainable AI in radiology

Where and how much is explainable AI relevant in radiology

In 2021 an interesting article written by van Leeuwen K.G. and colleagues was published, containing an analysis of 100 commercially available AI radiological products and related scientific evidence [29]. The paper underlines how despite the CE-marked products being analyzed, only a few had related studies on clinical impact (18/100) and, only 36/100 had peer-reviewed efficacy papers published. An aspect that needs a comment on this paper is the transparency aim of the manuscript, despite a clear xAI section is lacking. However, the Authors provide an online up-to-date tool to deepen the approved AI (www.aiforradiology.com), where it is possible to search for a specific AI tool and related information on how it works, its trustworthiness, and its clinical efficacy. From the analysis of this example, one of the main problems concerning AI in radiology and its explainability emerges. In fact, despite the availability of much CE-marked software, having already been released and started to be used, only a few have been analyzed regarding their explainability. The xAI problem could be marginal in the case of AI software that performs individual tasks, such as segmentation or lesion detection, where radiologists have the ability to check and modify the AI output before signing a report. However, in the case of more complex tasks that combine different medical areas and yield results in terms of prognosis or therapeutic strategies, based on different AI approaches, may radiologists be able to critically interpret the output? In this contest, the black-box approach lacks trustworthy and xAI is necessary to assure radiologists, other specialists, and patients the essential tool to merge AI software in real life. Of course, this process is not easy, and it needs time to be assimilated and integrated into the clinical use of AI.

In addition, it is hard to believe that radiologists might have all the knowledge to understand xAI but, some efforts are necessary to acquire some fundamental expertise and principle of xAI to improve the transparency and explainability of AI software that has the potential of decision-making in the medical area. Moving on to a different radiological topic it is possible to make explicit this concept. For instance, not all radiologists know all the Magnetic Resonance Imaging (MRI) functioning or components even if they interpret it for a clinical purpose MRI imaging. However, radiologists that use MRI as a diagnostic tool are aware of the ghosting artifact mechanism, and that allows them not to misinterpret an aortic pulsation on the liver parenchyma as a lesion. Taking this example, it seems important for AI applications in Radiology to have the possibility to understand how outputs are generated to reduce the risk of “dogmatic medicine”, far away from the “evidence-based approach” that drives progress in science now [5, 30].

For that reason, all the available AI tools in the Radiology field, such as segmentation, detection, lesion characterization, and prognosis, need end-users' side attention regarding the artificial neural network data processing accessibility also after the output is obtained. Of course, this extremely challenging task deserves attention and collaboration also from the other actors in the process (data scientists, developers, engineers, product companies, etc..) to improve the xAI process of merging AI tools in clinical practice, by avoiding conscious or unconscious errors that will damage the patient’s health or trust in AI. Nevertheless, pushing xAI to extreme transparency and explainability contains a very complex intrinsic limit. With increasing transparency, interpretability and explainability comes the risk of reducing the performance of these algorithms based on the true deep learning process. Therefore, once the benefits and limitations of xAI in Radiology are clear, we need to start implementing this process on a large scale of users to test the benefits of AI in clinical practice and to adapt the process itself to reality.

An interesting approach to the evaluation of the explainability of an AI system is the one called Z-Inspection; an initiative to assess trustworthy AI in practice [31, 32]. The Z-Inspection procedure has three main phases: (1) Set Up phase, during which necessary preconditions are clarified, the team of investigators are defined, the boundaries of the assessment are delineated, and a protocol is created; (2) Assess phase, during this phase the use of AI system is inspected, the potential ethical, as well as technical and legal issues are identified, which are further extended to the trustworthy AI ethical values and requirements; (3) Resolve phase, this phase engages with the raised issues in the sense of possible ethical tensions, and recommends appropriate procedures.

The adoption of the Z-Inspection process is important to settle on an AI in clinical practice, since it follows the Assessment List for Trustworthy Artificial Intelligence (ALTAI) outlined by the Ethics Guidelines for Trustworthy AI [33].

Implications of xAI for the radiological profession

Large-scale benefits potentially derivable from AI in medical care are enormous, in terms of process optimization, personalized treatment, and technical implementations, but all these possible scenarios are far from being realized, if possible, drawbacks are not recognized and corrected properly [5]. Being aware of these aspects and realizing the actuality of the thematic is central to preserving the rigor of the medical process as we built it up to now. In fact, AI is taking space not only in the research field but also in clinical practice as mentioned above. In this context, the xAI plays a bridging role in combining the rapid development of AI and its use in practice, in particular in the radiological profession. Thus, being conscious of xAI in the radiological profession implies some changes in the profession itself to avoid a possible catastrophic epilogue such as the one hypothesized by the AI precursor Geoffrey Hinton in 2016: “People should stop training radiologists now. It's just completely obvious that within 5 years deep learning will do better than radiologists.” Luckily, the process of AI implementation has been revealed to be more complex than expected and the role of radiologists is still fundamental; on the other hand, this profession will need implementation and modification to be part of the paradigm shift process. In fact, an important role of radiologists will be, as already happened in the past, to expand their knowledge and merge them with prior expertise. In fact, radiology since its beginning has faced up a wide multitude of technological changes and consequent adaptations that succeeded one other very rapidly, an emblematic example is represented by the X-rays phenomenon described by Roentgen to their clinical application soon after [34]. Therefore, one of the main implications of xAI for radiologists is to keep expanding knowledge in this field to take confidence with this new topic strictly related to medicine and in particular radiology, for improving trustworthiness for them and for patients. In fact, a translational approach is more than ever required in medical disciplines to enhance the benefits of progress and minimize potential drawbacks. Within these considerations, two more aspects need to be highlighted. Firstly, how to use the time obtainable from the automation of certain processes that are currently carried out by the radiologist? Secondly: how to implement knowledge of xAI in radiological practice and during radiology training?

Radiologists’ working schedules will probably evolve in a direction prone to solving more complex cases, where uncertainties or atypical situations make the AI application less performant, or to increase multidisciplinary meetings to merge all the information derived from different AI tools. In fact, as figures are more prone to technology, the role of radiologists in terms of explainability and transparency should be central in the next few years.

In addition, it is emerging how radiologists in training suffer from the lack of adequate training regarding AI [35]. An interesting reflection is provided by Forney et al. [36] regarding how much knowledge is the minimum acceptable for radiologists in training to give them the necessary tools to interpret AI in terms of input and outputs produced. In fact, by doing so, new generations of radiologists will be able to critically assess AI tools and be aware of a large number of biases present in this new entity (e.g., prevalence bias, automation bias, detection bias, negative set bias, etc.). Soon, it will be desirable to assure a basic standardized comprehensive education regarding AI and xAI during the training of radiologists, to prevent a new generation of radiologists from getting lost in the path of integrating AI into their discipline, but on the contrary, to become conscious and critical users of it [37].

The responsibility of the radiologist

Together with the great hype around the blooming of AI, the role of the radiologist is loaded with additional responsibilities concerning the various steps of the AI workflow. In fact, one of the prerequisites of training AI systems is access to a huge amount of data, in the case of imaging data as ground truth. The first concern regarding the accessibility of these medical data regards data ownership, and informed consent. In fact, it is critical to establish, according to countries’ laws, who is the final owner of this data. Community-dedicated laws are necessary to support physicians in that direction [38]. This aspect is also very sensitive, especially since private companies might use such data to develop AI tools that soon after will generate profit [39]. Strictly linked to data ownership there is another important aspect that radiologists need to know and consider. It regards patients’ privacy and informed consent. In fact, it is essential for privacy protection that data injected into the system are anonymized or pseudo-anonymized to avoid tracing back to individual patients. This aspect is deeply connected with the role of radiologists. Before sharing data, radiologists need to be prepared about which data are trackable or recognizable to a single person, and what is needed to be maintained as data to improve AI system efficiency: examples of data protection are the use of pseudo-anonymized information of patient’s age instead of the more conductible date of birth, or to prefer a system that avoids facial recognition obtainable with a volumetric reconstruction of head and neck [40].

After all these aspects have been managed, another fundamental step needs the radiologists’ attention. In fact, an essential step that assures a good development of AI tools regards the clinical question and consequently the type of data that will be used for training the systems. This will help to reduce potential pitfalls that might affect the AI development and further use of AI tools event if they are built with xAI approach. To reduce biases that might impact outputs, xAI will help radiologists and AI tools developers to choose which data are useful to train the software to solve the clinical question.

Parallel to this, radiologists need also to be aware of data labeling. In fact, careful annotation of imaging data that will be used for training, validation, and testing has a central role in AI tool development. In addition, also the definition of the ground truth deserves important consideration: for some pathologies, in fact, a single radiological modality is sufficient to define the diagnosis (e.g., pulmonary CT for pneumothorax) while, some other abnormal entities need a different images modality, imaging follow-up or support from other specialties (e.g., atypical pneumonia to confirm with a second CT after medical therapy). This issue intrinsically contains the risk of weakening the AI training due to the image findings are not directly sufficient, in real life, for the final diagnosis and so, the risk of higher uncertainty for the algorithms or error is high [39].

Another important responsibility of radiologists is transparency with both AI solution developers and patients: xAI, in fact, will support these aspects that will improve trustworthiness and will improve the use of AI in a clinical setting. With transparency, another crucial aspect needs consideration: the responsibility of the medical diagnosis. A large debate is present about the final responsibility of AI tools, but the main direction is prone to consider the radiologist that uses the AI-support as the final responsible [11, 39]. Of course, this aspect is more acceptable with xAI than with black box, and lots of steps need to be taken to consolidate this position and ensure protection for both radiologists and patients.

All the consideration above-mentioned are important to reduce the possibility of pitfalls in the use of AI tool in radiology, even if it is xAI. Another aspect to be considered is actual limit of xAI that cannot be applied at the moment on every AI tool and that explainability is not coincident with high level-decision in every approach in medical practice. Radiologists need to be aware of these limitations to avoid potential biases in the xAI usage [27].

Finally, the most important responsibility for radiologists, encompassing all the others, is to remain critical of the software itself, AI developers, and all the users. In fact, only with constructive critical collaboration among all the professional figures and patients, it will be possible to improve the comprehension of the benefits and limits of AI on specific tools [41].

Recommendations to adopt explainable AI in radiology

The incorporation of xAI algorithms and the inclusion of explanatory components should be carefully considered in the development of any high-risk AI system to be used in healthcare and particularly in radiological practice.
Patients’ informed consent should be built in the clearest and most intelligible manner considering as many as possible xAI concepts including Interpretability, Explainabiliy, Transparency, Justifiability, and Contestability.
Anonymization or pseudo-anonymization of patient data is of utmost importance to comply with current EU regulations. The radiologist can play a key role in making all parties aware of both the importance of sharing data for building biobanks to train artificial intelligence models and the protection of sensitive information.
Data ownership should be carefully considered before sharing data. This aspect includes important ethical considerations if a profit corporation is involved in the process. Transparency of the entire process is expected to improve the trustworthiness of both medical end-users and patients.
The radiologist as an end user of xAI, should be aware of the current limitations of xAI in relation to individual decisions, where xAI shows scarce illumination, compare to those explanations applied to global AI processes, such as model development and knowledge discovery.
Radiologists should be aware of the advent of xAI and the radiological academic community should also take care of the dissemination of the basic concepts of xAI among radiologists, residents, and medical students.
Constructive critical communication of all xAI processes should be encouraged among all the professional figures involved and, when necessary, with patients.

Conclusions

The AI has already stepped either in the scientific reality or the quotidian life of the radiologist with the huge success. However, there is still lack of understanding of xAI and its incorporation into the real world of the Radiologists, although the increasing focus on incorporating ethical standards into AI technology design and implementation. The SIRM introduces an xAI-white paper, which is intended to aid Radiologists, medical practitioners, and scientists. We provided an overview of the emerging field of xAI, the black-box problem behind the success of the AI and the xAI methods to unveil the black-box into a glass-box. We stated the role, and responsibilities of the Radiologists for appropriate use of the AI-technology, how it is relevant in the radiology field and finally, we provided some recommendations to adopt explainable AI in the radiology practice.

References

Ranschaert E, Topff L, Pianykh O (2021) Optimization of radiology workflow with artificial intelligence. Radiol Clin North Am 59:955–966. https://doi.org/10.1016/j.rcl.2021.06.006
Article PubMed Google Scholar
Coppola F, Faggioni L, Gabelloni M et al (2021) Human, all too human? an all-around appraisal of the “artificial intelligence revolution” in medical imaging. Front Psychol 12:710982. https://doi.org/10.3389/fpsyg.2021.710982
Article PubMed PubMed Central Google Scholar
Liew C (2018) The future of radiology augmented with artificial intelligence: a strategy for success. Eur J Radiol 102:152–156. https://doi.org/10.1016/j.ejrad.2018.03.019
Article PubMed Google Scholar
Hosny A, Parmar C, Quackenbush J, Schwartz LH, Aerts HJWL (2018) Artificial intelligence in radiology. Nat Rev Cancer 18(8):500–510. https://doi.org/10.1038/s41568-018-0016-5
Article CAS PubMed PubMed Central Google Scholar
Brady AP, Neri E (2020) Artificial intelligence in radiology-ethical considerations. Diagnostics (Basel). https://doi.org/10.3390/diagnostics10040231
Article PubMed Google Scholar
Lombardi A, Tavares JMRS, Tangaro S (2021) Editorial: explainable artificial intelligence (XAI) in systems neuroscience. Front Syst Neurosci 15:766980. https://doi.org/10.3389/fnsys.2021.766980
Article PubMed PubMed Central Google Scholar
Hosny A, Parmar C, Quackenbush J et al (2018) Artificial intelligence in radiology. Nat Rev Cancer 18:500–510. https://doi.org/10.1038/s41568-018-0016-5
Article CAS PubMed PubMed Central Google Scholar
Laino ME, Ammirabile A, Posa A et al (2021) The applications of artificial intelligence in chest imaging of COVID-19 Patients: a literature review. Diagnostics (Basel). https://doi.org/10.3390/diagnostics11081317
Article PubMed Google Scholar
Hosny A (2021) Abstract IA-05: deep learning radiomics in cancer imaging. Clin Cancer Res 27:IA-05
Article Google Scholar
Yang G, Ye Q, Xia J (2022) Unbox the black-box for the medical explainable AI via multi-modal and multi-centre data fusion: a mini-review, two showcases and beyond. Inf Fusion 77:29–52. https://doi.org/10.1016/j.inffus.2021.07.016
Article PubMed PubMed Central Google Scholar
Neri E, Coppola F, Miele V et al (2020) Artificial intelligence: who is responsible for the diagnosis? Radiol Med 125:517–521. https://doi.org/10.1007/s11547-020-01135-9
Article PubMed Google Scholar
Holzinger A, Langs G, Denk H et al (2019) Causability and explainability of artificial intelligence in medicine. Wiley Interdiscip Rev Data Min Knowl Discov 9:e1312. https://doi.org/10.1002/widm.1312
Article PubMed PubMed Central Google Scholar
Maloca PM, Müller PL, Lee AY et al (2021) Unraveling the deep learning gearbox in optical coherence tomography image segmentation towards explainable artificial intelligence. Commun Biol 4:170. https://doi.org/10.1038/s42003-021-01697-y
Article PubMed PubMed Central Google Scholar
Lent MV, Fisher W, Mancuso M (2004) An explainable artificial intelligence system for small-unit tactical behavior. AAAI Conference on Artificial Intelligence
Benois-Pineau J, Zemmari A (eds) (2021) Multi-faceted Deep Learning Models and Data. Springer International Publishing
Fuhrman JD, Gorre N, Hu Q et al (2022) A review of explainable and interpretable AI with applications in COVID-19 imaging. Med Phys 49:1–14. https://doi.org/10.1002/mp.15359
Article CAS PubMed Google Scholar
Bologna G, Hayashi Y (2017) Characterization of symbolic rules embedded in deep DIMLP networks: a challenge to transparency of deep learning. J Artif Intell Soft Comput Res 7:265–286. https://doi.org/10.1515/jaiscr-2017-0019
Article Google Scholar
Holzinger A, Goebel R, Fong R, Moon T, Müller K-R, Samek W (eds) (2022) xxAI - Beyond Explainable AI. AI International Workshop, Held in Conjunction with ICML 2020, July 18, 2020, Vienna, Austria, Revised and Extended Papers. Springer International Publishing
Elliott D, Soifer E (2022) AI technologies, privacy, and security. Front Artif Intell 5:826737. https://doi.org/10.3389/frai.2022.826737
Article PubMed PubMed Central Google Scholar
The ethics of artificial intelligence: issues and initiatives. In: Paperpile. https://www.europarl.europa.eu/thinktank/en/document/EPRS_STU(2020)634452. Accessed 9 Sept 2022
Krichmar JL, Olds JL, Sanchez-Andres JV, Tang H (2021) Editorial: explainable artificial intelligence and neuroscience: cross-disciplinary perspectives. Front Neurorobot 15:731733. https://doi.org/10.3389/fnbot.2021.731733
Article PubMed PubMed Central Google Scholar
Čartolovni A, Tomičić A, Lazić Mosler E (2022) Ethical, legal, and social considerations of AI-based medical decision-support tools: a scoping review. Int J Med Inform 161:104738. https://doi.org/10.1016/j.ijmedinf.2022.104738
Article PubMed Google Scholar
Holzinger A, Biemann C, Pattichis CS, Kell DB (2017) What do we need to build explainable AI systems for the medical domain? arXiv [cs.AI]
Vale D, El-Sharif A, Ali M (2022) Explainable artificial intelligence (XAI) post-hoc explainability methods: risks and limitations in non-discrimination law. AI Ethics. https://doi.org/10.1007/s43681-022-00142-y
Article Google Scholar
Rai A (2020) Explainable AI: from black box to glass box. J Acad Mark Sci 48:137–141. https://doi.org/10.1007/s11747-019-00710-5
Article Google Scholar
Vilone G, Longo L (2021) Classification of explainable artificial intelligence methods through their output formats. Mach Learn Knowl Extr 3:615–661. https://doi.org/10.3390/make3030032
Article Google Scholar
Ghassemi M, Oakden-Rayner L, Beam AL (2021) The false hope of current approaches to explainable artificial intelligence in health care. Lancet Digit Health 3:e745–e750. https://doi.org/10.1016/S2589-7500(21)00208-9
Article CAS PubMed Google Scholar
Hernandez M, Ramon-Julvez U, Ferraz F, ADNI Consortium (2022) Explainable AI toward understanding the performance of the top three TADPOLE challenge methods in the forecast of Alzheimer’s disease diagnosis. PLoS ONE 17:e0264695. https://doi.org/10.1371/journal.pone.0264695
Article CAS PubMed PubMed Central Google Scholar
van Leeuwen KG, Schalekamp S, Rutten MJCM et al (2021) Artificial intelligence in radiology: 100 commercially available products and their scientific evidence. Eur Radiol 31:3797–3804. https://doi.org/10.1007/s00330-021-07892-z
Article PubMed PubMed Central Google Scholar
Howick J (2016) Aulus Cornelius Celsus and “empirical” and “dogmatic” medicine. J R Soc Med 109:426–430. https://doi.org/10.1177/0141076816672397
Article PubMed Central Google Scholar
Zicari RV, Brodersen J, Brusseau J et al (2021) Z-inspection®: a process to assess trustworthy AI. IEEE Trans Technol Soc 2:83–97. https://doi.org/10.1109/tts.2021.3066209
Article Google Scholar
Amann J, Vetter D, Blomberg SN et al (2022) To explain or not to explain?—artificial intelligence explainability in clinical decision support systems. PLOS Digit Health 1:e0000016. https://doi.org/10.1371/journal.pdig.0000016
Article PubMed PubMed Central Google Scholar
Assessment List for Trustworthy Artificial Intelligence (ALTAI) for self-assessment. In: Shaping Europe’s digital future. Accessed 18 Sep 2022. https://digital-strategy.ec.europa.eu/en/library/assessment-list-trustworthy-artificial-intelligence-altai-self-assessment
Gore JC (2020) Artificial intelligence in medical imaging. Magn Reson Imaging 68:A1–A4. https://doi.org/10.1016/j.mri.2019.12.006
Article PubMed Google Scholar
Simpson SA, Cook TS (2020) Artificial intelligence and the trainee experience in radiology. J Am Coll Radiol 17:1388–1393. https://doi.org/10.1016/j.jacr.2020.09.028
Article PubMed Google Scholar
Forney MC, McBride AF (2020) Artificial intelligence in radiology residency training. Semin Musculoskelet Radiol 24:74–80. https://doi.org/10.1055/s-0039-3400270
Article PubMed Google Scholar
Bluemke DA, Moy L, Bredella MA et al (2020) Assessing radiology research on artificial intelligence: a brief guide for authors, reviewers, and readers-from the radiology editorial board. Radiology 294:487–489. https://doi.org/10.1148/radiol.2019192515
Article PubMed Google Scholar
Recht MP, Dewey M, Dreyer K et al (2020) Integrating artificial intelligence into the clinical practice of radiology: challenges and recommendations. Eur Radiol 30:3576–3584. https://doi.org/10.1007/s00330-020-06672-5
Article PubMed Google Scholar
Wichmann JL, Willemink MJ, De Cecco CN (2020) Artificial intelligence and machine learning in radiology: current state and considerations for routine clinical implementation. Invest Radiol 55:619–627. https://doi.org/10.1097/RLI.0000000000000673
Article PubMed Google Scholar
Jeong YU, Yoo S, Kim Y-H, Shim WH (2020) De-identification of facial features in magnetic resonance images: software development using deep learning technology. J Med Internet Res 22:e22739. https://doi.org/10.2196/22739
Article PubMed PubMed Central Google Scholar
Martín-Noguerol T, Paulano-Godino F, López-Ortega R et al (2021) Artificial intelligence in radiology: relevance of collaborative work between radiologists and engineers for building a multidisciplinary team. Clin Radiol 76:317–324. https://doi.org/10.1016/j.crad.2020.11.113
Article PubMed Google Scholar

Download references

Acknowledgements

All members of the Artificial Intelligence (AI) study group of the Italian Society of Medical and Interventional Radiology (SIRM) read and approved the position paper. *SIRM expert group on AI: Adriana Taddeucci, Health Physics Unit, Careggi University Hospital, Florence, Italy; Antonella Santone, Department of Medicine and Health Sciences V. Tiberio, University of Molise, 86100 Campobasso, Italy; Luca Brunese, Department of Medicine and Health Sciences V. Tiberio, University of Molise, 86100 Campobasso, Italy; Stefano Canitano, Diagnostic Imaging Department, ASL Rieti, Italy; Damiano Caruso, Medical-Surgical Sciences and Translational Medicine, Sapienza University of Rome, Sant'Andrea Hospital, Rome, Italy; Francesca Coppola, UOC Radiologia Faenza, Dipartimento Diagnostica per Immagini AUSL Romagna, Italy; Riccardo Ferrari, Department of emergency radiology, San Camillo-Forlanini Hospital, Rome, Italy; Giovanni Pasceri, ISLC–Information Society Law Center—University of Milan, Facoltà di Medicina e Chirurgia, Università Vita-Salute San Raffaele, 20132 Milano, Italy

Funding

Open access funding provided by Università di Pisa within the CRUI-CARE Agreement. This project is funded and supported by BANDO RICERCA SALUTE REGIONE TOSCANA 2018 (DD 15397/2018).

Author information

Authors and Affiliations

Academic Radiology, Department of Translational Research and of New Surgical and Medical Technology, University of Pisa, Pisa, Italy
Emanuele Neri & Gayane Aghakhanyan
Medical-Surgical Sciences and Translational Medicine, Sapienza University of Rome, Sant’Andrea Hospital, Rome, Italy
Marta Zerunian & Andrea Laghi
Diagnostic Imaging Department, VillaScassi Hospital-ASL 3, Corso Scassi 1, Genoa, Italy
Nicoletta Gandolfo
Radiology Unit, Università Degli Studi Della Campania Luigi Vanvitelli, Naples, Italy
Roberto Grassi
Department of Radiology, Careggi University Hospital, Florence, Italy
Vittorio Miele
Department of Radiological Sciences, Radiology Clinic, Azienda Ospedaliera Universitaria, Ospedali Riuniti Di Ancona, Ancona, Italy
Andrea Giovagnoni

Authors

Emanuele Neri
View author publications
You can also search for this author in PubMed Google Scholar
Gayane Aghakhanyan
View author publications
You can also search for this author in PubMed Google Scholar
Marta Zerunian
View author publications
You can also search for this author in PubMed Google Scholar
Nicoletta Gandolfo
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Grassi
View author publications
You can also search for this author in PubMed Google Scholar
Vittorio Miele
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Giovagnoni
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Laghi
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

SIRM expert group on Artificial Intelligence

Corresponding author

Correspondence to Gayane Aghakhanyan.

Ethics declarations

Conflict of interest

The authors have not disclosed any competing interest.

Ethical Statement

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Neri, E., Aghakhanyan, G., Zerunian, M. et al. Explainable AI in radiology: a white paper of the Italian Society of Medical and Interventional Radiology. Radiol med 128, 755–764 (2023). https://doi.org/10.1007/s11547-023-01634-5

Download citation

Received: 17 February 2023
Accepted: 19 April 2023
Published: 08 May 2023
Issue Date: June 2023
DOI: https://doi.org/10.1007/s11547-023-01634-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Explainable AI in radiology: a white paper of the Italian Society of Medical and Interventional Radiology

Abstract

Similar content being viewed by others

Explainable Artificial Intelligence in Health Care: How XAI Improves User Trust in High-Risk Decisions

The clinician-AI interface: intended use and explainability in FDA-cleared AI devices for medical image interpretation

Designing User-Centric Explanations for Medical Imaging with Informed Machine Learning

Introduction