Explaining decisions of a light-weight deep neural network for real-time coronary artery disease classification in magnetic resonance imaging

Iqbal, Talha; Khalid, Aaleen; Ullah, Ihsan

doi:10.1007/s11554-023-01411-7

Explaining decisions of a light-weight deep neural network for real-time coronary artery disease classification in magnetic resonance imaging

Research
Open access
Published: 10 February 2024

Volume 21, article number 31, (2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Real-Time Image Processing Aims and scope Submit manuscript

Explaining decisions of a light-weight deep neural network for real-time coronary artery disease classification in magnetic resonance imaging

Download PDF

Talha Iqbal¹,
Aaleen Khalid² &
Ihsan Ullah^1,2

955 Accesses
2 Citations
Explore all metrics

Abstract

In certain healthcare settings, such as emergency or critical care units, where quick and accurate real-time analysis and decision-making are required, the healthcare system can leverage the power of artificial intelligence (AI) models to support decision-making and prevent complications. This paper investigates the optimization of healthcare AI models based on time complexity, hyper-parameter tuning, and XAI for a classification task. The paper highlights the significance of a lightweight convolutional neural network (CNN) for analysing and classifying Magnetic Resonance Imaging (MRI) in real-time and is compared with CNN-RandomForest (CNN-RF). The role of hyper-parameter is also examined in finding optimal configurations that enhance the model’s performance while efficiently utilizing the limited computational resources. Finally, the benefits of incorporating the XAI technique (e.g. GradCAM and Layer-wise Relevance Propagation) in providing transparency and interpretable explanations of AI model predictions, fostering trust, and error/bias detection are explored. Our inference time on a MacBook laptop for 323 test images of size 100x100 is only 2.6 sec, which is merely 8 milliseconds per image while providing comparable classification accuracy with the ensemble model of CNN-RF classifiers. Using the proposed model, clinicians/cardiologists can achieve accurate and reliable results while ensuring patients’ safety and answering questions imposed by the General Data Protection Regulation (GDPR). The proposed investigative study will advance the understanding and acceptance of AI systems in connected healthcare settings.

Clinical Inference From Cardiovascular Imaging: Paradigm Shift Towards Machine-Based Intelligent Platform

Article 20 February 2020

Automated Deep Learning for Medical Imaging

Find the latest articles, discoveries, and news in related topics.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

According to the World Health Organization^{Footnote 1}, in 2019, an estimated 17.9 million people died from cardiovascular diseases, representing 32% of all global deaths. Statistics published by the American Heart Association in 2023 state that from 2017-2020, an estimated 20.5 million Americans had coronary heart disease (CHD) [1]. Specifically, Coronary artery disease (CAD) accounts for approximately 610,000 deaths annually in the United States and is the third leading cause of death worldwide, with 17.8 million deaths annually [2].

The patient’s symptoms of CAD are neither sensitive nor specific, thus making it difficult for clinicians or cardiologists to rely only on them. The reference standard for CAD detection is coronary angiography, which is an invasive diagnostic imaging procedure performed using cardiac catheterization [3]. This method is expensive and carries potential risks. Other methods include cardiac imaging techniques, which are safe, non-invasive, cheaper and can help doctors in early detection and providing timely interventions to treat CAD patients. These techniques include X-rays, Computer Tomography (CT), Echo-cardiogram and Magnetic Resonance Imaging (MRI) or Cardiac Magnetic Resonance (CMR) Imaging [4].

X-rays and CT imaging techniques use ionizing radiations, which are considered harmful if a patient is over-exposed to them [5]. Echocardiograms are limited by cost, time, and acoustic window access [6]. MRI or CMR imaging uses magnetic waves and is considered a viable alternative for non-invasive assessment of CAD [7]. MRI/CMR images provide precise measurements of heart structure and functions, as well as myocardial perfusion and parametric quantification. MRI/CMR could be 2D or 3D, but 3D imaging has excessive artifacts and has thus not been clinically used for the diagnosis of CAD [8]. Manual interpretation of 2D scans is also time-consuming and requires experience. Thus, artificial intelligence methods are exploited to automate the CAD diagnosis to reduce the analysis time with potentially improved accuracy. This plays a critical role in connected healthcare settings (transitioning healthcare services remotely, from hospitals to patient side or home-based care).

However, there are several challenges in implementing such AI models on computational tools such as Field Programmable Gate Arrays (FPGAs), Raspberry Pi and central processing unit (CPU)/graphics processing unit (GPU) based systems. These challenges arise due to the limited processing power, memory, and energy efficiency of these devices. It is essential to engage in a multidisciplinary approach that involves collaboration between domain experts, data scientists and hardware engineers to overcome these challenges.

Convolutional Neural Network (CNN) models have yielded unprecedented achievements in addressing computer vision challenges, including but not limited to image classification, object detection, and tracking. Nonetheless, their integration into embedded applications has been impeded by the substantial computational and memory requisites, thereby giving rise to a novel research domain known as model compression including bit reduction, knowledge distillation, tensor decomposition, network pruning, and microarchitecture [9]. Interested readers are referred to [10] for detailed insights, advantages and limitations of each mentioned method. While these strategies have demonstrated notable achievements, they are not without their inherent constraints.

This paper introduces a lightweight Convolutional Neural Network (CNN) model designed specifically for real-time implementation as a classifier. In connected healthcare settings, where low latency and efficient processing are crucial, this lightweight CNN offers a promising solution. By optimizing the model’s architecture and parameters, we aim to strike a balance between computational efficiency and classification accuracy, enabling real-time CAD detection. This approach has great potential to improve the deployment of AI systems in resource constrained environments, ultimately benefiting the overall healthcare systems.

The remaining paper is organised as; Sect. 2 summaries the available literature on real-time CAD classification networks, Sect. 3 highlights the proposed work and dataset description, Sect. 4 provides calculations and the experimental results and the conclusion and future work are presented in Sect. 6.

2 Background

Coronary artery disease (CAD) primarily originates from the accumulation of atherosclerotic plaque within the epicardial arteries, leading to an imbalance in the supply and demand of oxygen to the myocardium, often resulting in ischemia [11]. Chest pain is the predominant symptom, typically occurring during physical or emotional stress. Lifestyle modifications, pharmacological therapies, and invasive interventions are available strategies to modify this disease process, with the goal of stabilizing or regression of the disease [12]. Despite the development of innovative imaging methods, such as MRI and/or coronary CT angiography, invasive coronary angiography remains the preferred diagnostic tool for assessing the severity of complex CAD, as endorsed by the 2019 guidelines of the European Society of Cardiology [13]. The process of interpreting complex coronary vascular structures is a time-intensive task and presents challenges to the clinician [14]. The implementation of real-time automatic CAD detection and labelling offers promise in overcoming these challenges by providing valuable support in the decision-making process.

Numerous methodologies for the automatic or semi-automatic assessment of coronary artery diseases have been proposed by various research groups [15]. These methodologies adhere to a common framework comprising three fundamental steps: (1) extraction of the coronary artery tree, (2) computation of geometric parameters, and (3) analysis of stenotic segments. The pivotal phase significantly influencing the efficiency and precision of these algorithms is dependent on the extraction of the coronary artery tree. This task is accomplished through diverse techniques, including centerline extraction [16], graph-based methods [17], superpixel mapping [18], and machine/deep learning [19]. Among these, machine and deep learning methods have exhibited substantial potential in CAD detection based on their commendable performance, adaptability to tuning, and optimization capabilities [20]. The overarching objective pursued by developers and users of CNNs is to strike an optimal equilibrium between accuracy and speed, a concept often referred to as the “speed/accuracy trade-off” [21]. This trade-off incorporates the endeavour to achieve high levels of CAD detection accuracy while simultaneously ensuring swift processing and analysis, a critical consideration in clinical practice.

Although several CNN-based approaches have reportedly achieved optimal accuracy in CAD detection, with Dice Similarity Coefficients surpassing 0.75 [22] and Sensitivity metrics exceeding 0.70 [23], there is notable neglect of their processing speed. The time required for image processing represents a critical performance indicator for the practical application of these methods. In literature, studies have reported processing times ranging from 1.1 to 11.87 seconds [17, 22], 20 seconds [18] and, in some instances, even exceeding 60 seconds per frame [16]. However, such durations are considered unacceptable for real-time CAD detection as the required processing time is 0.13 to 0.07 seconds per frame [24]. Thus, our study presents a detailed analysis of light-weighted neural network architectures along with their potential in terms of accuracy and performance to classify healthy and CAD images.

3 Proposed work

In the proposed work, we implemented a light-weight neural network, that is, adapted version of LeNET-5 model [25] on the CAD Cardiac Magnetic Resonance Imaging dataset^{Footnote 2} (proposed by Khozeimeh et al. [26]) for a comprehensive comparison. The results of the CNN-RF model [26] were considered as ground truth/reference. The input to the model is 2D CMR images. Figure 1 depicts examples of both categories’ images. Pre-processing steps included resizing the images to 100x100 pixels and normalization between 0 and 1. The main contribution in the proposed work is 3-fold and is as follows:

1.
Time Complexity: We propose a lightweight deep network model for CAD classification in MRI by carefully selecting network architecture and optimizing model parameters to reduce inference time while maintaining accuracy and enabling real-time or near-real-time CAD diagnosis.
2.
Hyper-parameter Tuning: We optimized the deep model by exploring different hyper-parameter settings as well as used various activation functions, optimizers, and architectural changes, to identify configurations that maximize the model’s performance.
3.
eXplainable Artificial Intelligence (XAI): We integrated GradCam [27] and Layer-Wise Relevance Propagation (LRP) [28], XAI techniques to provide interpretable insights into the model’s decision-making process, generating heatmaps that highlight the regions of MRI/CMR images that govern CAD classification.

3.1 Dataset description

The dataset consists of 63,151 multiparametric CMR Images including 37,290 healthy and 25,861 CAD patients images. CAD diagnosis was confirmed by invasive coronary angiography. Four MRI / CMR sequences (that is, Late Gadolinium Enhancement (LGE), Perfusion, T2 weighted, and Steady-State Free Precession (SSFP)) were used, capturing short and long axes plains of the heart. A total of 13 slices per patient were collected in four types of sequences.

During the pre-processing stage, a manual inspection was conducted on images from both subsets, and any images with poor MRI/CMR quality were excluded from further analysis. Following the pre-processing stage, the dataset consisted of 34,216 images from healthy patients and 17,438 images from patients with CAD.

3.2 Performance assessment matrices

The performance of the classifier is assessed using Positive Predictive Value (PPV), Recall (Sensitivity or True Positive Rate), Specificity (True Negative Rate), F1-Score, Area Under the Curve (AUC), Accuracy and Balanced Accuracy. Mathematically, each matrix is presented as:

$$\begin{aligned}{} & {} \text {PPV} = \frac{\text {TP}}{\text {TP} + \text {FP}} \end{aligned}$$

(1)

$$\begin{aligned}{} & {} \text {Recall} = \frac{\text {TP}}{\text {TP} + \text {FN}} \end{aligned}$$

(2)

$$\begin{aligned}{} & {} \text {Specificity} = \frac{\text {TN}}{\text {TN} + \text {FP}} \end{aligned}$$

(3)

$$\begin{aligned}{} & {} \text {F1-score} = 2 \cdot \frac{\text {Precision} \cdot \text {Recall}}{\text {Precision} + \text {Recall}} \end{aligned}$$

(4)

$$\begin{aligned}{} & {} \text {AUC} = \int _{0}^{1} \text {ROC-curve} \, d\text {FPR} \end{aligned}$$

(5)

$$\begin{aligned}{} & {} \text {Accuracy} = \frac{\text {TP} + \text {TN}}{\text {TP} + \text {TN} + \text {FP} + \text {FN}} \end{aligned}$$

(6)

$$\begin{aligned}{} & {} \text {Balanced Accuracy} = \frac{\text {Sensitivity} + \text {Specificity}}{2} \end{aligned}$$

(7)

Where TP: True Positives, TN: True Negatives, FP: False Positives, FN: False Negatives, ROC: Receiver Operating Characteristic and FPR: False Positive Rate.

4 Results

Figure 2 illustrates the implemented model architecture^{Footnote 3}. All experiments were implemented in Python using the Karas library. The models were trained on Apple M2 Pro with 16 GB RAM. The following subsections discuss the time complexity calculations, the effect of hyper-parameter tuning, and feature explanations using XAI results.

4.1 Time complexity calculation and comparison

The time complexity of a model is determined by the number of layers and the operations performed in each layer. The proposed model architecture comprises seven layers, excluding the input layer, as shown in Fig. 2. These layers consist of C1 (convolutional), S2 (subsampling), C3 (convolutional), S4 (subsampling), FC5 (fully connected), FC6 (fully connected) and the output layer. The time complexity of each layer is as follows:

1.
C1 (convolutional layer): The time complexity of this layer depends on the filter size, the shape of the input image, and the number of filters applied. Assuming the input shape is (H, W, C), where H is the height, W is the width, and C is the number of channels, and the filter size is (FH, FW), the time complexity is approximately O(H * W * C * FH * FW * F *$S^{2}$), where F is the number of filters and S is stride value.
2.
S2 (subsampling layer/ average pooling layer): The time complexity of this layer depends on the pool size and strides. Assuming that the pool size is (PH, PW) and the strides are (SH, SW), the time complexity is approximately O((H/PH) * (W/PW) * C).
3.
C3 (convolutional layer): Similar to C1, the time complexity of this layer is approximately O(H * W * C * FH * FW * F * $S^{2}$).
4.
S4 (subsampling layer/ average pooling layer): Similar to S2, the time complexity of this layer is approximately O((H/PH) * (W/PW) * C).
5.
FC5 (fully connected layer): The time complexity of a fully connected layer with units U is O(U * F), where F is the size of the input features.
6.
FC6 (fully connected layer): The time complexity of this layer is similar to the previous one, that is, O(U * F).
7.
Output layer: The time complexity of the output layer is O(U), where U is the number of output units.

Adding all the time complexities of each layer, the overall time complexity of the proposed model could be approximated to be: $O\Big (2*(H * W * C * FH * FW * F * S^{2}) + \frac{H}{PH} * \frac{W}{PW} * C) + (ep * ts * tf * C * fs * fs) + 2*( U * F) + U\Big ).$

In our case, the input image shape is (100,100,1), filter size is varied between (C1 = 6, C3 = 6) and (C1 = 12, C3 = 6), kernel size = (5,5), pooling size = (2,2), strides = (2,2), units in fully connected layer 1 and layer 2 = 128, 84 respectively, while the output layer had only 1 unit, as the model is performing binary classification.

As the results are to be compared with CNN-RF models proposed by [26], the time complexity of their model is calculated to be: $O\big (n_{\text {e}} \log (n_{\text {e}}) n_{\text {f}} n_{\text {s}} \log (n_{\text {s}}) + 2 \big (O(n_{\text {s}}) + O(n_{\text {cnn}} n_{\text {ep}} n_{\text {ts}} n_{\text {tf}} n_{\text {tc}} \text {fs} \text {fs}) + O(n_{\text {ts}} n_{\text {cnn}}) + O(n_{\text {ts}} n_{\text {cnn}} \log (n_{\text {cnn}})) + O(n_{\text {vs}} n_{\text {cnn}})\big )$.

In both the time complexity equations, e is estimators, f is features, s is samples, ep is epochs, ts is train samples, tf is train features, tc is train channels, fs is filter size, and vs is validation samples.

A comparison of the time complexities between the proposed model and the CNN-RF model reveals that our model entails significantly lower computational overhead in comparison to the CNN-RF model. Our inference time on a MacBook laptop for 323 test images of size 100x100 is only 2.6 sec, which is only 8 milliseconds per image. Additionally, it provides better or equal classification accuracy. Our model’s lower computational complexity enables faster image analysis and diagnosis, improving efficiency, and facilitating deployment on resource-constrained systems such as Raspberry Pi, FPGA or any other edge device for real-time classification and diagnosis in connected healthcare settings.

4.2 Hyper-parameter tuning and classification results

Various hyperparameter configurations were utilized to attain optimal model performance for CAD image classification. Table 1, 2, 3 and 4 present the diverse performance of the models obtained with different settings.

Table 1 Model parameters settings: model with filter size $C_{1}$ and $C_{3}$ = 6,6; Batch Size = 32; epochs = 20; loss function = binary cross-entropy; final layer activation function = sigmoid; dropout = 0.5

Full size table

Table 2 Model parameters settings: model with filter size $C_{1}$ and $C_{3}$ = 12,6; Batch Size = 32; epochs = 20; loss function = binary cross-entropy; final layer activation function = sigmoid; dropout = 0.5

Full size table

Table 3 Model parameters settings: model with filter size $C_{1}$ and $C_{3}$ = 6,6; Batch Size = 32; epochs = 20; loss function = binary cross-entropy; final layer activation function = sigmoid; No dropout

Full size table

Table 4 Model parameters settings: model with filter size $C_{1}$ and $C_{3}$ = 12,6; batch size = 32; epochs = 20; loss function = binary cross-entropy; final layer activation function = sigmoid; No dropout

Full size table

The Parametric Rectified Linear Unit (PReLU) activation function combined with the Root Mean Squared propagation (RMSprop) optimizer resulted in the highest classification accuracy, achieving a general accuracy of 99.35% and a balanced precision of 99.13%. This surpasses the previously achieved highest accuracy of 99.18% obtained by the reference CNN-RF model. To test the generalizability of our model, a stratified cross-validation (CV) analysis was performed using 10-folds. The model showed similar performance as without CV, achieving classification accuracy of 99.22% (while the balance accuracy of 99.10%), as depicted in Table 5.

Table 5 Model’s best performances achieved with different settings: comparison

Full size table

The sub-optimal performance of the proposed classifiers can be attributed to their reliance on the frame-based analysis. MRI sequences often produce a multitude of frames, some of which lack noticeable regions of interest (ROIs), as depicted in Figure 3 (all three view angles of MRI scan). The figure illustrates the frames with no ROIs (no visible coronary artery in the frame). The proposed model considers all the frames uniformly, irrespective of their diagnostic value. Thus, frames without ROIs introduce noise into the analysis, impairing the classifier ability to differentiate between images of patients with CAD (illness) and those of healthy individuals. This limitation underscores the need for more sophisticated methodologies that account for the inherent variability in MRI frames, enabling classifiers to consider frames based on the presence or absence of ROIs.

5 eXplainable AI

Explainable Artificial Intelligence (XAI) is a field in machine learning and artificial intelligence that focuses on developing models that can provide transparent and interpretable explanations for their decisions or predictions. In the context of connected healthcare settings, XAI not only helps ensure the quality and safety of care but also fosters trust among patients and healthcare providers. Several notable XAI techniques include: SHAP (SHapley Additive exPlanations) values provide a unified framework for explaining the output of any machine learning model by attributing contributions of each input feature to the model’s prediction [29, 30]. LIME (Local Interpretable Model-Agnostic Explanations) generates local explanations by approximating complex model behaviour with simpler, interpretable models on a subset of data points [31]. Saliency Maps highlight regions in input data (e.g., medical images) that are most influential in a model’s prediction, aiding clinicians in understanding what the model is focusing on [32]. Accumulated Local Effects (ALE) helps visualize how the relationship between a single feature and the model’s prediction changes across different feature values [33]. Contrastive Explanation Method (CEM) generates contrastive explanations, highlighting the minimal changes needed in input features to alter a model’s prediction, which can be invaluable in understanding model behaviour [34]. Global Interpretation via Recursive Partitioning (GIRP) uses recursive partitioning techniques to create a global interpretable model that approximates the original complex model [35]. CAM (Class Activation Maps) highlights important regions in images that contribute to a specific class prediction, making it useful for image classification tasks [36]. GradCAM (Gradient-weighted Class Activation Mapping) combines gradient information with CAM to provide more precise visualizations of feature importance in convolutional neural networks [37]. LRP (Layer-wise Relevance Propagation) is a method that assigns relevance scores to each input feature, explaining how each feature contributes to the model’s output [38]. In this paper, we choose GradCAM and LRP due to their ability to provide precise, visual, and deep-level explanations, their compatibility with CNN-based models, and their established utility in the medical imaging domain. These methods collectively offer a comprehensive solution for improving the interpretability of AI models in a clinical context, ultimately leading to more informed and confident clinical decision-making. The results of each technique are explained as follows:

5.1 GradCAM heatmaps

Gradient-weighted Class Activation Mapping (Grad-CAM) is a computer vision technique used to generate a heatmap of the important regions in an image that significantly contributes to the prediction of the deep learning model [39]. Figure 4 illustrates some examples of generated GradCAM heatmaps that highlight the focused regions (regions of interest) for the prediction of CAD in the test images. In the GradCAM visualization, the intensity of the heatmap represents the importance of each pixel in the input image. Higher intensity (e.g. brighter colours) and high-contrast colour with the background are indicative of a more significant region that contributed to the model’s prediction.

5.2 Layer-wise relevance propagation (LRP)

Layer-wise Relevance Propagation (LRP) is an XAI technique used to understand the predictions made by deep learning models. The primary objective of LRP is to ascribe the model’s predictions to specific regions or features within the input image [40]. This helps explain why a particular classification decision was made, which is crucial in medical applications for trust and accountability. The core principle shared among various versions of the LRP algorithm is the conservation of the activation strength of an output node for a specific class, as it is propagated back through each layer of the neural network. This ensures that the total relevance associated with a particular class remains constant as it traverses the network layers during the explanation process [41]. This study investigated two versions of the LRP algorithms i.e., LRP0 and LRP_epsilon. The LRP0 is a straightforward version that conserves relevance strictly but can lead to issues with non-differentiable activation functions. LRP_epsilon addresses these issues by introducing a small smoothing factor (epsilon) to improve the stability and interpretability of relevant heatmaps.

Figure 5 displays the heatmaps produced by both algorithms along with the original images. The significance of features is visually represented using colours, with red indicating more critical features contributing to the classification of an image into a specific category.

5.3 Failure cases

The lack of contrast in the region of interest (ROI) or overly bright regions where there is no relevant information (ROI) presents a significant challenge. While the model appropriately emphasizes brighter regions, it struggles when the input image does not have enough contrast. Thus, the performance of the proposed model significantly depends on the quality of the input image. To address this issue, a potential solution is to implement a preprocessing step focused on enhancing image contrast. Furthermore, an iterative refinement process and parameter tuning may be employed to optimize the preprocessing step, ensuring adaptability to varying degrees of contrast in input images. However, it is crucial to acknowledge that these approaches incur computational expenses due to the additional processing requirements. Therefore, a trade-off balance between computational resources and enhanced model performance needs to be met.

Table 6 Comparison of the proposed model with AlexNet in terms of classification accuracy, model complexity and run-time complexity on 323 test images

Full size table

6 Conclusion

In conclusion, this research study aimed to propose a light-weighted Convolutional Neural Network (CNN) model tailored for real-time CAD image classification tasks in connected healthcare environments. The study placed a strong emphasis on optimizing hyperparameter configurations to enhance the efficiency and accuracy of AI models in healthcare-related classifications. Moreover, to provide the interpretability of the model’s predictions, we incorporated the GradCam and LRP algorithms, that highlighted the significant features within input images that influence classification decisions.

The achieved results are compared with the state-of-the-art algorithm present in the literature (an ensemble of 10 CNN-RF networks). The CNN-RF model is more computationally expensive as it extracts classification features using CNN and then feeds them to Random Forest (RF) classifier for classification. In addition, majority voting is performed to predict the final class (normal or sick patient image). On the other hand, the proposed model is a single seven-layered CNN model which outperforms the CNN-RF in terms of classification as well as time complexity making it more suitable to be implemented on edge devices and in connected healthcare settings.

The classification model performance (for both the models i.e., baseline and proposed) was measured using PPV, recall, specificity, F1-Score, AUC, and accuracy matrices. As the dataset has a class imbalance, an additional performance metric i.e., Balance Accuracy was also calculated during the analysis. The combination of different hyperparameters revealed different classification accuracies, as tabulated in Table 1 to 5. Among all the settings, the proposed model achieved the highest test accuracy of 99.35% (with balanced accuracy = 99.13%) with interlayer activation function to be PReLU, RMSprop optimizer, batch size of 32 and binary-cross-entropy loss-function.

The proposed model is also compared with a relatively more complex AlexNet in terms of classification accuracy, model complexity, and run-time complexity. With AlexNet achieving an accuracy of 98.89%, the proposed model demonstrates superior performance, as shown in Table 6. In addition to accuracy, the proposed model exhibits substantially reduced training and inference times (556.4 seconds and 2.6 seconds, respectively) compared to AlexNet. Moreover, the architecture of the proposed model has significantly fewer trainable parameters (507,299) as well as a smaller model size (1.94 MB), demonstrating its enhanced practicality and resource efficiency.

The achievement of such a high classification accuracy on the CAD test dataset with downsized images (100x100 pixels) using the proposed light-weighted model can be attributed to two main factors. Firstly, the representational efficiency of the model architecture is a key contributor. The proposed model demonstrates the capacity to learn the crucial features even in low-resolution images, enabling accurate predictions. Secondly, the downsampling of images does not severely compromise the model’s proficiency in recognizing spatial hierarchies and patterns.

This research highlights the critical role of optimizing time complexity and hyperparameters in the development of sustainable healthcare AI models. By doing so, we can ensure the resource efficiency and real-time applicability of these models, while concurrently upholding their reliability. Furthermore, the incorporation of eXplainable AI (XAI) techniques provides essential interpretability, aligning AI-generated recommendations with the interpretations of clinical experts and safeguarding patient safety.

Future directions: The proposed investigative work aimed to provide insight into the optimization of healthcare AI models, ensuring accurate and reliable results while prioritizing patient safety, resource efficacy, and advancing the acceptance and understanding of AI in connected healthcare settings. While the results on the 2D CMR images are promising, in future 3D-CNN based models will be explored on other healthcare images such as Computer Tomography (CT), X-rays and/or Echocardiogram (Echos) images to determine the model’s comprehensive diagnostic capabilities, cross-domain scalability, and performance on Multi-model data. Moreover, we propose the integration of two techniques to further improve the designed classification models’ performances: majority voting for frame-based analysis and the implementation of a video-based classifier. Combining these techniques offers a promising path towards a more accurate and reliable classification model to distinguish between patient and healthy images in MRI scans.

Notes

References

Tsao, C.W., Aday, A.W., Almarzooq, Z.I., Anderson, C.A., Arora, P., Avery, C.L., Baker-Smith, C.M., Beaton, A.Z., Boehme, A.K., Buxton, A.E., et al.: Heart disease and stroke statistics-2023 update: a report from the american heart association. Circulation 147(8), e93–e621 (2023)
Article Google Scholar
Brown, J. C., Gerhardt, T. E., Kwon, E.: “Risk factors for coronary artery disease,” 2020
Knaapen, P.: “Computed tomography to replace invasive coronary angiography? close, but not close enough,” 2019
Serruys, P.W., Hara, H., Garg, S., Kawashima, H., Nørgaard, B.L., Dweck, M.R., Bax, J.J., Knuuti, J., Nieman, K., Leipsic, J.A., et al.: Coronary computed tomographic angiography for complete assessment of coronary artery disease: Jacc state-of-the-art review. J. Amer. Coll. Cardiol. 78(7), 713–736 (2021)
Article Google Scholar
Agrawal, V., Paulose, R., Arya, R., Rajak, G., Giri, A., Bijanu, A., Sanghi, S.K., Mishra, D., Prasanth, N., Khare, A.K., et al.: Green conversion of hazardous red mud into diagnostic x-ray shielding tiles. J. Hazard. Mater. 424, 127507 (2022)
Article Google Scholar
Adeboye, A., Alkhatib, D., Butt, A., Yedlapati, N., Garg, N.: A review of the role of imaging modalities in the evaluation of viral myocarditis with a special focus on covid-19-related myocarditis. Diagnostics 12(2), 549 (2022)
Article Google Scholar
Catalano, O., Moro, G., Mori, A., Perotti, M., Gualco, A., Frascaroli, M., Pesarin, C., Napolitano, C., Ntusi, N. A., Priori, S. G.: “Cardiac magnetic resonance in stable coronary artery disease: added prognostic value to conventional risk profiling,” BioMed Research International, vol. 2018, 2018
Zhou, W., Sin, J., Yan, A.T., Wang, H., Lu, J., Li, Y., Kim, P., Patel, A.R., Ng, M.-Y.: Qualitative and quantitative stress perfusion cardiac magnetic resonance in clinical practice: A comprehensive review. Diagnostics 13(3), 524 (2023)
Article Google Scholar
Marinó, G.C., Petrini, A., Malchiodi, D., Frasca, M.: Deep neural networks compression: A comparative survey and choice recommendations. Neurocomputing 520, 152–170 (2023)
Article Google Scholar
Ademola, O.A., Leier, M., Petlenkov, E.: Evaluation of deep neural network compression methods for edge devices using weighted score-based ranking scheme. Sensors 21(22), 7529 (2021)
Article Google Scholar
Jensen, R.V., Hjortbak, M.V., Bøtker, H.E.: Ischemic heart disease: an update. Seminars Nuclear Med. 50, 195–207 (2020)
Article Google Scholar
Knuuti, J., Wijns, W., Saraste, A., Capodanno, D., Barbato, E., Funck-Brentano, C., Prescott, E., Storey, R.F., Deaton, C., Cuisset, T., et al.: 2019 esc guidelines for the diagnosis and management of chronic coronary syndromes: the task force for the diagnosis and management of chronic coronary syndromes of the european society of cardiology (esc). Eur. Heart J. 41(3), 407–477 (2020)
Article Google Scholar
Saraste, A., Knuuti, J.: Esc 2019 guidelines for the diagnosis and management of chronic coronary syndromes: Recommendations for cardiovascular imaging. Herz 45(5), 409 (2020)
Article Google Scholar
Janssen, J.P., Rares, A., Tuinenburg, J.C., Koning, G., Lansky, A.J., Reiber, J.H.: New approaches for the assessment of vessel sizes in quantitative (cardio-) vascular x-ray analysis. Int. J. Cardiovas. Imag. 26, 259–271 (2010)
Article Google Scholar
Joshi, M., Melo, D.P., Ouyang, D., Slomka, P.J., Williams, M.C., Dey, D.: Current and future applications of artificial intelligence in cardiac ct. Curr. Cardiol. Rep. 25(3), 109–117 (2023)
Article Google Scholar
Wan, T., Feng, H., Tong, C., Li, D., Qin, Z.: Automated identification and grading of coronary artery stenoses with x-ray angiography. Comp. Methods Prog. Biomed. 167, 13–22 (2018)
Article Google Scholar
Fang, H., Zhu, J., Ai, D., Huang, Y., Jiang, Y., Song, H., Wang, Y., Yang, J.: Greedy soft matching for vascular tracking of coronary angiographic image sequences. IEEE Trans. Circuits Syst. Video Technol. 30(5), 1466–1480 (2019)
Article Google Scholar
M’hiri, F., Duong, L., Desrosiers, C., Leye, M., Miró, J., Cheriet, M.: “A graph-based approach for spatio-temporal segmentation of coronary arteries in x-ray angiographic sequences,” Computers in biology and medicine, vol. 79, pp. 45–58, 2016
Zreik, M., Van Hamersvelt, R.W., Wolterink, J.M., Leiner, T., Viergever, M.A., Išgum, I.: A recurrent cnn for automatic detection and classification of coronary artery plaque and stenosis in coronary ct angiography. IEEE Trans. Med. Imag. 38(7), 1588–1598 (2018)
Article Google Scholar
Danilov, V.V., Klyshnikov, K.Y., Gerget, O.M., Kutikhin, A.G., Ganyukov, V.I., Frangi, A.F., Ovcharenko, E.A.: Real-time coronary artery stenosis detection based on modern neural networks. Sci. Rep. 11(1), 7582 (2021)
Article Google Scholar
Yang, S., Kweon, J., Roh, J.-H., Lee, J.-H., Kang, H., Park, L.-J., Kim, D.J., Yang, H., Hur, J., Kang, D.-Y., et al.: Deep learning segmentation of major vessels in x-ray coronary angiography. Sci. Rep. 9(1), 16897 (2019)
Article Google Scholar
M’hiri, F., Le, T. H. N., Duong, L., Desrosiers, C., Cherief, M.: “Hierarchical segmentation and tracking of coronary arteries in 2d x-ray angiography sequences,” in 2015 IEEE International Conference on Image Processing (ICIP), pp. 1707–1711, IEEE, 2015
Patel, M.R., Calhoon, J.H., Dehmer, G.J., Grantham, J.A., Maddox, T.M., Maron, D.J., Smith, P.K.: Acc/aats/aha/ase/asnc/scai/scct/sts 2017 appropriate use criteria for coronary revascularization in patients with stable ischemic heart disease: a report of the american college of cardiology appropriate use criteria task force, american association for thoracic surgery, american heart association, american society of echocardiography, american society of nuclear cardiology, society for cardiovascular angiography and interventions, society of cardiovascular computed tomography, and society of thoracic surgeons. J. Amer. Coll. Cardiol. 69(17), 2212–2241 (2017)
Article Google Scholar
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., Berg, A. C.: “Ssd: Single shot multibox detector,” in Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14, pp. 21–37, Springer, 2016
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Khozeimeh, F., Sharifrazi, D., Izadi, N.H., Joloudari, J.H., Shoeibi, A., Alizadehsani, R., Tartibi, M., Hussain, S., Sani, Z.A., Khodatars, M., et al.: Rf-cnn-f: random forest with convolutional neural network features for coronary artery disease diagnosis based on cardiac magnetic resonance. Sci. Rep. 12(1), 11178 (2022)
Article Google Scholar
Selvaraju, R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., Batra, D.: “Grad-cam: Visual explanations from deep networks via gradient-based localization. arxiv 2016,” arXiv preprint arXiv:1610.02391, 2022
Bach, S., Binder, A., Montavon, G., Klauschen, F., Müller, K.-R., Samek, W.: On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PloS one 10(7), e0130140 (2015)
Article Google Scholar
Lundberg,S. M., Lee, S.-I.: “A unified approach to interpreting model predictions,” Advances in neural information processing systems, vol. 30, 2017
Ahmed, S., Nobel, S. N., Ullah, O.: “An effective deep cnn model for multiclass brain tumor detection using mri images and shap explainability,” in 2023 International Conference on Electrical, Computer and Communication Engineering (ECCE), pp. 1–6, IEEE, 2023
Di Martino, F., Delmastro, F.: Explainable ai for clinical and remote health applications: a survey on tabular and time series data. Artif. Intell. Rev. 56(6), 5261–5315 (2023)
Article Google Scholar
Severn, C., Suresh, K., Görg, C., Choi, Y.S., Jain, R., Ghosh, D.: A pipeline for the implementation and visualization of explainable machine learning for medical imaging using radiomics features. Sensors 22(14), 5205 (2022)
Article Google Scholar
Salih, A., Boscolo Galazzo, I., Gkontra, P., Lee, A. M., Lekadir, K., Raisi-Estabragh, Z., Petersen, S. E.: “Explainable artificial intelligence and cardiac imaging: Toward more interpretable models,” Circulation: Cardiovascular Imaging, vol. 16, no. 4, p. e014519, 2023
Dhurandhar, A., Chen, P.-Y., Luss, R., Tu, C.-C., Ting, P., Shanmugam, K., Das, P.: “Explanations based on the missing: Towards contrastive explanations with pertinent negatives,” Advances in neural information processing systems, vol. 31, 2018
Yang, C., Rangarajan, A., Ranka, S.: “Global model interpretation via recursive partitioning,” in 2018 IEEE 20th International Conference on High Performance Computing and Communications; IEEE 16th International Conference on Smart City; IEEE 4th International Conference on Data Science and Systems (HPCC/SmartCity/DSS), pp. 1563–1570, IEEE, 2018
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., Torralba, A.: “Learning deep features for discriminative localization,” in Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2921–2929, 2016
Chien, J.-C., Lee, J.-D., Hu, C.-S., Wu, C.-T.: The usefulness of gradient-weighted cam in assisting medical diagnoses. Appl. Sci. 12(15), 7748 (2022)
Article Google Scholar
Shin, H., Park, J. E., Jun, Y., Eo, T., Lee, J., Kim, J. E., Lee, D. H., Moon, H. H., Park, S. I., Kim, S.: et al., “Deep learning referral suggestion and tumour discrimination using explainable artificial intelligence applied to multiparametric mri,” European Radiology, pp. 1–12, 2023
Xiao, M., Zhang, L., Shi, W., Liu, J., He, W., Jiang, Z.: “A visualization method based on the grad-cam for medical image segmentation model,” in 2021 International Conference on Electronic Information Engineering and Computer Science (EIECS), pp. 242–247, IEEE, 2021
Binder, A., Montavon, G., Lapuschkin, S., Müller, K.-R., Samek, W.: “Layer-wise relevance propagation for neural networks with local renormalization layers,” in Artificial Neural Networks and Machine Learning–ICANN 2016: 25th International Conference on Artificial Neural Networks, Barcelona, Spain, September 6-9, 2016, Proceedings, Part II 25, pp. 63–71, Springer, 2016
Böhle, M., Eitel, F., Weygandt, M., Ritter, K.: Layer-wise relevance propagation for explaining deep neural network decisions in mri-based alzheimer’s disease classification. Front. Aging Neurosci. 11, 194 (2019)
Article Google Scholar

Download references

Acknowledgements

This publication has emanated from research conducted with the financial support of Science Foundation Ireland under Grant number [SFI/12/RC/2289_P2] the Insight SFI Research Centre for Data Analytics. For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission. We also acknowledge the School of Computer Science Summer EDI scholarship for providing funding in part for the completion of this work.

Funding

Open Access funding provided by the IReL Consortium.

Author information

Authors and Affiliations

Insight SFI Research Centre for Data Analytics, University of Galway, Galway, H91 TK33, Ireland
Talha Iqbal & Ihsan Ullah
School of Computer Science, University of Galway, Galway, H91 TK33, Ireland
Aaleen Khalid & Ihsan Ullah

Authors

Talha Iqbal
View author publications
You can also search for this author in PubMed Google Scholar
Aaleen Khalid
View author publications
You can also search for this author in PubMed Google Scholar
Ihsan Ullah
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ihsan Ullah.

Ethics declarations

Conflicts of interest

All authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Iqbal, T., Khalid, A. & Ullah, I. Explaining decisions of a light-weight deep neural network for real-time coronary artery disease classification in magnetic resonance imaging. J Real-Time Image Proc 21, 31 (2024). https://doi.org/10.1007/s11554-023-01411-7

Download citation

Received: 28 September 2023
Accepted: 28 December 2023
Published: 10 February 2024
DOI: https://doi.org/10.1007/s11554-023-01411-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Explaining decisions of a light-weight deep neural network for real-time coronary artery disease classification in magnetic resonance imaging

Abstract

Similar content being viewed by others

Clinical Inference From Cardiovascular Imaging: Paradigm Shift Towards Machine-Based Intelligent Platform

Automated Deep Learning for Medical Imaging

Automated Deep Learning for Medical Imaging

1 Introduction

2 Background