Explainable Deep-Learning-Based Diagnosis of Alzheimer’s Disease Using Multimodal Input Fusion of PET and MRI Images

Odusami, Modupe; Maskeliūnas, Rytis; Damaševičius, Robertas; Misra, Sanjay

doi:10.1007/s40846-023-00801-3

Explainable Deep-Learning-Based Diagnosis of Alzheimer’s Disease Using Multimodal Input Fusion of PET and MRI Images

Original Article
Open access
Published: 20 June 2023

Volume 43, pages 291–302, (2023)
Cite this article

Download PDF

You have full access to this open access article

Journal of Medical and Biological Engineering Aims and scope Submit manuscript

Explainable Deep-Learning-Based Diagnosis of Alzheimer’s Disease Using Multimodal Input Fusion of PET and MRI Images

Download PDF

Modupe Odusami¹,
Rytis Maskeliūnas¹,
Robertas Damaševičius ORCID: orcid.org/0000-0001-9990-1084² &
…
Sanjay Misra³

4674 Accesses
17 Citations
1 Altmetric
Explore all metrics

Abstract

Purpose

Alzheimer’s disease (AD) is a progressive, incurable human brain illness that impairs reasoning and retention as well as recall. Detecting AD in its preliminary stages before clinical manifestations is crucial for timely treatment. Magnetic Resonance Imaging (MRI) provides valuable insights into brain abnormalities by measuring the decrease in brain volume expressly in the mesial temporal cortex and other regions of the brain, while Positron Emission Tomography (PET) measures the decrease of glucose concentration in the temporoparietal association cortex. When these data are combined, the performance of AD diagnostic methods could be improved. However, these data are heterogeneous and there is a need for an effective model that will harness the information from both data for the accurate prediction of AD.

Methods

To this end, we present a novel heuristic early feature fusion framework that performs the concatenation of PET and MRI images, while a modified Resnet18 deep learning architecture is trained simultaneously on the two datasets. The innovative 3-in-channel approach is used to learn the most descriptive features of fused PET and MRI images for effective binary classification of AD.

Results

The experimental results show that the proposed model achieved a classification accuracy of 73.90% on the ADNI database. Then, we provide an Explainable Artificial Intelligence (XAI) model, allowing us to explain the results.

Conclusion

Our proposed model could learn latent representations of multimodal data even in the presence of heterogeneity data; hence, the proposed model partially solved the issue with the heterogeneity of the MRI and PET data.

A Multimodal Deep Learning Based Approach for Alzheimer’s Disease Diagnosis

Multi-Modality Cascaded Convolutional Neural Networks for Alzheimer’s Disease Diagnosis

Article 23 March 2018

Is a PET All You Need? A Multi-modal Study for Alzheimer’s Disease Using 3D CNNs

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Alzheimer’s disease (AD) is a depressive brain illness that is the leading cause of dementia in older people. It causes cognitive decline, thereafter leading to the inability to carry out daily duties [1]. AD not only reduces the way of life of patients, but it also leads to extra stress on healthcare givers. The synthesis of the amyloid peptide is linked to AD, and the symptoms often begin with minor memory loss before progressing to other brain dysfunctions [2]. Since there is no cure for AD, early detection in the prodromal stage, i.e., mild cognitive impairment (MCI), is vital. Early MCI (EMCI) is a stage of cognitive impairment that occurs before MCI [3]. Early diagnosis of EMCI may prevent EMCI from progressing to AD [3]. Studies have stressed the relevance of diagnosing MCI patients by identifying the differences between EMCI and late MCI (LMCI) groups [4,5,6]. Neuroimaging has become a crucial diagnostic tool for AD due to the rapid advancement of neuroimaging technologies [7, 8]. Non-invasive techniques such as MRI and PET are routinely employed to record brain tissue features [9, 10]. By evaluating the brain images captured from PET and MRI, volumetric consolidation in parts of the brain atrophy can be used as an essential biomarker for AD [11, 12].

PET imaging is an important functional technology that allows clinicians to swiftly and precisely study activities relevant to the human brain, with the potential for the early detection of AD [13, 14]. PET images obtained using radioactive 18-fluorodeoxyglucose (FDG) diffusion were used to produce sensitive glucose metabolic rate estimations in the brain [15], and this can be used to trace the progression of the disease from Normal Cognitive (NC) to AD. When it is difficult to tell the difference between physiological and pathological alterations in the anatomy, FDG-PET can be used. The volume of brain structure diminishes with age (particularly in the elderly) [16], making it impossible to identify whether a person’s brain is in a normal or diseased state using only MRI. PET can detect the AD status of people more effectively in these cases. For example, Ozsahin et al. utilized PET data for the automated classification of AD groups [17]. Authors [18] predicted the risk of AD based on the deep learning model by extracting FDG PET image features. Jo et al. established a deep learning-based system for the categorization of AD that recognizes the morphological phenotypes of tau deposition in tau PET images [19]. Liu et al. used a multiscale deep neural network to learn the patterns of metabolism changes due to AD pathology by analyzing PET images [20].

Because of its capacity to show distinct atrophy patterns in the brain, structural MRI (sMRI) is useful in the range of possibilities of AD and its high resolution for soft tissue [21]. For many years, structural information about the brain has been widely employed for early detection and diagnosis of AD [22, 23], due to its universality in clinical practice and convenience in the examination [24]. MRI has confirmed the pattern of AD progression seen in postmortem brain tissue research [25]. The temporal lobe and parietal lobe, as well as sections of the frontal cortex and cingulate gyrus, degenerate as AD progresses, resulting in extensive atrophy of the affected regions [26]. The patient’s structural alterations in the brain can be observed with MRI. Taheri et al. used extracted gray matter (GM) images from sMRI using CNN architecture for the diagnosis and classification of the CN, EMCI, and LMCI groups [27]. Mehmood et al. applied tissue segmentation on sMRI to extract the GM tissue, and VGG layer-wise transfer learning was used to distinguish between EMCI and LMCI patients [28]. Yue et al. employed Deep Convolutional Neural Network (DCNN) on sMRI to extract the most useful spatial features of GM and further segmented into ninety regions for LMCI vs. EMCI classification [29]. Liu et al. extracted structural and functional features for distinguishing EMCI subjects from LMCI subjects [30]. Wee et al. used a spectral graph-CNN-based system for the early detection of AD that used sMRI cortical thickness and its underlying geometric information [31]. Sheng et al. combined sMRI features and genetic features for six binary classifications (HC vs. AD, HC vs. EMCI, HC vs. LMCI, EMCI vs. LMCI, EMCI vs. AD, and LMCI vs. AD) [32]. Jiang et al. utilized the volumetric features of sMRI data to train VGG16 CNN with transfer learning for the classification of EMCI vs. NC [24].

PET imaging can capture brain metabolism characteristics to aid in the detection of lesions, whereas structural MRI can reflect changes in brain structure [33]. Iaccarino et al. assessed gray matter reduction in the early MCI stage as well as FDG-PET metabolic connectivity. Results showed that multimodal data provides a clinically important analysis [34]. Researchers have proposed a multimodal input image modality based on MRI and PET images to improve classification accuracy. Forouzannezhad et al. developed a Deep Neural Network (DNN) model using a 3- hidden layer approach to obtain the relevant information from MRI and PET data for the classification of the AD group [35]. The model classified six binary groups. Their findings revealed that the sensitivity of the EMCI vs. AD classification is higher than the specificity for the combined modality of MRI + PET. Hao et al. extracted MRI and PET features with consistent metric constraints by extracting pairwise similarity measures for PET and MRI modalities, and the extracted features are used as input to SVM for classification [36]. The model could successfully retain the feature’s structural information with higher sensitivity than specificity in the task of LMCI vs. EMCI.

The idea of multimodal data fusion for medical diagnostics is not new [37, 38]. Khan et al. used it to fuse various modalities of brain MRI images (T1, T2, T1CE, and Flair) for brain tumor recognition [39], and by Muzammil et al. to fuse Computed Tomography (CT) and MRI of the brain [40]. Maqsood et al. proposed a multimodal image fusion framework based on multiscale image matting and evaluated the brain MRI and CT images [41]. Guo et al. proposed to fuse structural images, such as CT and MRI images, and functional images such as PET and single-photon emission computerized tomography (SPECT) images [42]. Zhang et al. extracted characteristics from MRI and PET data using a deep multimodal fusion network based on the attention mechanism. Irrelevant information was suppressed [43]. When the inference is made, complementary information from MRI and PET features can be learned, even if a specific type of modal data is absent, the single input and related complementary information obtained from the pretrained model could still be used to forecast AD. Shao et al. proposed a feature correlation and feature structure fusion approach with Support Vector Machine (SVM) [44]. The classification results showed that the model improved greatly in LMCI vs. EMCI classification when compared with other state-of-the-art methods. The authors suggested the need to further improve the binary classification of their model.

Recently, hybrid methods based on the combination of deep learning and heuristics, or nature-inspired optimization methods were proposed to enhance brain MRI image classification for AD diagnostics [45]. Pradhan et al. proposed a hybridized Salp Swarm Algorithm-based Extreme Learning Machine (ELM) used to optimize the ELM model for MRI classification [46]. Raghavaiah et al. used an Enhanced Squirrel Search Algorithm to select the optimal weight parameters of the deep neural network (DNN) architecture for AD stage classification [47]. In our previous work, the Resnet18 pretrained model was utilized for binary classification of AD using MRI from ADNI, proving its effectiveness in EMCI vs. AD and LMCI vs. AD with VA of 99.99 and 99.95%, respectively [48], while it was able to achieve 98.86% accuracy, 98.94% precision, and 98.89% recall in multiclass classification [49]. Odusami et al.. utilized a ResD hybrid technique based on Resnet18 and Densenet121, and for classification, the data from the two pretrained models are mixed [50]. Experiments reveal that the suggested hybrid ResD model has achieved 99.61% (macro) precision. This has inspired us to design an Agitated Resnet18 model using multimodal input images for the early detection of AD. The first convolution layer of Resnet18 is changed into an agitated layer, which is added to the main residual layer. This model takes advantage of the data extracted in the channel dimension and combines them with the original features in multiscale.

Summarizing, there are two approaches for fusing PET and MRI images. One approach for fusing MRI and PET images is to use a multimodal deep neural network (DNN) that takes both modalities as input and outputs a diagnosis. DNN can be trained using a dataset of subjects with AD and healthy controls. The network learns to extract features from both modalities and uses them to distinguish between the two groups. Another approach is to use a DNN to extract features from each modality separately, and then fuse the features using a fusion layer. This approach can be useful when the two modalities provide complementary information and the features from each modality are not directly comparable. This paper offers three significant contributions: the concatenation-based fusion of MRI and PET images, the in-3-Channel Resnet18 model for the AD classification task, and experimental validation of the proposed methodology on images from the ADNI database. The experiments demonstrate that the use of multimodal features extracted from the channel dimension and deep supervision can improve the performance of the AD classification model.

The novelty of this research paper further lies in its contribution to the field of Alzheimer’s disease diagnosis using a combination of MRI and PET images. The paper presents a novel early feature fusion framework that concatenates PET and MRI images and trains a modified Resnet18 deep learning architecture on the combined dataset. The combination of MRI and PET images has been widely studied for the diagnosis of Alzheimer’s disease. However, the use of early fusion, which combines the images at an earlier stage in the analysis process, even though it may not be entirely new, it remains relevant and important to explore further to advance our understanding of AD and develop more effective diagnostic and treatment strategies. By combining anatomical information from MRI with the functional information from PET, early fusion can provide a more accurate and reliable diagnosis of AD. The 3-in-channel approach is used to learn the most descriptive features of the fused images, leading to an improved binary classification of Alzheimer’s disease. Additionally, the paper provides an XAI model to explain the results, adding interpretability to the deep learning-based diagnosis. The experimental results on the ADNI database show promising accuracy and demonstrate the effectiveness of the proposed approach. To guide our research, we formulate the following Research Questions (RQ):

RQ1

How can the combination of MRI and PET images be used to improve the diagnosis of Alzheimer’s disease?

RQ2

What is the effectiveness of the proposed concatenation-based feature fusion framework for fusing MRI and PET images in the diagnosis of Alzheimer’s disease?

RQ3

How does the modified Resnet18 deep learning architecture perform in the classification of Alzheimer’s disease using fused MRI and PET images?

RQ4

Can the results of the deep learning-based diagnosis of Alzheimer’s disease be explained using the proposed Explainable Artificial Intelligence (XAI) model?

RQ5

How does the proposed approach compare with existing methods for diagnosing Alzheimer’s disease using MRI and PET images?

The remaining parts of the paper are summarized as follows. Section 2 describes the dataset and the steps of our methodology, including data preprocessing, image denoising, intensity normalization, and the proposed modification of the ResNet18 neural architecture. Section 3 presents the results of the experiments. Section 4 discusses the results while Sect. 5 compares the proposed model with previous studies. Finally, Sect. 6 presents the conclusions.

2 Materials and Methods

The overall architecture of our proposed model consists of two steps, namely, data preprocessing and classifying with the in-3-channel Resnet18 model.

2.1 Materials

The data used in this study were collected from the Alzheimer’s Disease Neuroimaging (ADNI) database. We obtained spatially normalized MRI and Coreg, Avg, Standardized Image, and Voxel Size PET images of the whole brain. Spatial normalization of MRI images involves aligning different brain images to a common reference space, which allows for meaningful comparisons between groups. Co-registration of PET images with MRI images are important because it allows for accurate localization of PET signals within specific brain regions. A total of 412 MRIs and 412 PETs subjects were included in this study and all subjects received both imaging examinations, each of the modalities containing EMCI and LMCI groups. The middle slices of both MRI and PET ranging from slice number 144 to slice number 153 were extracted for this study, and Clinical Dementia Rating (CDR) were used to determine the cognitive status of each patient data distribution is provided in Table 1.

Table 1 Statistical data of MRI and PET from ADNI

Full size table

2.2 Preprocessing Steps

To reduce the learning difficulty and enhance the proposed model performance on multimodal data, we utilized data processing steps consisting of noise removal and intensity normalization. Preprocessing is necessary to further improve the image quality.

2.2.1 Removal of Noise

Most MRI and PET images are noisy and typically include regions of low contrast. The original images are rotated by 90 degrees, and a mask with a number greater than ten is generated from the original images to form both the background mask and brain mask. The generated mask is used to perform segmentation. Morphological dilation is further applied to the segmented images to perform non-linear operations related to the morphology of features in the images such as boundaries and skeletons. Dilation enlarges bright regions and shrinks dark regions. Figure 1 depicts the generated brain mask and clean MRI and PET images.

2.2.2 Intensity Normalization

In image processing applications concerning MR images, intensity normalization is a crucial preprocessing step. Due to the usage of diverse equipment, MR images have an inconsistent intensity scale across (and within) facilities and scanners, pulse sequences, and scan settings that are different, and a different environment in which the machine is located. Fuzzy C-means is used to find a mask for the white matter on the original MRI and its brain mask is shown in Fig. 1. A white matter mask for the image is created from the brain mask and segmentation is obtained from the morphological dilation. Then the White Matter mask serves as an input again, where it is used to find an approximate mean of the White Matter intensity in the target contrast and move it to the standard value. Figure 2 shows the White Matter mask and the fuzzy means normalized MRI.

The overall process of the preprocessing technique is shown in Fig. 3.

2.3 Proposed in-3-Channel Resnet18 Model

After completion of noise removal, all the clean data, Fuzzy C-means normalization is used as the segmentation of gray matter so that the clean image is normalized to the mean of the tissue as demonstrated in Fig. 4. The step is described thus:

Let $T\,{\complement }\,B$ = the tissue mask for the image $I,$ where: $T$ = the set of indices corresponding to the location of the tissue in the image$I$. Then the tissue mean is described in Eq. (1), and the segmentation-based normalized image is described in Eq. (2).

$$\mu = \frac{1}{|\hspace{0.17em}T\hspace{0.17em}| }{\sum }_{t \in T}I \left(T\right)$$

(1)

$${I}_{seg} \left(X\right)= \frac{c.I\left(X\right)}{\mu }$$

(2)

where c ∈ R > 0 = constant.

In this study, three-class fuzzy—means are used to get a segmentation of the tissue over the brain mask B for the T1-MRI or PET and we set arbitrarily c = 1. Early fusion is performed on the normalized MRI and PERT data by simple concatenation. The first convolution layer of ResNet18 is changed using in _ channel = 3, and out _ channel = 64, kernel _ size = (3, 3), stride = (1, 1), padding = (1, 1), and bias = True. Then the classification method is designed using the extracted features to distinguish the EMCI subjects from the LMCI subjects.

The ResNet18 classification model diagram of concatenated MRI and PET data performance is shown as follows. First, import the model for the classification of AD classes, and then input MRI data and PET data. Early fusion of the two neuroimaging data is performed by direct concatenation. Using a holdout of 80%, the fused data is divided into training and validation for model training and validation. If the optimal result is achieved, the model is further tested on new data and obtains the classification result, else the hyperparameters of the model are updated for an optimal result as shown in Fig. 5.

To extract meaningful and key information from the multimodal data, we introduce the training algorithm which is achieved by reducing cross-entropy loss and hyperparameter update. A stochastic gradient is utilized to optimize the parameters of the proposed multimodal model. The pseudocode of the learning algorithm for AD classification is shown in Algorithm 1. The parameters of ResNet18 are initialized and the learning rate is set to ɲ. A mini batch of input fused data will be sampled from the training set for network model training.

Assume that there are Q classes, the cross-entropy loss for a batch size of R samples can be represented as follows:

$${Y}_{c}= -\frac{1}{R}\sum _{i=1}^{R}\sum _{j=1}^{Q}{(v}_{j}^{i}{log}({S}_{j}^{i}\left)\right)$$

(3)

${v}_{j}^{i}$ is the label of the ith sample for the class j, and ${S}_{j}^{i}$ is the corresponding SoftMax probability. Application of gradient descent to the loss function will gradually update the parameters, and the multimodal network is evaluated on the validation set at a frequency of F. The best model will be obtained after training Fmax iterations with optimal hyperparameters.

2.4 Experimental Setup

In this study, we designed EMCI vs. LMCI binary. In the beginning stage, noise removal and normalization were applied to each image. The train-split ratio was conducted to ensure that the trained model’s generalization was correct. The data ADNI were divided in the ratio 70%: 30%, with 70% for training and the remaining 30% for validation. The effect of Fuzzy C-Means and the modifications made on the first convolutional layer were ascertained by training the MRI and PET data separately on ResNet18 without modification, and then fused data on ResNet18. Samples were extracted from a separate subject outside of the training set and validation set. To reduce overfitting, we made data augmentation for rotation at 15 degrees. The proposed model was implemented using the open-source library Pytorch and performed on Nvidia TU116 [Geforce GTX 1660] GPU with ten epochs. PU architecture is highly efficient for training and deploying deep CNNs [51]. The optimizer used is Stochastic gradient descent (SGD) with a learning rate of 0.0001, momentum 0.9, weight decay 0.1, and the loss function used is Cross-entropy. If the accuracy on the validation dataset does not improve after 5 epochs, and the loss on the validation dataset does not decrease within ten epochs, the learning rate was changed. To further reduce overfitting, ResNet18’s last layer is modified with 0.5 dropouts and an increased number of epochs. A standard measure of accuracy was used to assess the proposed model’s performance.

3 Results

Additional findings to demonstrate the performance of the proposed In-3-channel ResNet18 model for binary AD diagnostic tasks by examining the impact of intensity normalization on the model, as well as the effect of altering RenNet18’s first convolutional layer, was performed. Furthermore, we compared the results. of our proposed model with the existing techniques. Table 2 shows the result of the training set and validation set on ResnNet18 with normal images and normalized image of MRI data with epochs with the best result, likewise, the result of the training set and validation set on the proposed model with normal image and normalized image of MRI data is also shown in Table 2.

Table 2 Training accuracy (TA) and Validation accuracy (VA) of the proposed model with or without intensity normalization, with or without change in the first convolution layer on MRI Data

Full size table

Figure 6 shows the confusion matrix results of the in-3-channel model for EMCI vs. LMCI classification with normalized data at 10 epochs, where zero label represents EMCI and one label represents LMCI.

4 Discussion

The early diagnosis of Alzheimer’s disease (AD) is crucial for the effective management of the condition. Magnetic Resonance Imaging (MRI) and Positron Emission Tomography (PET) are two imaging modalities that have been widely used in AD research. While MRI provides detailed structural information about the brain, PET allows for the assessment of metabolic and functional changes associated with AD. Deep learning models can fuse information from both modalities and learn complex relationships between the imaging data, and improve diagnostic accuracy compared to using either modality alone.

The results of the proposed model are illustrated in Tables 2, 3 and 4. The VA of the in-3 channel model on MRI data yielded an increase of 0.32% as the number of epochs increased from 5 to 10 as depicted in Table 2. As regards PET data, not so much improvement with the increase in the number of epochs used in the training and validation phases as shown in Table 3. An appreciable increase is seen in the training and VA of the proposed model on the fused MRI and PET data as shown in Table 4. In Fig. 5, the total number of both correct and wrong classifications from both classes is represented. Unlike the existing approaches for extracting discriminative features from multimodal MRI and PET data, we propose a novel 3-channel in-channel model by modifying the first convolution layer of the ResNet18 architecture with an in-channel of three. Each channel learns the representation of the combination of different modalities by utilizing the greatest number of available samples. The key benefit of this fusion is that it allows us to train our model with more samples, which improves the classification performance. The combination of MRI and PET images improves the diagnosis of Alzheimer’s disease by providing a more comprehensive view of the brain and its functioning. MRI measures the decrease in brain volume and can identify abnormalities in the mesial temporal cortex and other regions of the brain, while PET measures the decrease of glucose concentration in the temporoparietal association cortex. By combining these data, a more accurate diagnosis of Alzheimer’s disease can be made, because the two modalities provide complementary information about the brain, and the combination of the two can lead to a more robust and reliable diagnosis. Although several previous methods in the literature used separate feature selection methods, our proposed model can automatically learn discriminative features from multimodality data in an end-to-end way [21, 30] and the features used are cortical thickness, shape, and regional volume. The best-performing algorithm gave a sensitivity of 81.2%, a specificity of 66.9%, and an accuracy of 72.5% [27]. The results depicted in Table 3 showed that VA decreased from 96.70 to 94.10% after the number of epochs increased when analyzing the normal PET image using the proposed method. One major possibility of the decrease in VA is that the learning rate may be too high or too low for PET data as this decrement did not occur in MRI data and the fused data. Our proposed model achieved an accuracy of 73.90% in the test data from the ADNI database. As a result, the use of early fusion improved diagnostic accuracy by considering the complex relationships between the imaging data. The proposed concatenation-based feature fusion framework is effective in the diagnosis of Alzheimer’s disease using fused MRI and PET images. The framework performs the concatenation of the two modalities and trains a deep learning architecture on the combined dataset. The 3-in-channel approach is used to learn the most descriptive features of the fused images, leading to improved accuracy in the binary classification of Alzheimer’s disease. The experimental results on the ADNI database show that the proposed framework achieves a classification accuracy of 73.90%, demonstrating its effectiveness in the diagnosis of Alzheimer’s disease.

Table 3 Accuracy with or without intensity normalization, with or without change in first convolution layer on PET Data

Full size table

Table 4 Accuracy with or without intensity normalization, on concatenated MRI and PET Data

Full size table

5 Comparison of Proposed Model with Existing Studies

This subsection details the solution to RQ3, RQ4, and RQ5. The modified Resnet18 deep learning architecture has been found to perform well in the classification of Alzheimer’s disease using fused MRI and PET images. The 3-in-channel approach allows the architecture to learn the most descriptive features of the fused images, leading to improved accuracy in the binary classification task. The experimental results on the ADNI database show that the modified Resnet18 architecture achieved a classification accuracy of 73.90% as shown in Table 5, demonstrating its effectiveness in the diagnosis of Alzheimer’s disease using fused MRI and PET images. Three previous studies [34, 35, 44] used Deep Neural Network (DNN) or Support Vector Machine (SVM) models to classify EMCI vs. LMCI and reported varying levels of accuracy, specificity, and sensitivity. The proposed model uses ResNet18 (3-in-Channel) and achieves a higher level of specificity compared to the previous models, but lower levels of accuracy and sensitivity. The proposed model’s novelty lies in the use of ResNet18 (3-in-Channel) and its ability to achieve high specificity, which may have implications for the EMCI classification of individuals. The results of the deep learning-based diagnosis of Alzheimer’s disease can be explained using the proposed Explainable Artificial Intelligence (XAI) model. The XAI model allows for the interpretation of the results of the deep learning-based diagnosis, making the results more transparent and understandable. This can be particularly useful for clinicians who may not have experience with deep learning models and want to understand why a certain diagnosis was made. The proposed approach compares favorably with existing methods for diagnosing Alzheimer’s disease using MRI and PET images. The concatenation-based feature fusion framework and the modified Resnet18 deep learning architecture provide a more comprehensive view of the brain and its functioning by combining MRI and PET images. The experimental results on the ADNI database show that the proposed approach achieved a higher classification accuracy compared to existing methods, demonstrating its effectiveness in the diagnosis of Alzheimer’s disease. Additionally, the proposed XAI model provides interpretability to the deep learning-based diagnosis, making the results more transparent and understandable. Table 5 compares the proposed model with other existing works in the classification of AD.

Table 5 Comparison of Proposed Model with Existing Methods

Full size table

6 Conclusion

The paper proposed a methodology for combining neuroimaging data from PET and MRI images to make an early diagnosis of AD. We introduced a novel 3-channel phase feature learning model for early fusion for the early diagnosis of AD that concatenates and integrates MRI and PET neuroimaging data simultaneously. Our proposed model could learn latent representations of the multimodality data even in the presence of heterogeneity data; hence, the proposed model partially solved the issue with the heterogeneity of the MRI and PET data. This 3-channel phase feature learning offers maximum samples to be used during training based on multimodality data, thus more imaging modality data could be added to the model. We have achieved improved classification performance over existing techniques. Fusing MRI and PET images using deep learning models with additional preprocessing of data is an important and relevant approach for the early diagnosis of AD. These models have the potential to improve diagnostic accuracy and can help to identify imaging biomarkers that are associated with the disease. The results showed that the use of intensity normalization and early fusion techniques significantly improved the classification accuracy of AD. The accuracy improvement was attributed to the better alignment of the image intensities and the integration of complementary information from both modalities. Further research is needed to optimize the use of these models in clinical practice by effectively fine-tuning the hyperparameters. Furthermore, the use of data-fused deep learning models can help to identify imaging biomarkers that are associated with AD, which can aid in the development of new therapies for the disease.

Data Availability

We used the Alzheimer’s Disease Neuroimaging (ADNI) database in this study.

References

Association, As. (2020). Alzheimer’s disease facts and figures. Alzheimer’s & Dementia, 16(3), 391–460. https://doi.org/10.1002/alz.12068
Article Google Scholar
Jeremic, D., Jiménez-Díaz, L., & Navarro-López, J. D. (2021). Past, present and future of therapeutic strategies against amyloid-β peptides in Alzheimer’s disease: A systematic review. Aging Research Reviews, 72, 101496. https://doi.org/10.1016/j.arr.2021.101496
Article CAS PubMed Google Scholar
Kang, L., Jiang, J., Huang, J., & Zhang, T. (2020). Identifying early mild cognitive impairment by multi-modality MRI-based deep learning. Frontiers in Aging Neuroscience, 12, 206. https://doi.org/10.3389/fnagi.2020.00206
Article PubMed PubMed Central Google Scholar
Nozadi, S. H., & Kadoury, S. (2018). Classification of Alzheimer’s and MCI patients from semantically parcelled PET images: A comparison between AV45 and FDG-PET. International Journal of Biomedical Imaging, 2018, 1–13. https://doi.org/10.1155/2018/1247430
Article Google Scholar
Edmonds, E. C., McDonald, C. R., Marshall, A., Thomas, K. R., Eppig, J., Weigand, A. J., Delano-Wood, L., Galasko, D. R., Salmon, D. P., & Bondi, M. W. (2019). Early versus late MCI: Improved MCI staging using a neuropsychological approach. Alzheimer’s & Dementia, 15(5), 699–708. https://doi.org/10.1016/j.jalz.2018.12.009
Article Google Scholar
Zhang, T., Zhao, Z., Zhang, C., Zhang, J., Jin, Z., & Li, L. (2019). Classification of early and late mild cognitive impairment using functional brain network of resting-state fMRI. Frontiers in Psychiatry. https://doi.org/10.3389/fpsyt.2019.00572
Article PubMed PubMed Central Google Scholar
Ferreira, L. K., & Busatto, G. F. (2011). Neuroimaging in Alzheimer’s disease: Current roles in clinical practice and potential future applications. Clinics (Sao Paulo Brazil), 66(Suppl 1), 19–24. https://doi.org/10.1590/s1807-59322011001300003.
Article PubMed Google Scholar
Petrella, J. R. (2013). Neuroimaging and the search for a cure for Alzheimer disease. Radiology, 269(3), 671–691. https://doi.org/10.1148/radiol.13122503
Article PubMed Google Scholar
Ramzan, F., Khan, M. U. G., Rehmat, A., Iqbal, S., Saba, T., Rehman, A., & Mehmood, Z. (2020). A deep learning approach for automated diagnosis and multiclass classification of Alzheimer’s disease stages using resting-state fMRI and residual neural networks. Journal of Medical Systems, 44, 1–16. https://doi.org/10.1007/s10916-019-1475-2
Article Google Scholar
Gao, F. (2021). Integrated positron emission tomography/magnetic resonance imaging in the clinical diagnosis of Alzheimer’s disease. European Journal of Radiology, 145, 110017. https://doi.org/10.1016/j.ejrad.2021.110017.
Article PubMed Google Scholar
Varghese, T., Sheelakumari, R., James, J. S., & Mathuranath, P. (2013). A review of neuroimaging biomarkers of Alzheimer’s disease. Neurology Asia, 18(3), 239–248.
PubMed PubMed Central Google Scholar
Márquez, F., & Yassa, M. A. (2019). Neuroimaging biomarkers for Alzheimer’s disease. Molecular Neurodegeneration. https://doi.org/10.1186/s13024-019-0325-5
Article PubMed PubMed Central Google Scholar
Rossini, P. M., Di Lorio, R., Vecchio, F., Anfossi, M., Babiloni, C., Bozzali, M., & Dubois, B. (2020). Early diagnosis of Alzheimer’s disease: The role of biomarkers including advanced EEG signal analysis. Reports from the IFCN-sponsored panel of experts. Clinical Neurophysiology, 131(6), 1287–1310. https://doi.org/10.1016/j.clinph.2020.03.003
Article CAS PubMed Google Scholar
Zhang, X. Y., Yang, Z. L., Lu, G. M., Yang, G. F., & Zhang, L. J. (2017). PET/MR imaging: New frontier in Alzheimer’s disease and other dementias. Frontiers in Molecular Neuroscience, 10, 343. https://doi.org/10.3389/fnmol.2017.00343
Article CAS PubMed PubMed Central Google Scholar
Chiaravalloti, A., Micarelli, A., Ricci, M., Pagani, M., Ciccariello, G., Bruno, E., & Schillaci, O. (2019). Evaluation of task-related brain activity: Is there a role for 18F FDG-PET imaging? BioMed Research International. https://doi.org/10.1155/2019/4762404
Article PubMed PubMed Central Google Scholar
Nguyen, L. D., Lin, D., Lin, Z., & Cao, J. (2018). Deep CNNs for microscopic image classification by exploiting transfer learning and feature concatenation. 2018 IEEE international symposium on circuits and systems (ISCAS) (pp. 1–5). IEEE.
Google Scholar
Ozsahin, I., Sekeroglu, B., & Mok, G. S. (2019). The use of back propagation neural networks and 18F-Florbetapir PET for early detection of Alzheimer’s disease using Alzheimer’s disease neuroimaging initiative database. PLoS ONE, 14(12), e0226577. https://doi.org/10.1371/journal.pone.0226577
Article CAS PubMed PubMed Central Google Scholar
Yang, Z., & Liu, Z. (2020). The risk prediction of Alzheimer’s disease based on the deep learning model of brain 18F-FDG positron emission tomography. Saudi Journal of Biological Sciences, 27(2), 659–665. https://doi.org/10.1016/j.sjbs.2019.12.004
Article CAS PubMed Google Scholar
Jo, T., Nho, K., Risacher, S. L., & Saykin, A. J. (2020). Deep learning detection of informative features in tau PET for Alzheimer’s disease classification. BMC Bioinformatics, 21(21), 1–13. https://doi.org/10.1186/s12859-020-03848-0
Article Google Scholar
Liu, M., Cheng, D., Wang, K., Wang, Y., & Alzheimer’s Disease Neuroimaging Initiative. (2018). Multi-modality cascaded convolutional neural networks for Alzheimer’s disease diagnosis. Neuroinformatics, 16, 295–308. https://doi.org/10.1007/s12021-018-9370-4.
Article PubMed Google Scholar
Zhang, Y. D., Dong, Z., Wang, S. H., Yu, X., Yao, X., Zhou, Q., & Gorriz, J. M. (2020). Advances in multimodal data fusion in neuroimaging: Overview, challenges, and novel orientation. Information Fusion, 64, 149–187. https://doi.org/10.1016/j.inffus.2020.07.006
Article PubMed Google Scholar
Wei, H., Kong, M., Zhang, C., Guan, L., & Ba, M. (2018). The structural MRI markers and cognitive decline in prodromal Alzheimer’s disease: A 2-year longitudinal study. Quantitative Imaging in Medicine and Surgery, 8(10), 1004. https://doi.org/10.21037/qims.2018.10.08
Article PubMed PubMed Central Google Scholar
Ebrahimighahnavieh, M. A., Luo, S., & Chiong, R. (2020). Deep learning to detect Alzheimer’s disease from neuroimaging: A systematic literature review. Computer Methods and Programs in Biomedicine, 187, 105242doi. https://doi.org/10.1016/j.cmpb.2019.105242
Article Google Scholar
Jiang, J., Kang, L., Huang, J., & Zhang, T. (2020). Deep learning based mild cognitive impairment diagnosis using structure MR images. Neuroscience Letters, 730, 134971doi. https://doi.org/10.1016/j.neulet.2020.134971
Article CAS Google Scholar
Promteangtrong, C., Kolber, M., Ramchandra, P., Moghbel, M., Houshmand, S., Schöll, M., & Buchpiguel, C. (2015). Multimodality imaging approach in Alzheimer disease. Part I: Structural MRI, functional MRI, diffusion tensor imaging and magnetization transfer imaging. Dementia & Neuropsychologia, 9, 318–329. https://doi.org/10.1590/1980-57642015DN94000318
Article Google Scholar
Peet, B. T., Spina, S., Mundada, N., & La Joie, R. (2021). Neuroimaging in frontotemporal dementia: Heterogeneity and relationships with underlying neuropathology. Neurotherapeutics, 18, 728–752. https://doi.org/10.1007/s13311-021-01101-x.
Article PubMed PubMed Central Google Scholar
Taheri Gorji, H., & Kaabouch, N. (2019). A deep learning approach for diagnosis of mild cognitive impairment based on MRI images. Brain Sciences, 9(9), 217. https://doi.org/10.3390/brainsci9090217
Article PubMed Central Google Scholar
Mehmood, A., Yang, S., Feng, Z., Wang, M., Ahmad, A. S., Khan, R., & Yaqub, M. (2021). A transfer learning approach for early diagnosis of Alzheimer’s disease on MRI images. Neuroscience, 460, 43–52. https://doi.org/10.1016/j.neuroscience.2021.01.002
Article CAS PubMed Google Scholar
Yue, L., Gong, X., Chen, K., Mao, M., Li, J., Nandi, A. K., & Li, M. (2018, July). Auto-detection of Alzheimer’s disease using deep convolutional neural networks. In 2018 14th international conference on natural computation, fuzzy systems and knowledge discovery (ICNC-FSKD) (pp. 228–234). IEEE.doi: https://doi.org/10.1109/FSKD.2018.8687207.
Xu, J., Liu, H., Wu, D., Zhou, F., Gao, C. Z., & Jiang, L. (2020). Generating universal adversarial perturbation with ResNet. Information Sciences, 537, 302–312. https://doi.org/10.1145/3386263.3407607.
Article Google Scholar
Wee, C. Y., Liu, C., Lee, A., Poh, J. S., Ji, H., Qiu, A., & Alzheimers Disease Neuroimage Initiative. (2019). Cortical graph neural network for AD and MCI diagnosis and transfer learning across populations. NeuroImage: Clinical, 23, 101929doi. https://doi.org/10.1016/j.nicl.2019.101929.
Article Google Scholar
Sheng, J., Xin, Y., Zhang, Q., Wang, L., Yang, Z., & Yin, J. (2022). Predictive classification of Alzheimer’s disease using brain imaging and genetic data. Scientific Reports, 12(1), 2405. https://doi.org/10.1038/s41598-022-06444-9
Article CAS PubMed PubMed Central Google Scholar
Song, J., Zheng, J., Li, P., Lu, X., Zhu, G., & Shen, P. (2021). An effective multimodal image fusion method using MRI and PET for Alzheimer’s disease diagnosis. Frontiers in Digital Health, 3, 637386. https://doi.org/10.3389/fdgth.2021.637386
Article PubMed PubMed Central Google Scholar
Iaccarino, L., Sala, A., Caminiti, S. P., Presotto, L., Perani, D., & Alzheimer’s Disease Neuroimaging Initiative. (2020). In vivo MRI structural and PET metabolic connectivity study of dopamine pathways in Alzheimer’s disease. Journal of Alzheimer’s Disease, 75(3), 1003–1016. https://doi.org/10.3233/JAD-190954.
Article PubMed Google Scholar
Forouzannezhad, P., Abbaspour, A., Li, C., Cabrerizo, M., & Adjouadi, M. (2018). A deep neural network approach for early diagnosis of mild cognitive impairment using multiple features. In 2018 17th IEEE international conference on machine learning and applications (ICMLA) (pp. 1341–1346). IEEE. doi: https://doi.org/10.1109/ICMLA.2018.00218.
Hao, X., Bao, Y., Guo, Y., Yu, M., Zhang, D., Risacher, S. L., & Alzheimer’s Disease Neuroimaging Initiative. (2020). Multi-modal neuroimaging feature selection with consistent metric constraint for diagnosis of Alzheimer’s disease. Medical Image Analysis, 60, 101625. https://doi.org/10.1016/j.media.2019.101625
Article PubMed Google Scholar
Hermessi, H., Mourali, O., & Zagrouba, E. (2021). Multimodal medical image fusion review: Theoretical background and recent advances. Signal Processing. https://doi.org/10.1016/j.sigpro.2021.108036
Article Google Scholar
Azam, M. A., Khan, K. B., Salahuddin, S., Rehman, E., Khan, S. A., Khan, M. A., & Gandomi, A. H. (2022). A review on multimodal medical image fusion: Compendious analysis of medical modalities, multimodal databases, fusion techniques and quality metrics. Computers in Biology and Medicine. https://doi.org/10.1016/j.compbiomed.2022.105253
Article PubMed Google Scholar
Khan, M. A., Ashraf, I., Alhaisoni, M., Damaševičius, R., Scherer, R., Rehman, A., & Bukhari, S. A. C. (2020). Multimodal brain tumor classification using deep learning and robust feature selection: A machine learning application for radiologists. Diagnostics. https://doi.org/10.3390/diagnostics10080565
Article PubMed PubMed Central Google Scholar
Muzammil, S. R., Maqsood, S., Haider, S., & Damaševičius, R. (2020). CSID: A novel multimodal image fusion algorithm for enhanced clinical diagnosis. Diagnostics, 10(11), 1.
Article Google Scholar
Maqsood, S., Damasevicius, R., Siłka, J., & Woźniak, M. (2021). Multimodal image fusion method based on multiscale image matting. Artificial intelligence and soft computing—20th international conference (pp. 57–68). Springer.
Google Scholar
Guo, K., Hu, X., & Li, X. (2022). MMFGAN: A novel multimodal brain medical image fusion based on the improvement of generative adversarial network. Multimedia Tools and Applications, 81(4), 5889–5927.
Article Google Scholar
Zhang, T., & Shi, M. (2020). Multi-modal neuroimaging feature fusion for diagnosis of Alzheimer’s disease. Journal of Neuroscience Methods, 341, 108795. https://doi.org/10.1016/j.jneumeth.2020.108795.
Article PubMed Google Scholar
Shao, W., Peng, Y., Zu, C., Wang, M., Zhang, D., & Alzheimer’s Disease Neuroimaging Initiative. (2020). Hypergraph based multi-task feature selection for multimodal classification of Alzheimer’s disease. Computerized Medical Imaging and Graphics, 80, 101663doi. https://doi.org/10.1016/j.compmedimag.2019.101663.
Article Google Scholar
Ramya, J., Maheswari, B. U., Rajakumar, M. P., & Sonia, R. (2022). Alzheimer’s disease segmentation and classification on MRI brain images using enhanced expectation maximization adaptive histogram (EEM-AH) and machine learning. Information Technology and Control, 51(4), 786–800.
Article Google Scholar
Pradhan, A., Mishra, D., Das, K., Panda, G., Kumar, S., & Zymbler, M. (2021). On the classification of MR images using ELM-SSA coated hybrid model. Mathematics, 9(17), 2095. https://doi.org/10.3390/math9172095
Article Google Scholar
Raghavaiah, P., & Varadarajan, S. (2021). A CAD system design to diagnosize alzheimers disease from MRI brain images using optimal deep neural network. Multimedia Tools and Applications, 80(17), 26411–26428. https://doi.org/10.1007/s11042-021-10928-7.
Article Google Scholar
Odusami, M., Maskeliūnas, R., Damaševičius, R., & Krilavičius, T. (2021). Analysis of features of alzheimer’s disease: Detection of early stage from functional brain changes in magnetic resonance images using a finetuned ResNet18 network. Diagnostics, 11(6), 1071. https://doi.org/10.3390/diagnostics11061071.
Article PubMed PubMed Central Google Scholar
Odusami, M., Maskeliūnas, R., & Damaševičius, R. (2022). An intelligent system for early recognition of Alzheimer’s disease using neuroimaging. Sensors (Basel, Switzerland). https://doi.org/10.3390/s22030740
Article PubMed Google Scholar
Odusami, M., Maskeliūnas, R., Damaševičius, R., & Misra, S. (2022). ResD hybrid model based on Resnet18 and Densenet121 for early alzheimer disease classification. Intelligent systems design and applications. ISDA 2021. Lecture notes in networks and systems. (Vol. 418). Cham: Springer.
Google Scholar
Jordà, M., Valero-Lara, P., & Peña, A. J. (2022). cuConv: CUDA implementation of convolution for CNN inference. Cluster Computing, 25(2), 1459–1473. https://doi.org/10.1007/s10586-021-03494-y.
Article Google Scholar

Download references

Funding

No funding was used for this study.

Author information

Authors and Affiliations

Department of Multimedia Engineering, Kaunas University of Technology, Kaunas, Lithuania
Modupe Odusami & Rytis Maskeliūnas
Faculty of Applied Mathematics, Silesian University of Technology, Gliwice, Poland
Robertas Damaševičius
Institute of Energy Technology, Halden, Norway
Sanjay Misra

Authors

Modupe Odusami
View author publications
You can also search for this author in PubMed Google Scholar
Rytis Maskeliūnas
View author publications
You can also search for this author in PubMed Google Scholar
Robertas Damaševičius
View author publications
You can also search for this author in PubMed Google Scholar
Sanjay Misra
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: RM, SM; Data curation: MO; Formal analysis: RM, SM, RD; Investigation: MO, RM; Methodology: SM; Resources: MO, RM; Software: MO, RM; Supervision: RM; Validation: RM, SM, RD; Visualization: MO, RM; Roles/Writing—original draft: MO, RM; Writing—review & editing: RD, SM.

Corresponding author

Correspondence to Robertas Damaševičius.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Ethical Approval

Not applicable. No research involving human or animal subjects was done in this study. All experiments were performed using images from the Alzheimer’s Disease Neuroimaging (ADNI) database.

Consent to Participate

Not applicable. No research involving human or animal subjects was done in this study. All experiments were performed using images from the Alzheimer’s Disease Neuroimaging (ADNI) database.

Consent to Publish

Not applicable. No research involving human or animal subjects was done in this study. All experiments were performed using images from the Alzheimer’s Disease Neuroimaging (ADNI) database.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Odusami, M., Maskeliūnas, R., Damaševičius, R. et al. Explainable Deep-Learning-Based Diagnosis of Alzheimer’s Disease Using Multimodal Input Fusion of PET and MRI Images. J. Med. Biol. Eng. 43, 291–302 (2023). https://doi.org/10.1007/s40846-023-00801-3

Download citation

Received: 11 February 2023
Accepted: 01 June 2023
Published: 20 June 2023
Issue Date: June 2023
DOI: https://doi.org/10.1007/s40846-023-00801-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Explainable Deep-Learning-Based Diagnosis of Alzheimer’s Disease Using Multimodal Input Fusion of PET and MRI Images

Abstract

Purpose

Methods

Results

Conclusion

Similar content being viewed by others

A Multimodal Deep Learning Based Approach for Alzheimer’s Disease Diagnosis

Multi-Modality Cascaded Convolutional Neural Networks for Alzheimer’s Disease Diagnosis

Is a PET All You Need? A Multi-modal Study for Alzheimer’s Disease Using 3D CNNs

Explore related subjects

1 Introduction

RQ1

RQ2

RQ3

RQ4

RQ5

2 Materials and Methods

2.1 Materials

2.2 Preprocessing Steps

2.2.1 Removal of Noise

2.2.2 Intensity Normalization

2.3 Proposed in-3-Channel Resnet18 Model

2.4 Experimental Setup

3 Results

4 Discussion

5 Comparison of Proposed Model with Existing Studies

6 Conclusion

Data Availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Ethical Approval

Consent to Participate

Consent to Publish

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation