Alzheimer’s Disease Prediction Using Attention Mechanism with Dual-Phase 18F-Florbetaben Images

Kang, Hyeon; Kang, Do-Young

doi:10.1007/s13139-022-00767-1

Alzheimer’s Disease Prediction Using Attention Mechanism with Dual-Phase ¹⁸F-Florbetaben Images

Original Article
Open access
Published: 12 August 2022

Volume 57, pages 61–72, (2023)
Cite this article

Download PDF

You have full access to this open access article

Nuclear Medicine and Molecular Imaging Aims and scope Submit manuscript

Alzheimer’s Disease Prediction Using Attention Mechanism with Dual-Phase ¹⁸F-Florbetaben Images

Download PDF

2364 Accesses
1 Citation
7 Altmetric
1 Mention
Explore all metrics

Abstract

Introduction

Amyloid-beta (Aβ) imaging test plays an important role in the early diagnosis and research of biomarkers of Alzheimer’s disease (AD) but a single test may produce Aβ-negative AD or Aβ-positive cognitively normal (CN). In this study, we aimed to distinguish AD from CN with dual-phase ¹⁸F-Florbetaben (FBB) via a deep learning–based attention method and evaluate the AD positivity scores compared to late-phase FBB which is currently adopted for AD diagnosis.

Materials and Methods

A total of 264 patients (74 CN and 190 AD), who underwent FBB imaging test and neuropsychological tests, were retrospectively analyzed. Early- and delay-phase FBB images were spatially normalized with an in-house FBB template. The regional standard uptake value ratios were calculated with the cerebellar region as a reference region and used as independent variables that predict the diagnostic label assigned to the raw image.

Results

AD positivity scores estimated from dual-phase FBB showed better accuracy (ACC) and area under the receiver operating characteristic curve (AUROC) for AD detection (ACC: 0.858, AUROC: 0.831) than those from delay phase FBB imaging (ACC: 0.821, AUROC: 0.794). AD positivity score estimated by dual-phase FBB (R: −0.5412) shows a higher correlation with psychological test compared to only dFBB (R: −0.2975). In the relevance analysis, we observed that LSTM uses different time and regions of early-phase FBB for each disease group for AD detection.

Conclusions

These results show that the aggregated model with dual-phase FBB with long short-term memory and attention mechanism can be used to provide a more accurate AD positivity score, which shows a closer association with AD, than the prediction with only a single phase FBB.

Deep learning application for the classification of Alzheimer’s disease using 18F-flortaucipir (AV-1451) tau positron emission tomography

Article Open access 19 May 2023

Novel Iterative Attention Focusing Strategy for Joint Pathology Localization and Prediction of MCI Progression

Early Diagnosis of Alzheimer's Disease Using 3D Residual Attention Network Based on Hippocampal Multi-indices Feature Fusion

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Approximately 50 million people worldwide suffer from dementia, and nearly 10 million new cases occur every year. The total population with such dementia is expected to be 82 million by 2030 and 152 million by 2050 [1]. Alzheimer’s disease (AD), the most common cause of dementia, is complex and multi-factorial in elucidating the continuum of conditions leading to asymptomatic, mild cognitive impairment, and dementia. Amyloid-β (Aβ), which can be measured through positron emission tomography (PET) scan or cerebrospinal fluid analysis, is one of those defining the pathology of AD and is known as the earliest sign among AD biomarkers. Therefore, Aβ-related biomarkers have been studied for a clinical diagnostic index as well as for early diagnosis or prediction [2,3,4]. However, as AD is known to be affected by neurofibrillary tangles aggregated by hyperphosphorylated tau protein, genetics, and environmental influences as well [5], both Aβ-negative AD and Aβ-positive CN inevitably exist [6]. In addition, it is difficult to monitor the patient’s condition because Aβ plaques are already saturated by the time cognitive function clinically declines [7]. These facts remind us how additional AD biomarkers are required to understand and respond to AD. 18F-Fluorodeoxyglucose (FDG), which is a radiopharmaceutical that enables imaging of changes in glucose metabolism in brain tissue, is another one of representative AD biomarker. Hypometabolism, which is measured using FDG-PET, is known to be associated with neurodegeneration and cognitive decline [8]. However, such a series of PET imaging tests have drawbacks that make patients who need a diagnosis or longitudinal studies for AD undergo relatively frequent radiation exposure and high financial expenditure.

Aβ uptake in early-phase Aβ-PET is known to be a potential perfusion imaging modality that reflects cerebral blood flow [9,10,11]. Reference [4] reviewed the coupled relationship between hypoperfusion which causes deleterious changes in neurons and cerebral hypometabolism which underlies neuronal/synaptic dysfunction with the respective associations with cognitive impairment. Given an adequate evaluation of neuronal function and Aβ load from dual-phase Aβ-PET imaging, we may be able to provide patients with a more accurate AD diagnosis and prognostic evaluation without compromising patient convenience. Compared to late-phase Aβ-PET, however, there is no consensus or a well-established guide regarding how to interpret and evaluate the potential perfusion imaging for AD.

In the field of imaging biomarkers, various efforts have been made to provide an improved quality of medical services continuously. In particular, the latest technologies incorporating artificial intelligence have been reported to show a consistent inference and classification performance comparable to a human doctor. Such technologies are excellent at not only reducing a portion of manual labor of human doctors but also addressing inter-observation problems [12, 13]. In addition, machine learning–based studies on imaging for AD biomarkers are also actively reported [14]. Existing machine learning–based studies for AD have commonly suggested some predictive models that learn single or more than two kinds of imaging data such as magnetic resonance imaging, FDG, or Aβ-PET. Those attempts using a variety of information for AD detection could be appropriate solutions that address the complex and heterogeneous characteristics of AD.

In this study, we aimed to develop and evaluate an improved AD prediction model in the machine learning algorithm by engaging with dynamic early-phase Aβ-PET as well as single late-phase Aβ-PET conventionally used for AD diagnosis. The method included (1) extracting the mean of the standard uptake value ratio (SUVr) with a consistent area from individual dual-phase Aβ-PET imaging; (2) selecting a machine learning–based predictive model, which estimates the AD positivity score; and (3) comparing the classification performance among models and evaluating the association between predicted AD positivity scores and cognitive function or occurrence of AD.

Materials and Methods

Participants

We adopted FBB PET as an imaging biomarker to evaluate Aβ and retrospectively recruited subjects who visited the Department of Neurology and Nuclear Medicine of the Dong-A University Hospital (DAUH) and underwent dual-phase FBB from November 2015 to June 2020. The total number of subjects was 264, consisting of 74 cognitive normal (CN) and 190 AD. Detailed demographic data of the participants are presented in Table 1. All CN cases had normal age-, gender-, and education-adjusted performance on standardized cognitive tests. The AD participants met the following inclusion criteria: (1) criteria for dementia according to the Diagnostic and Statistical Manual of Mental Disorders 4th Edition (DSM-IV-TR) [15] and (2) the criteria for probable AD according to the NIA-AA core clinical criteria [16]. The individual FBB PET imaging for Aβ load was visually evaluated by the brain Aβ plaque load (BAPL) scoring system, which defines a BAPL score of 1 (no Aβ load), 2 (minor Aβ load), and 3 (significant Aβ load) [17]. Dong-A University Hospital Institutional Review Board (DAUHIRB) reviewed this study with the member who participated in Institutional Review Board Membership List III and finally approved this study protocol (DAUHIRB-17-108). All procedures for data acquisition were by the ethical standards of DAUHIRB with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards. We guarantee that informed consent was obtained from all participants for this study.

Table 1 Demographics of experimental data with dual-phase F18-Florbetaben imaging

Full size table

PET Acquisition

All FBB PET imaging was performed using a Biograph 40mCT Flow PET/CT scanner (Siemens Healthcare, Knoxville, TN, USA) and reconstructed through UltraHD-PET (TrueX-TOF). A dose of 300 MBq FBB was injected intravenously in resting conditions. Dynamic frames were acquired from 0 to 20 min and from 90 to 120 min post-injection after helical CT with a 0.5-s rotation time at 100 kVp and 228 mAs. The image acquisition time for dual-phase FBB PET was determined by related studies to sufficiently include the peak of Aβ uptake for early-phase FBB PET (eFBB) and the manufacturer’s recommendations for delay-phase FBB PET (dFBB) [10, 17, 18]. The acquired dynamic eFBB and static dFBB were 27 frames of 128 × 128 × 110 (3.19 mm × 3.19 mm × 1.5 mm) resliced from a field of view of 408 mm × 408 mm × 165 mm, and one frame of 400 × 400 × 110 (1.02 mm × 1.02 mm × 1.5 mm) resliced from a field of view of 408 mm × 408 mm × 165 mm, respectively. Static eFBB to evaluate potential perfusion was made by averaging the frames corresponding to 2–7 mins from dynamic eFBB. The optimal time period required to obtain static eFBB was internally determined using the approach in Reference [10].

Data Pre-processing

We adopted a series of pre-processing procedures to extract regional mean SUVr for dynamic eFBB or static eFBB/dFBB, respectively, and each step was as follows. For the spatial normalization of all PET images, we used an in-house eFBB PET template [19], which averaged 8 CN and 8 AD randomly selected from the spatially normalized FBB data pool in Montreal Neurological Institute (MNI) space [20]. Each static eFBB was spatially non-linearly registered to the template space. For the dynamic eFBB of a case, we created a deformation field to represent the transformation from the mean of the total number of frames to the template space and applied it to each frame. The deformation field for a dFBB was identical to that derived from spatial normalization of the matched static eFBB [21]. As a result, the spatially registered imaging was in a voxel space of 95 × 79 × 68 (height × width × depth). We merged the Hammers atlas [22] into 7 representative regions (frontal lobe, temporal lobe, parietal lobe, anterior cingulate cortex, posterior cingulate cortex, and cerebellum) for the reference region for count normalization and volume of interest for estimating the mean SUVr. After spatial normalization, the intensities of each image were normalized with respect to the mean uptake of the whole cerebellar region as a reference region. Finally, for static eFBB/dFBB and dynamic eFBB, regional mean SUVr of 6 × 1 and regional time-activity curve (TAC) data of 6 × 27 (number of target regions × temporal length) were obtained, respectively.

Calculation of AD Positivity Score Based on Brain Blood Perfusion and Amyloid-β Plaque

To calculate the AD positivity score from regional SUVr, we build a neural network (NN)–based classification model to predict the probability of whether the given regional TAC or mean SUVr data belong to the CN or AD distribution. Figure 1 shows the structure of our proposed framework that predicts AD using dual-phase FBB. The whole aggregated NN (NN_aggregated) in Fig. 1 consists of three modular networks: long short-term memory (LSTM_eFBB) model to extract temporal features from dynamic eFBB, feedforward neural network (NN_dFBB) for dFBB, and following NN (NN_Dx) to make a final diagnosis decision from the phase-specific features for each phase of FBB delivered from the preceding layers. In particular, we adopted an attention mechanism [23] to adaptively select the phase-specific features for AD detection under biomarkers’ disagreement. We describe the details of each modular networks and the attention mechanism layer connecting them in the following section.

Context vector encoded by attention mechanism layer and 1st aggregated features are pooled and used to infer AD positivity score. NN_Dx has an output layer with two nodes leading to the softmax function to interpret the model output as the probability for diagnostic labels, and their model parameters were trained to minimize the cross-entropy loss between the predicted probability and one-hot encoded actual label. To evaluate the efficacy and feasibility of the proposed model, we compared it against representative methods such as support vector machine (SVM) [24], and random forest (RF) [25] as a baseline.

Three Modular Networks for Independent Feature Extraction and Aggregation

We built the whole network into a combination of individual modules that are responsible for the independent task of performing AD classification. Long short-term memory (LSTM) is well known for handling long-term dependencies of temporal features using three types of gates (input, forget, and output gates) and memory cells [26, 27]. LSTM_eFBB produces phase-specific features for AD classification from regional TAC data. We first applied this LSTM layer on regional TAC data (6 × 27) to produce the temporal feature. Then, we applied layer normalization to reduce training time and stabilizing the hidden state dynamics in the previous recurrent neural network layer [28]. All of LSTM layers in NN_aggregated were followed by individual layer normalization. After two layers of LSTM, we applied feed forward layer (FC) on the output (6 × 1) at the last time step to encode high-level phase-specific feature. All of FC in NN_aggregated were followed by the pre-defined layer block, which are batch normalization, ReLU activation [29], and dropout layer [30]. To encode phase-specific feature for dFBB, we used a 4-layer FC followed by the pre-defined layer block which was explained above. Finally, we produced comprehensive functional features from two types of phase-specific features and phase attention which we present in the following section and AD positivity score by applying single-layer FC (NN_Dx).

Attention Mechanism for Adaptive Phase-Specific Feature Selection

In this work, we focus on adaptive phase-specific feature selection to address biomarkers’ disagreement. We adopt an attention method proposed by Luong et al. [23] to adaptively select proper evidences to predict AD positivity. Assume that a subject has N phase-specific hidden features h_i with i ∈ [1, P], H ∈ R^D and h^′ with H^′ ∈ R^D as a 1st aggregated hidden feature by concatenating N phase-specific hidden features and applying single-layer FC (Fig. 1). To highlight more informative phase to form the 1st aggregated feature for AD detection, we introduce a phase context vector C created from h_i, h^′ as the input of this mechanism as follows:

$${e}_i=f\left({h}_i,\kern0.5em {h}^{\prime}\right),$$

(1)

$${a}_i=\frac{\exp \left({e}_i\right)}{\sum_{k=1}^N\exp \left({e}_k\right)},$$

(2)

$$C=\sum\nolimits_{i=1}^N{a}_i{h}_i$$

(3)

where f is simple neural network that aggregates all of phase-specific hidden features h_i and reference feature h^′. The simple network can be written as follows:

$$A= softmax\left(\mathit{\tanh}\left( XW+b\right)\right)$$

(4)

Here, X is concatenated feature according to each phase between h_i and h^′ as X ∈ R^N × 2D. W and b are model parameters which will be learned to make attention score A with W ∈ R^2D × 1, b ∈ R^N × 1, and A ∈ R^N × 1. Finally, phase context vector C is the weighted sum of H with A as (3). And the context vector C and 1st aggregated feature will be used to encode 2nd aggregated feature in Fig. 1.

Detailed Parameters for Model Selection and Model Evaluation

For our experiment, we focused on showing that the model with dual-phase FBB is more useful for estimating AD positivity than a model with only dFBB. Therefore, we tried to simplify and unify the model structure and detailed parameters of each model as much as possible. NN-based models, including LSTM, have two hidden layers, with six nodes of each hidden layer. To prevent neural networks from overfitting, we apply L2 regularization with a weight of 0.01 and dropout layer with dropout rate of 0.2. The learning curves of all models were set to be trained up to 10,000 epochs but were stopped if the validation loss was not updated more than 200 times. The learning rate was 0.00001, and the Adam optimizer [31] was used for each setting. If the validation loss was not updated more than 100 times at a point, 0.001 of the decay rate was applied to the learning rate of the point.

SVM used in the experiment used a linear kernel as a kernel function. A radial basis function or polynomial kernel was also tested in an internal experiment but no meaningful difference was observed, and a simpler model was finally adopted to prevent overfitting. RF was trained with a max depth of 2 and a number of estimators of 1000, and gini inpurity [32] was used to measure the quality of a split. The hyperparameters of both comparative models were heuristically determined.

For model selection and evaluation, our dataset was split into training, validation, and testing with ratios of 0.6, 0.1, and 0.3, respectively. We use stratified sampling so that the ratio of diagnostic labels according to Aβ load in each data was same. The data split was the same for each phase of the dataset and all experiments. The previously preprocessed TAC and SUVr datasets were last subjected to min-max normalization before being input to a predictive model after the split.

The software used in this experiment was the SPM12 library and MATLAB R2020a for the data pre-processing, including spatial normalization, and count normalization, for evaluating the pre-processed image with t-contrast, and for calculating regional mean SUVr based on the Hammers atlas [22]. Keras 2.2.4 library and Python 3.6.9 were used to select and evaluate a model for estimating AD positivity. The experimental tool was implemented and tested on Linux Ubuntu 16.04 LTS with an Intel Core i7-6800K CPU and two GPUs (NVIDIA GeForce GTX 1080).

Statistical Analysis

We used independent-sample t-tests for numerical variables such as age and education and Pearson’s Chi-square test for categorical variables such as sex, FBB reading, and K-MMSE to determine whether the characteristics of subjects in our experimental dataset are biased according to the diagnostic label. For the demographic analysis, we used IBM SPSS statistics version 23. To evaluate the classification performance of trained models, we calculated the accuracy (ACC) and area under the receiver operating characteristic curve (AUROC) for AD detection using DeLong’s method [33] and Spearman correlation between predicted AD positivity scores and neuropsychological tests/actual diagnostic label. For these processes, we used MedCalc version 18.9.1 (MedCalc Software). In all tests, the statistical significance level was set at p < 0.001 with a two-sided test.

Results

Data Demographics

As Table 1 shows, there was no statistically significant difference between the CN and AD groups in age, sex, and education variables. The results of K-MMSE (which is the dominant variable in the diagnosis of AD and reflects cognitive function) and dFBB readings (which reflect a state of Aβ plaque load) showed statistically significant differences between groups. Therefore, the retrospective data used in the experiment differed only in the cognitive function and hallmark pathology that directly affect the diagnostic label, but no bias was observed in other factors. Our experimental data included 20.83% of Aβ-positive CN and 16.84% of Aβ-negative AD.

Pre-processed Imaging Data for TAC and SUVr

For the result of spatial registration, Fig. 2a shows static eFBB and dFBB registered in MNI space, which is randomly selected from each diagnostic label, compared with raw images of those in native space. As a result of pre-processing, it was confirmed that the spatial characteristics of individual imaging disappeared after they were transformed into MNI space but functional characteristics remained according to the diagnostic label.

In Fig. 2b, to check whether the functional information of eFBB on our pre-processing method and selected time period is feasible, eFBB (2–7 min) was observed by t-contrast according to the diagnostic label. The functional information of dFBB was omitted because the results have already been verified through previous studies [21]. In this study, t-contrast was applied to the eFBB images, and the voxel-wise difference between the two group (CN vs. AD) was calculated and visualized. As a result of t-contrast, the relative contrast of AD group is dominantly lower than CN, except for the cerebellar area in all of the 4 comparisons regardless of Aβ distribution.

AD Classification Performance

Table 2 shows the AD classification performance of ML-based predictive models. LSTM (ACC: 0.792, AUROC: 0.775, F1: 0.849, G-mean: 0.773) was the best model for eFBB. RF (ACC: 0.736, AUROC: 0.584, F1: 0.835, G-mean: 0.467), NN (ACC: 0.726, AUROC: 0.648, F1: 0.813, G-mean: 0.467), and SVM (ACC: 0.708, AUROC: 0.746, F1: 0.763, G-mean: 0.740) followed. For the classifier of static dFBB, which is used for conventional FBB reading, NN (ACC: 0.821, AUROC: 0.794, F1: 0.872, G-mean: 0.792) was the best model for AD detection with dFBB (NN_dFBB) and RF (ACC: 0.802, AUROC: 0.721, F1: 0.868, G-mean: 0.696) and SVM (ACC: 0.755, AUROC: 0.799, F1: 0.803, G-mean: 0.792) followed. In comparison among all kinds of FBB, the NN_aggregated was the best model (ACC: 0.858, AUROC: 0.831, F1: 0.901, G-mean: 0.828), which trained dual-phase FBB, followed by NN_dFBB that learned dFBB.

Table 2 Comparison of predictive performance for Alzheimer’s disease classification

Full size table

AD positivity scores measured by three models (NN_aggregated, LSTM_eFBB, and NN_dFBB) with each phase of FBB (dual-phase FBB, dynamic eFBB, and static dFBB) in the test data are presented in Table 3. NN_aggregated (AUROC: 0.854) trained dual-phase FBB was able to detect AD better than LSTM_eFBB (AUROC: 0.841) and NN_dFBB (AUROC: 0.851). In comparison of AUROC in Aβ-negative distribution (Aβ (−) CN vs. Aβ (−) AD), the NN_aggregated (AUROC: 0.837) was the best, followed by LSTM_eFBB (AUROC: 0.792) and NN_dFBB (AUROC: 0.731). In Aβ-positive distribution (Aβ (+) CN vs. Aβ (+) AD), the NN_aggregated (AUROC: 0.901) was the best as well, followed by LSTM_eFBB (AUROC: 0.812) and NN_dFBB (AUROC: 0.706). Figure 3 shows the distribution of AD positivity scores predicted by the trained models on the test set. Figure 3(b) shows that there are many misclassifications in the distribution of Aβ (+) CN and Aβ (−) AD because NN_dFBB is only referring to the Aβ pathology. On the other hand, Fig. 3a and c demonstrate that LSTM_eFBB and NN_aggregated can relatively correctly predict the AD positivity score in the distribution of Aβ (+) CN and Aβ (−) AD. In particular, NN_aggregated using two features shows a remarkably correct classification of Amyloid negative AD than LSTM_eFBB.

Table 3 Comparison of AUROC of AD positivity scores according to specific distribution

Full size table

Input and Feature Distribution According to the Visual Reading of dFBB and Diagnostic Label

Feature visualization provides a useful means of guessing how well a deep learning model understands the input data to achieve its learning goals. This can be addressed using t-distributed stochastic neighbor embedding (t-SNE), which is a kind of dimensionality reduction method designed to visualize high-dimensional data in a two- or three-dimensional map [34]. t-SNE prepares a neural network to understand target data distribution and is iteratively trained by gradient descent method so that the distance between data points low-dimensional data representation is similar to that in high-dimensional space. In Fig. 4, the distributions of inputs and features in the last hidden layer of the NN-based model according to the phase of FBB are shown in a two-dimensional space using t-SNE. Figure 4c and e show the distribution of mean SUVr and features extracted from dFBB, and those do not seem to fully explain Aβ-positive CN and Aβ-negative AD. The distribution of mean SUVr and features extracted from eFBB shown in Fig. 4a, b, and d appears to be that Aβ-negative AD distribution is closer to Aβ-positive AD distribution compared to those extracted from dFBB. However, it is observed that the Aβ-positive CN distribution is still close to that of Aβ-positive AD. On the other hand, in Fig. 4f, the feature distribution extracted from dual-phase FBB showed the separated representation rather than entangled for Aβ-negative CN and Aβ-positive AD.

Association Between AD Positivity Score and Neuropsychological Test

Figure 5 shows the AD positivity score distribution of each phase of FBB according to neuropsychological test results. For AD cases with a low score of MMSE, NN_dFBB hardly shows a high AD positivity score. On the other hand, NN_aggregated and LSTM_eFBB suggested high AD positivity scores for cases with decreased cognitive function. In the correlation analysis, AD positivity score from NN_aggregated is best correlated with neuropsychological test results (R: −0.5412, p < 0.0001). The correlation of LSTM_eFBB (R: −0.4613, p < 0.0001) and NN_dFBB (R: −0.2975, p < 0.0022) followed.

Observation of the Overall Behavior of the LSTM on Early-Phase FBB

Explaining a model prediction helps to understand the distribution of training data or the behaviors taken by the model to solve a given problem [35,36,37]. One approach for explaining deep NN decisions is by multiplying the partial derivative of the model prediction and the actual input feature, also referred to as simple Taylor decomposition [38], and this method also serves as a baseline for many related studies [39, 40]. The resulting relevance map can provide a feature-wise heatmap same as the input size and be understood as the product of sensitivity of how much the feature contributes to the model prediction and saliency of how much the feature is presented in the sample [40]. Figure 6 shows which part of the data the LSTM trained on eFBB observes for AD detection. In the comparison of the mean composite relevance in Fig. 6(b), CN shows a markedly high relevance in the 2nd to 5th frames and a remarkably low relevance in the 9th to 15th frames. On the other hand, AD shows a rather high relevance in the 4th to 7th frames and was generally maintained until the last frame. In the comparison of the mean regional relevance maps shown in € and (f), CN shows a remarkably high relevance in the anterior cingulate in the 15th to 25th frames. AD shows a higher overall relevance than CN, including the anterior cingulate and occipital lobe regions.

Discussion

We designed a predictive model to successfully improve the conventional imaging biomarkers with only static dFBB by engaging in dynamic eFBB based on the following two assumptions: (1) The potential blood flow information included in eFBB is sufficiently distinguished from dFBB and they provide complementary information with respect to AD diagnosis. (2) The temporal information included in dynamic eFBB can be represented as an embedding vector representing blood flow information by the LSTM model. In the remaining paragraphs, we will elucidate the experimental results or related problems concerning the hypotheses above.

Compared with the use of only dFBB in the conventional context, to improve the accuracy of AD detection by engaging dual-phase FBB, eFBB and dFBB must contain sufficient complementary information regarding AD, that is, eFBB should be able to sufficiently explain AD in different aspects from dFBB. As shown in Fig. 2 and Table. 3, we tried to confirm whether the potential perfusion information of eFBB is suitable for this experiment. Even though the deformation field used for registration in eFBB was applied to dFBB, both eFBB and dFBB were located in the MNI space in our visual observation, and the Aβ load pattern in a region of gray matter was still observed in each preprocessed dFBB. In voxel-based analysis, hypo-perfusion was observed in the AD group regardless of the Aβ distribution (Fig. 2). From the comparison of characteristics between the same Aβ distributions in Table 3, it was observed that the AD positivity score from eFBB explained AD distribution better than from dFBB, which meant that it was difficult to discriminate the diagnostic label with dFBB in the same Aβ distribution. Therefore, the dynamic or static eFBB acquired from our experimental protocol is meant to be complementary to the uptake of dFBB for AD detection, and the improved classification performance of the NN_aggregated could be based on the additional potential blood flow data.

LSTM is a representative NN for time series data that ultimately understands the long-term contextual information by managing the cell state necessary to determine the output from the input over time through input, output, and forget gates [41]. In terms of research on medical data, LSTM has been frequently used in EEG/ECG [42], imaging reports, electronic health records, and static or dynamic imaging data [43, 44], which include temporal information. A common delay-phase static PET image is acquired at the acquisition time determined by investigating the pseudo-equilibrium interval in which specific binding remains stable through TAC data and considering other parameters such as image quality and diagnostic accuracy. In the case of the FBB radiotracer, the manufacturer provides acquisition time for the delay phase, not for the early phase. In eFBB, the optimal acquisition time interval closest to potential perfusion cannot be found in the stable state owing to the curve that changes rapidly around the peak; therefore, the interval must be determined exploratory. Even the interval for ideal potential perfusion imaging is not deterministic and may vary from case to case. As a related work, it was mainly considered in studies that explored a specific acquisition time based on the similarity between eFBB and FDG images. They randomly selected an interval, including the peak uptake[11], or searched for a combination of the start time and time window to determine the acquisition time with the correlation most similar to FDG [10]. Figure 6 shows that the temporal and spatial features observed by LSTM trained on the eFBB differ according to the diagnostic label. These results suggest the presence of the temporal features of the eFBB for AD detection and non-determinism of the acquisition time interval of the ideal potential perfusion image. In Table 2, the LSTM model showed better performance than the NN model trained static eFBB at 2–7 min, which had a good correlation with FDG in our prior study [18]. These experimental results may indicate that the LSTM could understand the temporal features required for AD classification from potential perfusion information in dynamic eFBB and the calculation of optimal acquisition time could be omitted.

Figure 3a and b show that eFBB and dFBB discriminate AD from CN using different features of each image. In Fig. 3b, most misclassifications occurred in the Aβ-positive CN and Aβ-negative AD populations, whereas, in Fig. 6a, the eFBB classifier consistently scores a proper AD positivity for CN or AD regardless of Aβ distribution. Therefore, it could be considered that the performance of the dual-phase FBB classifier originates from the state of neuronal injury by comprehensively evaluating the degree of hypo-perfusion from eFBB and Aβ plaque deposition from dFBB, respectively (Fig. 3c). In Table 3, AD positivity scores calculated by dual-phase FBB for the entire population showed the AUROC, which had no statistically significant difference compared to the MMSE, and better classification performance than those calculated using only dFBB regardless of Aβ distribution. These results may indicate that it is possible to improve the evaluation of the degree of neuronal damage in research or clinically when the AD positivity score of dual-phase FBB is provided. In addition, it could provide a quantitative index to nuclear medicine physicians to explain false negative/positive cases in FBB imaging tests. This quantitative method could be considered for application to other types of tracers or PET imaging where early-phase PET reflects potential perfusion information.

This study proposes a quantitative method for the interpretation of dual-phase FBB at this point when the evaluation criteria for potential perfusion information of eFBB have not yet been established. Ultimately, it could help to reduce the radiation exposure and costs for patients with AD, and for a nuclear medicine physician, it could be a helpful tool in visual assessment for dual-phase FBB. On the other hand, as a limitation of this study, the predictive model analyzing dual-phase FBB needs to be evaluated in terms of external validation or clinical validity in the future. As mentioned earlier, AD is associated with neurofibrillary tangles aggregated by phosphorylated tau, CSF biomarker, genetics, and environmental factors, in addition to Aβ plaque accumulation. Given additional clinical and laboratory data in the future, it would be possible to develop a predictive model that aggregates various predictive factors for AD in addition to improving the performance of the quantitative model in this study. Furthermore, if a suitable amount of data is collected for the study, the application of the CNN algorithm, which is recently playing an important role as an image processing method, is left for our future work.

Conclusion

In this paper, we report on how to interpret dual-phase FBB using ML-based models and their evaluation results. In comparison with the AD classification, the model trained on mean SUVr extracted from dual-phase FBB imaging (ACC: 0.858, AUROC: 0.831) showed better AD classification than single-phase FBB, eFBB (ACC: 0.792, AUROC: 0.775), or dFBB (ACC: 0.821, AUROC: 0.794). In addition, the AD positivity score estimated by dual-phase FBB (R_MMSE: −0.5412) shows a higher correlation with psychological test result compared to only dFBB (R_MMSE: −0.2975). These experimental results show that the proposed method could be used to interpret eFBB in dual-phase FBB and that by reflecting eFBB into the current reading system, Aβ-PET reading, AD diagnosis, or the monitoring system could be improved.

Data Availability

The data used for the study are available on request from the corresponding author.

References

World Health Organization, Risk reduction of cognitive decline and dementia. 1st ed. World Health Organization; 2019.
Villemagne VL, Rowe CC, Macfarlane S, Novakovic K, Masters CL. Imaginem oblivionis: the prospects of neuroimaging for early detection of Alzheimer’s disease. J Clin Neurosci. 2005;12:221–30.
Article PubMed Google Scholar
Villemagne VL. Amyloid imaging: past, present and future perspectives. Ageing Res Rev. 2016;30:95–106.
Article CAS PubMed Google Scholar
Daulatzai MA. Cerebral hypoperfusion and glucose hypometabolism: key pathophysiological modulators promote neurodegeneration, cognitive impairment, and Alzheimer’s disease. J Neurosci Res. 2017;95:943–72.
Article CAS PubMed Google Scholar
Chételat G. Aβ-independent processes—rethinking preclinical AD. Nat Rev Neurol. 2013;9:123–4.
Article PubMed PubMed Central Google Scholar
Landau SM, Mintun MA, Joshi AD, Koeppe RA, Petersen RC, Aisen PS, et al. Amyloid deposition, hypometabolism, and longitudinal cognitive decline. Ann Neurol. 2012;72:578–86.
Article CAS PubMed PubMed Central Google Scholar
Jack CR Jr, Knopman DS, Weigand SD, Wiste HJ, Vemuri P, Lowe V, et al. An operational approach to National Institute on Aging–Alzheimer’s Association criteria for preclinical Alzheimer disease. Ann Neurol. 2012;71:765–75.
Article PubMed PubMed Central Google Scholar
Mosconi L, Mistur R, Switalski R, Tsui WH, Glodzik L, Li Y, et al. FDG-PET changes in brain glucose metabolism from normal cognition to pathologically verified Alzheimer’s disease. Eur J Nucl Med Mol Imaging. 2009;36:811–22.
Article CAS PubMed PubMed Central Google Scholar
Rostomian AH, Madison C, Rabinovici GD, Jagust WJ. Early 11C-PIB frames and 18F-FDG PET measures are comparable: a study validated in a cohort of AD and FTLD patients. J Nucl Med. 2011;52:173–9.
Article PubMed Google Scholar
Tiepolt S, Hesse S, Patt M, Luthardt J, Schroeter ML, Hoffmann K-T, et al. Early [18F] florbetaben and [11C] PiB PET images are a surrogate biomarker of neuronal injury in Alzheimer’s disease. Eur J Nucl Med Mol Imaging. 2016;43:1700–9.
Article CAS PubMed Google Scholar
Daerr S, Brendel M, Zach C, Mille E, Schilling D, Zacherl MJ, et al. Evaluation of early-phase [18F]-florbetaben PET acquisition in clinical routine cases. NeuroImage: Clinical. 2017;14:77–86.
Article PubMed Google Scholar
Gulshan V, Peng L, Coram M, Stumpe MC, Wu D, Narayanaswamy A, et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. Jama. 2016;316:2402–10.
Article PubMed Google Scholar
Lakhani P, Sundaram B. Deep learning at chest radiography: automated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology. 2017;284:574–82.
Article PubMed Google Scholar
Jo T, Nho K, Saykin AJ. Deep learning in Alzheimer’s disease: diagnostic classification and prognostic prediction using neuroimaging data. Front Aging Neurosci. 2019;220.
Bell CC. DSM-IV: diagnostic and statistical manual of mental disorders. Jama. 1994;272:828–9.
Article Google Scholar
McKhann GM, Knopman DS, Chertkow H, Hyman BT, Jack CR Jr, Kawas CH, et al. The diagnosis of dementia due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer's disease. Alzheimers Dement. 2011;7:263–9.
Article PubMed PubMed Central Google Scholar
Barthel H, Gertz H-J, Dresel S, Peters O, Bartenstein P, Buerger K, et al. Cerebral amyloid-β PET with florbetaben (18F) in patients with Alzheimer’s disease and healthy controls: a multicentre phase 2 diagnostic study. Lancet Neurol. 2011;10:424–35.
Article CAS PubMed Google Scholar
Shin H, Yoon H-J, Kang H, Lee S, Jeung Y, Kang D-Y. Optimal time frame for early-phase F-18-FBB brain PET compared to static F-18-FDG brain PET. Korean Soc Nucl Med. Online. 30-31st October. 2020;54:–98.
Kang H, Kang D-Y. Prediction of Alzheimer’s disease from early phase 18F-Florbetaben PET via LSTM. Korean Soc Nucl Med. Online. 30-31st October. 2020, 105;54.
Hutton C, Declerck J, Mintun MA, Pontecorvo MJ, Devous MD, Joshi AD, et al. Quantification of 18 F-florbetapir PET: comparison of two analysis methods. Eur J Nucl Med Mol Imaging. 2015;42:725–32.
Article CAS PubMed Google Scholar
Bae S, Choi H, Whi W, Paeng JC, Cheon GJ, Kang KW, et al. Spatial normalization using early-phase [18F] FP-CIT PET for quantification of striatal dopamine transporter binding. Nucl Med Mol Imaging. 2020;54:305–14.
Article CAS PubMed PubMed Central Google Scholar
Hammers A, Allom R, Koepp MJ, Free SL, Myers R, Lemieux L, et al. Three-dimensional maximum probability atlas of the human brain, with particular reference to the temporal lobe. Hum Brain Mapp. 2003;19:224–47.
Article PubMed PubMed Central Google Scholar
Luong MT, Pham H, Manning CD. Effective approaches to attention-based neural machine translation. arXiv. 2015;1508.04025.
Vapnik VN. Support vector machine: statistical learning theory. Hoboken: Wiley-Interscience; 1998.
Google Scholar
Breiman L. Random forests. Mach Learn. 2001;45:5–32.
Article Google Scholar
Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9:1735–80.
Article CAS PubMed Google Scholar
Van Houdt G, Mosquera C, Nápoles G. A review on the long short-term memory model. Artif Intell Rev. 2020;53:5929–55.
Article Google Scholar
Ba JL, Kiros JR, Hinton GE. Layer normalization. arXiv 2016;1607.06450.
Agarap AF. Deep learning using rectified linear units (relu). arXiv 2018;1803.08375.
Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res. 2014;15:1929–58.
Google Scholar
Kingma DP, Ba J. Adam: a method for stochastic optimization. arXiv 2014;1412.6980.
Biau G, Scornet E. A random forest guided tour. Test. 2016;25:197–227.
Article Google Scholar
DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837–45.
Article CAS PubMed Google Scholar
Maaten LVD, Hinton G. Visualizing data using t-SNE. J Mach Learn Res. 2008;9:2579–605.
Google Scholar
Ribeiro MT, Singh S, Guestrin C. “Why should i trust you?” Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. 2016;1135–44.
Lapuschkin S, Wäldchen S, Binder A, Montavon G, Samek W, Müller K-R. Unmasking Clever Hans predictors and assessing what machines really learn. Nat Commun. 2019;10:1–8.
Article CAS Google Scholar
Anders CJ, Weber L, Neumann D, Samek W, Müller K-R, Lapuschkin S. Finding and removing clever hans: using explanation methods to debug and improve deep models. Inform Fusion. 2022;77:261–95.
Article Google Scholar
Montavon G, Samek W, Müller K-R. Methods for interpreting and understanding deep neural networks. Digit Signal Process. 2018;73:1–15.
Article Google Scholar
Smilkov D, Thorat N, Kim B, Viégas F, Wattenberg M. Smoothgrad: removing noise by adding noise. arXiv 2017;1706.03825.
Shrikumar A, Greenside P, Kundaje A. Learning important features through propagating activation differences. International conference on machine learning. 2017;3145–53.
Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9:37–45.
Article Google Scholar
Weng W-H, Wagholikar KB, McCray AT, Szolovits P, Chueh HC. Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach. BMC Med Inform Decision Making. 2017;17:1–13.
Article Google Scholar
Liang D, Lin L, Hu H, Zhang Q, Chen Q, Han X, et al. Combining convolutional and recurrent neural networks for classification of focal liver lesions in multi-phase CT images. International Conference on Medical Image Computing and Computer-Assisted Intervention. 2018;666–75.
Yao H, Zhang X, Zhou X, Liu S. Parallel structure deep neural network using CNN and RNN with an attention mechanism for breast cancer histology image classification. Cancers. 2019:11, 1901.

Download references

Acknowledgements

We would like to thank Editage (www.editage.co.kr) for English language editing.

Funding

This research was supported by the National Research Foundation (NRF) of Korea funded by the Ministry of Science, ICT & Future Planning (NRF-2018 R1A2B2008178).

Author information

Authors and Affiliations

Institute of Convergence BioHealth, Dong-A University, Busan, Republic of Korea
Hyeon Kang & Do-Young Kang
Department of Nuclear Medicine, Institute of Convergence Bio-Health, Dong-A University College of Medicine, 32, Daesingongwon-ro, Seo-gu, Busan, Republic of Korea
Do-Young Kang
Department of Translational Biomedical Sciences, Dong-A University, Busan, Republic of Korea
Do-Young Kang

Authors

Hyeon Kang
View author publications
You can also search for this author in PubMed Google Scholar
Do-Young Kang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Do-Young Kang.

Ethics declarations

Competing Interests

Hyeon Kang and Do-Young Kang declare that they have no competing interests.

Ethical Approval

This study was performed in accordance with the ethical standards laid down in the Helsinki Declaration of 1964 and its later amendments or comparable ethical standards.

Consent to Participate

Informed consent was obtained from all individual participants included in this prospective study.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kang, H., Kang, DY. Alzheimer’s Disease Prediction Using Attention Mechanism with Dual-Phase ¹⁸F-Florbetaben Images. Nucl Med Mol Imaging 57, 61–72 (2023). https://doi.org/10.1007/s13139-022-00767-1

Download citation

Received: 04 April 2022
Revised: 04 July 2022
Accepted: 02 August 2022
Published: 12 August 2022
Issue Date: April 2023
DOI: https://doi.org/10.1007/s13139-022-00767-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Alzheimer’s Disease Prediction Using Attention Mechanism with Dual-Phase 18F-Florbetaben Images

Abstract

Introduction

Materials and Methods

Results

Conclusions

Similar content being viewed by others

Deep learning application for the classification of Alzheimer’s disease using 18F-flortaucipir (AV-1451) tau positron emission tomography

Novel Iterative Attention Focusing Strategy for Joint Pathology Localization and Prediction of MCI Progression

Early Diagnosis of Alzheimer's Disease Using 3D Residual Attention Network Based on Hippocampal Multi-indices Feature Fusion

Introduction

Materials and Methods

Participants

PET Acquisition

Data Pre-processing

Calculation of AD Positivity Score Based on Brain Blood Perfusion and Amyloid-β Plaque

Three Modular Networks for Independent Feature Extraction and Aggregation

Attention Mechanism for Adaptive Phase-Specific Feature Selection

Detailed Parameters for Model Selection and Model Evaluation

Statistical Analysis

Results

Data Demographics

Pre-processed Imaging Data for TAC and SUVr

AD Classification Performance

Input and Feature Distribution According to the Visual Reading of dFBB and Diagnostic Label

Association Between AD Positivity Score and Neuropsychological Test

Observation of the Overall Behavior of the LSTM on Early-Phase FBB

Discussion

Conclusion

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing Interests

Ethical Approval

Consent to Participate

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Alzheimer’s Disease Prediction Using Attention Mechanism with Dual-Phase ¹⁸F-Florbetaben Images