Generation of Amyloid PET Images via Conditional Adversarial Training for Predicting Progression to Alzheimer’s Disease

Yan, Yu; Lee, Hoileong; Somer, Edward; Grau, Vicente

doi:10.1007/978-3-030-00320-3_4

Yu Yan¹⁷,
Hoileong Lee¹⁷,
Edward Somer¹⁸ &
…
Vicente Grau¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11121))

Included in the following conference series:

International Workshop on PRedictive Intelligence In MEdicine

1545 Accesses
6 Citations
3 Altmetric

Abstract

New positron emission tomography (PET) tracers could have a substantial impact on early diagnosis of Alzheimer’s disease (AD) and mild cognitive impairment (MCI) progression, particularly if they are accompanied by optimised deep learning methods. To realize the full potential of deep learning for PET imaging, large datasets are required for training. However, dataset sizes are restricted due to limited availability. Meanwhile, most of the AD classification studies have been based on structural MRI rather than PET. In this paper, we propose a novel application of conditional Generative Adversarial Networks (cGANs) to the generation of $ ^{18} F $-florbetapir PET images from corresponding MRI images. Furthermore, we show that generated PET images can be used for synthetic data augmentation, and improve the performance of 3D Convolutional Neural Networks (CNN) for predicting progression to AD. Our method is applied to a dataset of 79 PET images, obtained from Alzheimer’s Disease Neuroimaging Initiative (ADNI) database. We generate high quality PET images from corresponding MRIs using cGANs, and we evaluate the quality of generated PET images by comparison to real images. We then use the trained cGANs to generate synthetic PET images from additional MRI dataset. Finally we build a 152-layer ResNet to compare the MCI classification performance using both traditional data augmentation method and our proposed synthetic data augmentation method. Mean Structural Similarity (SSIM) index was 0.95 ± 0.05 for generated PET and real PET. For MCI progression classification, the traditional data augmentation method showed 75% accuracy while the synthetic data augmentation improved this to 82%.

You have full access to this open access chapter, Download conference paper PDF

GANDALF: Generative Adversarial Networks with Discriminator-Adaptive Loss Fine-Tuning for Alzheimer’s Disease Diagnosis from MRI

Joint Neuroimage Synthesis and Representation Learning for Conversion Prediction of Subjective Cognitive Decline

Enhancing magnetic resonance imaging-driven Alzheimer’s disease classification performance using generative adversarial learning

Article Open access 14 March 2021

Keywords

1 Introduction

In recent years, amyloid positron emission tomography (PET) imaging has been applied in some medical imaging problems such as Alzheimer’s disease classification and detection of amyloid plaques [1, 2]. The first PET tracer used to image β-amyloid plaques was $ ^{11} C $-Pittsburgh-Compound-B (PiB) [3]. Due to the limited availability of $ ^{11} C $-PiB with its short half-life, $ ^{18} F $-labelled alternatives have been developed, which allow off-site production and regional distribution. $ ^{18} F $-flutemetamol, $ ^{18} F $-florbetapir and $ ^{18} F $-florbetaben have recently been approved by the US Food and Drug Administration (FDA) for clinical use. Abnormal uptake in grey matter causes a disruption of the characteristic white matter pattern caused by non-specific white matter binding [4]. These scans are generally interpreted visually.

A separate group from healthy volunteers (HV) and patients with probable Alzheimer’s disease (pAD), mild cognitive impairment (MCI) is an intermediate cognitive state between normal aging and dementia. Subjects with MCI, especially MCI involving memory problems, are more likely to develop AD and other dementias [5]. According to this progression, MCI subjects can subsequently be classified as progressive MCI (pMCI) or stable MCI (sMCI) [6].

Many deep learning methods have been proposed to classify different AD stages based on high dimensional features extracted from various neuroimaging biomarkers. Meanwhile, the focus for AD classification has gradually evolved from classification between healthy control and disease patients to classification between pMCI and sMCI. In a recent paper on MCI classification, Kim et al. developed a deep learning-based method for classifying tau-pet imaging patterns. MCI subjects were split into three subgroups with the Louvain method. This method discriminated subgroups 1 and 2 with accuracy 90.91%, and 80.49% for subgroups 2 and 3 [7].

A big challenge in the medical imaging field is how to cope with small datasets and limited amount of annotated samples [8]. One promising solution inspired by game theory for image synthesis is known as Generative Adversarial Networks (GANs) [9]. The method is based on the idea of training two networks, a generator and a discriminator simultaneously with competing losses. In the past few years, different variations of GANs have been applied to generate realistic natural images, and recently, the popularity of using GANs to generate medical images have also increased [10]. For example, Frid et al. [11] proposed a CNN based classification framework to classify different CT images, where GANs was used to generate high quality 2D liver lesion ROIs from a vector of 100 random numbers. The classification performance using only traditional data augmentation yielded 78.6% sensitivity and 88.4% specificity. By adding the synthetic data augmentation the results increased to 85.7% sensitivity and 92.4% specificity. Recently, Madani et al. [12] used a GAN to generate the 2D chest X-ray images from random noise, and the generated data were subsequently used to train a CNN to classify images for cardiovascular abnormalities.

In general, the availability of MRI is much higher than PET for a number of reasons. PET scanners are expensive to buy and operate, and thus less common. PET scans require subjects are exposed to ionising radiation during the test. More importantly, the number of test datasets is very limited when newly developed PET tracers are being tested. In this case, we aim at compensating this imbalance between available MRI and PET images by using a limited dataset.

To the best of our knowledge, direct generation of 3D amyloid PET imaging from structural MR has not yet been attempted. In this study, we focus on the application of conditional GANs to generate high quality volumetric florbetapir PET images from corresponding MRI images. In this way, we expect the natural variability in MRI scans and the image characteristics in PET to be combined. We also build a 152-layer ResNet classification model to distinguish pMCI and sMCI subjects, and quantify the difference in performance caused by the addition of this synthetic datasets in training. The summary of data generation model in this work is shown in Fig. 1.

2 Materials and Methods

2.1 Data and Pre-processing

All image data were acquired from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). ADNI aims to improve clinical trials for the prevention and treatment of Alzheimer’s disease. To date, over 1000 scientific publications have used ADNI data. ADNI has been running since 2004 and is currently funded until 2021.

In this study, 50 sMCI and 29 pMCI florbetapir images with corresponding T1 MRI were obtained from the ADNI database (set A). A second group of 29 T1 MRI images (21 of them with corresponding PET) from a different pMCI group were also downloaded (set B) and used independently. More details about the use of these datasets in training/validating/testing the cGANs and ResNet are provided in the relevant sections below. All the florbetapir images were pre-processed: MRI and PET scans from each subject were co-registered, and the PET scan was then reoriented into a standard 160 × 160 × 96 voxel image grid, comprising 1.5 mm cubic voxels. This image grid was oriented such that the anterior-posterior axis of the subject is parallel to the AC-PC line. The MRI images have dimensions 256 × 256 × 196 with a voxel size of 1 mm × 1 mm × 1.2 mm.

2.2 Amyloid PET Generation with Conditional Adversarial Training

GANs are generative networks that learn a mapping from random noise to output image. They are composed of two networks, a generator and a discriminator, trained in an adversarial way. The goal of the generator is to generate synthetic images, while the discriminator, evaluates them for authenticity. In conditional GANs, the generator learns a mapping between an input and an output image [13]. In this study, the generator is a U-Net based convolutional neural network with skip connections [14]. The discriminator is a convolutional Markovian discriminator (PatchGAN), which only penalizes structure at the scale of image patches. During the GANs training process, the generated PET was paired with the corresponding MRI and entered into the discriminator. The loss function of the conditional GANs is:

$$ L_{cGAN} \left( {G,D} \right) = E_{x,y} \left[ {logD\left( {x,y} \right)} \right] + E_{x} \left[ {{ \log }(1 - D\left( {x,G\left( x \right)} \right))} \right] $$

(1)

where x are MRI images and y are PET images. The first term is maximized when $ {\text{D}}\left( {{\text{x}},\,{\text{y}}} \right)\, = 1 $, and the second is maximized when the $ {\text{D}}\left( {{\text{x}},\,{\text{G}}\left( {\text{x}} \right)} \right)\, = \,0 $, while it is minimised when $ {\text{D}}\left( {{\text{x}},\,{\text{G}}\left( {\text{x}} \right)} \right)\, = \,1 $, i.e. discriminator is not able to distinguish the generated images and real images. The generator G tries to minimize this objective against an adversarial discriminator D that tries to maximize it. In addition, conditional GANs also add an L1 loss term:

$$ L_{L1} \left( G \right) = E_{x,y } \left( {\left\| {y - G\left( x \right)} \right\|} \right) $$

(2)

Therefore the complete form of loss function is:

$$ L_{total} \left( {G,D} \right) = L_{cGAN} \left( {G,D} \right) + \varepsilon L_{L1} \left( G \right) $$

(3)

where $ \varepsilon $ is used to adjust the contribution of L1 loss, and it is set to 100 in the experiments reported here.

In order to measure the similarity between generated PET and real PET, We used SSIM due to its combination of errors in image contrast and overall structure [15, 16]. The structural similarity index (SSIM) was calculated as:

$$ SSIM\left( {x,y} \right) = \frac{{\left( {2\mu_{x} \mu_{y} + C_{1} } \right)\left( {2\sigma_{xy} + C_{2} } \right)}}{{\left( {\mu_{x}^{2} + \mu_{y}^{2} + C_{1} } \right)\left( {\sigma_{x}^{2} + \sigma_{y}^{2} + C_{2} } \right)}} $$

(4)

where $ \mu_{x} $, $ \mu_{y} $, $ \sigma_{x} $, $ \sigma_{y} $, $ \sigma_{xy} $ are the local means, standard deviations, and cross-covariance for images x, y. $ C_{1} $ and $ C_{2 } $ are regularization constants determined by pixel value range. The SSIM = 1 meaning that the two images are identical.

In this work, we used the 29 paired PET and MRI images from the pMCI group in set A to train the conditional GANs, and subsequently apply the mapping to the unseen 29 MRI images in set B to generate 29 synthetic PET images, thus doubling the size of the pMCI dataset. In set B, 21 subjects had available PET scans, which we used to test the cGANs by calculating SSIM values. For the implementation of the cGANs architecture we used the Keras framework. The experiment was conducted on computer cluster equipped with NVIDIA GeForce GTX 1080 Ti GPU.

2.3 MCI Progression Classification Architecture Using 3D ResNet

Deep Residual Network (ResNet) [17] is arguably one of the most important developments in the deep learning area in the last few years. ResNet makes it possible to train up to thousands of layers and still achieves competitive performance with fast convergence. The core concept of ResNet is introducing an identity shortcut connection that skips one or more layers.

ResNet have been used successfully for 3D image segmentation as in VoxResNet, where the authors use identity mappings as skip connections [18]. In our work, the ResNet architecture was modified based on the identity mappings version [19] that refines the residual block with a pre-activation variant.

The main difference between our network and the identity mappings version is the number of dimensions of convolutional kernels and pooling. Our ResNet architecture has 152 layers containing 50 3-layer blocks. The three layers are 1×1×1, 3×3×3, 1×1×1 convolutions, where the 1×1×1 layers are responsible for reducing and then increasing dimensions, leaving the 3×3×3 layer a bottleneck with smaller input and output dimensions, as detailed in Table 1. Down-sampling is performed by conv3_1, conv4_1, conv5_1 with a stride of 2.

Table 1. 152-ResNet architecture for pMCI and sMCI classification

Full size table

We trained our classification model using 50 sMCI and 29 pMCI real florbetapir images from set A. A 10-fold cross-validation was applied to the whole dataset. We tested three scenarios. In the first one only real images were used with no augmentation. The second one included traditional augmentation, which was done at each epoch. Specifically, the random rotation range is set to 20°, and images will be randomly flipped horizontally and vertically. These two experiments used 65 samples for training, 6 for validation, and 8 for testing. The third experiment used our cGANs augmented dataset, including the additional 29 PET images generated from the MRI scans in set B, resulting in 89 samples for training, 8 for validation, and 11 for testing. For training we used a batch size of 1 with a learning rate of 0.0001 for 100 epochs. We used Keras to implement our MCI classification framework. The experiment was performed on computer cluster with NVIDIA GeForce GTX 1080 Ti GPU.

3 Results and Discussion

3.1 Data Generation

In this study, we used volumetric MRI images to generate 3D PET images to enlarge pMCI group. Examples of real and generated PET images, with their corresponding SSIM values, are shown in Fig. 2. As can be seen from Fig. 2, the generated PET and real PET contain similar signal patterns. The mean SSIM obtained was 0.95 $ \pm $ 0.05. Figure 3 shows generated PET images obtained from MRI scans for which the corresponding real PET was not available.

3.2 Classification Results

Classification results for pMCI against sMCI using 152-ResNet are shown in Table 2. We computed both the area under the receiver operating characteristic curve (AUC) and the accuracy (ACC). Three different cases are compared: classification with a network trained using only the real images, with no augmentation (top row); with a network trained using traditional augmentation (middle row) and using our synthetic images based augmentation method (bottom row).

Table 2. Classification ROC AUC and accuracy (mean ± std) with 152-ResNet

Full size table

As can be seen from Table 2, the classification score for sMCI against pMCI using real PET images achieved accuracy 0.63, and with the aid of traditional data augmentation, the accuracy raised to 0.75. As we expected, the highest accuracy was obtained by using our proposed synthetic augmentation method, achieving an improvement of 7% over the traditional augmentation.

4 Conclusion

We developed a model for generating florbetapir PET from structural MR using deep generative networks, with generated data showing a high similarity to real corresponding PET. The generated data were then used for data augmentation for MCI classification on a limited dataset. We compared the synthetic augmentation method with a traditional augmentation method, and the synthetic augmentation outperformed the traditional augmentation. Future work will focus on using multi-modality imaging biomarkers for CNN classification.

References

Cattell, L., et al.: Classification of amyloid status using machine learning with histograms of oriented 3D gradients. NeuroImage Clin. 12, 990–1003 (2016)
Google Scholar
Nordberg, A.: PET imaging of amyloid in Alzheimer’s disease. Lancet Neurol. 3(9), 519–527 (2004)
Article Google Scholar
Klunk, W.E., et al.: Imaging brain amyloid in Alzheimer’s disease with Pittsburgh compound-B. Ann. Neurol. 55(3), 306–319 (2004)
Article Google Scholar
Knesaurek, K., et al.: Comparison of SUVr calculations in amyloid PET brain imaging. J. Nuclear Med. 55, 1873 (2014)
Google Scholar
Villemagne, V.L., et al.: Amyloid β deposition, neurodegeneration, and cognitive decline in sporadic Alzheimer’s disease: a prospective cohort study. Lancet Neurol. 12(4), 357–367 (2013)
Article Google Scholar
Thurfjell, L., et al.: Combination of biomarkers: PET 18F flutemetamol imaging and structural MRI in dementia and mild cognitive impairment. Neurodegenerative Diseases 10(1–4), 246–249 (2012)
Article Google Scholar
Kim, J., et al.: Mild Cognitive impairment classification using deep learning. Alzheimer’s Dement. J. Alzheimer’s Assoc. 13(7), 1070–1071 (2017)
Google Scholar
Greenspan, H., et al.: Guest editorial deep learning in medical imaging: overview and future promise of an exciting new technique. IEEE Trans. Med. Imaging 35(5), 1153–1159 (2016)
Article Google Scholar
Goodfellow, I., et al.: Generative adversarial nets. In: Advances in Neural Information Processing Systems (2014)
Google Scholar
Odena, A., et al.: Conditional image synthesis with auxiliary classifier GANs. arXiv (2016)
Google Scholar
Frid, M., et al.: GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification. arXiv (2018)
Google Scholar
Madani, A., et al.: Chest x-ray generation and data augmentation for cardiovascular abnormality classification. International Society for Optics and Photonics (2018)
Google Scholar
Isola, P., et al.: Image-to-image translation with conditional adversarial networks. arXiv (2017)
Google Scholar
Ronneberger, O., et al.: U-net: convolutional networks for biomedical image segmentation. In: International Conference on Medical Image Computing and Computer-Assisted Intervention (2015)
Google Scholar
Wang, Z., et al.: Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13(4), 600–612 (2004)
Article MathSciNet Google Scholar
Choi, H., Dong, S.L.: Generation of structural MR images from amyloid PET: application to MR-less quantification. J. Nucl. Med. 59(7), 1111–1117 (2017)
Article Google Scholar
He, K., et al.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016)
Google Scholar
Chen, H., et al.: Voxresnet: deep voxelwise residual networks for volumetric brain segmentation. arXiv (2016)
Google Scholar
He, K., et al.: Identity mappings in deep residual networks. In: European Conference on Computer Vision (2016)
Google Scholar

Download references

Acknowledgements

We would like to acknowledge support from the China Scholarship Council (CSC) and The Engineering and Physical Sciences Research Council (EPSRC) and Medical Research Council (MRC) [grant number EP/L016052/1] and GE healthcare.

Author information

Authors and Affiliations

Institute of Biomedical Engineering, Oxford University, Oxford, UK
Yu Yan, Hoileong Lee & Vicente Grau
GE Healthcare, London, UK
Edward Somer

Authors

Yu Yan
View author publications
You can also search for this author in PubMed Google Scholar
Hoileong Lee
View author publications
You can also search for this author in PubMed Google Scholar
Edward Somer
View author publications
You can also search for this author in PubMed Google Scholar
Vicente Grau
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yu Yan .

Editor information

Editors and Affiliations

University of Dundee, Dundee, UK
Islem Rekik
Istanbul Technical University, Istanbul, Turkey
Gozde Unal
Stanford University, Stanford, CA, USA
Ehsan Adeli
Daegu Gyeongbuk Institute of Science and Technology, Daegu, Korea (Republic of)
Sang Hyun Park

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yan, Y., Lee, H., Somer, E., Grau, V. (2018). Generation of Amyloid PET Images via Conditional Adversarial Training for Predicting Progression to Alzheimer’s Disease. In: Rekik, I., Unal, G., Adeli, E., Park, S. (eds) PRedictive Intelligence in MEdicine. PRIME 2018. Lecture Notes in Computer Science(), vol 11121. Springer, Cham. https://doi.org/10.1007/978-3-030-00320-3_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-00320-3_4
Published: 13 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00319-7
Online ISBN: 978-3-030-00320-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics