Mapping the association between tau-PET and Aβ-amyloid-PET using deep learning

Ruwanpathirana, Gihan P.; Williams, Robert C.; Masters, Colin L.; Rowe, Christopher C.; Johnston, Leigh A.; Davey, Catherine E.

doi:10.1038/s41598-022-18963-6

Mapping the association between tau-PET and Aβ-amyloid-PET using deep learning

Article
Open access
Published: 30 August 2022

Volume 12, article number 14797, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Mapping the association between tau-PET and Aβ-amyloid-PET using deep learning

Download PDF

Gihan P. Ruwanpathirana^1,2,
Robert C. Williams²,
Colin L. Masters^4,5,
Christopher C. Rowe^3,4,5,
Leigh A. Johnston^1,2 &
…
Catherine E. Davey^1,2

2067 Accesses
1 Citation
14 Altmetric
1 Mention
Explore all metrics

Abstract

In Alzheimer’s disease, the molecular pathogenesis of the extracellular Aβ-amyloid (Aβ) instigation of intracellular tau accumulation is poorly understood. We employed a high-resolution PET scanner, with low detection thresholds, to examine the Aβ-tau association using a convolutional neural network (CNN), and compared results to a standard voxel-wise linear analysis. The full range of Aβ Centiloid values was highly predicted by the tau topography using the CNN (training R² = 0.86, validation R² = 0.75, testing R² = 0.72). Linear models based on tau-SUVR identified widespread positive correlations between tau accumulation and Aβ burden throughout the brain. In contrast, CNN analysis identified focal clusters in the bilateral medial temporal lobes, frontal lobes, precuneus, postcentral gyrus and middle cingulate. At low Aβ levels, information from the middle cingulate, frontal lobe and precuneus regions was more predictive of Aβ burden, while at high Aβ levels, the medial temporal regions were more predictive of Aβ burden. The data-driven CNN approach revealed new associations between tau topography and Aβ burden.

Deep learning detection of informative features in tau PET for Alzheimer’s disease classification

Article Open access 28 December 2020

A deep learning MRI approach outperforms other biomarkers of prodromal Alzheimer’s disease

Article Open access 29 March 2022

Dissociation of tau pathology and neuronal hypometabolism within the ATN framework of Alzheimer’s disease

Article Open access 21 March 2022

Introduction

Alzheimer’s disease (AD) is characterized by the accumulation of two proteins in the brain that pre-date the clinical onset of symptoms by several decades^1,2,3,4,5: extracellular Aβ-amyloid (Aβ) plaques that initiate in the neocortex and gradually spread through the brain, and intracellular tau neurofibrillary tangles, which are most evident in the entorhinal cortex and the limbic system^6,7,8.

Positron emission tomography (PET) tracers developed for imaging of aggregated Aβ have more recently been complimented by the development of tau radiotracers, enabling comprehensive studies that investigate topographical changes of both tau and Aβ across different stages of the natural progression of AD^3,5. The pathogenic mechanisms by which Aβ and tau accumulate remain largely unknown¹, and the development of Aβ-PET and tau-PET tracers have the potential to provide critical insights into their interaction over time. Earlier studies concluded that typically Aβ is deposited in cortical regions, followed by significant tau accumulation^9,10,11,12, initially in the medial temporal lobe and latterly spreading throughout the neocortex.

Aβ-PET images are typically mapped to a scalar Centiloid (CL) value to quantify neocortical Aβ burden, which may be indicative of downstream pathological changes^13,14,15. However, a comparable transformation for tau-PET images to a scalar value is not yet available.

Convolutional neural networks (CNNs) are deep learning networks designed for the analysis of image data, inspired by the concept of receptive fields in the visual cortex. The high accuracy of CNNs in mapping input images to a specified output, in conjunction with an abundance of medical imaging data, have promoted their broad application from diagnostics to image reconstruction^16,17,18 and, more recently, to AD^{19,20,21,22,23} research. However, the complex structure of CNNs, with many layers of thousands of learned parameters, render the interpretation of this mapping difficult²⁴. Recent research has employed attribution techniques to aid interpretation, which identify regions of the input image that are most responsible for estimating the output^25,26,27,28. Saliency maps are an attribution technique, employing a gradient-based method of generating weights for each input voxel that denote its contribution to a specific CNN-generated output. Unlike other interpretation techniques, saliency maps are not dominated by input values, but rather depend primarily on the learned network parameters²⁹.

Previous cross-sectional studies examining the interaction between Aβ and tau were principally based on linear, voxel-wise or region-of-interest (ROI), techniques. Such models did not seek to capture spatial dependencies between voxels or brain regions. Notable exceptions are studies that employ independent component analysis to examine how spatial patterns of tau change with Aβ burden^11,30. Given that tau accumulation has strong spatial dependencies^30,31,32,33, it is important to employ models that can accommodate these relationships. Analysis methods that can probe multivariate, nonlinear relationships between Aβ and tau across spatially remote brain regions may provide new insights into the interactions between these two molecular species. CNNs have this capability due to their multi-layered structure; a CNN passes information from input images to predicted output via a series of layers of artificial neurons. In each layer, a receptive field defines the image region over which to relate input information to output values. The size of the receptive field widens with depth in the network, thus integrating spatial dependence between input image regions in the output prediction. Moreover, the neuron activation functions permit nonlinear mappings from input to output, enabling the capture of a much broader range of relationships between Aβ and tau than voxel-wise or ROI-based linear models.

In this paper, a CNN is used to examine the relationship between tau-PET images and Aβ-PET quantification across the AD continuum. [¹⁸F]MK6240 scans were used as input to the CNN, while Aβ CL was used as the scalar output, such that the CNN was trained to map tau images to Aβ CL. We interpreted the importance of each tau voxel in this mapping using saliency (Fig. 1). Crucially, the end-point of this study was not solely the prediction of Aβ CL from a tau-PET image, but rather to provide insight into the mapping between tau topography and Aβ CL.

Results

In this study, we examined the relationship between tau-PET images and Aβ CL using a CNN. Demographic data for the 134 subjects used in this analysis are shown in Table 1. Missing demographic data is denoted by a ^*. Since the data were divided into training, validation and testing datasets during the CNN implementation, demographics were calculated for each separately. Although percentages of females to males were similar in the training and testing datasets, the validation dataset had a smaller proportion of females. Subjects in the test dataset were, on average, older than those in both the training and validation datasets. Each dataset had a similar percentage of Aβ+ subjects, whilst the validation dataset had the highest percentage of Apolipoprotein E4 (APOE4) carriers.

Table 1 Dataset demographics with format ‘percentage (number of samples)’.

Full size table

CNN training

As CNN performance is sensitive to network structure and parameter configuration, several different CNN models were compared. Ten-fold cross-validation was employed, so that ten instances of each model were learned. The best-performing model was identified by calculating the average performance across the ten folds, where performance was evaluated using the root mean squared error (RMSE) between measured Aβ-PET CL, and CNN estimated CL. The best-performing CNN model successfully learned to associate spatial features in tau images with Aβ CL (Supplementary Table S1), though with varied performance across the ten folds (Supplementary Table S2), reflected by the high standard deviation in Supplementary Table S1.

The CNN model learned in the sixth during the cross-validation fold demonstrated the best RMSE between measured and estimated Aβ CL, across both validation and test datasets, and showed consistency in performance across training (RMSE = 11.72, R² = 0.97) and validation (RMSE = 15.08, R² = 0.96) datasets. Furthermore, the model produced an accurate mapping for training and validation subjects across the full range of Aβ CL values (Fig. 2A,B open circles). The optimal CNN model instance produced an RMSE of 29.93 and R² of 0.79 for the hold-out test data. While lower than the validation and training R² values, a strong relationship between actual and estimated Aβ CL was maintained. Importantly, in the testing phase, the model continued to successfully discriminate the full range of Aβ CL values (Fig. 2B triangles). Therefore, this instance of the optimal CNN model was used to analyse tau spatial features associated with changes in Aβ burden and was compared with the standard linear, voxel-wise analysis.

A saliency map is an attribution image in which a voxel’s value indicates the importance of the voxel’s input tau value in the CNN mapping to CL output. Saliency maps were generated for each subject, and a general linear model (GLM) was used to identify voxels with a significant linear association between saliency and CL. Salient clusters were distinguished by requiring significant voxels to be part of a cluster of at least 200 voxels. This analysis was performed on all cross-validation CNN instances of the optimal CNN model, to examine consistency in the learned CNN mappings from tau to CL, to rule out overfitting of the CNN to input noise. The results show markedly consistent and overlapping regions across the ten learned CNN instances, including the bilateral medial temporal lobes, precuneus, middle cingulate, frontal lobes and paracentral lobules (Supplementary Fig. S2). Importantly, although cross-validation instances produced varied performance across the test, validation and training datasets, all CNN instances consistently identified similar salient regions in the mapping from tau to Aβ CL, irrespective of the partitioning of the data. This suggests that the learned CNNs have not overfitted to noise.

Comparison between tau-SUVR and saliency analyses

To compare our CNN approach with standard linear, voxel-wise modeling, a GLM was fitted to each voxel’s tau, converted to SUVR, and Aβ CL. It identified three significant tau clusters associated with Aβ CL (Figs. 3A, 4A), which were spread across the hippocampi, parahippocampal gyri, temporal lobes, parietal lobes, occipital lobes, frontal lobes and cingulate of both hemispheres.

A GLM was used to evaluate the voxel-wise relationship between saliency maps, generated by the best cross-validation instance, and Aβ CL. It revealed five salient clusters associated with Aβ CL (Figs. 3B, 4B): relatively large clusters in the frontal lobes, precuneus, postcentral gyri, paracentral lobules, middle cingulate, precentral gyri, supplementary motor areas (SMAs) and medial temporal structures of both hemispheres; smaller clusters in bilateral occipital lobes and insula of the left hemisphere. Detailed anatomical regions are given in Table 2.

Table 2 Occlusion analysis.

Full size table

The tau-SUVR GLM analysis captured larger clusters spread throughout the brain (Figs. 3A, 4A), the saliency maps identified smaller, more focused, clusters (Figs. 3B, 4B). In summary, although larger tau clusters were related to Aβ CL, for the CNN, sub-regions were more informative for mapping to Aβ CL.

The GLM analysis of saliency maps highlighted regions that were not captured by the tau-SUVR GLM analysis, including regions in the middle cingulate and postcentral gyri (second slices of Fig. 4A,B). Therefore, CNN used unique regions, which were not identified by the tau-SUVR analysis, for mapping to Aβ CL.

Changes in the association between tau and Aβ CL

Occlusion analysis was used to identify the importance of salient tau clusters in the mapping to Aβ, and how this association changes across the CL continuum. The tenet of occlusion analysis states that a cluster that is more important for the estimation of a given Aβ CL will prompt a larger reduction in estimation accuracy after it is removed from the CNN input space. This change in accuracy can be averaged across the whole Aβ CL continuum by calculating the R² value between measured Aβ CL and CNN-estimated Aβ CL, after occlusion, The five salient clusters identified as changing significantly with Aβ were occluded sequentially from the optimal CNN model, and the change in \({R}^{2}\) noted.

Of the five occluded clusters, removal of FPC, MTR and MTL resulted in the largest reduction in R² (Table 2). Figure 5A–C demonstrates these effects graphically, with a rotation of the scatter plots towards the x-axis after occluding the clusters from the input space, indicating reduced accuracy in the estimation of Aβ CL. Figure 5B,C establishes the importance of both left and right medial temporal structures in the CNN estimation of Aβ CL (Table 2). Occlusion of the right medial temporal lobe, denoted MTR, caused the biggest reduction in R² (Fig. 5C).

When all clusters were occluded from the input, the predictive capacity of the CNN was reduced for the training, validation and test datasets to an R² of 0.16, 0.02 and − 0.05, respectively (Table 2, Fig. 5D). This suggests that most of the CNNs predictive capability is captured in these five clusters.

We introduce a metric entitled ‘strength of association’ to evaluate the change in the importance of an identified cluster across the Aβ CL continuum. The strength of association is calculated by smoothing the absolute errors between CNN outputs corresponding to full tau input images and occluded input tau images. In order to reduce the bias from the best cross-validation instance, this metric was calculated for best instance clusters using all the cross-validation instances to calculate the mean strength of association for the CNN model. At low Aβ CL values, the FPC cluster had a higher strength of association than MTL and MTR clusters, that gradually decreased with increasing Aβ CL until an Aβ CL of approximately 25 (Fig. 5E, double-ended green arrow). At this point, the MTR and MTL began to dominate the mapping in all cross-validation models, becoming increasingly important in the CNN mapping of high CL values (Fig. 5E, red arrow). In summary, the CNN network used more tau information from the FPC and both MTR and MTL to predict Aβ CL in low and high Aβ subjects, respectively.

Since the FPC is a large cluster, containing more voxels than any other cluster and demonstrating high strength of association across the Aβ CL continuum, sub-clusters of the FPC were sequentially occluded from the CNN input space to gain more insight into the impact of the FPC on Aβ CL estimation (Fig. 6). Sub-clusters in the cingulate, precuneus and frontal lobe had the largest strength of association at low Aβ CL, with the cingulate and precuneus gradually decreasing in strength with increasing Aβ CL. Conversely, the frontal lobe region showed an increasing strength of association with increasing Aβ CL. Other sub-clusters of the FPC, spread in the parietal lobe, paracentral lobule, postcentral gyrus and SMA, remained at low strengths across the Aβ CL continuum, except the precentral gyrus, that showed an increasing association over Aβ CL, to become one of the strongest sub-clusters at Aβ CL values of 25 and above.

Discussion

In this study, we examined the relationship between tau-PET images and the Aβ burden using a CNN. The association between tau accumulation and Aβ burden has been investigated in several studies^{9,10,11,12,32,34,35,36,37}. However, they were carried out with an assumption of spatial independence in tau accumulation across brain regions. Such an assumption is in contradiction to the growing body of research establishing the spatial dependence of tau accumulation in the natural progression of AD^30,31,32,33. The use of a CNN, designed to extract spatial features and learn dependencies between spatially remote brain regions, enabled us to relax this assumption. Moreover, CNN analysis is a data-driven approach without the assumption of a particular spatiotemporal evolution of tau⁶; recent research has suggested the importance of the re-examination of tau topographic staging due to its heterogeneity³⁸.

The relationship between Aβ and tau is often examined by separating patient data into Aβ^– and Aβ⁺ groups, thresholded using Aβ burden; the capacity of a CNN to identify non-linear mappings between Aβ and tau allowed us to treat Aβ quantification as a continuum, avoiding both binarizing subjects into two groups and a priori assumptions regarding the threshold. Rather, the use of a CNN enabled an exploration of associations between tau and Aβ without such preconceptions.

In our exploration of the relationship between the [¹⁸F]MK6240 uptake pattern and Aβ CL value, both tau-SUVR voxel-wise linear models and a CNN based nonlinear model were implemented, mapping tau-PET voxels to Aβ CL values to examine the potential of CNN models to delineate the association between these two molecular species. A voxel-wise GLM on tau-SUVR identified a positive correlation between tau accumulation and Aβ CL value throughout the brain after controlling for age and sex. In contrast, the CNN-based model identified focal clusters in the medial temporal, precentral gyri, postcentral gyri, SMAs, paracentral lobules, superior and middle frontal gyri, precuneus, superior and inferior parietal lobules, and middle cingulate, as the regions in which information from the tau scan was most salient for predicting Aβ CL value. This was elucidated by the removal of these clusters resulting in significantly reduced predictive power of Aβ CL value from tau-PET images, demonstrating that the small clusters carry more information to predict reliably Aβ burden. A possible explanation for these focal clusters is that as CNNs integrate spatial dependence between input image regions in predicting the output, they may not use duplicate information from brain regions while generating the Aβ CL.

The CNN-based nonlinear mapping between tau-PET and Aβ CL values produced markedly different topographic patterns from those emerging from the linear tau-SUVR analysis. Although the medial temporal and some frontal lobe structures were common to both saliency-based CNN analysis and tau-SUVR analysis, the CNN used information from additional brain regions, including regions in the middle cingulate and postcentral gyri, associated with the Aβ burden that was not captured by tau-SUVR analysis. Further investigation is required to determine why the CNN model identified these specific regions as important for the prediction of Aβ CL.

Our GLM analysis of tau-SUVR images showed that the level of tau accumulation in medial temporal lobes is linked with Aβ CL value, a region postulated as an initial tau accumulation site during AD evolution^6,34. This association was not limited to the medial temporal lobes, with spread throughout the brain showing a pattern similar to Braak staging, with initiation in medial temporal structures that spreads to the neocortical regions. These results show the level of tau accumulation not only in medial temporal lobes, but also in other cortical regions, is related to Aβ burden as seen in previous studies^9,10,11,12.

The CNN model was used to examine how changes in tau topography are associated with changing global Aβ burden. We employed occlusion analysis, in which salient clusters were sequentially occluded from the input space, and the resultant impact on the CNN estimation of Aβ CL was determined. Occlusion analysis indicates the uniqueness of information provided by the occluded region—if the CNN estimation of Aβ CL is significantly altered, it suggests that the occluded region is providing information not available from other input clusters. Additionally, the strength of association metric that was introduced to quantify changes in CNN Aβ CL estimation accuracy does not make any assumption of linearity between input tau values the Aβ CL. The strength of association measures clearly demonstrate non-linear mappings between input tau values and Aβ CL estimates, with the relative importance of clusters changing across the continuum.

In this study, we considered both Aβ and tau as continuous variables, avoiding the standard dichotomous paradigm. As per the CNN analysis, at low Aβ levels, tau clusters in the frontal lobes, parietal lobes and cingulate (FPC cluster) were more associated with Aβ burden, driven primarily by sub-clusters in the cingulate, precuneus and frontal lobe. At high Aβ levels, information from both medial temporal regions was most prominent in the prediction.

The variable importance of brain regions at different Aβ CL values provides a proxy for longitudinal studies that show temporal changes of tau topography with the Aβ burden. Aβ accumulation initiates in the precuneus, superior and inferior parietal lobules, cingulate gyrus and prefrontal cortex^6,39, and our CNN-driven approach has captured tau sub-clusters in this area that are predictive of low Aβ burden. Further, information from medial temporal regions was the most important in predicting high Aβ levels. These pseudo-longitudinal results are consistent with the local/remote hypotheses of Aβ, in which Aβ drives local and remote effects⁴⁰. Further, The FPC cluster overlaps with regions of the default mode network, including the superior and middle frontal gyri, precuneus and inferior parietal lobule⁴¹.

The FPC cluster contains regions, such as the SMAs, precentral gyri and paracentral lobules that have not been shown to exhibit an elevated tau signal. However, the strength of association for these clusters remained flat at a low value across Aβ CL values, except for the sub-cluster in the precentral gyrus; further work is required to elucidate why the CNN has identified those sub-clusters in the FPC cluster.

After the occlusion of all significant salient clusters, there remains minor residual predictive power (Fig. 5D). Saliency analysis captures predictive brain regions that are common across the cohort. However, there may be salient regions unique to an individual that leaves residual predictive power after removing common salient clusters. The limited predictive power remaining after removal of common clusters suggests that the CNN analysis has identified cohort-wide regions predictive of Aβ burden. Although it is argued that tau spreads follow a stereotypical pattern, studies have shown an inter-individual tau heterogeneity, of which the residual predictive power is suggestive^33,42,43.

As with many other deep learning applications in biomedical imaging, data scarcity is a challenge in training the network. The training dataset performance is higher than the validation dataset, which may be thought to indicate overfitting. However, the CNN has performed well on the hold-out test dataset, successfully differentiating the full range of Aβ CL subjects. Additionally, all cross-validation instances showed overlapping spatial features in bilateral medial temporal, frontal and parietal lobes. These results suggest that our model has captured spatial features of tau images that are genuinely associated with changes in Aβ CL values, and are necessary to accurately map a tau image to CL.

The limitations of the current study include: (1) clinical diagnosis classifications were not used in this study and subjects with a range of Aβ values, both Aβ+ and Aβ−, were used; (2) all conclusions were drawn on cross-sectional analysis. Longitudinal analysis is required to investigate the insights provided by this study; (3) age and sex were used as confounding variables, however, other factors may impact tau uptake, including education level and genetics; (4) this analysis was performed on a cohort with a limited number of samples that may not cover the full spectrum of AD tau patterns; (5) while the saliency technique used to interpret the CNN results can identify regions useful for predicting of Aβ CL, it cannot be used to infer any conclusion with regard to the accumulation of tau, only that tau in these regions are used by the model to drive its prediction; (6) the findings may be dependent on the chosen PET image reconstruction characteristics. To enhance the CNN analysis, we used highly converged image reconstructions with less partial volume effect that are of higher resolution and noisier than those used for clinical viewing. This may limit the reproducibility of the study if constrained to using standard clinical PET reconstruction settings.

Irrespective of the above limitations, the study has highlighted that a deep learning approach reveals distinct information to standard voxel-wise analyses. Focused tau clusters were identified that were strongly predictive of Aβ CL values. In future studies, it will be interesting to analyse the models retrained on input tau images without clusters identified from this CNN analysis. Moreover, future studies will be targeted at more generalized methods, expanding the analysis to a range of scanners, and reversing the method to see if the topography of Aβ plays a role connecting to a variety of tau measures, including blood and CSF biomarkers as the tau markers.

Materials and methods

Participants

Participants were drawn from the Australian Dementia Network (ADNET) study, the Australian Imaging, biomarker and Lifestyle study (AIBL) and healthy controls from the Traumatic Brain Injury (TBI) study. A total of 134 subjects, 99 from ADNET (mean age 71.57 ± 7.69 years, 50 females), 25 from the traumatic brain injury study (mean age 63.08 ± 12.60 years, 11 females) and 10 from AIBL (mean age 79.2 ± 5.41 year, 6 females), were included (Table 1). The human scans were approved by the Austin Health Human Research Ethics Committee (HREC/18/Austin/201) and all the experiments were performed in accordance with relevant guidelines and regulations.

Image acquisition and processing

PET scans were performed on a Siemens Biograph 128 mCT PET/CT scanner at the Melbourne Brain Centre Imaging Unit, the University of Melbourne. Subjects were scanned for Aβ and tau on two different days. A low-dose CT scan was carried out prior to each PET acquisition for attenuation correction. For Aβ scans, subjects were injected with [¹⁸F]NAV4694 radiotracer 50 min prior to 20 min of continuous scanning. Scanning tau with [¹⁸F]MK6240 radiotracer used a 20-min acquisition 90 min after injection. The Siemens Ordered Subset Expectation Maximization algorithm with Time-of-Flight (12 iterations, 21 subsets, no smoothing, resolution of full width at half maximum = 4.3 mm) was used to reconstruct all PET scans in high-resolution, as this reconstruction method maintains the natural variability in the data without smoothing⁴⁴. Aβ PET images were spatially normalized using the CapAIBL software⁴⁵. The standardized uptake value ratio (SUVR) was computed using the ratio of PET retention computed inside the neocortical Aβ CL mask and the whole cerebellum. The SUVR was then transformed into Aβ CL value using the published transform for [¹⁸F]NAV4694¹⁴.

The skull was stripped from tau images to remove off-target binding in non-brain regions. CT scans, acquired for attenuation correction, were used to generate a skull stripping mask using the FSL Brain Extraction Tool, that was subsequently transformed to the PET domain to extract the PET image brain⁴⁶. Skull stripped images were manually checked for both registration and stripping faults. To evaluate the topography of [\({}^{18}\mathrm{F}]\)MK6240 bindings, tau-PET images were subject-wise, non-linearly normalized to the FSL MNI152 1 mm template using Advanced Normalization Tools after using the CT image as an intermediate step between the PET image and the template. The resultant tau PET images were each 158 × 198 × 158 voxels with 1 mm voxel size in all dimensions. Prior to input into the CNN, each tau-PET image was scaled to the range [0, 1], to examine the relative change of tau topography with Aβ burden and stabilize the training without exploding or vanishing gradients⁴⁷. The [¹⁸F]-MK6240 scans were normalised using the cerebellar cortex, identified from the Automated Anatomical Labeling Atlas 3 (AAL3)⁴⁸, to generate SUVR images analysed using voxel-wise general linear models for comparison with the CNN outcome.

Since tau is known to accumulate in medial and lateral temporal structures⁶, all brain slices are displayed as oblique axial slices throughout the results.

Deep learning framework

The CNN accepted three-dimensional, skull stripped, normalized and scaled standardized uptake value (SUV) tau image data for each participant as input and, using the RMSprop optimizer, adjusted weights within the network to find the optimal mapping of the tau input to the Aβ CL. The CNN network structure of this study was influenced by the U-net architecture, which is widely used in image segmentation¹⁶. Since CNN model performance is sensitive to network structure and the associated parameters, several different CNN models were trained with 10 separate training/validation partitions (109 training sets, 12 validation sets; 10-fold cross-validation), selecting the model with the lowest root mean squared error (RMSE) across the partitions. More details of the selected CNN network structure and the training parameters are provided in Supplementary Sect. S.1.

All 10-fold cross-validation instances of the selected CNN model were evaluated on 13 separate, hold-out test sets. Out of the 10 instances of the selected model, the instance with the best performance, determined by the RMSE value evaluated using both validation and testing sets, was used for further analysis.

Interpretation of CNN via saliency maps

After CNN training, the learned model requires interpretation to garner insight into how it maps the tau-PET topography to the associated Aβ burden. Saliency maps of equal dimension to the MNI normalized input tau images were generated for each subject²⁷, using training, validation and testing datasets and across all ten cross-validation model instances. The saliency maps provide a measure of the importance of the voxels in the prediction of the output CL Aβ value. As saliency maps, being a voxel-wise measure, are visually noisy²⁶, they were smoothed using a 2 mm FWHM Gaussian kernel before the analysis. Further details about saliency map computation are provided in Supplementary Sect. S.2.

Comparison of CNN-based mapping with SUVR analyses

To evaluate the voxel-wise relationship between CNN-generated saliency values and Aβ CL values, GLM analyses were carried out using statistical parametric mapping (SPM) version 8. Saliency value was used as the dependent variable in GLM. Age and sex were identified as confounding variables and controlled. The following models were tested:

1.
Tau-SUVR values are estimated by CL Aβ values, controlling for age and sex.
2.
Saliency values are estimated by CL Aβ values, controlling for age and sex. The nonlinearity of the CNN method is encapsulated in the saliency maps.

SPM analysis (t-contrast) was carried out on both linear models to identify significant clusters with p < 0.05, controlling for multiple comparisons using the family-wise error (FWE) rate, with a minimum cluster extent of 200 voxels imposed.

Interpretation of the identified saliency map clusters

The GLM analysis of the saliency maps identified statistically significant clusters. To determine the importance of best cross-validation instance clusters in the mapping between tau and Aβ CL value, each cluster was removed (‘occluded’) from the input images in turn, and new CNN outputs were predicted for all subjects. The importance of a cluster was assessed by quantifying the change in the coefficient of determination (R²) after its removal from the input data space. Out-of-sample R² values were calculated on validation and testing datasets⁴⁹. Since the CNN approach is a non-linear regression method, R² has been used as a proxy value to evaluate this change of CNN outputs.

To evaluate the change of the importance of each cluster across the Aβ CL continuum, a ‘strength of association’ metric was introduced, quantifying the importance of each cluster and sub-cluster in mapping the corresponding Aβ CL level. Absolute errors were calculated between CNN outputs corresponding to full tau input images and occluded input tau images and these errors were smoothed with local weighted regression to generate the strength of association. To reduce the bias of the best cross-validation instance on the results, the mean strengths of association were calculated for the best cross-validation instance clusters using all the cross-validation instances. The identified salient clusters were divided into sub-clusters and the strengths of association of those sub-clusters were also analysed.

Ethics approval

Ethics approval and consent to participate in this study were approved by the Austin Health Human Research Ethics Committee (HREC/18/Austin/201).

Informed consent

This study was approved by the Austin Health Human Research ethics Committee (HREC/18/Austin/201) and given the retrospective nature of the study and the use of anonymized consented patient data under Austin HREC, requirements for informed consent were waived.

Conclusion

A data-driven deep learning approach, unconstrained by standard definitions of pathological regions or reference regions, driven by relative spatial positioning rather than uptake levels, has revealed new relationships between tau topography and Aβ burden. This relationship does not start late in the natural history of AD, rather it occurs at minimal or low levels of Aβ burden. The differential importance of tau accumulation regions with the Aβ load may be considered as a proxy for longitudinal change and may provide insight into Alzheimer's disease evolution.

Data availability

The data that support the findings of this study are available from AIBL but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of AIBL (https://aibl.csiro.au/adni/index.html).

References

Jack, C. R. et al. Hypothetical model of dynamic biomarkers of the Alzheimer’s pathological cascade. Lancet Neurol. 9(1), 119–128 (2010).
Article CAS PubMed PubMed Central Google Scholar
Villemagne, V. L. et al. Amyloid β deposition, neurodegeneration, and cognitive decline in sporadic Alzheimer’s disease: A prospective cohort study. Lancet Neurol. 4422(13), 1–11 (2013).
Google Scholar
Brier, M. R. et al. Tau and Ab imaging, CSF measures, and cognition in Alzheimer’s disease. Sci. Transl. Med. 8(338), 1–10 (2016).
Article Google Scholar
Jagust, W. Imaging the evolution and pathophysiology of Alzheimer disease. Nat. Rev. Neurosci. 19(11), 687–700 (2018).
Article CAS PubMed PubMed Central Google Scholar
Villemagne, V. L., Doré, V., Burnham, S. C., Masters, C. L. & Rowe, C. C. Imaging tau and amyloid-β proteinopathies in Alzheimer disease and other conditions. Nat. Rev. Neurol. 14(4), 225–236 (2018).
Article CAS PubMed Google Scholar
Braak, H., Braak, E. Neuropathological stageing of Alzheimer-related changes. Acta Neuropathol. 1–4 (1991).
Thal, D. R., Rüb, U., Orantes, M. & Braak, H. Phases of Aβ-deposition in the human brain and its relevance for the development of AD. Neurology 58(12), 1791–1800 (2002).
Article PubMed Google Scholar
Delacourte, A. et al. The biochemical pathway of neurofibrillary degeneration in aging and Alzheimer’s disease. Neurology 52(6), 1158 (1999).
Article CAS PubMed Google Scholar
Schöll, M. et al. PET imaging of tau deposition in the aging human brain. Neuron 89(5), 971–982 (2016).
Article PubMed PubMed Central Google Scholar
Johnson, K. A. et al. Tau positron emission tomographic imaging in aging and early Alzheimer disease. Ann. Neurol. 79(1), 110–119 (2016).
Article PubMed Google Scholar
Pereira, J. B., Harrison, T. M., La Joie, R., Baker, S. L. & Jagust, W. J. Spatial patterns of tau deposition are associated with amyloid, ApoE, sex, and cognitive decline in older adults. Eur. J. Nucl. Med. Mol. Imaging. 47(9), 2155–2164 (2020).
Article CAS PubMed PubMed Central Google Scholar
Pontecorvo, M. J. et al. Relationships between flortaucipir PET tau binding and amyloid burden, clinical diagnosis, age and cognition. Brain 140(3), 748–763 (2017).
PubMed PubMed Central Google Scholar
Klunk, W. E. et al. The Centiloid project: Standardizing quantitative amyloid plaque estimation by PET. Alzheimer’s Dement. 11(1), 1-15.e4 (2015).
Article Google Scholar
Bourgeat, P. et al. Implementing the centiloid transformation for 11C-PiB and β-amyloid 18F-PET tracers using CapAIBL. Neuroimage 183(March), 387–393 (2018).
Article CAS PubMed Google Scholar
Jack, C. R. et al. NIA-AA research framework: Toward a biological definition of Alzheimer’s disease. Alzheimer’s Dement. 14(4), 535–562 (2018).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. Lect. Notes Comput. Sci. (Including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinform.). 9351, 234–241 (2015).
Google Scholar
Choi, H., Ha, S., Kang, H., Lee, H. & Lee, D. S. Deep learning only by normal brain PET identify unheralded brain anomalies. EBioMedicine 43, 447–453 (2019).
Article PubMed PubMed Central Google Scholar
Wang, Y. et al. 3D conditional generative adversarial networks for high-quality PET image estimation at low dose. Neuroimage 2018(174), 550–562 (2017).
Google Scholar
Suk, H. I., Lee, S. W. & Shen, D. Hierarchical feature representation and multimodal fusion with deep learning for AD/MCI diagnosis. Neuroimage 101, 569–582 (2014).
Article PubMed Google Scholar
Payan, A., Montana, G. Predicting Alzheimer’s disease a neuroimaging study with 3D convolutional neural networks. in ICPRAM 2015—4th Int. Conf. Pattern Recognit. Appl. Methods Proc., vol. 2, 355–362 (2015).
Punjabi, A., Martersteck, A., Wang, Y., Parrish, T. B. & Katsaggelos, A. K. Neuroimaging modality fusion in Alzheimer’s classification using convolutional neural networks. PlosOne. 14(12), 1–14 (2019).
Article Google Scholar
Böhle, M., Eitel, F., Weygandt, M., Ritter, K. Layer-wise relevance propagation for explaining deep neural network decisions in MRI-based Alzheimer’s disease classification. Front. Aging Neurosci. 10(JUL) (2019).
Oh, K., Chung, Y. C., Kim, K. W., Kim, W. S. & Oh, I. S. Classification and visualization of Alzheimer’s disease using volumetric convolutional neural network and transfer learning. Sci. Rep. 9(1), 1–16 (2019).
Article Google Scholar
Bach, S., Binder, A., Montavon, G., Klauschen, F., Müller, K.R., Samek, W. On pixel-wise explanations for non-linear classifier decisions by layer-wise relevance propagation. PLoS One. 10(7) (2015).
Sundararajan, M., Taly, A., Yan, Q. Axiomatic attribution for deep networks. in 34th Int. Conf. Mach. Learn. ICML 2017, vol 7, 5109–5118 (2017).
Smilkov, D., Thorat, N., Kim, B., Viégas, F., Wattenberg, M. SmoothGrad: Removing noise by adding noise. (2017).
Simonyan, K., Vedaldi, A., Zisserman, A. Deep inside convolutional networks: Visualising image classification models and saliency maps. in 2nd Int. Conf. Learn. Represent ICLR 2014—Work Track Proc. 1–8 (2014).
Zeiler, M. D. & Fergus, R. Visualizing and understanding convolutional networks. Lect. Notes Comput. Sci. (Including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinform.). 8689 LNCS(PART 1), 818–833 (2014).
Google Scholar
Adebayo, J. et al. Sanity checks for saliency maps. Adv. Neural Inf. Process Syst. 2018-Decem(NeurIPS), 9505–9515 (2018).
Google Scholar
Jones, D. T. et al. Tau, amyloid, and cascading network failure across the Alzheimer’s disease spectrum. Cortex 97, 143–159 (2017).
Article PubMed PubMed Central Google Scholar
Hoenig, M. C. et al. Networks of tau distribution in Alzheimer’s disease. Brain 141(2), 568–581 (2018).
Article PubMed Google Scholar
Sepulcre, J. et al. In vivo tau, amyloid, and gray matter profiles in the aging brain. J. Neurosci. 36(28), 7364–7374 (2016).
Article CAS PubMed PubMed Central Google Scholar
Franzmeier, N., Dewenter, A., Frontzkowski, L., et al. Patient-centered connectivity-based prediction of tau pathology spread in Alzheimer’s disease. Sci. Adv. 6(48) (2020).
Vemuri, P. et al. Tau-PET uptake: Regional variation in average SUVR and impact of amyloid deposition. Alzheimer’s Dement Diagnosis Assess. Dis. Monit. 6, 21–30 (2017).
Google Scholar
Cho, H. et al. In vivo cortical spreading pattern of tau and amyloid in the Alzheimer disease spectrum. Ann. Neurol. 80(2), 247–258 (2016).
Article CAS PubMed Google Scholar
Lockhart, S. N. et al. Amyloid and tau PET demonstrate region-specific associations in normal older people. Neuroimage 150(February), 191–199 (2017).
Article CAS PubMed Google Scholar
Iaccarino, L. et al. Local and distant relationships between amyloid, tau and neurodegeneration in Alzheimer’s Disease. NeuroImage Clin. 2018(17), 452–464 (2017).
Google Scholar
Vogel, J.W., Young, A.L., Oxtoby, N.P., et al. Four distinct trajectories of tau deposition identified in Alzheimer’s disease. Nat. Med. (2021).
Rowe, C. C. et al. Imaging β-amyloid burden in aging and dementia. Neurology 68(20), 1718–1725 (2007).
Article CAS PubMed Google Scholar
Masters, C. L. et al. Amyloid plaque core protein in Alzheimer disease and Down syndrome. Proc. Natl. Acad. Sci. USA. 82(12), 4245–4249 (1985).
Article ADS CAS PubMed PubMed Central Google Scholar
Alves, P. N. et al. An improved neuroanatomical model of the default-mode network reconciles previous neuroimaging and neuropathological findings. Commun. Biol. 2(1), 1–14 (2019).
Article Google Scholar
Ossenkoppele, R. et al. Tau PET patterns mirror clinical and neuroanatomical variability in Alzheimer’s disease. Brain 139(5), 1551–1567 (2016).
Article PubMed PubMed Central Google Scholar
Ossenkoppele, R. et al. Distinct tau PET patterns in atrophy-defined subtypes of Alzheimer’s disease. Alzheimer’s Dement. 16(2), 335–344 (2020).
Article Google Scholar
Williams, R. et al. Phantom measurement predicts the impact of image reconstruction on Florbetapir SUVR quantification. J. Nucl. Med. 60(supplement 1), 2001 (2019).
Google Scholar
Bourgeat, P. et al. Comparison of MR-less PiB SUVR quantification methods. Neurobiol. Aging. 36(S1), S159–S166 (2015).
Article CAS PubMed Google Scholar
Muschelli, J. Recommendations for processing head CT Data. Front. Neuroinform. 13(September), 1–9 (2019).
Google Scholar
Dai, Z., Heckel, R. Channel normalization in convolutional neural networks avoids vanishing gradients. arXiv. 1–11 (2019).
Rolls, E. T., Huang, C. C., Lin, C. P., Feng, J. & Joliot, M. Automated anatomical labelling atlas 3. Neuroimage 2020(206), 116189 (2019).
Google Scholar
Campbell, J. Y. & Thompson, S. B. Predicting excess stock returns out of sample: Can anything beat the historical average?. Rev. Financ. Stud. 21(4), 1509–1531 (2008).
Article Google Scholar

Download references

Acknowledgements

The authors acknowledge the facilities and scientific and technical assistance of the National Imaging Facility, a National Collaborative Research Infrastructure Strategy (NCRIS) capability, at the Melbourne Brain Centre Imaging Unit, the University of Melbourne. The first author would also like to acknowledge the Rowden White scholarship for its assistance in his research. The authors acknowledge the contributions of the research teams of AIBL, ADNET and TBI.

Funding

The research was supported by the Australian Federal Government through NHMRC and NIH grants and Cerveau Technologies.

Author information

Authors and Affiliations

Department of Biomedical Engineering, The University of Melbourne, Melbourne, VIC, Australia
Gihan P. Ruwanpathirana, Leigh A. Johnston & Catherine E. Davey
Melbourne Brain Centre Imaging Unit, The University of Melbourne, Melbourne, VIC, Australia
Gihan P. Ruwanpathirana, Robert C. Williams, Leigh A. Johnston & Catherine E. Davey
Department of Molecular Imaging and Therapy, Austin Health, Melbourne, VIC, Australia
Christopher C. Rowe
Florey Institute of Neuroscience and Mental Health, Melbourne, VIC, Australia
Colin L. Masters & Christopher C. Rowe
Florey Department of Neuroscience and Mental Health, The University of Melbourne, Melbourne, VIC, Australia
Colin L. Masters & Christopher C. Rowe

Authors

Gihan P. Ruwanpathirana
View author publications
You can also search for this author in PubMed Google Scholar
Robert C. Williams
View author publications
You can also search for this author in PubMed Google Scholar
Colin L. Masters
View author publications
You can also search for this author in PubMed Google Scholar
Christopher C. Rowe
View author publications
You can also search for this author in PubMed Google Scholar
Leigh A. Johnston
View author publications
You can also search for this author in PubMed Google Scholar
Catherine E. Davey
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.C.W. conceived the idea, and G.P.R., R.C.W. and L.A.J. designed the research. G.P.R. wrote the code. G.P.R., R.C.W., L.A.J. and C.E.D. analysed the data. All authors interpreted the results. G.PR. drafted the manuscript, with significant input from C.E.D. and L.A.J. All authors critically revised the manuscript. C.E.D., R.C.W. and L.A.J. supervised the project.

Corresponding author

Correspondence to Catherine E. Davey.

Ethics declarations

Competing interests

CCR was the recipient of a research grant from Cerveau who supplied the MK6240 tau tracer precursor for research use.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ruwanpathirana, G.P., Williams, R.C., Masters, C.L. et al. Mapping the association between tau-PET and Aβ-amyloid-PET using deep learning. Sci Rep 12, 14797 (2022). https://doi.org/10.1038/s41598-022-18963-6

Download citation

Received: 04 February 2022
Accepted: 23 August 2022
Published: 30 August 2022
DOI: https://doi.org/10.1038/s41598-022-18963-6
Springer Nature Limited

Mapping the association between tau-PET and Aβ-amyloid-PET using deep learning

Abstract

Similar content being viewed by others

Deep learning detection of informative features in tau PET for Alzheimer’s disease classification

A deep learning MRI approach outperforms other biomarkers of prodromal Alzheimer’s disease

Dissociation of tau pathology and neuronal hypometabolism within the ATN framework of Alzheimer’s disease

Introduction