Multiparametric computer-aided differential diagnosis of Alzheimer’s disease and frontotemporal dementia using structural and advanced MRI
To investigate the added diagnostic value of arterial spin labelling (ASL) and diffusion tensor imaging (DTI) to structural MRI for computer-aided classification of Alzheimer's disease (AD), frontotemporal dementia (FTD), and controls.
This retrospective study used MRI data from 24 early-onset AD and 33 early-onset FTD patients and 34 controls (CN). Classification was based on voxel-wise feature maps derived from structural MRI, ASL, and DTI. Support vector machines (SVMs) were trained to classify AD versus CN (AD-CN), FTD-CN, AD-FTD, and AD-FTD-CN (multi-class). Classification performance was assessed by the area under the receiver-operating-characteristic curve (AUC) and accuracy. Using SVM significance maps, we analysed contributions of brain regions.
Combining ASL and DTI with structural MRI resulted in higher classification performance for differential diagnosis of AD and FTD (AUC = 84%; p = 0.05) than using structural MRI by itself (AUC = 72%). The performance of ASL and DTI themselves did not improve over structural MRI. The classifications were driven by different brain regions for ASL and DTI than for structural MRI, suggesting complementary information.
ASL and DTI are promising additions to structural MRI for classification of early-onset AD, early-onset FTD, and controls, and may improve the computer-aided differential diagnosis on a single-subject level.
• Multiparametric MRI is promising for computer-aided diagnosis of early-onset AD and FTD.
• Diagnosis is driven by different brain regions when using different MRI methods.
• Combining structural MRI, ASL, and DTI may improve differential diagnosis of dementia.
KeywordsClassification Dementia Differential diagnosis Perfusion Diffusion tensor imaging
Arterial spin labelling
Area under the ROC curve
Cerebral blood flow
Cognitively normal controls
Diffusion tensor imaging
Mini-Mental State Examination
Support vector machine
T1-weighted structural MRI
Alzheimer's disease (AD) and frontotemporal dementia (FTD) are major diseases underlying dementia, especially in younger patients (age < 65 years) . Establishing an accurate diagnosis in the early stage of the disease can be difficult. Although clinical symptomatology differs between the diseases, symptoms in the early stage may be unclear and can overlap [2, 3]. The current clinical criteria, which entail qualitative inspection of neuroimaging, fail to accurately differentiate AD from FTD . However, early and accurate differential diagnosis of AD and FTD is very important, mainly because it gives patients access to supportive therapies [5, 6]. In addition, early diagnosis supports new research into understanding the disease process and developing new treatments [5, 6].
In this difficult case of differential diagnosis between AD and FTD, methods for computer-aided diagnosis may be beneficial. These methods make use of multivariate data analysis techniques that train a model (classifier) based on neuroimaging or related data, resulting in an objective diagnosis. In addition, computer-aided diagnosis can be more accurate than using only clinical criteria , as it potentially makes use of subtle group differences. Using structural T1-weighted (T1w) MRI to find characteristic patterns of brain atrophy, computer-aided diagnosis methods yielded accuracy of up to 84% for differentiation of AD and FTD [8, 9, 10].
Besides using structural MRI, evidence of neurodegeneration can be measured with advanced MRI techniques such as arterial spin labelling (ASL) and diffusion tensor imaging (DTI). ASL can non-invasively measure brain perfusion in terms of cerebral blood flow (CBF) [11, 12]. Recent studies have shown differences in perfusion patterns for FTD and AD indicating that this technique is promising for differential diagnosis [13, 14, 15, 16]. In addition, some classification studies showed an added value of ASL over atrophy measurements for AD diagnosis in individual patients, although others did not [13, 17, 18, 19]. Using DTI, the fractional anisotropy (FA) can be quantified, which is related to the degradation of white matter (WM) bundles. WM degradation has been shown to be more prominent in FTD than in AD, especially in frontal brain regions [14, 20, 21]. In classification studies, DTI generally shows a slight added value to atrophy measurements [22, 23, 24, 25, 26, 27, 28].
As ASL and DTI measure aspects of the neurodegenerative process that are different from brain volume changes, we hypothesise that these techniques have an added diagnostic value over structural MRI. Although ASL and DTI have been shown to be potential markers for differential diagnosis of AD and FTD, their combined added value for computer-aided differential diagnosis has not yet been evaluated. This study aims to investigate the added diagnostic value of ASL and DTI to structural MRI for classification of AD, FTD, and controls.
Materials and methods
We retrospectively included 24 AD patients, 33 FTD patients, and 34 cognitively normal (CN) controls. Patients who visited the memory clinic of our institution between February 2011 and June 2015 were considered for inclusion. Patients underwent neurological and neuropsychological examination as part of their diagnostic work-up. Patients with a Mini-Mental State Examination (MMSE) score ≥ 20 were included if they had undergone MR imaging with a standardised protocol including structural T1w MRI, ASL, and DTI. Patients with psychiatric or neurological disorders other than dementia were excluded. The reference standard was a diagnosis of AD or FTD established by consensus of a multidisciplinary team according to the clinical criteria [2, 3, 29]. Controls were recruited from patient peers and through advertisement, and had no memory complaints, history of neurological or psychiatric disease, or contra-indications for MRI.
This study was approved by the local medical ethics committee. Eighty-seven participants signed informed consent; consent from the remaining four patients was waived because of the retrospective nature of the study.
Image acquisition and processing
MRI acquisition parameters
3D IR FSPGR
2D single-shot EPI
Scan parameters (TI/TR/TE)
450 ms / 7.9 ms / 3.1 ms
1525 msa / 4632 ms / 10.5 ms
N.A. / 7925 ms / 82 ms
1 mm isotropic
3.3 mm isotropic
1.9 × 1.9 in-plane
240 × 240 × 176
512 sampling points on 8 spirals
128 × 128
Reconstructed voxel size
0.9 × 0.9 × 1.0 mm (sagittal)
1.9 × 1.9 × 4.0 mm (axial)
0.9 × 0.9× 2.5 mm or 0.9 × 0.9 × 2.9 mm (axial)
Number of excitations
Interleaved fast spin echo
No. b0 volumes (b-value = 0 s/mm2)
For image processing, the Iris pipeline  was applied to obtain voxel-based measures of structural MRI, ASL, and DTI (see Appendix A for a detailed description). From structural MRI, we derived tissue segmentations—WM, grey matter (GM), cerebrospinal fluid—and a brain mask. In a group template space, we derived features based on voxel-based morphometry (VBM) within a mask of the 1) GM (VBM-GM), 2) WM (VBM-WM) and 3) supratentorial brain (VBM-Brain). For ASL, CBF was quantified using a single-compartment model and partial volume correction. The CBF voxel values of the GM in the template space were used as features for classification. For DTI, tensor fits were performed to derive FA maps. The FA voxel values in WM in the template space were used as features for classification.
The following images were visually inspected (E.E.B., 5 years of experience): GM segmentation, WM segmentation, brain mask, template space registration, ASL registered to structural MRI, CBF map, DTI registered to structural MRI, and FA map. Any errors in the image processing were corrected until visual inspection revealed no more unacceptable results.
Analysis and statistics
GM combination: VBM-GM and CBF
WM combination: VBM-WM and FA
Full combination: VBM-Brain, CBF, and FA
For multi-class classification (AD-FTD-CN), pairwise classifiers were combined by multiplying the posterior probabilities. Using fourfold cross-validation, the mean area under the receiver operating characteristic curve (AUC), the mean accuracy, and standard deviations over 50 iterations were computed. The multi-class AUC was evaluated over pairs of classes , and the multi-class accuracy equalled the correctly classified rate.
Differences in mean AUC and accuracy were tested: 1) CBF versus VBM-GM, 2) FA versus VBM-WM, 3) GM combination versus VBM-GM, 4) WM combination versus VBM-WM, 5) Full combination versus VBM-Brain. This was done using non-parametric permutation tests: the difference in performance of the two classifications was compared (α ≤ 0.05) to a null distribution that was estimated using 500 permutations in which the labels were randomly distributed over the samples.
For detection of features that contributed significantly to the SVM, we calculated statistical significance maps (p-maps). These maps were computed on all data using an analytical expression that approximates permutation testing . Clusters of significant voxels were obtained by applying a slightly conservative p value threshold (α ≤ 0.01). We did not correct for multiple comparisons, as permutation testing has a low false-positive detection rate . The clusters’ locations were identified by visual inspection.
Age mean ± SD (range) [years]
67.1 ± 7.5 (52.4–81.3)
64.7 ± 8.8 (40.7–79.7)
64.7 ± 6.5 (46.5–78.8)
MMSE mean ± SD (range)b
24.1 ± 3.8 (15–30)a
25.3 ± 3.7 (15–30)a
28.7 ± 1.3 (25–30)
Age mean ± SD (range) [years]
67.3 ± 7.8 (52.4–81.3)
64.5 ± 8.2 (43.5–79.7)
66.6 ± 4.3 (58.1–78.8)
MMSE mean ± SD (range)b
24.1 ± 4.3 (15–29)a
25.1 ± 4.1 (15–30)a
28.4 ± 1.3 (25–30)
Age mean ± SD (range) [years]
66.9 ± 7.4 (60.8–79.4)
64.9 ± 9.6 (40.7–78.6)
61.4 ± 8.6 (46.5–75.5)
MMSE mean ± SD (range)b
24.2 ± 3.2 (20–30)
25.5 ± 3.4 (20–30)
29.3 ± 1.1 (27–30)
P values of the non-parametric permutation tests to test statistical differences between classifiers based on a) mean area under the ROC curve (AUC) and b) mean accuracy
CBF vs. VBM-GM
FA vs. VBM-WM
GM combination vs. VBM-GM
WM combination vs. VBM-WM
Full combination vs. VBM-Brain
a) Mean area under the ROC curve (AUC)
b) Mean accuracy
For AD-CN classification, mean AUCs were 92% (VBM-GM), 87% (VBM-WM), 94% (VBM-Brain), 89% (CBF), 89% (FA), 95% (GM combination), 91% (WM combination), and 98% (Full combination). Classification accuracy was slightly lower than AUC in general. The performance using CBF and FA features was similar to that of the VBM features. The feature combinations yielded slightly higher performance than the VBM features, but differences were not significant.
For FTD-CN classification, AUCs using VBM were somewhat higher than for AD-CN, but combination with FA and CBF did not improve performance. AUCs were 95% (VBM-GM), 96% (VBM-WM), 95% (VBM-Brain), 87% (CBF), 91% (FA), 93% (GM combination), 95% (WM combination), and 96% (Full combination).
For differential diagnosis of AD versus FTD, AUCs were 78% (VBM-GM), 76% (VBM-WM), 72% (VBM-Brain), 81% (CBF), 80% (FA), 84% (GM combination), 81% (WM combination), and 84% (Full combination). Combination with CBF and FA features improved performance over the use of VBM features only.For multi-class diagnosis of AD, FTD, and CN, mean AUCs were 85% (VBM-GM), 83% (VBM-WM), 84% (VBM-Brain), 82% (CBF), 83% (FA), 87% (GM combination), 85% (WM combination), and 90% (Full combination). Classification accuracy was lower, but it should be noted that for this three-class diagnosis, the accuracy for random guessing would be only ~33%. For multi-class classification, AUCs were highest for the combination methods. The method that combined VBM-Brain with CBF and FA yielded a significantly higher AUC (90 vs. 84%, p = 0.03) and accuracy (75 vs. 70%, p = 0.05) than VBM-Brain by itself. This is reflected in the examples of confusion matrices for one iteration of the cross-validation (Appendix C; Table C1), which show a higher number of correctly classified patients and controls for Full combination than for VBM-Brain. However, combining VBM with ASL or DTI may also reduce the number of correctly classified patients, e.g. GM Combination has a lower number of correctly classified FTD patients than VBM-GM, while accuracy is improved.
For VBM-WM (Fig. B1), we observed most clusters of significantly contributing voxels in the temporal lobe and around the ventricles. For AD-CN and FTD-CN classification, a smaller cluster of significant voxels in the corpus callosum was found. The temporal lobe clusters were present mainly in the left hemisphere, especially for AD-FTD differentiation.
For VBM-Brain (Fig. B2), p-maps were very smooth as the feature is formed by the Jacobian determinant of the spatially smooth deformation to template space. Smoothness is lost in VBM-GM and VBM-WM by multiplying the Jacobian determinant with the probabilistic tissue segmentations. For AD-CN, the classification was driven mainly by periventricular and left temporal lobe features. For FTD-CN, the temporal lobe contributed with the largest clusters of significant voxels. For AD-FTD, small clusters were found in the middle frontal gyrus, temporal lobe and periventricular regions.
For CBF (Fig. 4), p-maps showed small clusters of significant voxels in multiple brain regions. For AD-CN, significant voxels were observed mainly in the GM of the parietal lobe, precuneus, posterior cingulate gyrus, posterior temporal lobe and the insula. For FTD-CN, the main regions with significant voxels were the posterior cingulate gyrus, superior frontal gyrus, the straight gyrus, lingual gyrus and the putamen. For AD-FTD, the classification relied mainly on voxels from the posterior cingulate gyrus, parietal lobe, caudate nucleus, insula, temporal lobe and the cuneus.
For FA (Fig. 5), clusters of voxels in the corpus callosum and around the globus pallidus and putamen contributed significantly to the AD-CN classification. In addition, clusters of voxels in the visual and motor tracts contributed. For FTD-CN, the clusters of significant voxels were observed mainly in the anterior temporal lobe, the frontal WM, the corpus callosum, and language-associated tracts (uncinate fasciculus, superior longitudinal fasciculus). For the differential diagnosis of AD-FTD, fewer voxels were significant with only a cluster of significant voxels in the uncinate fasciculus.
Differential diagnosis of early-onset AD and FTD was improved (p = 0.03-0.05) by combining voxel-based features of ASL and DTI with those of structural MRI, however improvement was only borderline significant. For all classifications, ASL and DTI by themselves yielded performance similar to or slightly higher than structural MRI. While combining ASL and DTI with structural MRI improved differential diagnosis, no added value was observed for the classification of AD versus controls nor for the classification of FTD versus controls.
Classification performance was similar to that previously published on other data sets for pairwise differentiation of AD and FTD [8, 9], and slightly higher than that for multi-class classification . The combination of ASL and DTI for classification of AD, FTD, and controls has not been assessed before, and therefore cannot be directly compared to literature results. The techniques have been applied separately to pairwise classifications. In concordance with our results, most studies using DTI obtained good classification performance [23, 24, 26, 27], but indicated no significant improvement over structural MRI [22, 25, 28]. In contrast to our current and previous work , most ASL-based classification studies showed a significant added value to structural MRI [13, 17, 18]. This is partly due to the higher performance of structural MRI in our studies. Additionally, not all studies avoid overestimation of classification performance by using cross-validation. For ASL, this overestimation might be larger than for structural MRI, because of lower signal-to-noise ratio and robustness. Conclusions obtained with or without cross-validation can therefore be expected to differ.
This work is, to the best of our knowledge, the first to perform multiparametric classification of structural MRI, ASL, and DTI. Multiparametric classification on other modalities has previously used feature-level combination (e.g. one large feature vector) or classifier-level combination (e.g. combining classifier posterior probabilities). In this study, we averaged posterior probabilities of the individual classifiers, since we had previously found this to outperform feature-level approaches .
The SVM significance maps showed that the brain regions contributing to the classifications corresponded to those associated with AD or FTD, which indicates that the classifier makes plausible decisions. For structural MRI, the temporal lobes showed large clusters of significant voxels. While the medial temporal lobe (i.e. hippocampus, amygdala) largely contributed to the classifications of AD versus controls and FTD versus controls, the differentiation between AD and FTD was based mainly on anterior temporal lobe features, which corresponds to the literature on atrophy in AD [27, 36, 37, 38, 39] and FTD [27, 39]. ASL and DTI showed less influence of the temporal lobe. In the frontal and language-associated regions, DTI contributed to the classifications involving FTD. While frontal atrophy is expected in FTD [27, 39], no frontal lobe contribution was observed. ASL p-maps showed significant areas in the parietal lobe for classifications involving AD . While parietal lobe atrophy is often proposed as a differential marker [10, 27, 39], we did not find significant clusters in the VBM p-maps, which is in agreement with many VBM studies, e.g. [10, 41]. In addition to the parietal lobe, CBF in the cingulate gyri and subcortical structures—insula and caudate nucleus —showed significant features for AD and FTD classification. Finally, DTI captured the contribution of the corpus callosum for all classifications [20, 21]. Since the clusters of voxels influencing the classifications showed different brain regions for ASL and DTI compared to structural MRI, neuropathological processes with a spatial distribution other than atrophy are likely to be depicted.
Both the improved performances for differential diagnosis and the involvement of different brain regions suggest that ASL and DTI have additional diagnostic value to structural MRI and could improve diagnosis of individual AD and FTD patients. However, suboptimal image quality of these techniques in general, e.g. low signal-to-noise ratio, may have limited their diagnostic power when used separately. Similar to our findings, studies using data from the Alzheimer's Disease Neuroimaging Initiative 2 (ADNI 2) have shown that ASL and DTI separately provide information that is not available on structural MRI, but do not show better diagnostic power .
A limitation of this study is that the diagnosis was based on clinical criteria rather than post mortem histopathological examination. Although diagnosis was typically confirmed by follow-up, it is possible that some of the patients were misdiagnosed. Additionally, the size of our data set (24 AD, 33 FTD, 34 controls) was modest albeit comparable to that of other studies. Studies performing classification of AD and FTD using structural MRI data are typically of similar size [9, 13] (only larger in ). To obtain these group sizes, we did not limit inclusion to young-onset dementia, but included five AD and six FTD patients who were older than 70 years. In young-onset dementia, computer-aided differential diagnosis of FTD and AD would be most clinically relevant, as these patients show larger overlap of symptoms . Also, we pooled the patients of several FTD subgroups (bvFTD, SD, and PNFA), which could have influenced the classification results and the regions involved in classification. The modest data size did not allow for validation on a separate validation set; instead, cross-validation was used. In addition, potential vascular white matter damage in the AD group, e.g. infarcts and white matter hyperintensities, might have influenced the classification performance of DTI. However, we expect this effect to be small, as patients were excluded when they had a history of cerebrovascular accidents (CVA) or CVA reported in their MRI examination; additionally, they were relatively young.
Regarding these limitations and the results being only borderline significant, this study primarily has exploratory value. Future research on a larger and more specific presenile cohort is needed. To assess the generalisability of our conclusions, evaluation on multi-centre data and a separate validation set is necessary as well. With our current work, we presented a computer-aided diagnosis methodology based on structural MRI, ASL, and DTI which is ready to be evaluated on a larger data set when available.
In conclusion, we postulate that ASL and DTI are promising for multiparametric computer-aided diagnosis, since combining these techniques with structural MRI improved differentiation of early-onset AD and FTD in our study.
We would like to thank Inés Mérida and Sandrine Lacomme for their contributions to the early phase of this study. In addition, we acknowledge the European COST Action “Arterial spin labelling Initiative in Dementia (AID)” (BM1103).
The scientific guarantor of this publication is Esther E. Bron. W.J. Niessen declares relationships with the following companies: Quantib BV. Other authors of this manuscript declare no relationships with any companies whose products or services may be related to the subject matter of the article. This study has received funding through an Erasmus MC grant on “Advanced MR neuroimaging in presenile dementia”. W.J. Niessen and S. Klein acknowledge funding from the European Union Seventh Framework Programme (FP7/2007–2013) under grant agreement no. 601055, VPH-DARE@IT.
E.E. Bron and S. Klein have significant statistical expertise. Institutional Review Board approval was obtained. Written informed consent was obtained from eighty-seven subjects (patients) in this study; consent from the remaining four patients was waived by the Institutional Review Board because of the retrospective nature of the study. Some study subjects or cohorts have been previously reported in  and . First, we previously evaluated ASL for classification of dementia patients (n = 29) and controls (n = 29) on a subset of the data . Second, we evaluated the diagnostic value of regional measures of ASL using a subset of 13 Alzheimer’s disease patients, 19 frontotemporal dementia patients, and 22 controls . The current work extends this prior work to computer-aided differential diagnosis of Alzheimer’s disease and frontotemporal dementia, showing the added value of both ASL and DTI. Methodology: retrospective, diagnostic or prognostic study, performed at one institution.
- 2.McKhann GM, Knopman DS, Chertkow H et al (2011) The diagnosis of dementia due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers Dement 7:263–269CrossRefPubMedPubMedCentralGoogle Scholar
- 6.Prince M, Bryce R, Ferri C (2011) World Alzheimer Report 2011, The benefits of early diagnosis and intervention. Alzheimer’s Disease InternationalGoogle Scholar
- 8.Möller C, Pijnenburg YAL, Tijms B, Hafkemeijer A (2016) Alzheimer disease and behavioral variant frontotemporal dementia: automatic classification based on cortical atrophy for single-subject diagnosis. Radiology 1–11Google Scholar
- 9.Raamana PR, Rosen H, Miller B et al (2014) Three-class differential diagnosis among Alzheimer disease, frontotemporal dementia, and controls. Front Neurol 5:1–15Google Scholar
- 18.Mak HK-F, Qian W, Ng KS et al (2014) Combination of MRI hippocampal volumetry and arterial spin labeling MR perfusion at 3-Tesla improves the efficacy in discriminating Alzheimer’s disease from cognitively normal elderly adults. J Alzheimer Dis 41:749–758Google Scholar
- 21.Lu PH, Lee GJ, Shapira J et al (2014) Regional differences in white matter breakdown between frontotemporal dementia and early-onset Alzheimer’s disease. J Alzheimer Dis 39:261–269Google Scholar
- 31.Chang C-C, Lin C-J (2011) LIBSVM: A library for support vector machines. ACM TIST 2:27–27Google Scholar
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.