Introduction

Alzheimer’s disease (AD) is a neurodegenerative disease characterized by progressive decline of cognition caused by deposition of amyloid-β protein. Repeated failures of a clinical trial for disease-modifying therapy in AD emphasize the need for precise diagnosis in the early phase (Mangialasche et al., 2010). Genetic testing and pathological biomarkers for AD have advanced, but there are many limitations for clinical application. The centerpieces of AD diagnosis are clinical access, cognitive function tests, laboratory tests, and brain imaging.

Magnetic resonance (MR) brain imaging is a useful tool for diagnosing AD by identifying cerebral degeneration or other structural abnormalities that can cause a cognitive decline. The most common finding of brain imaging in AD is parieto-temporal lobe dominant cortical atrophy, which usually is accompanied by hippocampal atrophy. However, these signs can be unremarkable in early AD. In the preclinical stage, brain atrophy in specific regions can be detected before the presence of AD symptoms (Tan et al., 2014), but brain atrophy in AD can vary and be combined with other damage (Poulakis et al., 2018). Neurodegeneration due to mixed pathology interferes with a precise diagnosis of AD by imaging. A quantitative analysis of cortical thickness, connectivity, and brain volume can help to assess dementia and discriminate AD from other types of dementia (Vernooij et al., 2018). Difficulty of the analysis process and the use of tool for it may be a factor limiting to access for those unfamiliar with it, and interpreting results of the analysis may have less clinical significance due to differences between individuals. Although there are methodical difficulties and some applicatory limitations by individual, the availability of these analyses is increasing through the spread of high-resolution MR and the technical advancement of imaging analysis.

An easy, fast, and accurate measurement of brain volume can help clinicians to make an early diagnosis of AD or other degenerative diseases. Especially, regional brain volumes and their ratios can provide a clinical clue for detecting mild cognitive impairment (MCI) and AD dementia (ADD) (Risacher et al., 2017). We developed an automated program to measure regional volumes and their ratios among normal control (NC), MCI, and ADD patients using brain MRI imaging. The aim of the present study was to validate the automated program by comparisons with previous volumetric methods and to compare the regional volumes and their ratios including volume ratio to cerebellar volume among NC, MCI, and ADD groups. Finally, we evaluated the diagnostic performance of regional volume ratio alone or combined with MMSE for discrimination of MCI and ADD from NC.

Methods

Patients

We conducted a retrospective observational study from patients with mild to moderate ADD, MCI, and NC. The participants were enrolled at the dementia clinics of Seoul St. Mary’s Hospital, Yeouido St. Mary’s Hospital, and Chung-Ang University Hospital between January 2011 and December 2018. A diagnosis of ADD was based on the criteria of the National Institute of Neurological and Communicative Disorders and Stroke – Alzheimer’s Disease and Related Disorders Association or the Diagnostic and Statistical Manual of Mental Disorder, fifth edition (DSM-5). MCI was diagnosed on the basis of the Petersen amnestic MCI criteria (Petersen, 2004). Subjects with depression documented on the basis of a score greater than 7 on the short form of the Geriatric Depression Scale (GDS) were excluded.

NCs were cognitively and functionally normal, independent, and fulfilled the health-screening exclusion criteria of Christensen et al. (Christensen et al., 1990). They had MMSE scores above 1.5 SD of age- and education-adjusted norms.

Demographic information on age, sex, and educational history were collected. All enrolled subjects were evaluated with the MMSE, Clinical Dementia Rating (CDR), and GDS.

Magnetic resonance imaging

The imaging data were collected with T1-weighted (T1W) three-dimensional (3D) MR images acquired using a 1.5 T MR scanner (Signa HDxt, GE Medical Systems, Milwaukee, WI, USA) or a 3 T MR scanner (Philips Intera Achieva, Amsterdam, Netherlands) equipped with an 8-channel sensitivity-encoding head coil. An axial MPRAGE sequence was used to acquire T1W 3D images with the following MR parameters: TR = 1,780 ms, TE = 2.2 ms, FA = 9°, FOV = 256 × 256 × 256 mm, voxel size 1 × 1 × 1 mm, and thickness = 1.0 mm.

Automated volumetry program

The automated software for volumetric analysis of brain MR images is named Quick Brain Volumetry (QBraVo) and was programmed by J.M.L. and K.C.K. QBraVo, based on the 8th version of the Statistical Parametric Map (SPM8) package (Wellcome Trust Centre for Human Neuroimaging, London, UK; http://www.fil.ion.ucl.ac.uk/spm) and MATLAB (The Mathworks, Inc., Natick, MA, USA), provides a simple interface to input data and quickly measures a volume of segmented brain in regional divisions.

QBraVo registers native MR images into a standardized stereotaxic space using a 12-parameter transformation process called normalization. The normalized images undergo the unified segmentation procedure implemented in SPM8. In brief, the unified algorithm combines voxel intensity, imaging noise, and non-linear registration in tissue probabilistic models, being used as a priori information for the tissue classes. Then, the segmentation procedure automatically divides the structural image into gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF). However, the classification is probabilistic in the sense that a probability value of belonging to each of the tissue classes is assigned to each voxel in the output images. For regional division of the brain, the regional masks were manually mapped onto the MNI template. Based on neuroanatomical boundaries, the regional masks display border lines dividing the brain bilaterally into 6 frontal (orbital, anterior, anterior medial, dorsolateral, inferior, and posterior medial), 3 temporal (anterior, medial, and lateral), and 2 parietal (medial and lateral) areas as well as occipital, central, cerebellar, and ventricular areas (Fig. 1). Anatomical sub-divisions of the brain have the advantage of easy access over functional sub-divisions. The volume of regional brain is the sum of tissue proportions for each region in all voxels multiplied by voxel size. Finally, QBraVo calculates volumes of the 26 brain regions (GM plus WM), total brain volume (TBV), total cerebrospinal fluid volume (TCV), ventricular volume, and total intracranial volume (TIV). TBV is the sum of all GM and WM volumes, and TIV is the sum of TBV and TCV.

Fig. 1
figure 1

Regional mask with borderlines dividing the brain into 26 regions and CSF space. F: Frontal lobe, T: Temporal lobe, P: Parietal lobe

Though QBraVo uses SPM8-based algorithms for measurements of brain volume, several factors were advanced for improved reliability and usability in clinics. A more accurate template (avgT1_Dartel_IXI550_MNI152.nii) replaced the previous template (icbm_avg_152_t1_tal_lin.nii) used for normalization in SPM8. QBraVo integrated multiple processes into an automated analytic course. Furthermore, QBraVo used anatomic division of the brain that might have advantages in accuracy for regional volume analysis due to larger areas and clear borders.

Validation

Assessments of consistency and accuracy were based on intra- and inter-rater reliability of QBraVo. First, the intra-rater reliability of QBraVo was measured using MR images for all enrolled subjects for validation. Agreement of measured TBVs between repeated analyses for each subject was evaluated using test–retest reliability. Second, QBraVo was validated by inter-rater reliability with manual volumetry, which has been considered the gold standard of volume measurements, by the Intraclass Correlation Coefficient (ICC). The manual measurements of brain volume were conducted by a skilled researcher with expertise in MR image analysis (J. H. C.). The TBV, TCV, and TIV quantities measured using QBraVo were compared with those obtained using manual volumetry. For comparisons with other volumetric programs, ICCs of TBV, TCV, and TIV were measured with manual volumetry using SPM8 and FreeSurfer (The General Hospital Corporation, Boston, MA, USA).

Regional volume analysis

The regional volumes and Volume to TIV ratio (V:TIV) measured using QBraVo were compared among ADD, MCI, and NC groups. The V:TIV represents atrophic degree in the region. Regional brain Volume to TBV ratio (V:TBV) and regional CSF Volume to TCV ratio (V:TCV) also were compared. Comparisons of V:TBV or V:TCV among the groups demonstrate the relative atrophic degree to total brain atrophy and can reveal regional severity of degeneration in AD. We found that the volume ratio between the most and the least degenerated regions could help to diagnose ADD and MCI, and the standardized atrophy volume ratio (SAVR), which is a regional volume divided by that of the least degenerated region, was designed to discriminate among the groups. In this study, SAVR was used to compare regional volume with that of the cerebellum for discrimination of ADD and MCI from NC, and this calculation was similar to that for the standardized uptake value ratio (SUVR) in amyloid PET. The SAVR of the most degenerated brain region was analyzed using MMSE, which is the most commonly used screening test for general cognitive function, for diagnosis MCI or ADD. Using a combined predictive model with MMSE and SAVR, the effectiveness of SAVR was evaluated to assist in diagnosis MCI or ADD.

Statistical analysis

All statistical analyses were conducted with SPSS software version 24.0 for Windows (SPSS Incorporated, Chicago, IL, USA) and the R Statistical Environment (R Foundation for Statistical Computing, Vienna, Austria). ICCs were calculated to determine the degrees of intra-rater and inter-rater reliability for validation of QBraVo. One-way analysis of variance (ANOVA) was used to compare mean differences among the 3 groups. For multiple test correction, Tukey’s range test was used in subgroup comparison. Linear by linear association tests were used to compare frequencies for categorical variables. Diagnostic performance was assessed using AUC, sensitivity, specificity, and diagnostic accuracy in ROC curves. Combined predictive models for ADD vs. NC, MCI vs. NC, and ADD vs. MCI were created using a logistic regression that included MMSE and SAVR of the most degenerated region. The ROC curves were compared using the Delong method. Statistical significance was set as p < 0.05.

Standard protocol approvals, registrations, and patient consents

The study protocol was approved by the institutional review board and the ethical standard committee at our institution. The ethics board determined that participant consent was not required for the retrospective observations.

Results

Patients

During the study period, 56 NCs, 44 MCI patients, and 45 ADD were enrolled. Clinical characteristics of the subjects are summarized in Table 1. Patients with ADD and MCI were significantly older than the NC subjects (71.8 ± 8.7 vs. 71.5 ± 6.3 vs. 67.9 ± 5.9; ADD vs. NC, p = 0.017; MCI vs. NC, p = 0.032). However, differences in sex distribution and educational duration were not significant among the groups.

Table 1 Clinical characteristics and measured brain volumes of subjects

The MMSE total score were significantly different among ADD, MCI, and NC (21.5 ± 5.3, 26.2 ± 2.3, and 28.2 ± 1.6, respectively; p < 0.001). Global CDR also significantly differed among groups (ADD = 0.84 ± 0.55, MCI = 0.48 ± 0.15, NC = 0.02 ± 0.09; p < 0.001). Though patients with mild to moderate ADD were included, their mean CDR was relatively low. Among total ADD subjects, the numbers of subjects with CDR 0.5, CDR 1, and CDR 2 were 26 (57.8%), 14 (31.1%), and 5 (11.1%), respectively.

The TIV calculated by QBraVo showed no significant difference among ADD, MCI, and NC groups (1349.6 ± 129.8, 1387.7 ± 111.9, and 1344.33 ± 3.8, respectively; p = 0.15). Though TBV did not significantly differ among them (ADD: 1109.5 ± 110.4, MCI: 1158.9 ± 89.8, NC: 1134.7 ± 114.7; p = 0.09), TBV to TIV ratio was statistically smaller in ADD than in NC (ADD: 82.2 ± 3.1, MCI: 83.6 ± 3.5, NC: 84.3 ± 3.8; ADD vs. NC: p = 0.009). In addition, TCV and TCV to TIV ratio were larger in ADD than in NC.

Validation of QBraVo

The average QBraVo run time for a single data analysis was 5 min 36 s, producing quick analytic volumetric results for optimal patient care. The ICC for TBV of QBraVo to measure intra-rater reliability was 0.999 by test–retest reliability. ICCs for volumetric measurement methods were compared with the manual measurements as shown in Table 2. The ICC of QBraVo was superior to those of SPM8 and FreeSurfer in all brain locations. The inter-rater reliability of QBraVo was high, with ICCs of 0.97 in TBV, 0.89 in TCV, and 0.93 in TIV. The ICCs for SPM8 were 0.94 in TBV, 0.37 in TCV, and 0.73 in TIV, while the ICCs of FreeSurfer were 0.79 in TBV, 0.39 in TCV, and 0.70 in TIV.

Table 2 Inter-rater reliability of automated volumetric methods compared with manual measurements of brain volume

Regional volume analysis

Comparisons of the regional volumes and their ratios among NC, MCI, and ADD are listed in Table 3. Medial temporal and anterior temporal volumes were significantly smaller in ADD than in NC. Ventricular volume was larger in ADD and MCI than in NC. Anterior frontal, orbital frontal, and medial temporal volume were smaller in ADD than in MCI.

Table 3 Comparisons of regional brain volumes and ratios

Differences between the groups were more significant for V:TIV than for raw regional volume. Except for the inferior frontal, posterior medial frontal, medial parietal, occipital area, and cerebellum, V:TIVs were smaller in ADD than in NC. Medial temporal V:TIV was smaller and ventricular volume was larger in MCI than in NC.

Through comparisons of V:TBV or V:TCV, medial temporal lobe and ventricle were more degenerated than other brain regions in MCI or ADD compared to NC. Cerebellum was the least degenerated region in MCI or ADD.

Table 4 shows the performances of reginal brain volumes and volume ratios, which are represented as AUCs with sensitivity and specificity at the optimal cut-off, for discriminating ADD or MCI from other groups. Although the raw regional volumes were better than V:TIV in some regions for distinguishing ADD from MCI, the discriminating performance was generally better for V:TIV than for raw regional volume. Medial temporal V:TIV had the best performance for discriminating ADD or MCI from NC (ADD vs. NC: AUC = 0.771, sensitivity = 64.4%, specificity = 83.9%; MCI vs. NC: AUC = 0.684, sensitivity = 36.4%, specificity = 94.6%). To distinguish ADD from MCI, orbital frontal lobe showed the highest performance (volume, AUC = 0.696, sensitivity = 60.0%, specificity = 75.0%).

Table 4 Discriminating performance of regional volumes and ratios among ADD, MCI, and NC

The discriminating performance of V:TBV or V:TCV was lower than that of V:TIV in most brain regions. However, cerebellar V:TBV had better performance than V:TIV and increased in opposite order to V:TIV, with NC < MCI < ADD. These results suggest that the cerebellum was less degenerated than other regions (cerebellar V:TBV for ADD vs. NC: AUC = 0.717, cerebellar V:TIV for ADD vs. NC: AUC = 0.431).

Comparisons between MMSE, SAVR, and the combined model with MMSE and SAVR are shown in Table 5. Though MMSE had a high diagnostic performance for ADD vs. NC (AUC: 0.941, sensitivity: 82.2%, specificity: 90.1%, cut-off: 25.5), it did not show high performance for MCI vs. NC (AUC: 0.771, sensitivity: 65.9%, specificity: 73.2%, cut-off: 27.5) or ADD vs. MCI (AUC: 0.817, sensitivity: 82.2%, specificity: 65.9%, cut-off: 25.5). For discriminating MCI from NC, medial temporal SAVR (AUC = 0.747, diagnostic accuracy = 73%) was not superior to MMSE by comparison of the ROC curves. However, diagnostic ability of the combined model (MMSE + SAVR) was significantly superior for discriminating MCI from NC (AUC: 0.844, sensitivity: 65.9%, specificity: 94.6%) (Fig. 2.A). For ADD vs. NC, medial temporal SAVR showed lower performance than MMSE (AUC: 0.808, sensitivity: 60.0%, specificity: 96.4%). Diagnostic performance of the MMSE + SAVR combined model was the best for ADD vs. NC (AUC: 0.950, sensitivity: 86.7%, specificity: 98.1%). Although diagnostic performance of the combined model was not statistically superior to that of MMSE, its AUC and diagnostic accuracy were better than those of MMSE (Fig. 2.B). The performances for discriminating ADD from MCI were not significantly different among MMSE, orbital frontal SAVR, and the combined model. The combined model for ADD vs. MCI showed AUC: 0.855, sensitivity: 82.2%, and specificity: 81.0%.

Table 5 Comparisons of discrimination performances among MMSE, SAVR, and the combined predictive models of the most degenerated brain regions
Fig. 2
figure 2

ROC curves of MMSE, medial temporal SAVR, and the combined prediction models for discrimination of MCI from NC (A) and of ADD from NC (B). MMSE: Mini Mental Status Examination, SAVR: Standardized Atrophy Volume Ratio, ADD: Alzheimer’s Disease Dementia, MCI: Mild Cognitive Impairment, NC: Normal Control

Discussion

Brain volume has clinical implications beyond the individual characteristics of ethnicity or sex. Brain volume is closely related to aging and can reflect cognitive function or brain activity (Kiraly et al., 2016; Qing & Gong, 2016). In dementia risk analysis, brain volume has been used as an indicator of cognitive reserve (van Loenhoud et al., 2018). Due to the correlation of lower brain volume with cognitive decline (Vibha et al., 2018), brain atrophy enables suspicion of dementia. Furthermore, it is hypothesized that a decrease in brain volume indicates neuropathological progression in AD (Reiter et al., 2017). Accurate longitudinal measurements of brain volume can reveal the status of a degenerating brain. In this study, we measured regional brain volume using an automated program that we developed for quick and accurate measurements and effectively discriminated ADD and MCI from NC using medial temporal volume normalized with cerebellar volume.

It has been demonstrated that AD can be classified into four atrophy subtypes of medial temporal dominant, parieto-occipital dominant, diffuse cortical, and mild atrophy (Ten Kate et al., 2018). The hippocampal sparing variant in AD, which displays parieto-occipital dominant or mild atrophy, has been suspected to have limitation in assessment of hippocampal atrophy (Murray et al., 2011). However, hippocampal sparing AD also demonstrated medial temporal atrophy including that of the hippocampus, though its volume loss can be less than those of other subtypes (Ten Kate et al., 2018). In the present study, medial temporal volume and SAVR showed high accuracy to discriminate ADD from NC. However, they exhibited lower performance to distinguish between MCI and NC because hippocampal sparing variants are more frequent in MCI than in AD (Ten Kate et al., 2018). However, measurement of medial temporal volume decline increased the accuracy of diagnosis of MCI patients in our study.

Medial temporal lobe atrophy and hippocampal atrophy have been reported as diagnostic markers specific to ADD (Lane et al., 2018). The results of this study using QBraVo correspond well with those observations. The medial temporal V:TIV had the highest accuracy for diagnosing ADD, with a sensitivity of 64.4% and specificity of 83.9%. It can be hypothesized that QBraVo can not only measure accurate brain volume, but also precisely conduct regional discrimination. The previous automated programs had low accuracy of measurement in the hippocampus (Guenette et al., 2018). Measurement of medial temporal volume could effectively substitute for hippocampal volume in the current study.

In this study, medial temporal lobe was the most degenerative region relative to total brain atrophy in ADD and MCI. On the contrary, the least degenerative region in ADD and MCI was the cerebellum, which showed AD pathology in the final stages in an earlier autopsy study (Braak & Braak, 1991). Progression of deposition of β-amyloid occurs from the neocortex and hippocampus in the early phase to the cerebellum in the late phase. The volume reduction according to the increase in CDR was the least significant in the cerebellum (Ramos Bernardes da Silva Filho et al., 2017). In addition, there is the report that atrophy was observed after MCI state in cerebellum except vermis and paravermian lobules (Toniolo et al., 2018). Despite this, the cerebellum might be involved in cognition and presented reduced activation on functional imaging (Jacobs et al., 2018). A previous study has reported cerebellar GM volume to be lower in MCI than in NC (Möller et al., 2013). Further evaluations of cerebellar volume change are needed to prove the role of the cerebellum in AD.

In the validation of QBraVo, it took about 5 to 6 min to obtain results after input of preprocessed MR images. The runtimes of QBraVo were considerably shorter than those of the other methods. Manual volumetry commonly takes several days with a skilled analyst. FreeSurfer requires approximately 20 to 40 h to process both hemispheres. About 8 h is required for volume measurements using Inbrain (MIDAS Information Technology Corporation, Seongnam, Republic of Korea), which has been recently commercialized. We demonstrated that QBraVo is faster than previously used methods for volume measurements. Furthermore, QBraVo showed excellent reproducibility and relatively high accuracy that were confirmed by a significant correlation with the results of manual volumetry in the present study. This suggests QBraVo as an easy and rapid tool to measure brain volume compared to the traditional volumetric methods.

Compared with manual volumetry, the ICC was higher for TBV than for TCV or TIV. This is because the analysis of QBraVo is based on T1W images, which are limited in differentiating surface CSF from skull bone. This is similar to the results of an SPM validation in an earlier study (Heinen et al., 2016). Overestimation by SPM is caused by probabilistic segmentation that can include tissues outside the subarachnoid space in CSF (Nordenskjöld et al., 2013). Because the regional mask was created with strict limitations on the sub-arachnoid space, QBraVo showed improved reliability not only in TBV, but also in TCV compared to SPM and FreeSurfer. Although volume measurements in regions including surface CSF have relatively lower accuracy, QBraVo showed excellent ICCs in TCV and TIV.

In the present study, although subjects with ADD or MCI were older than the NCs, other clinical characteristics did not differ among them. Though old age can affect brain volume decrements in ADD and MCI, the effects of aging were not statistically significant by multiple regression analysis.

In the analysis of regional brain volume, discrimination performance by V:TIV was generally better than that by raw regional volume, which implies that brain volume normalized by TIV could adjust for head size variation among subjects. However, raw regional volume showed better performance than V:TIV in some regions for distinguishing between ADD and MCI, possibly because brain volume in MCI was larger than in ADD.

The V:TBV or V:TCV did not show higher performance than V:TIV for discriminating among the groups. However, performances of V:TBV/V:TCV in more or less degenerated regions were similar to that of V:TIV. In MCI and ADD, the more degenerated regions were medial temporal, anterior temporal lobe, and ventricle, and the less degenerated regions were posterior medial frontal lobe and cerebellum.

Ventricular volume and ratios also showed good performance to distinguish ADD and MCI. It has been documented that WM changes are associated with ventricular enlargement in AD (Coutu et al., 2016). Researchers have theorized that subcortical WM changes occur relatively quickly in AD. However, further investigations are needed to verify this hypothesis.

The SAVR of the most degenerated regions showed larger AUC and higher diagnostic accuracy than V:TIV for differentiating among the groups. In comparison of ROC curves, medial temporal SAVR was inferior to MMSE for ADD vs. NC but was not inferior to MMSE for MCI vs. NC. The difference of performance between orbital frontal SAVR and MMSE was not significant for ADD vs. MCI. Accordingly, it was posited that SAVR can clarify the difference of regional atrophy between groups. Especially, SAVR was effective for discrimination of ADD and MCI from NC, which suggests that normalization of regional brain volume by cerebellar volume is more sensitive than normalization by TIV in AD spectrum neurodegenerative cognitive disorders. In an earlier study, hippocampal volume to neocortical volume ratio was a predictor of cognition and subtype in AD (Risacher et al., 2017). Volume comparisons with the cerebellum have not been used previously. The combined model using MMSE and medial temporal SAVR had higher performance for discriminating MCI from NC than did MMSE alone, which suggests that SAVR is an advantageous method to assess AD in the early phase.

Cerebellar volume may change later in time compared with other regions of the brain in AD. This timing gap of the change between the comparison regions can make that SAVR exhibits steeper changing which can more clearly distinguish the differences between the groups in the earlier phase. Although there were many studies using regional volume normalized with total brain volume or intracranial volume, few studies attempted to compare between regions. The newly attempted metrics of SAVR in this study could increase the accuracy of diagnosis of MCI with MMSE, which suggests that the relative ratio of regional brain volume is possible as a biomarker of AD. Changes in brain volume have been used to predict the AD converter from NC or MCI in previous studies (Tabatabaei-Jafari et al., 2019; Vogel et al., 2018). In the future studies, we can evaluate whether longitudinal measurements of SAVR predict AD progression, distinguish AD from other dementia, such as dementia with Lewy body and frontotemporal dementia, or classify AD subtypes.

Changes in brain volume was not recommended as a single biomarker for AD (Lombardi et al., 2020), so there have been attempts to improve accuracy of volume measurements, such as analysis volume of GM and WM separately, and subdivision of hippocampus (Moon et al., 2017; Parker et al., 2019). It is suggested that ratio-based metrics including SAVR can increase accuracy of volume analysis. Recent study demonstrated that combined diagnostic model with other factors including genetic profile and cognitive functions and extracting multi-parameter from brain images can increase diagnostic accuracy in AD (Saribudak et al., 2020; Zhang & Liu, 2018). Based on this study, various metrics of regional brain volume ratio such as SAVR can be used for AD analyses in the future, and the combined model with multiple parameters can improve AD diagnosis and classification.

Quantitative information about brain volume can improve diagnostic capabilities in a clinic. However, AD diagnosis using volumetric data alone shows decreased accuracy. Volumetric data should be added to visual assessment by radiologists, clinical information, and a neurocognitive evaluation for better results.

Previously, many analytic studies about brain volume have been conducted with SPM or freesurfer. Although various tools for image analysis and automated programs have recently been applied, few studies have confirmed the superiority in volume measurement itself. Since development of these various tools can increase the accessibility of image analyses, it can be an important field as much as the analysis itself. Currently, several automated programs are commercially available, which can cost a lot to use and seem to be focused on GM measurements in volume analysis. QBraVo was developed with a focus on measuring anatomical regional volume similar to the approach used for macroscopic image interpretation, so it can have a low sense of difference with actual image reading and be practical to use. We expect that various comparative study of regional brain volume will be possible using QBraVo which will be released free of charge in the future with high accuracy and rapid results. We have a plan that QBraVo will be compared with other automated programs later.

Although the brain volume measurements using the automated analytic program were precise in previous validations, those results could have been affected by the imaging analytic program and the MR sequence parameters (Haller et al., 2016). To overcome these biases, strict standardization should be applied to all analysis processes. Analysis in specific regions, which can be conducted with insufficient precision, are necessary to be corrected in a regular range.

The limitations of our study are as follows. First, the number of enrolled subjects was relatively small to obtain high statistical power; therefore, the results of this study should be carefully interpreted. Second, the enrolled patients with ADD could be heterogeneous because their diagnoses were clinical without pathologic confirmation. though some subjects with ADD or MCI were confirmed with F18-florbetaben PET. Third, while subjects with a brain lesion were excluded, vascular damage was not evaluated in this study. Vascular lesions can affect not only brain volume, but also signal intensity on MR. Fourth, we did not measure volume of GM, WM, or hippocampus, which can provide more specific information for AD. Further studies on these topics are needed. Finally, regional brain volume may vary depending on multiple factors, so particular attention should be paid to interpret. Especially, cerebellum may display variable volume between individuals. Additional studies for verification should be conducted.

Conclusions

Our results suggest that SAVR is an effective method for regional volume comparisons in AD. In addition, the combined model with SAVR and MMSE could make more accurate guidance for diagnostic decision-making compared with SAVR alone. However, SAVR should be validated in other large populations and evaluated for diagnosis of other degenerative diseases including dementia with Lewy body and frontotemporal dementia.

In our present study, we confirmed that QBraVo was rapid, easy to use, and showed high reliability to measure brain volume. We suggest that medial temporal lobe to cerebellum volume ratio is an effective method to diagnosis ADD and MCI independently or in combination with the simple cognitive screening test MMSE.