Comparing multiband and singleband EPI in NODDI at 3 T: what are the implications for reproducibility and study sample sizes?

Bouyagoub, Samira; Dowell, Nicholas G.; Gabel, Matt; Cercignani, Mara

doi:10.1007/s10334-020-00897-7

Comparing multiband and singleband EPI in NODDI at 3 T: what are the implications for reproducibility and study sample sizes?

Research Article
Open access
Published: 14 December 2020

Volume 34, pages 499–511, (2021)
Cite this article

Download PDF

You have full access to this open access article

Magnetic Resonance Materials in Physics, Biology and Medicine Aims and scope Submit manuscript

Comparing multiband and singleband EPI in NODDI at 3 T: what are the implications for reproducibility and study sample sizes?

Download PDF

Samira Bouyagoub ORCID: orcid.org/0000-0002-5046-8264¹,
Nicholas G. Dowell¹,
Matt Gabel¹ &
…
Mara Cercignani^1,2

3247 Accesses
7 Citations
2 Altmetric
Explore all metrics

Abstract

Objective

The reproducibility of Neurite orientation dispersion and density imaging (NODDI) metrics from time-saving multiband (MB) EPI compared with singleband (SB) has not been considered. This study aims to evaluate the reproducibility of NODDI parameters from SB and MB acquisitions, determine the agreement between acquisitions and estimate the sample sizes required to detect between-group change.

Methods

Brain diffusion MRI data were acquired using SB and MB (acceleration factors 2 (MB2) and 3 (MB3)) on 8 healthy subjects on 2 separate visits. NODDI maps of isotropic volume fraction (FISO), neurite density (NDI) and orientation dispersion index (ODI) were estimated. Region-of-interest analysis was performed; variability across subjects and visits was measured using coefficients of variation (CoV). Intraclass correlation coefficient and Bland–Altman analysis were performed to assess reproducibility and detect any systematic bias between SB, MB2 and MB3. Power calculations were used to determine sample sizes required to detect group differences.

Results

Both NDI and ODI were reproducible between visits; however, FISO was variable. All parameters were not reproducible across methods; a systematic bias was observed with the derived values decreasing as the MB factor increases. The number of subjects needed to detect a between-group change is not significantly different between methods; however, ODI needs considerably higher sample sizes than NDI.

Conclusions

Both SB and MB yield highly reproducible NDI and ODI measures, but direct comparison of these parameters between methods is complicated by systematic differences that exist between the two approaches.

In vivo human whole-brain Connectom diffusion MRI dataset at 760 µm isotropic resolution

Article Open access 29 April 2021

Reduction of bias in the evaluation of fractional anisotropy and mean diffusivity in magnetic resonance diffusion tensor imaging using region-of-interest methodology

Article Open access 11 September 2019

Scan–rescan and inter-vendor reproducibility of neurite orientation dispersion and density imaging metrics

Article Open access 27 December 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Diffusion MRI (dMRI) is an established non-invasive MRI technique that is instrumental in characterising tissue microstructure by probing the diffusion properties of water molecules within the tissue over distances of a length scale comparable to that of cellular structures [1]. Several advanced diffusion models have been developed over the years to generate indices that quantify specific tissue properties relating to the geometry and organisation of neurites [2]. In contrast to the standard diffusion tensor imaging (DTI, [3]), these complex models require larger datasets acquired at multiple b values and higher angular resolution, often resulting in prohibitively long acquisition times for clinical applications. Among these models, neurite orientation dispersion and density imaging (NODDI) [4] has become very popular amongst multicompartment dMRI models, as it was designed to characterise axons and dendrites, within clinically feasible acquisition times. NODDI combines a hierarchical three-compartment model with a high angular resolution diffusion-weighted imaging (HARDI) protocol to differentiate the MRI signal from tissue and free water (isotropic compartment) within a voxel and intraneurite and extraneurite space within the tissue compartment.

NODDI estimates a number of quantitative parameters that characterise tissue microstructure voxelwise across the image. Orientation dispersion index (ODI) describes the organisation and orientation of neurites (axons and dendrites), neurite density index (NDI) is derived from the intraneurite volume fraction within the tissue compartment of a voxel and the volume fraction that undergoes isotropic diffusion (FISO) which is generally assumed to represent the CSF compartment within a voxel. NODDI also estimates fibre orientation vectors. NODDI’s ability to decouple NDI and ODI helped shed more light on the source of diffusion anisotropy, previously quantified in conventional DTI using the fractional anisotropy index (FA) [5]. This is particularly useful in the regions of crossing fibres which are often problematic for DTI [6]. However, the NODDI model has also attracted some criticism regarding the use of some assumptions that, if violated, can result in a bias in its parameters [7, 8]. These include the assumption of a common T2 for all compartments [9] and a common default value for the intrinsic diffusivity that is reasonable for white matter tissue but sub-optimal in gray matter [10, 11]. Although the model has some limitations, it is important to acknowledge its advantages in terms of feasibility, and the recent analysis shows agreement between NODDI metrics and histologically equivalent metrics [12]. NODDI has found a number of applications in neuroimaging from studying multiple sclerosis [12, 13], Alzheimer’s disease [14] and healthy neurodevelopment [15], to characterising myelination [16, 17], inflammation [18] and first-episode psychosis [19]. As NODDI usage increases, efforts have been made to establish the reliability and reproducibility of its indices. It has been demonstrated that NODDI is sensitive to field strength [20] and acquisition parameters such as the maximum b value and the number of diffusion-encoding directions [21], although the impact varies depending on the anatomical region of interest.

NODDI data can be acquired in clinically feasible times using the conventional echo planar imaging (EPI) approach that involves exciting a single slice at a time (singleband SB). However, with the introduction of the multiband (MB) or simultaneous multislice EPI [22, 23], the acquisition time for NODDI can be reduced further. In the MB technique, acquisition is accelerated by simultaneously exciting multiple slices using a single radio-frequency (RF) pulse, without significantly compromising the spatial resolution or the signal-to-noise ratio (SNR). By increasing the number of simultaneously excited slices, known as the MB factor, it is possible to reduce the repetition time (TR) and hence the total acquisition time. However, a known issue with multiband is the non-uniform noise caused by the geometrical arrangement of the receiver coils (the g factor) and this causes SNR to be different across different brain regions [24]. Although MB was introduced to increase temporal resolution for functional MRI [25], diffusion MRI can also benefit by significantly reducing the scan time. MB has already been incorporated in standard dMRI protocols such as the Human Connectome Project's diffusion MRI scanning protocol, which uses MB with acceleration factor 3. It is known that MB comes at a price of reduced SNR and increased T1-weighting and with the increased usage of MB in combination with NODDI it is important to study the effects of MB sequences on diffusion data and derived diffusion metrics. Duan et al. [26] investigated the reliability of MB-derived DTI measures and showed moderate to good repeatability, which varied between ROIs depending on their size and location; but this was a test–retest study and did not compare the results with SB-derived measures. Another work by Mitsuda et al. [27] examined the effects of MB-EPI sequence (MB factors of 2,3 and 4) on DTI measures from data acquired using 1.5 T scanner and 12 channel head coil compared with SB-EPI data. This study showed significant differences in FA and ADC between SB data and MB data with higher acceleration factors of 3 and 4. Bernstein et al. [28] conducted a bootstrap analysis to compare diffusion metrics derived from SB with those derived from MB (factor 3) both at similar and reduced TR in order to study the effects of MB reconstruction and TR shortening separately. The study revealed a bias in the MB-derived maps and demonstrated an increase in uncertainty for each parameter when the TR is short. Olson et al. [29] examined the effects of slice crosstalk on diffusion parameters in simultaneous multislice imaging. They found that interslice leakage between simultaneously excited slices had an effect on the reproducibility of diffusion metrics from higher level dMRI models more than DTI metrics.

These findings, combined with the observation that MB potentially introduces artefact, [30] and a signal-to-noise penalty, suggest that maps derived from NODDI might also be affected.

In this paper, we investigate the reproducibility of the NODDI indices and explore the intersession and intersubject variations in healthy volunteers, determine the agreement between MB and SB acquisition methods and to establish any systematic differences between MB and SB derived parameters. Finally, we perform power calculations in order to estimate the sample sizes required to detect a between-group change for both acquisition methods.

Materials and methods

Subjects and scan sessions

Eight healthy participants were recruited in this study: 7 male, median age 34 (range 22–40) years. Each participant was scanned on two separate visits, arranged between 2 and 7 days apart. We limited the gap between visits to a minimum of 2 days and maximum of 7 days; this is short enough to ensure that there is no physiological change in the participants for DWI, but long enough to take into account the different state of the scanner that is a potential source of variability. Each visit included NODDI acquisition with MB2 (Multiband with acceleration factor 2), MB3 (Multiband with acceleration factor 3) and conventional single band (SB). This study falls within the ethical approval granted as part of a larger methodological development study approved by the Brighton and Sussex Medical School Ethics Committee; all participants provided written informed consent.

The images were acquired using a Siemens 3 T Prisma scanner (Siemens, Erlangen, Germany) with a maximum gradient strength of 80 mT/m and a 32-channel head coil. The same pulse sequence developed by the University of Minnesota Center for Magnetic Resonance Research was used to acquire single band (SB) and MB data (sequence version R016a). Diffusion-weighted data were acquired with single-shot, twice-refocused pulsed gradient spin-echo echo EPI using acquisition parameters that are typically used in NODDI studies. Sequence parameters were: TR = 7210, 4100 and 2800 ms for SB, MB2 and MB3 acquisitions, respectively; echo time (TE) = 82.80 ms; field of view = 240 × 240 (mm2); matrix size = 96 × 96; number of slices = 60; slice thickness = 2.5 mm; total acquisition time = 13, 9 and 7 min for SB, MB2 and MB3 data, respectively. Two b value shells were acquired with b = 800 and 2600 s/mm2, with 30 and 60 non-colinear diffusion-weighted directions, respectively. Eight volumes with no diffusion weighting (i.e. b=0) were acquired (b0 images). Further b0 images were acquired in the opposite phase encoding direction in order to estimate and correct for susceptibility induced distortions [31]. Images were acquired using generalised auto-calibrating partially parallel acquisition (GRAPPA, reduction factor=2), which not only reduces scan time, but also improves image quality by reducing EPI signal distortions.

Image analysis

All diffusion-weighted images were first corrected for movement and eddy current distortions using FMRIB software library (FSL, version 5.0.7, Oxford, UK). FSL’s topup tool was used to correct for susceptibility and FSL’s Eddy command was used to correct for eddy current distortions [32]. The corrected data were then fitted to the NODDI model using the toolbox (http://mig.cs.ucl.ac.uk/mig/mig/index.php/?n=Tutorial.NODDImatlab/) run in Matlab 2012b (The MathWorks, Inc., Natick, MA) using a high performance computing cluster of 128 cores to generate voxelwise whole brain maps of FISO, NDI and ODI. The resulting NODDI parameter maps were normalised to the Montreal Neurological Institute (MNI) space using the Advanced Normalisation Tools (ANTs, version 2.1.0; http://stnava.github.io/ANTs) in order to perform region-of-interest (ROI) analysis. This involved calculating the diffeomorphic transformation required to warp the mean b0 image to the MNI152 T2-weighted image template, (with spatial resolution of 2x2x2 mm3). A selection of ROIs was chosen for analysis for which the mean and standard deviation was calculated for each NODDI parameter; the ROIs were obtained from the ICBM-DTI-81 white-matter labels atlas [33] for white matter regions and the non-linear MNI-ICBM152 atlas [34] for the gray matter regions. A selection of brain regions was chosen to reflect areas of different microstructural properties (e.g. fibre density) and challenges (e.g. partial volume effect). The ROIs selected for this study included body of corpus callosum (BCC), genu of corpus callosum (GCC), corticospinal tracts (CST), external capsules (EC) and optic radiation (OR) from the white matter. Although the frontal lobe (FL) and the occipital lobe (OL) were chosen to represent the cortical gray matter; and the caudate, putamen and thalamus were selected from the deep gray matter regions. The cerebellum was also included in this study. Figure 1 illustrates the size and location of these ROIs on the brain.

In order to perform the ROI-based statistical analyses, the mean values for the NODDI parameters NDI, FISO and ODI were extracted for each ROI.

Comparing raw diffusion signal between SB and MB acquisition methods

In this section, we evaluate the impact of MB sequences on the raw diffusion signal (in the non-diffusion weighted and diffusion weighted images) prior to NODDI fitting and explore if any differences can be observed between MB-acquired and SB-acquired images. Before any quantitative comparison, a visual inspection was performed on the b0 images after movement and eddy distortion corrections and co-registration to identify the presence of image artefacts from either GRAPPA or MB acceleration techniques.

SNR

Before performing any ROI-based image analysis on the NODDI metrics, a voxel-wise SNR calculation was performed for each NODDI dataset, using the MRtrix3 software [35]. This involves calculating the ratio of the mean signal and the standard deviation (SD) of the eight b=0 volumes. The resulting SNR map was used to calculate the mean SNR within each ROI. We tested for the statistical significance of the differences between mean SNR for SB, MB2 and MB3 using paired t test. The SNR maps were also visually inspected to identify any differences between SB, MB2 and MB3 approaches.

Signal variability across diffusion weighting directions

Higher diffusion weighting is more sensitive to complex microstructure architecture resulting in greater signal variation. To account for that, NODDI requires the higher b value shell to be sampled at twice the angular resolution of the lower b value shell [4]. We investigated whether increasing the MB factor affects the variability between diffusion signal measured over different directions. To do this, we calculated the standard deviation for the signal at b=2600 across data acquired in all 60 directions and compared the results of SB with MB2 and MB3. In order to account for the intrinsic differences in signal intensity between SB, MB2 and MB3, the diffusion weighted images for each dataset were first normalised by dividing them by the dataset’s mean b0 image.

Variability between subjects and between visits

To characterise the variability of the NODDI parameters, we used the coefficients of variation as a measure of variability between visits and subjects using the equation CoV (%) =100% × SD/mean.

Between-subjects variability: for each acquisition method (SB, MB2 and MB3), CoV between subjects within the data from the first visit was calculated.

Between-visits variability: for each acquisition method (SB, MB2 and MB3), we calculated the CoV between data from the first visit and the second visit. For this measure, SD is calculated between two measurements as: \({\text{SD}} = \sqrt {\frac{{\sum \left( {X1 - X2} \right)^{2} }}{2 \times N}}\) with: N being the number of the subjects and X1 and X2 being the two measurements (obtained from each ROI from visits 1 and 2, respectively) for each subject.

Reproducibility between visits

The reproducibility of each NODDI measure obtained by each acquisition method between visits was quantified by means of the intraclass correlation coefficient (ICC) with the 95% confidence interval (CI). ICC estimates were calculated separately for SB, MB2, MB3, using NODDI parameters from visit 1 and visit 2. ICC considers both the within-subject variance due to measurement error and the variance because of biological differences between subjects. Ideally, the contribution from measurement error would be much smaller than from the subjects and in this case, ICC tends to 1. ICC estimates were calculated using SPSS statistical package version 25 (SPSS Inc, Chicago, IL), based on the single measures, absolute agreement, two-way mixed effects model. An ICC<0.50 was considered as poor reproducibility, 0.50–0.75 moderate, 0.75–0.9 good and >0.9 as excellent reproducibility, following the stratification introduced by Koo and Li [36].

Agreement between acquisition methods

To determine the agreement between SB and MB acquisition methods, NODDI parameters resulting from these acquisitions are compared against each other using ICC as an index to reflect both correlation and agreement. ICC estimates were calculated across measurements within the same scanning visit (SB-versus-MB2, SB-versus-MB3 and MB2-versus-MB3); this was done using the NODDI measures from visit 1. Further to using ICC, Bland–Altman plots were also generated to compare SB with MB2, MB2 with MB3 and SB with MB3 using data from visit 1; a similar Bland–Altman analysis was also performed separately for data acquired in visit 2. The Bland–Altman plots show the difference between measurements against the mean of the measurements, the bias and the 95% limits of agreement (LoA). LoA for each NODDI parameter were defined as the mean of paired differences ± 1.95 × its standard deviation (SD). Bland–Altman plots make it possible to visualise the agreement between the acquisition methods, detect systematic bias and identify any relationship between the absolute differences and the mean value for each parameter [37]. Finally, in order to visualise and locate the origin of any possible bias in each metric, mean images across subjects for each NODDI metric from visit 1 were calculated for SB, MB2 and MB3 data and difference images were generated between SB- and MB-derived NODDI maps.

Power and sample size calculations

Detecting physiological differences between groups (e.g. patients and controls) is a common aim for research studies that employ NODDI. Although the expected group differences can be estimated from previous (analogous) studies, it is not possible to determine the necessary group sizes to reliably detect these differences unless the reproducibility of the measurement technique is known. Here we use our quantification of reproducibility to determine the sample size required for detecting a reduction of 5% and 10% in NDI, ODI and FISO for each ROI using the mean and standard deviation (across subjects) of each NODDI measure from visit 1. The calculation was performed using Gpower [38], based on a t test involving the difference in the means between two independent groups with a two-tailed significance level of 0.05, power of 0.9 and equal sample sizes. Finally, we compared the estimated sample sizes for all parameters between methods (SB, MB2 and MB3) using a paired t test to determine whether there are any significant differences between methods in the number of subjects needed to detect a between-group change.

Results

Visual comparison

Prior to performing NODDI fitting, a visual inspection has been carried out on all the pre-processed diffusion data. Figure 2 shows co-registered slices from SB, MB2 and MB3 images taken from a representative subject during visit 1; the figure illustrates there are no appreciable differences in artefacts or image quality.

SNR

Figure 3 shows the average SNR for each ROI for all the tested acquisitions (averaged across subjects, SD across subjects is shown as error bars). As expected, the multiband factor has the overall effect of reducing SNR, particularly when increasing the acceleration factor from 2 to 3, but not significantly.

Inspecting the voxel-wise SNR (see the supplementary material), we have observed that the SNR was more uniform across the brain in SB; whereas in MB SNR had a greater dependence on tissue-type and location with lowest values recorded in the deeper areas of the brain. Furthermore, comparing mean SNR for SB, MB2 and MB3, has revealed a significant decrease in SNR, particularly in white matter ROIs and deep gray matter ROIs as the MB factor increases (p<0.05).

Signal variability across diffusion weighting directions

Figure 4 shows the variability between diffusion signal between b = 2600 data points measured over 60 different directions, from a representative subject in visit 1. The results show that the variability was increased in GM and slightly decreased in WM, as the MB factor increased.

Variability between subjects and between visits

CoVs for NDI, FISO and ODI are shown in Fig. 5 (see Table 1S in the supplementary material for a tabular breakdown of these CoV measures). The results show that between subjects CoV measurements (Fig. 5a–c) are approximately 2 times higher than between visits CoV measurements (Fig. 5d–f). NDI exhibited the lowest variation of all NODDI parameters between visits and subjects with CoV<2% and <4.5%, respectively. The highest CoVs for NDI were: between visits 1.64% (SB), 1.67% (MB2) and 1.97% (MB3); and between subjects 4.36% (SB), 4.17% (MB2) and 4.40% (MB3). Similarly, ODI showed a low variability between scan visits for all ROIs and regardless of the acceleration factor with the following highest CoVs: 2.50% (SB), 1.88% (MB2) and 1.63% (MB3). On the other hand, ODI showed relatively larger between-subjects variability (CoV between 6.5% and 10%) in the white matter regions with prevalence of tightly packed parallel fibres (e.g. GCC, BCC and CST) than the gray matter regions. FISO shows the largest variation with the following highest CoVs: between visits 16.69% (SB), 15.09% (MB2) and 10.58% (MB3); and between subjects 24.15% (SB), 25.17% (MB2) and 28.06% (MB3).

There is greater variation between subjects than between visits for all NODDI parameters. This is expected because a normal physiological variation within the brain microstructure is known to give rise to variability in quantitative MR measurements in general, even in homogenous groups like those studied here [39].