Introduction

Microvascular blood flow, also known as perfusion, plays an important role in regulating physiology. In the brain, perfusion indexes such as cerebral blood flow (CBF) and cerebral blood volume (CBV) have been shown useful in the diagnosis and prognosis of neurological diseases [13].

To date, three major magnetic resonance (MR) imaging techniques are available for cerebral perfusion measurement: dynamic susceptibility contrast (DSC) imaging [4], arterial spin-labelling (ASL) imaging [5], and intravoxel incoherent motion (IVIM) imaging [6]. With DSC imaging, images are repeatedly acquired to trace the signal evolution following the passage of intravenously administered paramagnetic contrast agent. Multiple perfusion indexes (e.g., CBV, CBF, mean transit time, and time to maximum) can be derived from the temporal dynamics [79]. Except for being impracticable in subjects with impaired renal function [10, 11], DSC imaging imposes little invasiveness and has been routinely performed in many hospitals. With ASL imaging, flow contrast is generated by using radiofrequency pulses tailored to label the protons electromagnetically in arterial blood. Most existing ASL methods provide CBF measurement only [12], although variants have been proposed to estimate arterial CBV [13] and arterial transit time [14]. The caveat of ASL imaging is the inherently low signal-to-noise ratio (SNR). First proposed by Le Bihan et al [6], IVIM imaging measures the signal drop caused by intravoxel incoherent motion at varied magnitudes of diffusion weighting (quantified by b-values). Perfusion indexes are derived by modelling the signal attenuation as a composite outcome of interstitial water diffusion (characterized by diffusion coefficient D) and intravascular capillary blood flow:

$$ \frac{S(b)}{S_0}=\left(1-f\right) \exp \left(-bD\right)+f \exp \left(-bD*\right) $$
(1)

where S0 is the signal intensity obtained with b = 0. D* is the pseudo-diffusion coefficient to account for the capillary blood flow that takes up a volume fraction f. IVIM imaging has been applied in several body organs [1518], and was assessed for measurement sensitivity in the rat brain [19] and recently in the healthy human brain [20]. Nonetheless, feasibility study remains relatively scarce in the human brain [6, 21].

As compared with its counterpart techniques, IVIM imaging does not rely on tracer delivery (e.g., transit of arterial protons for ASL imaging and gadolinium chelates for DSC imaging). This is a desirable feature as bolus dispersion and elongated transit time that stem from variable or abnormal flow dynamics or routes can confound tracer-based perfusion imaging. For example, ASL measurement of white matter perfusion has been challenging mainly because of the long transit time [22]. On the other hand, previous studies suggest that IVIM imaging is SNR demanding [23, 24] although the relationship between SNR and perfusion that can be reliably measured remains unclear. The purpose of this study was to numerically and experimentally investigate the robustness of IVIM MR imaging in measuring perfusion indexes in the human brain.

Materials and methods

Subjects

The institutional review board approved this study. Eighteen healthy volunteers (eight women, 10 men; age = 21-36 years) were recruited. Written informed consent was obtained from all subjects beforehand.

MR imaging

All MR imaging was performed on a 3-Tesla whole body clinical system (Tim Trio, Siemens, Erlangen, Germany). The body coil was used to transmit radiofrequency pulses. A 12-channel phased-array head coil was used to receive signals. IVIM/ASL/DSC images were obtained from the same location (field of view = 200 mm, in-plane matrix = 128 × 128, GRAPPA acceleration factor = 2, slice thickness = 5 mm, 12 axial slices, inter-slice gap = 1 mm). Specifically, IVIM imaging was based on a single-shot, twice-refocused, spin-echo echo-planar readout (TR = 2 s, TE = 100 ms, 12 repetitions after a dummy scan, diffusion time = 42 ms, ratio of diffusion time to gradient duration ~ 1). Diffusion weighting was applied along three orthogonal directions (in separate scans) with 12 b-values: 0, 15, 30, 45, 60, 100, 250, 400, 550, 700, 850, 1,000 s/mm2. The maximum b-value was chosen such that SNR was adequate and non-Gaussian diffusion was not prominent. ASL imaging was based on the pseudocontinuous labelling scheme [25] (TR = 4.4 s, TE = 18 ms, labelling duration = 2 s, post-labelling delay = 1.5 s, 50 pairs of tag and control images, single-shot gradient-echo echo-planar readout). Reference scans were included for coil sensitivity correction and labelling efficiency calibration [26]. For DSC imaging, single-shot gradient-echo echo-planar images were repeatedly acquired (TR = 1 s, TE = 25 ms, 120 measurements). At the 20th acquisition, a bolus of 0.1 mmol/kg bodyweight Gadobutrol (Gadovist, 1.0 mmol/ml) was intravenously administered followed by a 15-ml saline flush, at a rate of 4 ml/s using a power injector. T1-weighted anatomic images were acquired by using three-dimensional magnetization prepared rapid gradient echo. ASL imaging and IVIM imaging were performed back-to-back and in a semi-random order across subjects but always before DSC imaging.

Data processing

To extract IVIM indexes, a two-step fitting procedure was performed voxel by voxel based on equation (1). D was first obtained by fitted the signals obtained with b = 400 – 1,000 s/mm2 to a mono-exponential function given that perfusion effect is negligible in this regime [24]. The signals obtained with the rest of b-values were then fitted to equation (1) to extract f and D*. Data obtained with different directions of diffusion encoding were processed separately and then averaged.

Following the procedure summarized in [26], the ASL images were corrected for spatial variation of coil sensitivity, pair-wise subtracted, averaged, and then converted to CBFASL maps. Following the procedure summarized in [27], DSC signal time curves were converted to concentration time curves and fitted with a gamma-variate function to remove recirculation. CBVDSC was then calculated by dividing the time-integral of concentration in tissue by the time-integral of arterial input function.

All echo-planar images were averaged to create a local template for each subject. The T1-weighted anatomic images were voxel-wise calculated for the probability of belonging to gray matter, white matter, and cerebrospinal fluid. Gray matter mask was created at a threshold of 0.9. To mitigate partial volume effect, white matter mask was created at a threshold of 0.9 after the original probability map was smoothed by a three-dimensional Gaussian kernel with a full-width-half-maximum of 3 mm [22]. The abovementioned maps (CBFASL, CBVDSC, D, f, D*, SNRb1000) and masks were all coregistered to the local template for subsequent comparison and analysis.

According to [28], the product of f and D* is proportional to CBF, whereas f is linearly associated with CBV by definition. As such, we examined the correlation between fD* and CBFASL, and between f and CBVDSC, on a voxel-wise and per-subject basis. All post-processing and data analysis were performed by using SPM (http://www.fil.ion.ucl.ac.uk/spm/) and custom-designed programs in the environment of MATLAB (The MathWorks Inc., Natick, MA, USA).

Computer simulation

Based on equation (1), a series of IVIM signals were generated with the same b-values used in MR experiment and with varied combinations of f (0.01 to 0.20, in steps of 0.01), D* to D ratio (1 to 50, in steps of 1), and SNRb1000 (10 to 100, in steps of 10). The D was assumed to be 0.8 × 10-3 mm2/s for gray matter, 1.2 × 10-3 mm2/s for axial diffusion in white matter, and 0.4 × 10-3 mm2/s for radial diffusion in white matter. Rician noise was added according to the required SNR levels. For each combination, 1,000 samples were generated for IVIM model fitting, yielding 1,000 sets of D/f/D* estimates. Accuracy was assessed by comparing the mean value with the theoretical value. Precision was assessed by computing the coefficient of variation.

Statistical analysis

The correlation between modalities (fD* versus CBFASL and f versus CBVDSC) was assessed by the Pearson’s correlation coefficient at a significance level of 0.01. Bootstrapping [29] was employed to assess the robustness of IVIM measurement. In short, a specific number of repetitions were randomly selected out of the 12 repetitions (with replacement) and then averaged. This was done for every b-value to create a sample set, which was fitted with the IVIM model to render an estimate of D/f/D*. On a basis of 10 estimates, coefficient of variation was computed and served as a measure of precision. The R2 value of model fitting was also recorded. The procedure was carried out for repetitions of 8, 6, 4, and 2. The different numbers of repetitions can be related to SNR based on their square roots. The IVIM images obtained with b = 1,000 s/mm2 were computed for their standard deviation and mean across the 12 repetitions. The ratio of mean to standard deviation was used as an estimate of SNR (SNRb1000).

Results

Computer simulations

Figure 1 shows the variance of D/f/D* estimates obtained with varied combinations of f and D* to D ratio, at two levels of SNRb1000 (30 and 100). Overall, the variance of f and D* is smaller in gray matter (Fig. 1a) than in white matter (Fig. 1b), decreasing with the increase in f, D* to D ratio, and SNR. The variance of D also decreases with the increase in SNR but is largely independent of f and D* to D ratio.

Fig. 1
figure 1

Simulated variance of IVIM-derived diffusion coefficient (D), blood volume fraction (f), and pseudo-diffusion coefficient (D*). Gray matter and white matter are shown in subfigures (a) and (b), respectively. For a given signal-to-noise ratio at b = 1,000 s/mm2 (SNRb1000), variance was computed in terms of the coefficient of variation (CV) from 1,000 random samples, for each combination of f (0.01 to 0.20, in steps of 0.01) and D* to D ratio (1 to 50, in steps of 1). Two levels of SNRb1000 are shown (30 and 100)

Figure 2 shows the comparison of D/f/D* estimates and their theoretical values. The estimate of D has high accuracy (error <5 %) regardless of SNR, gray/white matter, and axial/radial diffusion. In contrast, f and D* are overestimated at low SNR and gradually converge to the theoretical values when SNR increases. As compared with gray matter, white matter, particularly the radial diffusion, demands higher SNR to achieve a comparable level of accuracy. Again, variance decreases with the increase of SNR (see the error bars that indicate the standard deviation of 1,000 estimates). For a given SNR level, the variance is notably smaller for f than for D*. Note that in white matter, the coefficient of variation of D* is greater than 100 % even when SNRb1000 = 100.

Fig. 2
figure 2

Simulated accuracy of IVIM-derived diffusion coefficient (D), blood volume fraction (f), and pseudo-diffusion coefficient (D*). Gray matter (blue squares) and white matter (red upward triangles = axial diffusion in white matter, black downward triangles = radial diffusion in white matter) are shown separately. D was assumed to be 0.8 × 10-3, 1.2 × 10-3, and 0.4 × 10-3 mm2/s for gray matter, axial diffusion in white matter, and radial diffusion in white matter, respectively. Gray matter and white matter were assumed to have an f of 0.08 and 0.03, respectively, and the same D* (1.2 × 10-2 mm2/s) for simplicity. Gray dotted lines indicate the assigned values. Signal-to-noise ratio was defined at b = 1,000 s/mm2 (SNRb1000)

Experimental SNR and parametric estimates

Figure 3 shows the typical SNRb1000 obtained with our experimental setting. SNRb1000 is higher in the cortical gray matter than in the deep gray matter partly as a consequence of coil sensitivity. For white matter, SNRb1000 is also dependent on the fibre orientation with respect to diffusion encoding. In Fig. 3b, the SNRb1000 histogram is obtained based on the masks in Fig. 3a. The average SNRb1000 is ~8 for a single measurement, and can be related to the SNR when multiple repetitions are averaged. For example, the SNR averaged over 12 repetitions is 8 times the square root of 12, which is ~30 (one of the values we chose to demonstrate in Fig. 1).

Fig. 3
figure 3

Typical spatial distribution (a) and histogram (b) of signal-to-noise ratio at b = 1,000 s/mm2 (SNRb1000) based on our imaging settings. The histogram was computed by using the masks shown in (a)

Figure 4 summarizes the parametric estimates obtained from experimental data. Error bars indicate the standard deviation across 18 subjects. Note that D/f/D* were derived for three directions of diffusion encoding separately and then averaged, which reduces variance by a factor equal to the square root of 3. Figure 4a shows the percentage of voxels where the R2 values of model fitting are higher than 0.9. The percentage increases with the number of averages, and is consistently higher in gray matter than in white matter. Figure 4b shows the variance in the estimates of D/f/D*. The estimates have lower variance (i.e., higher precision) in gray matter than in white matter. By extrapolating the plots in Fig. 4b (for number of averages = 12), the variance of f/D* estimates is ~25 %/30 % in gray matter and ~50 %/60 % in white matter, which is in a reasonable agreement with the numerical simulations shown in Figs. 1 and 2.

Fig. 4
figure 4

(a) Percentage of voxels in which R2 of IVIM model fitting is above 0.9. (b) Variance of IVIM-derived diffusion coefficient (D), blood volume fraction (f), and pseudo-diffusion coefficient (D*). On a per-subject basis, 10 bootstrap samples were generated (see Materials and Methods), from which coefficient of variation (CV) was computed to assess variance and mean R2 was computed. Gray matter (blue lines) and white matter (red lines) are shown separately. The error bars indicate the standard deviation across 18 healthy volunteers

Cross-modal comparison

Figure 5 shows the comparison between f and the DSCCBV in a representative subject. Correlation is observed in gray matter (r = 0.48, p < 10-5) but not in white matter (r = 0.02, p = 0.58). Note that the correlation appears to decrease when DSCCBV is below 0.02 or beyond 0.10. Figure 6 shows the comparison between fD* and ASLCBF in the same subject shown in Fig. 5. Correlation is absent regardless of gray/white matter, presumably due to the large variance (i.e., low precision) in the D* estimate. These observations are highly consistent across subjects (f versus DSCCBV: r = 0.29 – 0.48 in gray matter, -0.02 – 0.05 in white matter; fD* versus ASLCBF: r = -0.01 – 0.12 in gray matter, -0.06 – 0.04 in white matter). Table 1 summarizes the mean and standard deviation of D/f/D*, DSCCBV, and ASLCBF.

Fig. 5
figure 5

Voxel-wise comparison between IVIM-derived blood volume fraction (f) and DSC-derived cerebral blood volume (CBVDSC). Data are from a representative healthy volunteer. Each circle denotes a voxel

Fig. 6
figure 6

Voxel-wise comparison between IVIM-derived blood flow estimate (the product of blood volume fraction f and pseudo-diffusion coefficient D*) and ASL-derived cerebral blood flow (CBFASL). Data is from the same healthy volunteer as shown in Fig. 5. Each circle denotes a voxel

Table 1 Measured IVIM indexes, DSCCBV, and ASLCBF. Diffusion coefficient (D), blood volume fraction (f), and pseudo-diffusion coefficient (D*) were derived based on the data of 12 averages. Values are expressed as mean ± standard deviation (across 18 subjects)

Discussion

Of the two IVIM-derived perfusion indexes, blood volume fraction f is notably more robust than pseudo-diffusion coefficient D* (Figs. 1, 2, and 4b). Both f and D* can be better estimated when blood volume or blood flow is sufficiently large. The threshold, however, is dependent upon SNR and for white matter, also upon the fibre orientation with respect to the direction of diffusion encoding. Based on our imaging setting (particularly, the b-values and number of averages), SNRb1000 is ~30 and with which ~10 % error in f and >100 % overestimation in D* are expected (Fig. 2). On the other hand, diffusion coefficient D can be estimated with high accuracy (error < 5 %) and precision (variance < 10 %) irrespective of gray/white matter. This suggests that perfusion has a negligible contribution to Eq. (1) when b ≥ 400 s/mm2 and it is appropriate to use the two-step fitting for the IVIM model. It is worth noting that IVIM-derived indexes are highly variable when D* to D ratio is below 10 (Fig. 1). Given that D can be reliably measured in a wide range of f and D* and that low D*/D ratio is most likely due to low D*, IVIM imaging may not be applicable to organs where blood flow is slow (e.g., the prostate). On the other hand, several recent studies [3033] managed to differentiate cancer pathologies in various organs by using IVIM-derived perfusion indexes. However, it may not be straightforward to generalize findings from organ to organ considering the different microvasculature and/or flow rate in different organs.

Cross-modal comparison reveals a statistical correlation between f and CBVDSC in gray matter (r = 0.29 – 0.48, p < 10-5), but not in white matter. IVIM model is not robust when CBV is low (CBVDSC < 0.02). The correlation also drops when CBVDSC is beyond 0.10 because these voxels are likely to contain large vessels. Large vessels usually do not comply with the assumption of randomly oriented vasculature. Coherent flow leads to phase accumulation (not signal decay) and in its presence, intravascular diffusion will be attributed to D instead of D*. Taken together, f will be subject to misestimate and low precision. Wirestam et al [21] also reported a correlation between CBVIVIM and CBVDSC (r = 0.563, p < 10-5), but two major differences should be noted between their study and ours. First, their comparison was based on regions of interest, presumably due to limited SNR (36 b-values, but only one measurement). Second, they pooled the regions of interest in gray/white matter and across subjects for correlation analysis. The inherent difference between gray matter and white matter could have inflated and dominated the computed correlation. By contrast, our comparison was on a per-subject and voxel-wise basis, and was performed for gray matter and white matter separately. Our analysis should more correctly reflect the correlation between modalities, and is more applicable for per-measurement assessment. In addition, our data showed no correlation between fD* and CBFASL. This is likely because of the large variance (i.e., low precision) in D* estimate (Figs. 1, 2, and 4b). As mentioned previously, the D* estimate is more SNR-demanding than the f estimate. Based on computer simulation, Pekar et al [24] also pointed out the SNR issue in IVIM imaging, but did not address the relationship between SNR and f/D*.

IVIM imaging does not rely on tracer delivery, which makes it a potential candidate for measuring slow or delayed perfusion such as in white matter. Unfortunately, our data showed substantial errors and variability in the IVIM-derived perfusion indexes in white matter. In particular, D* can be overestimated by 300 % along with low precision (coefficient of variation ~60 %); f is relatively accurate (~10 % error), but its variability is still large (coefficient of variation ~50 %). A potential application to take advantage of this tracer-free feature might be f measurement in tumours with blood-brain-barrier disruption in which case contrast agent leaks into interstitial space, leading to complex interplay between T1 and T2* effects. CBVDSC measured under such a circumstance has been known to be subject to error unless the T1 effect can be removed or corrected for [34].

There are a few limitations in this study. First, the data presented are based on healthy volunteers in their 20s and 30s. Nevertheless, our results provide quantitative assessment that helps parameter adjustment for IVIM imaging. Second, we did not convert f and D* to CBV and CBF quantities. The purpose of this study is to evaluate the robustness of IVIM-derived perfusion indexes. According to [28], the calculation of CBV and CBF primarily relies on f and D*, whose precision and accuracy can be inferred from our data. Third, in our fitting we did not consider non-Gaussian diffusion [35] that can be characterized by including a diffusion kurtosis coefficient to Eq. (1). Given that the kurtosis term contributes at high b-values (usually beyond 1,000 s/mm2), our IVIM imaging was based on relatively low b-values and should not include notable kurtosis effect. The suitability of our model fitting without the kurtosis term can also be revealed by the high percentage of voxels where R2 is above 0.9 with the IVIM model (~95 % in gray matter and ~85 % in white matter). The expense of using the extended model is adding an additional degree of freedom to the nonlinear least-squares fitting that is already SNR demanding. It is noteworthy that in addition to kurtosis, non-Gaussian diffusion has also been investigated with two-exponential model (slow vs. fast diffusing compartments or intra- vs. extra-cellular compartments) [36, 37] and stretched exponential model (multiple compartments and thus continuous distribution of diffusion coefficients) [38].

In summary, we have numerically and experimentally assessed the robustness of IVIM imaging in measuring cerebral perfusion indexes. A minimum SNRb1000 of 30 is recommended such that reliable f can be obtained as a noninvasive measure of cerebral blood volume. However, D* has limited robustness and should be interpreted with caution.