Parameter estimation
Two major approaches for estimating only D and f have been proposed: one is based on a specialized model-fitting procedure, and one uses a special case of the IVIM model (Eq. 1), both assuming b values in certain ranges and that D* ≫ D. In the former approach, often referred to as segmented fitting, estimation is done in two steps [15,16,17]. In the first step, data from b values below a certain threshold (bthr) are omitted. If bthr is large enough, the signal from the perfusion compartment is considered to be of negligible size and the IVIM model simplifies to a monoexponential model:
$$S(b) = S_{0} \left( {1 - f} \right)e^{ - bD} = Ae^{ - bD}$$
(2)
In the second step, f is estimated as f = 1 − A/S(0), where S(0) is the measured signal at b = 0. In the latter approach, a simplified version of the IVIM model (sIVIM) is considered:
$$S(b) = S_{0} \left( {\left( {1 - f} \right)e^{ - bD} + f\delta \left( b \right)} \right),$$
(3)
where \(\delta \left( b \right)\) is the discrete delta function, i.e., \(\delta \left( {b = 0} \right) = 1\) and \(\delta \left( {b \ne 0} \right) = 0\) [16, 18, 19]. The model is valid for the b values b = 0 and b ≥ bthr.
Four specific approaches for estimating D and f were considered in this study:
-
1.
Segmented fitting, where D is estimated from b values ≥ 120 s/mm2 (i.e., bthr = 120 s/mm2) and f from the intercept A (Eq. 2), as described above
-
2.
Least-squares fitting of the sIVIM model (Eq. 3)
-
3.
Bayesian fitting of the sIVIM model using the marginal posterior modes
-
4.
Bayesian fitting of the sIVIM model using the posterior means.
Segmented fitting was performed using a custom-made MATLAB function with nonlinear least-squares fitting of D. Least-squares fitting of the sIVIM model was done with the MATLAB function fit with default arguments.
The Bayesian model fitting based on Eq. 3 was performed using a previously published MATLAB function for Bayesian IVIM model fittingFootnote 1 [21], which was adapted to the sIVIM model. The implementation uses a Markov chain Monte Carlo setup to sample the posterior parameter distribution from which the marginal posterior mode or posterior mean was estimated. Uniform prior distributions were used for all parameters.
For all four estimation approaches, the parameter estimates of D, f, and S0 were constrained to the ranges [0 5] µm2/ms, [0 1] and [0 2 Smax], respectively, where Smax is the maximum measured or simulated signal value depending on the context. For the segmented model fit, the constraint on f was applied by setting negative estimates to zero. For the Bayesian methods, the constraints were applied by setting the prior distributions to zero outside the specified ranges. Additional detailed information about the parameter estimation approaches can be found in the supplementary information.
Patients
MR imaging data from patients with liver metastases from small-intestine NET was obtained from a previously published randomized clinical trial of embolization methods [25]. Patients were randomly assigned to either hepatic artery embolization or radioembolization treatment and were examined with MRI before and one and 3 months after treatment. Among the 11 patients in the previous study, one failed to undergo the MR examination due to cardiac pacemaker, and one was examined with a different MR protocol, resulting in nine patients for analysis. For detailed descriptions of inclusion criteria and treatments, the reader is referred to the previous paper [25]. The MR examinations included in the study we report here were those performed before (baseline) and 3 months after treatment.
MR imaging
Respiratory-triggered diffusion weighted images (DWIs) of the upper abdomen were acquired on a Philips Achieva dStream 3T with software release 5.1.7 (Best, The Netherlands) using a single-shot spin-echo echoplanar imaging (SE-EPI) sequence with five b values (0, 120, 350, 575, 800 s/mm2, Δ ≈ 26 ms, δ ≈ 16 ms). For b > 0, three orthogonal diffusion-encoding directions were acquired. Number of directions × number of signal averages = total number of measurements at each b value, were 1 × 6 = 6, 3 × 3 = 9, 3 × 3 = 9, 3 × 6 = 18 and, 3 × 6 = 18, respectively. Other imaging parameters were: TE = 54 ms, TR = 2600 ms, half scan = 0.70, acquisition pixel size = 3 × 3 mm2, reconstructed pixel size = 1.8 × 1.8 mm2, slice thickness = 6 mm, and slice gap = 0.6 mm. Phase encoding was performed in the anterior–posterior direction, with sensitivity-encoding (SENSE) factor = 2 and a resulting bandwidth of 13.7 Hz/mm. Regions of interest (ROIs) were produced by manual delineation of the tumor border using the DWI with b = 0. ROIs were also drawn in healthy liver and spleen in the same images, avoiding large vessels. ROIs in liver and spleen were drawn such that their size was similar to the overall average tumor size. This strategy was employed to get approximately the same number of voxels for each tissue type. SNR was calculated as the signal in the image with b = 0 divided by the standard deviation of the noise, taking into account the effects of averaging. The noise level was estimated from the residuals of a monoexponential fit of data with b > 0. The resulting median SNR estimates in tumor, liver, and spleen were 16, 20, and 18, respectively.
Simulations
Simulated data were generated from the sIVIM model (Eq. 3) for the same b values and total number of measurements at each b value as for the in vivo acquisitions at three SNR levels: 10, 20, and 40. At each level 10,000 data series with Rician noise were generated based on values of D and f randomly drawn from uniform distributions with bounds [0.5, 1.5] µm2/ms and [0, 0.3], respectively. SNR refers to the measurement at b = 0 after averaging. The noise level after averaging was thus lower at the higher b values due to the larger number of averages.
Statistical analysis
The quality of parameter estimates obtained from the different estimation approaches using simulated data were compared in terms of bias and variability. This was done by studying the quantiles of the distribution of differences between estimated and simulated parameter value. For parameter estimates based on in vivo data, where the true parameter values are unknown, the relative bias and variability were studied by comparison of results between estimation approaches. To evaluate whether the b-value threshold was sufficiently high, D and f were estimated excluding data with b = 120 s/mm2 and compared with estimates based on all b values.
The ability of different estimation approaches to differentiate between tumor and healthy liver tissue was studied by constructing a classifier for each approach separately based on kernel density estimation. The performance of classifiers was quantified using a leave-subject-out cross-validation where the classifier was trained on data from all but one patient. Data from that patient was then used for testing. The training and testing procedure was repeated such that all patients were used for testing once. The classifier was trained on voxel parameter data from all patients such that for each tissue type, the tissue-specific probability density function (pdf) was estimated using the MATLAB function ksdensity with a Gaussian-shaped kernel and default arguments. Classification was performed on all voxel data from the test patient by identifying the tissue type with the highest probability based on the estimated tissue-specific pdf. The analysis was performed both in one dimension for D and f separately and in two dimensions with D and f combined. The proportion of correct classifications was averaged across the repetitions to calculate the overall performance of the classifier. The classification was performed for each time-point separately. MATLAB 2016b (MathWorks, Natick, USA) was used for all calculations and visualization.