Introduction

Whole-body diffusion-weighted imaging (WB-DWI) is increasingly being used in oncological applications and has proven a valuable diagnostic tool in cancers such as multiple myeloma, melanoma, lymphoma and metastatic disease, as well as in screening applications1,2. Usage areas span tumor detection, tumor characterization and therapy response assessment3,4. Widespread clinical use of WB-DWI has however been limited and the technique is primarily used in specialized centers. One reason is that DWI data sets are typically affected by distortions and misalignments, meaning DWI and structural images acquired in the same scan session are not geometrically aligned. This mismatch can lead to difficulties when segmenting and characterizing organs and lesions in DWI and apparent diffusion coefficient (ADC) data sets, manually and automatically2,3. This also hinders the integration of DWI data into workflows for large-scale analysis of multi-parametric oncology data, in which geometric alignment between input images are essential for automatic and correct tumor evaluation5,6,7.

DWI visualizes differences in water diffusivity by incorporating diffusion sensitizing gradients into the magnetic resonance (MR) sequence. The b-value, set by the strength and timings of these gradients, makes the sequence sensitive to different scales of microscopic motion. In WB-DWI, two or three b-values are in general collected and followed by mono-exponential calculation of quantitative ADC maps1. To obtain ADC maps less sensitive to perfusion effects, while keeping a reasonable acquisition time and sufficient signal-to-noise ratio (SNR), the lowest acquired b-value for WB-DWI tends to lie in the range 50–100 s/mm2 1,8. Currently, the most widely used sequence for WB-DWI image acquisition is diffusion-prepared single-shot spin echo planar imaging (EPI), collected with short-TI inversion recovery (STIR) fat suppression and in free breathing1. Multiple stations are collected to cover the head to mid-thigh and stations are combined into single volumes as a post-processing step. EPI has the advantage of a fast readout and as such it is less sensitive to macroscopic patient motion. The sequence is however sensitive to static magnetic field inhomogeneity, resulting in image distortions due to magnetic field susceptibility differences between tissues and at tissue air transitions9. The distortion occurs because of the EPI sequence’s low bandwidth in the phase encoding direction, with the magnetic field inhomogeneity causing signal phase shifts during readout. Susceptibility-induced distortion is the same regardless of applied b-value10 and the effect is more pronounced at higher field strengths11. As the static magnetic field inhomogeneity is dependent on the subject within the field-of-view (FOV), it can also give central frequency shifts between acquired stations. A second, separate, source of distortion is eddy currents, modifying the applied gradient fields and resulting in an image distortion that is typically dependent on the applied b-value10. Being a multi-station acquisition, WB-DWI can also suffer from inter-station misalignments due to patient movement.

A number of post-processing methods for correction of susceptibility-induced geometric distortion have been proposed, most developed for brain imaging applications. These include B0 field mapping12, registration approaches13 and usage of a reverse polarity gradient (RPG) acquisition14,15. For body applications, less has been evaluated. The RPG method has however been tested in e.g. prostate16,17, cervix18, breast19,20 and head and neck21 applications with good outcomes, while field mapping has been used in prostate imaging22 with improved geometric accuracy as a result. The usage of the RPG method has been described as more robust to large distortions compared to field mapping15. For geometric improvements of WB-DWI, Blackledge et al.23 corrected inter-station misalignments by aligning stations using histogram matching of overlapping slices. Whole station images were translated in the phase-encode direction giving improved WB-DW images in terms of alignment of structures at station boundaries. Alignment to structural images was not assessed. Ceranka et al.24 assessed rigid and deformable registration approaches for aligning WB-DWI and structural data acquired in the same scan session. In addition to these approaches, it has been shown that volume shimming can potentially be omitted at 1.5T, with the center frequency set equal for all WB-DWI stations25. Due to larger susceptibility-induced distortions, however, this approach cannot be used at 3T26.

The aim of this work was to assess the performance of the RPG method for correction of susceptibility-induced geometric distortion of WB-DWI healthy volunteer data acquired at 1.5T and 3T. The performance was assessed in terms of geometrical alignment between DWI and structural data, impact on ADC values and visual evaluation of image quality. The RPG method was developed for b = 0 s/mm2 (b0) DWI data. As ADC maps less sensitive to perfusion are recommended for clinical use we have, for clinical applicability, also assessed whether b = 50 s/mm2 (b50) DWI data can be used for RPG correction instead of the standard b0 data.

Methods

Subjects

Twenty-six healthy volunteers were included in this prospective study, with 20 subjects scanned at 1.5T (age mean 46.8 ± 15.4 years, body mass index mean 26.4 ± 4.5 kg/m2, 8 females) and 20 subjects scanned at 3T (age mean 50.1 ± 16.9 years, body mass index mean 25.4 ± 4.0 kg/m2, 12 females). 14 subjects were dually scanned at 1.5T and 3T. Study approval was obtained from the regional ethics committee (Uppsala Regional Ethics Review Board, ethics approval 2017/524) and all participants gave their written informed consent to participate. The study was conducted in accordance to the relevant guidelines and regulations.

Data acquisition

Images were acquired at 1.5T (Achieva, Philips Healthcare, Best, The Netherlands, gradient system: 33 mT/m maximum amplitude, 180 T/m/s maximum slew rate) and at 3T (Signa PET/MR, GE Healthcare, Milwaukee, WI, USA, gradient system: 44 mT/m maximum amplitude, 200 T/m/s maximum slew rate). To reduce gut mobility, subjects fasted for 4 h and an intramuscular injection of Buscopan (2 mg/mL, 1 mL injection) was administered prior to scanning. For both scanners, DWI and structural data were collected station-wise using head volume and phased array surface coils. Five or six axial stations were collected per subject to cover the head to mid-thigh. DWI data was acquired using a diffusion-weighted spin echo EPI sequence with STIR fat suppression in free breathing27, while an axial T1-weighted Dixon scan was acquired for anatomical reference (Table 1). For the standard EPI sequence, phase encoding was in the anterior–posterior (AP) direction. For RPG distortion correction, a reverse EPI sequence was acquired immediately following the standard sequence with phase encoding in the posterior-anterior (PA) direction.

Table 1 MR image acquisition parameters at 1.5T and 3T.

ADC maps were calculated station-wise from b50, b = 400 s/mm2 (b400) and b900 images using a mono-exponential log-linear least square fit (MATLAB R2019b, the MathWorks, Inc., Natick, Massachusetts, US). DWI and ADC data were combined into single volumes in MATLAB. Overlapping slices were removed, with an equal number of slices removed from adjacent stations, and no intensity blending was performed.

Geometric distortion correction

The steps involved in the geometric distortion correction are illustrated in Fig. 1. The RPG method for correction of susceptibility-induced geometric distortion exploits the fact that two EPI data sets acquired with opposing phase-encoding gradients exhibit exactly opposite distortions. An undistorted data set can then be calculated by finding the mid-way between the two distorted data sets. In this work distorted DW images were corrected as described by Holland et al.15. In short, AP and PA b0 images are iteratively aligned using nonlinear registration in a multi-resolution framework in the form of decreasing Gaussian smoothing kernels. The obtained 3D deformation field map contains the voxel-wise displacement in the phase-encoding direction. As higher b-value data acquired in the same scan session exhibit the same susceptibility-induced geometric distortion as b0, the deformation map can be applied to the whole diffusion data set.

Figure 1
figure 1

Illustration of the steps involved in the RPG geometric distortion correction. The CMTK tools epiunwarp, reformatx and imagemath were used as indicated. Epiunwarp calculates the 3D deformation field map from b0 AP and PA input images. The resulting deformation map then corrects all b-value images using the reformatx and imagemath tools. Reformatx applies the deformation map to each b-value image, while a voxel-wise multiplication with the Jacobian of the deformation is carried out using imagemath, correcting for signal pile-up.

The Computational Morphometry Toolkit (CMTK) open source software28 was used for station-wise geometric distortion correction. The tools epiunwarp, reformatx and imagemath were combined in MATLAB for batch processing (Fig. 1). Regularization parameters were set to 0 (λ1) and 1000 (λ2), and the Gaussian smoothing kernel was set to start at a width of 16 mm, decreasing in steps of 0.25 mm until achieving original image resolution. At each smoothing level the number of optimization iterations was set to 5. The 1.5T images were rescaled to have approximately the same intensity range as the 3T images, meaning the same regularization parameters could be used for both scanners. The resulting deformation map was used to correct b0, b50, b400 and b900 images. To assess the feasibility of using b50 data instead of b0 data, the distortion correction was repeated with AP and PA b50 images and the resulting deformation map used to correct b50, b400 and b900 images.

Evaluations

Geometrical alignment

The geometric correspondence between b0 and structural data was assessed using the mutual information (MI) similarity metric (MATLAB)29. Structural images were resampled using linear interpolation to match the resolution of the diffusion images. MI between b0 and resampled structural images was calculated station-wise for non-corrected and distortion corrected data. Each station was assigned to an anatomical region; head & neck, neck & chest, abdomen, abdomen & pelvis or pelvis & legs.

All image data was screened for incidental findings by a radiologist (20 y experience in WB-DWI). Benign abnormalities including cysts, gallstones and scoliosis were found at 1.5T (n = 6) and 3T (n = 7). These structures were manually segmented in 3DSlicer30, independently on structural MR, non-corrected DWI and distortion corrected DWI. For DWI segmentation, b50 (cysts and gallstones) or b900 (scoliosis) images were used depending on which data set gave the best image contrast for a given abnormality. For scoliosis, the spinal canal in the cervical and thoracic regions was segmented. Structural region of interest (ROI) data was resampled using linear interpolation to match the DWI ROI resolution. The segmentations were evaluated with Dice Similarity Coefficient (DSC), Euclidean distance of geometric centers (ED) and Average Hausdorff distance (AVD), comparing non-corrected and distortion corrected DWI segmentations with their structural counterpart (MATLAB).

ADC values

To assess what impact the distortion correction had on ADC, ADC values were measured by manual ROI segmentation of healthy tissues (3DSlicer). Tissues were selected as to cover the whole body: cerebral cortex, cerebellum, liver (segment VI), spleen, kidneys, psoas muscle, vertebral disk, pelvic bone (body of ilium), urinary bladder and femur. ROIs were outlined on five consecutive axial slices for all tissues except for the cerebellum and vertebral disks. For the cerebellum, three consecutive axial slices were used, while five vertebral disks were segmented with one axially drawn ROI per disk (L1-L5). ROIs were drawn to include as much as possible of the tissue while excluding tissue borders and major vessels. Segmentations were first performed with access to all non-corrected whole body images (DWI, ADC and structural MR). These segmentations were then overlaid on distortion corrected DWI and ADC images, and adjusted to exclude tissue borders and major vessels if needed. For each subject, the non-corrected and corrected mean ADC (ADCmean) were extracted for each tissue.

Visual evaluation

Visual evaluation was performed on whole body b50 images in two steps assessing (a) spine misalignment at station boundaries and (b) overall image quality. Prior to evaluation, images were corrected for inter-station signal intensity differences by using histogram matching of overlapping slices23. Sagittal whole body images were visually inspected in terms of spine misalignment at station boundaries (3DSlicer). For each station boundary, in a whole body data set, the subject was scored 0 or 1 if the spine was aligned or not aligned. A total score for each subject was calculated by summing the scores from each station boundary, i.e. a perfectly aligned spine at all station boundaries gave a total score of 0, while non-alignment at all station boundaries gave a total score of 3. This was performed blinded, and independently, for non-corrected and distortion corrected data. Overall image quality was independently assessed by three radiologists (2, 7 and 20 years of experience with WB-DWI). Scrollable movie clips of axial b50 images were prepared with non-corrected and distortion corrected images side by side. Each radiologist scored the distortion corrected data set as having equal or worse overall image quality compared to the corresponding non-corrected data set. If a distortion corrected data set was scored as worse, the amount of artefacts introduced was specified on a 3-point scale: (1) minor artifacts, most likely not affecting clinical use, (2) moderate artifacts, renders the image of borderline clinical use or (3) severe artifacts, image not clinically useful. The radiologist was also asked to specify the body parts and/or tissues affected and the type of artefacts introduced.

Distortion correction based on b50 data

To evaluate whether b50 data can be used for RPG distortion correction instead of b0 data, the same evaluations as for the b0-based correction was performed: station-wise calculation of MI between b50 and structural images, comparison between DWI and structural images by means of ROI-based metrics of segmented abnormalities, assessment of the impact on healthy tissue ADC values, and visual evaluation of spine alignment and image quality.

Distortion correction of high b-value images

High b-value whole-body images in the range of 800–1000 s/mm2 tend to be used clinically for visual analysis1. The effect of the RPG distortion correction on these images was therefore assessed by station-wise calculation of MI between b900 and structural images, and visual evaluation of spine alignment and image quality.

Distortion at different field strengths

The ROI-based analysis of segmented abnormalities and visual evaluation of spine misalignment enabled the extent of distortion at 1.5T and 3T to be compared. For this purpose, a subset analysis was performed for subjects scanned at both field strengths (n = 14).

Statistical tests

Paired metrics were compared using paired, two-sided, Wilcoxon signed-rank tests (significance level 0.05). Bland–Altman plots were used to assess comparability between non-corrected and corrected ADC values for segmented tissues.

Results

Examples of the susceptibility-induced geometric distortion as observed for b0 and b50 sagittal WB-DW images at 1.5T and 3T are shown in Fig. 2. The distortion corrected images obtained using b0 and b50 as reference scans for the RPG distortion correction and corresponding structural images are also displayed. To visualize the effect of the RPG correction on higher b-value images, corresponding non-corrected and corrected b900 images are provided as Supplementary Fig. S1.

Figure 2
figure 2

Representative examples of the susceptibility-induced distortion and the effect of distortion correction for sagittal b0 and b50 images at 1.5T (top row, ag) and 3T (bottom row, hn). For the b0 images, non-corrected (NC) anterior–posterior (AP) (a, h), NC posterior-anterior PA (b, i) and b0-based distortion corrected (DC-b0) AP (c, j) images are displayed. For the b50 images, NC AP (d, k), NC PA (e, l) and b50-based distortion corrected (DC-b50) AP (f, m) images are displayed. Corresponding structural images are also shown (g, n). DW images are shown in inverted grey-scale with station boundaries marked with a dotted line. Inter-station signal intensity correction was performed for the purpose of visualization23. Distortion is visible for the spine at station boundaries for NC images, and can be seen to be of opposing directions along the phase-encode direction for non-corrected AP and PA images. For the distortion corrected data the spine is linked at all station boundaries for both the b0- and b50-based corrections and its shape better matches that of the structural image.

Geometrical alignment

Overall, MI between b0 DWI and structural data increased when distortion correction was performed (Fig. 3). The distortion correction increased MI for all anatomical regions at 1.5T (head & neck: P < 0.0001, neck & chest: P = 0.0014, abdomen: P = 0.0057, abdomen & pelvis: P = 0.0014, pelvis & legs: P = 0.0060) and at 3T (all regions P < 0.0001).

Figure 3
figure 3

Mutual Information (MI) measured station-wise for non-corrected (NC), b0-based distortion corrected (DC-b0) and b50-based distortion corrected (DC-b50) data at 1.5T (left) and 3T (right). For the b0-based correction MI has been calculated between b0 and structural images, while for the b50-based correction MI has been calculated between b50 and structural images. An increased MI was obtained for the distortion corrected data for all anatomical regions (Wilcoxon signed-rank test, P < 0.05).

When comparing DWI and structural data by means of ROI-based metrics, the largest improvements were obtained for spinal segmentations, while abdominal segmentations showed small or no improvement (Supplementary Table S1). Supplementary Fig. S2 shows examples of ROIs drawn at 1.5T and 3T for two subjects.

ADC values

Bland–Altman plots of the difference in ADCmean between non-corrected and b0-based distortion corrected measurements (ΔADCmean) are shown in Fig. 4. Overall, small numerical differences in ADCmean were measured for all healthy tissues for both scanners, with ΔADCmean values close to zero. The absolute median difference between non-corrected and corrected ADCmean was small across all healthy tissues, but ADC bias assessed using paired Wilcoxon signed-rank tests identified a non-zero bias for the cerebral cortex (P = 0.0010) and kidneys (P = 0.00085) at 1.5T, and for the kidneys (P = 0.013), psoas muscle (P = 0.0077) and pelvis (P = 0.046) at 3T (Table 2). The number of adjusted ROIs for the distortion corrected data differed between scanners: at 1.5T the majority of ROIs was not adjusted (n = 14), while at 3T all subjects required at least one ROI update (Table 2).

Figure 4
figure 4

Bland Altman plots of the difference in ADCmean (ΔADCmean) of healthy tissues measured between non-corrected (NC) and b0-based distortion corrected (DC-b0) data, and between NC and b50-based distortion corrected (DC-b50) data, at 1.5T and 3T. For the pelvis the body of ilium was segmented.

Table 2 Median absolute ΔADCmean measured for selected healthy tissues at 1.5T (top) and 3T (bottom) with the IQR in parentheses.

Visual evaluation

Distortion correction improved spine alignment at station boundaries at both field strengths (Table 3). At 3T, large signal dropout was seen for one subject at the neck and for two subjects at the chest (Fig. 5), meaning the total number of evaluable station boundaries at these locations were 19 and 18, respectively. Without distortion correction, 14 subjects (70%) had at least one station boundary with spine misalignment at 1.5T, with the majority of cases being in the neck (n = 13, 62%). After distortion correction, the spine was aligned at all station boundaries for all subjects. At 3T, all subjects had at least one spine misalignment for non-corrected data, with most discrepancies arising in the chest (n = 17, 94%). The distortion correction decreased the number of misalignments, but 5 subjects (25%) had one remaining misalignment after correction (Fig. 5). With distortion correction, the average misalignment score decreased from 1.1 to 0 at 1.5T, and from 1.8 to 0.3 at 3T.

Table 3 Visual assessment of spine misalignment at station boundaries for NC, DC-b0 and DC-b50 data, at 1.5T (top) and 3T (bottom). At both field strengths, b50 and b900 images are evaluated. The total number of subjects affected by spine misalignments are displayed, as well as the number of subjects with spine misalignments at the different anatomical locations along the spine (neck, chest and abdomen). In parentheses the total number of evaluable subjects are displayed. At 3T, large signal dropout for 1 subject at the neck and 2 subjects at the chest meant that the total number of station boundaries that could be evaluated at these locations were 19 and 18, respectively. The average misalignment score across all subjects is also given.
Figure 5
figure 5

Example of a subject at 3T for which susceptibility changes cause both a signal void at the chest and large geometric distortion at the abdominal station boundary. Sagittal b50 anterior–posterior (AP) images are shown in inverted grey-scale: (a) non-corrected (NC), (b) b0-based distortion corrected (DC-b0) and (c) b50-based distortion corrected (DC-b50). The corresponding structural image is also shown (d). Station boundaries are marked with a dotted line and inter-station signal intensity correction was performed for the purpose of visualization23. The signal drop-out resulted in spine misalignment not being assessed at the chest station boundary. The geometric distortion seen as a spine misalignment at the abdominal station boundary could not be corrected by either the b0- or b50-based distortion corrections.

When assessing axial slices, the radiologists scored the overall image quality as worse for the majority of subjects at 1.5T (mean n = 15 subjects, 73%) and at 3T (mean n = 13 subjects, 63%) (Table 4). Worse image quality was attributed to minor artefacts in the majority of these subjects (score 1: 73% at 1.5T, 76% at 3T). Localized moderate or severe artefacts were however also present (score 2: 27% at 1.5T, 21% at 3T, score 3: 0% at 1.5T, 3% at 3T). Artefacts were mainly noted in the spleen and/or kidneys and the causes attributed to registration artefacts. Typical artefacts are shown in Fig. 6 (1.5T) and 7 (3T).

Table 4 Rating and scores given by three radiologists when comparing the image quality of non-corrected and distortion corrected axial b50 images side by side. The number of distortion corrected scans scored as ‘equal’ or ‘worse’ compared to the corresponding non-corrected scans are shown for DC-b0 and DC-b50 data. The scores given for scans marked as having worse image quality are also shown. The mean rating and score for the three readers’ assessments are indicated. In parentheses, percentages are given.
Figure 6
figure 6

Typical registration artefacts caused by the RPG method as observed at 1.5T. Axial b50 images (a) axial b50 images overlaid on corresponding structural images (b) and sagittal b50 images (c) are shown. In each column (left to right) the non-corrected (NC) anterior–posterior (AP), NC posterior-anterior (PA), b0-based distortion corrected (DC-b0) AP and b50-based distortion corrected (DC-b50) AP images are displayed. Axial and sagittal b50 images in inverted grey scale show spleen registration artefacts as marked by arrows (a, c). Although artefacts are introduced, there is an improvement in the geometrical alignment. This is exemplified by zoomed axial b50 images in yellow color scale overlaid on corresponding structural images (b), with misalignment in the NC AP image indicated with an arrow. For the sagittal images the station border is marked with a dashed line. The radiologists’ assessments of overall image quality gave a mean score of 1.7 and 1.0 for the DC-b0 and DC-b50 images, respectively.

Distortion correction based on b50 data

The evaluations (geometric alignment, ADC values and visual assessments) of the b50-based geometric distortion correction were in line with that of the b0-based distortion correction. When b50 data was used for distortion correction, an increased MI was obtained for all body parts at 1.5T (head & neck: P < 0.0001, neck & chest: P = 0.00089, abdomen: P = 0.00062, abdomen & pelvis: P = 0.0057, pelvis & legs: P = 0.00072) and at 3T (head & neck, neck & chest, abdomen, and abdomen & pelvis: P < 0.0001, pelvis & legs: P = 0.00014) (Fig. 3). When comparing ROI-based metrics of segmented incidental findings on DWI with structural MR, larger improvements were seen for spinal segmentations compared to abdominal segmentations (Supplementary Table S1).

Small numerical differences between non-corrected and distortion corrected ΔADCmean measurements were seen (Fig. 4). ROI placement was the same as for the b0-based correction at both field strengths. After distortion correction, the spine was aligned for all subjects at all station boundaries at 1.5T, while 4 subjects (20%) had one remaining misalignment after correction at 3T (Table 3, Fig. 5). The radiologists scored the overall image quality as worse for the majority of subjects at 1.5T (mean n = 16, 80%) and at 3T (mean n = 15, 75%) (Table 4). As for the b0-based distortion correction, worse image quality was mainly attributed to minor artefacts (score 1: 79% at 1.5T, 78% at 3T), but localized moderate or severe artefacts were present (score 2: 15% at 1.5T, 18% at 3T, score 3: 6% at 1.5T, 4% at 3T). At both field strengths, artefacts were mainly noted in the spleen and kidneys, and were attributed to registration artefacts in all cases. Typical artefacts were similar for the b0- and b50-based distortion corrections (Figs. 6 and 7).

Figure 7
figure 7

Typical registration artefact caused by the RPG method as observed at 3T. Axial b50 images (a) axial b50 images overlaid on corresponding structural images (b) and sagittal b50 images (c) are shown. In each column (left to right) the non-corrected (NC) anterior–posterior (AP), NC posterior-anterior (PA), b0-based distortion corrected (DC-b0) AP and b50-based distortion corrected (DC-b50) AP images are displayed. Axial b50 images in inverted grey scale show the kidney registration artefact as marked by arrows (a). Although artefacts are introduced, there is an improvement in the geometrical alignment. This is exemplified by zoomed axial b50 images in yellow color scale overlaid on corresponding structural images (b) and by sagittal b50 images in inverted grey scale (c). Misalignments in NC AP images are indicated with arrows in (b) and (c). For the sagittal images the station border is marked with a dashed line. The radiologists’ assessments of overall image quality gave a mean score of 1.0 and 1.7 for the DC-b0 and DC-b50 images, respectively.

Distortion correction of high b-value images

The RPG distortion correction gave small numerical changes in b900 station-wise MI at both field strengths (Supplementary Figure S3). The exception being the head & neck, for which a larger MI improvement was seen (P < 0.0001 for both field strengths and for both the b0- and b50-based distortion correction). At 1.5T, using the b0-based distortion correction, MI did not change for the pelvis & legs (P = 0.91), while a small systematic decrease in MI was measured for the neck & chest (P = 0.00013), abdomen (P = 0.00027) and abdomen & pelvis (P = 0.00029). Usage of the b50-based distortion correction gave better geometrical alignment between b900 and structural data at 1.5T: increased MI for the abdomen (P = 0.0034) and pelvis & legs (P = 0.036), and no change in MI for the neck & chest (P = 0.35) and abdomen & pelvis (P = 0.48). At 3T, the b0-based distortion correction gave an increased MI for the abdomen & pelvis (P = 0.0064) and pelvis & legs (P = 0.029), while no difference in MI was measured for the neck & chest (P = 0.18) and abdomen (P = 0.20). Again, the b50-based distortion correction gave a slight improvement in geometrical alignment to structural images: increased MI for the abdomen (P = 0.038), abdomen & pelvis (P = 0.00017) and pelvis & legs (P = 0.0026), and no change in MI for the neck & chest (P = 0.75). In general, the b900 images at 3T are of low SNR, which is evident in the larger spread of MI-values compared to the MI-value spread of b0 and b50 images (Fig. 3, Supplementary Fig. S3).

The visual assessments of b900 images gave results that were in line with the low b-value image evaluations. A small number of spine misalignments remained after b0- or b50-based distortion corrections (Table 3) and the radiologists mainly reported minor artefacts (Supplementary Table S2). At both field strengths, localized registration artefacts were noted and occurred mainly in the spleen. The artefacts had the same appearance as for the low b-value images (Figs. 6 and 7), with no new artefacts introduced.

Distortion at different field strengths

Comparing DWI and structural data by means of ROI-based metrics, indicated larger distortions at 3T compared to 1.5T for non-corrected data; lower DSC, and larger ED and AVD were measured at 3T (Supplementary Table S1, Supplementary Fig. S2). The distortion correction was also seen to give larger improvements at 3T compared to 1.5T.

Visual assessment of spine misalignment at stations boundaries for subjects scanned at both field strengths (n = 14) are displayed per patient as Supplementary Table S3. A higher number of misalignments are seen at 3T, indicating larger distortions at this field strength.

Discussion

In this work we assessed the performance of the RPG method for correction of susceptibility-induced geometric distortion of WB-DWI healthy volunteer data acquired at 1.5T and 3T. Geometric distortions were present in the majority of acquired data, with larger distortions observed at 3T compared to 1.5T. In terms of improved geometric accuracy, the RPG method performed well at both 1.5T and 3T: an increase in MI between low b-value DWI and structural MR data was measured for distortion corrected data, for all anatomical regions, and a lower number of spine misalignments was visually observed at station boundaries. Overall, improvements in geometrical accuracy could aid WB-DWI gain a wider clinical acceptance and is also important in other aspects; it would benefit the development of detection and segmentation techniques in particular for multispectral data and aid implementation for radiotherapy usage. An improved geometrical accuracy could also facilitate improved co-registration of WB-DWI data acquired at different time points, for example during treatment response assessments. Artefact-reduction has been described as a hurdle for usage of ADC as a primary endpoint biomarker8 and WB-DWI is not yet included in the Quantitative Imaging Biomarker Alliance (QIBA) DWI profile longitudinal claim statement31. Providing a reduction in image distortion could aid standardization efforts.

This study further showed that the RPG correction can be used with b50 EPI data as input, instead of the standard b0 EPI data. The susceptibility-induced distortion corrected by the RPG method is thought to be the same regardless of b-value10, giving validity to this approach. With regards to clinical usage, this is of importance as a perfusion insensitive lowest b-value is recommended and b0 data tend not to be collected. Although no comparison was made with a b0-based distortion correction, Winter et al.21 also showed that the RPG method using AP and PA b = 150 s/mm2 images reduces distortion in head and neck EPI-based DWI acquisitions.

The RPG distortion correction of high b-value images did not improve the overall geometrical alignment in terms of MI between b900 and structural images to the same extent as for b0 and b50 images. For stations covering the chest and abdomen, the correction gave a decreased or unchanged MI at 1.5T, and an improved or unchanged MI at 3T. Reasons for the poorer performance seen for b900 data are likely due to eddy current induced distortions and differences in motion between acquired b-value images. The b900 images are likely to be affected by eddy currents due to the strong gradient pulses used for data acquisition, and more likely to be affected by breathing motion artefacts due to the increased number of signal averages used in the acquisition (9 in the current study). The head & neck and pelvis & legs stations showed the largest improvements for b900 data, corresponding to stations less affected by breathing motion. In addition to these effects, high b-value images are inherently of low SNR, affecting the accuracy of the MI metric calculation. This latter effect was in particular seen at 3T, for which a larger MI-value spread was measured for b900 images compared to low b-value images. Although the overall geometrical alignment gave mixed results for b900 images, visual assessment of image quality showed that the RPG distortion correction did not produce additional artefacts for b900 images compared to low b-value images. Moreover, a localized improvement of the distortion correction was seen for the spinal cord at both field strengths, both in terms of a reduced number of spine misalignments and an improved geometrical alignment of incidental findings. This highlights that a potential clinical application of the distortion correction is likely to be application dependent.

Bland–Altman plots exhibited small numerical differences between non-corrected and corrected ADCmean values for healthy tissues. Across all healthy tissues, a median absolute difference of 0.0040 × 10–3 mm2/s was measured for both the b0- and b50-based correction at 1.5T, while at 3T median absolute differences of 0.011 and 0.0095 × 10–3 mm2/s were measured for the b0- and b50-based corrections, respectively. Published results on test–retest ADC measurements in free-breathing WB-DWI are scarce, but Kwee et al.32 showed that the mean ADC bias of healthy liver parenchyma was in the range of − 0.01–0.09 × 10–3 mm2/s at 1.5T. The liver ADC bias measurements in this work can hence be considered small (mean absolute ΔADCmean 0.003 × 10–3 mm2/s at 1.5T). An overall trend of larger ADCmean differences was seen at 3T compared to 1.5T. To some extent, this is attributable to the ROI adjustment needed for distortion corrected data at 3T for which at least one ROI adjustment was needed per subject. The ADC bias assessment showed that the RPG method has the potential to introduce a non-zero bias, as observed for a subset of healthy tissues. This bias is however numerically very small in terms of ΔADCmean and unlikely to have a clinical impact.

The visual assessment of spine misalignment at station boundaries showed that non-corrected datasets had a larger number of misalignments compared to corrected datasets. The shape of the whole spine improved after correction and better matched its structural counterpart, in particular at 3T. Visual assessment of axial b50 and b900 images showed that the RPG method can give rise to localized registration artefacts. This was particularly evident for the spleen and kidneys, corresponding to tissues exhibiting large distortions in this study. The spleen most likely as it lies close to the air-tissue interface of the lung, meaning large susceptibility-induced distortions in this area of the body. The kidneys were for many subjects imaged by two stations, meaning the kidney as a whole was affected by central-frequency shifts between stations. As a free-breathing WB-DWI acquisition was used, the abdominal organs are also affected by motion. For WB-DWI it has been shown that imaging in free-breathing is preferably used, giving an intra-voxel coherent motion pattern that exhibits an improved image SNR compared to breath-hold or navigator triggered imaging33. As the RPG method requires an additional image acquisition an inherent limitation of the technique is however the potential for gross subject movement, other than respiratory motion, between the AP and PA data acquisitions. To minimize the risk of this type of movement occurring, the PA acquisition followed immediately after the AP acquisition in the current study.

A limited number of incidental finding abnormalities were found and used to further assess the geometric correspondence between DWI and structural MR data. Overall, the distortion correction gave a numerical improvement in the ROI-based performance metrics assessed, in particular at 3T. The results however indicated that the improvement is position dependent: larger improvements were seen in the spine, but small or no improvement was seen for abdominal lesions. This is partly due to breathing motion differences between abdominal DWI and structural MR acquisitions, which was not corrected for in this study.

The extra scan time for the PA phase-encoded sequences in this study was relatively long as both b0 and b50 images were acquired: 0:50 and 1:35 min per station at 1.5T and 3T, respectively. In addition to an increased likelihood of gross subject motion, the additional scan time is a prohibitive factor to achieve patient compliance during the already long total scan time for whole body MR protocols. In practice, however, an EPI sequence giving only one set of images of PA phase-encoding is needed (e.g. b0 or b50), meaning the scan time can be reduced compared to our study protocol. For further time saving, it would also be possible to take advantage of the fact that the distortion corrected image can be composed from corrected AP and PA data sets. For the imaging protocol used in this study, this would entail acquiring low b-value images with an NSA of 1 for the AP and PA data acquisitions and as such the RPG method would give a negligible penalty in acquisition time compared to the standard protocol of acquiring one AP acquisition with an NSA of 2. Although not available at our imaging center, there are also sequences available that are optimized to allow for shorter acquisition times that have been implemented by some MR scanner manufacturers (e.g. PROGRES, GE Healthcare). The extra time needed for these purpose-built sequences is one TR (5.6 s at 1.5T and 3.5 s at 3T in the current study), making the RPG method more feasible for clinical applications.

A limited amount of work has been published on distortion correction of WB-DWI. Blackledge et al. however assessed a correction method based on histogram matching of overlapping slices23, while Ceranka et al. looked at image registration between DWI and structural images acquired in the same scan session24. Both approaches require the acquisition of overlapping slices and that inter-station intensity standardization is performed prior to correction. Intensity standardization can be problematic at 3T for which large signal dropout can be obtained and common approaches relying on signal intensity registration23,34 would not work adequately. An advantage of the RPG method is that no pre-processing of the acquired data is needed. In addition to these approaches, it has been shown that volume shimming can potentially be omitted at 1.5T, with the center frequency set equal for all WB-DWI stations25. During protocol development in this study this approach was however tested at 1.5T, but no improvements in station distortion was observed.

This study has some limitations. We did not take into account distortion due to eddy currents. Eddy currents have the potential to affect higher b-value images, in particular the b400 and b900 images acquired in this study. This can give rise to distortion in the images that are not present in lower b-value images, and can introduce errors in ADC measurements9. Clinically, high b-value images tend to be used for visual evaluation of WB-DWI1. Although the RPG method corrects for susceptibility-induced distortion in these images, the effect of eddy current distortion was not assessed in this work. Registration-based methods for eddy current correction have however been proposed35,36 and could be tested for WB-DWI. Modifications to these single-station brain registration algorithms are likely needed for utilization in multi-station WB-DWI. Furthermore, the RPG method is dependent on the scale of the distortion of input data. If the distortion is too large, the RPG method is not capable of restoring the true signal position and/or magnitude. This was seen in some cases at 3T and it is possible that subject-specific tuning of RPG registration parameters (λ2 and Gaussian smoothing kernel) could have improved the results in these cases. Subject-specific parameter tuning was however not performed in this work, in which we instead showed that improved geometric accuracy can be obtained even though station-wise signal intensities and the amount of susceptibility-induced distortion varies in a WB-DWI examination. The RPG method hence proved robust with regards to registration parameter selection, which can be considered a strength for clinical applicability. A further problem with the RPG method can arise due to fold-over artefacts, which can give rise to singularities in the Jacobian determinant maps and as such inaccuracies in the registration15. To avoid fold-over artefacts a large FOV is used in WB-DWI. There is however the potential that very large patients will extend outside the FOV in the phase-encode direction. Depending on the amount of fold-over, the RPG method might not be usable in these cases. All subjects included in this study were however smaller than the FOV, meaning fold-over artefacts were not observed and this effect could not be assessed.

In conclusion, an improved geometric accuracy was obtained when applying the RPG distortion correction on WB-DWI data acquired at 1.5T and 3T. The b0- and b50-based distortion corrections had a very similar performance, which indicates that a b50 EPI acquisition can be used for RPG correction of WB-DWI data and facilitates clinical use.