How reliable are ADC measurements? A phantom and clinical study of cervical lymph nodes

Moreau, Bastien; Iannessi, Antoine; Hoog, Christopher; Beaumont, Hubert

doi:10.1007/s00330-017-5265-2

How reliable are ADC measurements? A phantom and clinical study of cervical lymph nodes

Magnetic Resonance
Open access
Published: 23 February 2018

Volume 28, pages 3362–3371, (2018)
Cite this article

Download PDF

You have full access to this open access article

European Radiology Aims and scope Submit manuscript

How reliable are ADC measurements? A phantom and clinical study of cervical lymph nodes

Download PDF

Bastien Moreau¹,
Antoine Iannessi¹,
Christopher Hoog¹ &
…
Hubert Beaumont²

2868 Accesses
28 Citations
3 Altmetric
Explore all metrics

Abstract

Objective

To assess the reliability of ADC measurements in vitro and in cervical lymph nodes of healthy volunteers.

Methods

We used a GE 1.5 T MRI scanner and a first ice-water phantom according to recommendations released by the Quantitative Imaging Biomarker Alliance (QIBA) for assessing ADC against reference values. We analysed the target size effect by using a second phantom made of six inserted spheres with diameters ranging from 10 to 37 mm. Thirteen healthy volunteers were also scanned to assess the inter- and intra-observer reproducibility of volumetric ADC measurements of cervical lymph nodes.

Results

On the ice-water phantom, the error in ADC measurements was less than 4.3 %. The spatial bias due to the non-linearity of gradient fields was found to be 24 % at 8 cm from the isocentre. ADC measure reliability decreased when addressing small targets due to partial volume effects (up to 12.8 %). The mean ADC value of cervical lymph nodes was 0.87.10^-3 ± 0.12.10^-3 mm²/s with a good intra-observer reliability. Inter-observer reproducibility featured a bias of -5.5 % due to segmentation issues.

Conclusion

ADC is a potentially important imaging biomarker in oncology; however, variability issues preclude its broader adoption. Reliable use of ADC requires technical advances and systematic quality control.

Key Points

• ADC is a promising quantitative imaging biomarker.

• ADC has a fair inter-reader variability and good intra-reader variability.

• Partial volume effect, post-processing software and non-linearity of scanners are limiting factors.

• No threshold values for detecting cervical lymph node malignancy can be drawn.

Robustness of apparent diffusion coefficient–based lymph node classification for diagnosis of prostate cancer metastasis

Article Open access 15 December 2023

Developing and testing a robotic MRI/CT fusion biopsy technique using a purpose-built interventional phantom

Article Open access 22 November 2022

Task-based assessment of neck CT protocols using patient-mimicking phantoms—effects of protocol parameters on dose and diagnostic performance

Article Open access 05 November 2020

Discover the latest articles, news and stories from top researchers in related subjects.

Medical Imaging

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Recent advances in medical imaging technology and drug therapeutics have accelerated the emergence of new quantitative imaging biomarkers (QIB) [1, 2]. The multiplication of these QIBs is unfortunately not always accompanied by stringent validations establishing that QIBs are well designed to characterize a disease and its changes with therapy. This lack of validation creates a situation where QIBs are routinely used but with limited knowledge of their performances, precluding a larger adoption in clinical trials.

Apparent diffusion coefficient (ADC) can quantify the level of free water diffusion restricted by an increase in tissue cellularity. Applications of ADC in cancer imaging has motivated intensive research and ADC is now one of the main QIBs derived from diffusion MRI.

Several studies have documented the incremental value of ADC assessment as a complement or substitute to standard sequences for the detection of malignant tumours [3], the degree of malignancy [4, 5] or to evaluate response to treatment [6,7,8].

Since lymph node involvement is pivotal in oncological imaging [9], ADC has been tested for its detection of malignant adenomegalies [10, 11]. Results are discordant [12, 13].

Previous literature comprises heterogeneous studies protocols and results [14]. Several sequential unitary processes are necessary to output an ADC assessment, the lack of reliability of any of these unitary processes is likely to degrade the final ADC assessment. It is therefore particularly relevant to study if ADC qualifies as a quantitative biomarker.

Over the last decade, a multidisciplinary community has organized retrospective investigations of QIBs starting by documenting methodologies [2]. In 2007, the Radiological Society of North America (RSNA) launched QIBA (Quantitative Imaging Biomarker Alliance [15]), a specialized working group aiming at improving the value and usefulness of QIBs in reducing variability across devices, patients, and practices.

One of QIBA aims consists in releasing ‘Profiles’, which are documents standardizing imaging protocols to obtain optimal, reliable and reproducible biomarker measures according to the current state of the art. The QIBA diffusion imaging profile is still a work in progress [16].

QIBA also proposes a standardized protocol for quality control in diffusion imaging, using a diffusion phantom [17, 18] consisting of a volume of 0 °C stabilized water as the reference value for ADC assessment [19, 20].

The main objective of this study was to evaluate the variability of ADC measurements in vitro on a phantom and in vivo on cervical lymph nodes. The secondary objective was to understand and quantify ADC measurement errors, in view of correcting them in future studies.

Methods

We first tested QIBA metrics for quality control (QC) of ADC image quality, and then performed a reliability analysis of ADC measurements. Finally we measured ADC values of cervical lymph nodes in healthy volunteers.

This prospective study was conducted at the Centre Antoine Lacassagne, cancer centre in Nice, France, between March and November 2016. We used a GE MRI scanner 1-5T MR450W and ADW Volume Share 5 4.6 software to process images (GE Healthcare).

Quality control test

We used a DIN 6858-1 PET-CT phantom (PTW) consisting of a cylindrical Plexiglas body filled with a mixture of ice and water. Three smaller cylinders were inserted into the body, one of which was filled with water at 0 °C (Fig. 1, left side).

Homogeneity of temperature inside the cylinder was thermometer-controlled according to the process defined into the QIBA profile to achieve thermal equilibrium (>1 h) over the entire MRI exam period. For each b value, four successive acquisitions spaced in time from more than 12 min were performed, allowing retrospective checks.

The diffusion protocol was 3three directions, DW SS-EPI with b=0, 100, 600, 800 s/mm², TR=9,451 ms, TE=80 ms, Number of average = 2, FOV 320*320 mm, contiguous slice thickness of 4 mm, encoding frequency axis R/L.

Four successive acquisitions were made for each b value, the phantom symmetry axis was laser-centred to the magnetic field positioning the 0 °C water cylinder at the center of the scanner. Acquisitions of the phantom were performed horizontally (x-axis) and vertically (y-axis). We measured circular regions of interest (ROIs) of 2.5 cm diameter and composed of 123 voxels (Fig. 2). Mean ADC and standard deviation (SD) were computed.

According to the equations in Table 1, we computed the measurement repeatability (R), estimated by the coefficient of variation (CV_R) and the repeatability coefficient (RC_R), the accuracy (ADC Bias estimate), ADC noise estimate and b-value dependency.

Table 1 Definition of quality control metrics according to QIBA DW-MRI profile

Full size table

The signal-to-noise ratio (SNR) was computed using formula F (shown in Table 1) and involved computing the ‘Temporal Noise Image’ from the diffusion mapping at b = 0, with a 2-cm circular ROI.

Results were compared to QIBA ‘s references values [16].

In addition, we analysed the planar spatial correlation of ADC measures in shifting ROIs along the x and y axis. The ADC reference value was measured at the image center using formula C (see Table 1). We used circular ROIs of 2.2-cm diameter and 2-cm shifts from the centre either to the right (x-axis) or to the bottom (y-axis) of the image.

Measurement variability

SPHERE phantom study

A second phantom was used (NEMA NU2-2012 (PTW)), called SPHERE Phantom (Fig. 1, right side). The SPHERE phantom embedded six different spheres (diameters 10, 13, 17, 22, 28 and 37 mm), filled with room temperature water.

We simulated clinical conditions in using the cervical level of the routine whole-body MRI, i.e. axial DW SS EPI with b=50 and b=1,000 s/mm², TR=10,384 ms, TE set to minimum (around 70 ms for all scans). Number of averages=2, parallel imaging factor=2, FOV=400*400 mm, contiguous 5-mm slice thickness, encoding frequency axis R/L. The phantom was laser centred, equidistant from all spheres. Four acquisitions were made at 1-day intervals. All values were averaged over 4 days.

ADC measures were obtained from spherical volumes of interest (VOIs) centred on spheres (Fig. 3).

The relative ADC error was computed for each sphere size, considering that the reference ADC value was from the 37-mm sphere. We analysed the correlation between VOI size and precision of measurements in computing the CV_R. Additional analysis documented the measurement error, first in measuring bias, second in computing the CV_R through several concentric VOIs of decreasing size in the largest sphere, according to Table 1 (Formula A). Then partial volume effect was quantified by calculating the relative error within a VOI with a diameter equal to 80 % the diameter of a sphere compared to a VOI of identical size within the largest sphere. The mean and SD of ADC values were computed for all VOIs size.

In vivo study

Informed consent was obtained from 13 healthy volunteers. Exclusion criteria were chronic disease, history or ongoing symptoms of infection like fever, cough, rhinorrhoea, dysphagia and odynophagia, history of cervical surgery, claustrophobia and all usual contraindications for MRI. Demographic status and smoking habits were recorded for the 13 volunteers. Volunteers were scanned using the same machine as the phantom study. The acquisition was performed with a neck phased-array coil and the volunteer was instructed to breath normally.

Technical settings of diffusion sequence for volunteers were identical to those of the SPHERE phantom.

Two readers assessed ADC values of lymph nodes: a senior radiologist with more than 6 years of experience in cancer imaging and a junior radiologist.

Lymph node volumes were manually segmented on the b1000 scan, and the graphic was exported to the ADC map (Fig. 4). At least four lymph nodes were selected per volunteer, including the largest. VOIs were segmented in delineating hyper-intense diffusion areas on b1000 scans while excluding lymph nodes hilum. Each node was segmented twice by each observer using the same acquisition with an interval of 7–60 days (mean 41 days) [21]. Mean and SD ADC values were recorded.

Inter- and intra-observer agreements were calculated according to the Bland Altman method using R CRAN software. Bias and limit of agreement (LoA) were computed. Inter- and intra-observer differences in segmenting lymph node volumes and ADC values were analysed using the sum of Wilcoxon rank for paired values test. A p-value < 0.05 was considered significant.