Comparison of clinical MRI liver iron content measurements using signal intensity ratios, R 2 and R 2*

Runge, Jurgen H.; Akkerman, Erik M.; Troelstra, Marian A.; Nederveen, Aart J.; Beuers, Ulrich; Stoker, Jaap

doi:10.1007/s00261-016-0831-7

Comparison of clinical MRI liver iron content measurements using signal intensity ratios, R ₂ and R ₂*

Open access
Published: 18 July 2016

Volume 41, pages 2123–2131, (2016)
Cite this article

Download PDF

You have full access to this open access article

Abdominal Radiology Aims and scope Submit manuscript

Comparison of clinical MRI liver iron content measurements using signal intensity ratios, R ₂ and R ₂*

Download PDF

Jurgen H. Runge ORCID: orcid.org/0000-0003-4190-3890¹,
Erik M. Akkerman¹,
Marian A. Troelstra¹,
Aart J. Nederveen¹,
Ulrich Beuers² &
…
Jaap Stoker¹

2099 Accesses
13 Citations
Explore all metrics

Abstract

Purpose

To compare three types of MRI liver iron content (LIC) measurement performed in daily clinical routine in a single center over a 6-year period.

Methods

Patients undergoing LIC MRI-scans (1.5T) at our center between January 1, 2008 and December 31, 2013 were retrospectively included. LIC was measured routinely with signal intensity ratio (SIR) and MR-relaxometry (R ₂ and R ₂*) methods. Three observers placed regions-of-interest. The success rate was the number of correctly acquired scans over the total number of scans. Interobserver agreement was assessed with intraclass correlation coefficients (ICC) and Bland–Altman analysis, correlations between LIC_SIR, R ₂, R ₂*, and serum values with Spearman’s rank correlation coefficient. Diagnostic accuracies of LIC_SIR, R ₂ and serum transferrin, transferrin-saturation, and ferritin compared to increased R ₂* (≥44 Hz) as indicator of iron overload were assessed using ROC-analysis.

Results

LIC MRI-scans were performed in 114 subjects. SIR, R ₂, and R ₂* data were successfully acquired in 102/114 (89%), 71/114 (62%), and 112/114 (98%) measurements, with the lowest success rate for R ₂. The ICCs of SIR, R ₂, and R ₂* did not differ at 0.998, 0.997, and 0.999. R ₂ and serum ferritin had the highest diagnostic accuracies to detect elevated R ₂* as mark of iron overload.

Conclusions

SIR and R ₂* are preferable over R ₂ in terms of success rates. R ₂*’s shorter acquisition time and wide range of measurable LIC values favor R ₂* over SIR for MRI-based LIC measurement.

Non-invasive measurement of liver iron concentration using 3-Tesla magnetic resonance imaging: validation against biopsy

Article 24 November 2017

Comparison of Inline R2* MRI versus FerriScan for liver iron quantification in patients on chelation therapy for iron overload: preliminary results

Article 26 May 2021

Assessment of liver iron overload by 3 T MRI

Article 21 February 2017

Various diseases are associated with increased liver iron content (LIC), which may induce or contribute to liver damage [1–3]. Serial measurement of LIC during long-term follow-up and treatment is highly desirable, but repeated invasive measurements are not recommended due to risks of complications of serial liver biopsies. Surrogate biochemical markers including serum ferritin and transferrin-saturation are widely used, but are flawed by limited specificity. Thus, accurate non-invasive MRI-based methods of LIC measurement are used in clinical practice for patients (suspected) with increased LIC [4, 5].

Several types of MRI LIC measurement have been described in the literature. Straightforward in–out phase gradient echo (GRE) shows signal loss at the later echo time (TE) but is only qualitative and easily confounded by the presence of hepatic steatosis. Quantitative approaches include (i) signal intensity ratio (SIR) measurement (e.g., the Gandon method) and (ii) MR-relaxometry. The Gandon method (henceforth referred to as “SIR”) utilizes the liver-to-muscle SIR on differently weighted MRI-scans [6]. This method allows easy and free calculation of the LIC_SIR, by entering ROI values in an online tool [7]. Hence, assuming the acquisition and placement of regions-of-interest (ROIs) are performed correctly, the method is robust to observer influences. A major limitation is its upper limit of detection of 350 µmol/g (equal to 20 mg/g): changes above that threshold cannot be measured.

MR-relaxometry relies on the calculation of tissue relaxation rates (R ₂ and R ₂*, the inverse of relaxation times T ₂ and T ₂*), which increase as iron accumulates and are sensitive to changes in LIC values well above the SIR-threshold. One commercialized R ₂ approach using single-echo spin-echo (SE) MRI is the FDA-approved St. Pierre method [FerriScan^®], performed in 10 min in free-breathing [8]. The per-scan analysis price is ~$300, on top of the costs of the MRI-scan itself. Alternative free-of-charge approaches are available for R ₂ using free-breathing or respiratory triggered SE-MRI and for R ₂* using single breath-hold GRE MRI [9].

Recent developments in MR-relaxometry include multipeak fat corrections and the use of complex instead of magnitude-only data fitting [10], assessment of the effect of fat suppression on R ₂* [11] and the comparison of advanced data fit models [12] and analysis approaches [13].

A comparative study of LIC_SIR, R ₂, and R ₂* in 94 patients with β-thalassemia reported high correlations [14]. However, success rates, interobserver agreement, and applicability for diseases other than β-thalassemia were not investigated, nor were serum markers assessed. The latter may be useful to screen for elevated LIC (i.e., >36 µmol/g), saving expensive and limited MRI time. We hypothesize that R ₂* is preferable over SIR and R ₂ in terms of success rate, acquisition time, and range of detection and over serum values in terms of accuracy in detecting elevated LIC.

In our center, the clinical LIC protocol has included SIR, R ₂, and R ₂* since 2005, with regular weekly clinical referrals since 2008. The SIR measurement is recommended by the national guideline for hemochromatosis [15]. It is supplemented by R ₂ and R ₂* measurements to fill the gap caused by the SIR method’s hard cut-off at 350 µmol/g. To investigate our hypothesis, we (i) assessed SIR, R ₂, and R ₂* LIC measurements and their success rates and interobserver agreement; and (ii) compared the diagnostic accuracies of LIC_SIR, R ₂, and surrogate serum markers for correctly predicting elevated LIC based on increased R ₂*_.

Materials and methods

Ethical

All data used for this study were acquired in clinical setting and were anonymized prior to analysis. Informed consent was waived by the Medical Research Ethics Committee of the AMC Amsterdam.

Patients

All MRI-based LIC measurements performed between January 1, 2008 and December 31, 2013 were retrospectively included in this study. As additional measurements were added to the protocol in 2014, only measurements up to end 2013 were included. Clinical diagnosis and—when available—serum markers of iron metabolism (total iron, transferrin, transferrin-saturation, ferritin) were collected and subsequently anonymized by a colleague not otherwise involved in this study.

MRI

MRI-scanning was performed supine, feet first on a 1.5T Avanto MRI-scanner (Siemens AG, Erlangen, Germany) using phased-array coils (body array and spine coil) for localizers and R ₂ and R ₂* measurements and the body coil for the SIR measurement [6]. Use of the body coil provided an as homogenous B₁ field as possible, reducing variation in SIR measurements due to variations of flip angles between patients. For R ₂* and R ₂, the B₁ variation is eliminated via the data fit. Breath-hold imaging (localizers, SIR and R ₂*) was performed in expiration. Three 10-mm slices with a variable slice gap to cover the liver were equally positioned for all three LIC measurements. Especially for the GRE-based SIR and R ₂* measurements, careful B₀ shimming is important to achieve a homogenous B₀ field, ensuring correct measurements. Shimming was performed with a shim box covering the field-of-view in the feet-head direction and the contours of the abdomen (i.e., excluding the arms) in the left–right and anterior-posterior directions. The SIR measurement according to Gandon et al. requires five (T1, PD, T2, T2+, and T2++) image weightings with specific TR/TE combinations [6]. Table 1 contains an overview of the relevant scan parameters. Of note, the TE interval used for R ₂* was shorter (1.41 ms) than the standard in- and out-of-phase interval (2.26 ms).

Table 1 MRI parameters

Full size table

Data analyses

After inclusion all measurements were checked for correct TRs, TEs, and RF coils using DICOM header information as for SIR measurements, specific TR/TE combinations and the use of the body coil are mandatory. Image quality was assessed by a research trainee (JHR, 4 years of experience) and an abdominal radiologist (JS, 20 years of experience) using a 3-point scale (good/adequate/inadequate). The type of artifact(s) was noted. Measurements with incorrect scan parameters or inadequate image quality were classified unsuccessful.

ROI-placement

SIR, R ₂, and R ₂* data were processed using custom-made software that allowed ROI-placement, LIC_SIR calculation, and R ₂ and R ₂* data fitting. Three blinded observers (JHR, MAT, and EMA) with four, a half and 9 years of experience, respectively, independently placed regions-of-interest (ROIs) for three slices per scan. First, the liver parenchyma was masked on R ₂* source data, excluding a rim near the liver edge (Fig. 1 A). Next, non-liver voxels (e.g., vessels, gall bladder) inside the liver contour were masked (Fig. 1 B). By subtracting ROI-2 from ROI-1, only liver parenchyma remained (Fig. 1 C). Liver ROIs were copied from the R ₂* data for SIR analysis, with two additional ROIs in both paraspinal muscles, carefully avoiding areas of signal intensity loss close to the lung (Fig. 1 D). This also allowed a check to identify whether patients had moved between R ₂* and SIR measurements, in which case new ROIs were placed. Ghosting artifacts caused by aortic blood flow were present in SIR measurements before November 2012 (when saturation slabs were added). Separate ROIs were placed to remove these artifacts from the liver and muscle ROIs (Fig. 1 E, F). Some reports indicate that susceptibility artifacts may affect R ₂* measurements when using a single ROI in liver segments VII or VIII [16]. Due to the limited number of slices, we did not formally assess segmental variations of R ₂, R ₂*, or LIC_SIR in this study.

The respiratory triggering applied for R ₂ data acquisition resulted in slight changes in slice positioning so that new ROIs were placed using R ₂ source data as described above.

LIC_SIR

The calculations published by Gandon et al. were entered into the aforementioned program [7, 17], which automatically chooses the most reliable SIR (i.e., T1, PD, T2, T2+, or T2++) which is converted to LIC_SIR. The mean LIC_SIR of three slices was used and, when one or more values exceeded the 350 µmol/g threshold, the final value was noted as >350 µmol/g. In two subanalyses, the R ₂ and R ₂* values and the individual SIR ratios in patients with LIC_SIR >350 µmol/g were evaluated.

R ₂*

In magnitude images, the noise is distributed in a non-Gaussian manner. This is known as Rician noise [18]. At high signal levels, the non-zero mean has a negligible effect on the average signal, but near the noise level, a noise bias exists which needs to be taken into account when fitting R ₂*. We explored three different fit routines: a truncated exponential fit (A) [19, 20], an exponential + constant fit (B) [9, 21], and an exponential + Rician noise (C).

The truncated exponential method A is considered the reference standard, but is time-consuming, where methods B + C do not require further manual input. We compared method B and C with method A as reference using Bland–Altman analysis and R ₂* data from a single reader (EMA). Based on this comparison (mean paired difference ($ \bar{d} $) was 0.8 Hz for A–C and 33.6 Hz for A–B), we employed method C (Rician noise bias) for the remaining analyses [22, 23].

R ₂* calculation was thus performed with a monoexponential model (Eq. 1) with a Rician noise factor. In Eq. 1, E _R describes the Rice distribution (Online Resource 1), where σ is a noise parameter and $ S_{0} \times {\text{e}}^{{ - {R_{2}} ^{*} \times {\text{TE}}}} $ reflects the true magnitude value. Data were averaged inside the ROI before data fitting (average-then-fit).

$$ S\left({\text{TE}} \right) = E_{\text{R}} \cdot \left( {S_{0} \cdot e^{{ - {R_{2}}^{*} \cdot {\text{TE}}}} ,\sigma } \right) $$

(1)

The effect of intrahepatic fat on R ₂* was assessed by applying a biexponential model in a subset (n = 10) with definite presence of fat, as identified by the presence of a oscillating signal intensity decay over time. R ₂* values with and without correction were compared using Bland–Altman analysis. The ($ \bar{d} $) was 0.1 Hz—indicating low overall fat content in this cohort—and deemed negligible compared to the subset mean of 70 Hz. Monoexponentially fitted R ₂* values were used for all comparisons.

R ₂

For R ₂ calculation an average-then-fit routine was applied using a biexponential model as shown in Eqs. 2 and 3. In Eq. 2, S _T (TE) is the signal intensity without noise at time TE, S ₀ is the signal intensity at TE = 0, and R ₂ is the relaxation rate. The subscripts a and b indicate fast and slow relaxation components, respectively. For R ₂, Rician noise bias was approximated by the Pythagorean addition of an extra fit parameter, the noise factor ‘ν’ in Eq. 3.

$$ S_{\text{T}} \left( {\text{TE}} \right) = S_{{{\text{0}},a}} \cdot e^{{ - R_{2,a} \cdot {\text{TE}}}} + S_{{{\text{0}},b}} \cdot e^{{ - R_{2,b} \cdot {\text{TE}}}} $$

(2)

$$ S\left( {\text{TE}} \right) = \sqrt {S_{\text{T}} \left( {\text{TE}} \right) + \nu^{2} }. $$

(3)

In the biexponential model, an iron-dense and an iron-sparse component are assumed, with short and long R ₂, respectively. For further comparisons with LIC_SIR and R ₂*, the bulk R ₂ was calculated (Eq. 4) in accordance with the literature [8, 9, 14].

$$ R_{2} = \frac{{S_{{{\text{0}},a}} \cdot R_{2,a} + S_{{{\text{0}},b}} \cdot R_{2,b} }}{{S_{{{\text{0}},a}} + S_{{{\text{0}},b}} }} $$

(4)

Comparison with the literature

The relations between the LIC_SIR, R ₂, and R ₂* were compared to published regression analysis results based on either biopsy-proven LIC (LIC_BIOPSY) [8, 9, 19–21] or LIC_SIR [14].

Statistical analyses

Data are described as number (%) or median (interquartile range, IQR). Results of observers were compared using a Friedman test and Wilcoxon Signed-Rank test as post hoc. Success rates are defined as the number of correctly acquired scans of at least “adequate” quality divided by the total number of measurements. These were compared using a McNemar test. Correlations were assessed with Spearman’s correlation coefficients (r _S), interobserver agreement with two-way random, and absolute intraclass correlation coefficients (ICCs). Both were graded according to Landis et al. [24]. Bland–Altman analysis was performed to compare accuracy between the three MRI methods for a single observer and compare the performance of the three observers [22]. In a separate analysis, the calculated R ₂ and R ₂* values were converted to $ {\text{LIC}}_{R_{2}(\ast)} $ values in μmol/g using the formulas provided by St. Pierre et al. and Garbowski et al. [8, 20] as these were established with image analysis protocols similar to ours.

ROC-analyses were performed for LIC_SIR, R ₂, and serum values with significant correlation with R ₂* to establish their diagnostic accuracy to identify increased R ₂*, i.e., ≥44 Hz [9]. R ₂* was chosen as a reference value as it had the best success rate and shortest acquisition time. The optimal cut-off value for R ₂ was found by optimizing the Youden index, while for LIC_SIR we used the established cut-off value of >36 µmol/g. P values of <0.05 were accepted as statistically significant. Statistical analyses were performed using SPSS Version 22 (IBM Corp, Armonk, NY), MedCalc Statistical Software version 16.2.0 (MedCalc Software bvba, Ostend, Belgium; https://www.medcalc.org; 2016), and GraphPad Prism 5.0 (GraphPad Software, La Jolla, CA).

Results

Patients

Between January 1, 2008 and December 31, 2013, a total of 114 patients (M/F: 74/40) underwent 144 MRI-scans for routine LIC measurement. Patient characteristics and clinical indications for LIC measurement are described in Table 2. Thirty patients had multiple measurements. To prevent a repeated measurements effect on correlation assessment between LIC_SIR, R ₂, and R ₂*, only the 114 baseline measurements were used. SIR, R ₂, and R ₂* data were available for 108/114 (95%), 72/114 (63%), and 113/114 (99%) baseline measurements.

Table 2 Patient characteristics

Full size table

MRI success rates

Five SIR measurements were classified unsuccessful because a surface coil was used, one due to erroneous TR/TE combinations. Furthermore, image quality was inadequate (respiration artifacts) in a single patient (only R ₂ and R ₂* acquired). Hence, SIR was successful in 102/114 (89%), R ₂ in 71/114 (62%), and R ₂* in 112/114 (98%) subjects. The success rate of R ₂ was lower than that of SIR and R ₂* (P < 0.0001, each). Missing datasets were presumed to not have been scanned, with time constraints and respiratory triggering problems as the major cause of the low success rate of the R ₂ measurement. For subsequent analyses, only successful baseline measurements were used.

Interobserver agreement

LIC_SIR and R ₂ values differed between observer 1 and the other observers (Table 3). However, these differences (median values: 80–85 µmol/g and 33–34 Hz for R ₂) would be negligible in clinical practice. This was confirmed by high ICCs for SIR, R ₂, and R ₂* of 0.998, 0.997, and 0.999, respectively. Bland–Altman analysis between pairs of observers showed a single outlier for SIR, while R ₂ and R ₂* showed differences up to 5% for higher values, reflecting the uncertainties in the data fit at very high LIC (Online Resource 1).

Table 3 MRI interobserver agreement: median (IQR) values

Full size table

LIC_SIR, R ₂, and R ₂*

Median (IQR) LIC _SIR , R ₂, and R ₂* (given for observer 1 and LIC_SIR <350 µmol/g) were 84 (30–205), 33 (23–48), and 123 (56–321). LIC_SIR correlated positively with R ₂ and R ₂* with r _S of 0.90 (95% confidence interval (CI) 0.84–0.94, P < 0.0001, n = 57) and 0.98 (95% CI 0.97–0.99, P < 0.0001, n = 87), respectively. R ₂ correlated positively with R ₂*: r _S of 0.95 (95% CI 0.93–0.97, P < 0.0001, n = 71). Figure 2 A, B shows scatter plots of (SIR-based or biopsy-proven) LIC against R ₂ and R ₂*. Solid lines indicate regression analysis results (95% CI bands as dashed lines). In our patient cohort, R ₂ increased linearly with LIC_SIR (Eq. 5), while R ₂* appeared to have a clear non-linear relationship with LIC_SIR, well described by a quadratic polynomial (Eq. 6).

$$ R_{2} = 15.5 + 0.107 \cdot {\text{LIC}}_{\text{SIR}} $$

(5)

$$ {R_{2}}^{*} = 42.7 + 0.142 \cdot {\text{LIC}}_{\text{SIR}} + 4.02 \times 10^{ - 3} \cdot {{\text{LIC}}_{\text{SIR}}}^{2} $$

(6)

The LIC_SIR upper threshold of 350 µmol/g was reached in 15/102 (15%) measurements. In these measurements, only the T1W SIR correlated with R ₂*, with r _S of −0.72 (95% CI −0.9 to −0.31, P = 0.003, n = 15). Figure 3 shows the T1 W SIR against R ₂*, indicating that for LIC_SIR >350 µmol/g, the discriminatory value of the T1W SIR becomes progressively smaller.

Comparison with the literature

Figure 2 A, B also shows published regression lines between either LIC_SIR or LIC_BIOPSY and R ₂ (Fig. 2 A) and R ₂* (Fig. 2 B). Contrary to our finding, these lines indicate a linear increase of R ₂* as LIC increases, and a non-linear increase of R ₂ as LIC increases. To assess whether this is caused by LIC_SIR or by R ₂ or R ₂*, we applied established conversion formulae to convert our R ₂ (Eq. 7) and R ₂* (Eq. 8) values to LIC values [8, 20]. We then compared these LIC_R2* and LIC_R2 values to our LIC_SIR values.

$$ {\text{LIC}}_{{R_{2} }} \, (\upmu {\text{mol/g}}) = 17.91 \cdot \left( {29.75 + \sqrt {\left( {900.7 - 2.283 \cdot R_{2} } \right)} } \right)^{1.424} $$

(7)

$${\text{LIC}}_{{R_2}^{*}} \, (\upmu {\text{mol/g}}) = \frac{0.029 \cdot {R_{2}}^{{*}^{1.014}}}{5.585\cdot 10^{-2}}$$

(8)

These established conversion formulae show a non-linear relation between R ₂ and true LIC (Eq. 7) and linear relation between R ₂* and true LIC (Eq. 8). Hence, the scatter plot between LIC_R2* and LIC_SIR also revealed a quadratic relation, and that between LIC_SIR and LIC_R2 a linear one (data not shown).

Diagnostic accuracies of LIC_SIR, R ₂, and serum values

Serum total iron, transferrin, transferrin-saturation, and ferritin were available for 56, 56, 54, and 96 out of 114 measurements. All four correlated significantly with R ₂*, with best correlation for ferritin at r _S = 0.80 (P < 0.0001, n = 94).

Increased R ₂* (≥44 Hz) was present in 91 subjects. Of the MRI and serum methods, R ₂ and ferritin had best diagnostic accuracies to detect increased R ₂* (Table 4). Figure 4 A–C shows true and false positive and negative results of R ₂ (Fig. 4 A), LIC_SIR (Fig. 4 B), and ferritin (Fig. 4 C) for establishing increased R ₂*.

Table 4 Diagnostic accuracy values to correctly identify increased R ₂* (≥44 Hz)

Full size table

Discussion

This study shows that for routine clinical MRI-based LIC measurements SIR and R ₂* are more often successful than R ₂. Interobserver agreement was near perfect (ICC > 0.9) for all methods. R ₂ and R ₂* methods provided relaxation rates when the SIR-threshold (>350 µmol/g) was already exceeded. This gives them an advantage over SIR in subjects with transfusional hemosiderosis (at least 55% of our population), when LIC values can easily surpass 350 µmol/g. The combination of high success rate, high interobserver agreement, ability to detect changes in LIC over a wide range of LIC values, and single breath-hold acquisition favors the R ₂* method for LIC measurement.

In our study, the relationship between R ₂* and LIC_SIR was quadratic and remained quadratic when R ₂* was expressed as a LIC value using a previously published (biopsy-proven) conversion formula. Other authors report linear relationships. Given the physics of the R ₂*–iron relationship, which is basically linear [25], this discrepancy arises either from our R ₂* acquisition and analysis or from the reference standard. To rule out the former, we compared three fit routines. The exponential + Rician noise factor fit provided identical results in a fraction of the required time to the established and widely applied but labor-intensive method of manual truncation before exponential fitting.

With respect to reference standard, St. Pierre et al. [8], Wood et al. [9], Hankins et al. [19], Garbowski et al. [20], and Anderson et al. [21] all used biopsy-determined LIC_BIOPSY as reference standard, whereas we and Christoforidis et al. [14] used the LIC_SIR according to Gandon. Given the similarity of our MRI protocols, it is unsurprising that Christoforidis’ and our data points show considerable overlap. Arguably, their linear relation between LIC_SIR and R ₂* could also be described by a quadratic polynomial.

Apart from the linear relationship, the other authors report much steeper increase of R ₂* as LIC increases [9, 19–21]. Anderson et al.’s very steep increase could be due a long TE1 of 2.2 ms compared to all other studies (range of TE1: 0.8–0.99 ms) that hampers the ability to accurately estimate high R ₂* values. The fact that the control values of R ₂* in subjects without iron overload in those studies but also in this paper hover around 40 Hz is a further argument that the observed difference in LIC–R ₂* does not arise from the R ₂* acquisition or analysis but from the reference standard.

Hence, the most likely cause of the deviating quadratic relation between R ₂* and estimated LIC is the piecewise sampling of the LIC range with five differently weighted GRE-sequences for LIC_SIR. This has artificially imposed a quadratic behavior on the actually linear relationship between R ₂* and true LIC_BIOPSY. If one looks at the fundamental GRE signal equation (Eq. 9), where PD is proton density and α is flip angle and applies this to the liver-to-muscle signal intensity ratio, the PD and sin(α) terms drop out. By taking the natural logarithm, we find Eqs. 10 and 11. The latter proves that the relationship between R ₂* and SIR is logarithmic. Indeed, plotting Fig. 3 with a log-scale for the signal intensity ratio on the y-axis linearized the line (data not shown).

$$ S\left( {\text{TE}} \right) = \frac{{{\text{PD}} \cdot \sin \left( \alpha \right) \cdot \left( {1 - e^{{ - {\text{TR}}/T_{1} }} } \right)}}{{\left( {1 - \cos \left( \alpha \right) \cdot e^{{ - {\text{TR}}/T_{1} }} } \right)}} \cdot e^{{ - {R_{2}}^{*} \cdot {\text{TE}}}} $$

(9)

$$ \ln \left( {\frac{{S_{\text{LIVER}} }}{{S_{\text{MUSCLE}} }}} \right) = f\left( {{\text{TR}},\alpha ,T_{1} } \right) + {\text{TE}} \cdot \left( {{R_2}^{*}_{{,\,{\text{LIVER}}}} - {{R_2}^{*}_{,\,\text{MUSCLE}}}} \right) $$

(10)

$$ {{R_2}^{*}_{,\,\text{LIVER}}} = \frac{{\ln \left( {\frac{{S_{\text{LIVER}} }}{{S_{\text{MUSCLE}} }}} \right) - f\left( {{\text{TR}},\alpha ,T_{1} } \right)}}{\text{TE}} + {{R_2}^{*}_{,\,\text{MUSCLE}}} $$

(11)

For R ₂, single- and multiecho SE acquisitions are possible: multiecho SE decreases R ₂ due to residual signal of stimulated echoes at a given TE. Single-echo SE increases R ₂ because long TEs cause increased sensitivity to diffusion, hence increased signal loss at a given TE. Reported single-echo SE R ₂ values [8, 9] were concordantly higher for the same estimated LIC compared to multiecho SE results as in this study and in [14]. In terms of R ₂ data fitting, we as many others applied a biexponential model and we did not assess non-exponential decay models as for instance proposed by Jensen et al. [26].

The main limitation of our study is the lack of biopsy confirmation. In our center, liver biopsy for iron determination is seldom performed. Both the national, European and American guidelines recommend reluctance in performing biopsy and underline the high sensitivity of MRI [15, 27, 28]. Moreover, differing processing steps to obtain LIC_BIOPSY are reported, compromising generalizability. In Gandon’s method, paraffin-embedded liver biopsy specimens are dewaxed using a protocol with a triple xylene wash to remove lipid solids from the sample. This approach was shown to have an elevating effect on the dry weight liver iron calculation compared to processing fresh tissue samples [29]. Another limitation is the fact that we did not perform multipeak fat-correction on complex data [10]. This was not feasible with only magnitude data available. Comparison to other literature is further hampered by the use of different image acquisition and postprocessing protocols which directly influence the calibration curves between the reference standard and the index test. We have opted to compare our findings to calibration curves obtained with similar postprocessing protocols.

ROC-analyses showed that R ₂ and ferritin have the highest diagnostic accuracy to identify increased R ₂* (≥44 Hz). Both ferritin (≥524 µg/L) and R ₂ (≥18.3 Hz) had positive predictive values of 100%, but the wide distribution of ferritin levels for R ₂* ≥ 44 Hz indicates that it cannot be used confidently to follow-up treatment nor accurately determine the LIC. In contrast, R ₂ shows a different picture with a close distribution around the regression line. In addition, ferritin lacks the spatial information that MRI provides, allowing segmental LIC measurement and follow-up.

R ₂ datasets were missing (i.e., not scanned) in 42/114 (37%) subjects. As R ₂ is part of our routine scan protocol, this illustrates that the long and artifact-prone R ₂ series is skipped first by the radiographer. This makes the R ₂ series less suited as first choice for LIC measurement.

Our results favor the use of R ₂* measurements for daily clinical practice with the use of an exponential + Rician noise fit method to save time in analysis. The recommendation to (only) use R ₂* comes with cautions. It requires careful consideration of scan parameters which should be kept equal for all measurements. Ideally, routine quality control with phantom testing should be performed.

In conclusion, as R ₂* can be obtained in a single breath-hold with excellent success rates, high interobserver agreement, and ability to detect changes over a wide range of LIC values and is available from all major vendors without additional per-scan costs, it is our first choice for LIC measurement.

Abbreviations

LIC:: Liver iron content

References

Tavill AS, AASLD, ACG (2001) Diagnosis and management of 2 hemochromatosis. Hepatology 33:1321–1328. doi:10.1053/jhep.2001.24783
Article CAS PubMed Google Scholar
Pietrangelo A (2003) Haemochromatosis. Gut 52(Suppl 2):ii23–ii30. doi:10.1136/gut.52.suppl_2.ii23
CAS PubMed PubMed Central Google Scholar
Queiroz-Andrade M, Blasbalg R, Ortega CD, et al. (2009) MR imaging findings of iron overload. Radiographics 29:1575–1589. doi:10.1148/rg.296095511
Article PubMed Google Scholar
Bravo AA, Sheth SG, Chopra S (2001) Liver biopsy. N Engl J Med 344:495–500. doi:10.1056/NEJM200102153440706
Article CAS PubMed Google Scholar
Sirlin CB, Reeder SB (2010) Magnetic resonance imaging quantification of liver iron. Magn Reson Imaging Clin N Am 18:359–381. doi:10.1016/j.mric.2010.08.014
Article PubMed PubMed Central Google Scholar
Gandon Y, Olivie D, Guyader D, et al. (2004) Non-invasive assessment of hepatic iron stores by MRI. Lancet 363:357–362. doi:10.1016/S0140-6736(04)15436-6
Article CAS PubMed Google Scholar
Gandon Y. Rennes—hemochromatosis. Y Gandon, Rennes. 10-06-2001. http://www.radio.univ-rennes1.fr/Sources/EN/Hemo.html. Accessed October 16, 2015.
St Pierre TG, Clark PR, Chua-anusorn W, et al. (2005) Noninvasive measurement and imaging of liver iron concentrations using proton magnetic resonance. Blood 105:855–861. doi:10.1182/blood-2004-01-0177
Article CAS PubMed Google Scholar
Wood JC, Enriquez C, Ghugre N, et al. (2005) MRI R₂ and R₂* mapping accurately estimates hepatic iron concentration in transfusion-dependent thalassemia and sickle cell disease patients. Blood 106:1460–1465. doi:10.1182/blood-2004-10-3982
Article CAS PubMed PubMed Central Google Scholar
Hernando D, Kramer JH, Reeder SB (2013) Multipeak fat-corrected complex R₂* relaxometry: theory, optimization, and clinical validation. Magn Reson Med 70:1319–1331. doi:10.1002/mrm.24593
Article PubMed PubMed Central Google Scholar
Krafft AJ, Loeffler RB, Song R, et al. (2015) Does fat suppression via chemically selective saturation affect R₂*-MRI for transfusional iron overload assessment? A clinical evaluation at 1.5T and 3T. Magn Reson Med. doi:10.1002/mrm.25868
PubMed Google Scholar
Yokoo T, Yuan Q, Senegas J, Wiethoff AJ, Pedrosa I (2015) Quantitative R₂* MRI of the liver with rician noise models for evaluation of hepatic iron overload: Simulation, phantom, and early clinical experience. J Magn Reson Imaging 42:1544–1559. doi:10.1002/jmri.24948
Article PubMed Google Scholar
Ibrahim EH, Khalifa AM, Eldaly AK (2016) MRI T₂* imaging for assessment of liver iron overload: study of different data analysis approaches. Acta Radiol. doi:10.1177/0284185116628337
PubMed Google Scholar
Christoforidis A, Perifanis V, Spanos G, et al. (2009) MRI assessment of liver iron content in thalassamic patients with three different protocols: comparisons and correlations. Eur J Haematol 82:388–392. doi:10.1111/j.1600-0609.2009.01223.x
Article PubMed Google Scholar
Swinkels DW, van Bokhoven MA, Castel A, et al. (2007) Richtlijn Diagnostiek en behandeling van hereditaire hemochromatose. Utrecht: Nederlandse Internisten Vereeniging en Nederlandse Vereniging voor Klinische Chemie
Google Scholar
Meloni A, Luciani A, Positano V, et al. (2011) Single region of interest versus multislice T₂* MRI approach for the quantification of hepatic iron overload. J Magn Reson Imaging 33(2):348–355. doi:10.1002/jmri.22417
Article PubMed Google Scholar
Gandon Y. Gandon calculations. Y Gandon, Rennes. 10-06-2001. http://www.radio.univ-rennes1.fr/Images/Externe15.js. Accessed October 16, 2015
Gudbjartsson H, Patz S (1995) The Rician distribution of noisy MRI data. Magn Reson Med 34:910–914
Article CAS PubMed PubMed Central Google Scholar
Hankins JS, McCarville MB, Loeffler RB, et al. (2009) R₂* magnetic resonance imaging of the liver in patients with iron overload. Blood 113:4853–4855. doi:10.1182/blood-2008-12-191643
Article CAS PubMed PubMed Central Google Scholar
Garbowski MW, Carpenter JP, Smith G, et al. (2014) Biopsy-based calibration of T₂* magnetic resonance for estimation of liver iron concentration and comparison with R₂ Ferriscan. J Cardiovasc Magn Reson. doi:10.1186/1532-429X-16-40
PubMed PubMed Central Google Scholar
Anderson LJ, Holden S, Davis B, et al. (2001) Cardiovascular T₂-star (T₂*) magnetic resonance for the early diagnosis of myocardial iron overload. Eur Heart J 22:2171–2179. doi:10.1053/euhj.2001.2822
Article CAS PubMed Google Scholar
Bland JM, Altman DG (1986) Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1:307–310. doi:10.1016/S0140-6736(86)90837-8
Article CAS PubMed Google Scholar
Akkerman EM, Runge JH, Troelstra MA, Nederveen AJ, Stoker J (2015) Non-linear relationship between estimated liver iron concentration and R₂*. ISMRM 3268
Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data. Biometrics 33:159–174
Article CAS PubMed Google Scholar
Ghugre NR, Wood JC (2011) Relaxivity-iron calibration in hepatic iron overload: probing underlying biophysical mechanisms using a Monte Carlo model. Magn Reson Med 65:837–847. doi:10.1002/mrm.22657
Article PubMed Google Scholar
Jensen JH, Chandra R (2002) Theory of nonexponential NMR signal decay in liver with iron overload or superparamagnetic iron oxide particles. Magn Reson Med 47:1131–1138. doi:10.1002/mrm.10170
Article CAS PubMed Google Scholar
European Association for the Study of the Liver (2010) EASL clinical practice guidelines for HFE hemochromatosis. J Hepatol 53:3–22. doi:10.1016/j.jhep.2010.03.001
Article Google Scholar
Bacon BR, Adams PC, Kowdley KV, Powell LW, Tavill AS (2011) Diagnosis and management of hemochromatosis: 2011 practice guideline by the American Association for the Study of liver diseases. Hepatology 54:328–343. doi:10.1002/hep.24330
Article PubMed PubMed Central Google Scholar
Butensky E, Fischer R, Hudes M, et al. (2005) Variability in hepatic iron concentration in percutaneous needle biopsy specimens from patients with transfusional hemosiderosis. Am J Clin Pathol 123:146–152. doi:10.1309/PUUXEGXDLH26NXA2
Article PubMed Google Scholar

Download references

Acknowledgments

The authors would like to acknowledge Paul F. Groot for anonymizing the data and Shandra Bipat for providing advice on statistical analyses.

Author information

Authors and Affiliations

Department of Radiology, Academic Medical Center, University of Amsterdam, Meibergdreef 9, 1105AZ, Amsterdam, The Netherlands
Jurgen H. Runge, Erik M. Akkerman, Marian A. Troelstra, Aart J. Nederveen & Jaap Stoker
Department of Gastroenterology & Hepatology, Academic Medical Center, University of Amsterdam, Meibergdreef 9, 1105AZ, Amsterdam, The Netherlands
Ulrich Beuers

Authors

Jurgen H. Runge
View author publications
You can also search for this author in PubMed Google Scholar
Erik M. Akkerman
View author publications
You can also search for this author in PubMed Google Scholar
Marian A. Troelstra
View author publications
You can also search for this author in PubMed Google Scholar
Aart J. Nederveen
View author publications
You can also search for this author in PubMed Google Scholar
Ulrich Beuers
View author publications
You can also search for this author in PubMed Google Scholar
Jaap Stoker
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jurgen H. Runge.

Ethics declarations

Conflict of Interest

The authors declare that they have no conflict of interest.

Ethical standard/informed consent

This was a retrospective study using data obtained in routine clinical practice that were anonymized before analysis. In light of the respective nature of the study, the obligation to obtain informed consent was waived by the Medical Ethical Committee of the AMC Amsterdam.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (PDF 311 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Runge, J.H., Akkerman, E.M., Troelstra, M.A. et al. Comparison of clinical MRI liver iron content measurements using signal intensity ratios, R ₂ and R ₂*. Abdom Radiol 41, 2123–2131 (2016). https://doi.org/10.1007/s00261-016-0831-7

Download citation

Published: 18 July 2016
Issue Date: November 2016
DOI: https://doi.org/10.1007/s00261-016-0831-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Comparison of clinical MRI liver iron content measurements using signal intensity ratios, R 2 and R 2*

Abstract

Purpose

Methods

Results

Conclusions

Similar content being viewed by others

Non-invasive measurement of liver iron concentration using 3-Tesla magnetic resonance imaging: validation against biopsy

Comparison of Inline R2* MRI versus FerriScan for liver iron quantification in patients on chelation therapy for iron overload: preliminary results

Assessment of liver iron overload by 3 T MRI

Materials and methods

Ethical

Patients

MRI

Data analyses

ROI-placement

LICSIR

R 2*

R 2

Comparison with the literature

Statistical analyses

Results

Patients

MRI success rates

Interobserver agreement

LICSIR, R 2, and R 2*

Comparison with the literature

Diagnostic accuracies of LICSIR, R 2, and serum values

Discussion

Abbreviations

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Ethical standard/informed consent

Electronic supplementary material

Supplementary material 1 (PDF 311 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

Comparison of clinical MRI liver iron content measurements using signal intensity ratios, R ₂ and R ₂*

LIC_SIR

R ₂*

R ₂

LIC_SIR, R ₂, and R ₂*

Diagnostic accuracies of LIC_SIR, R ₂, and serum values