Introduction

Autoimmune liver diseases (AILD) include three, sometimes overlapping, conditions: autoimmune hepatitis (AIH), primary sclerosing cholangitis (PSC), and autoimmune sclerosing cholangitis (ASC) [1,2,3]. These diseases can have a significant impact on liver function and health, often causing progressive fibrosis and, in some cases, cirrhosis and end-stage liver disease [3, 4]. More accurate diagnosis and monitoring could potentially allow more precise patient management and improve outcomes [1, 4, 5].

There is increasing evidence that quantitative MRI techniques, such as MR elastography (MRE) and T1 mapping, can be used to non-invasively measure the severity of chronic liver disease and predict liver histologic fibrosis stage [6,7,8]. Liver stiffness measured from MRE has been shown to correlate with biliary stricture severity in adult patients with PSC [8, 9]. Additionally, MRE-derived liver stiffness has been shown to positively correlate with advanced stages of fibrosis [10, 11]. T1 relaxation times have also been shown to correlate well with histologic markers of pathology and to increase with advanced liver fibrosis and inflammation [7, 12,13,14]. These studies are mainly in adult populations, with only limited data available in pediatric populations [2, 15, 16].

Magnetic resonance cholangiopancreatography (MRCP) has been shown to sufficiently diagnose PSC in children, with sensitivities of 81–84% and specificity up to 100% [17, 18]. Recent evidence has also shown that quantitative metrics obtained from conventional anatomic 3D MRCP images of the biliary tree can be used to assess patients with AILD. Gilligan et al. [19] showed that quantitative analysis of 3D MRCP data can discriminate between pediatric patients with AIH and PSC/ASC and that quantitative MRCP metrics (e.g., number and total length of bile duct strictures/dilations) may serve as non-invasive biomarkers of AILD.

The objective of this study was to investigate relationships between quantitative MRI metrics of liver disease including (1) MRE-derived liver stiffness, (2) T1 relaxation measurements, and (3) quantitative 3D MRCP measurements, and clinical and laboratory data in a pediatric and young adult AILD cohort.

Materials and methods

This single-center, prospective, cross-sectional, Health Insurance Portability and Accountability Act (HIPAA)-compliant pilot study was approved by the institutional review board at Cincinnati Children's Hospital Medical Center (CCHMC).

Patients up to 25 years of age with AILD (including AIH, PSC, and ASC) that were enrolled into an institutional AILD registry were included in this study. A group of pediatric hepatologists at CCHMC assigned the registry participants with a diagnosis of AIH, PSC, or ASC according to clinical guidelines [3, 19,20,21]. Written informed consent, and assent as appropriate, were obtained. Patients with any other form of liver disease were excluded from the registry.

MRI acquisition protocol

All registry participants underwent research MRI examinations of the liver at the time of registry enrollment. Subsequent research MRI examinations using the same protocol and scanner were performed 12 and 24 months later. The current study includes baseline MRI examinations acquired on a 1.5 T scanner (Ingenia; Philips Healthcare, Best, The Netherlands) using a 16-channel phased-array anterior (torso) surface coil as well as scanner table built-in spine coils (12-channels). The imaging protocol and parameters for the broader registry study have been previously published [2, 19]. Pertinent quantitative sequences for the current study include the following:

  1. (1)

    Three-dimensional (3D) fast spin-echo (FSE) MRCP using respiratory triggering; the belt used to detect breathing was placed over the upper abdomen.

  2. (2)

    Iron (T2*)-corrected T1 mapping (cT1) of the liver using a modified Look Locker (MOLLI) pulse sequence approach and complex chemical shift MRI [6].

  3. (3)

    Two-dimensional (2D) gradient recalled echo (GRE) MRE of the liver (active driver frequency = 60 Hz), with the passive driver placed over the right upper quadrant of the abdomen and secured in place with a Velcro strap.

Clinical and laboratory data collection

Pertinent demographic and clinical data were recorded from the AILD registry. The following clinical data were recorded: patient age, sex, clinical diagnosis (AIH, PSC, or ASC), time between diagnosis and research MRI (in months), and presence of inflammatory bowel disease (IBD). The following laboratory measurements also were recorded from the time of the research MRI: total bilirubin, alanine aminotransferase (ALT), aspartate aminotransferase (AST), gamma-glutamyl transferase (GGT), platelets, fibrosis-4 (FIB4) score, AST to platelet ratio index (APRI), alkaline phosphatase (ALP), and matrix metalloproteinase 7 (MMP7), a novel marker of bile duct injury and liver fibrosis in AILD [22]. Methods for calculating FIB4 score and APRI have been previously published [23].

MR image post-processing

Perspectum Ltd. (Oxford, United Kingdom) provided image post-processing tools and analyses for quantitative 3D MRCP metrics and iron-corrected T1 (cT1) relaxation measurements at no cost through a formal research agreement. Their analysts were blinded to all patient information, including clinical diagnosis as well as laboratory, clinical, and other imaging data.

3D MRCP

3D MRCP images were processed by a single operator at Perspectum using the MRCP + tool. This operator has more than 5 years of experience using MRCP + . To achieve this, bile ducts were identified through a series of signal enhancement and thresholding steps to detect tubular structures and generate a 3D model of the biliary tree [19]. The following metrics were then quantified: biliary tree volume; maximum and median diameters of the common bile duct, left hepatic bile duct, and right hepatic bile duct; total number of ducts in the 3D biliary tree model, defined as the total number of branches in the modeled tree; total length of ducts; total number of strictures, defined as local minima that were more than 30% narrower than neighboring maxima, total length of duct strictures; total number of dilations, defined as local maxima that were more than 30% wider than neighboring minima; and total length of duct dilations [19]. Diameters and lengths were reported in millimeters (mm).

cT1

Region-of-interest (ROI) and whole liver cT1 values were provided by Perspectum using LiverMultiScan [6, 24, 25], with values derived by a single observer with more than 5 years of experience using LiverMultiScan. For ROI measurements, three equally sized circular ROIs, with a diameter of 15 mm, were drawn in the right hepatic lobe of the liver on one representative axial slice, while avoiding major blood vessels and bile ducts. Voxel-by-voxel cT1 values were also calculated for the whole liver on four representative axial slices. The mean cT1 for the three circular ROIs and the mean, median, and interquartile range (IQR) of whole liver cT1 measurements across the four axial slices were calculated.

2D GRE MRE

MREplus + prototype software (Resoundant Inc., Rochester, Minnesota) was utilized to measure liver stiffness values from MRE data. Under the supervision of an image analyst in the Department of Radiology who was blinded to clinical data and other imaging data, ROIs were automatically placed on elastograms with 90% confidence threshold masks across four representative axial slices. The weighted mean of the mean liver shear stiffness values (in kPa) across the four axial slices was calculated for each patient.

Representative images of the quantitative MRI sequences assessed in this study are provided in Fig. 1.

Fig. 1
figure 1

13-year-old male with autoimmune sclerosing cholangitis. a Maximum intensity projection 3D MRCP image. b Corresponding 3D biliary tree model extracted from 3D MRCP image using MRCP + . c MR elastogram of the liver (units of kPa). d Iron-corrected T1 (cT1) map of the liver (units of ms)

Statistical analysis

The patient cohort was divided into two groups, (1) AIH and (2) PSC/ASC. Patients with PSC/ASC were grouped into one cohort due to similarities in MRCP findings and clinical outcomes [3, 19]. Continuous variables were summarized as median, first quartile, and third quartile values. Variables that were not normally distributed were log transformed using base10 for analyses (Supplementary Table 1). Group differences in age, MRE liver stiffness, cT1, quantitative MRCP metrics, and laboratory values between the AIH and PSC/ASC groups were examined using the Mann–Whitney U test. Categorical variables were presented as frequency counts and percentages; group differences were determined using Chi-square/Fisher’s exact test, as applicable.

Five variables were considered primary outcome variables: liver stiffness, ROI-based mean cT1, whole liver mean cT1, whole liver median cT1, and whole liver cT1 IQR. The following variables were considered potential predictor variables: demographic data, quantitative MRCP metrics, and serum laboratory values. Univariate associations between outcomes and predictor variables for the entire study cohort and for the AIH and PSC/ASC groups individually were assessed using Pearson correlation coefficients (r). Relationships between three individual outcome variables (liver stiffness, whole liver mean cT1, and whole liver cT1 IQR) and potential predictor variables were further examined using multiple linear regression models, adjusting for patient age, sex, the presence of IBD, specific diagnosis (AIH vs. PSC/ASC), and time from diagnosis to research MRI examination. Stratified multiple linear regression models by diagnosis (AIH vs PSC/ASC) were also run, while also adjusting for patient age, sex, presence of IBD, and time from diagnosis to research MRI examination.

p values were not adjusted for multiple comparisons as this was an exploratory assessment. p less than 0.05 was considered statistically significant. All statistical testing was carried out on SAS version 9.4 (SAS Institute, Cary, North Carolina).

Results

Seventy-three patients were eligible for inclusion in the current study. Fifteen patients were excluded for reasons detailed in Fig. 2. Of the remaining 58 patients, 12 had poor quality baseline MRI examinations (i.e., did not allow quantitative 3D MRCP processing due to artifacts), which were replaced with the next available research MRI examination at follow-up year 1 or year 2. Substitution of examinations was deemed acceptable because this is a cross-sectional study and includes patients at varying timepoints in their disease course at registry enrollment, ranging from newly diagnosed to end-stage liver disease.

Fig. 2
figure 2

Patient/MRI examination selection flow diagram. MRI magnetic resonance imaging, AIH autoimmune hepatitis, ASC autoimmune sclerosing cholangitis, PSC primary sclerosing cholangitis

Detailed patient demographic information and laboratory value summary statistics are given in Table 1. The median age of the AIH group (n = 30) was 16.0 years, and the median age of the PSC/ASC group (n = 28) was 15.5 years; cohort age was not statistically significantly different (p = 0.56). There were no differences in laboratory values between cohorts, with the exception of platelet count (p = 0.01) and MMP7 levels (p = 0.04).

Table 1 Demographic and laboratory value summary statistics for the study cohort

Summary statistics for liver stiffness, cT1 measurements, and quantitative MRCP metrics are provided in Table 2. There were no significant differences in liver stiffness or cT1 values between the AIH and PSC/ASC cohorts. All quantitative MRCP metrics, except median and maximum right hepatic bile duct diameters, were statistically significantly different between the two cohorts.

Table 2 MRI (liver stiffness, cT1, and quantitative MRCP) summary statistics for the overall study cohort (n = 58), subdivided by specific diagnosis

Univariate correlations

Univariate relationships between outcome variables and predictor variables for the entire study cohort are presented in Table 3. Results for univariate correlation analyses for the AIH and PSC/ASC groups are provided in Supplementary Tables 2 and 3.

Table 3 Univariate correlations between liver stiffness and cT1 measurements and predictor variables in overall study cohort (AIH, PSC, and ASC) (n = 58)

Associations between outcome variables and quantitative MRCP metrics

For the overall cohort, positive correlations were detected between ROI-based mean cT1, whole liver mean cT1, whole liver median cT1, and liver stiffness and right hepatic duct diameters (r = 0.30–0.35; p values 0.008–0.02). Whole liver mean cT1, whole liver cT1 IQR, and liver stiffness positively correlated with left hepatic duct diameters (r = 0.27–0.38; p values 0.004–0.04). There were no other significant correlations between cT1 or liver stiffness measurements and quantitative MRCP metrics.

In the subgroup analysis for patients with AIH, there were positive correlations between all five primary outcome variables and right hepatic bile duct diameters (r = 0.45–0.58; p values 0.0009–0.01). There also were positive correlations between all five primary outcome variables and left hepatic bile duct diameters (r = 0.41–0.69; p values < 0.0001–0.02). Whole liver cT1 IQR positively correlated with biliary tree volume, total number and length of strictures, and total number of dilations (r = 0.37–0.46; p values 0.01–0.04). For the PSC/ASC cohort, none of our primary outcome variables correlated with quantitative 3D MRCP measurements (Supplementary Table 3).

Associations between outcome variables and clinical metrics

For the overall cohort, whole liver mean cT1 (r = 0.33; p = 0.01) and cT1 IQR (r = 0.35; p = 0.008) positively correlated with age. There were multiple significant associations between cT1 or liver stiffness measurements and laboratory values. Liver stiffness negatively correlated with platelet count (r = − 0.34; p = 0.01) and positively correlated with all other laboratory values (r = 0.33–0.65; p values < 0.0001–0.02). Whole liver cT1 IQR was associated with all laboratory variables except alkaline phosphatase and MMP7 (r = 0.26–0.66; p values < 0.0001–0.04); whole liver cT1 IQR negatively correlated with platelet count (r = − 0.50; p < 0.0001). ROI-based mean cT1, whole liver mean cT1, and whole liver median cT1 were associated with APRI and FIB4 scores (r = 0.27–0.44; p values 0.0005–0.04). Figure 3 displays the relationships between log transformed FIB4 score and cT1 IQR and log transformed APRI score and liver stiffness.

Fig. 3
figure 3

Relationship between outcome variables and two predictor variables. Linear regression lines with 95% confidence intervals shown in blue for AIH group and red for PSC/ASC group. a Relationship between log transformed fibrosis-4 score and whole liver cT1 IQR shown. Correlation coefficient for entire cohort was 0.66 (p < 0.0001). Correlation coefficient for the AIH group was 0.63 (p = 0.0002), while correlation coefficient for the PSC/ASC group was 0.75 (p < 0.0001). b Relationship between log transformed APRI score and MRE liver stiffness. Correlation coefficient for entire cohort was 0.65 (p < 0.0001). Correlation coefficient for the AIH group was 0.65 (p = 0.0001), while correlation coefficient for the PSC/ASC group was 0.74 (p < 0.0001). cT1 iron (T2*)-corrected T1, IQR interquartile range, APRI AST to platelet ratio index, AIH autoimmune hepatitis, PSC primary sclerosing cholangitis, ASC autoimmune sclerosing cholangitis

For the AIH subgroup, ROI mean cT1, and whole liver mean and median cT1 positively correlated with total bilirubin, APRI, and FIB4 scores (r = 0.39–0.64, p values 0.0001–0.04) and with AST and MMP7 (r = 0.39–0.81, p values 0.001–0.03). Whole liver cT1 IQR positively correlated with age (r = 0.48, p = 0.007). For the PSC/ASC subgroup, whole liver cT1 IQR and liver stiffness positively correlated with ALT, AST, APRI, and FIB4 scores (r = 0.46–0.80, p values < 0.0001–0.01). Whole liver cT1 IQR correlated with total bilirubin and platelet count (r = 0.40 and − 0.53, p = 0.04 and 0.004, respectively). Liver stiffness also positively correlated with GGT (r = 0.69, p < 0.0001).

Multivariable linear regression

After statistical adjustments for patient age, sex, presence of IBD, specific diagnosis (AIH vs. PSC/ASC), and time from diagnosis to MRI examination, only left hepatic bile duct maximum diameter was statistically significantly associated with whole liver mean cT1 (p = 0.01). Eight predictor variables, including seven laboratory values and left hepatic bile duct maximum diameter, were significantly associated with whole liver cT1 IQR (p < 0.0001–0.04). Eleven predictor variables, including eight laboratory values and three quantitative MRCP metrics, were significantly associated with liver stiffness (p < 0.0001–0.04) (Table 4).

Table 4 Multivariable linear regression results across the entire study cohort, associating whole liver mean cT1, whole liver cT1 IQR, and MRE liver stiffness with each independent predictor variablea

Results from stratified [by diagnosis (AIH and PSC/ASC)] linear regression analyses are presented in Supplementary Table 4. Within the AIH group, four quantitative MRCP metrics (p values 0.0003–0.001) and five laboratory values (p values 0.0001–0.02) were associated with whole liver mean cT1. Five quantitative MRCP metrics (p values < 0.0001–0.04) and five laboratory values (p values 0.0002–0.02) as well as age (p = 0.01) were significantly associated with whole liver cT1 IQR. Four quantitative MRCP metrics (p = 0.0003–0.02) and seven laboratory values (p < 0.0001–0.047) were significantly associated with MRE liver stiffness. For the PSC/ASC group, no predictor variables were significantly associated with whole liver mean cT1. Five laboratory values (p < 0.0001–0.01) were significantly associated with whole liver cT1 IQR, while six laboratory values (p < 0.0001–0.001) were significantly associated with MRE liver stiffness. No MRCP metrics were significantly associated with cT1 IQR or MRE liver stiffness.

Discussion

In a cohort of children and young adults with AILD, we have shown multiple significant associations between quantitative MRI markers (cT1, cT1 IQR, liver stiffness and quantitative MRCP) and between these markers and clinical variables. This suggests that each of the MRI markers and clinical markers under study are, to some degree, responsive to changes of inflammation and/or fibrosis in patients with AILD. Further, it is worth noting that there was no significant correlation between measures of cT1 or liver stiffness and time from AILD diagnosis to MRI examination. This suggests that changes in cT1 measurements and liver stiffness over time are not uniform among AILD patients, and instead may be individually specific.

Quantitative MRCP metrics, inclusive of duct diameters and narrowings, would be expected to be reflective of biliary involvement by disease. Our results support this, with significant differences in quantitative MRCP metrics between the AIH and PSC/ASC subgroup. However, for the overall AILD cohort, only right and left hepatic duct diameters, and none of the other metrics, were significantly correlated with cT1 and liver stiffness, measures that may be more indicative of parenchymal disease. These results contradict prior literature demonstrating associations between liver stiffness and biliary abnormalities measured on MRCP by human observers in the setting of PSC. Tafur et al. [8] demonstrated a positive correlation between liver stiffness measurements and intrahepatic bile duct stricture severity in adult patients with PSC. Bookwalter et al. [9] found a positive correlation between liver stiffness and the presence of a common bile duct stricture and the number of segmental duct strictures in the biliary tree, also in adult patients with PSC. Interestingly, our study failed to demonstrate any significant associations between liver stiffness and quantitative MRCP metrics related to numbers and lengths of strictures or biliary tree volume (a putative measure of upstream duct dilation).

Interestingly, in the AIH subgroup, univariate and multivariable analyses demonstrated multiple quantitative MRCP metrics (e.g., biliary tree volume, number of strictures, and length of strictures) to be significantly associated with cT1 measurements (including cT1 IQR) as well as liver stiffness. These results could reflect mild irregularities in biliary tree structure due to periductal fibrosis in patients with AIH, as reported previously in an adult cohort by Lewin et al. [26]. The absence of such associations on the univariate and multivariable levels in PSC/ASC patients may indicate that large duct injury is not a key driver of fibrosis in PSC/ASC or that large duct changes occur early in the disease before significant changes in cT1 or stiffness occur.

In the overall AILD cohort, measures of cT1, cT1 IQR, and liver stiffness all showed positive associations with numerous laboratory markers of liver disease as well as clinical predictive scores of liver fibrosis (APRI, FIB4). Relationships with these predictive scores were of relatively higher strength for cT1 IQR and liver stiffness than with absolute cT1 measurements. This was similarly the case in the PSC/ASC subgroup where correlations between cT1 IQR and MRE liver stiffness and clinical predictive scores of liver fibrosis were r > 0.7. These associations between clinical predictive scores of fibrosis and liver stiffness are not unexpected given prior studies in adults with PSC [27, 28]. However, our findings identify a similar relationship with cT1 IQR, a measure of liver T1 relaxation heterogeneity, suggesting this may be an additional biomarker of liver fibrosis. This potential is further supported by our multivariable analyses which showed cT1 IQR to be less associated with circulating biomarkers of liver inflammation (e.g., AST and ALT) and cholestasis (e.g., GGT and ALP) than liver stiffness is, suggesting cT1 IQR may be less confounded by inflammation and cholestasis as a marker of liver fibrosis.

Our results add to the limited literature evaluating associations between liver T1 relaxation measurements (including cT1) and other markers of chronic liver disease. Previously, Banerjee et al. [6] found a strong positive association between cT1 and increasing fibrosis stage. Hoffman et al. [29] using conventional T1 mapping without T2* correction demonstrated an area under the ROC curve for differentiating early (F0-F2) from advanced (F3-F4) fibrosis of 0.67 for reader 1 and 0.64 for reader 2. Alongside cT1, cT1 IQR has previously been shown to diagnose histologically confirmed disease and decrease following standard of care treatment in pediatric AIH patients [15]. Interestingly, our study suggests that cT1 IQR, as opposed to cT1, may be a better performing biomarker of AILD, although further investigations are needed.

Our study has limitations. First, it is cross-sectional in design, including 58 subjects with AILD. As a result, small but significant correlations could have gone undetected. Such small correlations are unlikely to be clinically useful, however. Second, 25 of the 73 (~ 34%) available baseline MRI examinations could not undergo quantitative MRCP post-processing due to image quality issues, most often motion artifacts. Utilization of faster MRCP methods, such as 3D FSE sequences with compressed sensing, that shorten the acquisition time may reduce the number of datasets that are rejected for quantitative MRCP post-processing. Third, the cT1 metrics were not corrected for fat. The MOLLI sequence used is sensitive to fat as well as to the other features of liver disease such as inflammation and fibrosis. However, cT1 is not routinely corrected for fat because in patients with low liver fat, the signal is dominated by the free-water signal that is generated by the biological processes of inflammation and fibrosis. Thus, there is no need for such fat correction in order to interpret the change in the cT1 signal in response to biology in these populations [30]. Finally, quantitative MRCP metrics and laboratory values were correlated with imaging markers of chronic liver disease and histopathological correlations were not evaluated. Correlating these predictor variables as well as our primary outcomes (particularly liver stiffness and cT1 IQR) with meaningful clinical outcomes would further help establish the value of these various imaging techniques.

In conclusion, we have demonstrated significant associations among automated quantitative MRCP metrics, MRE-derived liver stiffness, T1 relaxation (cT1) measurements, and liver-related laboratory measurements in children and young adults with AILD. Quantitative MRCP metrics related to the diameters of the right and left hepatic bile ducts were significantly associated with measures of cT1 and liver stiffness, especially in AIH patients. Both cT1 IQR and liver stiffness are highly associated with circulating laboratory biomarkers of liver disease, and in particular clinical fibrosis scores. Our results also suggest that T1 relaxation heterogeneity (cT1 IQR) may be a novel marker of liver fibrosis which may be less impacted by cholestasis and inflammation than MR elastography.