Introduction

Gadoxetic acid (Gd-EOB-DTPA) enhanced magnetic resonance imaging (MRI) is increasingly applied for several indications, including detection of premalignant and secondary liver lesions, characterization of liver lesions, and assessment of liver (dys)function [1, 2]. Clinically relevant primary liver lesions include focal nodular hyperplasia (FNH), hepatocellular adenoma (HCA), and hepatocellular carcinoma (HCC) [3].

FNH mostly occurs in female patients between 35 years and 50 years, has no risk of malignant transformation or bleeding, and does not require follow-up [4]. HCA also mostly occurs in young women. HCA does have a risk of malignant transformation (± 5%) and bleeding (15–20%). Therefore, HCA may require further follow-up or treatment, depending on their molecular subtype, size, location, and patient’s sex [4,5,6]. HCC is the malignant counterpart of HCA, mostly seen in men between 70 years and 79 years. Treatment is recommended according to the European Association for the Study of the Liver (EASL) guidelines [7].

Adequate differentiation between FNH and HCA/HCC has clinical consequences. Overdiagnosis of FNH as HCA/HCC can lead to unnecessary diagnostic procedures (including biopsy) and/or treatment with potential complications. Conversely, underdiagnosing HCA/HCC as FNH can result in inadequate treatment and thus to malignant transformation and/or metastasis. Clinical characteristics such as sex and underlying liver disease are helpful but do not fully discriminate, and there are no biomarkers to differentiate FNH and HCA/HCC.

According to current EASL guidelines, hyper- or isointensity in the hepatobiliary phase (HBP) of Gd-EOB-DTPA enhanced MRI has high sensitivity and specificity for FNH, whereas HCA/HCC is generally characterized by hypointensity in the HBP [4, 8, 9]. However, more than 80% of B-catenin-activated HCA, the molecular subtype with the highest risk of malignant transformation, are hyper- or isointense in the HBP [1, 10,11,12,13]. Hyper- or isointensity is also seen in approximately 10% of HCC [1, 10, 14,15,16].

Quantitative analysis of the extent of hyper- or isointensity has been suggested to improve the differentiation between lesions hyper- or isointense in the HBP [17,18,19]. However, quantitative HBP analysis may be hampered by heterogeneity, often caused by imaging patterns within the lesions [20, 21]. For example, the inflammatory HCA subtype is characterized by the “atoll sign”, illustrated by a hyperintense peripheral rim on T2-weighted images [22, 23].

Data combining evidence on hyper- or isointensity in the HBP with MR imaging patterns are limited [24]. Therefore, the aim of the current study was to identify imaging characteristics to aid the differentiation between FNH and HCA/HCC that were hyper- or isointense in the HBP on Gd-EOB-DPTA enhanced MRI.

Materials and methods

Study design and inclusion in participating center

A retrospective cohort study was performed in accordance with the Declaration of Helsinki and the STROBE guidelines. The need for ethical approval was waived. If required by regulations of participating centers, patients not occurring in objection registers were sent a letter detailing study procedures and aims and could opt out of the study. All patients with pathology-proven FNH, HCA, or HCC (biopsies or resection specimens) between 2010 and 2020 were identified through local pathology databases. Radiologists in participating centers reviewed all Gd-EOB-DTPA-enhanced MRIs. All lesions hyper- or isointense compared to the surrounding liver in the HBP by visual interpretation were included, including partially hyper- to isointense lesions.

Central inclusion and imaging techniques

All hyper- or isointense lesions were centrally pseudonymously reviewed by two expert radiologists (more than twenty years of experience) blinded from the pathological classification. Included were patients with hyper- or isointense regions in the lesions compared to the surrounding liver, not attributable to pre-contrast signal intensity, therefore due to contrast uptake in the HBP. In case of doubt, subtraction images were produced.

In general, MRI scans were conducted using 1.5-Tesla or 3.0-Tesla scanners. Typically, a spoiled 3D gradient fat-saturated T1-weighted sequence was produced before and during the different dynamic phases after contrast injection. The HBP consisted of a similar sequence as the dynamic sequence, and was performed between 10 min and 20 min after contrast injection. In- and opposed-phase T1-weighted sequence was performed using the Dixon technique, if available. T2-weighted 2D spin-echo sequence was performed with or without fat saturation, using a single shot or/and multishot protocol. Diffusion-weighted sequences included typically minimally two b-values with the first between 0 s/mm² and 150 s/mm² and the second from 600 s/mm² to 800 s/mm².

Imaging characteristics

The supplementary material shows full signs, definitions, and scoring process. Patterns in the HBP were raster, perilesional hypointense rim [25], lesional hyper- or isointense rim [26], atoll fingerprint, white-bordered flower (present/absent), partification (no, craquelure with loosening part, focal/mosaic or nodule-in-nodule) [27, 28], and extent of hyper- or isointensity (limited/extensive). Patterns on T2-w were sinusoidal dilatation (present/absent) [23] and partification (no, focal/mosaic or nodule-in-nodule). Patterns scored in the HBP and T2-weighted images were scar [29], liquid (present/absent), and extent of heterogeneity (no/limited or extensive).

Miscellaneous imaging features were: T1-w in-phase hyperintensity not due to fat, hemosiderin (present/absent), and intralesional fat (absent, less than 50%, or more than 50%). In dynamic phases, intralesional arterial hypointense rim (present/absent) and venous washout (present, absent, equivocal, or pseudo-washout) were scored [9, 30, 31]. If diffusion-weighted imaging (DWI) was available, diffusion restriction was scored (present, absent, or equivocal). Lesion contour was defined as spherical, cauliflower, or aspecific.

Finally, the final diagnosis (i.e., FNH or HCA/HCC) and treatment recommendation (i.e., no follow-up, follow-up, or histology (either biopsy or resection)) were formulated. HCA and HCC were grouped because both are hepatocellular lesions with (risk of) malignant transformation generally assumed to be hypointense in the HBP. Treatment recommendations were based on imaging characteristics alone, not taking lesion size and location into account.

Pathological and clinical data

Pathological classification was based on a review of pathology reports. The time between MRI imaging and biopsy and/or resection was a median of 0 months (IQR: 0–3 months). The electronic supplementary material describes morphological requirements for each diagnosis. For the differentiation between FNH and HCA, either a pattern of glutamine synthetase (GS) staining or, in the case of HCA, a positive C-reactive protein or positive serum amyloid A staining was required [32]. B-catenin-activated HCA was characterized by nuclear staining of B-catenin and/or overexpression of GS (n = 5) and/or molecular evidence of mutations in the CTNNB1 gene (n = 9). If available, the locus of the mutation was noted.

Clinical data and baseline characteristics collected were lesion size in centimeters and lesion location (left or right liver lobe). Pathological and clinical data were collected using RedCap. Underlying liver disease was defined as non-alcoholic fatty liver disease or non-alcohol steatohepatitis, cirrhosis and/or fibrosis, and metabolic diseases including hepatocyte nuclear factor 1A maturity-onset diabetes of the young and glycogen storage disorders.

Statistical analysis

Statistical analyses were conducted using SPSS® statistics version 28.0 (IBM). Classification and regression tree (CART) analysis was conducted using the recursive partitioning and regression trees (Rpart) package in RStudio version 4.2.1 (Posit). The Rpart package uses ten-fold cross-validation to determine the optimal place to prune the regression tree. Categorical variables were expressed as numerators/denominators with percentages and continuous variables as means with standard deviations (SDs) or medians with interquartile ranges (IQRs). For all analyses, available case analysis was conducted.

Interobserver and intraobserver variability were calculated using Cohen’s kappa statistic. Levels of agreement were interpreted using cut-off values defined by Landis and Koch: < 0.00 poor, 0.00–0.20 slight, 0.21–0.40 fair, 0.41–0.60 moderate, 0.61–0.80 substantial, and 0.81–1.00 almost perfect [33].

In univariable analyses, imaging features were compared between patients with a final pathological diagnosis of FNH and HCA/HCC using Pearson’s chi-squared test or Fisher’s exact test. p-values < 0.05 were considered statistically significant. Diagnostic odds ratios with 95% confidence intervals for the diagnosis FNH, compared to HCA/HCC were calculated. Sensitivity, specificity, positive predictive value, and negative predictive value were calculated for significant imaging features. Haldane–Anscombe correction was performed in the case of zero cells.

For multivariable analyses, imaging features with a p-value of > 0.25 in univariable analysis were excluded. A Spearman’s correlation matrix of imaging features was derived. In cases of a correlation coefficient of 0.80 and higher, indicative of collinearity, one of the correlated imaging features was excluded from the multivariable analysis. The tree diagram obtained by CART analysis and odds ratios obtained by multivariable logistic regression were combined in a visual representation made in the draw.io® (JGraph Limited). Sensitivity analyses assessed the occurrence of imaging features suggestive of FNH in B-catenin-activated HCA.

Results

In the six participating centers, hyper- or isointense lesions were observed in the HBP in 113/113 (100%), 95/211 (45%), and 43/90 (48%) of patients with pathology-proven FNH, HCA, and HCC, respectively. Hyper- or isointense lesions were included for central revision with subtraction images. After central inclusion based on the central assessment of pathological classification and hyper- and isointensity, 124/251 patients (49%) were included for full analysis (Fig. 1).

Fig. 1
figure 1

Flowchart of the inclusion process

The majority (108/124, 87%) of patients were female and the mean age was 40.3 years (SD 9.8 years). Underlying liver disease was present in 29/124 patients (23%) and 66/124 patients (53%) had a known history of oral contraceptive use. Two patients (1.6%) had a history of anabolic steroid (ab)use and one patient (0.8%) of transdermal estrogen use (Table 1).

Table 1 Baseline characteristics

Baseline imaging and pathological characteristics

In total, 124 patients with 128 hyper- or isointense lesions were included. The mean lesion size was 6.8 (SD 3.3) centimeters and most lesions arose from the left 71/128 (55%) hemiliver.

Based on imaging, the 128 lesions were classified as FNH in 66 patients (52%) and HCA/HCC in 62 patients (48%, Table 1). In 12 of the 66 patients (18%) diagnosed with FNH on imaging, biopsy was recommended. Interobserver agreement was moderate and intraobserver agreement was substantial for the majority of the imaging characteristics (Supplementary Tables 1 and 2).

Final pathology diagnoses were evenly distributed between the two groups (FNH in 64 patients, 50%, and HCA/HCC in 64 patients, 50%). Two patients were classified as HCA/HCC on imaging and turned out to be FNH on pathology. Four patients classified as FNH on imaging turned out to be HCA/HCC on pathology, in one of these patients the treatment recommendation formulated by the expert radiologists was to biopsy the lesion. Thus, the sensitivity of imaging for the diagnosis of FNH was 96.9% (95% CI: 90.7–99.5%) and the specificity was 93.8% (95% CI: 86.1–98.0%). The positive predictive value was 93.9% (95% CI: 86.5–98.1%) and the negative predictive value was 96.8% (95% CI: 90.4–99.5%).

Imaging characteristics exclusively occurring in HCA/HCC

Imaging characteristics were compared between the 64 patients with a pathological diagnosis of FNH and the 64 patients with a pathological diagnosis of HCA/HCC (Table 2). Several imaging characteristics were exclusively observed in HCA/HCC, these were: (i) the presence of a raster pattern in the HBP, (ii) the presence of an atoll fingerprint pattern in the HBP, (iii) the presence of sinusoidal dilatation on T2-w, (iv) the presence of hemosiderin, (v) the presence of T1-w in-phase hyperintensity not due to fat, (vi) the presence of venous washout, and (vii) the presence of a nodule in nodule participation in the HBP and on T2-w (Fig. 2).

Table 2 Occurrence and univariable analysis of imaging characteristics in FNH and HCA/HCC
Fig. 2
figure 2

CART and multivariable logistic regression analysis. Figure shows imaging characteristics exclusively identified in HCA/HCC (top part of the figure), and combines the results of the multivariable logistic regression and CART analysis (bottom part of the figure, shaded in gray). The tree structure and division of categorical variables are derived from the CART analysis. The odds ratios are derived from multivariable logistic regression. The occurrence of the identified imaging characteristics in the respective groups is shown in the circles on the right part of the figure

Supplementary Table 3 shows imaging characteristics comparing HCA and HCC: (i) up to (iv) were indicative of HCA, and (v) up to (vii) of HCC. Figure 3 shows a case of a patient with imaging characteristics specific to HCA/HCC. In 12/64 (19%) HCA/HCC patients, none of these imaging characteristics were observed. Ten out of twelve were HCA, one was an uncertain HCA/HCC based on immunohistochemistry, and one was an HCC in a male patient based on morphology.

Fig. 3
figure 3

Case of a patient with imaging features suggestive of FNH occurring in a B-catenin-activated HCA. Large liver mass (B-catenin inflammatory HCA, exon 3, non-S45) in a 21-year-old female using oral contraceptives. On T1-w in-phase and out-of-phase imaging (a, b), we can appreciate a slightly intrinsically hyperintense tumor (226 vs 186 units) which is hypervascular during the arterial and venous phase (c, d). During the HBP (20 min) (e), the tumor is clearly hyper- to isointense to the surrounding liver. Mainly in the center of the lesion, some hypointense lines can be seen, which possibly comprise scar tissue. On T2-w imaging with fat suppression a large rim of hyperintense signal can be visualized and central inhomogeneity (f). This was classified as sinusoidal dilatation with possible atoll configuration. There was no restriction on the apparent diffusion coefficient map (g, h). The combination of extensive hyper- or isointense signal in the HBP (typical for FNH), together with features very typical or unequivocal for HCA/HCC (sinusoidal dilatation and T1-w intrinsic hyperintensity) should prompt a biopsy. The presence of a scar (which is typical for FNH) should not mislead the reader in this situation

Imaging characteristics suggestive of FNH or HCA/HCC

Other imaging characteristics were suggestive of, yet not specific for, FNH or HCA/HCC. The sensitivity, specificity, positive predictive value, and negative predictive value of (subcategories of) these imaging characteristics are shown in Table 3. These imaging characteristics were included in multivariable analyses (Supplementary Table 4).

Table 3 Imaging features exclusively seen in HCA/HCC and/or suggestive for FNH of HCA/HCC

Excluded from multivariable analyses were HBP partification, HBP scar, and HBP liquid, because these imaging features had a correlation efficiency of 0.80 or higher with their T2-w counterparts. Perilesional hypointense rim and lesional hyper-/isointense rim were excluded because these were not associated in the univariable analysis, and diffusion restriction was excluded due to missing data.

The outcomes of regression tree analysis and multivariable logistic regression were similar (CART R2 = 0.766 and Nagelkerke R2 = 0.844, respectively). Three additional imaging features that were independently associated with either FNH or HCA/HCC were identified (Fig. 2): (i) the presence of a scar on the T2-w, indicating FNH, (ii) the presence of less than 50% fat, indicating HCA/HCC, and (iii) a spherical contour indicating HCA/HCC.

Occurrence of imaging features suggestive of FNH in B-catenin-activated HCA

The presence of a T2-w scar, which was associated with FNH in the multivariable analysis and was present in 61/64 (95%) of FNH, was also present in 4/14 (29%) of B-catenin-activated HCA and in two out of five B-catenin activated HCA with a non-S45 exon 3 mutation (Table 4 and Supplementary Table 5).

Table 4 Occurrence of the imaging features suggestive of FNH in B-(I)HCA

Other imaging features suggestive of FNH that occurred in almost all FNH were also frequently present in B-catenin-activated HCA, i.e., extensive hyper-/isointensity, the absence of partification, the absence of diffusion restriction, and the absence of venous washout. Cauliflower contour and HBP White-bordered flower were both rarely observed (1/14, 7.1%) in B-catenin activated HCA, yet were also present in less than 50% of FNH. Figure 3 shows a case of a patient with imaging features suggestive of FNH occurring in a B-catenin-activated HCA.

Discussion

This multicenter study found that imaging has high sensitivity and specificity for differentiation of FNH and HCA/HCC hyper- or isointensity in the HBP. In more than 80% of HCA/HCC, at least one of the imaging features that occurred exclusively in HCA/HCC was present. These features, with acceptable intra- and interobserver reliability, aid the clinically relevant distinction between FNH and HCA/HCC. The absence of these highly specific features mostly occurred in HCA.

Comparing the imaging characteristics suggestive of HCA/HCC identified in this study to the literature, the presence of sinusoidal dilatation on T2-w has been described in inflammatory HCA and rarely in HCC [23, 34]. Some other imaging features were also described for the Liver Imaging Reporting and Data System criteria for HCC, for example, the presence of nodule-in-nodule partification in the HBP and on T2-w [9, 25, 35]. Venous washout exclusively occurred in HCA/HCC, although assessment of venous washout is hampered in lesions hyper- to isointense in the HBP by early uptake of Gd-EOB-DTPA by hepatocytes [36]. In addition, the study identified novel features indicating HCA/HCC, including the raster and atoll fingerprint pattern in the HBP; and features indicating FNH, including the white-bordered flower.

Regarding the imaging diagnosis of FNH, according to EASL guidelines, the diagnosis is generally made by a set of characteristics indicative of FNH, and the absence of characteristics suggesting HCA/HCC warranting further analysis [4]. In multivariable analyses, a T2-w scar was independently associated with FNH with an odds ratio of more than 80. Yet, the confidence interval of the odds ratio was wide. In the absence of other imaging features completely specific to FNH, caution is advised. Undertreatment of HCA/HCC caused by misdiagnosis as FNH could result in malignant transformation [4, 37, 38].

Our study, which is one of the first to assess B-catenin mutated HCA alongside a control group of FNH, contributes to the expanding body of evidence that suggests that the threshold for biopsy and/or resection should perhaps be lowered for lesions showing HBP hyper- or isointensity. Lesions with imaging characteristics exclusively seen in or suggestive of HCA/HCC should be suspected to be HCA/HCC despite the presence of extensive hyper- or isointensity in the HBP and/or a T2-w scar. A T2-w scar was independently associated with FNH in the multivariable analysis, but was also present in 4/14 (29%) of B-catenin-activated HCA. These findings are in line with previous studies that have shown that B-catenin-mutated HCA may show a T2-w scar [39]. Other imaging characteristics suggestive of FNH such as the absence of venous washout or diffusion restriction were also frequently present in B-catenin-activated HCA. Other authors have suggested quantitative HBP analysis to differentiate FNH from HCA/HCC [37]. Our study confirms that hyper- or isointensity in the HBP has a high specificity for FNH, as all included FNH were hyperintense. However, in the current cohort and in previous literature, B-catenin-activated HCA is frequently extensively hyper- or isointense (13/14 B-catenin activated HCA) [40]. In our study, almost half of the HCA and HCC (95/211, 45% of HCA and 43/90, 48% of HCC) were also hyper- or isointense in the HBP, which is much higher than reported in previous studies [1, 10, 14, 15]. Selection bias in patients with HCC who underwent Gd-EOB-DPTA enhanced imaging and in patients with HCA who underwent biopsy and/or resection may be present. Nonetheless, it is striking that 14/48 (29%) of HCA were B-catenin mutated HCA.

This study has some limitations. First, diffusion-weighted imaging was excluded from multivariable analysis because these were often missing due to insufficient quality. Second, the method to characterize B-catenin-activated HCA provided high specificity for B-catenin-activated HCA, but may lack sensitivity [41, 42]. This may have introduced bias as exclusively lesions with strong B-catenin activation (indicated by overexpression of GS-staining) may have been included and limits the number of B-catenin activated HCA identified in the current study [42]. Third, the retrospective study design is prone to selection bias and the sample size may lead to overfitting of the multivariable logistic regression analysis. Thus, caution is advised when extrapolating the imaging characteristics that were exclusively observed in FNH or HCA/HCC in our study to larger and other populations. In addition, external validation of the CART analysis is warranted. Fourth and finally, the differentiation with other types of potentially hyper- or isointense liver lesions was not assessed. These other lesions were excluded from this study as these are hyper- or isointense not due to intracellular uptake in hepatocytes, but through other mechanisms [1].

In conclusion, our study provides a diagnostic algorithm with potential clinical application. For lesion hyper- or isointense in the HBP, raster or atoll fingerprint, sinusoidal dilatation, hemosiderin, T1-w hyperintensity, and/or nodule-in nodule partification should lead to suspicion of HCA/HCC. If these lesions are extensively hyper- to isointense in the HBP, further investigation may be warranted.