Introduction

Nonalcoholic fatty liver disease (NAFLD) is the main form of chronic liver disease in children and adolescents. Obesity and associated metabolic disorders are the staple risk factors for the development of NAFLD, which may be part of the comprehensive clinical picture of metabolic syndrome. With the increasing prevalence of childhood obesity, hepatic steatosis will likely become a major public health issue [1]. The estimated prevalence of hepatic steatosis in Europe ranges from 1.3% to 22.5% for children aged 3–18 years, with a prevalence of 11% among adolescents [2].

As far as we are aware, there are no cohort studies giving insight into the course of NAFLD and nonalcoholic steatohepatitis (NASH) in children or their potential evolution into fibrosis and advanced cirrhosis during childhood and early adulthood [3, 4]. The absence of overt clinical manifestations makes precocious NAFLD diagnosis difficult; as a consequence, the diagnosis of pediatric NAFLD is often incidental, at an average age of 11–13 years [4].

Despite the availability of noninvasive methods to confirm a diagnosis of NAFLD, liver biopsy remains the current gold standard for steatosis and NASH classification. However, biopsy has a number of drawbacks: it is invasive and operator-dependent (and thus subject to sampling variability), requires inter-pathologist agreement and imposes a degree of clinical risk which cannot be overlooked [5]. These limitations make liver biopsy impractical in the routine assessment of liver fat content for patients with uncomplicated NAFLD. Liver enzyme assays systematically underestimate the actual prevalence of the disease and display low prognostic value for the development of NASH [6].

Less invasive methods to exclude hepatic steatosis are required not just as a first step in the diagnostic pathway, but also during the follow-up phase, to monitor the impact of any interventions.

Recently, a few studies have been published investigating new ultrasound (US) technologies for quantitative assessment of liver fat content in children [7, 8] with good results; however, optimal threshold values for these techniques have not yet been defined and approved [9]. Currently, magnetic resonance spectroscopy (MRS) and magnetic resonance imaging proton density fat fraction (MRI-PDFF) represent the only noninvasive diagnostic methods for quantifying hepatic steatosis. Conventional B-mode US is a widely used imaging tool for the noninvasive subjective assessment of hepatic steatosis [10]. However, the US evaluation of fatty liver is based on the operator’s expertise; there is a subjective estimation of the extent of fatty infiltration in the liver, with a sensitivity of 60.9–65.0% for the detection of mild steatosis (fat content>5% but <33%) [11]. Compared to histology, the overall sensitivity and specificity of B-mode US were, respectively, 84.8% and 93.6%, with an area under the receiver operating characteristic curve (AUROC) of 0.93 (95% confidence interval [CI] 0.91–0.95) [12].

Despite these limitations, B-mode US is the preferred first-line diagnostic imaging procedure suggested in patients with nonalcoholic fatty liver disease [13].

Recently, novel liver composition assessment techniques (quantitative ultrasound, QUS) have been investigated for their usefulness in the evaluation of hepatic steatosis based on an analysis of the radiofrequency echoes detected by the transducer and on the automatic calculation of the hepato-renal index. The aims of the present study were to assess the diagnostic accuracy and inter-operator variability of QUS and the diagnostic accuracy of B-mode US in pediatric NAFLD using MRI-PDFF as the reference standard.

Materials and methods

Patients were enrolled between January 2021 and May 2022 at the Department of Pediatrics, Sapienza University of Rome, Italy. The main inclusion criteria were age <18 years and a body mass index (BMI)-for-age>85th percentile (World Health Organization [WHO] growth charts, 5–19 years [14]). The exclusion criteria were the presence of diabetes mellitus (type 1 or 2), thyroid illnesses and liver diseases other than NAFLD. The parents or legal guardians of all patients signed an informed consent form before enrollment. Calibrated scale and stadiometer were used for body weight and height assessment, respectively; waist circumference was taken with an inextensible tape at a point midway between the inferior margin of the lowest rib and the iliac crest [15]. Obesity prevalence was assessed using both the WHO [16] and International Obesity Task Force (IOTF) [17] definitions.

Resting blood pressure was measured by auscultation using a pediatric aneroid non-mercury manometer, with the patient maintaining a seated position for 5 min before evaluation.

The following biochemical parameters were measured using commercial assays: total cholesterol, HDL-cholesterol, LDL-cholesterol, non-HDL-cholesterol, triglyceride, chylomicron remnants, basal glucose and basal insulin. After collecting fasting samples, a standard oral glucose tolerance test was performed and blood samples withdrawn after 2 h; 2-h glucose was measured. Homeostasis model assessment of insulin resistance was calculated from fasting plasma insulin and glucose levels using the formula: insulin×glucose/405 (mU/L×mg/dL) [18].

All participants underwent MRI, B-mode US and QUS using the following software: EzHRI™ (Hepato-Renal Index with automated region of interest [ROI] recommendation), TAI™ (Tissue Attenuation Imaging), and TSI™ (Tissue Scatter Distribution Imaging) (all software, Samsung Medison, Hongcheon, Republic of Korea).

Participants were asked to fast at least 6 h before the B-mode US. An experienced radiologist (V.C. with over 25 years of experience) and a radiologist (G.P., with four years of training) performed a conventional grayscale ultrasound with QUS imaging using a clinical US system (RS85 A, Samsung Medison, Seoul, South Korea) with a curved array transducer.

Routine B-mode US images of the liver were obtained using subcostal and intercostal planes and stored. Hepatic steatosis was diagnosed based on known US findings, including increased parenchymal echogenicity, hepatorenal echo contrast, impaired visualization of the diaphragm line and intrahepatic portal vein wall, and deep attenuation of the liver parenchyma [19]. The severity of hepatic steatosis was qualitatively scored on a 4-point scale using the Hamaguchi scoring system [20] with scores of 0, 1, 2, and 3 indicating absent, mild, moderate, and severe steatosis, respectively.

Studies have shown that the hepatorenal index or hepatorenal ratio is a sensitive and noninvasive test [21,22,23]. It is a simple calculation of the B-mode ratio, or the brightness ratio of the liver parenchyma over the renal cortex in each user-selected ROI. EzHRI functions in much the same way as the conventional hepatorenal index but offers greater convenience and an improved workflow by suggesting initial ROI positions. EzHRI segments the input image into the kidney and liver, based on deep learning technology. It then uses the stochastic analysis method, which extracts three brightness classes from each segmented organ, to extract only the cortex while excluding other structures and artifacts. Lastly, it finds the ROIs in the liver and kidney exhibiting the lowest brightness variance, calculates the average brightness ratio for each ROI, and then measures the heptaorenal index.

TAI is a tool that quantitatively measures attenuation of US signals received from the liver. Attenuation of US signals is due to the gradual loss of signal strength due to absorption, reflection, refraction, and scattering. TAI quantifies attenuation based on changes in the center frequency under the optimal transmission and reception conditions. Attenuation is increased in high-frequency components, resulting in changes in the center frequency of the signal; the severity of fatty liver disease is positively associated with the degree of attenuation [24, 25]. To ensure accurate measurement, it is recommended that the user selects a ROI in the right lobe at the VIII segment, at least 3 cm away from the liver capsule and excluding the major veins, with the patients holding their breath. The reliability of the measurement is expressed as an R2 value. It is recommended that the user performs a measurement in a region with an R2 value of 0.6 or above.

TSI is a tool that quantifies the scattered signal distribution based on backscattered signals. Scattering, the reflection of ultrasonic waves in multiple directions is not affected by their entry angle and produces speckle patterns in US images. This changes the distribution of backscattered US signals, based on the scattering intensity. The distribution can be modeled by applying a statistical distribution. The Rayleigh distribution is used to represent the case of a high density of random scatterers without coherent signal components. Based on this model, TSI represents scattering by quantifying the correlation between the backscattered signals and the Rayleigh distribution [25,26,27].

MRI scans were acquired using a 3-tesla magnet (GE Discovery 750; General Electric Healthcare, Milwaukee, WI) with a peak gradient amplitude of 50 mT/m, a time to peak of 200 μs, and a 32-element body torso-array coil system. For liver fat quantification, the following technique was used: 3-dimensional (D) T2*-weighted with Dixon pulse multiple-echo sequence (MRI-PDFF) with the following parameters: TR, 12.9 ms; TE six different echoes from 1.6 to 9.8 ms field of view, 35–40 cm; matrix, 224×60; bandwidth, 125 kHz; FA, 5°; section thickness, 5 mm with 28 sections, acquisition time 25 s. The images of the iterative decomposition of water and fat with echo asymmetry and least squares estimation (IDEAL Q) sequence were processed using the software provided by the manufacturer (IDEAL-IQ; General Electric Healthcare) to create water, fat, in-phase, out-of-phase, R2* and fat fraction maps instantaneously [5]. Quantification of liver steatosis from the IDEAL sequence was performed by drawing a 2-cm2 ROI in all liver segments, based on the software-generated fat fraction map [28].

Data were collected in a Microsoft Excel-format spreadsheet. Statistical calculations were performed using SPSS software (IBM SPSS Statistics for Windows, IBM Corp., Version 28.0. Armonk, NY), with the exception of Bland-Altman analysis (including plots) which was performed using jamovi (The jamovi project, Version 2.3, Sydney, Australia) [29]. To assess normality, variables were checked in terms of skewness and kurtosis and the Shapiro-Wilk normality test was performed; variables that were not normally distributed underwent appropriate numeric transformation (logarithmic, squared root, or reciprocal functions). After categorizing patients according to the absence (MRI-PDFF<5.6%) or presence (MRI-PDFF≥5.6%) of hepatic steatosis, baseline comparisons between the two groups were performed using Mann-Whitney U test. To calculate the diagnostic performance of TAI, TSI and EzHRI in detecting patients with steatosis defined by MRI-PDFF, ROC curve analysis was performed; optimal cut-off values were defined as those maximizing the Kolmogorov-Smirnov statistic. Agreement between B-mode US and MRI-PDFF in differentiating patients with and without liver steatosis was assessed by means of unweighted Cohen’s kappa (κ). The κ scores were interpreted using Landis and Koch’s criteria [30]. The levels of concordance between the two radiologists using TAI, TSI, and EzHRI were expressed with a two-way mixed-effects single-measure intraclass correlation coefficient for agreement (ICC); ICCs were reported in accordance with Koo et al. [31]. Bland-Altman plots were drawn and mean bias, upper and lower limits of agreement and their 95% CI were plotted. P-values of less than 0.05 were considered statistically significant.

Results

Out of the 36 children and adolescents recruited in the study (17 females), MRI showed a PDFF value ≥5.6% in 22 patients (males: n=13 [59%]). Obesity prevalence ranged from 66.6% (n=24) to 44.4% (n=16), whereas overweight affected 33.3% (n=12) to 55.5% (n=20) of the sample (IOTF and WHO definitions, respectively). The general characteristics of the study sample, categorized according to hepatic steatosis status, are expressed as medians (1st–3rd quartile) in Table 1.

Table 1 Stratification of biochemical and anthropometric characteristics in patients with and without evidence of hepatic steatosis based on magnetic resonance imaging proton density fat fraction (MRI-PDFF). No statistically significant between-group difference was apparent (Mann–Whitney U test; all P> 0.1)

The level of agreement between B-mode US and MRI-PDFF definition of hepatic steatosis was very good for the 1st radiologist (κ=0.94, 95% CI 0.83–1.00) and good-to-very good for the 2nd radiologist (κ=0.82, 95% CI 0.62–1.00). Inter-operator variability of the QUS software was good-to-excellent for EzHRI (ICC 0.91, 95% CI 0.82–0.95) and TAI (ICC 0.94, 95% CI 0.88–0.97) and moderate-to-good for TSI (ICC 0.73; p<0.001; 95% CI 0.53–0.85). Bland-Altman plots for each QUS index are displayed in Figs. 1, 2, and 3; mean difference (mean Δ), limits of agreement and 95% CI are reported in Supplementary Material S1–S3.

Fig. 1
figure 1

Bland–Altman plot of hepato-renal index with automated region of interest recommendation values (1st vs. 2nd radiologist). Solid horizontal line y=0, broken lines mean difference (Δ), upper and lower limits of agreement (LoA), dotted lines 95% confidence interval for mean Δ, upper LoA and lower LoA. See Supplementary Material 1 for further details

Fig. 2
figure 2

Bland–Altman plot of Tissue Attenuation Imaging values (1st vs. 2nd radiologist). Solid horizontal line y=0, broken lines mean difference (Δ), upper and lower limits of agreement (LoA), dotted lines 95% confidence interval for mean Δ, upper LoA and lower LoA. See Supplementary Material 2 for further details

Fig. 3
figure 3

Bland–Altman plot of Tissue Scatter Distribution Imaging values (1st vs. 2nd radiologist). Solid horizontal line y=0, broken lines mean difference (Δ) upper and lower limits of agreement (LoA), dotted lines 95% confidence interval for mean Δ, upper LoA and lower LoA. See Supplementary Material 3 for further details

Using ROC curve analysis, the diagnostic accuracy of QUS methods (values obtained by the 1st radiologist) was investigated against MRI-PDFF (reference method) (Fig. 4). The computed AUROC were 0.98 (95% CI 0.94–1.00) for EzHRI, 0.90 (95% CI 0.77–1.00) for TAI and 0.99 (95% CI 0.98–1.00) for TSI. Data on diagnostic performance of TAI, EzHRI and TSI are given in Table 2.

Fig. 4
figure 4

Receiver operating characteristic curve analysis of TSI, TAI and EzHRI against MRI-PDFF ≥ 5.6% (reference method). EzHRI Hepato-Renal Index with automated region of interest (ROI) recommendation, MRI-PDFF magnetic resonance imaging proton density fat fraction, TAI Tissue Attenuation Imaging, TSI Tissue Scatter Distribution Imaging

Table 2 Diagnostic performance of TSI, TAI, and EzHRI using receiving operating characteristic curve analysis

Discussion

In our study, the sensitivity and specificity of EzHRI were compared with MRI-PDFF, resulting in an AUROC of 0.98 (0.94–1.00). With regard to TSI, several studies have shown significantly better diagnostic performance in the detection of liver fat content >10%, compared to qualitative visual evaluation in B-mode [32]. Thus, TSI can be considered a valid marker for the identification of the presence of hepatic steatosis in children and adolescents, with an AUROC of 0.99 (0.98–1.00) and a superior diagnostic accuracy to TAI and EzHRI. To date, few studies have focused on the aforementioned QUS indices in the assessment of pediatric liver steatosis; one prospective study evaluated the performance of a different attenuation coefficient (i.e., attenuation imaging, ATI) compared to MRI-PDFF [7], while a cross-sectional study evaluated the diagnostic concordance between ATI and B-mode US [8]. Another recent prospective study included four US technologies for the quantitative assessment of liver fat content in children (the acoustic attenuation coefficient, the hepatorenal index, the Nakagami parameter and the shear wave elastography parameter) using MRI-PDFF as the reference standard. The attenuation coefficient and hepatorenal index accurately detected and quantified liver fat in this small sample of children [33].

Authors of one study assessed the diagnostic performance of quantitative liver echogenicity measurement (hepatorenal index) for hepatic steatosis in a pediatric cohort using MRI-PDFF as the reference method and showed moderate diagnostic performance and a positive correlation with MRI-PDFF with an optimal hepatorenal index cutoff value of >1.75. Furthermore, the incorporation of BMI-percentile and standard deviation of the grayscale echogenicity improved diagnostic performance [34].

In previous studies involving an adult population, an EzHRI value of 1.2 was used as the optimal cut-off (sensitivity: 100%) to detect steatosis [21]. In agreement with these results, this current study in pediatric patients found a similar value for EzHRI (AUROC 0.98, specificity 92.9%) as the best cut-off value. A limitation of our study is the small sample size, which limited our ability to discern different stages of hepatic steatosis; nonetheless, the homogeneity of sex representation, our focus on a pediatric population and the availability of a reference method (MRI-PDFF) counterbalance this drawback. As the operator-dependency of B-mode US assessment represents a major issue, the present study explored the inter-rater reliability of QUS techniques, which was good-to-excellent for EzHRI and TAI and moderate-to-good for TSI.

In a prospective study, US was 90% sensitive in detecting steatosis in 20% of hepatocytes but became less sensitive for lower degrees of fat content [35]. Hepatic steatosis is often homogeneously distributed in the liver, but 10–15% exhibit an atypical heterogeneous distribution. Areas of focal steatosis appear hyperechogenic, most often located near the gallbladder or adjacent to the falciform ligament, sometimes diffuse with typical map-like distribution. In contrast, hypoechogenic areas are termed focal sparing areas and represent different concentrations of steatosis.

From a practical standpoint, an US image indicating steatosis implies the presence of at least 20% of hepatic fat. However, below that threshold, steatosis cannot be reliably detected by standard B-mode US.

However, classification of steatosis with US is arduous because the operator must subjectively (qualitatively) assess the amount of fatty infiltration. Especially in mild forms of steatosis, B-mode US displays limited accuracy in assessing mild changes of steatosis over time [10].

Therefore, although B-mode US can offer a fairly accurate diagnosis in cases of moderate-to-severe steatosis (defined histologically as ≥ 30% or 33%), with a sensitivity ranging from 81.8 to 100.0% and a specificity of approximately 98% [36], it has several limitations in identifying mild steatosis (fat content >5% but <33%), with a sensitivity ranging from 60.9 to 65.0% [11]. In short, the qualitative assessment of B-mode US is strongly influenced by the severity of hepatic steatosis, the presence of any coexisting liver disease and the experience of the operator, and from these observations, it follows that it is important to develop US methods to accurately quantify hepatic steatosis.

Conclusion

In conclusion, our results show acceptable reliability for the included QUS methods; furthermore, using MRI-PDFF as the reference method, we propose EzHRI, TAI and TSI cut-off values to diagnose NAFLD in a pediatric setting. These preliminary results require confirmation by further studies investigating the performance of QUS in hepatic steatosis grading. However, the wide availability of QUS software, its ease of use and speed of execution make it a potentially useful tool for the screening and clinical follow-up of pediatric hepatic steatosis.