Automated MRI liver segmentation for anatomical segmentation, liver volumetry, and the extraction of radiomics

Gross, Moritz; Huber, Steffen; Arora, Sandeep; Ze’evi, Tal; Haider, Stefan P.; Kucukkaya, Ahmet S.; Iseke, Simon; Kuhn, Tom Niklas; Gebauer, Bernhard; Michallek, Florian; Dewey, Marc; Vilgrain, Valérie; Sartoris, Riccardo; Ronot, Maxime; Jaffe, Ariel; Strazzabosco, Mario; Chapiro, Julius; Onofrey, John A.

doi:10.1007/s00330-023-10495-5

Automated MRI liver segmentation for anatomical segmentation, liver volumetry, and the extraction of radiomics

Imaging Informatics and Artificial Intelligence
Open access
Published: 13 January 2024

Volume 34, pages 5056–5065, (2024)
Cite this article

Download PDF

You have full access to this open access article

European Radiology Aims and scope Submit manuscript

Automated MRI liver segmentation for anatomical segmentation, liver volumetry, and the extraction of radiomics

Download PDF

Moritz Gross ORCID: orcid.org/0000-0002-6831-5004^1,2,
Steffen Huber¹,
Sandeep Arora¹,
Tal Ze’evi³,
Stefan P. Haider^1,4,
Ahmet S. Kucukkaya^1,2,
Simon Iseke^1,5,
Tom Niklas Kuhn^1,6,
Bernhard Gebauer²,
Florian Michallek²,
Marc Dewey²,
Valérie Vilgrain^7,8,
Riccardo Sartoris^7,8,
Maxime Ronot^7,8,
Ariel Jaffe⁹,
Mario Strazzabosco⁹,
Julius Chapiro^1,3 &
…
John A. Onofrey^1,3,10

2562 Accesses
3 Citations
4 Altmetric
Explore all metrics

Abstract

Objectives

To develop and evaluate a deep convolutional neural network (DCNN) for automated liver segmentation, volumetry, and radiomic feature extraction on contrast-enhanced portal venous phase magnetic resonance imaging (MRI).

Materials and methods

This retrospective study included hepatocellular carcinoma patients from an institutional database with portal venous MRI. After manual segmentation, the data was randomly split into independent training, validation, and internal testing sets. From a collaborating institution, de-identified scans were used for external testing. The public LiverHccSeg dataset was used for further external validation. A 3D DCNN was trained to automatically segment the liver. Segmentation accuracy was quantified by the Dice similarity coefficient (DSC) with respect to manual segmentation. A Mann-Whitney U test was used to compare the internal and external test sets. Agreement of volumetry and radiomic features was assessed using the intraclass correlation coefficient (ICC).

Results

In total, 470 patients met the inclusion criteria (63.9±8.2 years; 376 males) and 20 patients were used for external validation (41±12 years; 13 males). DSC segmentation accuracy of the DCNN was similarly high between the internal (0.97±0.01) and external (0.96±0.03) test sets (p=0.28) and demonstrated robust segmentation performance on public testing (0.93±0.03). Agreement of liver volumetry was satisfactory in the internal (ICC, 0.99), external (ICC, 0.97), and public (ICC, 0.85) test sets. Radiomic features demonstrated excellent agreement in the internal (mean ICC, 0.98±0.04), external (mean ICC, 0.94±0.10), and public (mean ICC, 0.91±0.09) datasets.

Conclusion

Automated liver segmentation yields robust and generalizable segmentation performance on MRI data and can be used for volumetry and radiomic feature extraction.

Clinical relevance statement

Liver volumetry, anatomic localization, and extraction of quantitative imaging biomarkers require accurate segmentation, but manual segmentation is time-consuming. A deep convolutional neural network demonstrates fast and accurate segmentation performance on T1-weighted portal venous MRI.

Key Points

• This deep convolutional neural network yields robust and generalizable liver segmentation performance on internal, external, and public testing data.

• Automated liver volumetry demonstrated excellent agreement with manual volumetry.

• Automated liver segmentations can be used for robust and reproducible radiomic feature extraction.

Automated segmentation of liver segment on portal venous phase MR images using a 3D convolutional neural network

Article Open access 24 February 2022

A pipeline for automated deep learning liver segmentation (PADLLS) from contrast enhanced CT exams

Article Open access 22 September 2022

Improving automatic liver tumor segmentation in late-phase MRI using multi-model training and 3D convolutional neural networks

Article Open access 18 July 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Magnetic resonance (MR) imaging (MRI) offers high tissue contrast and can be used for non-invasive assessment of the liver, and is integral to diagnosing hepatocellular carcinoma (HCC) [1], liver fibrosis [2], cirrhosis [3], and portal hypertension [4]. Liver segmentation can be used for volumetry, anatomic localization, and the extraction of radiomics. Accurate liver volumetry is essential for risk assessment, decision management, and planning of therapeutic procedures. An important predictor of the success of liver transplantation is the liver volume for both donor and recipient [5]. Liver resection also relies on reliable volume measurements as the outcome is heavily dependent on the liver remnant [6]. To plan therapy and calculate dosimetry, liver volumetry is important for Yttrium-90 selective internal radiotherapy [7] and liver volumes are also of interest for epidemiology research [8]. Anatomic localization by accurate liver segmentation is key for anatomical guidance in computer-assisted surgery [9] and radiotherapy [8, 10], and is accomplished through manual segmentation. Moreover, liver segmentation is a pivotal pre-processing step for lesion detection algorithms [11] and for the extraction of radiomics.

Manual liver segmentation is time-consuming and subject to inter-rater variation [12], which limits its practicality in clinical practice workflows. Convolutional neural networks (CNN) based on deep learning have shown promising results in automating segmentation tasks in medical imaging [13] and provide fast processing times. However, overfitting and dataset shift are major problems in deep learning and external evaluation is pivotal to ensure generalizable validity [14] and many deep learning algorithms have shown substantially decreased performance on external data [15]. A recent study underlined the importance of model evaluation on datasets composed of heterogeneous diagnostic findings encountered in clinical practice [16]. Most proposed automated liver segmentation methods were developed on small datasets and tested only on small internal test sets and therefore do not guarantee generalizable and consistent performance on data from other institutions [14, 15]. Liver image analysis techniques based on radiomics and deep learning, which rely on anatomical segmentations as input, have demonstrated their utility in applications such as characterizing focal hepatic lesions, staging liver fibrosis, and identifying portal hypertension [17].

The aim of this study was to develop and evaluate a deep CNN (DCNN) for automated liver segmentation, liver volumetry, and radiomic feature extraction on portal venous phase contrast-enhanced MRI using a large institutional dataset and assess performance generalizability to external and public testing data.

Materials and methods

Compliance with ethical standards

This retrospective study is HIPPA-compliant and was approved by the institutional review boards of the Yale School of Medicine and the Beaujon Hospital in Paris with full waiver of informed consent and was conducted in accordance with the Declaration of Helsinki.

Data availability

Image data used in this paper cannot be shared publicly due to legal reasons (it would compromise patient confidentiality).

Code availability

All code and the trained segmentation model are publicly available on GitHub: https://github.com/OnofreyLab/volumetry-net

Data

Inclusion of patients and magnetic resonance imaging data

From an institutional HCC database at Yale School of Medicine, all patients >18 years old were included that had T1-weighted portal venous MR images available for processing. All included scans were downloaded from the Picture Archiving and Communication System (PACS) server and de-identified. MRI was acquired between the years 2008 and 2019 using a standard triphasic institutional imaging protocol as suggested by the LIRADS comity [1]. T1-weighted 3D gradient echo volumetric interpolated examination (VIBE) sequence with fat saturation [18] were acquired before contrast administration and 12–18 s (depending on the bolus tracking), 60–70 s, and 3–5 min post-contrast injection for pre-contrast-, late arterial, portal venous-, and delayed-phase images, respectively, after the administration of various gadolinium-based contrast agents. The imaging was conducted using a range of scanners with different field strengths, including 1.16 T, 1.5 T, and 3 T. More information about imaging parameters can be found in Supplemental Table 1.

External validation data

To evaluate the model’s generalization performance on a different patient population, an external dataset of de-identified T1-weighted portal venous MR images was made available from the Beaujon Hospital in Clichy, France. MRI was acquired between the years 2015 and 2020 using a standard triphasic institutional imaging protocol.

Public validation data

For additional external validation, the publicly available LiverHccSeg [19] dataset was used.

Image processing and liver segmentation

All images were converted to the Neuroimaging Informatics Technology Initiative (NIfTI) format. Subsequently, all livers were manually segmented under the supervision of two board-certified abdominal radiologists (S.A. and S.H. with 9 and 10 years of experience, respectively) using the software 3D Slicer (v4.11) [20].

Model development

For model development and evaluation, our institutional dataset was randomly split into training, validation, and internal testing subsets containing 70/15/15% of the data, respectively.

A DCNN was trained to automatically segment the liver on 3D T1-weighted portal venous MR images using manual liver segmentations as ground truth. The architecture of the DCNN is based on the U-Net [21], including two residual units, and was trained with a dropout rate of 0.3, using mini-batches of size 32 with batch-normalization. To avoid model overfitting, we continuously assessed the performance of the model on the validation set during training over 1000 epochs to determine the best performing model. Full details of the model architecture can be found in Supplement 1. Input images were resized to have 2×2×2 mm voxel spacings and normalized such that the 25^th and 75^th image intensity percentiles were scaled to –0.5 and +0.5, respectively [22]. During every training iteration, 16 random 3D patches (64×64×32 voxels) were extracted from the input image in a 3:1 liver region-to-background ratio to focus the training process on the liver area. Dice loss [23] was optimized using the Adam optimizer [24] with a fixed learning rate of 0.0001. The resulting model comprised 1,187,921 trainable parameters and was implemented in Python (v3.8.3) using the open-source Medical Open Network for AI (MONAI) (v0.3.0) framework and PyTorch (v1.5.1) on a Linux workstation using an NVIDIA Quadro RTX 8000 GPU.

For model inference, a sliding window approach is used to segment the entire image field-of-view. Therefore, overlapping 3D patches of size 64×64×32 voxels were extracted at regular increments of 16×16×8 voxels. The prediction results of the overlapping patches were averaged using a Gaussian weighting according to the patch center. This approach accommodates images of different field-of-view sizes.

Model evaluation and statistical analysis

Segmentation performance

To quantify segmentation accuracy, Dice similarity coefficient (DSC), Modified Hausdorff distance (MHD), and mean absolute distance (MAD) metrics were calculated between the manual and automated liver segmentations. The equations for calculating DSC, MHD, and MAD can be found in Supplement 2.

Volumetry performance

To assess liver volumetry accuracy, liver volumes were calculated based on the manual and automated liver segmentations. The intraclass correlation coefficient (ICC) (two-way mixed, single measures, ICC(3,1)) was calculated to assess the agreement between manual and automated volumetry. Furthermore, the absolute volume error and relative volume error were calculated as follows:

$$Absolute\ volume\ error=\left|{vol}_{man}-{vol}_{auto}\right|,$$

$$Relative\ volume\ error=\frac{\left|{vol}_{man}-{vol}_{auto}\right|}{vol_{man}},$$

where vol_man and vol_auto are the volumes from the manual and automated liver segmentations, respectively.

Radiomic feature reproducibility

A total of 107 radiomic features comprising 18 first-order statistic features, 24 gray-level co-occurrence matrix (glcm) features, 14 gray-level dependence matrix (gldm) features, 16 gray-level run-length matrix (glrlm) features, 16 gray-level size zone matrix (glszm) features, 5 neighboring gray-tone difference matrix (ngtdm) features, and 14 shape features were extracted from the manual and automated liver segmentations using the software PyRadiomics (v3.0) [25] and their agreement was assessed using the ICC(3,1). The PyRadiomics settings used for feature extraction are provided in Supplement 3.

Statistical analysis

Descriptive statistics were summarized as absolute and relative frequencies (n and %) for categorical variables or mean and standard deviation (SD) or median and interquartile range (IQR) for continuous variables. To assess data consistency across the training, validation, and internal testing sets, we employed a one-way ANOVA test for continuous characteristics, and a Chi-squared test for categorical characteristics. To evaluate the algorithm’s segmentation and volumetry generalizability, a Mann-Whitney U test was used to compare the segmentation performance from the internal to the external and public test sets, and between HCC and hepatic adenoma patients in the external test set. All statistical analyses were carried out in Python (v3.8.3) using the SciPy (v1.5.2) library and p values <0.05 were considered statistically significant.

Results

Internal study population

A total of 470 HCC patients with T1-weighted portal venous MR images were included in this study (mean age, 63.9 years ± 8.2 [standard deviation]; 376 males). Patients <18 years (n=2), with non-diagnostic MRI (n=14), and no portal venous phase MRI (n=84) were excluded from the study (Fig. 1). Patient characteristics are summarized in Table 1, and MR imaging parameters are reported in Supplemental Table 1. HCC was either proven by imaging criteria or histopathology. Manual segmentation determined reference liver volumes (mean volume, 1687.9 ccm ± 534.7).

Table 1 Patient characteristics

Full size table

External study population

For external validation, 20 T1-weighted portal venous MR images were made available for external testing from a collaborating institution. Patients (mean age, 41 years ± 12; 13 males) were diagnosed with either HCC (n=10) or hepatic adenoma (n=10) and diagnoses were based on histopathology. Manual segmentation determined reference liver volumes (mean volume, 1728.4 ccm ± 647.7).

Public study population

A total of 17 HCC patients (mean age, 61 years ± 10.77; 11 males) with T1-weighted portal venous MR images were included for additional external testing. Manual segmentation determined reference liver volumes (mean volume, 1916.3 ccm ± 459.6).

Final segmentation model

The total training time of the network was 6.35 hours. Evaluation using the validation set identified the best performing model at epoch 880 where the validation DSC reached 0.97. The training and validation loss curves can be found in Supplemental Figure 3.

Algorithm segmentation time demonstrated no substantial differences between the internal (mean time, 0.54 s ± 0.20) and external (mean time, 0.70 s ± 0.70) test sets (p=0.17) or internal and public test sets (mean time, 0.604 s ± 0.230) (p=0.08).

Segmentation performance

Segmentation accuracy of the DCNN was similarly high in the internal (mean DSC, 0.968 ± 0.016) and external (mean DSC, 0.961 ± 0.032) test sets (p=0.28). Notably, in the external test set the segmentation performance was similarly high in HCC (mean DSC, 0.970 ± 0.009) and hepatic adenoma (mean DSC, 0.953 ± 0.044) patients (p=0.21). The DCNN demonstrated adequate segmentation performance in the public test set (mean DSC, 0.93 ± 0.03). However, the overall segmentation performance in the public dataset was significantly lower than in the internal (p<0.001), and external (p=0.004) test sets.

Table 2 summarizes all segmentation performance metrics for all datasets. Examples of representative liver segmentations are shown in Fig. 2, and Fig. 3 shows two segmentation failure cases with DSC <0.95 and poor qualitative performance. No substantial segmentation performance differences were noted across different clinical findings (Table 3; e.g., ascites, Supplemental Figure 1) or in cases with image artifacts or reduced image quality (Supplemental Figure 2) indicating robust generalizability.

Table 2 Liver segmentation performance

Full size table

Table 3 Statistical evaluation of performance generalizability

Full size table

Volumetry performance

Liver volumes determined by manual and automated liver volumetry demonstrated ICCs of 0.99 [95%CI 0.99, 1.00] (p<0.001), 0.97 [95%CI 0.93, 0.99] (p<0.001), and 0.85 [95%CI 0.62, 0.94] (p<0.001) for the internal, external, and public test sets, respectively. Absolute volume errors showed no statistical significance between the internal and external test sets (median volume, 31.7 ccm [interquartile range (IQR), 24.7] vs 19.3 ccm [IQR, 32.4]; p=0.12). Relative volume errors showed no statistical significance between the internal and external test sets (median error, 2.0% [IQR, 1.8] vs 1.3% [IQR, 2.0]; p=0.48). Table 4 summarizes the volumetry performance measures for all datasets. Comparisons of liver volumes determined by manual and automated volumetry can be found in the scatterplot in Fig. 4. In the external test set, there were two cases in which the algorithm substantially underestimated the liver volumes, corresponding to the cases with the lowest segmentation performance (DSC 0.830 and 0.949), resulting in absolute volume errors of 792.5 ccm and 260.9 ccm, respectively.

Table 4 Liver volumetry performance

Full size table

Radiomic feature reproducibility

Radiomic features derived from automated liver segmentations demonstrated significant agreement compared to manual segmentations (p<0.05; for all features in all datasets except the sphericity shape feature in the external test set: p=0.24) in the internal (mean ICC, 0.98 ± 0.04; range 0.80–1.00), external (mean ICC, 0.94 ± 0.10; range 0.16–1.00), and public (mean ICC, 0.91 ± 0.09; range 0.51–1.00) test sets. Supplemental Table 2 reports ICCs and 95% confidence intervals for each individual radiomic feature in each dataset.

Discussion

Using a large dataset, we developed a deep learning algorithm for automated liver segmentation on T1-weighted portal venous phase contrast-enhanced MRI. Accurate liver segmentation is key for anatomical guidance in computer-assisted surgery [9] and radiotherapy [8, 10] and is also a pivotal pre-processing step for subsequent automated lesion detection algorithms [11]. Our segmentation framework attained robust liver segmentation results on internal (mean Dice similarity coefficient (DSC), 0.97), external (mean DSC, 0.96), and public (mean DSC, 0.93) test sets indicating generalizable performance to external data.

Radiological assessment of the liver is an important diagnostic task for hepatocellular carcinoma patients. Accurate and reproducible liver volumetry is key for clinical decision-making and treatment planning. Therefore, methods for automated volumetry enhance workflows and facilitate implementation into widespread clinical practice. Our method yielded fast processing times (mean runtime, 0.54 s) and automated liver volumetry demonstrated acceptable relative volume errors of 2.44% and 2.97%, and excellent agreement [26] with manual volumetry in the internal (intraclass correlation coefficient (ICC), 0.99) and external (ICC, 0.97) test sets and good agreement with manual volumetry in the public test set (ICC, 0.85). Finally, we demonstrate that automated liver segmentation provides robust and reproducible radiomic feature extraction compared to manual segmentation in the internal (mean ICC, 0.98), external (mean ICC, 0.94), and public (mean ICC, 0.91) test sets.

Traditional methods for liver segmentation are based on seeded region growing [27], support vector classification with watershed [28], watershed segmentation coupled with active contouring [29], and convolutional neural network with graph cut [30]. Each of them has their strengths and limitations, making them suitable for different scenarios. However, in comparison to traditional image segmentation methods, deep learning 3D convolutional neural networks offer superior performance, especially for complex and large-scale segmentation tasks when sufficient labeled training data is available. While traditional methods have the advantage of simplicity and computational efficiency for certain scenarios, DCNNs’ ability to learn intricate features from data has elevated their effectiveness in various medical imaging applications, including liver segmentation. It is worth noting that the success of DCNNs relies heavily on the availability of diverse labeled training data [16] and computational resources for training. Compared with other automated liver volumetry approaches, our approach showed higher volume agreement with higher ICC values and lower relative volume error rates [31,32,33]. Furthermore, external testing is critical when evaluating deep learning algorithms in order to avoid overfitting and dataset shift [14]. External test sets are infrequent and most studies report decreased model performance on external testing data [15], indicating limited generalizability. Only a small number of MRI liver segmentation studies [33, 34] evaluated algorithm performance on external testing data. In this study, we demonstrated generalizable performance by testing on a large internal test set (n=71), an external test set (n=20), and on the publicly available LiverHccSeg dataset (n=17) for further external validation.

Additionally, heterogeneous and diverse training data is key for generalizable and robust liver segmentation performance [16]. Advanced liver cancer substantially alters liver morphology as it leads to cancer-related tissue changes, heterogeneous liver tissue, and liver shape deformity [35, 36]. Deep learning methods not trained on data comprising the full spectrum of disease-related liver changes failed to segment livers with advanced tumor stages [16]. This contrasts with other methods evaluated on patient cohorts without intrahepatic lesions [34]. Other segmentation methods failed in patients with intrahepatic lesions that altered the liver parenchyma [33]. Our study utilized a diverse dataset of HCC patients with different disease etiologies (such as alcoholic steatohepatitis, non-alcoholic fatty liver disease, hepatitis C, hepatitis B, autoimmune, cryptogenic) that included a varying number of tumors of different sizes as well as portal vein thrombosis, tumor thrombi, infiltrative disease, and ascites. Furthermore, we used MRI scans acquired on a range of different scanners from different manufacturers with both extracellular and liver-specific contrast and some scans had limited image quality or artifacts as data encountered in real-world settings.

For the clinical implementation of automated liver segmentation and volumetry tools, simple processing routines and fast processing times are key for use in clinical practice. Our approach does not require extensive preprocessing, such as image registration, and takes less than 1 s of processing time, which compares favorably to other methods with longer processing times [31, 32, 37]. Since our method relies only on a single-contrast phase, image artifacts on any other imaging sequence do not change the quality of the liver segmentation and the resulting volume estimation. In comparison, methods using multiphasic data [37] are dependent on the image quality of all the used sequences and their registration. Other approaches performed liver segmentation on non-contrast MRI but obtained overall inferior DSC results [34, 38, 39].

Recently, quantitative imaging biomarkers such as radiomics have gained increased interest in liver disease studies [40] and automated segmentation methods are needed to reduce the segmentation effort required for feature extraction. Good reproducibility of radiomic features from automated segmentations was demonstrated in cervical cancer [41], but to our knowledge has not been confirmed in the liver. Here, we demonstrated feature robustness from the extracted radiomic features in the liver, which can be used for automated and reproducible radiomic feature extraction. While many studies use segmentations of specific regions or volumes of interest for feature extraction, whole-liver imaging biomarkers allow for a thorough assessment of the entire liver encompassing all its regions and structures [42]. This is particularly relevant in cases with multiple lesions or widespread liver disease. It reduces the risk of missing lesions in the segmentation and may be beneficial in cases with multiple or infiltrative lesions as they “blend” into the background of the cirrhotic liver [43] and therefore can be difficult to segment. Additionally, this automated liver segmentation approach could be used as pre-processing for other methods, such as deep learning, to extract novel imaging biomarkers from the liver beyond standard radiomics.

This study has limitations. First, the proposed automated liver segmentation method relies on contrast-enhanced T1-weighted MRI. However, in clinical practice, most scans are acquired using intravenous contrast for better detection and characterization of liver lesions. Second, this study only tested algorithm performance in patients with liver pathologies and requires further validation on healthy patients with healthy liver parenchyma. The observed discrepancy in model performance between the public dataset and our internal and external test sets could be attributed to several factors, primarily related to differences in the characteristics of the data. The public dataset, collected between 1993 and 2007, encompasses an earlier timeframe compared to our internal and external test sets. The advancements in MRI technology, imaging protocols, and the overall quality of medical imaging over the years may contribute to these disparities. Future work will assess the segmentation of liver lobes, liver segments, and liver vessels and we aim to expand our testing scope by incorporating larger test sets from multiple sites. Additionally, generalization to alternative MRI sequences, such as T2-weighted imaging, requires future validation. To this end, incremental learning or transfer learning strategies can leverage the model developed in this study as a starting point for future deep learning models that generalize across MRI sequences.

In conclusion, the presented deep convolutional neural network can perform accurate and fast liver segmentation on T1-weighted portal venous MR images with generalizable performance to external and public testing data. Automated liver volumetry shows excellent agreement with manual volumetry, and automated liver segmentations can be used for reliable radiomic feature extraction.

Abbreviations

CI:: Confidence interval
CNN:: Convolutional neural network
DCNN:: Deep convolutional neural network
DSC:: Dice similarity coefficient
HCC:: Hepatocellular carcinoma
ICC:: Intraclass correlation coefficient
MAD:: Mean absolute distance
MHD:: Modified Hausdorff distance
MR:: Magnetic resonance
MRI:: Magnetic resonance imaging
SD:: Standard deviation

References

Chernyak V, Fowler KJ, Kamaya A et al (2018) Liver Imaging Reporting and Data System (LI-RADS) version 2018: imaging of hepatocellular carcinoma in at-risk patients. Radiology 289:816–830
Article PubMed Google Scholar
Faria SC, Ganesan K, Mwangi I et al (2009) MR imaging of liver fibrosis: current state of the art. Radiographics 29:1615–1635
Article PubMed Google Scholar
Gupta AA, Kim DC, Krinsky GA, Lee VS (2004) CT and MRI of cirrhosis and its mimics. AJR Am J Roentgenol 183:1595–1601
Article PubMed Google Scholar
Wang C, Huang Y, Liu C et al (2023) Diagnosis of clinically significant portal hypertension using CT- and MRI-based vascular model. Radiology 307:e221648
Article PubMed Google Scholar
Taner CB, Dayangac M, Akin B et al (2008) Donor safety and remnant liver volume in living donor liver transplantation. Liver Transpl 14:1174–1179
Article PubMed Google Scholar
Guglielmi A, Ruzzenente A, Conci S, Valdegamberi A, Iacono C (2012) How much remnant is enough in liver resection? Dig Surg 29:6–17
Article PubMed Google Scholar
Theysohn JM, Ertle J, Muller S et al (2014) Hepatic volume changes after lobar selective internal radiation therapy (SIRT) of hepatocellular carcinoma. Clin Radiol 69:172–178
Article CAS PubMed Google Scholar
Gloger O, Kuhn J, Stanski A, Volzke H, Puls R (2010) A fully automatic three-step liver segmentation method on LDA-based probability maps for multiple contrast MR images. Magn Reson Imaging 28:882–897
Article PubMed Google Scholar
Zygomalas A, Kehagias I (2019) Up-to-date intraoperative computer assisted solutions for liver surgery. World J Gastrointest Surg 11:1–10
Article PubMed PubMed Central Google Scholar
Jabbour SK, Hashem SA, Bosch W et al (2014) Upper abdominal normal organ contouring guidelines and atlas: a Radiation Therapy Oncology Group consensus. Pract Radiat Oncol 4:82–89
Article PubMed Google Scholar
Bousabarah K, Letzen B, Tefera J et al (2020) Automated detection and delineation of hepatocellular carcinoma on multiphasic contrast-enhanced MRI using deep learning. Abdom Radiol (NY). https://doi.org/10.1007/s00261-020-02604-5
Gotra A, Sivakumaran L, Chartrand G et al (2017) Liver segmentation: indications, techniques and future directions. Insights Imaging 8:377–392
Article PubMed PubMed Central Google Scholar
Yamashita R, Nishio M, Do RKG, Togashi K (2018) Convolutional neural networks: an overview and application in radiology. Insights Imaging 9:611–629
Article PubMed PubMed Central Google Scholar
Dockes J, Varoquaux G, Poline JB (2021) Preventing dataset shift from breaking machine-learning biomarkers. Gigascience 10
Yu AC, Mohajer B, Eng J (2022) External validation of deep learning algorithms for radiologic diagnosis: a systematic review. Radiol Artif Intell 4:e210064
Article PubMed PubMed Central Google Scholar
Gross M, Spektor M, Jaffe A et al (2021) Improved performance and consistency of deep learning 3D liver segmentation with heterogeneous cancer stages in magnetic resonance imaging. PLoS One 16:e0260630
Article CAS PubMed PubMed Central Google Scholar
Park HJ, Park B, Lee SS (2020) Radiomics and deep learning: hepatic applications. Korean J Radiol 21:387–401
Article PubMed PubMed Central Google Scholar
Rofsky NM, Lee VS, Laub G et al (1999) Abdominal MR imaging with a volumetric interpolated breath-hold examination. Radiology 212:876–884
Article CAS PubMed Google Scholar
Gross M, Arora S, Huber S, Kücükkaya AS, Onofrey JA (2023) LiverHccSeg: a publicly available multiphasic MRI dataset with liver and HCC tumor segmentations and inter-rater agreement analysis. Data Brief 51:109662. https://doi.org/10.1016/j.dib.2023.109662
Fedorov A, Beichel R, Kalpathy-Cramer J et al (2012) 3D Slicer as an image computing platform for the quantitative imaging network. Magn Reson Imaging 30:1323–1341
Article PubMed PubMed Central Google Scholar
Kerfoot E, Clough J, Oksuz I, Lee J, King AP, Schnabel JA (2019) Left-ventricle quantification using residual U-Net. Statistical atlases and computational models of the heart atrial segmentation and LV quantification challenges STACOM 2018 Lecture Notes in Computer Science 11395:371-380
Onofrey JA, Casetti-Dinescu DI, Lauritzen AD et al (2019) Generalizable multi-site training and testing of deep neural networks using image normalization. Proc IEEE Int Symp Biomed Imaging. https://doi.org/10.1109/ISBI.2019.8759295:348-351
Milletari F, Navab N, Ahmadi S-A (2016) V-Net: fully convolutional neural networks for volumetric medical image segmentation. Fourth International Conference on 3D Vision (3DV):565–571
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:14126980
van Griethuysen JJM, Fedorov A, Parmar C et al (2017) Computational radiomics system to decode the radiographic phenotype. Cancer Res 77:e104–e107
Article PubMed PubMed Central Google Scholar
Koo TK, Li MY (2016) A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med 15:155–163
Article PubMed PubMed Central Google Scholar
Adams R, Bischof L (1994) Seeded region growing. IEEE Trans Pattern Anal Mach Intell 16:641–647
Article Google Scholar
Zhang X, Tian J, Xiang D, Li X, Deng K (2011) Interactive liver tumor segmentation from ct scans using support vector classification with watershed. Annu Int Conf IEEE Eng Med Biol Soc 2011:6005–6008
PubMed Google Scholar
Huynh HT, Le-Trong N, Bao PT, Oto A, Suzuki K (2017) Fully automated MR liver volumetry using watershed segmentation coupled with active contouring. Int J Comput Assist Radiol Surg 12:235–243
Article PubMed Google Scholar
Lu F, Wu F, Hu P, Peng Z, Kong D (2017) Automatic 3D liver location and segmentation via convolutional neural network and graph cut. Int J Comput Assist Radiol Surg 12:171–182
Article PubMed Google Scholar
Huynh HT, Le-Trong N, Bao PT, Oto A, Suzuki K (2017) Fully automated MR liver volumetry using watershed segmentation coupled with active contouring. Int J Comput Assist Radiol Surg 12:235–243
Article PubMed Google Scholar
Huynh HT, Karademir I, Oto A, Suzuki K (2014) Computerized liver volumetry on MRI by using 3D geodesic active contour segmentation. AJR Am J Roentgenol 202:152–159
Article PubMed PubMed Central Google Scholar
Wang K, Mamidipalli A, Retson T et al (2019) Automated CT and MRI liver segmentation and biometry using a generalized convolutional neural network. Radiol Artif Intell 1(2):180022
Jimenez-Pastor A, Alberich-Bayarri A, Lopez-Gonzalez R et al (2021) Precise whole liver automatic segmentation and quantification of PDFF and R2* on MR images. Eur Radiol 31:7876–7887
Article PubMed Google Scholar
Dodd GD 3rd, Baron RL, Oliver JH 3rd, Federle MP (1999) Spectrum of imaging findings of the liver in end-stage cirrhosis: part I, gross morphology and diffuse abnormalities. AJR Am J Roentgenol 173:1031–1036
Article PubMed Google Scholar
Huber A, Ebner L, Heverhagen JT, Christe A (2015) State-of-the-art imaging of liver fibrosis and cirrhosis: a comprehensive review of current applications and future perspectives. Eur J Radiol Open 2:90–100
Article PubMed PubMed Central Google Scholar
Ivashchenko OV, Rijkhorst EJ, Ter Beek LC et al (2020) A workflow for automated segmentation of the liver surface, hepatic vasculature and biliary tree anatomy from multiphase MR images. Magn Reson Imaging 68:53–65
Article PubMed Google Scholar
Liu M, Vanguri R, Mutasa S et al (2020) Channel width optimized neural networks for liver and vessel segmentation in liver iron quantification. Comput Biol Med 122:103798
Article CAS PubMed Google Scholar
Zeng Q, Karimi D, Pang EHT et al (2019) Liver segmentation in magnetic resonance imaging via mean shape fitting with fully convolutional neural networks. Springer International Publishing, Cham, pp 246–254
Google Scholar
Wei J, Jiang H, Gu D et al (2020) Radiomics in liver diseases: current progress and future opportunities. Liver Int 40:2050–2063
Article PubMed PubMed Central Google Scholar
Lin YC, Lin CH, Lu HY et al (2020) Deep learning for fully automated tumor segmentation and extraction of magnetic resonance radiomics features in cervical cancer. Eur Radiol 30:1297–1305
Article PubMed Google Scholar
Blanc-Durand P, Van Der Gucht A, Jreige M et al (2018) Signature of survival: a (18)F-FDG PET based whole-liver radiomic analysis predicts survival after (90)Y-TARE for hepatocellular carcinoma. Oncotarget 9:4549–4558
Article PubMed Google Scholar
Demirjian A, Peng P, Geschwind JF et al (2011) Infiltrating hepatocellular carcinoma: seeing the tree through the forest. J Gastrointest Surg 15:2089–2097
Article PubMed PubMed Central Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL. J.A.O. was supported by the National Institute of Diabetes and Digestive and Kidney Diseases of the National Institutes of Health under Award Number P30 KD034989 and M.S. the National Institutes of Health Grant Award Number DDRCC DK034989-36 for the Clinical Translational Core of the Yale Liver Center. The content is solely the responsibility of the authors and does not necessarily represent the official views of the Nation Institutes of Health. M.G. was supported by a travel stipend from the Rolf W. Günther Foundation for Radiological Sciences for his travels to Yale University. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

Authors and Affiliations

Department of Radiology and Biomedical Imaging, Yale University School of Medicine, New Haven, CT, USA
Moritz Gross, Steffen Huber, Sandeep Arora, Stefan P. Haider, Ahmet S. Kucukkaya, Simon Iseke, Tom Niklas Kuhn, Julius Chapiro & John A. Onofrey
Charité Center for Diagnostic and Interventional Radiology, Charité - Universitätsmedizin Berlin, Berlin, Germany
Moritz Gross, Ahmet S. Kucukkaya, Bernhard Gebauer, Florian Michallek & Marc Dewey
Department of Biomedical Engineering, Yale University, New Haven, CT, USA
Tal Ze’evi, Julius Chapiro & John A. Onofrey
Department of Otorhinolaryngology, University Hospital of Ludwig Maximilians Universität München, Munich, Germany
Stefan P. Haider
Department of Diagnostic and Interventional Radiology, Pediatric Radiology and Neuroradiology, Rostock University Medical Center, Rostock, Germany
Simon Iseke
Department of Diagnostic and Interventional Radiology, University Duesseldorf, Duesseldorf, Germany
Tom Niklas Kuhn
Université Paris Cité, Île-de-France, Paris, France
Valérie Vilgrain, Riccardo Sartoris & Maxime Ronot
Department of Radiology, Hôpital Beaujon, AP-HP.Nord, Department of Radiology, Île-de-France, Clichy, France
Valérie Vilgrain, Riccardo Sartoris & Maxime Ronot
Department of Internal Medicine, Yale University School of Medicine, New Haven, CT, USA
Ariel Jaffe & Mario Strazzabosco
Department of Urology, Yale University School of Medicine, New Haven, CT, USA
John A. Onofrey

Authors

Moritz Gross
View author publications
You can also search for this author in PubMed Google Scholar
Steffen Huber
View author publications
You can also search for this author in PubMed Google Scholar
Sandeep Arora
View author publications
You can also search for this author in PubMed Google Scholar
Tal Ze’evi
View author publications
You can also search for this author in PubMed Google Scholar
Stefan P. Haider
View author publications
You can also search for this author in PubMed Google Scholar
Ahmet S. Kucukkaya
View author publications
You can also search for this author in PubMed Google Scholar
Simon Iseke
View author publications
You can also search for this author in PubMed Google Scholar
Tom Niklas Kuhn
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard Gebauer
View author publications
You can also search for this author in PubMed Google Scholar
Florian Michallek
View author publications
You can also search for this author in PubMed Google Scholar
Marc Dewey
View author publications
You can also search for this author in PubMed Google Scholar
Valérie Vilgrain
View author publications
You can also search for this author in PubMed Google Scholar
Riccardo Sartoris
View author publications
You can also search for this author in PubMed Google Scholar
Maxime Ronot
View author publications
You can also search for this author in PubMed Google Scholar
Ariel Jaffe
View author publications
You can also search for this author in PubMed Google Scholar
Mario Strazzabosco
View author publications
You can also search for this author in PubMed Google Scholar
Julius Chapiro
View author publications
You can also search for this author in PubMed Google Scholar
John A. Onofrey
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Moritz Gross or John A. Onofrey.

Ethics declarations

Guarantor

The scientific guarantor of this publication is Dr. John Onofrey.

Conflict of interest

The authors of this manuscript declare no relationships with any companies, whose products or services may be related to the subject matter of the article.

Statistics and biometry

No complex statistical methods were necessary for this paper.

Informed consent

Written informed consent was waived by the Institutional Review Board.

Ethical approval

Institutional Review Board approval was obtained (IRB Protocol 1603017425).

Study subjects or cohorts overlap

No study subjects or cohorts have been previously reported.

Methodology

• retrospective

• experimental

• performed at one institution

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

ESM 1

(DOCX 3.70 mb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gross, M., Huber, S., Arora, S. et al. Automated MRI liver segmentation for anatomical segmentation, liver volumetry, and the extraction of radiomics. Eur Radiol 34, 5056–5065 (2024). https://doi.org/10.1007/s00330-023-10495-5

Download citation

Received: 22 August 2023
Revised: 20 September 2023
Accepted: 29 October 2023
Published: 13 January 2024
Issue Date: August 2024
DOI: https://doi.org/10.1007/s00330-023-10495-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Automated MRI liver segmentation for anatomical segmentation, liver volumetry, and the extraction of radiomics

Abstract

Objectives

Materials and methods

Results

Conclusion

Clinical relevance statement

Key Points

Similar content being viewed by others

Automated segmentation of liver segment on portal venous phase MR images using a 3D convolutional neural network

A pipeline for automated deep learning liver segmentation (PADLLS) from contrast enhanced CT exams

Improving automatic liver tumor segmentation in late-phase MRI using multi-model training and 3D convolutional neural networks

Introduction

Materials and methods

Compliance with ethical standards

Data availability

Code availability

Data

Inclusion of patients and magnetic resonance imaging data

External validation data

Public validation data

Image processing and liver segmentation

Model development

Model evaluation and statistical analysis

Segmentation performance

Volumetry performance

Radiomic feature reproducibility

Statistical analysis

Results

Internal study population

External study population

Public study population

Final segmentation model

Segmentation performance

Volumetry performance

Radiomic feature reproducibility

Discussion

Abbreviations

References

Funding

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Guarantor

Conflict of interest

Statistics and biometry

Informed consent

Ethical approval

Study subjects or cohorts overlap

Methodology

Additional information

Publisher’s note

Supplementary information

ESM 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation