Automated graded prognostic assessment for patients with hepatocellular carcinoma using machine learning

Gross, Moritz; Haider, Stefan P.; Ze’evi, Tal; Huber, Steffen; Arora, Sandeep; Kucukkaya, Ahmet S.; Iseke, Simon; Gebauer, Bernhard; Fleckenstein, Florian; Dewey, Marc; Jaffe, Ariel; Strazzabosco, Mario; Chapiro, Julius; Onofrey, John A.

doi:10.1007/s00330-024-10624-8

Automated graded prognostic assessment for patients with hepatocellular carcinoma using machine learning

Gastrointestinal
Open access
Published: 27 March 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

European Radiology Aims and scope Submit manuscript

Automated graded prognostic assessment for patients with hepatocellular carcinoma using machine learning

Download PDF

Moritz Gross^1,2,
Stefan P. Haider^1,3,
Tal Ze’evi^1,4,
Steffen Huber¹,
Sandeep Arora¹,
Ahmet S. Kucukkaya^1,2,
Simon Iseke^1,5,
Bernhard Gebauer²,
Florian Fleckenstein²,
Marc Dewey²,
Ariel Jaffe⁶,
Mario Strazzabosco⁶,
Julius Chapiro¹ &
…
John A. Onofrey ORCID: orcid.org/0000-0002-9432-0448^1,4,7

1187 Accesses
3 Altmetric
Explore all metrics

Abstract

Background

Accurate mortality risk quantification is crucial for the management of hepatocellular carcinoma (HCC); however, most scoring systems are subjective.

Purpose

To develop and independently validate a machine learning mortality risk quantification method for HCC patients using standard-of-care clinical data and liver radiomics on baseline magnetic resonance imaging (MRI).

Methods

This retrospective study included all patients with multiphasic contrast-enhanced MRI at the time of diagnosis treated at our institution. Patients were censored at their last date of follow-up, end-of-observation, or liver transplantation date. The data were randomly sampled into independent cohorts, with 85% for development and 15% for independent validation. An automated liver segmentation framework was adopted for radiomic feature extraction. A random survival forest combined clinical and radiomic variables to predict overall survival (OS), and performance was evaluated using Harrell’s C-index.

Results

A total of 555 treatment-naïve HCC patients (mean age, 63.8 years ± 8.9 [standard deviation]; 118 females) with MRI at the time of diagnosis were included, of which 287 (51.7%) died after a median time of 14.40 (interquartile range, 22.23) months, and had median followed up of 32.47 (interquartile range, 61.5) months. The developed risk prediction framework required 1.11 min on average and yielded C-indices of 0.8503 and 0.8234 in the development and independent validation cohorts, respectively, outperforming conventional clinical staging systems. Predicted risk scores were significantly associated with OS (p < .00001 in both cohorts).

Conclusions

Machine learning reliably, rapidly, and reproducibly predicts mortality risk in patients with hepatocellular carcinoma from data routinely acquired in clinical practice.

Clinical relevance statement

Precision mortality risk prediction using routinely available standard-of-care clinical data and automated MRI radiomic features could enable personalized follow-up strategies, guide management decisions, and improve clinical workflow efficiency in tumor boards.

Key Points

• Machine learning enables hepatocellular carcinoma mortality risk prediction using standard-of-care clinical data and automated radiomic features from multiphasic contrast-enhanced MRI.

• Automated mortality risk prediction achieved state-of-the-art performances for mortality risk quantification and outperformed conventional clinical staging systems.

• Patients were stratified into low, intermediate, and high-risk groups with significantly different survival times, generalizable to an independent evaluation cohort.

Hepatocellular carcinoma pathologic grade prediction using radiomics and machine learning models of gadoxetic acid-enhanced MRI: a two-center study

Article 21 September 2022

Estimating postsurgical outcomes of patients with a single hepatocellular carcinoma using gadoxetic acid–enhanced MRI: risk scoring system development and validation

Article 18 March 2023

An interpretable machine learning model based on contrast-enhanced CT parameters for predicting treatment response to conventional transarterial chemoembolization in patients with hepatocellular carcinoma

Article 14 February 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Liver cancer is the second most frequent cause of cancer-related death in the world [1] and the fifth leading cause in the United States [2]. In contrast to other cancer types, liver cancer incidence and mortality rates are rising [3, 4]. Hepatocellular carcinoma (HCC) is the most prevalent form of primary liver cancer, accounting for 70–85% of total liver cancers globally [5]. Imaging is a critical tool for diagnosing and staging of HCC. Magnetic resonance imaging (MRI) provides high spatial resolution and allows soft-tissue characterization of the liver. The liver imaging reporting and data system (LI-RADS) [6] has been developed for non-invasive diagnosis based on imaging criteria, and in most cases, HCC can be detected and diagnosed using contrast-enhanced multiphasic imaging without an invasive biopsy [7].

Various scoring and staging systems have been proposed for HCC mortality risk quantification and to stratify patients into risk categories [8,9,10,11,12,13,14]. However, these systems use only limited one-dimensional tumor size measurements [9,10,11,12,13], do not make use of quantitative imaging biomarkers, and stratify patients into different risk groups based on strict thresholds. Traditional one-dimensional tumor size measurements have major limitations in reflecting viability, actual tumor size, and growth potential [15] and are subject to rater variability. Quantitative image biomarkers, such as radiomic features, can be extracted from regions and volumes of interest from medical imaging data; however, automated segmentation methods are required for integration into clinical practice workloads. Fully automated whole liver segmentation based on deep learning has demonstrated robust, reproducible, and generalizable segmentation performance across disease stages with substantially altered liver morphology and only required processing times in the order of seconds [16].

The aim of this study was twofold: develop and validate an automated method for mortality risk quantification in HCC patients using only routinely available standard-of-care clinical variables and radiomic features derived from automated liver segmentations on baseline multiphasic contrast-enhanced MRI by means of machine learning and use this risk score to stratify patients into low, intermediate, and high-risk groups.

Materials and methods

Compliance with ethical standards

This HIPAA-compliant study was approved by the Yale School of Medicine institutional review board with full waiver of consent and conducted in accordance with the declaration of Helsinki.

Code availability

All code for the methodological implementation and the trained framework is publicly available on GitHub: https://github.com/OnofreyLab/hcc-mortality-risk

Data availability

Patient data and imaging data used in this paper cannot be shared publicly due to legal reasons.

Patient inclusion and exclusion

This retrospective study identified all patients with treatment-naïve HCC treated at our institution between the years 2008 and 2019. HCC was either proven by imaging criteria or biopsy confirmation. We included

(i)
all patients >18 years old
(ii)
that had multiphasic contrast-enhanced MRI at the time of diagnosis.

We excluded

(i)
all patients with missing clinical information,
(ii)
with no triphasic MRI acquisition or
(iii)
non-diagnostic MRI.

The patients were randomly sampled into independent cohorts, with 85% for development and 15% for independent validation.

Clinical data

Clinical data were collected from the hospital’s electronic health record system, and conventional clinical staging scores [8,9,10,11,12,13,14] were calculated. Patients were evaluated for extrahepatic metastases on chest computed tomography (CT) and bone scans at the date of diagnosis, and the following laboratory values were collected closest to the date of imaging: alpha-fetoprotein (AFP), total bilirubin, direct bilirubin, serum albumin, international normalized ratio (INR), partial thromboplastin time (PTT), sodium, and creatinine.

MRI acquisition and radiomics extraction

Multiphasic contrast-enhanced MRI was acquired using a standard institutional imaging protocol involving T1-weighted breath-hold sequences before contrast administration and 12–18 s, 60–70 s, and 3–5 min post-contrast injection for pre-contrast-, late arterial, portal venous-, and delayed-phase images, respectively. The scans were de-identified and downloaded from the picture archiving and communication system (PACS) server. Subsequently, automated image co-registration and whole liver segmentation were performed using a convolutional neural network [17], followed by radiomic feature extraction. Full details on the image processing pipeline can be found in Supplement 1.

Survival model and graded prognostic assessment

A random survival forest (RSF) [18] is a non-parametric method that uses an ensemble of survival trees to analyze right-censored time-to-event data. In this study, we fit an RSF with overall survival (OS) as the dependent variable and the combination of all clinical- and radiomic variables as independent variables (termed “candidate variables”). OS was defined as the period between the imaging date and the date of death of any cause. Patients were censored either at their last date of follow-up, end-of-observation date, or at the date of liver transplantation. The following RSF settings were used: 1000 estimator trees were grown with a minimum of 10 samples to split an internal node, and 15 samples were required at a leaf node with \(\sqrt{n}\) variables to consider when looking for the best split. First, all candidate variables with a Pearson correlation coefficient > 0.9 in the development cohort were excluded to reduce multicollinearity. Second, we fit the RSF on the remaining candidate variables and obtained a ranked list of variable importance scores by ten permutations of random shuffling. Third, we re-fit the final survival model on the 30 topmost important variables. To gain further insight and interpretability of the proposed method, we obtained variable importance scores by ten permutations of random shuffling for all included variables. Our method was implemented in Python (v3.8.11) using the scikit-survival package (v0.16.0). Three risk groups were defined from the risk score predictions of the proposed model using the “rhier” function of the R package “rolr” (v1.0), which uses a hierarchical method by applying ordered logrank tests [19], to stratify patients into low-, intermediate-, and high-risk groups. The risk score cutoffs for each risk group were derived from the development cohort, with a minimum of 25 subjects in each risk group. The same cutoff values were then applied for risk group stratification in the independent validation cohort. For performance evaluation of the proposed model and conventional clinical staging scores, we calculated Harrell’s C-index [20] and the area under the time-dependent receiver operating characteristic curve (AUC) [21] at 1–5 years after the date of imaging.

Statistical analysis

Statistical analyses were conducted in SPSS (v27), R (v4.1.1), and Python (v3.8.11). p values < .05 were considered statistically significant, and Bonferroni correction was used when comparing multiple groups. Cox proportional hazards regression analysis was used to determine the association of the developed risk score and the proposed risk groups with OS. Median OS (mOS) was calculated, and Kaplan-Meier survival curves were plotted for each risk group and compared using a logrank test. The survival rates of the proposed risk groups were calculated at 1, 3, and 5 years after the date of imaging. To assess the generalizability of the risk groups’ survival times to new cohorts, the survival times of each risk group were compared between the development- and validation cohort using a logrank test.

Results

Patient characteristics

A total of 555 patients (mean age, 63.8 years ± 8.9 [standard deviation]; 118 females) with treatment-naïve HCC and multiphasic contrast-enhanced MRI at the time of diagnosis were included in the study. Patients without MRI at baseline (n = 501), < 18 years (n = 2), missing clinical information (n = 16), no triphasic image acquisition (n = 84), and non-diagnostic MRI (n = 14) were excluded from the study (Fig. 1). Patient baseline characteristics are summarized in Table 1, and MRI parameters are reported in Supplemental Table 1. HCC was either proven by imaging criteria or histopathology.

Table 1 Patient baseline characteristics

Full size table

A total of 287 (51.7%) patients died after a median time of 14.40 months (range, 0.20–97.12 months; interquartile range (IQR), 22.23) after the date of imaging, and patients were followed up for a median of 32.47 months (range: 0.20–118.90 months; IQR: 61.5) after the date of imaging. The median time between the laboratory results and the imaging date was 10 days (IQR, 28.3). First treatments based on the institution’s multidisciplinary tumor board decisions were as follows: 192 (34.6%) patients underwent transarterial chemoembolization, 138 (24.9%) thermal ablation, 82 (14.8%) hepatectomy, 68 (12.3%) a combination of transarterial chemoembolization and thermal ablation, 24 (4.3%) Sorafenib, 24 (4.3%) best supportive care, 20 (3.6%) transarterial radioembolization with Yttrium-90, and lastly 7 (1.3%) liver transplantation. For model development and validation, a total of 471 (85%) patients were randomly allocated to the development cohort and 84 (15%) to the independent validation cohort.

Survival model

Figure 2 summarizes the entire model development pipeline. The proposed model attained C-indices of 0.8503 and 0.8234 in the development- and validation cohort, respectively. Table 2 summarizes all performance metrics for the proposed model and conventional clinical staging systems. For the interpretability of the proposed model, Fig. 3 depicts the variable importance scores of the included variables. On average, the proposed framework required a running time of 1.11 min per patient (automated liver segmentation, 0.70 s; extraction of 23 included radiomic features, 1.09 min; model prediction, 0.42 s).

Table 2 Performance evaluation

Full size table

Mortality risk predictions and graded prognostic assessment

The distribution of risk scores in the development and validation cohort is shown in Fig. 4. In the development- and validation cohort, the mean (± standard deviation) predicted risk score was 121.57 (± 65.31) and 135.64 (± 67.59), respectively. Cox proportional hazards regression analysis showed a highly significant association between the predicted risk score and OS in the development cohort (coefficient, 0.021658 (p <.00001); HR, 1.022 (95% CI: 1.02, 1.024)), and in the validation cohort (coefficient, 0.021676 (p <.00001); HR, 1.022 (95% CI 1.016, 1.028)). The cutoff values determined by the hierarchical method to stratify patients into low-, intermediate-, and high-risk groups based on the proposed model’s predicted risk scores were 93.08 and 172.73. Detailed results of the Cox proportional hazards regression analysis for each risk group can be found in Supplemental Table 2. Example cases are shown in Fig. 5. In the development cohort, 193 (41%) patients were assigned to the low-risk group, 185 (39%) patients to the intermediate-risk group, and 93 (20%) patients to the high-risk group. In the validation cohort, 27 (32%) patients were allocated to the low-risk group, 32 (38%) patients to the intermediate-risk group, and 25 (30%) patients to the high-risk group. Supplemental Table 3 shows a cross-tabulation analysis of the proposed risk groups across conventional clinical staging systems.

Survival analysis

A comprehensive table with mOS times and 95% confidence intervals for various strata can be found in Table 3. The mOS time of the entire study cohort was 32.47 (95% CI: 28.40, 37.83) months. In the development- and validation cohort, the mOS were 32.57 (95% CI: 29.13, 38.50) months and 24.57 (95% CI: 14.73, NA) months, respectively. Survival rates (± standard error) for the proposed risk groups in the development- and validation cohort are summarized in Table 4. Survival times between the development- and the validation cohort showed no statistical difference (p = .29). In the development cohort, mOS in the low-risk group was 90.40 (95% CI: 62.97, NA) months, 25.8 (95% CI: 23.50, 31.90) months in the intermediate-risk, and 6.40 (95% CI: 5.03, 7.73) months in the high-risk group. In the validation cohort, mOS in the low-risk group was NA (95% CI:48.03, NA) months, 19.60 (95% CI: 14.23, NA) months in the intermediate-risk, and 5.40 (95% CI: 3.80, 12.63) months in the high-risk group. Kaplan-Meier curves for the developed risk groups can be found in Fig. 6. The developed low-, intermediate-, and high-risk groups demonstrated significantly different survival times in both cohorts (development cohort, p <.0001; validation cohort, p <.0001). Notably, no statistical difference was found when comparing the survival times of each risk group between the development- and the validation cohort (low-risk group, p = 1.0; intermediate-risk group, p = 1.0; high-risk group, p = 1.0), thus indicating the generalizability of the risk groups’ survival times to new data. Complete results of the logrank test for pairwise OS comparisons between the proposed risk groups can be found in Supplemental Table 4. Kaplan-Meier curves for the conventional staging scores can be found in Supplemental Figure 1.

Table 3 Median overall survival times across staging systems

Full size table

Table 4 Survival rates (± standard error) for the proposed risk groups

Full size table

Discussion

Using a large dataset (n = 555), we devised and independently validated a fully automated framework for mortality risk prediction in hepatocellular carcinoma patients using routinely available standard-of-care clinical data and radiomic biomarkers from automated liver segmentations on baseline multiphasic contrast-enhanced MRI. This completely data-driven method yielded reliable, fast, and reproducible risk predictions and attained state-of-the-art performances for mortality risk quantification at baseline. The generalizability of our method was confirmed in an independent validation cohort, and performance was compared against conventional staging systems. In addition, the proposed stratification into low-, intermediate- and high-risk groups yielded similar overall survival times in the development and validation cohort, indicating the generalizability of the method. By using automated liver segmentations, we ensure that the extracted radiomic imaging markers are stable and reproducible and increase the workflow substantially by demonstrating segmentation computation times under a second without the need for human interaction [17]. Finally, we developed our method using a data-driven approach. Instead of stratifying patients into risk groups based on one-dimensional image measurements [9,10,11,12,13], we derived risk group cutoff values using the full three-dimensional imaging volume itself. Thus, such a risk prediction framework could enable personalized follow-up strategies, guide management decisions, and improve clinical workflow efficiency in tumor boards.

We hypothesize that the proposed framework outperformed conventional systems for mortality risk quantification in terms of C-index and AUC as it uses advanced quantitative imaging biomarkers from the whole liver volume in imaging and a data-driven approach for mortality risk prediction. Three-dimensional quantitative assessment of tumor burden has been shown to be a stronger predictor of patient survival than one-dimensional tumor size measurements [23], and one-dimensional tumor size measurements, as used in conventional staging systems, have shown major limitations in reflecting viability, actual tumor size, and growth potential [24]. Our approach uses the full 3D MR imaging data for the extraction of quantitative biomarkers, which is a rich representation of MRI compared to single-dimensional representations such as diameter-based measurements. The 30 biomarkers with the highest importance scores derived from the data demonstrate a combination of clinical features and radiomics across all four phases of the MRI study (Fig. 4). These biomarkers effectively represent disease through intensity patterns and textures summarized by the radiomics in different phases of the imaging study. The performances of the conventional staging systems are in line with previously published studies [25,26,27,28,29,30,31]. Previous works have developed machine learning models for mortality risk quantification. However, direct comparison is challenging due to the different datasets, imaging modalities, and study endpoints being used. Mei et al. [32] developed and validated a prognostic nomogram to predict survival in patients with unresectable HCC after hepatic arterial infusion chemotherapy in a cohort with 463 predominantly advanced-stage patients attaining C-indices of 0.710 and 0.716 in their development and validation cohorts, respectively. Furthermore, the authors proposed a risk stratification approach to classify patients into three or four risk groups based on a trisection cutoff and a quartile cutoff point with significantly different OS times. While their approach did not include quantitative imagining biomarkers, their data-driven design achieved superior performance compared to conventional staging systems in their cohort, similar to the findings of our study. Liu et al. [33] developed and validated a prognostic nomogram to predict overall survival in HCC patients after hepatectomy from radiomic features from tumor segmentations on portal venous phase computed tomography and clinical- and pathological variables in a cohort with 544 Chinese patients and achieved C-indices of 0.747 and 0.777 in their development and validation cohorts, respectively. Based on the predicted risk scores, patients were allocated into low and high-risk groups with significantly different OS times. The authors could significantly improve performance when integrating radiomic features into the risk prediction model compared to clinical- and pathological features alone. Blanc-Durand et al. [34] proposed a scoring system to stratify HCC patients undergoing transarterial radioembolization with Yttrium-90 into a low- and a high-risk group. Radiomic features were derived from semi-automatic liver segmentations from pretreatment ¹⁸F-fluorodeoxyglucose positron emission tomography images. Their proposed risk score was significantly correlated with OS and could stratify patients into two risk groups with distinct OS times. However, their method was not evaluated on an independent validation cohort, and no measures of prognostic power, such as the C-index or time-dependent AUCs, were reported, and their method relies on manually revised semi-automatic liver segmentations. In their multi-institutional study, Ji et al. [35] presented a machine learning framework to predict the time to recurrence after resection. Their method was developed and evaluated in 470 patients with solitary HCC lesions and used radiomic features derived from manual tumor and peritumoral segmentations from baseline contrast-enhanced computed tomography images combined with clinical variables and achieved C-indices of 0.733–0.801, outperforming conventional staging systems. The authors proposed a risk stratification approach to allocate patients into low, intermediate, and high-risk groups with significantly different OS times. Their study confirms the utility of a data-driven approach and the usage of advanced quantitative imaging biomarkers for time-to-event data. However, their method relies on time-consuming manual tumor segmentation and does not use a fully automated segmentation method as our liver segmentation approach, which yields segmentation in processing times of under a second (0.70 s).

Our study has several limitations. First, our method was developed using retrospective data from a single institution; thus, prospective multicenter studies and external validation are warranted before potential clinical translation. However, our method was developed using a large dataset and yielded precise and generalizable results in an independent validation cohort. Furthermore, the large cohort size of this study adds a degree of robustness to the findings, and the relative simplicity of the analytical elements involved enhances the likelihood of reproducibility and the reliability of the findings. Due to these factors, we anticipate that other groups can replicate our study using the publicly available code. Second, we did not control for the effects of different treatment types on OS. Nevertheless, our method is still predictive of OS even without any treatment information given. Finally, the proposed method relies on multiphasic contrast-enhanced MRI and cannot be applied to patients who underwent other imaging modalities at baseline. In future work, we will incorporate longitudinal data from multiple external contributors and evaluate our method on a larger independent validation cohort.

In conclusion, we present a fully automated framework for mortality risk prediction in hepatocellular carcinoma patients using routinely available standard-of-care clinical data and radiomic features derived from automated liver segmentations on baseline multiphasic contrast-enhanced MRI outperforming conventional staging systems for mortality risk quantification. The developed method, based on machine learning, can help personalize follow-up strategies, guide management decisions, and improve clinical workflow efficiency in tumor boards.

Abbreviations

AFP:: Alpha-fetoprotein
AUC:: Area under the curve
CI:: Confidence interval
CT:: Computed tomography
HCC:: Hepatocellular carcinoma
INR:: International normalized ratio
IQR:: Interquartile range
mOS:: Median overall survival
MRI:: Magnetic resonance imaging
NA:: Not available
PTT:: Partial thromboplastin time
RSF:: Random survival forest
SD:: Standard deviation

References

Sung H, Ferlay J, Siegel RL et al (2021) Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 71:209–249
Article PubMed Google Scholar
Jemal A, Bray F, Center MM, Ferlay J, Ward E, Forman D (2011) Global cancer statistics. CA Cancer J Clin 61:69–90
Article PubMed Google Scholar
Siegel RL, Miller KD, Jemal A (2019) Cancer statistics, 2019. CA Cancer J Clin 69:7–34
Article PubMed Google Scholar
White DL, Thrift AP, Kanwal F, Davila J, El-Serag HB (2017) Incidence of hepatocellular carcinoma in All 50 United States, From 2000 Through 2012. Gastroenterology 152:812-820.e815
Article PubMed Google Scholar
Perz JF, Armstrong GL, Farrington LA, Hutin YJ, Bell BP (2006) The contributions of hepatitis B virus and hepatitis C virus infections to cirrhosis and primary liver cancer worldwide. J Hepatol 45:529–538
Article PubMed Google Scholar
Chernyak V, Fowler KJ, Kamaya A et al (2018) Liver Imaging Reporting and Data System (LI-RADS) version 2018: imaging of hepatocellular carcinoma in at-risk patients. Radiology 289:816–830
Article PubMed Google Scholar
Hamer OW, Schlottmann K, Sirlin CB, Feuerbach S (2007) Technology insight: advances in liver imaging. Nat Clin Pract Gastroenterol Hepatol 4:215–228
Article PubMed Google Scholar
Child CG, Turcotte JG (1964) Surgery and portal hypertension. Major Probl Clin Surg 1:1–85
CAS PubMed Google Scholar
Reig M, Forner A, Rimola J et al (2022) BCLC strategy for prognosis prediction and treatment recommendation: the 2022 update. J Hepatol 76:681–693
Article PubMed Google Scholar
Yau T, Tang VY, Yao TJ, Fan ST, Lo CM, Poon RT (2014) Development of Hong Kong Liver Cancer staging system with treatment stratification for patients with hepatocellular carcinoma. Gastroenterology 146:1691–1700
Article PubMed Google Scholar
Amin MB, Greene FL, Edge SB et al (2017) The Eighth Edition AJCC Cancer Staging Manual: Continuing to build a bridge from a population-based to a more “personalized” approach to cancer staging. CA Cancer J Clin 67:93–99
Article PubMed Google Scholar
Japan TLCSGo (1989) The general rules for the clinical and pathological study of primary liver cancer. Jpn J Surg 19:98–129
Article Google Scholar
Kudo M, Chung H, Osaki Y (2003) Prognostic staging system for hepatocellular carcinoma (CLIP score): its value and limitations, and a proposal for a new staging system, the Japan Integrated Staging Score (JIS score). J Gastroenterol 38:207–215
Article PubMed Google Scholar
Johnson PJ, Berhane S, Kagebayashi C et al (2015) Assessment of liver function in patients with hepatocellular carcinoma: a new evidence-based approach-the ALBI grade. J Clin Oncol 33:550–558
Article PubMed Google Scholar
Tacher V, Lin M, Duran R et al (2016) Comparison of existing response criteria in patients with hepatocellular carcinoma treated with transarterial chemoembolization using a 3D quantitative approach. Radiology 278:275–284
Article PubMed Google Scholar
Gross M, Spektor M, Jaffe A et al (2021) Improved performance and consistency of deep learning 3D liver segmentation with heterogeneous cancer stages in magnetic resonance imaging. PLoS One 16:e0260630
Article CAS PubMed PubMed Central Google Scholar
Gross M, Huber S, Arora S et al (2024) Automated MRI liver segmentation for anatomical segmentation, liver volumetry, and the extraction of radiomics. Eur Radiol. https://doi.org/10.1007/s00330-023-10495-5
Ishwaran H, Kogalur UB, Blackstone EH, Lauer MS (2008) Random survival forests. Ann Appl Statist 2:841–860
Article Google Scholar
Crowley J, Mitchell A, Qu P, Morgan G, Barlogie B (2017) Optimal three group splits based on a survival outcome. Frontiers of Biostatistical Methods and Applications in Clinical Oncology:231–242. https://doi.org/10.1007/978-981-10-0126-0_14
Harrell FE Jr, Lee KL, Mark DB (1996) Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med 15:361–387
Article PubMed Google Scholar
Blanche P, Dartigues JF, Jacqmin-Gadda H (2013) Estimating and comparing time-dependent areas under receiver operating characteristic curves for censored event times with competing risks. Stat Med 32:5381–5397
Article PubMed Google Scholar
Pyradiomics-community (February 11, 2020) pyradiomics Documentation Release v3.0. Available via https://pyradiomics.readthedocs.io/en/v3.0/features.html. Accessed 2023 Feb 24
Park T, Yoon MA, Cho YC et al (2022) Automated segmentation of the fractured vertebrae on CT and its applicability in a radiomics model to predict fracture malignancy. Sci Rep 12:6735
Article CAS PubMed PubMed Central Google Scholar
Zhao Y, Zhao T, Chen S et al (2022) Fully automated radiomic screening pipeline for osteoporosis and abnormal bone density with a deep learning-based segmentation using a short lumbar mDixon sequence. Quant Imaging Med Surg 12:1198–1213
Article PubMed PubMed Central Google Scholar
Kim KM, Sinn DH, Jung SH et al (2016) The recommended treatment algorithms of the BCLC and HKLC staging systems: does following these always improve survival rates for HCC patients? Liver Int 36:1490–1497
Article CAS PubMed Google Scholar
Adhoute X, Penaranda G, Raoul JL, Bourliere M (2017) Usefulness of the MESH score in a European hepatocellular carcinoma cohort. World J Hepatol 9:711–714
Article PubMed PubMed Central Google Scholar
Zhao S, Wang M, Yang Z et al (2020) Comparison between Child-Pugh score and Albumin-Bilirubin grade in the prognosis of patients with HCC after liver resection using time-dependent ROC. Ann Transl Med 8:539
Article CAS PubMed PubMed Central Google Scholar
op den Winkel M, Nagel D, Sappl J et al (2012) Prognosis of patients with hepatocellular carcinoma Validation and ranking of established staging-systems in a large western HCC-cohort. PLoS One 7:e45066
Article CAS PubMed PubMed Central Google Scholar
Gui B, Weiner AA, Nosher J et al (2018) Assessment of the albumin-bilirubin (ALBI) grade as a prognostic indicator for hepatocellular carcinoma patients treated with radioembolization. Am J Clin Oncol 41:861–866
Article CAS PubMed PubMed Central Google Scholar
Bai Y, Lian Y, Wu J et al (2021) A prognostic scoring system for predicting overall survival of patients with the TNM 8th edition stage I and II hepatocellular carcinoma after surgery: a population-based study. Cancer Manag Res 13:2131–2142
Article PubMed PubMed Central Google Scholar
Parikh ND, Scaglione S, Li Y et al (2018) A comparison of staging systems for hepatocellular carcinoma in a multicenter US cohort. Clin Gastroenterol Hepatol 16:781–782
Article PubMed Google Scholar
Mei J, Lin WP, Shi F et al (2021) Prognostic nomogram predicting survival of patients with unresectable hepatocellular carcinoma after hepatic arterial infusion chemotherapy. Eur J Radiol 142:109890
Article PubMed Google Scholar
Liu Q, Li J, Liu F et al (2020) A radiomics nomogram for the prediction of overall survival in patients with hepatocellular carcinoma after hepatectomy. Cancer Imaging 20:82
Article CAS PubMed PubMed Central Google Scholar
Blanc-Durand P, Van Der Gucht A, Jreige M et al (2018) Signature of survival: a (18)F-FDG PET based whole-liver radiomic analysis predicts survival after (90)Y-TARE for hepatocellular carcinoma. Oncotarget 9:4549–4558
Article PubMed Google Scholar
Ji GW, Zhu FP, Xu Q et al (2019) Machine-learning analysis of contrast-enhanced CT radiomics predicts recurrence of hepatocellular carcinoma after resection: a multi-institutional study. EBioMedicine 50:156–165
Article PubMed PubMed Central Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL. J.A.O. was supported by the National Institute of Diabetes and Digestive and Kidney Diseases of the National Institutes of Health under Award Number P30 KD034989 and M.S. the National Institutes of Health Grant Award Number DDRCC DK034989-36 for the Clinical Translational Core of the Yale Liver Center. The content is solely the responsibility of the authors and does not necessarily represent the official views of the Nation Institutes of Health. M.G. was supported by a travel stipend from the Rolf W. Günther Foundation for Radiological Sciences for travel to Yale University. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

Authors and Affiliations

Department of Radiology and Biomedical Imaging, Yale University School of Medicine, New Haven, CT, USA
Moritz Gross, Stefan P. Haider, Tal Ze’evi, Steffen Huber, Sandeep Arora, Ahmet S. Kucukkaya, Simon Iseke, Julius Chapiro & John A. Onofrey
Charité Center for Diagnostic and Interventional Radiology, Charité - Universitätsmedizin Berlin, Berlin, Germany
Moritz Gross, Ahmet S. Kucukkaya, Bernhard Gebauer, Florian Fleckenstein & Marc Dewey
Department of Otorhinolaryngology, University Hospital of Ludwig Maximilians Universität München, Munich, Germany
Stefan P. Haider
Department of Biomedical Engineering, Yale University, New Haven, CT, USA
Tal Ze’evi & John A. Onofrey
Department of Diagnostic and Interventional Radiology, Pediatric Radiology and Neuroradiology, Rostock University Medical Center, Rostock, Germany
Simon Iseke
Department of Internal Medicine, Yale University School of Medicine, New Haven, CT, USA
Ariel Jaffe & Mario Strazzabosco
Department of Urology, Yale University School of Medicine, New Haven, CT, USA
John A. Onofrey

Authors

Moritz Gross
View author publications
You can also search for this author in PubMed Google Scholar
Stefan P. Haider
View author publications
You can also search for this author in PubMed Google Scholar
Tal Ze’evi
View author publications
You can also search for this author in PubMed Google Scholar
Steffen Huber
View author publications
You can also search for this author in PubMed Google Scholar
Sandeep Arora
View author publications
You can also search for this author in PubMed Google Scholar
Ahmet S. Kucukkaya
View author publications
You can also search for this author in PubMed Google Scholar
Simon Iseke
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard Gebauer
View author publications
You can also search for this author in PubMed Google Scholar
Florian Fleckenstein
View author publications
You can also search for this author in PubMed Google Scholar
Marc Dewey
View author publications
You can also search for this author in PubMed Google Scholar
Ariel Jaffe
View author publications
You can also search for this author in PubMed Google Scholar
Mario Strazzabosco
View author publications
You can also search for this author in PubMed Google Scholar
Julius Chapiro
View author publications
You can also search for this author in PubMed Google Scholar
John A. Onofrey
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Moritz Gross.

Ethics declarations

Guarantor

The scientific guarantor of this publication is Dr. John Onofrey.

Conflict of interest

The authors of this manuscript declare no relationships with any companies whose products or services may be related to the subject matter of the article.

Statistics and biometry

No complex statistical methods were necessary for this paper.

Informed consent

Written informed consent was waived by the Institutional Review Board.

Ethical approval

This study was approved by the Yale School of Medicine institutional review board with full waiver of consent and conducted in accordance with the declaration of Helsinki.

Study subjects or cohorts overlap

None.

Methodology

• retrospective

• experimental

• performed at one institution

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 24.4 MB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gross, M., Haider, S.P., Ze’evi, T. et al. Automated graded prognostic assessment for patients with hepatocellular carcinoma using machine learning. Eur Radiol (2024). https://doi.org/10.1007/s00330-024-10624-8

Download citation

Received: 31 August 2023
Revised: 18 December 2023
Accepted: 08 January 2024
Published: 27 March 2024
DOI: https://doi.org/10.1007/s00330-024-10624-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Automated graded prognostic assessment for patients with hepatocellular carcinoma using machine learning

Abstract

Background

Purpose

Methods

Results

Conclusions

Clinical relevance statement

Key Points

Similar content being viewed by others

Hepatocellular carcinoma pathologic grade prediction using radiomics and machine learning models of gadoxetic acid-enhanced MRI: a two-center study

Estimating postsurgical outcomes of patients with a single hepatocellular carcinoma using gadoxetic acid–enhanced MRI: risk scoring system development and validation

An interpretable machine learning model based on contrast-enhanced CT parameters for predicting treatment response to conventional transarterial chemoembolization in patients with hepatocellular carcinoma

Introduction

Materials and methods

Compliance with ethical standards

Code availability

Data availability

Patient inclusion and exclusion

Clinical data

MRI acquisition and radiomics extraction

Survival model and graded prognostic assessment

Statistical analysis

Results

Patient characteristics

Survival model

Mortality risk predictions and graded prognostic assessment

Survival analysis

Discussion

Abbreviations

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Guarantor

Conflict of interest

Statistics and biometry

Informed consent

Ethical approval

Study subjects or cohorts overlap

Methodology

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 24.4 MB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation