Deep learning-based 3D quantitative total tumor burden predicts early recurrence of BCLC A and B HCC after resection

Wei, Hong; Zheng, Tianying; Zhang, Xiaolan; Zheng, Chao; Jiang, Difei; Wu, Yuanan; Lee, Jeong Min; Bashir, Mustafa R.; Lerner, Emily; Liu, Rongbo; Wu, Botong; Guo, Hua; Chen, Yidi; Yang, Ting; Gong, Xiaoling; Jiang, Hanyu; Song, Bin

doi:10.1007/s00330-024-10941-y

Deep learning-based 3D quantitative total tumor burden predicts early recurrence of BCLC A and B HCC after resection

Hepatobiliary-Pancreas
Open access
Published: 19 July 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

European Radiology Aims and scope Submit manuscript

Deep learning-based 3D quantitative total tumor burden predicts early recurrence of BCLC A and B HCC after resection

Download PDF

Hong Wei^1,2^na1,
Tianying Zheng¹^na1,
Xiaolan Zhang³,
Chao Zheng³,
Difei Jiang³,
Yuanan Wu⁴,
Jeong Min Lee^2,5,
Mustafa R. Bashir^6,7,8,
Emily Lerner⁶,
Rongbo Liu¹,
Botong Wu⁹,
Hua Guo⁹,
Yidi Chen¹,
Ting Yang¹,
Xiaoling Gong¹,
Hanyu Jiang¹^na2 &
…
Bin Song ORCID: orcid.org/0000-0002-7269-2101^1,10^na2

545 Accesses
1 Citation
Explore all metrics

Abstract

Objectives

This study aimed to evaluate the potential of deep learning (DL)-assisted automated three-dimensional quantitative tumor burden at MRI to predict postoperative early recurrence (ER) of hepatocellular carcinoma (HCC).

Materials and methods

This was a single-center retrospective study enrolling patients who underwent resection for BCLC A and B HCC and preoperative contrast-enhanced MRI. Quantitative total tumor volume (cm³) and total tumor burden (TTB, %) were obtained using a DL automated segmentation tool. Radiologists’ visual assessment was used to ensure the quality control of automated segmentation. The prognostic value of clinicopathological variables and tumor burden-related parameters for ER was determined by Cox regression analyses.

Results

A total of 592 patients were included, with 525 and 67 patients assigned to BCLC A and B, respectively (2-year ER rate: 30.0% vs. 45.3%; hazard ratio (HR) = 1.8; p = 0.007). TTB was the most important predictor of ER (HR = 2.2; p < 0.001). Using 6.84% as the threshold of TTB, two ER risk strata were obtained in overall (p < 0.001), BCLC A (p < 0.001), and BCLC B (p = 0.027) patients, respectively. The BCLC B low-TTB patients had a similar risk for ER to BCLC A patients and thus were reassigned to a BCLC A_n stage; whilst the BCLC B high-TTB patients remained in a BCLC B_n stage. The 2-year ER rate was 30.5% for BCLC A_n patients vs. 58.1% for BCLC B_n patients (HR = 2.8; p < 0.001).

Conclusions

TTB determined by DL-based automated segmentation at MRI was a predictive biomarker for postoperative ER and facilitated refined subcategorization of patients within BCLC stages A and B.

Clinical relevance statement

Total tumor burden derived by deep learning-based automated segmentation at MRI may serve as an imaging biomarker for predicting early recurrence, thereby improving subclassification of Barcelona Clinic Liver Cancer A and B hepatocellular carcinoma patients after hepatectomy.

Key Points

Total tumor burden (TTB) is important for Barcelona Clinic Liver Cancer (BCLC) staging, but is heterogenous.
TTB derived by deep learning-based automated segmentation was predictive of postoperative early recurrence.
Incorporating TTB into the BCLC algorithm resulted in successful subcategorization of BCLC A and B patients.

MRI radiomics based on deep learning automated segmentation to predict early recurrence of hepatocellular carcinoma

Article Open access 20 May 2024

Deep learning–assisted LI-RADS grading and distinguishing hepatocellular carcinoma (HCC) from non-HCC based on multiphase CT: a two-center study

Article 01 July 2023

Deep learning nomogram based on Gd-EOB-DTPA MRI for predicting early recurrence in hepatocellular carcinoma after hepatectomy

Article Open access 14 February 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Medical Imaging

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Hepatocellular carcinoma (HCC) is the sixth most common malignancy and the third leading cause of cancer-related deaths worldwide [1]. Surgical resection is the mainstay curative treatment for patients with resectable HCC [2], and local ablation is the potentially curative therapy for small early-stage HCC [3]. While patients with early-stage HCC are considered the optimal surgical candidates, as per the Barcelona Clinic Liver Cancer (BCLC) algorithm [4], resection for well-selected intermediate-stage (BCLC B) HCC has been associated with improved survival compared to transarterial chemoembolization (TACE) [5, 6]. Therefore, identifying individuals with BCLC B HCC who would benefit from hepatectomy is pivotal for personalized clinical decision-making.

Tumor burden is a critical factor in determining staging and treatment plans, yet it exhibits considerable heterogeneity within the BCLC stages A and B. As a result, patients assigned to the same BCLC stage may have distinct prognoses while those at different stages might demonstrate similar survival outcomes [2, 7, 8]. One potential explanation is that a one-dimensional measurement is used in the BCLC algorithm to estimate tumor burden, which may under- or overestimate tumor size due to its inaccuracies. Despite being easy-to-use and highly reproducible, this one-dimensional approach might be insufficient to profile the complex morphology of three-dimensional (3D) tumors. Thus, identifying more accurate approaches for tumor burden evaluation is key to optimizing patient risk stratification and refining treatment allocations.

The 3D volumetric analysis may offer a promising solution for accurate tumor burden quantification. Previous research has indicated that the MRI-based 3D quantitative tumor burden demonstrates superior capability compared to one-dimensional measurements for assessing prognosis and treatment response in HCC patients [9,10,11,12]. However, these studies were limited by small sample sizes (ranging from 78 to 162 patients) and the utilization of semi-automated segmentation. Although these semi-automated methods have mitigated some challenges associated with manual segmentation, including time consumption, labor intensity, and dependence on operators’ experience, they still involve user input (i.e., manual adjustment of inaccurate segmentations) and may suffer from inter-reader variability. Fortunately, the recently implemented artificial intelligence (AI) deep-learning (DL) algorithms for segmentation have allowed automated quantification of liver and HCC volumes [13,14,15,16]. This approach has the potential to improve both efficiency and reproducibility over traditional methodologies. Nonetheless, to our knowledge, data remain scarce on the potential of automated segmentation-based volumetric analyses in the prognostication of BCLC A and B HCC following resection.

This study aimed to evaluate the utility of MRI-based 3D quantitative tumor burden using automated DL segmentation algorithms in predicting early recurrence (ER) among patients undergoing curative resection for BCLC A and B HCC. We also assessed whether 3D quantitative tumor burden could be used to subcategorize these patients for more effective prognostication. Radiologists’ visual assessment was used to ensure the quality control of automated segmentation.

Materials and methods

This single-institution, retrospective study was approved by our institutional review board, and the requirement for informed consent was waived.

Patients

Consecutive patients who underwent curative resection for HCC between July 2010 and December 2021 were retrospectively included in this study if they fulfilled the following criteria: (a) age ≥ 18 years, (b) pathological confirmation of HCC, (c) without previous HCC treatment history, (d) without any co-malignancy other than HCC, and (e) contrast-enhanced MRI performed within 1 month before surgery. The exclusion criteria were (a) HCC beyond BCLC stage A or B, (b) ruptured HCC, (c) MR images covering only part of the tumor/liver or of suboptimal image quality (e.g., severe artifacts), (d) inaccurate image segmentation (detailed below), and (e) without any follow-up information (Fig. 1).

Baseline patient characteristics were collected from electronic medical records. The diagnosis of cirrhosis was established based on the Japanese Society of Gastroenterology/Japanese Society of Hepatology joint Clinical Practice Guidelines [17]. Clinical decisions for surgical resection were based on discussions of liver surgery expert panels while fully considering patient performance status, co-morbidities, liver function reserve, estimated future liver remnant volume, and tumor extent. Briefly, surgical resection was considered for patients with localized HCC and preserved liver function in the absence of clinically significant portal hypertension [1, 18, 19]. For patients with borderline tumors (e.g., R0 resection was technically challenging, or remnant liver volume was anticipated to be around 30% in non-cirrhotic patients or 40% in cirrhotic patients), the selection among surgical resection and other alternatives such as locoregional therapies was determined based on multidisciplinary discussions. Notably, some patients received postoperative adjuvant therapies (e.g., TACE, systemic therapy, and radiotherapy), as in our practice adjuvant therapies have been routinely recommended since 2017 for patients at anticipated high-risk of recurrence (e.g., multiple tumors, tumor size > 5 cm, Edmondson grade 3–4, and microvascular invasion (MVI)) [20, 21].

Of note, 222 patients have been reported previously [22]. While the prior work proposed a preoperative prognostic score for overall survival, the current work focused on exploring 3D quantitative tumor burden biomarkers for predicting postsurgical ER of HCC.

MRI acquisition

MRI was performed with various 3.0-T or 1.5-T systems. The choice between extracellular and hepatobiliary MRI contrast agents was at the surgeons’ discretion or based on the multidisciplinary team’s recommendations according to our institutional standard [23]. MRI systems and acquisition protocols are described in Supplementary Material 1 and Table S1.

Image analysis

One-dimensional measurement

One-dimensional measurement of tumors was performed by two abdominal radiologists (H.W. and H.Y.J., with 5 and 8 years of experience in liver MRI, respectively) who were aware that all patients had HCCs but were blinded to other information. Total tumor size (TTS) was defined as the sum of the size of all HCC lesions. Additionally, the single tumor > 7 cm and multiple tumors beyond up-to-seven criteria were assessed to reassign the BCLC staging [8]. Details of one-dimensional measurement are presented in Supplementary Material 2.

Three-dimensional analysis

The 3D automated image segmentation and volumetric quantification analyses were performed using commercial visualization and analysis software (LiverMRDoc; version 2.10.0; SHUKUN) (Fig. 2). Deep-learning algorithms for automated segmentation and volumetric quantification are detailed in Supplementary Material 3 and Fig. S1.

Preparation work before segmentation

Before initiating automated segmentation, one radiologist (H.W.) inspected the MR images and verified the sequence names, the HCC lesions, and the corresponding 3D bounding boxes (i.e., the automated lesion detection annotation) on the AI software platform. For patients (n = 25) with inaccurate 3D bounding boxes (e.g., failing to detect HCC lesions or delineating the whole tumors), manual adjustment was conducted to achieve accurate localization of tumors.

Automated liver and tumor segmentation

Automated segmentation of liver and HCC lesions was performed on each sequence by 3D U-net-based DL algorithms [24]. Briefly, a volumetric segmentation mask derived from portal-venous phase images was used to quantify whole liver volume (with intrahepatic vasculature and focal liver lesions included). A volumetric segmentation mask obtained on the optimal sequence automatedly determined by the AI software was used to quantify tumor volume. For a total of 716 tumors included in this study, tumor volumes were extracted from portal-venous phase (n = 617), delayed phase (n = 33), hepatobiliary phase (n = 32), precontrast phase (n = 17), arterial phase (n = 8), opposed phase (n = 3), in-phase (n = 2), T2-weighted imaging (n = 2), transitional phase (n = 1), diffusion-weighted imaging (n = 1), respectively.

Quality control for automated segmentation

Two radiologists (H.W. and T.Y.Z.) independently and visually inspected the segmented tumors and liver for each patient, with discrepancies resolved by discussion to reach a consensus. Cases with inaccurate segmentations were excluded from the volumetric analyses. Manual adjustment was not performed because this study aimed to focus on the prognostic potential of this automated technique.

Volumetric quantification analyses

Volumetric quantification analyses were performed to calculate total liver volume (TLV, cm³), total tumor volume (TTV, cm³), and total tumor burden (TTB, %). Specifically, TTV was defined as the sum of the volume of all HCC lesions. TTB was defined as the ratio of TTV to the TLV as follows: TTB = TTV/TLV × 100%.

Evaluation of automated segmentation accuracy

To evaluate the accuracy of automated DL segmentation, we randomly selected 35 patients (with 30 single HCCs and 5 multiple HCCs) from the final included cohort for manual tumor segmentation. One radiologist (H.W.), who was blinded to the automated segmentation results, manually segmented tumors using ITK-SNAP (version 3.8.0; www.itksnap.org).

Patient follow-up

Follow-up protocols included serum alpha-fetoprotein (AFP) level, liver function tests, and contrast-enhanced ultrasound, computed tomography, or MRI performed 1 month after surgical resection, every 3 months during the first 2 years, and every 6 months subsequently. Patients were followed up until death or the end date of this study (May 1, 2022). ER was defined as tumor recurrence within the first 2 years after surgery.

Statistical analysis

Categorical variables were compared using the Chi-squared test or Fisher’s exact test, while continuous variables were compared using the Student’s t-test or Mann–Whitney U-test, as appropriate. Inter-reader agreement for one-dimensional measurement was assessed by intraclass correlation coefficient (ICC). Consistency between automated and manual tumor segmentations was evaluated by the dice similarity coefficient (DSC).

The prognostic value of clinical-radiological-pathological characteristics to predict ER was assessed by uni- and multivariable Cox regression analyses. Variables with p < 0.1 at univariable analyses were included in the multivariable analysis using a backward stepwise approach based on the akaike information criterion and five-fold cross-validation.

Survival curves were plotted by the Kaplan–Meier method and compared by the log-rank test. X-tile plots provide a single and intuitive approach for evaluating the association between variables and survival. Given that no well-recognized thresholds have been established for TTS, TTV, and TTB, X-tile plots were used to determine the optimal cutoffs of these variables based on the highest χ² value (minimum p-value) defined by Kaplan–Meier survival analysis and log-rank test. X-tile plots were generated by X-tile software (version 3.6.1; Yale University School of Medicine) [25]. A bootstrap resampling (n = 1000) technique was used to test the robustness of survival risk stratification based on BCLC and modified BCLC systems.

Statistical analyses were performed with R (version 4.3.0; The R Foundation for Statistical Computing), Python (version 3.9.1; https://www.python.org/), and IBM SPSS software (version 26.0; SPSS Inc.). A two-tailed p < 0.05 was considered statistically significant.

Results

Patient characteristics

A total of 592 patients (median age, 54 years; interquartile range (IQR), 46–62 years; 517 men) were included, with 525 (88.7%) and 67 (11.3%) classified in BCLC stage A and B, respectively. There were 333 patients (56.3%) classified as China Liver Cancer stage Ia, 192 (32.4%) as Ib, 59 (10.0%) as IIa, and 8 (1.4%) as IIb. Overall, 17.6% (104/592) of patients underwent postoperative adjuvant therapies. ER occurred in 28.9% (171/592) of patients during a median follow-up period of 52.4 months (IQR, 29.5-78.5 months). Patient characteristics are summarized in Table 1.

Table 1 Patient characteristics

Full size table

Inaccurate image segmentations

Twenty-two cases with inaccurate liver and/or tumor segmentations were excluded from the volumetric analyses. Examples of inaccurate segmentations are shown in Fig. 3.

Evaluation of inter-reader agreement and segmentation accuracy

The inter-reader agreement was excellent for TTS (ICC = 0.987; 95% confidence interval (CI): 0.983, 0.990). For the 40 HCCs (median size, 4.2 cm; IQR, 3.0–7.3 cm) in 35 randomly selected patients, the mean DSC between automated and manual tumor segmentations was 0.85 ± 0.11 (median, 0.88; IQR, 0.82–0.92) on all sequences. DSCs for each sequence are detailed in Table S2 and Fig. S2.

Predictors for ER based on Cox regression analyses

Entire cohort

For the entire cohort (n = 592), 8 parameters, including serum AFP level, postoperative adjuvant therapy, BCLC stage, tumor multiplicity, TTS, TTV, TTB, and the single tumor > 7 cm, and multiple tumors beyond up-to-seven criteria, were significantly associated with ER at univariable Cox regression analyses (p < 0.05 for all) (Table 2). On subsequent multivariate Cox regression analysis, serum AFP level (hazard ratio (HR) = 1.4; 95% CI: 1.0, 1.9; p = 0.05), tumor multiplicity (HR = 1.8; 95% CI: 1.2, 2.6; p = 0.003) and TTB (HR = 2.2; 95% CI: 1.6, 3.0; p < 0.001) were found to be predictive of ER (Table 2). The C-index for TTB in predicting the risk of ER was 0.589 (95% CI: 0.553, 0.625).

Table 2 Predictors for early recurrence based on Cox regression analyses

Full size table

Patients with complete pathological data

For patients who had complete documentation of tumor differentiation and MVI status (n = 385), the multivariable Cox regression analysis showed that TTB remained the most important predictor of ER (HR = 2.0; 95% CI: 1.4, 3.0; p < 0.001), with a C-index of 0.610 (95% CI: 0.566, 0.653). Additional factors retained in the final Cox model based on the akaike information criteria included serum AFP level (HR = 1.7; 95% CI: 1.2, 2.5; p = 0.007), tumor multiplicity (HR = 1.6; 95% CI: 1.0, 2.5; p = 0.06), and MVI (HR = 1.8; 95% CI: 1.2, 2.7; p = 0.003) (Table 2).

Entire cohort plus cases with inaccurate segmentations

After incorporating 22 cases with inaccurate segmentations into the entire cohort (n = 614), TTB remained an independent variable for predicting ER (HR = 1.6; 95% CI: 1.1, 2.3; p = 0.009), with a C-index of 0.591 (95% CI: 0.556, 0.626) (Supplementary Material 4 and Table S3).

Survival analyses

TTB and TTS for ER risk stratification

Using 6.84% as the threshold, TTB stratified all patients into two risk strata for ER (ER rate at 24 months, 26.8% vs. 47.4%; p < 0.001), as well as BCLC A patients (ER rate at 24 months, 25.8% vs. 45.1%; p < 0.001) and BCLC B patients (ER rate at 24 months, 37.7% vs. 58.1%; p = 0.027), respectively (Table 3; Fig. 4A–C). Additionally, TTS gave two risk strata for ER among all patients (ER rate at 24 months, 21.9% vs. 39.9%; p < 0.001) and BCLC A patients (ER rate at 24 months, 21.9% vs. 38.6%; p < 0.001), respectively (Table 3; Fig. 4D, E). However, all BCLC B patients had high TTS and thus could not be stratified into two risk strata for ER according to TTS (ER rate at 24 months, 45.3%) (Table 3).

Table 3 ER rates at 6, 12, 18, and 24 months and hazard ratios according to TTB, TTS, BCLC stage, and modified BCLC stage

Full size table

To identify subgroups of BCLC B patients who had favorable prognosis after resection, combinations of BCLC stage and TTB were analyzed. BCLC B low-TTB patients demonstrated comparable risk for ER to that of BCLC A patients with either low (ER rate at 24 months, 37.7% vs. 25.8%; p = 0.16) or high (ER rate at 24 months, 37.7% vs. 45.1%; p = 0.27) TTB (Table 3; Fig. 4F).

Construction of modified BCLC algorithm for stage A and B HCC

The ER rate at 24 months was 30.0% (95% CI: 25.7, 34.0) for BCLC A patients vs. 45.3% (95% CI: 30.2, 57.2) for BCLC B patients (HR = 1.8; 95% CI: 1.2, 2.7; p = 0.007) (Table 3; Fig. 5A). By integrating TTB into the current BCLC system, a modified BCLC algorithm was constructed: (a) BCLC stage A_n (n = 568), constituting of the original BCLC A patients plus those BCLC B patients with low TTB; and (b) BCLC stage B_n (n = 24), consisting of the remaining BCLC B patients with high TTB. Based on the modified BCLC algorithm, the ER rate at 24 months was 30.5% (95% CI: 26.4, 34.4) for BCLC A_n patients vs. 58.1% (95% CI: 30.7, 74.7) for BCLC B_n patients (HR = 2.8; 95% CI: 1.6, 4.9; p < 0.001) (Table 3; Fig. 5B).

After bootstrap resampling, BCLC B patients demonstrated a mean HR of 1.8 (95% CI: 1.0, 2.6) for ER relative to BCLC A patients (mean p = 0.08), whilst BCLC B_n patients demonstrated a mean HR of 2.9 (95% CI: 1.1, 4.7) for ER relative to BCLC A_n patients (mean p = 0.02) (Fig. 6).

Sensitivity analysis for patients without adjuvant therapy after surgery

For patients who did not receive adjuvant therapies after surgery (n = 488), similar results were also obtained for survival analyses based on TTB, TTS, BCLC, and modified BCLC algorithms (Supplementary Material 5, Table S4, Figs. S3 and 4).

Comparisons of patient characteristics between low and high TTB groups are provided in Supplementary Material 6 and Table S5.

Discussion

This study aimed to evaluate the prognostic impact of MRI-based automated 3D volumetric parameters in predicting ER of BCLC A and B HCC. Our findings demonstrated that TTB was an independent predictor of postoperative ER, even after adjusting for pathological factors. We identified two ER risk groups based on TTB, with high TTB (≥ 6.84%) patients at elevated ER risk. Additionally, BCLC B low-TTB patients had similar ER risk to BCLC A patients, leading to their recategorization in a modified BCLC algorithm. The modified BCLC system resulted in an improved prognostic stratification for BCLC A and B patients.

Conventional one-dimensional tumor measurements, though user-friendly, provide only a basic size estimation and fail to accurately represent the varied growth patterns and irregular shapes of tumors. Conversely, the 3D volumetric quantification offers a more thorough evaluation of the tumor’s extent, characterized by high reproducibility and interobserver agreement [9]. Previous studies have demonstrated that 3D quantitative tumor burden biomarkers outperformed one-dimensional measurements in HCC prognostication [9,10,11,12]. Aligning with these studies, our investigation also revealed that TTB, representing the ratio of total tumor volume to liver volume and the remnant liver function reserve, was independently correlated with ER, whilst TTS was not. Additionally, TTB effectively stratified BCLC B patients into two risk strata, whereas TTS did not. These findings highlight the superior capacity of TTB over traditional diameter-based measurements in assessing ER risk of HCC.

To our knowledge, this is the first study to explore the potential utility of automated 3D quantitative tumor burden for refining BCLC A and B HCC subcategorization. In our study, TTB was the dominant predictor of ER, even after adjusting for pathological factors. Using 1000-bootstrap resampling, the TTB-based modified BCLC system remained significantly associated with ER (mean p = 0.02), unlike the original BCLC system (mean p = 0.08). These findings signify that the modified BCLC system may have a more robust prognostic impact on ER. Additionally, similar findings were obtained after excluding patients who received adjuvant therapies, suggesting the robustness of TTB in ER risk stratification. However, due to the single-institution retrospective study design and the small number of BCLC B patients (n = 67), the modified BCLC system requires further validation and refinement on larger-scale populations before it can be implemented in clinical practice, particularly in terms of generalizability and simplicity.

DL-based automated image segmentation represents a pivotal advancement for precise 3D volumetric analysis. Developed from a dataset of 1889 patients across six tertiary hospitals in China, the models exhibit good accuracy in liver and lesion segmentation. The consistency between automated and manual segmentations of HCC observed in our study underscores the potential of automated segmentation for refining clinical decision-making. To address the limitations of traditional U-Net architecture, incorporating NNU-Net—a self-configuring framework that mitigates the complexities of manual parameter tuning—could yield substantial improvements. Leveraging pre-trained foundation models via transfer learning may further enhance segmentation performance. These strategies aim to enhance model capabilities and streamline the integration of DL segmentation into routine practice, providing a more effective tool for managing HCC patients. However, despite commercial certification, the software used in our study is not yet publicly available.

Noteworthily, a small subset of cases (n = 22) presented challenges during automated segmentation and were excluded from volumetric analysis. The inaccurate segmentation observed in these cases could be attributed to various factors, e.g., the presence of an obscure tumor margin, inhomogeneous signal intensity within the tumor, and additional signal interference introduced by peritumoral parenchyma. To improve the accuracy of automated segmentation, further refinement of the DL algorithms is needed. Once this advanced technique is validated and improved in more widespread populations, it is expected to become an applicable workflow in routine practice.

This study had several limitations. First, our study used a single-institution retrospective cohort, lacking external validation. Expanding our findings to larger-scale multicenter cohorts is necessary to confirm their reproducibility and generalizability. Second, the limited sample size of BCLC B patients warrants further validation of the prognostic utility of TTB in future studies. Third, we only evaluated patients with surgically resected HCC, so our results may not be applicable to nonsurgical patients. Fourth, despite the automated segmentation, there were two steps involving the radiologist’s intervention—the 3D bounding box adjustment (n = 25) and the exclusion of inaccurate segmentations (n = 22), potentially overestimating the automated technique’s performance. Finally, future multicenter studies are encouraged to externally validate the TTB threshold and test their applicability in different populations.

In conclusion, based on 592 patients who underwent curative liver resection for early to intermediate-stage HCC, TTB derived by MRI-based automated DL segmentation was a useful imaging biomarker for postoperative ER prediction and enabled optimized BCLC subclassification.

Abbreviations

3D:: Three-dimensional
AFP:: Alpha-fetoprotein
AI:: Artificial intelligence
BCLC:: Barcelona Clinic Liver Cancer
CI:: Confidence interval
DL:: Deep learning
DSC:: Dice similarity coefficient
ER:: Early recurrence
HBV:: Hepatitis B virus
HCC:: Hepatocellular carcinoma
HR:: Hazard ratio
ICC:: Intraclass correlation coefficient
IQR:: Interquartile range
MVI:: Microvascular invasion
TACE:: Transarterial chemoembolization
TLV:: Total liver volume
TTB:: Total tumor burden
TTS:: Total tumor size
TTV:: Total tumor volume

References

Singal AG, Llovet JM, Yarchoan M et al (2023) AASLD Practice Guidance on prevention, diagnosis, and treatment of hepatocellular carcinoma. Hepatology 78:1922–1965
Article PubMed Google Scholar
Tsilimigras DI, Mehta R, Paredes AZ et al (2020) Overall tumor burden dictates outcomes for patients undergoing resection of multinodular hepatocellular carcinoma beyond the Milan criteria. Ann Surg 272:574–581
Article PubMed Google Scholar
Llovet JM, Kelley RK, Villanueva A et al (2021) Hepatocellular carcinoma. Nat Rev Dis Primers 7:6
Article PubMed Google Scholar
Reig M, Forner A, Rimola J et al (2022) BCLC strategy for prognosis prediction and treatment recommendation: The 2022 update. J Hepatol 76:681–693
Article PubMed Google Scholar
Yin L, Li H, Li AJ et al (2014) Partial hepatectomy vs. transcatheter arterial chemoembolization for resectable multiple hepatocellular carcinoma beyond Milan criteria: a RCT. J Hepatol 61:82–88
Article PubMed Google Scholar
Labgaa I, Taffé P, Martin D et al (2020) Comparison of partial hepatectomy and transarterial chemoembolization in intermediate-stage hepatocellular carcinoma: a systematic review and meta-analysis. Liver Cancer 9:138–147
Article PubMed PubMed Central Google Scholar
Tsilimigras DI, Moris D, Hyer JM et al (2020) Hepatocellular carcinoma tumour burden score to stratify prognosis after resection. Br J Surg 107:854–864
Article CAS PubMed Google Scholar
Wang YY, Zhong JH, Xu HF et al (2019) A modified staging of early and intermediate hepatocellular carcinoma based on single tumour >7 cm and multiple tumours beyond up-to-seven criteria. Aliment Pharmacol Ther 49:202–210
Article PubMed Google Scholar
Chapiro J, Duran R, Lin M et al (2015) Identifying staging markers for hepatocellular carcinoma before transarterial chemoembolization: comparison of three-dimensional quantitative versus non-three-dimensional imaging markers. Radiology 275:438–447
Article PubMed Google Scholar
Tacher V, Lin M, Duran R et al (2016) Comparison of existing response criteria in patients with hepatocellular carcinoma treated with transarterial chemoembolization using a 3D quantitative approach. Radiology 278:275–284
Article PubMed Google Scholar
Fleckenstein FN, Schernthaner RE, Duran R et al (2016) 3D Quantitative tumour burden analysis in patients with hepatocellular carcinoma before TACE: comparing single-lesion vs. multi-lesion imaging biomarkers as predictors of patient survival. Eur Radiol 26:3243–3252
Article PubMed PubMed Central Google Scholar
Jeon SK, Lee DH, Park J et al (2021) Tumor volume measured using MR volumetry as a predictor of prognosis after surgical resection of single hepatocellular carcinoma. Eur J Radiol 144:109962
Article PubMed Google Scholar
Zheng R, Wang Q, Lv S et al (2022) Automatic liver tumor segmentation on dynamic contrast enhanced MRI using 4D information: deep learning model based on 3D convolution and convolutional LSTM. IEEE Trans Med Imaging 41:2965–2976
Article PubMed Google Scholar
Raman AG, Jones C, Weiss CR (2022) Machine learning for hepatocellular carcinoma segmentation at MRI: radiology in training. Radiology 304:509–515
Article PubMed Google Scholar
Li X, Chen H, Qi X, Dou Q, Fu CW, Heng PA (2018) H-DenseUNet: hybrid densely connected UNet for liver and tumor segmentation from CT volumes. IEEE Trans Med Imaging 37:2663–2674
Article PubMed Google Scholar
Perez AA, Noe-Kim V, Lubner MG et al (2022) Deep learning CT-based quantitative visualization tool for liver volume estimation: defining normal and hepatomegaly. Radiology 302:336–342
Article PubMed Google Scholar
Yoshiji H, Nagoshi S, Akahane T et al (2021) Evidence-based clinical practice guidelines for Liver Cirrhosis 2020. J Gastroenterol 56:593–619
Article PubMed PubMed Central Google Scholar
Zhou J, Sun H, Wang Z et al (2023) Guidelines for the diagnosis and treatment of primary liver cancer (2022 edition). Liver Cancer 12:405–444
Article PubMed PubMed Central Google Scholar
European Association for the Study of the Liver (2018) EASL clinical practice guidelines: management of hepatocellular carcinoma. J Hepatol 69:182–236
Bruix J, Takayama T, Mazzaferro V et al (2015) Adjuvant sorafenib for hepatocellular carcinoma after resection or ablation (STORM): a phase 3, randomised, double-blind, placebo-controlled trial. Lancet Oncol 16:1344–1354
Article CAS PubMed Google Scholar
Wang Z, Ren Z, Chen Y et al (2018) Adjuvant transarterial chemoembolization for HBV-related hepatocellular carcinoma after resection: a randomized controlled study. Clin Cancer Res 24:2074–2081
Article CAS PubMed Google Scholar
Wei H, Fu F, Jiang H et al (2023) Development and validation of the OSASH score to predict overall survival of hepatocellular carcinoma after surgical resection: a dual-institutional study. Eur Radiol. https://doi.org/10.1007/s00330-023-09725-7
Jiang H, Wei H, Yang T et al (2023) VICT2 trait: prognostic alternative to peritumoral hepatobiliary phase hypointensity in HCC. Radiology 307:e221835
Article PubMed Google Scholar
Han X, Wu X, Wang S et al (2022) Automated segmentation of liver segment on portal venous phase MR images using a 3D convolutional neural network. Insights Imaging 13:26
Article PubMed PubMed Central Google Scholar
Camp RL, Dolled-Filhart M, Rimm DL (2004) X-tile: a new bio-informatics tool for biomarker assessment and outcome-based cut-point optimization. Clin Cancer Res 10:7252–7259
Article CAS PubMed Google Scholar

Download references

Funding

This work was supported by the National Natural Science Foundation of China (Grant No. 82101997, U22A20343), the China Post-doctoral Science Foundation (Grant No. 2023T160448), the 1.3.5 project for disciplines of excellence, West China Hospital, Sichuan University (Grant No. ZYJC21012, ZYGD22004), Hainan Province Clinical Medical Center, and the Post-doctoral Station Development Project of Sanya.

Author information

Hong Wei and Tianying Zheng contributed equally to this work.
Hanyu Jiang and Bin Song jointly supervised this work.

Authors and Affiliations

Department of Radiology, Functional, and Molecular Imaging Key Laboratory of Sichuan Province, West China Hospital, Sichuan University, Chengdu, Sichuan, 610041, China
Hong Wei, Tianying Zheng, Rongbo Liu, Yidi Chen, Ting Yang, Xiaoling Gong, Hanyu Jiang & Bin Song
Department of Radiology, Seoul National University Hospital, Seoul, 03080, Republic of Korea
Hong Wei & Jeong Min Lee
Shukun Technology Co., Ltd, Beijing, 100102, China
Xiaolan Zhang, Chao Zheng & Difei Jiang
Big Data Research Center, University of Electronic Science and Technology of China, Chengdu, Sichuan, 610000, China
Yuanan Wu
Department of Radiology, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
Jeong Min Lee
Department of Radiology, Duke University Medical Center, Durham, NC, 27710, USA
Mustafa R. Bashir & Emily Lerner
Center for Advanced Magnetic Resonance in Medicine, Duke University Medical Center, Durham, NC, 27705, USA
Mustafa R. Bashir
Division of Gastroenterology, Department of Medicine, Duke University Medical Center, Durham, NC, 27710, USA
Mustafa R. Bashir
Center for Biomedical Imaging Research, Department of Biomedical Engineering, School of Medicine, Tsinghua University, Beijing, 100102, China
Botong Wu & Hua Guo
Department of Radiology, Sanya People’s Hospital, Sanya, Hainan, 572000, China
Bin Song

Authors

Hong Wei
View author publications
You can also search for this author in PubMed Google Scholar
Tianying Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Xiaolan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chao Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Difei Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yuanan Wu
View author publications
You can also search for this author in PubMed Google Scholar
Jeong Min Lee
View author publications
You can also search for this author in PubMed Google Scholar
Mustafa R. Bashir
View author publications
You can also search for this author in PubMed Google Scholar
Emily Lerner
View author publications
You can also search for this author in PubMed Google Scholar
Rongbo Liu
View author publications
You can also search for this author in PubMed Google Scholar
Botong Wu
View author publications
You can also search for this author in PubMed Google Scholar
Hua Guo
View author publications
You can also search for this author in PubMed Google Scholar
Yidi Chen
View author publications
You can also search for this author in PubMed Google Scholar
Ting Yang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoling Gong
View author publications
You can also search for this author in PubMed Google Scholar
Hanyu Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Bin Song
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Hanyu Jiang or Bin Song.

Ethics declarations

Guarantor

The scientific guarantor of this publication is Bin Song, M.D.

Conflict of interest

H.W.: no relevant relationships. T.Z.: no relevant relationships. X.Z., C.Z., and D.J.: employees of Shukun Technology Co., Ltd. Y.W.: no relevant relationships. J.M.L.: no relevant relationships. M.R.B.: research grants to the institution from Siemens Healthineers, Madrigal Pharmaceuticals, NGM Biopharmaceuticals, Carmot Therapeutics, and Corcept Therapeutics; associate editor for contrast media section of Radiology; associate editor for continuing medical education of Journal of Magnetic Resonance Imaging. E.L.: no relevant relationships. R.L.: no relevant relationships. B.W.: no relevant relationships. H.G.: no relevant relationships. Y.C.: no relevant relationships. T.Y.: no relevant relationships. X.G.: no relevant relationships. H.J.: stock owner of Kanghong Technology Co., Ltd. B.S.: no relevant relationships.

Statistics and biometry

One co-author (Yuanan Wu, Big Data Research Center, University of Electronic Science and Technology of China, Chengdu, Sichuan 610000, China) has significant statistical expertise.

Informed consent

The requirement for informed consent was waived (retrospective design) by the Biomedical Ethics Review Committee of West China Hospital, Sichuan University.

Ethical approval

The study protocol was approved by the Biomedical Ethics Review Committee of West China Hospital, Sichuan University (Approval Numbers: 2022-651).

Study subjects or cohorts overlap

Some study subjects (37.5%, 222/592) have been previously reported in another study. While the prior work proposed a preoperative prognostic score for overall survival, the current work focused on exploring three-dimensional quantitative tumor burden biomarkers for predicting postsurgical early recurrence of hepatocellular carcinoma.

Methodology

Retrospective
Diagnostic or prognostic study
Single-center study

Additional information

Publisher’s Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This work was originated from Department of Radiology, West China Hospital, Sichuan University, Chengdu, Sichuan 610041, China.

Supplementary information

Supplementary Materials

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wei, H., Zheng, T., Zhang, X. et al. Deep learning-based 3D quantitative total tumor burden predicts early recurrence of BCLC A and B HCC after resection. Eur Radiol (2024). https://doi.org/10.1007/s00330-024-10941-y

Download citation

Received: 16 January 2024
Revised: 15 May 2024
Accepted: 16 June 2024
Published: 19 July 2024
DOI: https://doi.org/10.1007/s00330-024-10941-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Deep learning-based 3D quantitative total tumor burden predicts early recurrence of BCLC A and B HCC after resection

Abstract

Objectives

Materials and methods

Results

Conclusions

Clinical relevance statement

Key Points

Similar content being viewed by others

MRI radiomics based on deep learning automated segmentation to predict early recurrence of hepatocellular carcinoma

Deep learning–assisted LI-RADS grading and distinguishing hepatocellular carcinoma (HCC) from non-HCC based on multiphase CT: a two-center study

Deep learning nomogram based on Gd-EOB-DTPA MRI for predicting early recurrence in hepatocellular carcinoma after hepatectomy

Explore related subjects

Introduction

Materials and methods

Patients

MRI acquisition

Image analysis

One-dimensional measurement

Three-dimensional analysis

Preparation work before segmentation

Automated liver and tumor segmentation

Quality control for automated segmentation

Volumetric quantification analyses

Evaluation of automated segmentation accuracy

Patient follow-up

Statistical analysis

Results

Patient characteristics

Inaccurate image segmentations

Evaluation of inter-reader agreement and segmentation accuracy

Predictors for ER based on Cox regression analyses

Entire cohort

Patients with complete pathological data

Entire cohort plus cases with inaccurate segmentations

Survival analyses

TTB and TTS for ER risk stratification

Construction of modified BCLC algorithm for stage A and B HCC

Sensitivity analysis for patients without adjuvant therapy after surgery

Discussion

Abbreviations

References

Funding

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Guarantor

Conflict of interest

Statistics and biometry

Informed consent

Ethical approval

Study subjects or cohorts overlap

Methodology

Additional information

Supplementary information

Supplementary Materials

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation