Preoperative identification of microvascular invasion in hepatocellular carcinoma by XGBoost and deep learning

Jiang, Yi-Quan; Cao, Su-E; Cao, Shilei; Chen, Jian-Ning; Wang, Guo-Ying; Shi, Wen-Qi; Deng, Yi-Nan; Cheng, Na; Ma, Kai; Zeng, Kai-Ning; Yan, Xi-Jing; Yang, Hao-Zhen; Huan, Wen-Jing; Tang, Wei-Min; Zheng, Yefeng; Shao, Chun-Kui; Wang, Jin; Yang, Yang; Chen, Gui-Hua

doi:10.1007/s00432-020-03366-9

Preoperative identification of microvascular invasion in hepatocellular carcinoma by XGBoost and deep learning

Original Article – Clinical Oncology
Open access
Published: 27 August 2020

Volume 147, pages 821–833, (2021)
Cite this article

Download PDF

You have full access to this open access article

Journal of Cancer Research and Clinical Oncology Aims and scope Submit manuscript

Preoperative identification of microvascular invasion in hepatocellular carcinoma by XGBoost and deep learning

Download PDF

Yi-Quan Jiang¹^na1,
Su-E Cao²^na1,
Shilei Cao³^na1,
Jian-Ning Chen⁴^na1,
Guo-Ying Wang¹^na1,
Wen-Qi Shi²,
Yi-Nan Deng¹,
Na Cheng⁴,
Kai Ma³,
Kai-Ning Zeng¹,
Xi-Jing Yan¹,
Hao-Zhen Yang⁵,
Wen-Jing Huan⁵,
Wei-Min Tang⁵,
Yefeng Zheng³,
Chun-Kui Shao⁴,
Jin Wang²,
Yang Yang¹ &
…
Gui-Hua Chen ORCID: orcid.org/0000-0002-9003-943X^6,7

8018 Accesses
2 Altmetric
Explore all metrics

Abstract

Purpose

Microvascular invasion (MVI) is a valuable predictor of survival in hepatocellular carcinoma (HCC) patients. This study developed predictive models using eXtreme Gradient Boosting (XGBoost) and deep learning based on CT images to predict MVI preoperatively.

Methods

In total, 405 patients were included. A total of 7302 radiomic features and 17 radiological features were extracted by a radiomics feature extraction package and radiologists, respectively. We developed a XGBoost model based on radiomics features, radiological features and clinical variables and a three-dimensional convolutional neural network (3D-CNN) to predict MVI status. Next, we compared the efficacy of the two models.

Results

Of the 405 patients, 220 (54.3%) were MVI positive, and 185 (45.7%) were MVI negative. The areas under the receiver operating characteristic curves (AUROCs) of the Radiomics-Radiological-Clinical (RRC) Model and 3D-CNN Model in the training set were 0.952 (95% confidence interval (CI) 0.923–0.973) and 0.980 (95% CI 0.959–0.993), respectively (p = 0.14). The AUROCs of the RRC Model and 3D-CNN Model in the validation set were 0.887 (95% CI 0.797–0.947) and 0.906 (95% CI 0.821–0.960), respectively (p = 0.83). Based on the MVI status predicted by the RRC and 3D-CNN Models, the mean recurrence-free survival (RFS) was significantly better in the predicted MVI-negative group than that in the predicted MVI-positive group (RRC Model: 69.95 vs. 24.80 months, p < 0.001; 3D-CNN Model: 64.06 vs. 31.05 months, p = 0.027).

Conclusion

The RRC Model and 3D-CNN models showed considerable efficacy in identifying MVI preoperatively. These machine learning models may facilitate decision-making in HCC treatment but requires further validation.

Predicting microvascular invasion in hepatocellular carcinoma: a deep learning model validated across hospitals

Article Open access 09 October 2021

Prediction of preoperative microvascular invasion by dynamic radiomic analysis based on contrast-enhanced computed tomography

Article 05 December 2023

Deep convolutional neural network for preoperative prediction of microvascular invasion and clinical outcomes in patients with HCCs

Article 04 August 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Liver cancer is the sixth-most common cancer in the world and the fourth cause of cancer-related death worldwide (Villanueva 2019). Throughout the world, ~ 841,000 people are diagnosed with hepatocellular carcinoma (HCC), and ~ 782,000 people die from HCC each year (Bray et al. 2018). The mainstay treatment for HCC is surgery, including hepatic resection and liver transplantation. Despite receiving radical surgery, patients still have a high risk of recurrence; thus, an accurate preoperative cancer assessment are essential for determining the appropriate surgical approach and management strategy to decrease the recurrence rate.

Recent studies have proposed the importance of a preoperative assessment of microvascular invasion (MVI), which can be used to guide therapy in patients with HCC (Banerjee et al. 2015; Cucchetti et al. 2010; Hyun et al. 2018; Lee et al. 2017; Renzulli et al. 2016; Wang et al. 2018a; Wu et al. 2015; Xu et al. 2019). Studies have shown that MVI is an independent histopathological prognostic factor associated with survival in all-stage HCC patients (Mazzaferro et al. 2009). Furthermore, MVI has been reported to be a better predictor of tumour recurrence and overall survival than the Milan criteria (Lim et al. 2011). For patients with MVI, a more aggressive treatment strategy may be preferred, such as a wide resection margin or anatomical resection for patients receiving hepatic resection (HR), an ablation margin of at least 0.5–1 cm 360° around the tumour for patients receiving ablation, and neoadjuvant therapy before surgery (Hirokawa et al. 2014; Hocquelet et al. 2016; Nakazawa et al. 2007; Nault et al. 2018; Zhao et al. 2017). For liver transplantation (LT) in patients with HCC, MVI status has been recognized as an essential variable for identifying patients who will benefit most from LT (Mazzaferro et al. 2009, 2018).

However, the traditional method of identifying MVI is based on postoperative microscopic examination of surgical specimens even though the most important treatment decisions are commonly determined before surgery. Therefore, exploring new methods that can be used to preoperatively assess MVI to determine the most appropriate treatment strategy for HCC patients is important. Developments in imaging technology have enabled non-invasive assessments of MVI preoperatively (Banerjee et al. 2015; Hyun et al. 2018; Lee et al. 2017; Renzulli et al. 2016; Wang et al. 2018a; Wu et al. 2015; Xu et al. 2019; Zheng et al. 2017).

Advances in imaging technology, together with artificial intelligence (Bi et al. 2019), have allowed researchers to create various diagnostic and treatment models and improved the diagnostic efficacy in liver cancer, dermatology, ophthalmology, lung and breast cancers, neurology, cardiovascular diseases, gastrointestinal endoscopy, and genetic diseases, etc. (Attia et al. 2019; Chilamkurthy et al. 2018; Coudray et al. 2018; Esteva et al. 2017; Gurovich et al. 2019; Kermany et al. 2018; Mori et al. 2018; Rampasek and Goldenberg 2018; Yasaka et al. 2018; Zou et al. 2019). The purpose of the current study is to develop models using eXtreme Gradient Boosting (XGBoost) and deep learning to provide a preoperative non-invasive assessment method for MVI in HCC patients. An artificial intelligence system for hepatology requires a great amount of work, but it is just the beginning of the dramatic change that artificial intelligence will bring about in medicine.

Materials and methods

This retrospective clinical study was approved by our institutional review board. Because of the retrospective nature of the study, patient consent for inclusion was waived. All private information of the included patients was erased.

Case cohort

A retrospective cohort from collected from 2010 to 2018 was analysed. The inclusion and exclusion criteria were as follows: (1) histological diagnosis of HCC; (2) HR or LT received as primary therapy; (3) preoperative four-phase contrast-enhanced computed tomography (CT) performed 2 months at most before LT or HR; and (4) available postoperative pathologic specimens. Details about pathological assessment of MVI and CT imaging protocol are shown in Supplemental methods.

Methods overview

The traditional method of assessing MVI status preoperatively is to manually collect radiological features, radiomics features and clinical variables and develop a predictive model based on such collected information. Such models are more interpretable but require more manpower and materials. Nowadays, deep learning models excel at automated image recognition with high efficiency and accuracy. In the current study, we developed predictive models by XGBoost in the traditional way and also developed a predictive model based on an emerging algorithm, namely, deep learning, and compared the efficacy of the two methods.

Predictive models based on XGBoost (Chen and Guestrin 2016)

Tumor segmentation

Tumor segmentation was manually and independently performed by three radiologists (A, B and C) (all of the radiologists had at least 3 years of experience in HCC diagnosis) for the three phases of the volume data (the AP, PVP, and DP), and the results were reviewed by a radiologist (D) with 20 years of experience in HCC diagnosis. The segmentation boundaries were drawn with ITK-SNAP software (https://www.radiantviewer.com) slice-by-slice for each volume along the visible borders of the lesion. The 3D segmentation of the tumor provides the volume-of-interest (VOI) for the later feature extraction step.

Radiomics feature extraction

Radiomics is defined as the quantitative mapping, that is, the extraction, analysis and modelling of many medical image features in relation to prediction targets. The fundamental principle of radiomics is to extract high-dimension features, e.g., first-, second-, and higher-order statistics, to quantitatively describe the attributes of the VOI based on tomographic data. In the current study, the VOI was the 3D tumor region that was manually segmented from the CT scan. The radiomics features were extracted from the tumor VOI (VOI-full) and 1 cm extended from the VOI boundary (VOI-extension) via standard morphology binary dilation. To guarantee the extension of the tumor boundary inside the liver region, we obtained the liver mask from an automatic liver organ segmentation algorithm and discarded the non-liver regions outside the mask. The segmentation of a typical case is shown in Fig. 2. We used the open source PyRadiomics package for radiomics feature extraction. For each volume of the 3 different phases, we extracted 1217 features from the VOI-full and VOI-extension regions, consisting of a set of 7302 radiomics features.

Radiological feature extraction

The radiological features of the four-phase CT images of all cases were extracted and summarized by the aforementioned radiologists, and during this process, they were blinded to the pathological and clinical data. Next, the controversial cases among the three radiologists (A, B, C) were jointly evaluated until a final consensus was reached, and then they were finally reviewed by the most senior radiologist (D).

The extracted radiological features are a semantic interpretation of the tumor regions by the radiologists organized in a binary format. The summary of the radiological features were as follows: (1) liver morphology (normal vs. cirrhosis); (2) number of hepatic lobes involved (one lobe involved vs. two or more lobes involved); (3) number of tumors (one vs. more than one); (4) peritumoral satellite nodule (absence vs. presence); (5) maximum diameter of tumor (> 5 cm vs. ≤ 5 cm); (6) tumor growth pattern (intrahepatic growth vs. extrahepatic growth); (7) intratumoral hemorrhage (absence vs. presence); (8) intratumoral necrosis (absence vs. presence); (9) tumor margin (smooth vs. nonsmooth); (10) enhancing “capsule” (absence vs. presence); (11) AP hyperenhancement (absence vs. presence); (12) internal arteries (absence vs. presence); (13) peritumoral enhancement (absence vs. presence); (14) mosaic pattern or nodule-in-nodule pattern (absence vs. presence); (15) nonperipheral washout (absence vs. presence); (16) hypodense halos (absence vs. presence); and (17) tumor steatosis (absence vs. presence). The largest nodule was evaluated if multiple nodules existed. The definitions of some radiological features are shown in Supplemental methods.

Clinical variables

Baseline data of the patients including age, sex, background liver disease, diabetes, surgery type, primary tumor size, tumor count, α-fetoprotein (AFP) level, aspartate aminotransferase (AST) level, alanine aminotransferase (ALT) level, prothrombin time (PT), international normalized ratio (INR), serum fibrinogen (FBG) level, platelet (PLT) count, total bilirubin (TBIL) level, serum creatinine (Scr) level, Child–Pugh class, MELD score were recorded.

Feature analysis and predictive model based on XGBoost

Using XGBoost, we developed MVI prediction models based on radiological features (the Radiological Model), radiomics features (Radiomics Model) and a combination of radiological features, radiomics features, and clinical variables (Radiological-Radiomics-Clinical (RRC) Model). Details about XGBoost model are shown in Supplemental methods.

Deep learning: the 3D-CNN predictive model (Wang et al. 2018b)

Convolutional neural networks excel at medical image recognition (Hosny et al. 2018; Litjens et al. 2017). A 3D-CNN Model was developed to assess MVI in an end-to-end training fashion, in which feature extraction and predictive model construction were automatically processed by a single neural network. We developed several empirical principles to process the input data and guide the design of the deep neural networks: (1) the input should be a small volume sample that is mostly covered by the tumour region to exclude interference from nearby tissues; 2) the input should be sampled within the tumour region to force the network to learn the relevant features of the tumour; and (3) the depth of the CNN should not be profound to avoid the overfitting problem due to the limited size of the training cohort. According to these principles, we proposed a CNN as shown in Fig. 1. The network takes three 16 × 64 × 64 patches as input and passes them through several intermediate layers to extract deep features, which are further fused and fed into the decision layers to generate the final MVI assessment result. Details about CNN model are shown in Supplemental methods.

Statistical analysis

The performance of the predictive models was evaluated by the areas under the receiver operating characteristic curve (AUROC) and precision recall curve (AUPRC). The accuracy, sensitivity, specificity, positive predictive value, negative predictive value and f1 score of the models were also calculated and are presented. Hanley and McNeil analysis was performed to compare the efficacy of the proposed models. Recurrence-free survival analyses were performed based on the MVI status predicted by the XGBoost and 3D-CNN models. Recurrence-free survival was defined as the time from the surgery to local, regional, or distant cancer relapse or to death due to HCC.

Results

Of the 1618 patients with a diagnosis of HCC at the * between 2010 and 2018, a total of 405 patients met the inclusion criteria (flow chart is shown in Supplemental Fig. 1). Of the 405 patients, 220 patients (54.3%) were MVI positive, and 185 patients (45.7%) were MVI negative. The baseline characteristics of all patients are presented in Table 1. All patients were randomly assigned to the training set and validation set at a ratio of 8:2. The radiological features and baseline characteristics of patients stratified by MVI status are presented in Table 2 and Supplemental Table 1, respectively.

Table 1 Baseline information of the entire cohort

Full size table

Table 2 Radiological features stratified by MVI status in the training set and validation set

Full size table

Development of an MVI prediction model based on the 3D-CNN

In the current study, a 3D-CNN Model was developed to assess MVI in an end-to-end training fashion. A graphical abstract of the 3D-CNN Model is shown in Supplemental Fig. 2, and the detailed schematic of the 3D-CNN Model developed to predict MVI status is shown in Fig. 1. The performance of the 3D-CNN Model for the identification of MVI is presented in Table 3. The AUROC values of the 3D-CNN Model in the training set and the validation set were 0.980 (95% CI 0.959–0.993) and 0.906 (95% CI 0.821–0.960), respectively (Fig. 2a, b). The AUPRC values of the 3D-CNN Model in the training set and the validation set were 0.99 and 0.90, respectively (Fig. 2c, d). To improve the interpretability of the 3D-CNN model, we attempted to predict the 15 most important variables selected by the XGBoost method and some valuable radiological features of HCC based on the 3D-CNN Model. A high prediction accuracy means that the established CNN model has encoded the interpretable characteristics to assist in the decision-making process in predicting MVI status. The performance of the 3D-CNN Model in predicting these features is presented in Supplemental Table 2. For example, the AUROC, specificity and sensitivity were 0.776, 0.923 and 0.564, respectively, in predicting the tumor margin status using the 3D-CNN Model.

Table 3 Performance of the MVI predictive models in the validation set

Full size table

Development of MVI predictive models based on XGBoost (Chen and Guestrin 2016)

Next, we used traditional methods to access MVI status preoperatively, that is, manually collecting images and clinical information and developing predictive models based on such collected information. We developed MVI prediction models based on radiological features (Radiological Model), radiomics features (Radiomics Model), clinical variables and their combinations (Radiomics-Radiological-Clinical Model, RRC Model) (Fig. 3). The performance of the predictive models generated by XGBoost is also presented in Table 3. The areas under the receiver operating characteristic curves (AUROCs) of the Radiological Model in the training set and the validation set were 0.900 (95% CI 0.776–0.862) and 0.875 (95% CI 0.761–0.925), respectively. The AUROC values of the Radiomics Model in the training set and the validation set were 0.948 (95% CI 0.918–0.969) and 0.873 (95% CI 0.781–0.937), respectively. The AUROC values of the RRC Model in the training set and the validation set were 0.952 (95% CI 0.923–0.973) and 0.887 (95% CI 0.797–0.947), respectively (Fig. 2).

Importance ranking of variables for predicting MVI status by XGBoost

To identify the most vital features in the preoperative assessment of MVI status, all variables, including 17 radiological features, 7302 radiomics features and 19 baseline characteristics of patients, were evaluated for their importance in predicting MVI status by the XGBoost method. Finally, 129 features were found to contribute to the RRC model. Of all the variables, the tumour margin was ranked first and was the only radiological feature ranking in the top 15 features (Fig. 4), and α-fetoprotein (AFP) level was ranked 4th and was the only baseline characteristic ranking in the top 15 features. The remaining important variables were radiomics features (Table 4).

Table 4 The 15 most important features for MVI classification in the RRC Model

Full size table

In the Radiological Model, the five most important radiological features are as follows: margin of tumor, internal arteries, hypo-dense halo, peritumoral enhancement and lobes involved.

Comparison of the predictive models by 3D-CNN and XGBoost

In the training set, the 3D-CNN Model had the highest AUROC value among the other models, whereas the AUROC value of the Radiological Model was the lowest. The AUROC value of the Radiological Model was lower than that of the Radiomics Model (0.900 vs. 0.951, p = 0.0026). The AUROC value of the Radiomics Model was comparable to that of the RRC Model (0.951 vs. 0.965, p = 0.0523), which demonstrated that the radiological features and clinical variables did not provide significant added value to the Radiomics Model. The AUROC value of the 3D-CNN Model was higher than that of the Radiomics Model (0.980 vs. 0.951, p = 0.0148). However, there was no significant difference between the AUROC value of the 3D-CNN Model and that of the RRC Model (0.980 vs. 0.965, p = 0.1444).

In the validation set, there were no significant differences among the AUROC values of the several predictive models. The AUROC values of the Radiomics Model and the Radiological Model were 0.888 vs. 0.873, p = 0.73. The AUROC values of the Radiomics Model and the RRC Model were 0.888 vs. 0.897, p = 0.72. The AUROC values of the 3D-CNN Model and the Radiomics Model were 0.906 vs. 0.888, p = 0.66. The AUROC values of the 3D-CNN Model and the RRC Model were 0.906 vs. 0.897, p = 0.83.

Recurrence-free survival analysis based on predicted MVI status

The median recurrence-free survival (RFS) of the entire cohort was 22 months. The median RFS of patients with MVI was 6 months. The median RFS of patients without MVI was not available because less than half of the patients experienced recurrence. Kaplan–Meier survival analyses were performed based on the MVI status predicted by the RRC Model and 3D-CNN Model (Fig. 5) within the training set and the validation set. Based on the MVI status predicted by the RRC Model and the 3D-CNN Model, the mean RFS was significantly better in the predicted MVI-negative group than that in the predicted-MVI positive group (in the training set, RRC Model: 55.30 months vs. 19.99 months, p < 0.001; 3D-CNN Model: 50.24 months vs. 23.95 months, p < 0.001. In the validation set, RRC Model: 69.95 months vs. 24.80 months, p < 0.001; 3D-CNN Model: 64.06 months vs. 31.05 months, p = 0.027).

Discussion

A preoperative noninvasive assessment of MVI may be essential to guide treatment strategies. In this study, we developed models based on image analysis by XGBoost and 3D-CNN, which may enhance the accuracy of preoperative non-invasive assessment of MVI in HCC patients. These machine learning models shown considerable efficacy in identifying MVI preoperatively.

Several studies have utilized radiological features or radiomics features to predict the status of MVI in HCC. Studies have reported that radiological features like the tumour margin, internal arteries, peritumoural enhancement and hypodense halos are essential in predicting MVI (Banerjee et al. 2015; Renzulli et al. 2016, 2018; Zheng et al. 2017), which is consistent with the current study. With the development of computer-assisted diagnosis methods, radiomics analysis has also been adopted to predict MVI status in HCC. In the study by Xu et al. (2019), they developed a regression model based on radiological features, clinical variables and radiomics features to predict MVI status and achieved an AUROC of 0.889 in the internal test set. In the current study, we also developed the RRC Model based on radiological features, radiomics features and clinical variables using a machine learning method, namely XGBoost. The RRC Model achieved an AUROC of 0.897 in the internal validation set, which is similar to Xu et al. study. We also developed models based on radiological features or radiomics features, and there were no significant differences between the Radiological Model and the Radiomics Model. We believe that each of the two methods has its own advantages. Radiological features are easy to understand and practical in clinical work, however, the accuracy of abstract of these features rely on experience of radiologists. Radiomics features are pre-defined by experts and quantified by computer, which are independent of experience of radiologists.

The most important highlight of the current study is that to the best of our knowledge, this is the first study to develop an MVI predictive model based on image analysis using machine learning methods (XGBoost and a convolutional neural network). Both of the models showed substantial efficacy in identifying MVI status. For the construction of the RRC Model developed by XGBoost, we collected comprehensive and detailed data including radiological features, radiomics features (based on manual segmentation) and clinical variables, which required extensive work and manpower. Radiomics is now an advanced technique used for image analysis. However, the shortcoming of radiomics analysis is that the method is based on hand-crafted feature extractors, which rely on expert definition and thus do not represent the most optimal option (Hosny et al. 2018). In contrast, the most important advantage of the 3D-CNN Model is that the model achieved high efficacy in identifying MVI status automatically with minimal manpower, time and materials. For the construction of the 3D-CNN Model, we needed only to input images, and clinical data, radiological features or radiomics features did not need to be collected. This significant efficacy accompanied by high efficiency is the primary driver to advance the application of artificial intelligence in medicine. Another innovative point of the current study is that we provided a new means to explain how deep learning can identify MVI. The greatest deficiency of deep learning or a CNN is the lack of interpretability. End-to-end predictive models are common in previous studies utilizing deep learning. To solve this problem, we extracted the output of the second last decision layer as the features to represent the CNN model. We evaluated the performance of the CNN model regarding the identification of some valuable features of HCC (radiological features and radiomics features) in the CT images, and the results were satisfactory, indicating that the CNN model can predict the status of MVI partly based on the explainable features utilized in daily clinical work.

Limitations existed in the current study. First, the accuracy of the 3D-CNN Model in identifying radiological features and radiomics features requires further improvement as we did not intend to construct a model dedicated to identifying these features. We believe that such models can be easily developed in the future, which may substantially reduce the workload of radiologists. Second, this is a single-centre study with a relatively small sample size, and the results therefore require further validation.

In conclusion, we proposed state-of-the-art models based on image analysis by XGBoost and deep learning to provide a preoperative noninvasive assessment method for MVI in HCC patients. The 3D-CNN model showed considerable efficacy in identifying MVI preoperatively with minimal manpower, time and material requirements. This model may facilitate decision making in HCC treatment. We believe that our model may have a substantial impact on the evaluation of tumour stages and the selection of appropriate treatments for HCC patients. Furthermore, this model may also advance the application of artificial intelligence in the area of hepatology. The validity of the model, as well as the long-term outcomes of patients who received treatments based on the model, requires further investigation.

Data availability

The data and material are available from the corresponding author upon reasonable request.

Code availability

The custom code used to analyse the data is available from the corresponding author upon reasonable request.

Abbreviations

AFP:: α-Fetoprotein
AUROC:: Area under receiver operating characteristic curve
AUPRC:: Area under precision-recall curve
CI:: Confidence interval
HCC:: Hepatocellular carcinoma
HR:: Hepatic resection
LT:: Liver transplantation
MVI:: Microvascular invasion
RFS:: Recurrence-free survival
RRC:: Radiomics-Radiological-Clinical
3D-CNN:: Three-dimensional convolutional neural network
XGBoost:: EXtreme Gradient Boosting
AP:: Arterial phase
PVP:: Portal-venous phase
DP:: Delayed phase
VOI:: Volume-of-interest

References

Attia ZI et al (2019) Screening for cardiac contractile dysfunction using an artificial intelligence-enabled electrocardiogram. Nat Med 25:70–74. https://doi.org/10.1038/s41591-018-0240-2
Article CAS PubMed Google Scholar
Banerjee S et al (2015) A computed tomography radiogenomic biomarker predicts microvascular invasion and clinical outcomes in hepatocellular carcinoma. Hepatology 62:792–800. https://doi.org/10.1002/hep.27877
Article PubMed Google Scholar
Bi WL et al (2019) Artificial intelligence in cancer imaging: clinical challenges and applications. CA Cancer J Clin 69:127–157. https://doi.org/10.3322/caac.21552
Article PubMed PubMed Central Google Scholar
Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A (2018) Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 68:394–424. https://doi.org/10.3322/caac.21492
Article PubMed Google Scholar
Chen T, Guestrin C (2016) Xgboost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, pp 785–794
Chilamkurthy S et al (2018) Deep learning algorithms for detection of critical findings in head CT scans: a retrospective study. Lancet 392:2388–2396. https://doi.org/10.1016/S0140-6736(18)31645-3
Article PubMed Google Scholar
Coudray N et al (2018) Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning. Nat Med 24:1559–1567. https://doi.org/10.1038/s41591-018-0177-5
Article CAS PubMed Google Scholar
Cucchetti A et al (2010) Preoperative prediction of hepatocellular carcinoma tumour grade and micro-vascular invasion by means of artificial neural network: a pilot study. J Hepatol 52:880–888. https://doi.org/10.1016/j.jhep.2009.12.037
Article PubMed Google Scholar
Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, Thrun S (2017) Dermatologist-level classification of skin cancer with deep neural networks. Nature 542:115–118. https://doi.org/10.1038/nature21056
Article CAS PubMed PubMed Central Google Scholar
Gurovich Y et al (2019) Identifying facial phenotypes of genetic disorders using deep learning. Nat Med 25:60–64. https://doi.org/10.1038/s41591-018-0279-0
Article CAS PubMed Google Scholar
Hirokawa F et al (2014) Outcomes and predictors of microvascular invasion of solitary hepatocellular carcinoma. Hepatol Res 44:846–853. https://doi.org/10.1111/hepr.12196
Article CAS PubMed Google Scholar
Hocquelet A et al (2016) Three-dimensional measurement of hepatocellular carcinoma ablation zones and margins for predicting local tumor progression. J Vasc Interv Radiol 27:1038–1045.e1032. https://doi.org/10.1016/j.jvir.2016.02.031
Article PubMed Google Scholar
Hosny A, Parmar C, Quackenbush J, Schwartz LH, Aerts H (2018) Artificial intelligence in radiology. Nat Rev Cancer 18:500–510. https://doi.org/10.1038/s41568-018-0016-5
Article CAS PubMed PubMed Central Google Scholar
Hyun SH et al (2018) Preoperative prediction of microvascular invasion of hepatocellular carcinoma using (18)F-FDG PET/CT: a multicenter retrospective cohort study. Eur J Nucl Med Mol Imaging 45:720–726. https://doi.org/10.1007/s00259-017-3880-4
Article CAS PubMed Google Scholar
Kermany DS et al (2018) Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell 172(1122–1131):e1129. https://doi.org/10.1016/j.cell.2018.02.010
Article CAS Google Scholar
Lee S, Kim SH, Lee JE, Sinn DH, Park CK (2017) Preoperative gadoxetic acid-enhanced MRI for predicting microvascular invasion in patients with single hepatocellular carcinoma. J Hepatol 67:526–534. https://doi.org/10.1016/j.jhep.2017.04.024
Article CAS PubMed Google Scholar
Lim KC et al (2011) Microvascular invasion is a better predictor of tumor recurrence and overall survival following surgical resection for hepatocellular carcinoma compared to the Milan criteria. Ann Surg 254:108–113. https://doi.org/10.1097/SLA.0b013e31821ad884
Article PubMed Google Scholar
Litjens G et al (2017) A survey on deep learning in medical image analysis. Med Image Anal 42:60–88. https://doi.org/10.1016/j.media.2017.07.005
Article PubMed Google Scholar
Mazzaferro V et al (2009) Predicting survival after liver transplantation in patients with hepatocellular carcinoma beyond the Milan criteria: a retrospective, exploratory analysis. Lancet Oncol 10:35–43. https://doi.org/10.1016/S1470-2045(08)70284-5
Article PubMed Google Scholar
Mazzaferro V et al (2018) Metroticket 2.0 model for analysis of competing risks of death after liver transplantation for hepatocellular carcinoma. Gastroenterology 154:128–139. https://doi.org/10.1053/j.gastro.2017.09.025
Article PubMed Google Scholar
Mori Y et al (2018) Real-time use of artificial intelligence in identification of diminutive polyps during colonoscopy: a prospective study. Ann Intern Med 169:357–366. https://doi.org/10.7326/M18-0249
Article PubMed Google Scholar
Nakazawa T et al (2007) Radiofrequency ablation of hepatocellular carcinoma: correlation between local tumor progression after ablation and ablative margin. AJR Am J Roentgenol 188:480–488. https://doi.org/10.2214/AJR.05.2079
Article PubMed Google Scholar
Nault JC, Sutter O, Nahon P, Ganne-Carrie N, Seror O (2018) Percutaneous treatment of hepatocellular carcinoma: state of the art and innovations. J Hepatol 68:783–797. https://doi.org/10.1016/j.jhep.2017.10.004
Article PubMed Google Scholar
Rampasek L, Goldenberg A (2018) Learning from everyday images enables expert-like diagnosis of retinal diseases. Cell 172:893–895. https://doi.org/10.1016/j.cell.2018.02.013
Article CAS PubMed Google Scholar
Renzulli M et al (2016) Can current preoperative imaging be used to detect microvascular invasion of hepatocellular carcinoma? Radiology 279:432–442. https://doi.org/10.1148/radiol.2015150998
Article PubMed Google Scholar
Renzulli M et al (2018) Imaging features of microvascular invasion in hepatocellular carcinoma developed after direct-acting antiviral therapy in HCV-related cirrhosis. Eur Radiol 28:506–513. https://doi.org/10.1007/s00330-017-5033-3
Article PubMed Google Scholar
Villanueva A (2019) Hepatocellular carcinoma. N Engl J Med 380:1450–1462. https://doi.org/10.1056/NEJMra1713263
Article CAS PubMed Google Scholar
Wang WT et al (2018a) Assessment of microvascular invasion of hepatocellular carcinoma with diffusion kurtosis imaging. Radiology 286:571–580. https://doi.org/10.1148/radiol.2017170515
Article PubMed Google Scholar
Wang X, Girshick R, Gupta A, He K (2018b) Non-local neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. IEEE, Salt Lake City, UT, pp 7794–7803
Wu D et al (2015) Liver computed tomographic perfusion in the assessment of microvascular invasion in patients with small hepatocellular carcinoma. Investig Radiol 50:188–194. https://doi.org/10.1097/RLI.0000000000000098
Article Google Scholar
Xu X et al (2019) Radiomic analysis of contrast-enhanced CT predicts microvascular invasion and outcome in hepatocellular carcinoma. J Hepatol 70:1133–1144. https://doi.org/10.1016/j.jhep.2019.02.023
Article PubMed Google Scholar
Yasaka K, Akai H, Abe O, Kiryu S (2018) Deep learning with convolutional neural network for differentiation of liver masses at dynamic contrast-enhanced CT: a preliminary study. Radiology 286:887–896. https://doi.org/10.1148/radiol.2017170706
Article PubMed Google Scholar
Zhao H, Chen C, Gu S, Yan X, Jia W, Mao L, Qiu Y (2017) Anatomical versus non-anatomical resection for solitary hepatocellular carcinoma without macroscopic vascular invasion: a propensity score matching analysis. J Gastroenterol Hepatol 32:870–878. https://doi.org/10.1111/jgh.13603
Article CAS PubMed Google Scholar
Zheng J et al (2017) Preoperative prediction of microvascular invasion in hepatocellular carcinoma using quantitative image analysis. J Am Coll Surg 225:778–788. https://doi.org/10.1016/j.jamcollsurg.2017.09.003
Article PubMed PubMed Central Google Scholar
Zou J, Huss M, Abid A, Mohammadi P, Torkamani A, Telenti A (2019) A primer on deep learning in genomics. Nat Genet 51:12–18. https://doi.org/10.1038/s41588-018-0295-5
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The manuscript was edited for proper English language, grammar, punctuation, spelling, and overall style by one or more of the highly qualified native English-speaking editors at AJE.

Funding

This study was supported by the National Natural Science Foundation of China (81570593, 81770648); the Natural Science Foundation of Guangdong Province (2015A030312013, 2015A030313038); the Frontier and Key Technologies Innovation Foundation of Guangdong Province (No. 2014B020228003); the Science and Technology Program of Guangdong Province (2014B030301041, 2014A020211015); Major State Research Development Program (2017ZX10203205-006-001, 2017ZX10203205-001-003); the Science and Technology Program of Guangzhou city (201604020001); and the Major Project of Cooperative Innovation of Health Care of Guangzhou City (158100076).

Author information

Yi-Quan Jiang, Su-E. Cao, Shi-Lei Cao, Jian-Ning Chen and Guo-Ying Wang have contributed equally as first authors.

Authors and Affiliations

Department of Hepatic Surgery and Liver Transplantation Center, The Third Affiliated Hospital, Organ Transplantation Institute, Sun Yat-Sen University, 600 Tianhe Road, Guangzhou, 510630, Guangdong, China
Yi-Quan Jiang, Guo-Ying Wang, Yi-Nan Deng, Kai-Ning Zeng, Xi-Jing Yan & Yang Yang
Department of Radiology, The Third Affiliated Hospital, Sun Yat-Sen University, 600 Tianhe Road, Guangzhou, 510630, China
Su-E Cao, Wen-Qi Shi & Jin Wang
Tencent Youtu Lab, Malata Building, Kejizhongyi Road, Nanshan District, Shenzhen, 518075, China
Shilei Cao, Kai Ma & Yefeng Zheng
Department of Pathology, The Third Affiliated Hospital, Sun Yat-Sen University, 600 Tianhe Road, Guangzhou, 510630, China
Jian-Ning Chen, Na Cheng & Chun-Kui Shao
Tencent Healthcare, Tengxun Building, Kejizhongyi Road, Nanshan District, Shenzhen, 518075, China
Hao-Zhen Yang, Wen-Jing Huan & Wei-Min Tang
Organ Transplantation Research Center of Guangdong Province, Guangzhou, 510630, Guangdong, China
Gui-Hua Chen
Guangdong Key Laboratory of Liver Disease Research, The Third Affiliated Hospital, Sun Yat-Sen University, 600 Tianhe Road, Guangzhou, 510630, Guangdong, China
Gui-Hua Chen

Authors

Yi-Quan Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Su-E Cao
View author publications
You can also search for this author in PubMed Google Scholar
Shilei Cao
View author publications
You can also search for this author in PubMed Google Scholar
Jian-Ning Chen
View author publications
You can also search for this author in PubMed Google Scholar
Guo-Ying Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Qi Shi
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Nan Deng
View author publications
You can also search for this author in PubMed Google Scholar
Na Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Kai Ma
View author publications
You can also search for this author in PubMed Google Scholar
Kai-Ning Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Xi-Jing Yan
View author publications
You can also search for this author in PubMed Google Scholar
Hao-Zhen Yang
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Jing Huan
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Min Tang
View author publications
You can also search for this author in PubMed Google Scholar
Yefeng Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Chun-Kui Shao
View author publications
You can also search for this author in PubMed Google Scholar
Jin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yang Yang
View author publications
You can also search for this author in PubMed Google Scholar
Gui-Hua Chen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

YQJ, SEC, SLC, JNC, and WQS contributed to the conception and design of the study, acquisition of the data, and analysis and interpretation of the data. YQJ participated in critical drafting and revising of the article for important intellectual content. YND, NC, KM, KNZ, XJY, HZY, WJH, and WMT contributed substantially to the conception and design of the study and acquisition of the data. YZ, CKS, JW, GYW, YY and GHC contributed to the conception and design of the study and provided final approval of the version to be submitted and any revised versions.

Corresponding authors

Correspondence to Yang Yang or Gui-Hua Chen.

Ethics declarations

Conflict of interest

All authors declare that they have no conflicts of interest related to this manuscript. All authors have neither relevant commercial interests nor financial or material support to disclose. All authors have contributed significantly, and all authors are in agreement with the content of the manuscript.

Ethical approval

This is a retrospective study. All private information about the included patients were erased and the requirement for written informed consent is waived by IRB.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 22 kb)

Supplementary file2. Supplemental Figure 1. Flow chart (PDF 38 kb)

Supplementary file3. Supplemental Figure 2. Graphical abstract (TIF 9713 kb)

Supplementary file4 (DOCX 18 kb)

Supplementary file5 (DOCX 16 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jiang, YQ., Cao, SE., Cao, S. et al. Preoperative identification of microvascular invasion in hepatocellular carcinoma by XGBoost and deep learning. J Cancer Res Clin Oncol 147, 821–833 (2021). https://doi.org/10.1007/s00432-020-03366-9

Download citation

Received: 30 June 2020
Accepted: 18 August 2020
Published: 27 August 2020
Issue Date: March 2021
DOI: https://doi.org/10.1007/s00432-020-03366-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Preoperative identification of microvascular invasion in hepatocellular carcinoma by XGBoost and deep learning

Abstract

Purpose

Methods

Results

Conclusion

Similar content being viewed by others

Introduction

Materials and methods

Case cohort

Methods overview

Predictive models based on XGBoost (Chen and Guestrin 2016)

Tumor segmentation

Radiomics feature extraction

Radiological feature extraction

Clinical variables

Feature analysis and predictive model based on XGBoost

Deep learning: the 3D-CNN predictive model (Wang et al. 2018b)

Statistical analysis

Results

Development of an MVI prediction model based on the 3D-CNN

Development of MVI predictive models based on XGBoost (Chen and Guestrin 2016)

Importance ranking of variables for predicting MVI status by XGBoost

Comparison of the predictive models by 3D-CNN and XGBoost

Recurrence-free survival analysis based on predicted MVI status

Discussion

Data availability

Code availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation