Development of machine learning-based clinical decision support system for hepatocellular carcinoma

Choi, Gwang Hyeon; Yun, Jihye; Choi, Jonggi; Lee, Danbi; Shim, Ju Hyun; Lee, Han Chu; Chung, Young-Hwa; Lee, Yung Sang; Park, Beomhee; Kim, Namkug; Kim, Kang Mo

doi:10.1038/s41598-020-71796-z

Development of machine learning-based clinical decision support system for hepatocellular carcinoma

Article
Open access
Published: 09 September 2020

Volume 10, article number 14855, (2020)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Development of machine learning-based clinical decision support system for hepatocellular carcinoma

Download PDF

Gwang Hyeon Choi¹^na1,
Jihye Yun²^na1,
Jonggi Choi¹,
Danbi Lee¹,
Ju Hyun Shim¹,
Han Chu Lee¹,
Young-Hwa Chung¹,
Yung Sang Lee¹,
Beomhee Park²,
Namkug Kim² &
…
Kang Mo Kim¹

5750 Accesses
21 Citations
3 Altmetric
Explore all metrics

Abstract

There is a significant discrepancy between the actual choice for initial treatment option for hepatocellular carcinoma (HCC) and recommendations from the currently used BCLC staging system. We develop a machine learning-based clinical decision support system (CDSS) for recommending initial treatment option in HCC and predicting overall survival (OS). From hospital records of 1,021 consecutive patients with HCC treated at a single centre in Korea between January 2010 and October 2010, we collected information on 61 pretreatment variables, initial treatment, and survival status. Twenty pretreatment key variables were finally selected. We developed the CDSS from the derivation set (N = 813) using random forest method and validated it in the validation set (N = 208). Among the 1,021 patients (mean age: 56.9 years), 81.8% were male and 77.0% had positive hepatitis B BCLC stages 0, A, B, C, and D were observed in 13.4%, 26.0%, 18.0%, 36.6%, and 6.3% of patients, respectively. The six multi-step classifier model was developed for treatment decision in a hierarchical manner, and showed good performance with 81.0% of accuracy for radiofrequency ablation (RFA) or resection versus not, 88.4% for RFA versus resection, and 76.8% for TACE or not. We also developed seven survival prediction models for each treatment option. Our newly developed HCC-CDSS model showed good performance in terms of treatment recommendation and OS prediction and may be used as a guidance in deciding the initial treatment option for HCC.

Machine learning-based clinical decision support system for treatment recommendation and overall survival prediction of hepatocellular carcinoma: a multi-center study

Article Open access 05 January 2024

Machine learning-based decision support model for selecting intra-arterial therapies for unresectable hepatocellular carcinoma: A national real-world evidence-based study

Article 06 July 2024

Predictive models and early postoperative recurrence evaluation for hepatocellular carcinoma based on gadoxetic acid-enhanced MR imaging

Article Open access 08 January 2023

Introduction

Hepatocellular carcinoma (HCC) is the third and seventh most common malignancy in men and women worldwide, respectively, and its incidence continues to increase¹. The American Association for the Study of Liver Diseases and the European Association for the Study of the Liver currently endorse the Barcelona Clinic Liver Cancer (BCLC) staging system as a primary prognostic model and a allocating tool of HCC treatment^2,3.

However, there is a significant discrepancy in the initial treatment choice for HCC between the recommendations from the BCLC system and real clinical practice^4,5. This is partially because treatment decision for HCC is highly multifactorial, in which physicians need to take into consideration the HCC stage, baseline liver function, and performance status. Moreover, other factors such as location and distribution of tumour, presence of intermediate nodule, comorbidities, socio-economic status, availability of potential living related-donors, and the invasiveness and feasibility of each treatment option play critical roles in determining the clinical outcomes of patients with HCC. Such complex nature of HCC treatment decision has hindered large-sized clinical studies, because conventional statistical methods fall short of aptly controlling multiple variables and factors.

Recent attempts on applying the artificial intelligence (AI) technique to clinical practice have focused on using AI to develop clinical decision support system (CDSS)^{6,7,8,9,10,11}. For this study, we reasoned that machine learning, an application of AI that self-improves by learning from large amounts of data, would be useful for generating an algorithm for that evaluates multiple pretreatment variables to recommend optimal treatment options for HCC¹². We believed that a good-quality database is essential, and the definition and selection of pretreatment variables are also significantly important for clinically plausible results. In this study, we thus gathered a team of well experienced hepatologists and AI scientists at our centre and developed a CDSS algorithm that can recommend optimal initial treatment for patients with HCC and predict the overall survival (OS) of patients after treatment, based on our centre’s experiences.

Methods

Study population

We retrospectively reviewed hospital records of 1,650 consecutive patients who were newly diagnosed with HCC at Asan Medical Center (Seoul, Korea) between January and October 2010 (Supplementary Fig. 1). Patients who had a treatment history of HCC (N = 356), those who received HCC treatment at other hospitals (N = 138), those who had a metastatic liver cancer (N = 71), those with secondary malignancies that might affect survival (N = 36), those with combined hepatocellular-cholangiocarcinoma (N = 21), and those with incidentally detected HCC after transplantation (N = 7) were excluded from the study. Consequently, the study cohort included 1,021 patients with HCC.

All enrolled patients were diagnosed with HCC using liver protocol computed tomography or magnetic resonance imaging or liver biopsy according to the current guidelines of the American Association for the Study of Liver Diseases¹³. Patients were randomly allocated to the derivation or validation set at a ratio of 4:1. The protocols of this study were approved by the Institutional Review Board of Asan Medical Center (IRB number: 2017-0188), and the requirement for informed consent from patients was waived due to the retrospective nature of the study. All methods were performed in accordance with the relevant guidelines and regulations.

Data collection

We used our institutional database to collect information on the initial treatment option, initial treatment response, and OS of all patients. We retrospectively collected pre-treatment demographic, clinical, and imaging variables (Supplementary Table S1), treatment information and survival status of all the 1,021 patents from our centre’s database. The following demographic factors were assessed: age, sex, Eastern Cooperative Oncology Group (ECOG) score, aetiology of liver disease, presence of potential liver-related donor, body mass index (BMI), occupation, resident area, patients’ educational attainment, maximum tumour size, tumour number, tumour type (infiltrative or nodular), tumour enhancement pattern, tumour distribution, portal vein invasion, hepatic vein or inferior vena cava invasion, bile duct invasion, extrahepatic metastases, presence of dysplastic nodule, radiofrequency ablation (RFA) feasibility, presence of cirrhosis, Child–Pugh class, presence of varix, laboratory findings including alpha-feto protein (AFP) level, within or above the Milan criteria, initial treatment option, initial treatment response, and OS. RFA feasibility was defined as a size or location of the tumour to receive percutaneous RFA successfully without significant complications, evaluated by a single hepatologist, G.H.C. Tumour location adjacent to the large vessel, bile duct, hepatic hilum, liver capsule or extrahepatic organ was classified as an RFA non-feasible lesion. OS was defined as the time form date of imaging diagnosis of HCC to the date of death due to any cause.

Among the 61 initial pretreatment variables, 20 key variables (Table 1) were selected based on the importance scores calculated using the automated classifier model and the survival prediction model in the derivation set. Specifically, 14 variables were patient-related factors (age, BMI, Child–Pugh class, presence of varix, presence of ascites, ECOG score, haemoglobin level, platelet count, albumin level, prothrombin time, alanine aminotransferase [ALT] level, total bilirubin level, creatinine level, and AFP level), and six were tumour-related factors (tumour number, maximal tumour size, tumour distribution, presence of portal vein invasion, presence of metastasis, and RFA feasibility). Using these 20 variables, random forest and random survival forest methods were trained and evaluated again to recommend treatment options and to predict OS respectively in both the derivation and validation sets.

Table 1 Key 20 variables for hepatocellular carcinoma-clinical decision support system model.

Full size table

Treatment options were classified as follows: transplantation, surgical resection, RFA or percutaneous ethanol injection therapy (PEIT), transarterial chemoembolisation (TACE), TACE combined with external beam radiotherapy (EBRT), sorafenib treatment, supportive care, and other therapies, which included combined therapy (e.g. surgical resection with intraoperative RFA, TACE combined with sorafenib), palliative resection, intra-arterial cytotoxic chemotherapy, clinical trials, and EBRT alone. Database review was performed by one hepatologist (G.H.C.) to avoid inter-observer bias.

Machine learning for CDSS development in HCC

The primary outcomes were accuracies of treatment recommendation and survival prediction. The index date was defined as the date when patients underwent their first liver protocol computed tomography or magnetic resonance imaging. The follow-up period for each patient was estimated from the index date to the date of death or the last follow-up date.

Due to large differences in survival between treatments, it was difficult to train a machine learning-based model of treatment recommendation and survival prediction in an integrated way. Therefore, treatment recommendation and survival prediction models were separately designed and trained. Treatment recommendation models were hierarchically designed with six classifiers in the same manner as treatment planning in clinical practice. Supervised learning was adapted to prefer curative modalities using a classifier method. Transplantation option was not included in the treatment decision algorithm due to the medical environment of severe shortage of deceased liver donor. Although transplantation was not included in the classifier model, transplantation was suggested as an option, when it met the Millan criteria. Because factors affecting the prognosis were different for each treatment, we developed survival prediction models for each treatment. Our CDSS system operated by sequentially using treatment recommendation and survival prediction models.

To develop the treatment recommendation and survival prediction model, random forest model was employed. Random forest, which is one of the representative ensemble methods, is widely used because it is powerful and relatively lighter than other ensemble methods^14,15. Random forest constructs a number of tree-type base models and forms an ensemble through a technique called bootstrap aggregating or bagging. As the splitting rules for random forests, Gini impurity and log-rank test were used for treatment recommendation and survival prediction models, respectively. Other possible combinations of hyperparameters of models were investigated by grid search using GridSearchCV library in Scikit-learn package.

Figure 1 shows the schematic diagram for the construction of the CDSS model for HCC. The model comprised six multi-step classifiers and seven OS prediction sub-models. The input variables (N = 20) were processed with the algorithm for treatment recommendation with multi-step classifiers. The CDSS model for HCC was designed to prefer curative modalities (transplantation, resection, RFA or PEIT). Once a treatment option is selected, the model demonstrates the predicted OS curve for each patient. Additionally, if another treatment option is available, our CDSS model for HCC can suggest another predicted OS curve after the alternative treatment. Therefore, the model can predict different OS curves of the same patient with different treatments, which could be helpful when clinicians made treatment decisions in actual clinical setting.

Statistical analyses

Baseline characteristics of the patients were compared using the chi-square test for categorical variables and the Mann–Whitney U test for continuous variables. Survival distributions were compared using the Kaplan–Meier method with a log-rank test. Patients in our follow-up programme who were not confirmed deceased were recorded as censored.

In the initial phase of model development, we fitted a univariate Cox proportional hazards model to the treatment decision and survival endpoints. To select variables, we employed a two-step variable selection approach. The first step was to fit a random forest model to compute a variable importance score, and the second step was to compute a relative selection frequency based on a bootstrap resampling method^16,17.

For the validation data sets, per-patient based analysis was performed from probability values using accuracy, sensitivity, specificity, positive predictive value, and negative predictive value for each classifier. The accuracy was defined as the percentage of correctly classified instances and calculated as follows: accuracy = (TP + TN)/(TP + TN + FP + FN), where TP, TN, FP, and FN are true positive, true negative, false positive, and false negative, respectively. Each survival prediction model was validated using bootstrapping to correct for optimistic bias. Time-dependent concordance (C)-index was used to evaluate predicted survival times, which were ranked in accordance with the observed survival times. All P-values were two-sided and P < 0.05 were considered significant. The outcome of implicit feature selection of the random forest was visualised using the Gini importance¹⁸. SPSS version 21 (SPSS, Inc., Chicago, IL), open-source Scikit-learn package in python version 0.19.1¹⁹, and random Forest SRC package in R version 3.4.1 (R Core Team, Vienna, Austria)²⁰ were used for statistical analyses.

Results

Characteristics of the study patients

We trained our CDSS system using the derivation set (N = 813) and validated it in the validation set (N = 208). Two sets were divided by stratified random splits. The same derivation and validation sets were used for both treatment recommendation and survival prediction models. A total of 460 and 128 patients died during the median follow-up periods of 37.8 (interquartile range [IQR], 8.3–84.7) and 48.6 (IQR, 8.3–83.1) months, respectively. Patients’ baseline demographics of patients are summarised in Table 2. Of the total 1,021 patients (mean age, 56.9 years), 81.8% were male, and 77.0% had positive hepatitis B virus surface antigen. Moreover, 76.3% of patients were classified with Child–Pugh class A, and 75.1% had ECOG score of 0. Regarding tumour-related factors, 41.7% of patients had multiple tumours, and the median maximal tumour diameter was 4.0 cm (IQR 2.3–8.5). Portal vein invasion and distant metastasis were confirmed in 22.8% and 12.2% of patients, respectively. BCLC stages 0, A, B, C, and D were observed in 13.4%, 26.0%, 18.0%, 36.6%, and 6.3% of patients, respectively. As an initial treatment, transplantation was performed in 4.5%, resection in 32.9%, RFA or PEIT in 7.5%, TACE in 31.5%, TACE combined with EBRT in 6.6%, sorafenib treatment in 3.0%, supportive care in 10.1%, and other therapies in 3.8% of patients. Among the other therapies, nine patients underwent resection combined with intraoperative RFA, nine underwent palliative resection, eight underwent EBRT to liver, six underwent TACE combined with sorafenib or cytotoxic chemotherapy, and four underwent intra-arterial cytotoxic chemotherapy. Moreover, three patients were enrolled in clinical trials and underwent systemic therapy. There was no significant difference between the derivation and validation set with respect to patient-, tumour-, or treatment-related variables.

Table 2 Baseline characteristics of the patients, tumors, and initial treatment options.

Full size table

Survival of the study patients according to the initial treatment

Supplementary Figure S2 shows the Kaplan–Meier survival curve according to the initial treatment in all patients. The 5-year survival rates of transplantation, resection, and RFA/PEIT were 86.5%, 73.7%, and 70.5%, respectively. The median survival of TACE, TACE + EBRT, sorafenib treatment, and other therapies and supportive care were 32.7 (95% confidence interval [CI] 27.0–38.4), 9.5 (95% CI 7.3–11.7), 4.2 (95% CI 2.7–5.8), 10.6 (95% CI 5.9–15.4), and 2.3 months (95% CI 1.6–3.1), respectively.

Performance of the treatment recommendation classifier of the CDSS model for HCC

Table 3 shows the accuracy of the six classifier models trained from the derivation set. The recommended treatment from the model was compared with the treatment used in real clinical practice in the validation set. Overall, our CDSS classifier model for HCC was well generalised and showed good performance, and its standard deviations were higher in the lower branches of the treatment (e.g. sorafenib treatment, supportive care, other therapies) as the number of patients were relatively smaller. The accuracies of classifiers 1, 2, 3, 4, and 5 were 81.0% (curative treatments versus not curative treatments), 88.4% (resection versus RFA/PEIT), 76.8% (TACE vs. or not TACE), 76.6% (TACE + EBRT versus not TACE + EBRT), 80.0% (sorafenib treatment versus not sorafenib treatment), and 80.1% (supportive care versus other therapies), respectively. Supplementary Figure S3 shows the importance of the features ranked by the Gini importance that calculates reduced impurity in all trees.

Table 3 Accuracy, sensitivity, specificity, positive predictive value, and negative predictive value for the six classifier models in the validation set.

Full size table

Performance of survival prediction of the CDSS model for HCC

Figure 2 shows predicted survival curves of each recommended treatment in the validation set. The ‘Ground truth curves’ represent the Kaplan–Meier survival curve of patients in the validation set in real clinical practice. The C-index values for the derived models of OS for RFA/PEIT, resection, TACE, TACE + EBRT, sorafenib treatment, supportive care, transplantation, and other therapies were 0.725 (95% CI, 0.708–0.741), 0.695 (95% CI, 0.680–0.709), 0.803 (95% CI, 0.796–0.809), 0.676 (95% CI, 0.658–0.694), 0.684 (95% CI, 0.648–0.720), 0.710 (95% CI, 0.689–0.730), 0.959 (95% CI, 0.949–0.969), and 0.850 (95% CI, 0.835–0.884), respectively. Supplementary Figure S4 shows the importance of the features for OS prediction in each recommended treatment.

Discussion

In the present study, we developed a machine learning-based CDSS algorithm for recommending initial treatment option for HCC by employing clinical data from 1,021 patients. Treatment recommendations made by the CDSS model for HCC showed high accordance with the actual treatment choices, and the OS prediction was also highly associated with the observed 5-year survival rates.

We present a detailed example of the application of the CDSS model for HCC (Fig. 3). A 43-year-old male patient had Child–Pugh class A and a 2-cm-sized single HCC without evidence of vascular invasion and extrahepatic metastasis. The patient’s clinical details were as follows: ECOG score 0, haemoglobin 12.2 g/dL, platelet count 92 × 10⁹/mm3, albumin 3.4 g/dL, ALT 46 U/L, total bilirubin 1.0 mg/dL, creatinine 0.7 mg/dL, and AFP 42.4 ng/mL. The HCC CDSS model recommended resection as the initial treatment and the predicted 3-year and 5-year survival rates were 90.2% and 83.4%, respectively. The CDSS model for HCC also provided an estimated survival rate for RFA and transplantation, for which the predicted 5-year survival rates were 51.5% and 94.7%, respectively. In real clinical practice, this patient initially underwent resection, experienced HCC recurrence 3.2 year after the resection, received subsequent multiple on-demand TACE treatments, and still survived for a total of 6.9 years following resection.

For the development of the CDSS model for HCC, we adapted the machine learning method to overcome the complexity of treatment decision for HCC. We took special care in selecting the proper pretreatment variables and constructing high-quality database in order to ensure that the algorithm training goes well to produce applicable results. We first recruited 61 variables that are known to influence HCC treatment decision in daily clinical practice, and inputted them in a hierarchical classifier model. Through a refinement process, we finally selected 20 key pre-treatment variables with the highest importance in our model, and the resulting CDSS model proved to have good prediction ability for both treatment option and OS in the validation set.

To the best of our knowledge, this is the first description of a machine learning-based CDSS model developed for treatment decision and survival prediction in HCC. Our CDSS model for HCC not only provides the best treatment option, but also suggests alternative treatment and predicts prognosis after each treatment. Our results show that the CDSS model for HCC may be used as a supplementary system for physicians in deciding the treatment option for HCC and explaining their choice to the patients. Future multicentre studies using the HCC-CDSS model would allow for a more powerful comparison in the treatment patterns between centres and recommend treatment options with relative strength according to each centre.

Previous studies that used AI to study HCC have primarily focused on prognosis prediction after resection or TACE^6,9,10,21,22. A recent study employed deep learning to identify multi-omics features associated with the differential survival of patients with HCC²³. Compared to the algorithms used in previous studies, our algorithm focused more on the clinical and radiological parameters and could thus be more easily used in daily clinical practice. The integration of individual genetic information to the HCC CDSS model would enable physicians to make a more accurate selection for HCC treatment in each patient.

The Watson for Oncology, a cognitive computing system trained at the Memorial Sloan Kettering Cancer Center (New York, USA), uses natural language processing and machine learning to provide treatment recommendations. The Watson system processes structured and unstructured data from medical literature, treatment guidelines, medical records, imaging, laboratory and pathology reports, and the expertise of the physicians at Memorial Sloan Kettering to formulate therapeutic recommendations²⁴. However, the Watson system has yet to be adapted for use in HCC, which may partially be due to complexity of factors that affect the treatment decision in HCC.

Our algorithm adapted manual database input in the development, and human efforts are certainly required during data acquisition. However, we already started another AI study, allowing us to automatically learn the radiological information of HCC, and training our algorithm more easily in the future learning process.

Our algorithm could not properly classify patients who received living related donor liver transplantation (LDLT) in the derivation set. As a possible explanation, it was generated in the medical environment of severe shortage of deceased liver donor. LDLT is the main method used when performing liver transplantation in our centre. Decision process of LDLT could be significantly different from that of other treatments, probably because not only availability of living donors and ethical and economic problems but also treatment willingness of the patients could influence significantly more to the decision of LDLT. Therefore, our algorithm could not recommend LDLT in the relevant patients. Hence, a different process in the decision of LDLT in patients with HCC within the Milan criteria should be considered. However, our algorithm could even predict the survival of a patient if he/she has a certain condition that requires LDLT as an initial treatment.

The present study has the following limitations. First, in this study, we trained our algorithm only for the initial treatment option and not for subsequent treatments after recurrence. Moreover, our algorithm was trained using a database from a single centre located in a HBV-endemic area with mostly male patients. Therefore, the HCC-CDSS model may show less power when used in centres with different demographics (e.g. ethnicity, aetiology, level of hospital facility, socio-economic status of the country, and even reimbursement policy), where the optimal treatment option would be different. Second, this study comprised a relatively small sample size included for each treatment specially transplantation, RFA, and sorafenib treatment. Although the c-index of the survival prediction model for these treatments was at an acceptable level, additional validation is required. Therefore, we look forward to expanding our database with the collaboration with diverse medical centres through online web-site, allowing and to make our algorithm to be more suitable for use in diverse clinical environments. Finally, although patient’s preference is one of the important factors in making treatment decisions, this variable was not included in this model. However, this cannot be quantified only by the patient's age and financial status. Therefore, we tried to compensate it by presenting the survival curve of preferred and alternative treatments.

We are more than willing to share our algorithm from the web-site with any centre worldside baseed on collaboration. This algorithm was built basically from our clinical practice and could function differently in other centres, but, hopefully, a future multicentre study could widen the usefulness of our algorithm as a method of efficacy comparison between different centres.

In conclusion, we developed HCC CDSS model for treatment decision and prognosis prediction in patients with HCC. This algorithm is considered benefical to physicians when discussing with HCC patients and when establishing a treatment decision for the appropriate initial treatment based on the estimated survival according to each treatment option, specially in HBV-endemic area. Further CDSS model with the integration of genetic information and automatically acquired imaging data could enable more individualised treatment to each patient.

Abbreviations

AFP:: Alpha-fetoprotein
AI:: Artificial intelligence
ALT:: Alanine aminotransferase
BCLC:: Barcelona Clinic Liver Cancer
BMI:: Body mass index
CDSS:: Clinical decision support system
LDLT:: Living related donor liver transplantation
EBRT:: External beam radiotherapy
ECOG:: Eastern Cooperative Oncology Group
HCC:: Hepatocellular carcinoma
IQR:: Interquartile range
OS:: Overall survival
PEIT:: Percutaneous ethanol injection therapy
RFA:: Radiofrequency ablation
TACE:: Transarterial chemoembolisation

References

Fitzmaurice, C. et al. The global burden of cancer 2013. JAMA Oncol. 1, 505–527. https://doi.org/10.1001/jamaoncol.2015.0735 (2015).
Article PubMed Google Scholar
Heimbach, J. K. et al. AASLD guidelines for the treatment of hepatocellular carcinoma. Hepatology 67, 358–380. https://doi.org/10.1002/hep.29086 (2018).
Article Google Scholar
EASL Clinical Practice Guidelines: Management of hepatocellular carcinoma. J. Hepatol. https://doi.org/10.1016/j.jhep.2018.03.019 (2018).
Leoni, S. et al. Adherence to AASLD guidelines for the treatment of hepatocellular carcinoma in clinical practice: experience of the Bologna Liver Oncology Group. Digest. Liver Dis. 46, 549–555. https://doi.org/10.1016/j.dld.2014.02.012 (2014).
Article Google Scholar
Park, J. W. et al. Global patterns of hepatocellular carcinoma management from diagnosis to death: the BRIDGE Study. Liver Int. 35, 2155–2166. https://doi.org/10.1111/liv.12818 (2015).
Article PubMed PubMed Central Google Scholar
Abajian, A. et al. Predicting treatment response to intra-arterial therapies for hepatocellular carcinoma with the use of supervised machine learning-an artificial intelligence concept. J. Vasc. Intervent. Radiol.: JVIR 29, 850-857.e851. https://doi.org/10.1016/j.jvir.2018.01.769 (2018).
Article Google Scholar
Barbieri, C. et al. An international observational study suggests that artificial intelligence for clinical decision support optimizes anemia management in hemodialysis patients. Kidney Int. 90, 422–429. https://doi.org/10.1016/j.kint.2016.03.036 (2016).
Article PubMed Google Scholar
Villanueva, A. et al. New strategies in hepatocellular carcinoma: genomic prognostic markers. Clin. Cancer Res. 16, 4688–4694. https://doi.org/10.1158/1078-0432.Ccr-09-1811 (2010).
Article CAS PubMed PubMed Central Google Scholar
Cucchetti, A. et al. Preoperative prediction of hepatocellular carcinoma tumour grade and micro-vascular invasion by means of artificial neural network: a pilot study. J. Hepatol. 52, 880–888. https://doi.org/10.1016/j.jhep.2009.12.037 (2010).
Article PubMed Google Scholar
Qiao, G. et al. Artificial neural networking model for the prediction of post-hepatectomy survival of patients with early hepatocellular carcinoma. J. Gastroenterol. Hepatol. 29, 2014–2020. https://doi.org/10.1111/jgh.12672 (2014).
Article CAS PubMed Google Scholar
Gulshan, V. et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316, 2402–2410. https://doi.org/10.1001/jama.2016.17216 (2016).
Article PubMed Google Scholar
Bishop, C. M. & Bishop, C. M. Pattern recognition and machine learning (Springer, Berlin, 2006).
MATH Google Scholar
Bruix, J. & Sherman, M. Management of hepatocellular carcinoma: an update. Hepatology 53, 1020–1022. https://doi.org/10.1002/hep.24199 (2011).
Article PubMed PubMed Central Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Wyner, A. J., Olson, M., Bleich, J. & Mease, D. Explaining the success of adaboost and random forests as interpolating classifiers. J. Mach. Learn. Res. 18, 1–33 (2017).
MathSciNet MATH Google Scholar
Bland, J. M. & Altman, D. G. Statistics notes: bootstrap resampling methods. BMJ 350, h2622. https://doi.org/10.1136/bmj.h2622 (2015).
Article PubMed Google Scholar
Carpenter, J. & Bithell, J. Bootstrap confidence intervals: when, which, what? A practical guide for medical statisticians. Stat. Med. 19, 1141–1164. https://doi.org/10.1002/(sici)1097-0258(20000515)19:9%3c1141::aid-sim479%3e3.0.co;2-f (2000).
Article CAS PubMed Google Scholar
Zhu, X. & Lang, J. Soluble PD-1 and PD-L1: predictive and prognostic significance in cancer. Oncotarget 8, 97671–97682. https://doi.org/10.18632/oncotarget.18311 (2017).
Article PubMed PubMed Central Google Scholar
Pedregosa, F. et al. Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet MATH Google Scholar
Ishwaran, H. & Kogalur, U.Random Forests for Survival, Regression, and Classification (RF-SRC), R Package Version 2.7.0, https://cran.r-project.org/web/packages/randomForestSRC/citation.html (2018).
Chiu, H. C., Ho, T. W., Lee, K. T., Chen, H. Y. & Ho, W. H. Mortality predicted accuracy for hepatocellular carcinoma patients with hepatic resection using artificial neural network. Sci. World J. 2013, 201976. https://doi.org/10.1155/2013/201976 (2013).
Article Google Scholar
Shi, H. Y. et al. Artificial neural network model for predicting 5-year mortality after surgery for hepatocellular carcinoma: a nationwide study. J. Gastrointestinal Surg. 16, 2126–2131. https://doi.org/10.1007/s11605-012-1986-3 (2012).
Article Google Scholar
Chaudhary, K., Poirion, O. B., Lu, L. & Garmire, L. X. Deep learning-based multi-omics integration robustly predicts survival in liver cancer. Clin. Cancer Res. 24, 1248–1259. https://doi.org/10.1158/1078-0432.Ccr-17-0853 (2018).
Article CAS PubMed Google Scholar
Ferrucci, D. A. Introduction to “This is Watson”. Ibm. J. Res. Dev. https://doi.org/10.1147/Jrd.2012.2184356 (2012).
Article Google Scholar

Download references

Acknowledgements

This work was supported by grants from the National Research Foundation of Korea (NRF, Grant Number: 2018R1A2B6007377) funded by the Ministry of Science, ICT and Future Planning of Republic of Korea.

Author information

These authors contributed equally: Gwang Hyeon Choi and Jihye Yun.

Authors and Affiliations

Department of Gastroenterology, Asan Liver Center, Asan Medical Center, University of Ulsan College of Medicine, 88 Olympic-ro 43-gil, Songpa-gu, Seoul, 05505, Korea
Gwang Hyeon Choi, Jonggi Choi, Danbi Lee, Ju Hyun Shim, Han Chu Lee, Young-Hwa Chung, Yung Sang Lee & Kang Mo Kim
Department of Convergence Medicine and Radiology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
Jihye Yun, Beomhee Park & Namkug Kim

Authors

Gwang Hyeon Choi
View author publications
You can also search for this author in PubMed Google Scholar
Jihye Yun
View author publications
You can also search for this author in PubMed Google Scholar
Jonggi Choi
View author publications
You can also search for this author in PubMed Google Scholar
Danbi Lee
View author publications
You can also search for this author in PubMed Google Scholar
Ju Hyun Shim
View author publications
You can also search for this author in PubMed Google Scholar
Han Chu Lee
View author publications
You can also search for this author in PubMed Google Scholar
Young-Hwa Chung
View author publications
You can also search for this author in PubMed Google Scholar
Yung Sang Lee
View author publications
You can also search for this author in PubMed Google Scholar
Beomhee Park
View author publications
You can also search for this author in PubMed Google Scholar
Namkug Kim
View author publications
You can also search for this author in PubMed Google Scholar
Kang Mo Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

GH Choi, JH Yun, N Kim and KM Kim were responsible for the study concept and design, data acquisition, analysis, and interpretation, and manuscript drafting. JG Choi, D Lee, JH Shim, HC Lee, YH Chung, YS Lee, and B Park assisted in data acquisition, analysis, and interpretation. GH Choi, JH Yun and B Park performed the statistical analyses. Guarantor of the article: Kang Mo Kim and Namkug Kim.

Corresponding authors

Correspondence to Namkug Kim or Kang Mo Kim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary file1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Choi, G.H., Yun, J., Choi, J. et al. Development of machine learning-based clinical decision support system for hepatocellular carcinoma. Sci Rep 10, 14855 (2020). https://doi.org/10.1038/s41598-020-71796-z

Download citation

Received: 31 January 2020
Accepted: 04 August 2020
Published: 09 September 2020
DOI: https://doi.org/10.1038/s41598-020-71796-z
Springer Nature Limited

This article is cited by

Machine learning-based clinical decision support system for treatment recommendation and overall survival prediction of hepatocellular carcinoma: a multi-center study
- Kyung Hwa Lee
- Gwang Hyeon Choi
- Kang Mo Kim
npj Digital Medicine (2024)

Development of machine learning-based clinical decision support system for hepatocellular carcinoma

Abstract

Similar content being viewed by others

Machine learning-based clinical decision support system for treatment recommendation and overall survival prediction of hepatocellular carcinoma: a multi-center study

Machine learning-based decision support model for selecting intra-arterial therapies for unresectable hepatocellular carcinoma: A national real-world evidence-based study

Predictive models and early postoperative recurrence evaluation for hepatocellular carcinoma based on gadoxetic acid-enhanced MR imaging

Introduction