Multiple machine-learning tools identifying prognostic biomarkers for acute Myeloid Leukemia

Cheng, Yujing; Yang, Xin; Wang, Ying; Li, Qi; Chen, Wanlu; Dai, Run; Zhang, Chan

doi:10.1186/s12911-023-02408-9

Multiple machine-learning tools identifying prognostic biomarkers for acute Myeloid Leukemia

Research
Open access
Published: 02 January 2024

Volume 24, article number 2, (2024)
Cite this article

Download PDF

You have full access to this open access article

BMC Medical Informatics and Decision Making Aims and scope Submit manuscript

Multiple machine-learning tools identifying prognostic biomarkers for acute Myeloid Leukemia

Download PDF

Yujing Cheng¹,
Xin Yang¹,
Ying Wang¹,
Qi Li¹,
Wanlu Chen¹,
Run Dai¹ &
…
Chan Zhang¹

1108 Accesses
Explore all metrics

Abstract

Background

Acute Myeloid Leukemia (AML) generally has a relatively low survival rate after treatment. There is an urgent need to find new biomarkers that may improve the survival prognosis of patients. Machine-learning tools are more and more widely used in the screening of biomarkers.

Methods

Least Absolute Shrinkage and Selection Operator (LASSO), Support Vector Machine-Recursive Feature Elimination (SVM-RFE), Random Forest (RF), eXtreme Gradient Boosting (XGBoost), lrFuncs, IdaProfile, caretFuncs, and nbFuncs models were used to screen key genes closely associated with AML. Then, based on the Cancer Genome Atlas (TCGA), pan-cancer analysis was performed to determine the correlation between important genes and AML or other cancers. Finally, the diagnostic value of important genes for AML was verified in different data sets.

Results

The survival analysis results of the training set showed 26 genes with survival differences. After the intersection of the results of each machine learning method, DNM1, MEIS1, and SUSD3 were selected as key genes for subsequent analysis. The results of the pan-cancer analysis showed that MEIS1 and DNM1 were significantly highly expressed in AML; MEIS1 and SUSD3 are potential risk factors for the prognosis of AML, and DNM1 is a potential protective factor. Three key genes were significantly associated with AML immune subtypes and multiple immune checkpoints in AML. The results of the verification analysis show that DNM1, MEIS1, and SUSD3 have potential diagnostic value for AML.

Conclusion

Multiple machine learning methods identified DNM1, MEIS1, and SUSD3 can be regarded as prognostic biomarkers for AML.

View this article's peer review reports

Molecular subtypes predict therapeutic responses and identifying and validating diagnostic signatures based on machine learning in chronic myeloid leukemia

Article Open access 06 April 2023

RETRACTED ARTICLE: Ensemble learning-based gene signature and risk model for predicting prognosis of triple-negative breast cancer

Article 14 March 2023

Survival prediction in acute myeloid leukemia using gene expression profiling

Article Open access 03 March 2022

Backgrounds

Acute myeloid leukemia (AML) is a malignant bone marrow disease characterized by clonal expansion and differentiation arrest of bone marrow progenitor cells. Most AML cases still have no clear etiology [1]. AML is the most common acute leukemia in adults, and its survival time is short [2]. In recent years, with the rapid development of molecular targeted therapy and combined therapy, and the widespread application of these two therapies in clinical practice, the survival and prognosis of AML patients have been relatively prolonged and improved [3]. Intensive chemotherapy and gene stem cell transplantation are usually applied to a small number of young patients, and for most patients, the prognosis and survival rate are poor [4]. Although the treatment strategies for AML have been continuously adjusted and improved over the past few decades, the effect of these treatment strategies on the survival and prognosis of patients is still minimal [5]. Therefore, the identification of new and effective prognostic biomarkers is crucial for accurately predicting the prognosis of AML patients and for a deeper and more comprehensive understanding of the pathogenesis of AML.

With the advancement of gene sequencing technology, a series of gene databases have emerged, such as the Cancer Genome Atlas (TCGA) and the Gene Expression Omnibus (GEO). In addition, machine learning algorithms, as one of the main tools of data mining, are now widely used in the medical field. The algorithm establishes a risk model by learning the existing data of patients, which is used to predict the disease, diagnose the severity of the disease, and evaluate the prognosis of the disease [6, 7]. Its main types include the least absolute shrinkage and selection operator (LASSO), random forest graph (RF), support vector machine (SVM), decision tree, and other common algorithms. LASSO is the only property of the absolute value of the penalized regression coefficient [8]. The greater the penalty, the greater the shrinkage of the coefficient, and then remove the unimportant covariates [9, 10]. Support vector machine recursive feature elimination (SVM-RFE) is a supervised machine learning technique widely used in classification and regression. Its purpose is to classify data points by maximizing the margin between classes in high-dimensional space. The features are classified according to the accuracy value, and several features with higher accuracy are selected [11]. The RF algorithm is a method of training and predicting samples by constructing a decision tree. The features with high importance scores are obtained by calculating and sorting the importance scores of features [12]. These machine algorithms can learn and train from data to achieve accurate predictions of future events [13]. These algorithms are gradually being used in the prognosis of lung cancer, breast cancer, liver cancer, gastrointestinal cancer, and other malignant tumors, which has become a hot spot in clinical research [14,15,16,17].

Machine learning algorithms contain a variety of types, and each model has its scope of application. Different types, different volumes, and different characteristics of data have different prediction performance. Therefore, this study aims to screen genes related to AML prognosis based on multiple types of machine learning, and then take the intersection gene of each machine learning result as the key gene for subsequent research. Finally, pan-cancer analysis was performed on key genes to further clarify the correlation between key genes and the occurrence and development of AML or other cancers and then to clarify the diagnostic value of key genes for AML. This study will provide new ideas for the prognosis evaluation of AML patients, and then promote the efficient and accurate individualized treatment of AML.

Methods

Acquisition of data sets

Data on acute myeloid leukemia were obtained from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi). Due to the excessive amount of search data, we set some conditions to select candidate data sets. The specific screening criteria for the dataset used in this study were acute leukemia, United States, Homo sapiens, adults, and with a total sample size greater than 100 people. Finally, three data sets were selected for subsequent research (GSE63270, GSE15061, and GSE48558). The basic information of each data set is shown in Table 1. The gene set of GSE15061 was obtained for expression difference analysis (setting the threshold: |logFC| > 1.5, P < 0.05). The differential volcano map (http://sangerbox.com/tool.html) was drawn using the SangerBox website.

Table 1 Basic information for all data sets used in this study

Full size table

Screening the key genes related to the prognosis of AML based on multiple machine-learning methods

Kaplan-Meier (KM) survival analysis of differentially expressed genes in the training set was performed using the R-4.1.1 software package (“survival” and “survminer”).

Subsequently, R-4.1.1 different software packages were used to perform machine learning such as LASSO (R package: “glmnet”, “survival”, “survminer”), RF (R package: “randomForest”, “caret”, “varSelRF”), SVM-RFE (R package: “svm”, “caret”, and “randomForest”), and XGBoost (R package: “xgboost”, and “caret”) on genes related to AML prognosis in the training set. In this study, we using the different functions in caret package of the R software (lrFuncs, IdaProfile, caretFuncs, and nbFuncs) to screened the optimal genes. The specific operation process and screening criteria were carried out according to the official manual of R software (http://topepo.github.io/caret/recursive-feature-elimination.html#recursive-feature-elimination-via-caret). The parameters of the machine learning algorithms used in this study were set according to previous studies [18,19,20].

Finally, the key genes were obtained after the intersection of the genes screened by the above machine-learning methods for subsequent analysis.

Pan-cancer analysis of key genes

Using the UCSC XENA database (http://xenabroswer.net/hub), which integrates public data from multiple databases, downloaded data sets that have been uniformly standardized (including the Cancer Genome Atlas (TCGA), Genotype Tissue Expression (GTEx). We extracted the expression of key genes in 33 cancer types, immune subtypes, clinical information, and other data from the downloaded data set. Then SangerBox (http://sangerbox.com/tool.html) was used to analyze the expression of key genes in cancer, survival analysis, immune analysis, and so on.

The correlation matrix heat map was drawn by SangerBox to explore the expression differences at immune checkpoints. The box plot was drawn by the R package (“ggplot2”, “ggsignif”, “ggpubr”, and “RColorBrewer”) to analyze the correlation between key genes and different subtypes of immune cells.

Gene set enrichment analysis (GSEA) of key genes

To identify pathways associated with key genes, we performed GSEA enrichment analyses. GSEA does not require genetic screening, thereby preserving genes that are not significantly different in expression but are functionally important [21]. GSEA analysis was performed by Sangerbox online bioinformatics tool (http://sangerbox.com/tool.html) based on AML mRNA data in the training set (GSE15061), which will identify the signaling pathways that potentially be related to the key genes screened by the machine learning algorithm.

Validation of diagnostic value of key genes

To verify the diagnostic value of key genes for acute myeloid leukemia, GSE63270 and GSE48558 data sets were used for verification. The “barplot” software package in R was used to verify the differential expression of key genes in AML patients, and the ROC (receiver operating characteristic curve) curve was drawn to evaluate the diagnostic value of key genes.

Results

Survival analysis of differentially expressed genes

The difference expression analysis of the training set GSE15061 was performed, and the threshold was set: |logFC| > 1.5, P < 0.05. The results showed that (Fig. 1) there were 171 differentially expressed genes, including 151 down-regulated genes and 20 up-regulated genes. Subsequent survival analysis of differentially expressed genes showed that a total of 26 differentially expressed genes had prognostic differences in the training set (Supplemental Fig. 1).

Screening key genes related to AML prognosis based on multiple machine learning

Firstly, LASSO regression analysis was used to screen 26 genes with prognostic value, and 10-fold cross-validation was performed. According to LASSO regression machine learning (Fig. 2A–B), a total of 20 genes were screened. Then, we use the RF algorithm (parameter settings: ntree = 2000, mtry = 6) to obtain the importance of input variables and screen out the top five genes (Fig. 2C–D). The SVM-RFE algorithm was used to remove the last few feature genes in the weight ranking of the training set in one round, and 22 genes were screened (Fig. 3A). According to the XGBoost model (Fig. 3B), 19 genes were screened and showed good discrimination, with an AUC of 0.964 (Fig. 3C). Finally, the recursive feature elimination in the ‘caret’ package was used to construct different models. The results showed that lrFuncs screened 15 genes (Fig. 4A), IdaProfile screened 26 genes (Fig. 4B), caretFuncs screened 12 genes (Fig. 4C), and nbFuncs screened 24 genes (Fig. 4D).

The important genes screened by each machine learning algorithm have been obtained (Table 2). And the intersection genes of important genes screened by these machine-learning algorithms are also summarized in Table 2. Figure 5 is the Upset diagram, which illustrating shared genes in important genes screened by different machine learning algorithm. The results showed that DNM1, MEIS1, and SUSD3 can be regarded as key genes for subsequent studies.

Table 2 The results of all machine learning algorithms used for screening important genes related to AML prognosis

Full size table

Pan-cancer analysis for key genes associated with AML prognosis

For the three key genes screened by machine learning tools, single gene pan-cancer analysis was performed respectively. The results showed that DNM1, MEIS1, and SUSD3 were differentially expressed between various cancers and normal tissues. MEIS1 and DNM1 are highly expressed in AML (Fig. 6A and B), while SUSD3 is not significantly different in AML (Fig. 6C).

To study the relationship between the expression levels of DNM1, MEIS1, and SUSD3 and the prognosis of 33 kinds of cancer, we carried out a survival correlation analysis. Cox proportional hazard model analysis showed that MEIS1 was not only associated with poor prognosis of AML (p = 8.0e-3, HR = 1.15), but also associated with poor prognosis of GBML, LGG, KIRP, and THCA (Fig. 7A). At the same time, MEIS1 was significantly correlated with the prognosis of HNSC, ACC, and KIRC. In addition to the poor prognosis of AML (p = 6.6e-3, HR = 1.25), SUSD3 was also significantly associated with the poor prognosis of GBML, LGG, and ACC (Fig. 7B). At the same time, SUSD3 was significantly associated with a better prognosis of various types of cancer (SKCM, SKCM-M, BRCA, KIPAN, MESO, SARC, LUAD). DNM1 was significantly associated with a better prognosis of AML (p = 0.04, HR = 0.88) and PAAD (p = 0.02, HR = 0.79) (Fig. 7C). In addition, DNM1 was also significantly associated with poor prognosis in a variety of types (THCA、ACC、LIHC、MESO、COADREAD、BLCA、COAD).

To explore the relationship between the expression levels of key genes and AML immune subtypes, we analyzed the correlation between key genes and AML immune subtypes based on the TCGA data set. The results showed that DNM1 (p < 0.001), MEIS1 (p < 0.001), and SUSD3 (p < 0.001) were significantly associated with AML C1-C6 immune subtypes (Fig. 8A). Based on the TCGA database, the correlation between key gene expression levels and immune checkpoints was explored. The results showed that the expression levels of DNM1, MEIS1, and SUSD3 were associated with many cancer immune checkpoints (Fig. 8B–D). Specifically, the expression level of MEIS1 was not correlated with the AML immune checkpoint (Fig. 8B). Figure 8C shows that the expression level of DNM1 is significantly correlated with the two immune checkpoints of AML (HAVCR2 and PDCD1LG2). Figure 8D showed that the SUSD3 expression level was significantly correlated with the five immune checkpoints of AML (CD274, CTLA4, LAG3, PDCD1, and TIGIT).

Gene set enrichment analysis (GSEA) of key genes

GSEA analysis showed that genes related to DNM1 in the training set were mainly enriched in aminoacyl tRNA biosynthesis, acute myeloid leukemia and other pathways (Fig. 9A). Genes related to MEIS1 were mainly enriched in pathways such as cell cycle, P53 signaling pathway, o glycan biosynthesis, etc. (Fig. 9B). Genes associated with SUND3 are mainly enriched in tryptophan metabolism, NOD-like receptor signaling pathway, chemokine signaling pathway etc. (Fig. 9C).

Validation of diagnostic value of key genes

To further explore the role of key genes as AML biomarkers, we selected two data sets to verify their diagnostic value. The results showed that in the GSE48588 (Fig. 10A–B) and GSE63270 (Fig. 10C–D) datasets, the expression levels of DNM1 and MEIS1 in AML were significantly higher than those in the control group, while the expression levels of SUSD3 in AML were significantly lower than those in the control group. ROC results suggested that the expression differences of DNM1, MEIS1, and SUSD3 have potential diagnostic value for AML.

Discussion

The prognosis of AML is poor. Young and elderly patients have a high risk of recurrence of chemotherapy resistance, and alternative and targeted drugs are needed to improve their survival rate [22]. At present, the treatment of AML mainly includes chemotherapy and molecular targeted therapy, such as FMS-like tyrosine kinase 3 (FLT3) inhibitors, IDH [isocitrate dehydrogenase (NADP+)] inhibitors, and monoclonal antibodies [23]. Despite many treatments, the prognosis of AML is still poor. High-throughput genomic screening methods and computer-aided techniques can be used to predict biomarkers related to disease occurrence and assist in the design of new targeted drugs [24]. Therefore, screening biomarkers related to the prognosis of AML patients through machine learning methods will provide a valuable reference for individualized targeted therapy and prognosis prediction of AML in clinical practice.

In this study, 26 genes with survival differences were screened from the differentially expressed genes in the training set. To further screen out the key genes closely related to the prognosis of AML, this study screened key genes based on a variety of machine learning. The results showed that the machine learning method used in this study identified DNM1, MEIS1, and SUSD3 as key genes significantly associated with AML prognosis based on different screening criteria. To further clarify the correlation between key genes and the occurrence and development of AML and various cancers, this study also performed a single-gene pan-cancer analysis of three key genes based on the TCGA database. The expression of MEIS1 and DNM1 in AML and normal controls was significantly different. MEIS1, SUSD3, and DNM1 were significantly associated with the prognosis of AML patients. Three key genes were significantly associated with AML immune subtypes, and DNM1 and SUSD3 were significantly associated with multiple immune checkpoints of AML. In addition to the strong association between the three key genes and AML, this study also found evidence that they are closely related to the prognosis and immunity of various cancers. More importantly, we also selected two validation datasets to verify the diagnostic value of key genes for AML, and the results showed that DNM1, MEIS1, and SUSD3 had good diagnostic values for AML.

The myeloid tropism leukemia virus integration site 1 (MEIS1) gene is located on 1p13-14 of human chromosome 2 and is widely expressed in various tissues including blood, liver, and brain [25]. MEIS1 is related to the differentiation of leukemia stem cells and the proliferation of leukemia cells [26]. Studies have shown that MEIS1 is often up-regulated in AML patients and can participate in disease progression through a variety of mechanisms [27, 28]. Thorsteinsdottir, U. et al. highlighted the role of Meis1 in regulating human AML cell maintenance and survival in vitro knockdown experiments [29]. Similar to the above study is that our study has also found evidence that MEIS1 expression is associated with AML prognosis, immunity, etc., which further proves that MEIS1 may be a biomarker for predicting AML prognosis.

Dynamins 1 (DNM1) is a member of the GTP-binding protein family. DNM1 is highly expressed in the nervous system of the human body and can regulate nerve activity [30, 31]. Therefore, DNM1 is often reported to play a role in nervous system diseases [32, 33]. However, in addition to neurological diseases, more and more studies have shown that DNM1 plays a role in the development of many cancers [34,35,36]. Previous studies have found that high expression of DNM1 is an independent prognostic biomarker for poor OS in patients with hepatocellular carcinoma [36]. DNM1 is overexpressed in many lung cancers, enhances the growth, migration, and invasion of cancer cells, and reduces the survival rate of lung cancer patients. Activated DNM1 selectively regulates tumor necrosis factor-related apoptosis-inducing ligand (TRAIL-R2) -mediated endocytosis, allowing cancer cells to escape death [37]. Based on the above, DNM1 may be used as a biomarker to predict the prognosis of patients with multiple cancers. In this study, we found for the first time evidence that DNM1 is potentially related to the prognosis of AML patients, further indicating that DNM1 plays a potential role in the occurrence and development of AML, and the specific mechanism of action is worthy of further discussion.

At present, there are few reports on Sushi domain-containing protein 3 (SUSD3), mainly focusing on the mechanism of SUSD3 in the occurrence and development of breast cancer. SUSD3 has extracellular, transmembrane, and cytoplasmic domains. It is highly expressed in breast cancer and estrogen-sensitive tissues such as the liver, breast, myometrium, endometrium, and ovary. Experiments have shown that SUSD3 has a higher level of expression in estrogen receptor (ER) -positive breast cancer cells, and estrogen treatment can further increase its expression [38]. SUSD3 has been reported as one of the potential biomarkers for the prognosis of breast cancer [39, 40]. This study found for the first time that SUSD3 is potentially related to the prognosis of AML and is expected to be a prognostic marker for AML patients.

In addition, we explored potential pathways associated with genes closely related to key genes in the training set through GSEA. Many pathways were found to be potentially related to the development of AML. Previous study has reported that the dysfunction of the typical metabolomics pathway Aminoacyl − tRNA biosynthesis indicates that mitochondrial dysfunction, which leads to a decrease in the detoxification ability of reactive oxygen species produced by AML chemotherapy and radiotherapy [41]. P53 plays a key role in normal and leukemia hematopoiesis and is the core of the complex network of AML-related signaling pathways [42]. NLRP3 inflammasome, a major factor in NOD-like receptor signaling pathway, promotes the progression of AML in an IL-1β-dependent manner. Targeting NLRP3 inflammasome may provide a new therapeutic option for AML [43]. Based on the above, it can be seen that the potential pathways related to DNM1, MEIS1 and SUSD3 participate in the occurrence and development of AML. We hypothesized that DNM1, MEIS1, and SUSD3 are closely related to the prognosis of AML, which may be mediated by the above pathways. However, the above is only speculation, and further functional verification experiments are needed to explore the mechanism of these three key genes in the occurrence and development of AML.

Based on a variety of machine learning, this study has explored three new biomarkers for AML prognosis, which provides a new idea for the clinical development of individualized targeted therapy and prognosis prediction of AML. However, this study still has some shortcomings. First, all the analyses in this study are based on retrospective data in public databases, and large-scale prospective studies and additional functional verification experiments are needed to confirm our findings. Secondly, it is necessary to further explore the specific mechanism of the three key genes in AML and their influence on the prognosis of AML in future research, so as to better explore the molecular mechanism involved in tumorigenesis and AML development.

Conclusion

This study found that DNM1, MEIS1, and SUSD3 were abnormally expressed in AML and were potentially related to its prognosis. They are expected to become new biomarkers and potential therapeutic targets for predicting the prognosis of AML patients. This study will provide a new theoretical basis for the basic research of AML.

Data availability

The datasets generated and/or analyzed during the current study are available in the [GEO] repository, [GSE63270, GSE15061, and GSE48558].

References

Shallis RM, Wang R, Davidoff A, Ma X, Zeidan AM. Epidemiology of acute Myeloid Leukemia: recent progress and enduring challenges. Blood Rev. 2019;36:70–87.
Article PubMed Google Scholar
Allemani C, Matsuda T, Di Carlo V, Harewood R, Matz M, Nikšić M, et al. Global surveillance of trends in cancer survival 2000-14 (CONCORD-3): analysis of individual records for 37 513 025 patients diagnosed with one of 18 cancers from 322 population-based registries in 71 countries. Lancet (London England). 2018;391(10125):1023–75.
Article PubMed PubMed Central Google Scholar
Totiger TM, Ghoshal A, Zabroski J, Sondhi A, Bucha S, Jahn J et al. Targeted Therapy Development in Acute Myeloid Leukemia. Biomedicines. 2023;11(2).
Vakiti A, Mewawalla P. Acute Myeloid Leukemia. StatPearls. Treasure Island (FL) ineligible companies. Disclosure: Prerna Mewawalla declares no relevant financial relationships with ineligible companies.: StatPearls Publishing Copyright © 2023. StatPearls Publishing LLC.; 2023.
Shimony S, Stahl M, Stone RM. Acute Myeloid Leukemia: 2023 update on diagnosis, risk-stratification, and management. Am J Hematol. 2023;98(3):502–26.
Article PubMed Google Scholar
Handelman GS, Kok HK, Chandra RV, Razavi AH, Lee MJ, Asadi H. eDoctor: machine learning and the future of medicine. J Intern Med. 2018;284(6):603–19.
Article CAS PubMed Google Scholar
Komuro J, Kusumoto D, Hashimoto H, Yuasa S. Machine learning in cardiology: clinical application and basic research. J Cardiol. 2023;82(2):128–33.
Article PubMed Google Scholar
McEligot AJ, Poynor V, Sharma R, Panangadan A. Logistic LASSO regression for dietary intakes and Breast Cancer. Nutrients. 2020;12(9).
Zhao Y, Ogden RT, Reiss PT. Wavelet-based LASSO in functional linear regression. Journal of computational and graphical statistics: a joint publication of American Statistical Association, Institute of Mathematical Statistics. Interface Foundation of North America. 2012;21(3):600–17.
Google Scholar
Harezlak J, Coull BA, Laird NM, Magari SR, Christiani DC. Penalized solutions to functional regression problems. Comput Stat Data Anal. 2007;51(10):4911–25.
Article PubMed PubMed Central Google Scholar
Yang Y, Yi X, Cai Y, Zhang Y, Xu Z. Immune-Associated Gene signatures and subtypes to predict the progression of atherosclerotic plaques based on machine learning. Front Pharmacol. 2022;13:865624.
Article CAS PubMed PubMed Central Google Scholar
Lai B, Lai Y, Zhang Y, Zhou M, OuYang G. Survival prediction in acute Myeloid Leukemia using gene expression profiling. BMC Med Inf Decis Mak. 2022;22(1):57.
Article Google Scholar
Verma AA, Murray J, Greiner R, Cohen JP, Shojania KG, Ghassemi M, et al. Implementing machine learning in medicine. CMAJ: Can Med Association J = J de l’Association medicale canadienne. 2021;193(34):E1351–e7.
Article Google Scholar
Lynch CM, Abdollahi B, Fuqua JD, de Carlo AR, Bartholomai JA, Balgemann RN, et al. Prediction of Lung cancer patient survival via supervised machine learning classification techniques. Int J Med Informatics. 2017;108:1–8.
Article Google Scholar
Zhou CM, Xue Q, Wang Y, Tong J, Ji M, Yang JJ. Machine learning to predict the cancer-specific mortality of patients with primary non-metastatic invasive Breast cancer. Surg Today. 2021;51(5):756–63.
Article CAS PubMed Google Scholar
Ji GW, Fan Y, Sun DW, Wu MY, Wang K, Li XC, et al. Machine learning to Improve Prognosis Prediction of Early Hepatocellular Carcinoma after Surgical Resection. J Hepatocellular Carcinoma. 2021;8:913–23.
Article Google Scholar
Christopherson KM, Das P, Berlind C, Lindsay WD, Ahern C, Smith BD, et al. A machine learning Model Approach to Risk-Stratify patients with gastrointestinal Cancer for hospitalization and mortality outcomes. Int J Radiat Oncol Biol Phys. 2021;111(1):135–42.
Article PubMed Google Scholar
Qiu H, Luo L, Su Z, Zhou L, Wang L, Chen Y. Machine learning approaches to predict peak demand days of cardiovascular admissions considering environmental exposure. BMC Med Inf Decis Mak. 2020;20(1):83.
Article Google Scholar
Kang J, Choi YJ, Kim IK, Lee HS, Kim H, Baik SH, et al. LASSO-Based machine learning algorithm for prediction of Lymph Node Metastasis in T1 Colorectal Cancer. Cancer Res Treat. 2021;53(3):773–83.
Article CAS PubMed Google Scholar
Mahmoudian M, Venäläinen MS, Klén R, Elo LL. Stable iterative variable selection. Bioinf (Oxford England). 2021;37(24):4810–7.
CAS Google Scholar
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102(43):15545–50.
Article CAS PubMed PubMed Central Google Scholar
Huang Y, Zhang Z, Sui M, Li Y, Hu Y, Zhang H, et al. A novel stemness classification in acute Myeloid Leukemia by the stemness index and the identification of cancer stem cell-related biomarkers. Front Immunol. 2023;14:1202825.
Article CAS PubMed PubMed Central Google Scholar
Kantarjian H, Kadia T, DiNardo C, Daver N, Borthakur G, Jabbour E, et al. Acute Myeloid Leukemia: current progress and future directions. Blood cancer Journal. 2021;11(2):41.
Article PubMed PubMed Central Google Scholar
Kumar N, Narayan Das N, Gupta D, Gupta K, Bindra J. Efficient automated Disease diagnosis using machine learning models. J Healthc Eng. 2021;2021:9983652.
Article PubMed PubMed Central Google Scholar
Aksoz M, Turan RD, Albayrak E, Kocabas F. Emerging roles of Meis1 in Cardiac Regeneration, Stem cells and Cancer. Curr Drug Targets. 2018;19(2):181–90.
Article CAS PubMed Google Scholar
Kocabas F, Zheng J, Thet S, Copeland NG, Jenkins NA, DeBerardinis RJ, et al. Meis1 regulates the metabolic phenotype and oxidant defense of hematopoietic stem cells. Blood. 2012;120(25):4963–72.
Article CAS PubMed PubMed Central Google Scholar
Collins CT, Hess JL. Deregulation of the HOXA9/MEIS1 axis in acute Leukemia. Curr Opin Hematol. 2016;23(4):354–61.
Article CAS PubMed PubMed Central Google Scholar
Chen CW, Armstrong SA. Targeting DOT1L and HOX gene expression in MLL-rearranged Leukemia and beyond. Exp Hematol. 2015;43(8):673–84.
Article CAS PubMed PubMed Central Google Scholar
Thorsteinsdottir U, Kroon E, Jerome L, Blasi F, Sauvageau G. Defining roles for HOX and MEIS1 genes in induction of acute Myeloid Leukemia. Mol Cell Biol. 2001;21(1):224–34.
Article CAS PubMed PubMed Central Google Scholar
Raimondi A, Ferguson SM, Lou X, Armbruster M, Paradise S, Giovedi S, et al. Overlapping role of dynamin isoforms in synaptic vesicle endocytosis. Neuron. 2011;70(6):1100–14.
Article CAS PubMed PubMed Central Google Scholar
Ferguson SM, Brasnjo G, Hayashi M, Wölfel M, Collesi C, Giovedi S, et al. A selective activity-dependent requirement for dynamin 1 in synaptic vesicle endocytosis. Sci (New York NY). 2007;316(5824):570–4.
Article CAS Google Scholar
Sahly AN, Krochmalnek E, St-Onge J, Srour M, Myers KA. Severe DNM1 encephalopathy with dysmyelination due to recurrent splice site pathogenic variant. Hum Genet. 2020;139(12):1575–8.
Article CAS PubMed Google Scholar
Brereton E, Fassi E, Araujo GC, Dodd J, Telegrafi A, Pathak SJ, et al. Mutations in the PH Domain of DNM1 are associated with a nonepileptic phenotype characterized by developmental delay and neurobehavioral abnormalities. Mol Genet Genom Med. 2018;6(2):294–300.
Article CAS Google Scholar
Yamada H, Takeda T, Michiue H, Abe T, Takei K. Actin bundling by dynamin 2 and cortactin is implicated in cell migration by stabilizing filopodia in human non-small cell lung carcinoma cells. Int J Oncol. 2016;49(3):877–86.
Article CAS PubMed PubMed Central Google Scholar
Raja SA, Shah STA, Tariq A, Bibi N, Sughra K, Yousuf A, et al. Caveolin-1 and dynamin-2 overexpression is associated with the progression of Bladder cancer. Oncol Lett. 2019;18(1):219–26.
CAS PubMed PubMed Central Google Scholar
Tian M, Yang X, Li Y, Guo S. The expression of Dynamin 1, 2, and 3 in Human Hepatocellular Carcinoma and patient prognosis. Med Sci Monitor: Int Med J Experimental Clin Res. 2020;26:e923359.
Article CAS Google Scholar
Reis CR, Chen PH, Bendris N, Schmid SL. TRAIL-death receptor endocytosis and apoptosis are selectively regulated by dynamin-1 activation. Proc Natl Acad Sci USA. 2017;114(3):504–9.
Article CAS PubMed PubMed Central Google Scholar
Moy I, Todorović V, Dubash AD, Coon JS, Parker JB, Buranapramest M, et al. Estrogen-dependent sushi domain containing 3 regulates cytoskeleton organization and migration in Breast cancer cells. Oncogene. 2015;34(3):323–33.
Article CAS PubMed Google Scholar
Zhao S, Chen SS, Gu Y, Jiang EZ, Yu ZH. Expression and clinical significance of Sushi Domain- Containing protein 3 (SUSD3) and insulin-like growth Factor-I receptor (IGF-IR) in Breast Cancer. Asian Pac J cancer Prevention: APJCP. 2015;16(18):8633–6.
Article Google Scholar
Lu N, Guan X, Bao W, Fan Z, Zhang J. Breast cancer combined prognostic model based on lactate metabolism genes. Medicine. 2022;101(51):e32485.
Article CAS PubMed PubMed Central Google Scholar
Cano KE, Li L, Bhatia S, Bhatia R, Forman SJ, Chen Y. NMR-based metabolomic analysis of the molecular pathogenesis of therapy-related myelodysplasia/acute Myeloid Leukemia. J Proteome Res. 2011;10(6):2873–81.
Article CAS PubMed PubMed Central Google Scholar
Prokocimer M, Molchadsky A, Rotter V. Dysfunctional diversity of p53 proteins in adult acute Myeloid Leukemia: projections on diagnostic workup and therapy. Blood. 2017;130(6):699–712.
Article CAS PubMed PubMed Central Google Scholar
Zhong C, Wang R, Hua M, Zhang C, Han F, Xu M, et al. NLRP3 Inflammasome promotes the progression of Acute Myeloid Leukemia via IL-1β pathway. Front Immunol. 2021;12:661939.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank all authors for their contributions and support.

Funding

This study was supported by the Kunming medical university joint special applied basic research [202001AY070001-111]; the First People’s Hospital of Yunnan Province Clinical Medical Center open project by the Yunnan Province Clinical Research Center for Hematologic Disease and the Yunnan Province Clinical Center for Hematologic Disease [2021LCZXXF-XY12, 2022LCZXKF-XY04, and 2023YJZX-XY02]; Chinese Academy of Medical Sciences (CAMS) Innovation Fund for Medical Sciences(CIFMS) [2016-I2M-3-024]; Kunming University of Science and Technology medical joint special project [KUST-KH2022028Y].

Author information

Authors and Affiliations

Department of blood transfusion, The First People’s Hospital of Yunnan Province. The Affiliated Hospital of Kunming University of Science and Technology, No.157 Jinbi Road, 650034, Kunming, Yunnan, China
Yujing Cheng, Xin Yang, Ying Wang, Qi Li, Wanlu Chen, Run Dai & Chan Zhang

Authors

Yujing Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Xin Yang
View author publications
You can also search for this author in PubMed Google Scholar
Ying Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qi Li
View author publications
You can also search for this author in PubMed Google Scholar
Wanlu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Run Dai
View author publications
You can also search for this author in PubMed Google Scholar
Chan Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, Y.C. and C.Z.; methodology, X.Y. and Y.W.; software, Q.L. and W.C.; data collection, R.D.; writing, review, and editing, Y.C. All authors have read and approved the manuscript.

Corresponding author

Correspondence to Chan Zhang.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not Applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Cheng, Y., Yang, X., Wang, Y. et al. Multiple machine-learning tools identifying prognostic biomarkers for acute Myeloid Leukemia. BMC Med Inform Decis Mak 24, 2 (2024). https://doi.org/10.1186/s12911-023-02408-9

Download citation

Received: 04 August 2023
Accepted: 14 December 2023
Published: 02 January 2024
DOI: https://doi.org/10.1186/s12911-023-02408-9

Multiple machine-learning tools identifying prognostic biomarkers for acute Myeloid Leukemia

Abstract

Background

Methods

Results

Conclusion

Similar content being viewed by others

Molecular subtypes predict therapeutic responses and identifying and validating diagnostic signatures based on machine learning in chronic myeloid leukemia

RETRACTED ARTICLE: Ensemble learning-based gene signature and risk model for predicting prognosis of triple-negative breast cancer

Survival prediction in acute myeloid leukemia using gene expression profiling

Backgrounds

Methods

Acquisition of data sets

Screening the key genes related to the prognosis of AML based on multiple machine-learning methods

Pan-cancer analysis of key genes

Gene set enrichment analysis (GSEA) of key genes

Validation of diagnostic value of key genes

Results

Survival analysis of differentially expressed genes

Screening key genes related to AML prognosis based on multiple machine learning

Pan-cancer analysis for key genes associated with AML prognosis

Gene set enrichment analysis (GSEA) of key genes

Validation of diagnostic value of key genes

Discussion

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation