To the Editor,


Acute myeloid leukemia (AML) is a heterogeneous hematological malignancy characterized by malignant proliferation and abnormal differentiation of myeloid blasts [1]. The prognosis of AML remains poor, with 5-year overall survival rate below 50% after diagnosis and 2-year survival rate of only 20%–30% in elderly patients [2, 3]. Currently, molecular and genetic advances have increased our understanding of the pathogenesis and progression of AML [4, 5], and further discovery of genes associated with prognosis can broaden therapeutic options for AML patients. This study aimed to identify novel prognostic genes for AML through a multi-perspective and multi-dimensional analysis of the GEO database and validation using the TCGA database and our center's data.

As shown in the workflow (Fig. 1A), three datasets with survival outcomes were downloaded from the GEO databases, and 1532 patients were included for follow-up analysis (562 in GSE37642, 534 in GSE76009, and 436 in GSE16432). A total of 151 AML patients with RNA-seq data were enrolled from the TCGA database. Five hundred and five differentially expressed genes (DEGs) in the GEO datasets were screened using |logFC|≥ 1.0 and adj. p-value < 0.05 as cut-off criteria. The heat map shows 50 genes with the highest variability (Fig. 1B), and the volcano map displays the up-and down-regulated genes among these differential genes (Fig. 1C). Towards a comprehensive understanding of the potential function of these genes in the co-expression modules, the Gene Ontology (GO) enrichment analysis was performed and divided genes into three categories: biological processes (BP), cellular components (CC), and molecular functions (MF) (Fig. 1D). In BP, genes were enriched in GTPase activity and Golgi vesicle transport. Many genes are involved in the cell leading edge, transport vesicle, and transcription regulator complex in the CC category. In addition, protein serine/threonine kinase activity and Ras GTPase binding were mainly involved in MF. Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis showed that genes were enriched in choline metabolism in cancer (Fig. 1E).

Fig. 1
figure 1

The flow-chart of this study and functional enrichment analysis of differential genes. A Flow chart of our study. B Heat map for the analysis of differential genes in AML patients with different survival status. C Volcano map of differential expressed genes in AML patients with different survival states. D GO enrichment analysis. E KEGG enrichment analysis. *p < 0.05, **p < 0.01, ***p < 0.001. GO gene ontology; KEGG Kyoto Encyclopedia of Genes and Genomes

Two thousand three hundred and forty-one co-expressed genes were extracted from these significant modules based on weighted gene co-expression network analysis (WGCNA) [6] (Fig. 2A). We compared the status of patients (alive or dead) in different sets of genes and found the most significant degree of variability in the gray gene set (Fig. 2B). We also analyzed the correlation of gene significance and module membership in patients with different survival statuses in both gray and blue modules (Additional file 1: Fig. S1). We intersected the eligible differential genes with the genes in the WGCNA gray gene set. Finally, 22 overlapping genes were obtained at the junction of the DEGs list and the two co-expression modules (Additional file 2: Fig. S2). Then, least absolute shrinkage and selection operator (LASSO) regression screening was performed based on the partial likelihood deviation method to obtain the corresponding coefficients with an optimal log λ of −4.35, of which 21 genes were suitable for constructing the model [7] (Additional file 3: Fig. S3A, B).

Fig. 2
figure 2

Co-expression network based on WGCNA to identify modules associated with AML survival status and survival analysis of candidate hub genes. A Clustering dendrogram of co-expression modules screened based on AML-related samples from the GEO database. B Module-trait relationships between gene modules and survival status of AML patients. CG Kaplan–Meier survival analysis based on DLC1 (C), NF1B (D), DENND5B (E), TANC2 (F), and ELAVL4 (G) expression in AML patients from the GEO database. H Kaplan–Meier survival analysis based on DLC1 expression in AML patients from TCGA database. *p < 0.05, **p < 0.01, ***p < 0.001. WGCNA Weighted Gene Co-expression Network Analysis, GEO Gene Expression Omnibus, TCGA The Cancer Genome Atlas

Meanwhile, five genes were identified by the support vector machine algorithm with the optimal five CV accuracy [8] (Additional file 3: Fig. S3C). Moreover, the five shared biomarkers of AML were defined by overlapping the biomarkers derived from the two algorithms (Additional file 3: Fig. S3D). These five specific genes (DLC1, NF1B, DENND5B, TANC2, ELAVL4) are associated with the prognosis of AML. We performed a survival analysis of the above five genes using GEO data (Fig. 2C–G) and found that AML patients with low DLC1 level had a poorer overall prognosis (Fig. 2C, p = 0.003). Median was regarded as cut-off value. Survival data from TCGA data showed similar results (Fig. 2H, p = 0.048). Meanwhile, we analyzed the expression of relevant genes in patients with different survival states (Additional file 4: Fig. S4) and found that DLC1 expression was significantly lower in deceased patients (Additional file 4: Fig. S4A, p = 0.0005). Furthermore, Quantitative real-time PCR was used to detect DLC1 expression in bone marrow samples from 48 newly diagnosed AML patients in our center. The clinical characteristics of these patients are provided in Additional file 6: Table. S1. The Kaplan–Meier curve showed that patients with low DLC1 expression had a worse prognosis (Additional file 5: Fig. S5, p = 0.028).

DLC1 is a downregulated tumor suppressor gene and silenced by epigenetic mechanisms in human cancers, including several hematological malignancies [9,10,11]. Based on the bio-information database and external validation, our study suggests that DLC1 may be a potential marker affecting the prognosis of AML. Its deficiency at initial diagnosis is associated with a poor long-term prognosis. Future AML-matched methylation and RNA-seq are required to better elucidate the mechanism of DLC1 downregulation to guide the development of relevant molecular targeting drugs.