Background

In adults, the most common acute leukemia is acute myeloid leukemia (AML) [1]. AML is a cell clonal malignant proliferative disease of myeloid primordial cells in the hematopoietic system. AML is a highly heterogeneous group of diseases that can be derived from hematopoietic progenitors at various stages of differentiation and development of normal myeloid cells [2,3,4]. The incidence of AML increased proportionally with age, from 1.8 cases per 100,000 people under 65 years old to 13.7 cases per 100,000 people over 65 years old. In developed countries, various half of novelly diagnosed AML sufferers are over 65 years, with a median age of 67 years at diagnosis, and more men have AML than women [5]. There are various 20,000 new AML cases in the US in 2021 [6]. On the other hand, merely about 25% of AML sufferers survived for 5 years or more despite multi-drug combination chemotherapy. Elderly sufferers (> 60 years) and sufferers that could not bear standard induction chemotherapy had unfavorable molecular outcomes, with a median survival of merely 5–10 months and a 5-year overall survival (OS) of 5% [7, 8]. At present, scientific research has moved from the cellular level to the molecular level. For example, targeted drugs [9], nanomaterials [10], and molecular markers [11]. We are committed to finding new molecular markers to more accurately predict the progress and prognosis of AML.

Microchromosome maintenance protein 5 (MCM5) is a key cell cycle regulator located on chromosome 22Q13.1 [12], whose role in DNA replication is verified [13]. MCM5 is a DNA licensing factor as an ingredient of the MCM2-7 complex, a putative replicative helicase vital for the initiation and extension of "once per cell cycle" DNA replication in eukaryotic cells [14, 15]. So far, MCM5 has been reported to be closely correlated with varieties of diseases. For instance, increased level of MCM5 is remarkably correlated with the positive progression and unfavorable prognosis of oral squamous cell carcinoma, and MCM5 can be utilized as a marker for the early diagnosis of oral squamous cell carcinoma [16, 17]. MCM5 was also reported to be an individual prognostic element in lung squamous cell carcinoma [18]. MCM5 can aggravate the HDAC1-mediated malignant progression of lung cancer [19]. MCM5 is correlated with malignant status and unfavorable prognosis in cervical adenocarcinoma sufferers, and regulates the proliferation of cervical adenocarcinoma cells [20]. MCM5 is a new sensitive as well as specific biomarker for the detection of endometrial and ovarian tumors in urine samples [21]. Therefore, we investigated the relationship betwixt MCM5 and AML.

In this study, we investigated the gene level microarray of AML and correlative clinical data in GSE38865, GSE142698, and The Cancer Genome Atlas (TCGA) database. MCM5 was selected as the research objective. Then, we studied the level profile and biological functions of MCM5 in AML, and further analyzed the correlation betwixt MCM5 level and the AML.

Methods

Data acquisition

With gene expression omnibus (GEO, https://www.ncbi.nlm.nih.gov/geo/) database, “AML” and “survival” in the search box as the keyword, we harvested the AML gene level chips and correlative clinical data. GSE38865 [22] was selected as the training data set of this study. We extracted clinical data with prognostic information directly from the matrix file on the correlative gene chip page in the GEO database.

From TCGA (https://portal.gdc.cancer.gov/) we harvested RNA-seq data (TPM) and the correlative clinical information of AML.

Screening of Prognosis-Related Genes

Kaplan–Meier curves and Cox regression analysis to screen genes correlated to survival. The prognosis-related genes screened through “survival” R package with the KM < 0.01 and cox p value < 0.01 were shown in Table S1.

Identify Target Gene

After screening of prognosis-related genes, we utilized multivariate Cox regression analysis to perform individual prognostic analysis (p < 0.01). Thus, we got the genes remarkably correlated with individual prognosis which were exhibited in Table 1.

Table 1 The genes for independent prognostic selected by multivariate Cox regression analysis (p < 0.01)

Next, we analyze the correlation betwixt individual prognostic-related genes and clinical traits through Wilcox.test or Kruskal.test (p < 0.05). The correlation betwixt individual prognostic-related genes and clinical traits was shown in Table 2.

Table 2 The clinical traits related-independent prognostic genes

Then, we utilized GEPIA (http://gepia2.cancer-pku.cn) to explore the relationship betwixt genes in Table 2 (SigNum > 0) and the survival of AML.GEPIA is an interactive web server [23].

Survival Analysis

All AML sufferers with MCM5 level values higher than the median were classified as MCM5 high group, and the remaining sufferers were classified as MCM5 low group. We utilized Kaplan–Meier curve and log-rank test via the R package “survival” and “survminer” to analyze the survival of MCM5 in the prognostic model. Individual prognostic value of MCM5 was verified with multivariate and univariate Cox regression analyses.

Enrichment Analysis

The difference of gene level in AML samples betwixt MCM5 high group and MCM5 low group was compared through setting adjusted p < 0.05 and fold change > 1 threshold using the ‘‘Limma” package. "ClusterProfiler" and "enrichplot" packages were utilized for Kyoto encyclopedia of genes and genomes (KEGG) analysis and Gene Ontology (GO) to further explore the functions of various genes.

Statistical Analysis

Through R software 3.5.0 we performed all statistical analysis. We utilized Fisher exact test as well as the Wilcoxon rank-sum tests, respectively, to verify hypotheses for categorical and continuous variables. According to the median level value of MCM5, the samples in the second cohort were divided into MCM5 high group (n = 10) and MCM5 low group (n = 10). The limma package was utilized to analyze the distinct gene level. Kaplan–Meier method and Cox regression multivariate analysis were utilized for survival analysis, and log-rank test was utilized for comparison betwixt groups. To identify GO and KEGG enrichment terms, we utilized “Clusterprofiler” package. For all statistical analysis, p < 0.05 was considered significant.

Results

Screening Target Genes

We got GSE38865 from GEO database (https://www.ncbi.nlm.nih.gov/geo/). According to the probe information of Illumina HumanWG-6 V3.0 expression beadchip (GPL6884) and Illumina HumanHT-12 V4.0 expression beadchip (GPL10558), 22089 genes of 30 AML samples were annotated. Then, we performed Kaplan–Meier curves and Cox regression analysis to screen genes correlated to survival. The prognosis-related genes were screened through “survival” R package with the KM < 0.01 and coxPvalue < 0.01 (Table S1).

In order to get the genes for independent prognostic, multivariate Cox regression analysis was performed (p < 0.01) in prognosis-related genes. 52 genes for independent prognostic were exhibited in Table 1.

Later, we analyze the correlation betwixt individual prognostic genes and clinical traits (age, gender, Wbc, and NPM1 mutation) through wilcox.test or kruskal.test (p < 0.05). Four clinical traits related-individual prognostic genes (DENND4C, DKC1, LAPTM4B, and MCM5) were shown in Table 2.

GEPIA (http://gepia2.cancer-pku.cn) was utilized to verify the relationship betwixt these 4 genes and the over survival of AML. We found that merely MCM5 was correlated to the survival and prognosis of AML. As presented in Fig. 1, sufferers with high level of MCM5 had worse prognosis than these with low level of MCM5. Therefore, we chose MCM5 for follow-up study.

Fig. 1
figure 1

Survival outcomes in MCM5 high group and MCM5 low groups through GEPIA. AML sufferers in MCM5 high group had worse outcome than that in MCM5 low group

Identify Target Gene MCM5

In GSE38865, all AML sufferers with MCM5 level values above the median were classified as MCM5 high group, and the others were considered to be MCM5 low group. As shown in Fig. 2A, the prognosis of AML sufferers with high MCM5 level was worse than that of sufferers with low MCM5 level. Multivariate and univariate Cox regression analysis exhibited that MCM5 was remarkably correlated with the prognosis of AML sufferers and was an independent prognostic element. (Fig. 2B and C; Table S2).

Fig. 2
figure 2

Survival analysis and Cox regression analysis of MCM5 in GSE38865. A Survival analysis betwixt MCM5 high group and MCM5 low groups in GSE38865. B Univariate Cox regression analysis and C multivariate Cox regression analysis screened out the individual prognostic-related element

MCM5 in TCGA database

To further identify target gene MCM5, we harvested RNA-seq data (TPM) and the correlative clinical information from TCGA-AML (https://portal.gdc.cancer.gov/).

We analyzed these MCM5 level based on the TCGA-AML. The results were consistent with those harvested from GSE38865 data. In Fig. 3, high MCM5 group had worse prognosis than low MCM5 group (Fig. 3A), and MCM5 was an independent prognostic element in AML (Fig. 3B and C).

Fig. 3
figure 3

Survival analysis and Cox regression analysis of MCM5 in TCGA. A Survival analysis betwixt MCM5 high group and MCM5 low groups based on TCGA data. B Univariate Cox regression analysis and C multivariate Cox regression analysis screened out the individual prognostic-related element

MCM5 Overexpression in AML

Then, we want to study whether MCM5 level shows difference in AML and normal samples. GSE142698 (AML = 24; normal = 24) was download from GEO database. We found an increase level of MCM5 in AML blood compared with healthy blood (Fig. 4A). We also found that an increase level in AML bone marrow samples compared with T Acute Lymphoblastic Leukemia (T-ALL) bone marrow samples (Fig. 4B) based on GSE131184 (AML = 76; T-ALL = 49).

Fig. 4
figure 4

The level of MCM5. A The level of MCM5 in AML blood compared with healthy blood based on GSE142698. B MCM5 level in AML bone marrow samples compared with T-ALL bone marrow samples based on GSE131184

Differences in Clinical in AML Betwixt MCM5 High Group and MCM5 Low Groups

The differences in clinical in AML betwixt MCM5 high group and MCM5 low groups were analyzed through “ggpubr” package. There was no significant difference in the age gender, and NPM1 mutation betwixt MCM5 high group and MCM5 low groups (Fig. S1A-C).

Different Expression and Functional Enrichment Analysis for Genes in MCM5 High Group Versus MCM5 Low Group

To discover the genes correlated with MCM5, we analyzed the discrepant level values betwixt MCM5 high group and MCM5 low group (adj.p < 0.05, (FC, log2) > 1 or < 1). There were 34 up-modulated genes and 47 down-modulated genes (Fig. 5A; Table S3). Figure 5B exhibited the top 20 genes in up-modulated and down-modulated groups. We analyzed the enriched GO terms through using the discrepantly expressed genes (DEGs). Among the biological process terms of GO, DEGs are mainly enriched in regulating extracellular matrix organization, negatively regulating response to external stimulus, phase transition of mitotic cell cycle, regulating mitotic cell cycle, and DNA replication (Fig. 5C). Showed by the KEGG analysis results, DNA replication, Kaposi sarcoma-correlated herpesvirus infection, Cell cycle, Glycine, serine and threonine metabolism, and Amino sugar and nucleotide sugar metabolism were the most enriched pathways (Fig. 5D).

Fig. 5
figure 5

DEGs and the results of GO enrichment and KEGG analyses. A Volcano plot show DEGs betwixt MCM5 high group and MCM5 low groups. B Heatmap plot show the top 20 up-modulated DEGs and top 20 down-modulated DEGs betwixt MCM5 high group and MCM5 low groups. C GO result for DEGs. D KEGG result for DEGs

Module Screening Using PPI Network

Last, the correlation analysis of the top 40 DEGs betwixt MCM5 high group and MCM5 low group was performed (Fig. 6A). We could see that MCM5 was positive correlation with KIAA0101 which also up-modulated in MCM5 high group.

Fig. 6
figure 6

Module screening from the PPI network. A The correlation analysis of top 20 up-modulated DEGs and top 20 down-modulated DEGs with the Pearson correlation coefficient. B PPI network of DEGs

KIAA0101 is a proliferating cell nuclear antigen (PCNA) -related element. KIAA0101 has been verified to be over-expressed in a variety of human malignant tumors, such as esophageal squamous cell carcinoma, lung adenocarcinoma, chronic lymphocytic leukemia, and it has been identified as an oncogene [24,25,26,27,28]. Protein–protein interaction (PPI) network was screened in the String database through with the 81 DEGs. All down-modulated genes and most up-modulated genes had interactions in the PPI network (Fig. 6B). In the general network MCM5 acts as a core gene. The graphical abstract is showed in Fig. S2.

Discussion

AML is listed among the most common hematologic malignancies [29, 30]. Despite progress in the treatment of AML, it remains difficult to treat and cure, and the 5-year overall survival rate, especially in sufferers which age is over 60 years, is still very low [31,32,33]. The pathogenesis, diagnosis and treatment of AML are hot topics in current research.

In our research, we analyzed the mRNA and the correlative clinical information in GSE38865. We wanted to find the genes which could predict bad or good prognosis in AML. Through using Kaplan–Meier curves, Cox regression analysis, Wilcox.test or Kruskal.test, and GEPIA, we found MCM5 may the target gene which was an independent prognostic element for AML. From TCGA we also downloaded the AML-related RNA-seq data (TPM) and harvested the correlative clinical information. It was exhibited by results that MCM5 was an independent prognostic element for AML and AML sufferers in MCM5 high group had worse prognosis than that in MCM5 low group. We also found an increase level in AML blood compared with healthy blood (GSE142698) and an increase level in AML bone marrow samples compared with T-ALL bone marrow samples (GSE131184).

MCM5, as a critical cell cycle regulator and DNA replication licensing element, was expressed highly in multiple cancer tissues, such as cervical adenocarcinoma, cervical cancer, renal cell carcinoma, and laryngeal squamous cell cancer [20, 34,35,36]. MCM5 in thyroid cancer cells was also reported as a target of BET inhibitors. MCM5 may serve as an adverse prognostic biomarker for lung cancer[18, 37, 38]. In addition, Chen Liu even reported that compared with leukocytes, MCM5 was highly expressed in AML cell lines (KG-1a, NB4 and HL60)[12].

To find the genes correlated with MCM5, we analyzed the deferent expressed genes betwixt MCM5 high group and MCM5 low group. We found CCL1, MLLT11, and PIM1 were up-modulated in MCM5 high group. CCL1 is involved in immune-cell recruitment and, like other chemokines, is involved in nociceptive processing[39]. The chemokine CCL1 activates the AMFR-SPRY1 pathway, which facilitates differentiation of pulmonary fibroblasts into myofibroblasts and drives pulmonary fibrosis[40]. MLLT11 acted as an oncogene in multiple cancers, for example osteosarcoma[41], bladder cancer[42], and lung cancer[43]. Importantly, report exhibited that MLLT11 is a unfavorable prognostic biomarker for AML, adult normal cytogenetics AML, and adult myelodysplastic syndrome[44]. PIM1 plays a critical role in the development of many hematopoietic and non-hematopoietic malignancies, including prostate cancer and acute myeloid leukemia[45, 46]. AML with high level of PIM1 was reported to show an unfavorable prognosis. PIM1 facilitates the proliferation and inhibits apoptosis of AML cells, but also enhances the chemotactic ability of leukemia cells[47, 48].

Likewise, the GO pathways mainly enriched in regulation of extracellular matrix organization, DNA replication, mitotic cell cycle phase transition, regulation of mitotic cell cycle, mitotic cell cycle phase transition, and negative regulation of response to external stimulus. DNA replication, Kaposi sarcoma-correlated herpesvirus infection, Cell cycle, Amino sugar and nucleotide sugar metabolism, serine and threonine metabolism, and Glycine.

PPI network also revealed that MCM5 is highly linked with KIAA0101. KIAA0101 is a proliferating cell nuclear antigen (PCNA) -related element. KIAA0101 has been reported to be over-expressed in many human malignant tumors and has been identified as an oncogene[27, 49]

Conclusions

In conclusion, we demonstrated that MCM5 was an independent prognostic element for AML. Low level of MCM5 was a good prognostic element for AML sufferers. In addition, the results of GO term enrichment, KEGG analysis and PPI network involvement in AML showed that MCM5 may regulate DNA replication and cell cycle of AML cells which provided an insight into the pathogenesis correlated with different MCM5 level. However, the exact pathophysiological role of MCM5 in AML cells has not been fully demonstrated in this study. Further investigation of the molecular mechanism of MCM5 in AML progression and more in-depth genomic studies are urgently needed.