Background

Sepsis is a life-threatening syndrome caused by a deregulated host response to bacterial, fungal, or viral infection, involving physiological, pathological, and biochemical abnormalities [1, 2]. It is generally believed that the symptoms of early sepsis are mild, manifested as non-specific symptoms such as fever, headache, and fatigue. However, physical examination may reveal inflammatory reactions and symptoms of redness and swelling at the site of local infection. Over time, the condition may further worsen, with severe symptoms such as high fever, chills, shortness of breath, and increased heart rate. Advanced sepsis typically manifests as multiple organ dysfunction syndrome (MODS), which involves multiple organ dysfunction and severe symptoms such as renal insufficiency, pneumonia, myocardial injury, and bleeding. At this time, the condition is already very critical and requires urgent treatment [3]. In advanced sepsis, the number and function of various peripheral immune cells, including CD4 + T cells, CD8 + T cells, B cells, and natural killer cell, may be affected, resulting in the inability of the immune system to effectively eliminate pathogens [4]. Therefore, analyzing the expression of peripheral immune cells in patients with advanced sepsis can provide detailed information on the immune response to sepsis and help to gain a deeper understanding of the specific molecular mechanisms underlying the high mortality rate caused by this disease.

N6 methyladenosine (m6A) is the most prevalent post-transcriptional chemical modification on RNA molecules, which participates in the regulation of various biological processes including immunity, metabolism, proliferation, and apoptosis, accounting for more than 60% of all RNA epigenetics [5]. The biological function of m6A modification is dynamically and reversibly mediated by methyltransferases (writers), demethylases (erasers), and m6A recognition proteins (readers), which is involved in the occurrence and development of various diseases and is also related to the high heterogeneity in advanced sepsis [6,7,8]. We speculate that m6A modification plays an important role in immune regulation in advanced sepsis. In the present study, bioinformatics methods were used to analyze the differential expression patterns of m6A methylation regulators and explore their relationship with immune cell infiltration, aiming to provide a theoretical basis for the individualized risk determination and treatment target selection of advanced sepsis.

Methods

Source of data

The dataset GSE175453 containing the transcriptome data of peripheral blood mononuclear cells (PBMCs) in the blood samples from 4 patients with advanced sepsis (14–21 days after sepsis) and 5 healthy subjects was obtained from the Gene Expression Omnibus (GEO) [9].

Screening of m6A methylation regulators

A total of 21 m6A-related genes were obtained from the literature to identify their different m6A modification modes: RNA binding motif protein 15B (RBM15B), insulin-like growth factor binding protein 3 (IGFBP3), IGFBP1, IGFBP2, insulin-like growth factor 2 mRNA binding protein 1 (IGF2BP1), zinc finger CCCH domain-containing protein 13 (ZC3H13), Casitas B-lineage lymphoma-transforming sequence-like protein 1 (CBLL1), YTH N6-methyladenosine RNA binding protein 1 (YTHDF1), YTHDF2, heterogeneous nuclear ribonucleoprotein C (HNRNPC), embryonic lethal-abnormal vision like protein 1 (ELAVL1), methyltransferase-like 3 (METTL3), RNA-binding motif protein X-linked (RBMX), leucine-rich pentatricopeptide repeat (PPR)-motif-containing protein (LRPPRC), METTL16, fat mass and obesity-associated gene (FTO), Wilms tumor 1-associated protein (WTAP), YTH domain containing 1 (YTHDC1), fragile X mental retardation type 1 (FMR1), RBM15, and Heterogeneous nuclear ribonucleoprotein A2B1 (HNRNPA2B1).

Analysis of sepsis characteristic genes based on random forest (RF) algorithm

The m6A methylation regulatory factors differentially expressed in peripheral blood immune cells between healthy subjects and advanced sepsis patients were analyzed using the “limma” R package, with the screening criteria of P < 0.05 [10]. Pearson correlation coefficient was used to analyze the correlation of the differentially expressed m6A methylation regulators (r > 0.3, P < 0.05). The upport vector machine (SVM) algorithm and RF algorithm were performed using the “randomForest” R package to construct a model for predicting the occurrence of advanced sepsis [11, 12]. The prediction accuracy of the model was evaluated using the residual boxplot, the residual reverse cumulative distribution, and the receiver operating characteristic (ROC) curve.

m6A clustering and immunocyte infiltration analysis

Using the “ConsensusClusterPlus” R package, the m6A clusters were identified by consensus clustering based on the m6A methylation regulatory factors [13]. Then, the abundance of 23 immune cells in advanced sepsis was evaluated using single-sample gene set enrichment analysis (ssGSEA) to further investigate the correlation [14,15,16,17].

Statistical analysis

Statistical analysis was performed using R (version 4.1.0). Linear regression analysis and Pearson correlation coefficient (r) were used to determine the correlation between gene expression patterns. Nonparametric one-way analysis of variance (ANOVA) was used to compare the variables between different groups. The comparison between two groups was performed using the t-test. A value of P < 0.05 was indicative of statistical significance.

Results

Differentially expressed m6A methylation regulators in peripheral immune cells in advanced sepsis

There were 21 differentially expressed m6A methylation regulators in peripheral immune cells between healthy subjects and advanced sepsis patients in the GSE175453 dataset. As shown in Fig. 1A, IGFBP1, IGFBP2, IGF2BP1, and WTAP were highly expressed in advanced sepsis patients, and the remaining 17 regulators were poorly expressed. Further analysis showed that there was a positive correlation between the expression of METTL3 (Fig. 1B), METTL16 (Fig. 1C), and FTO. These results suggested that m6A methylation might play an important role in sepsis.

Fig. 1
figure 1

Differential expression of m6A methylation regulatory factors in sepsis. (A) Heatmap analysis of differentially expressed m6A methylation regulators; (B) Correlation analysis of gene expression between METTL3 and FTO; (C) Correlation analysis of gene expression between METTL16 and FTO. Con: control patients; treat: sepsis patients

Identification of two m6A clusters

Consensus cluster analysis was performed using the “ConsensusClusterPlus” R package based on differential m6A methylation regulators, and two m6A clusters (m6A cluster A and m6A cluster B) were identified (Fig. 2A). The results of principal component analysis (PCA) showed that m6A RNA methylation regulators could be classified into two m6A clusters (Fig. 2B). Figure 2 C shows the heat map of the differentially expressed genes in two m6A clusters (Additional file 1: Table S1). IGFBP1, IGFBP2, and IGF2BP1 were highly expressed in cluster B (Additional file 2: Table S2).

Fig. 2
figure 2

Two different m6A clusters. A Consensus clustering matrix of sepsis samples, k = 2; B Principal component analysis of two different m6A clusters; C Heatmap of differentially expressed genes in two m6A clusters

Construction of sepsis prediction model based on RF algorithm

The performance of SVM algorithm and RF algorithm was compared using residual boxplots (Fig. 3A), residual reverse cumulative distribution (Fig. 3B), and ROC curve (Fig. 3C). The results showed that there was no significant difference between the two algorithms. Compared with SVM algorithm, RF algorithm had higher accuracy, so the subsequent screening of disease characteristic genes was based on RF algorithm.

Fig. 3
figure 3

The prediction model of sepsis constructed by random forest (RF). A Boxplots of residual distribution; B Residual inverse cumulative distribution; C Receiver operating characteristic curve

Immunocyte infiltration analysis

Due to the fact that a large number of immune cells are involved in sepsis, we further analyzed the difference in immune cell infiltration between the two m6A clusters. The activated CD4+T cells, activated CD8 + T cells, regulatory T cells, Th2 helper T cells, and Th17 helper T cells showed significant differences between the two m6A clusters (Fig. 4A). In addition, ssGSEA was used to determine the correlation between the expression of 21 m6A methylation regulatory factors and the infiltration of immune cells (Fig. 4B). Correlation heat map analysis showed that IGFBP1, IGFBP2, and IGF2BP1 had a significant positive correlation with Th17 helper T cells. Based on the RF model, the genetic importance of 21 m6A methylation regulators was ranked (Fig. 4C). Then we selected METTL16 gene to further observe its correlation with immune cell infiltration, and found that it was significantly positively correlated with activated CD4+T cells, activated CD8 + T cells, natural killer T cells, regulatory T cells, Th1 helper T cells, and Th2 helper T cells (Fig. 4D).

Fig. 4
figure 4

Immune cell infiltration analysis. A Boxplots showing the infiltration of immune cells in two m6A clusters; B Correlation heatmaps of 21 m6A methylation regulatory factors and immune cell infiltration by single sample gene cluster enrichment analysis (ssGSEA); C Importance of 21 m6A regulatory factors based on random forest (RF) model; D Positive correlation between METTL16 and immune cell infiltration

Discussion

Advanced sepsis is a serious life-threatening infectious disease, and its pathological and physiological mechanisms are still not fully understood [18, 19]. m6A regulatory factors are closely related to human diseases, and abnormal m6A methylation modification is believed to mediate various immune responses and the pathogenesis of autoimmune related diseases [20]. According to the analysis of m6A-single nucleotide polymorphism (SNP) and expression quantitative trait locus (eQTL) dataset, m6A is involved in regulating bacterial infection. Sun et al. identified 1321 genes as the sites of m6A cis-eQTL [21]. These genes are highly enriched in the pathways involving platelet degranulation and staphylococcus aureus infection, which are crucial to the pathophysiological process of sepsis. In addition, multiple studies have reported the typing role of m6A regulatory factors in sepsis and their correlation with immune cells [8, 22]. These findings provide convincing evidence for the correlation between m6A modification and sepsis episodes.

In recent years, the development of single-cell RNA seq technology has provided a novel approach for sepsis research [9, 23]. Compared with traditional gene expression analysis methods, single cell RNA seq can analyze the transcriptome of a single cell, find more cell types and subtypes, and analyze heterogeneous cells more accurately. Researchers have found unique transcriptome patterns of multiple circulating immune cell subtypes, including B- and CD4+, CD8+, activated CD4+, and activated CD8+T lymphocytes, as well as NK, NKT, and plasma cell like dendritic cells in patients with advanced sepsis [9]. However, the specific molecular mechanisms leading to these changes remain unclear, and the therapeutic targets have not been determined yet.

This study focused on analyzing the relationship between m6A methylation modification and multiple lymphocytes in patients with advanced sepsis, and explored potential epigenetic therapeutic targets. In this study, the ssGSEA method was used to evaluate the abundance of 23 immune cells in advanced sepsis. The results showed that IGFBP1, IGFBP2, IGF2BP1, and WTAP were highly expressed in patients with advanced sepsis and m6A cluster B; IGFBP1, IGFBP2, and IGF2BP1 showed a significant positive correlation with Th17 helper T cells. These results all suggested that m6A methylation played a crucial role in immune regulation in advanced sepsis.

It is well-known that RF is a widely used machine-learning algorithm in bioinformatics, which has been proven to be effective in identifying relevant features and classifying samples. RF is particularly suitable for analyzing high-dimensional and complex data sets such as gene expression data, because it can handle a large number of variables and avoid overfitting. In addition, RF can provide a ranking of feature importance, which is valuable for identifying the genes most relevant to the results of interest. Therefore, in our study, we chose to use RF for feature selection and identify the most important genes associated with sepsis [24,25,26,27,28]. Although other algorithms such as xgboost, GLM, and NB are also popular in similar studies, we chose RF because of its advantages in processing complex datasets and providing feature importance rankings. Based on RF, this study screened the characteristic gene METTL16 as the most important gene in advanced sepsis. Like METTL3, the METTL16 gene also belongs to a class I methyltransferase that contains Rossmann fold of class I methyltransferases and uses S-adenosylmethionine (SAM) as the methyl donor. Moreover, METTL16 also contains other regions outside its methyltransferase domain, and these regions also interact with RNA substrates and may provide specificity. Intriguingly, METTL16 is commonly captured in the identification of mRNA binding proteins separated with polyadenosylated mRNA and endogenous interaction proteins, while the components of METTL3/14 complex are not captured. Although many METTL16 RNA interactants have been identified, only two of them have been confirmed as methylation substrates [29], which may indicate that METTL16 has other functions besides catalytic activity. Our study found that METTL16 gene was significantly positively correlated with the proportion of multiple immune cells, suggesting that METTL16 may promote the infiltration of immune cells in the occurrence and development of sepsis.

Conclusions

In this study, the m6A-related genes differentially expressed in healthy subjects and advanced sepsis patients were obtained. Based on the RF model and correlation analysis, IGFBP1, IGFBP2, IGF2BP1, WTAP, and METTL16 were screened out to accelerate the occurrence and development of sepsis by regulating m6A methylation and promoting immune cell infiltration. These genes provide potential drug targets for the early detection, diagnosis, and treatment of sepsis.