An integrative bioinformatics analysis for identifying hub genes associated with infection of lung samples in patients infected with SARS-CoV-2

Xie, Tian-Ao; He, Zhi-Jian; Liang, Chuan; Dong, Hao-Neng; Zhou, Jie; Fan, Shu-Jin; Guo, Xu-Guang

doi:10.1186/s40001-021-00609-4

An integrative bioinformatics analysis for identifying hub genes associated with infection of lung samples in patients infected with SARS-CoV-2

Research
Open access
Published: 17 December 2021

Volume 26, article number 146, (2021)
Cite this article

Download PDF

You have full access to this open access article

European Journal of Medical Research Aims and scope Submit manuscript

An integrative bioinformatics analysis for identifying hub genes associated with infection of lung samples in patients infected with SARS-CoV-2

Download PDF

Tian-Ao Xie^1,2^na1,
Zhi-Jian He^1,2^na1,
Chuan Liang^1,2^na1,
Hao-Neng Dong^1,2,
Jie Zhou^1,2,
Shu-Jin Fan^1,2 &
…
Xu-Guang Guo ORCID: orcid.org/0000-0003-1302-5234^1,2,3,4

2486 Accesses
5 Citations
1 Altmetric
Explore all metrics

Abstract

Background

At the end of 2019, the world witnessed the emergence and ravages of a viral infection induced by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Also known as the coronavirus disease 2019 (COVID-19), it has been identified as a public health emergency of international concern (PHEIC) by the World Health Organization (WHO) because of its severity.

Methods

The gene data of 51 samples were extracted from the GSE150316 and GSE147507 data set and then processed by means of the programming language R, through which the differentially expressed genes (DEGs) that meet the standards were screened. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were performed on the selected DEGs to understand the functions and approaches of DEGs. The online tool STRING was employed to construct a protein–protein interaction (PPI) network of DEGs and, in turn, to identify hub genes.

Results

A total of 52 intersection genes were obtained through DEG identification. Through the GO analysis, we realized that the biological processes (BPs) that have the deepest impact on the human body after SARS-CoV-2 infection are various immune responses. By using STRING to construct a PPI network, 10 hub genes were identified, including IFIH1, DDX58, ISG15, EGR1, OASL, SAMD9, SAMD9L, XAF1, IFITM1, and TNFSF10.

Conclusion

The results of this study will hopefully provide guidance for future studies on the pathophysiological mechanism of SARS-CoV-2 infection.

Exploration and validation of related hub gene expression during SARS-CoV-2 infection of human bronchial organoids

Article Open access 16 March 2021

Identification of potential mRNA panels for severe acute respiratory syndrome coronavirus 2 (COVID-19) diagnosis and treatment using microarray dataset and bioinformatics methods

Article 11 September 2020

Bioinformatics-based investigation on the genetic influence between SARS-CoV-2 infections and idiopathic pulmonary fibrosis (IPF) diseases, and drug repurposing

Article Open access 22 March 2023

Introduction

Currently, a new type of coronavirus, first named as Coronavirus 2019 (COVID-19), has spread rapidly in 212 countries. As of May 25, 2020, more than 5.5 million cases have been diagnosed and over 340,000 people have died of it. The results of genome sequencing have unveiled that this pneumonia is induced by a new type of coronavirus, namely severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) [1].

At the beginning of the outbreak, scientists believed that the disease was first spread from animals to humans, and then from symptomatic people to other humans until the first human-to-human transmission from asymptomatic carriers recorded in Germany [2,3,4]. It has been proved that the new CoV can spread from person to person through breathing droplets. It is worth noting that the respiratory tract may not be the only route of transmission. Specifically, the direct or even indirect contact with the mucous membranes of eyes, mouth, or nose can also spread SARS-CoV-2 [5, 6]. Besides, as documented by a research lately, it is possible that SARS-CoV-2 infection can be spread through the digestive tract [7].

As people in all age groups are vulnerable to SARS-CoV-2, those over the middle age prove to be the most susceptible. A majority of patients admitted to hospital for the diagnosis of COVID-19 average from 48 to 58 years [8]. After SARS-CoV-2 infects human body, it enters the alveolar epithelial cells to rapidly replicate and trigger a powerful immunological reaction, causing damage to lung tissues and the cytokine storm syndrome, which is an important cause of acute respiratory distress syndrome (ARDS) and multiple organ failure [9, 10]. At present, in clinical practice, the infection prevention and control methods are implemented, as well as the supportive nursing is provided for patients. However, no vaccine or specific treatment has been successfully developed for SARS-CoV-2 [11].

Since the emergence of COVID-19, it has been extensively documented by scholars and researchers from all over the world. Recent studies have shown that spike protein is of vital importance in inducing neutralizing antibodies. Vaccines can be developed to specifically identify the spike proteins of SARS and angiotensin-converting enzyme 2 (ACE2) receptors [12]. Although the SARS-CoV-2 and SARS spike protein sequences show some overlaps, viral genetic mutations and increased antibody dependence may affect the efficacy of the vaccine [13]. In order to reduce the occurrence of above situation, it is a good approach to screen out hub genes that are closely related to the pathological process of SARS-CoV-2 invasion into lung cells, and to understand the changes in host cell molecular level during this interaction.

With the technology of microarray and high-throughput sequencing evolving increasingly, genes related to the occurrence, development, diagnosis and treatment of diseases can be identified. In order to better understand the important genes of alveolar epithelial cells that SARS-CoV-2 acts on and to provide more information for vaccine development, this study employed integrated bioinformatics methods to conduct cross-platform research and large-sample survey [14].

Methods and materials

Study process

In this study, we mainly carried out DEGs screening, GO analysis, KEGG analysis, PPI network establishment and statistical verification for gene set data. The specific process is shown in Additional file 1: Figure S1.

Data sources

The Gene Expression Omnibus (GEO) is an open database that stores expression chip data, from which the GSE150316 and the GSE147507 gene expression profiles were obtained. The GSE150316 gene expression profile was obtained from the platform GPL18573 Illumina NextSeq 500 (Homo sapiens), and the GSE147507 gene expression profile was obtained from the analysis of platform GPL18573 Illumina NextSeq 500 (Homo sapiens) and platform GPL28369 Illumina NextSeq 500 (Mustela putorius furo). From the GSE150316 and GSE147507 gene expression profiles, the gene data of 47 samples and that of four samples were selected, respectively, both of which were included for research and analysis. Out of the gene data of 47 samples, 42 were obtained from the lung tissues of 11 COVID-19-positive patients, and five from normal human lung tissues. Out of the gene data of four samples, two were from lung tissue samples of the same COVID-19 positive patient (technical replication), and two were from lung tissues of two normal individuals (a male and a female).

Data processing

The system matrix files and other related files of GSE150316 and GSE147507 were downloaded from the GEO database. R was deployed to convert the counts of these genes into TPM files, and the limma package was used in R language to standardize the data of each group and select the DEGs that meet the standards [15]. The GSE150316 data set is based on |log₂ FC|≥ 1, p-value < 0.05 as the standard, and the standard of the GSE147507 data set is |log2 FC|≥ 2, p-value < 0.01—both meet the statistical standard. Therefore, this standard was adopted to distinguish genes with significant changes in expression fold change from other genes. All eligible DEGs were included in this study [16].

Identification of DEGs

The GO analysis and the KEGG enrichment analysis, which can perform functional enrichment analysis and pathway enrichment analysis on DEGs, respectively, were employed to delve into the biological functions of DEGs. GO is a biological information resource that stores computable knowledge of the functions of genes and gene products and describes the biological functions of genes and gene products in living bodies through annotations that have been confirmed by relevant studies, and it has become the main annotation method for high-throughput sequences [17, 18]. In the GO analysis, it is widely accepted that p < 0.05 is statistically significant. This step enables the identification of genes that have significant effects in the biological process (BP), cellular component (CC), and molecular function (MF) [19]. KEGG, which is composed of the PATHWAY database, LIGAND database, and GENES database, is a resource library with genomic sequences and related molecular data obtained from other high-throughput experimental analyses. KEGG boasts of powerful image tools that clearly display various biological metabolic pathways and the connections between them. In the KEGG analysis, p < 0.05 is considered to be of statistical significance [20].

Construction of the protein–protein interaction (PPI) network and identification of hub genes

The PPI network consists of individual proteins that interact with each other to be involved in a variety of life activities, such as the transmission of biological signals, regulation of gene expression, metabolism of energy and substance, as well as the regulation of cell cycle. Analyzing protein interaction networks allows us to better understand how proteins work and function in biological systems. Therefore, the online tool STRING (version 11.0) (http://string-db.org/) (medium confidence > 0.4), to which the DEGs were uploaded, was employed to construct the PPI network. Following this, the PPI network was visualized with the help of the software Cytoscape (version 7.3.2) (the default parameters). Hub genes were defined as those with a degree in the top 10.

Performing non-paired t-test for verification

To ensure that the 10 hub genes obtained in this study are worthy of further study, the data of GSE150728 were used to verify them. The data of the control group and the infection group of these 10 hub genes in GSE150728 were extracted and a non-paired t-test analysis was performed using GraphPad Prism (version 8.0.2). p-value < 0.05 was considered to be statistically significant.

Result

The DEGs among GSE150316 and GSE147507

The gene data of 51 samples were obtained from GSE150316 and GSE147507 gene expression profiles. Detailed information about the data sources of this research is described in Table 1. First, according to abs |log FC|> 2 and p-value < 0.01, 1107, DEGs were selected from the gene data of four samples of GSE147507. Among them, 384 genes were up-regulated and 723 genes were down-regulated. According to abs |log FC|> 1, p-value < 0.05, 643, DEGs were selected from the gene data of 47 samples of GSE150316. Among them, 449 genes were up-regulated and 194 genes were down-regulated. Additionally, there are 52 intersection genes between GSE147507 and GSE150316 (refer to Additional file 2: Table S1 and Fig. 1). Details of their function and fold change are presented in Additional file 3: Table S2 (The above information of gene functions comes from GeneCards) (http://www.genecards.org). Then, the volcano map and the heat map of the DEGs were plotted for GSE147507 and GSE150316 (refer to Fig. 2A-D).

Table 1 Details of the data sources for this study

Full size table

Enrichment analysis of the pathway and process in SARS-CoV-2 infection

The GO analysis demonstrated that in GSE147507, the parts that exert significant influence on the annotation of biological processes (BP) are the neutrophil activation involved in immune response and neutrophil activation. Moreover, in GSE150316, humoral immune response and complement activation have significant effects on the annotation of biological processes (BP). Those parts that are statistically significant in the three processes of GO analysis, BP, CC, and MF, are shown in Fig. 3A, B, respectively. Additional file 3: Table S2 shows a collection of the 20 most important GO enrichment items in the BP of GSE147507 and GSE150316.

Analysis of the KEGG pathway

In order to better identify the biological function of DEGs, an in-depth analysis of DEG was conducted by means of KEGG, where p-value < 0.05 was considered to be statistically significant. In system matrix files GSE147507, the results demonstrated that 20 meaningful approaches were analyzed in total. The cell signaling pathways significantly associated with the SARS-CoV-2 infection include, but are not limited to osteoclast differentiation, chemokine signaling pathway, Yersinia infection, NOD-like receptor signaling pathway, and C-type lectin receptor signaling pathway. The most abundant KEGG path information of the system matrix files GSE147507 and GSE150316 is shown in Additional file 4: Table S3. However, the KEGG path information of GSE150316 is not obvious, and it is not shown in Additional file 4: Table S3. All the specific enrichment pathways obtained from the analysis of the DEGs of the system matrix file GSE147507 are shown in Fig. 4A. In the system matrix file GSE150316, we did not obtain highly significant signal pathway results after analysis, but, as can be seen in Fig. 4B, it may be related to the circadian rhythm, hematopoietic cell lineage, p53 signaling pathway, viral protein interaction with cytokine and cytokine receptor, and vitamin digestion and absorption.

Hub genes identification with DEGs protein–protein interaction network (PPI)

From a physiological point of view, proteins rarely function on their own and usually interact with each other within a network. Therefore, this research constructed the PPI network on STRING (version 11.0). Finally, it was observed that the PPI network of GSE147507 has 371 nodes and 442 edges (refer to Fig. 5), the PPI network of GSE150316 has 75 nodes and 122 edges (refer to Fig. 6), and the PPI network of intersection has 26 nodes and 59 edges (Fig. 7). In addition, the PPI network was visualized using Cytoscape (version 7.3.2), and the 10 hub genes were identified as IFIH1, DDX58, ISG15, EGR1, OASL, SAMD9, SAMD9L, XAF1, IFITM1, and TNFSF10 (Fig. 8). Their functions are shown in Table 2.

Table 2 The function of 10 hub genes

Full size table

Verification of the hub gene through non-paired t-test

The results of the t-test are demonstrated in Fig. 9. There are six hub genes that have statistical significance: IFIH1, SAMD9L, ISG15, XAF1, OASL, and TNFSF10. The p-values of SAMD9, DDX58, EGR1, and IFITM1 were 0.5561, 0.3529, 0.5915, and 0.8628, respectively, which are all greater than 0.05. Thus, they were not statistically significant. This study will not conduct further research on these four hub genes.

Discussion

At the beginning of 2020, SARS-CoV-2 triggered a worldwide outbreak of COVID-19 pneumonia, rendering it imperative to acquire better knowledge about SARS-CoV-2 and create a vaccine to prevent its spread among the populace [21]. Therefore, uncovering the potential molecular mechanism of COVID-19 is of paramount importance. The gene expression profile of the GSE150316 and GSE147507 data sets were used to screen DEGs. As a result, 1107 DEGs were identified in the GSE147507, including 384 up-regulated DEGs and 723 down-regulated DEGs, and 643 DEGs were identified in the GSE150316, including 449 up-regulated DEGs and 194 down-regulated DEGs. The GO analysis was utilized to perform a functional enrichment analysis on the DEGs obtained, the results of which demonstrated the significantly enriched disease-related BP, CC, and MF. Meaningful enrichment was also reported through the KEGG analysis. Next, the PPI network construction, module analysis, and central gene identification were performed to screen a total of 10 important hub genes that may play a key regulatory role in the pathophysiology of COVID-19.

As shown in the graph describing the results of the GO analysis, it is generally believed that the smaller the p-value of the GO item, the more significant the enrichment of DEG in the GO item. For GSE147507, the functions of neutrophil activation involved in immune response, neutrophil activation and neutrophil-mediated immunity are the most significant parts of biological processes. The manner in which SARS-CoV-2 invades host cells is related to the angiotensin-converting enzyme 2 (ACE2), and SARS-CoV-2 invades human cells by binding to ACE2 [22]. The human body’s pattern recognition receptor recognizes the SARS-CoV-2 virus antigen and presents it to natural killer cells and CD8-positive cytotoxic T cells, which activate the body’s innate immunity and adaptive immunity and triggers a large number of pro-inflammatory cytokines and chemotaxis in the body factor generation. Some pro-inflammatory cytokines can activate neutrophils. With the development of COVID-19, the level of neutrophils in the blood continues to rise. Neutrophils participate in the immune response and fight against cells through the engulfment of microbes, the formation of reactive oxygen species, degranulation, the secretion of antimicrobials, and the formation of increased neutrophil extracellular traps [23, 24]. In terms of biological processes, the functions of the humoral immune response, complement activation, and protein activation cascade are the most significant for GSE150316. After the human body is infected with SARS-CoV-2, the humoral immune response and cellular immune response are activated. In the humoral immune response, plasma cells produce immunoglobulins that bind to antigens on the surface of SARS-CoV-2 to form an immunoglobulin complex. The immunoglobulin complex and the mannose-binding lectin that is involved in complement activation form the classical pathway and the lectin pathway, respectively. These two ways of activating complement are essentially involve cascade protein activation [25]. At the same time, immunoglobulin can also bind to the immunoglobulin receptor of the cells invaded by SARS-CoV-2 to perform phagocytosis and adhesion [22, 23, 26].

Based on the enrichment analysis of the KEGG pathway, it was found that certain cell signal transduction pathways are closely related to SARS-CoV-2 infection. When the human body is infected with the SARS-CoV-2 virus, the entry of SARS-CoV-2 into the alveolar epithelial cells causes the damage of lung tissue and leads to the uncontrolled production of pro-inflammatory cytokines [10, 11, 27]. SARS-CoV-2 mainly infects epithelial cells in the lung and results in the accumulation of white blood cells in injured or infected tissues. More importantly, the virus can enter macrophages and dendritic cells [28, 29]. The infection of these cells plays an important role in inducing pro-inflammatory cytokines that may cause disease [30]. In fact, many cytokines and chemokines are produced by macrophages and dendritic cells and their levels are elevated in the serum of patients infected by SARS-CoV-2 [31]. In response to injury or infection, the expression of the chemokine signaling pathways and the NOD-like receptor signaling pathways in the tissues is enhanced. In the signal transduction pathway of chemokines, the chemokines bind and activate chemokine receptors to embed the chemokine receptors in G protein-coupled receptors (GPCRs) in the cell membranes of leukocytes, thereby inducing leukocytes to change their adhesion and shape, adhere to the blood vessel wall, and penetrate the inflamed tissue along its chemotactic factor gradient [32]. The accumulated white blood cells remove pathogens and necrotic tissues through phagocytosis and proteolysis. In addition to their participation in leukocyte trafficking, chemokines can also produce a variety of other cells and participate in tissue responses, including proliferation, activation, differentiation, extracellular matrix remodeling, and angiogenesis, which may be related to tissue repair and reconstruction [33,34,35,36]. Moreover, some chemokines and receptors are constitutively expressed in specific tissues and cell types that can promote homeostasis, such as T cell development, stem cell migration, and lymphoid organogenesis [37]. When the SARS-CoV-2 virus invades human cells, it can also exert the body’s immune function through the processes of inflammasome assembly, signal transduction, transcription activation, and autophagy through the activated NOD-like receptor signal transduction pathway [38]. The NOD-like receptor signal transduction pathway and the Yersinia infection pathway can activate caspase-1 by the inflammasome of the multimeric protein complex. The activated caspase-1 leads to the processing and maturation of the pro-inflammatory cytokines, interleukin (IL)-1β and IL-18, which participate in the body’s immune response and lead to the death of specific inflammatory cells [39, 40]. Additionally, NOD1 and NOD2 can activate the serine/threonine kinase of NF-kB [41] and activate the mitogen-activated protein kinase (MAPK) signaling pathway, leading to the secretion of pro-inflammatory cytokines [42, 43]. It plays an important role in resisting SARS-CoV-2 infection and regulating host immune response. NOD2 can also sense the ssRNA of the virus and then activate it to produce interferon and antiviral defense.

The 10 pivotal genes screened through the PPI network were verified by the non-paired t-test, and four meaningless genes were found. The remaining six were all up-regulated genes that are closely related to the pathological process of SARS-CoV-2 infection in lung cells and may become new therapeutic targets or research directions. Over the past 15 years, ubiquitin-like protein ISG15 has been widely regarded as a major participant in the host antiviral response, and recent work has shown that it can directly inhibit viral replication and modulate host immunity [44]. OASL and SAMD9L have been shown to be important in viral infection and innate immunity, but the mechanism of their action is different from that of ISG15 [45, 46]. Studies have found that OASL makes the RNA detection system based on RIG-I more sensitive to viral RNA, which can be activated under viral infections that are relatively below the threshold level, and the SARS-Cov-2 virus in the body can be found faster [47]. The mechanism by which the expression of SAMD9L affects virus replication in varying degrees is not yet fully understood, but it has been reported that SAMD9L can inhibit West Nile virus replication [48, 49]. Similarly, the significant expression of IFIH1 and TSFSF10 is beneficial to the human body. Experiments have shown that the deficiency of IFIH1 can lead to primary immunodeficiency, which manifests as extreme susceptibility to common respiratory RNA viruses, such as human respiratory syncytial virus and rhinoviruses [50]. Moreover, the death receptor of TNFSF10 contributes to immune surveillance against viral infection by promoting apoptosis. It should be noted that we must pay close attention to determine whether SARS-CoV-2 escapes by regulating TNFSF10 receptor signal transduction [51]. Unlike the other five hub genes, the significant expression of XAF1 has a negative impact on the human body and may become a therapeutic target for COVID-19. From a previous study, we found that the expression of XAF1 was high in DENV2-infected VECs. After this expression of XAF1 was enhanced, as much as 28% of cells entered irreversible apoptosis 72 h after infection. Thus, it is very important to inhibit the expression of XAF1 during the treatment of COVID-19⁵².

There are certain limitations to this study. First, the conclusion is based on data collected from public databases, rather than from actual experiments. Enough clinical samples should be employed to ensure more accurate results. Second, the mechanism of several key genes in the pathological process of COVID-19 has not been fully understood. Therefore, further research and larger samples are needed.

Conclusion

Our results indicate that the ten hub genes IFIH1, DDX58, ISG15, EGR1, OASL, SAMD9, SAMD9L, XAF1, IFITM1 and TNFSF10 may play an important regulatory role in SARS-CoV-2 infection. It may has guiding significance for researchers to study the infection mechanism of SARS-CoV-2 in the future.

Availability of data and materials

Not applicable.

References

Xu X, Chen P, Wang J, et al. Evolution of the novel coronavirus from the ongoing Wuhan outbreak and modeling of its spike protein for risk of human transmission. Sci China Life Sci. 2020;63:457–60. https://doi.org/10.1007/s11427-020-1637-5.
Article CAS PubMed PubMed Central Google Scholar
Chan JF, Yuan S, Kok KH, et al. A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster. Lancet. 2020;395:514–23. https://doi.org/10.1016/s0140-6736(20)30154-9.
Article CAS PubMed PubMed Central Google Scholar
Rothe C, Schunk M, Sothmann P, et al. Transmission of 2019-nCoV Infection from an Asymptomatic Contact in Germany. N Engl J Med. 2020;382:970–1. https://doi.org/10.1056/NEJMc2001468.
Article PubMed PubMed Central Google Scholar
Chan JF, Kok KH, Zhu Z, et al. Genomic characterization of the 2019 novel human-pathogenic coronavirus isolated from a patient with atypical pneumonia after visiting Wuhan. Emerg Microbes Infect. 2020;9:221–36. https://doi.org/10.1080/22221751.2020.1719902.
Article CAS PubMed PubMed Central Google Scholar
Lu CW, Liu XF, Jia ZF. 2019-nCoV transmission through the ocular surface must not be ignored. Lancet. 2020;395: e39. https://doi.org/10.1016/s0140-6736(20)30313-5.
Article CAS PubMed PubMed Central Google Scholar
Xia J, Tong J, Liu M, Shen Y, Guo D. Evaluation of coronavirus in tears and conjunctival secretions of patients with SARS-CoV-2 infection. J Med Virol. 2020. https://doi.org/10.1002/jmv.25725.
Article PubMed PubMed Central Google Scholar
Zhang H, Kang Z, Gong H, et al. The digestive system is a potential route of 2019-nCov infection: a bioinformatics analysis based on single-cell transcriptomes. BioRxiv. 2020. https://doi.org/10.1101/2020.01.30.927806.
Article PubMed PubMed Central Google Scholar
Huang C, Wang Y, Li X, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan. China Lancet. 2020;395:497–506. https://doi.org/10.1016/s0140-6736(20)30183-5.
Article CAS PubMed Google Scholar
Abd El-Aziz TM, Stockand JD. Recent progress and challenges in drug development against COVID-19 coronavirus (SARS-CoV-2) - an update on the status. Infect Genet Evol. 2020;83: 104327. https://doi.org/10.1016/j.meegid.2020.104327.
Article CAS PubMed PubMed Central Google Scholar
Villar J, Zhang H, Slutsky AS. Lung Repair and Regeneration in ARDS: Role of PECAM1 and Wnt Signaling. Chest. 2019;155:587–94. https://doi.org/10.1016/j.chest.2018.10.022.
Article CAS PubMed Google Scholar
Channappanavar R, Perlman S. Pathogenic human coronavirus infections: causes and consequences of cytokine storm and immunopathology. Semin Immunopathol. 2017;39:529–39. https://doi.org/10.1007/s00281-017-0629-x.
Article CAS PubMed PubMed Central Google Scholar
Du L, He Y, Zhou Y, Liu S, Zheng BJ, Jiang S. The spike protein of SARS-CoV–a target for vaccine and therapeutic development. Nat Rev Microbiol. 2009;7:226–36. https://doi.org/10.1038/nrmicro2090.
Article CAS PubMed PubMed Central Google Scholar
Peeples L. News Feature: Avoiding pitfalls in the pursuit of a COVID-19 vaccine. Proc Natl Acad Sci U S A. 2020;117:8218–21. https://doi.org/10.1073/pnas.2005456117.
Article CAS PubMed PubMed Central Google Scholar
Pollack JR. DNA microarray technology Introduction. Methods Mol Biol. 2009;556:1–6. https://doi.org/10.1007/978-1-60327-192-9_1.
Article CAS PubMed Google Scholar
Denny P, Feuermann M, Hill DP, Lovering RC, Plun-Favreau H, Roncaglia P. Exploring autophagy with Gene Ontology. Autophagy. 2018;14:419–36. https://doi.org/10.1080/15548627.2017.1415189.
Article CAS PubMed PubMed Central Google Scholar
Gautier L, Cope L, Bolstad BM, Irizarry RA. affy–analysis of Affymetrix GeneChip data at the probe level. Bioinformatics (Oxford, England). 2004;20:307–15. https://doi.org/10.1093/bioinformatics/btg405.
Article CAS Google Scholar
Paradis E, Claude J, Strimmer K. APE: Analyses of Phylogenetics and Evolution in R language. Bioinformatics (Oxford, England). 2004;20:289–90. https://doi.org/10.1093/bioinformatics/btg412.
Article CAS Google Scholar
Quan L, Wang Y, Liang J, Shi J, Zhang Y, Tao K. Identification of the interaction network of hub genes for melanoma treated with vemurafenib based on microarray data. Tumori. 2015;101:368–74. https://doi.org/10.5301/tj.5000316.
Article CAS PubMed Google Scholar
Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016;44:D457–62. https://doi.org/10.1093/nar/gkv1070.
Article CAS PubMed Google Scholar
The Gene Ontology Consortium. Expansion of the Gene Ontology knowledgebase and resources. Nucleic Acids Res. 2017;45:D331–8. https://doi.org/10.1093/nar/gkw1108.
Article CAS Google Scholar
Ahmed SF, Quadeer AA, McKay MR. Preliminary Identification of Potential Vaccine Targets for the COVID-19 Coronavirus (SARS-CoV-2) Based on SARS-CoV Immunological Studies. Viruses. 2020. https://doi.org/10.3390/v12030254.
Article PubMed PubMed Central Google Scholar
Tay MZ, Poh CM, Rénia L, MacAry PA, Ng LFP. The trinity of COVID-19: immunity, inflammation and intervention. Nat Rev Immunol. 2020;20:363–74. https://doi.org/10.1038/s41577-020-0311-8.
Article CAS PubMed Google Scholar
Wang J, Jiang M, Chen X, Montaner LJ. Cytokine storm and leukocyte changes in mild versus severe SARS-CoV-2 infection: Review of 3939 COVID-19 patients in China and emerging pathogenesis and therapy concepts. J Leukoc Biol. 2020;108:17–41. https://doi.org/10.1002/jlb.3covr0520-272r.
Article CAS PubMed Google Scholar
Soy M, Keser G, Atagündüz P, Tabak F, Atagündüz I, Kayhan S. Cytokine storm in COVID-19: pathogenesis and overview of anti-inflammatory agents used in treatment. Clin Rheumatol. 2020;39:2085–94. https://doi.org/10.1007/s10067-020-05190-5.
Article PubMed PubMed Central Google Scholar
Gralinski LE, Sheahan TP, Morrison TE, et al. Complement Activation Contributes to Severe Acute Respiratory Syndrome Coronavirus Pathogenesis. MBio. 2018. https://doi.org/10.1128/mBio.01753-18.
Article PubMed PubMed Central Google Scholar
Noris M, Benigni A, Remuzzi G. The case of complement activation in COVID-19 multiorgan impact. Kidney Int. 2020;98:314–22. https://doi.org/10.1016/j.kint.2020.05.013.
Article CAS PubMed PubMed Central Google Scholar
Wang H, Ma S. The cytokine storm and factors determining the sequence and severity of organ dysfunction in multiple organ dysfunction syndrome. Am J Emerg Med. 2008;26:711–5. https://doi.org/10.1016/j.ajem.2007.10.031.
Article PubMed Google Scholar
Peiris JS, Chu CM, Cheng VC, et al. Clinical progression and viral load in a community outbreak of coronavirus-associated SARS pneumonia: a prospective study. Lancet. 2003;361:1767–72. https://doi.org/10.1016/s0140-6736(03)13412-5.
Article CAS PubMed PubMed Central Google Scholar
Spiegel M, Schneider K, Weber F, Weidmann M, Hufert FT. Interaction of severe acute respiratory syndrome-associated coronavirus with dendritic cells. J Gen Virol. 2006;87:1953–60. https://doi.org/10.1099/vir.0.81624-0.
Article CAS PubMed Google Scholar
Law HK, Cheung CY, Ng HY, et al. Chemokine up-regulation in SARS-coronavirus-infected, monocyte-derived human dendritic cells. Blood. 2005;106:2366–74. https://doi.org/10.1182/blood-2004-10-4166.
Article CAS PubMed Google Scholar
Lau YL, Peiris JS. Pathogenesis of severe acute respiratory syndrome. Curr Opin Immunol. 2005;17:404–10. https://doi.org/10.1016/j.coi.2005.05.009.
Article CAS PubMed PubMed Central Google Scholar
Moser B, Wolf M, Walz A, Loetscher P. Chemokines: multiple levels of leukocyte migration control. Trends Immunol. 2004;25:75–84. https://doi.org/10.1016/j.it.2003.12.005.
Article CAS PubMed Google Scholar
Dimberg A. Chemokines in angiogenesis. Curr Top Microbiol Immunol. 2010;341:59–80. https://doi.org/10.1007/82_2010_21.
Article CAS PubMed Google Scholar
Speyer CL, Ward PA. Role of endothelial chemokines and their receptors during inflammation. J Invest Surg. 2011;24:18–27. https://doi.org/10.3109/08941939.2010.521232.
Article PubMed Google Scholar
Ben-Baruch A. The multifaceted roles of chemokines in malignancy. Cancer Metastasis Rev. 2006;25:357–71. https://doi.org/10.1007/s10555-006-9003-5.
Article CAS PubMed Google Scholar
Luther SA, Cyster JG. Chemokines as regulators of T cell differentiation. Nat Immunol. 2001;2:102–7. https://doi.org/10.1038/84205.
Article CAS PubMed Google Scholar
Zlotnik A, Burkhardt AM, Homey B. Homeostatic chemokine receptors and organ-specific metastasis. Nat Rev Immunol. 2011;11:597–606. https://doi.org/10.1038/nri3049.
Article CAS PubMed Google Scholar
Motta V, Soares F, Sun T, Philpott DJ. NOD-like receptors: versatile cytosolic sentinels. Physiol Rev. 2015;95:149–78. https://doi.org/10.1152/physrev.00009.2014.
Article PubMed Google Scholar
Davis BK, Wen H, Ting JP. The inflammasome NLRs in immunity, inflammation, and associated diseases. Annu Rev Immunol. 2011;29:707–35. https://doi.org/10.1146/annurev-immunol-031210-101405.
Article CAS PubMed PubMed Central Google Scholar
Jesus AA, Goldbach-Mansky R. IL-1 blockade in autoinflammatory syndromes. Annu Rev Med. 2014;65:223–44. https://doi.org/10.1146/annurev-med-061512-150641.
Article CAS PubMed PubMed Central Google Scholar
Kobayashi K, Inohara N, Hernandez LD, et al. RICK/Rip2/CARDIAK mediates signalling for receptors of the innate and adaptive immune systems. Nature. 2002;416:194–9. https://doi.org/10.1038/416194a.
Article CAS PubMed Google Scholar
Girardin SE, Tournebize R, Mavris M, et al. CARD4/Nod1 mediates NF-kappaB and JNK activation by invasive Shigella flexneri. EMBO Rep. 2001;2:736–42. https://doi.org/10.1093/embo-reports/kve155.
Article CAS PubMed PubMed Central Google Scholar
Inohara N, Ogura Y, Fontalba A, et al. Host recognition of bacterial muramyl dipeptide mediated through NOD2. Implications for Crohn’s disease. J Biol Chem. 2003;278:5509–12. https://doi.org/10.1074/jbc.C200673200.
Article CAS PubMed Google Scholar
Perng YC, Lenschow DJ. ISG15 in antiviral immunity and beyond. Nat Rev Microbiol. 2018;16:423–39. https://doi.org/10.1038/s41579-018-0020-5.
Article CAS PubMed PubMed Central Google Scholar
Zhu J, Ghosh A, Sarkar SN. OASL-a new player in controlling antiviral innate immunity. Curr Opin Virol. 2015;12:15–9. https://doi.org/10.1016/j.coviro.2015.01.010.
Article CAS PubMed PubMed Central Google Scholar
Liu J, Wennier S, Zhang L, McFadden G. M062 is a host range factor essential for myxoma virus pathogenesis and functions as an antagonist of host SAMD9 in human cells. J Virol. 2011;85:3270–82. https://doi.org/10.1128/jvi.02243-10.
Article CAS PubMed PubMed Central Google Scholar
Zhu J, Zhang Y, Ghosh A, et al. Antiviral activity of human OASL protein is mediated by enhancing signaling of the RIG-I RNA sensor. Immunity. 2014;40:936–48. https://doi.org/10.1016/j.immuni.2014.05.007.
Article CAS PubMed PubMed Central Google Scholar
Li J, Ding SC, Cho H, et al. A short hairpin RNA screen of interferon-stimulated genes identifies a novel negative regulator of the cellular antiviral response. MBio. 2013;4:e00385-e1313. https://doi.org/10.1128/mBio.00385-13.
Article CAS PubMed PubMed Central Google Scholar
Boon AC, Williams RW, Sinasac DS, Webby RJ. A novel genetic locus linked to pro-inflammatory cytokines after virulent H5N1 virus infection in mice. BMC Genomics. 2014;15:1017. https://doi.org/10.1186/1471-2164-15-1017.
Article CAS PubMed PubMed Central Google Scholar
Asgari S, Schlapbach LJ, Anchisi S, et al. Severe viral respiratory infections in children with IFIH1 loss-of-function mutations. Proc Natl Acad Sci U S A. 2017;114:8342–7. https://doi.org/10.1073/pnas.1704259114.
Article CAS PubMed PubMed Central Google Scholar
Shin GC, Kang HS, Lee AR, Kim KH. Hepatitis B virus-triggered autophagy targets TNFRSF10B/death receptor 5 for degradation to limit TNFSF10/TRAIL response. Autophagy. 2016;12:2451–66. https://doi.org/10.1080/15548627.2016.1239002.
Article CAS PubMed PubMed Central Google Scholar
Zhu C, Li B, Frontzek K, Liu Y, Aguzzi A. SARM1 deficiency up-regulates XAF1, promotes neuronal apoptosis, and accelerates prion disease. J Exp Med. 2019;216:743–56. https://doi.org/10.1084/jem.20171885.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We sincerely thank the third clinical college of Guangzhou medical university for its technical support.

Funding

No funding.

Author information

Tian-Ao Xie, Zhi-Jian He and Chuan Liang contributed equally to this work

Authors and Affiliations

Department of Clinical Laboratory Medicine, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, 510150, China
Tian-Ao Xie, Zhi-Jian He, Chuan Liang, Hao-Neng Dong, Jie Zhou, Shu-Jin Fan & Xu-Guang Guo
Department of Clinical Medicine, The Third Clinical School of Guangzhou Medical University, Guangzhou, 511436, China
Tian-Ao Xie, Zhi-Jian He, Chuan Liang, Hao-Neng Dong, Jie Zhou, Shu-Jin Fan & Xu-Guang Guo
Key Laboratory for Major Obstetric Diseases of Guangdong Province, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, 510150, China
Xu-Guang Guo
Key Laboratory of Reproduction and Genetics of Guangdong Higher Education Institutes, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, 510150, China
Xu-Guang Guo

Authors

Tian-Ao Xie
View author publications
You can also search for this author in PubMed Google Scholar
Zhi-Jian He
View author publications
You can also search for this author in PubMed Google Scholar
Chuan Liang
View author publications
You can also search for this author in PubMed Google Scholar
Hao-Neng Dong
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Shu-Jin Fan
View author publications
You can also search for this author in PubMed Google Scholar
Xu-Guang Guo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

XGG, TAX conceived and designed the experiments. TAX, ZJH, CL, HND, JZ and SJF contributed to making tables and figures. All authors participated in the writing, reading, and revising of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Xu-Guang Guo.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

Flowchart of data preparation, processing, and analysis in this study.

Additional file 2: Table S1.

Up-regulated genes and down-regulated genes that meet the screening criteria.

Additional file 3: Table S2.

The function and fold change of 52 identified genes.

Additional file 4: Table S3.

BP pathways of GSE147507 and GSE150316 that are heavily enriched in GO analysis (ranked in the top 10 according to p value).

Additional file 5: Table S4.

Ten most significantly enriched KEGG pathways in the system matrix file GSE147507 and GSE150316.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Xie, TA., He, ZJ., Liang, C. et al. An integrative bioinformatics analysis for identifying hub genes associated with infection of lung samples in patients infected with SARS-CoV-2. Eur J Med Res 26, 146 (2021). https://doi.org/10.1186/s40001-021-00609-4

Download citation

Received: 18 August 2021
Accepted: 14 November 2021
Published: 17 December 2021
DOI: https://doi.org/10.1186/s40001-021-00609-4

An integrative bioinformatics analysis for identifying hub genes associated with infection of lung samples in patients infected with SARS-CoV-2

Abstract

Background

Methods

Results

Conclusion

Similar content being viewed by others

Introduction

Methods and materials

Study process

Data sources

Data processing

Identification of DEGs

Construction of the protein–protein interaction (PPI) network and identification of hub genes

Performing non-paired t-test for verification

Result

The DEGs among GSE150316 and GSE147507

Enrichment analysis of the pathway and process in SARS-CoV-2 infection

Analysis of the KEGG pathway

Hub genes identification with DEGs protein–protein interaction network (PPI)

Verification of the hub gene through non-paired t-test

Discussion

Conclusion

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation