Co-expression analysis to identify key modules and hub genes associated with COVID-19 in platelets

Alarabi, Ahmed B.; Mohsen, Attayeb; Mizuguchi, Kenji; Alshbool, Fatima Z.; Khasawneh, Fadi T.

doi:10.1186/s12920-022-01222-y

Co-expression analysis to identify key modules and hub genes associated with COVID-19 in platelets

Research
Open access
Published: 14 April 2022

Volume 15, article number 83, (2022)
Cite this article

Download PDF

You have full access to this open access article

BMC Medical Genomics Aims and scope Submit manuscript

Co-expression analysis to identify key modules and hub genes associated with COVID-19 in platelets

Download PDF

3213 Accesses
9 Citations
5 Altmetric
Explore all metrics

Abstract

Corona virus disease 2019 (COVID-19) increases the risk of cardiovascular occlusive/thrombotic events and is linked to poor outcomes. The underlying pathophysiological processes are complex, and remain poorly understood. To this end, platelets play important roles in regulating the cardiovascular system, including via contributions to coagulation and inflammation. There is ample evidence that circulating platelets are activated in COVID-19 patients, which is a primary driver of the observed thrombotic outcome. However, the comprehensive molecular basis of platelet activation in COVID-19 disease remains elusive, which warrants more investigation. Hence, we employed gene co-expression network analysis combined with pathways enrichment analysis to further investigate the aforementioned issues. Our study revealed three important gene clusters/modules that were closely related to COVID-19. These cluster of genes successfully identify COVID-19 cases, relative to healthy in a separate validation data set using machine learning, thereby validating our findings. Furthermore, enrichment analysis showed that these three modules were mostly related to platelet metabolism, protein translation, mitochondrial activity, and oxidative phosphorylation, as well as regulation of megakaryocyte differentiation, and apoptosis, suggesting a hyperactivation status of platelets in COVID-19. We identified the three hub genes from each of three key modules according to their intramodular connectivity value ranking, namely: COPE, CDC37, CAPNS1, AURKAIP1, LAMTOR2, GABARAP MT-ND1, MT-ND5, and MTRNR2L12. Collectively, our results offer a new and interesting insight into platelet involvement in COVID-19 disease at the molecular level, which might aid in defining new targets for treatment of COVID-19–induced thrombosis.

View this article's peer review reports

Transcriptional landscape of circulating platelets from patients with COVID-19 reveals key subnetworks and regulators underlying SARS-CoV-2 infection: implications for immunothrombosis

Article Open access 09 February 2022

Differentially expressed platelet activation-related genes in dogs with stage B2 myxomatous mitral valve disease

Article Open access 13 December 2023

Analysis and Regulatory Mechanisms of Platelet-Related Genes in Patients with Ischemic Stroke

Article 04 January 2024

Introduction

The coronavirus SARS-CoV-2 is a highly contagious infection that causes a severe respiratory disease known as COVID-19. This disease that has reached a pandemic level, is impacting tens of millions of people worldwide. In the United States, there are around 78 million reported cases, over 4 million hospital admissions, and 900 thousand deaths as of February 2022 [1]. It is now known that COVID-19-induced thrombosis increases the incidence of cardiovascular occlusive events in infected patients, a fact that has been reported in several studies [2,3,4], Indeed, abnormal hemostasis responses were observed in COVID-19 hospitalized patients, which was linked to poor prognosis [2, 5, 6] In addition, studies have shown that COVID-19 leads to increase in platelet activation through alterations of platelet transcriptome and proteome [7, 8]. In this connection, it is now well established that platelets play roles beyond vascular hemostasis, including innate immunity and tumor metastasis [9]. Moreover, platelets were shown to be activated in the septic state, and antiplatelet therapy has been used as a strategy to prevent organ damage in sepsis [10]. To this end, evidence has indicated that viral infections are associated with coagulation disorders, and thrombotic cardiovascular events [11, 12], which is consistent with the thrombotic phenotype seen in COVID-19 patients/SARS-CoV-2 viral infection. While there has been some progress, our understanding of the pathways that govern platelet participation in COVID-19–induced thrombosis remains limited, but clearly warrants investigation.

To obtain a comprehensive insight into the pathogenesis of specific disease states, several computational and research methods have been developed [13]. Some of these approaches were employed to examine the potential gene networks, which are very instrumental to guide understanding of diseases and their mechanistic pathways. Notably, co-expression analysis is one such approach, which clusters genes into coexpressed groups known as modules. These genes that belong to the same module are thought to share functional properties [14]. This approach relies on using graph theory concepts that allow researchers to understand in a systematic way the relations between the genes of a module and the phenotype based on the module eigingene [14]. In fact, co-expression using weighted correlation network analysis (WGCNA) has been used for analyzing a number of biological processes, including cancer [15, 16] and cognitive and mental disorders [17, 18]. In short, gene networks provide the utility to move beyond individual-gene comparisons and comprehensively identify biologically meaningful relationships between gene products and phenotypes.

At the same time, machine learning and artificial intelligence are getting extensively used in biology [19], especially for feature selection. “Feature selection” is used to select the minimum number of features to predict the biological phenomenon or correctly classify the biological samples. This approach facilitates understanding of the underlying disease mechanisms and other factors that reasonably could have affected the disease status. One particular approach for results validation is to build a classifier using the information derived from the identified set of biomarkers (e.g., gene expression) and test the performance of that classifier on totally different data set to examine its ability to classify two status (e.g. disease vs healthy). successful classifier gives strong evidence supporting the biomarkers validity [20, 21].

Previous studies on the mechanisms of thrombosis in COVID-19 disease have primarily concentrated on specific pathophysiological functions, with relatively fewer studies identifying comprehensive regulatory networks. Therefore, in the present study, WGCNA was used to determine gene networks associated with COVID-19 disease in platelets. PRJNA634489 data set- which contained a total of 15 samples from COVID-19 patients and health controls [7] was used in the present study. Three modules with the highest level of significance in correlation with COVID-19 disease were identified. Of note, the three aforementioned modules were validated as a predictor of COVID-19 phenotype using another set, and the three genes with the highest intramodular connectivity were selected as the hub genes in the respective modules for COVID-19. Gene enrichment analysis was also conducted to determine enrichments in the key modules. The results of this study may provide novel information/insights into the underlying mechanisms of COVID-19 disease and may assist in the identification of potential biomarkers for diagnosis and/or targets for treatment.

Methods

Data preprocessing and differentially expressed genes screening

RNAseq data is publicly available and were downloaded from BioProject accession #PRJNA634489 [7]. Data comprised of ten COVID-19 patients in addition to age- and sex-matched five healthy controls. Of note, while the original paper included a total 58 subjects divided as 41 COVID-19 patients and 17 healthy controls, samples from only 15 subjects were sequenced, and hence used in our analysis. The Kallisto program was employed for pseudoalignment of reads and quantification to obtain the counts and the transcript per million (TPM) [22]. Log2CPM (log transformed counts per million) was used for the differential expression analysis by employing Voom normalization [23] and Limma R package [24] TPM normalized and filtered to exclude low variance transcripts (\(\le\) 0.001) [25] was used for the weighted gene co-expression network analysis. All methods were performed in accordance with the relevant guidelines and regulations.

RNA seq data for validation was downloaded from the publicly NCBI SRA repository under accession: #PRJNA736410, analyzed and normalized by following the same steps as first data set.

Weighted gene coexpression network analysis

The weighted co-expression network was produced using R package “WCGNA” [14] as per the flowchart in Fig. 1. To weight highly correlated genes, the soft thresholding power (\(\beta\)) was set at 12, and the minimal module size was set at 30. To define clusters of genes in the data set, the adjacency matrix was used to calculate the topological overlap matrix (TOM), which shows the degree of overlap in shared neighbors between pairs of genes in the network. The resulting gene network was visualized as a heatmap.

Screening for key modules and hub genes

Correlation between module eigengenes and the COVID-19 status was calculated to identify key modules that have significant correlation. The correlation values were displayed within a heatmap. The modules that correlated with COVID-19 most significantly were considered as the key modules. Gene significance (GS) was defined as the correlation between gene expression and the COVID-19 status. Module membership (MM) was defined as the correlation between gene expression and each module’s eigengene, and intramodular connectivity (K.in), which measures how connected a given gene with respect to the genes of a particular module, was also calculated using WGCNA. Subsequently, the correlation between GS and MM as well as GS and k.in were examined to verify module-COVID-19 status associations. The correlation analyses in this study were performed using the Pearson correlation as described in the “WGCNA” package [14]. All module genes were ranked according to their intramodular connectivity, and only the top three genes were selected as hub genes.

Validation of key modules using machine learning

To validate the results of the above mentioned analysis, multiple classification models (Lasso, Naiive Bayes, Random forest, SVM and XGBboost) were trained using the key modules of the original data set. Those models were employed to classify the samples of a second data set [26] of platelets gene expression in COVID-19 patients and healthy subjects. The second data was totally isolated from the training process.

Functional enrichment analysis of key modules

The genes in each key modules were extracted from the network and enrichment analysis was performed to further explore the functions of the respective modules. Targetmine [27] which is a web-based integrative data analysis platform for target prioritisation and broad-based biological knowledge discovery- was used to perform Gene Ontology (GO) and Reactome pathway enrichment analysis. In this analysis, a benjamini hochberg adjusted P-value of 0.05 was set as the significance threshold to identify the most significant functional pathways/GO terms. Only top results of enriched terms are reported.

Statistical and visualization tools

We used R statistical programming language [28] version 4.1.0, with the following packages: “WGCNA” [14] for coexpression analysis; “Scikit-learn” [29] for machine learning building and evaluation; “circlize” [30] for chord diagram building; “ggplot2” [31] and “seaborn” [32] for visualization ; “Igraph” for network analysis [33] and “ggraph” [34] for network visualization.

Results

Construction of co-expression network

The transcript per million (TPM) gene expression data set were filtered based on variance, and 7119 genes in the 15 samples of ten COVID-19 patients and five healthy controls were used to construct the co-expression network. The results of cluster analysis of the samples are demonstrated in (Fig. 2A). To construct the network, a soft-threshold of 12 was used to obtain the approximate scale-free topology (Additional file 1: Fig. S1). Genes across the 15 samples were hierarchically clustered based on topological overlap (Fig. 2C, D). We identified 16 modules in which genes are coexpressed, random colors were assigned to the modules to distinguish between them. The size (number of genes/module) of each module is presented in (Fig. 2B). To demonstrate how these modules were relatively distinctive, we plotted the network heatmap of 400 randomly selected genes based on topological overlap matrix dissimilarity and their cluster dendrogram (Fig. 3A) indicating relative independence among these clusters.

Correlation between modules and COVID-19 disease status

To examine the relation of COVID-19 status with the emerged modules, we built the eigengene adjacency matrix by calculating the correlation of the eigengenes matrix after inserting COVID-19 status to the matrix. The heatmap (Fig. 3B) showed the modules’ relationship and the correlation between the modules namely black, cyan, yellow, blue, and magenta and COVID-19 status.

Identification of key modules in relationship to COVID-19 disease status

To further determine the closest modules to COVID-19 status, we re-clustered the eigengenes using single linkage method with absolute correlation as a distance function; the single linkage clustering algorithm looks for closest pair of modules to form a cluster, then cluster them with the next nearest module progressively until one cluster is formed [35]. As demonstrated in Fig. 3C, the closest three modules to COVID-19 status are magenta, yellow and black. Three essential measurements can help confirm the importance of the module to a specific trait, 1) Module membership (MM), which increases for a particular gene, when the module eigengene accurately represents this gene, 2) gene significance (GS) is measured by calculating the correlation of gene expression with the specific trait and 3) intramodular connectivity (K.in) for a gene within the module, reflecting the centrality of the gene to the module expression network. Based on WGCNA, if a gene is higher with GS, MM, and K.in, it is more meaningful to the clinical trait of interest [36, 37].

Explicitly, the higher the correlation between gene significance of genes in a module and their module membership, the higher its importance. Similarly, when the gene centrality in the network increases in parallel with gene significance, that also is strong evidence that key modules are essential in that trait. The correlations between gene significance and module membership as well as between gene significance and intramodular connectivity show that yellow, black, and magenta modules have the highest correlation values with a substantial difference to the next nearest module (Blue R = 0.61) (Additional file 1: Fig. S2). For those reasons, we selected yellow, black, and magenta modules for further investigation and will refer to them using the term key modules.

Key modules show high correlation to COVID-19 disease status

The module-trait relationship was determined by correlating module eigengenes with COVID-19 disease status to identify significant correlation. The yellow and the black modules exhibited the highest positive correlation (R=0.91; p-value=\(3 \times 10^{-6}\), and R=0.86; p-value= \(3 \times 10^{-5}\), respectively; Fig. 3D). On the other hand, the magenta module (R=-0.96; p-value=\(1 \times 10^{-8}\)) exhibited the highest negative correlation (Fig. 3D). Therefore, these three modules were identified as key modules for COVID-19 disease and its impact on platelets. The significant correlations between the different GS, MM, and K.in for COVID-19 are illustrated in (Fig. 4A, B). We also showed the GS, MM, and K.in of the green module that showed the low correlation to COVID-19 disease status (Fig. 4A, B).

In summery, although all samples were used to identify the co-expression modules, the top modules were selected based on meeting the following criteria: 1) high correlation between module eigengene and COVID-19 status, 2) close clustering with COVID-19 status using single linkage with absolute correlation distance, 3) high correlation between genes significance and module membership, and 4) high correlation between gene significance and intramodular connectivity. Together those measures confirm the importance of the key modules in COVID-19 status

Key modules’ genes can differentiate COVID-19 from normal subjects

The classification models trained using data from key modules genes showed high performance in terms of high balanced accuracy, sensitivity, specificity, Matthews correlation coefficient, as well as, area under the receiver operating characteristic curve (AUC) (Fig. 5), suggesting that the genes of these three modules are important in the pathology of COVID-19 disease. Furthermore, the accurate classification of the external validation set samples suggests that these results can be generalized and not limited to the analyzed data set.

Gene hub detection and visualization of module networks

Genes in the selected key modules were ranked according to the intramodular connectivity and the top 20 genes of each key modules were used to visualize the network of each specific module (Fig. 6). Subsequently, the top three genes of the yellow, black, and magenta modules were labeled as the hub genes in their modules that are important for COVID-19 disease. Thus, the protein coding genes COPE, CDC37 and CAPNS1 were selected as the hub genes in the yellow module, whereas AURKAIP1, LAMTOR2, and GABARAP protein coding genes were selected as the hub genes in the black module. Regarding the magenta module, MT-ND1, MT-ND5, and MTRNR2L12 were selected as hub genes. All of these hub genes exhibited a high intramodular connectivity, which established their network centrality and potentially vital roles in the COVID-19 disease. We also observed that not all of hub genes show differential gene expression (Table 1). A full list of genes and their modules can be found in the supplementary tables (Additional files 2, 3, 4, 5, 6).

Table 1 Differential expression of hub genes identified in the key modules

Full size table

Enrichment analysis of key modules

Gene ontology (GO) pathway enrichment analyses were performed on the yellow, black, and magenta modules using Targetmine platform, and the top relevant terms of each category are presented in (Fig. 7A). The pathway enrichment results demonstrated that the genes in both yellow and black modules were primarily enriched in pathways associated with metabolic process, protein translation, energy substance metabolism, mitochondrial activity, and oxidative phosphorylation. Genes in the magenta module were enriched in several pathways that are primarily associated with regulation of megakaryocyte differentiation and apoptosis, including the regulation of the execution phase of apoptosis. Reactome showed enriched pathways of metabolism, platelet degranulation, and response to elevated platelet cytosolic Ca²⁺ in the yellow module. The black module shows enrichment of respiratory electron transport, ATP synthesis by chemiosmotic coupling, heat production by uncoupling proteins, citric acid (TCA) cycle, and respiratory electron transport just to name a few (Fig. 7B) (More detailed results are shown in Additional file 1: Fig. S3 and cross check of hubgenes with Disgenet database is shown in Additional file 6: Table: S5 [38]).

Discussion

The underlying pathophysiological mechanisms of thrombosis in COVID-19 are extremely complicated [39], and hence clearly require more examination. Inspecting gene co-expression patterns is proven to be an effective method to analyze and uncover complicated genetic networks. To address the aforementioned issues, in the present study, gene co-expression analysis was performed on platelet RNAseq data set containing gene expression data from ten COVID-19 patients and five healthy controls. There were three modules that were identified as the key modules in COVID-19, with the highest level of significant association. The top three genes of each key module with the highest intramodular connectivity were identified as hub genes for COVID-19 in platelets. The results of the enrichment analysis suggest that the key modules and the pathological processes underlying the disease are associated with energy metabolism, mitochondrial processes, and apoptosis. Furthermore, we also saw enrichment of platelet secretion and activation pathways. These results provide- at least in part- an insight into the comprehensive platelet regulatory network in COVID-19, which should improve the current understanding of the mechanisms underlying immunothrombosis in COVID-19 patients. Ultimately, these findings might help in finding appropriate therapeutic targets.

The present study used the data in BioProject accession #PRJNA634489 [7] to perform the co-expression analysis using WGCNA. The data used in this study, which was generated by Manne et al. [7], revealed that COVID-19 disease leads to changes in platelet transcriptional profiles in comparison to control. Manne et al. showed that platelet differential gene expression in COVID-19 is associated with enrichment of protein ubiquitination, antigen presentation, and mitochondrial dysfunction. The major differences in the genes or modules obtained in the present study, compared with the results from other studies including the one by Manne et. al. [7] is that the present study used a more comprehensive method by employing WGCNA. Using this method, we were able to identify/pull-out co-expression modules of genes, namely the yellow and black modules, which represent important regulatory modules of platelet function in COVID-19. In addition, we were able to identify the magenta module, which represents genes that are negatively correlated with the COVID-19 disease state and show enrichment in megakaryocyte differentiation and apoptotic pathways. This systematic and in-depth analysis should complement results obtained from conventional DEGs analysis, and therefore allow for a better understanding of the pathophysiological mechanisms of COVID-19 disease.

Notably, the co-expression analysis revealed a total cluster of 16 modules with the yellow and black modules exhibiting the strongest positive correlation and the magenta exhibiting the strongest negative correlation to COVID-19 disease. These three modules were selected as key modules and their genes deemed important for the COVID-19 disease state. This result was validated when the genes of these three key modules/clusters were used to accurately classify the subjects from another recently published platelet data set (Barrett et. al 2021) to either COVID-19 or healthy using machine learning classifiers. The high accuracy of this classification underscores the importance of these platelet gene clusters in the pathogenesis of COVID-19 disease.

Enrichment analysis indicated that the genes in the yellow and black modules were primarily associated with platelet metabolism, energy, and oxidative phosphorylation. Furthermore, the analysis of the yellow module showed enrichment of a host of platelet functional responses/activities, such as platelet degranulation/secretion and increased platelet response to Ca²⁺. Indeed, other studies showed the COVID-19 disease to be associated with platelet activation and increased platelet alpha granule secretion, which are critical in the development of thrombosis seen in those patients [7, 8]. It is noteworthy that the platelet alpha granule secretion response is not only important for thrombus formation, but also in inflammation by releasing receptors that facilitate adhesion of platelets with other vascular cells as well as releasing a wide range of inflammatory chemokines [40].

The yellow and black modules show strong enrichment in platelet metabolic processes, which is in agreement with the increase in platelet activation. To this end, previous data have shown that platelet transition from inactive to active state requires alteration in ATP availability [41], and furthermore, substrate metabolism (e.g. glucose) was shown to be essential for platelet activation [42], and thrombosis [43]. This seems to suggest that altered platelet metabolism may play a critical role in the pathophysiology of thrombosis in COVID-19 patients. It is important to note that reports have suggested that a state of hypermetabolic demand is one of COVID-19 disease features, especially when sepsis develops [44]. Like other viruses that can impact cellular metabolism in human cells and utilize them to their advantage, SARS-CoV-2 virus appears to have the ability to localize proteins to mitochondria and hijack the host’s mitochondrial function [45]. This mechanism might explain the enrichment of platelet mitochondrial processes we observed in the yellow and black modules. This finding is in fact supported by a recent study that reported that SARS-CoV-2 impacts mitochondria in platelets, which affects their involvement in the pathophysiology of thrombosis in COVID-19 patients [46]. The enrichment of protein translation in the yellow and black modules suggests an alteration in protein synthesis and possible hijacking of the translation machinery of platelet by the virus. In line with this observation, one study suggested that the cells infected with SARS-CoV-2 might exhibit a faster protein synthesis rate, which implies a higher translation rate [47]. This notion requires further investigation to determine the exact mechanism underlying enhancement of translation in platelets of COVID-19 patient.

One particular characteristic of platelet apoptotic processes is phosphatidylserine (PS) exposure, which is essential for the generation of thrombin [48]. PS exposure is found to be downregulated in activated platelets from COVID-19 patients due to mitochondrial dysfunction [46]. This observation is supported by the negative regulation of apoptotic processes in platelet enrichment in the negatively correlated magenta module. On the contrary, another report showed that COVID-19 increases PS externalization, which is linked to thrombosis [49]. The impact of platelets mitochondrial damage on hemostasis seems to depend on its severity. Thus, it leads to bleeding by progressing toward apoptosis if it is severe; or toward platelet activation pathways and development of thrombosis risk in case of mild damage [50]. Based on this reasoning, COVID-19 disease-caused mitochondrial damage in platelets is probably mild; and hence the thrombotic phenotype still prevails in these patients. Based on these considerations, more investigation is needed to confirm these observations and to understand the underlying mechanisms.

Additionally, we identified hub genes in each of the key modules. For example, in the yellow module the COPE, CDC37, and CAPNS1, which are protein coding genes involved in vesicle-mediated transport, positive regulation of cellular processes, and regulation of interferons. Furthermore, some of these protein coding genes have also been investigated in platelets and shown to regulate important aspects of their function [51,52,53,54], Interestingly, although our co-expression analysis showed that CAPNS1 is an important hub gene in the yellow module, this gene was not differentially expressed in our differential gene expression analysis. Furthermore, CAPNS1 was found to play a significant role in regulating platelet activity and thrombosis under hypoxia [53], a condition commonly seen in severe COVID-19 patients [55]. This observation might indicate that some of the important genes in establishing thrombotic phenotype in COVID-19 may not necessarily be differentially expressed.

The hub genes of the black module, AURKAIP1, LAMTOR2, and GABARAP are linked to regulation of mitochondrial activity, regulation of signaling processes, and protein targeting. Data on the role of these genes in platelets is limited, thus, further investigation is warranted. It is noteworthy that LAMTOR2 is a known regulator of the MAPK/ERK and mTOR signaling pathways [56, 57], both of which were shown to be important in regulating platelet function [58, 59]. Moreover, the p14/LAMTOR2 deficiency- which is associated with one of the primary immunodeficiency diseases that also include “Hermansky–Pudlak syndrome type 2”- has been linked to platelet defects [60]. However, more needs to be done to examine the exact role of LAMTOR2 in platelets of COVID-19 patients.

In the magenta module, MT-ND1 [61], MT-ND5 [62], and MTRNR2L12 protein coding genes are related to NADH dehydrogenase activity and apoptotic processes. According to our analysis, all hub genes in the magenta module are differentially expressed and downregulated in COVID-19 patients in comparison to healthy controls. Down regulation of MT-ND1 and MT-ND5 protein coding genes might, at least in part, explain the mitochondrial dysfunction seen in platelets of COVID-19 patients. With respect to MTRNR2L12, it was observed that it is one of the differentially expressed genes in bronchoalveolar lavage fluid samples from patients with severe COVID-19 in comparison to control [63]. MTRNR2L12 is a paralog of the protein coding gene MTRNR2L8, and both are expressed in platelets [64]. It is of note that MTRNR2L12 was shown to be among the top 10 RNA with differential splice junctions in platelets of patients of multiple sclerosis [65].

In addition to the identified hub genes, a number of other canonical platelet genes in the yellow and black modules were also associated with platelet function. For example, SLEB and ITGA2B protein coding genes were present in the yellow module with high intramodular connectivity (ranked in the top 50) and both proteins are critical for platelet function [66]. Moreover, another canonical platelet gene that was also identified in the yellow module, namely ITGB3 was ranked 132 with regard to its intramodular connectivity, which is considered high in the yellow module of 681 genes. Furthermore, we also noticed that the protein coding gene IFITM3 shows high module membership (black module). The protein encoded by this gene is an interferon-induced membrane protein that was shown to be important in immunity against influenza A H1N1 virus, West Nile virus, and dengue virus [67,68,69]. Most recently, IFITM3 was also found to be upregulated protein in COVID-19 disease [7, 70], which importantly was also demonstrated/confirmed by Western blot [7].

The present study has certain limitations that should be noted. Firstly, the analysis focused on only one data set, due to limited access to platelet gene expression data that were collected from COVID-19 patients. Therefore, additional data sets should be analyzed, if available, to validate our findings and/or obtain more representative results. Also, the number of samples was 15, which may be associated with some noise, albeit it is the minimum number of samples recommended for co-expression analysis by WGCNA. Finally, any limitations in the original study, from which the data was obtained will also be reflected in the results of this study.

In conclusion, our co-expression analysis of a platelet RNAseq data set from COVID-19 patients and healthy controls revealed 16 modules, amongst which the yellow, black, and magenta were identified as the most critical in COVID-19 disease and validated using machine learning. Additionally, nine hub genes were determined to potentially serve key roles in the pathophysiological mechanisms of COVID-19 in the context of platelet biology. The positively associated yellow and black modules were identified to be involved in platelet degranulation, energy metabolism, and mitochondria. The negatively associated magenta module was associated with interactive pathways of apoptosis. These data should help expand our understanding of the underlying mechanisms of thrombosis in COVID-19 disease and help promote and guide future experimental studies to investigate the roles of the protein coding genes in the pathophysiology of this disease. Additionally, these genes may serve as novel therapeutic targets for treating patients.

Availability of data and materials

The data sets analysed during the current study are publicly available in the NCBI BioProject repositories, (PRJNA634489, PRJNA736410).

Abbreviations

WGCNA:: Whole genome correlation network analysis
SARS-CoV-2:: Severe acute respiratory syndrome corona virus 2
COVID-19:: Corona virus disease 2019
TPM:: Transcript per million
Log2CPM:: Log transformed counts per million
TOM:: Topological overlap matrix
K.in:: Intramodular connectivity
GO:: Gene ontology
MM:: Module membership
GS:: Gene significance
ATP:: Adenosine triphosphate
TCA:: Citric acid cycle
PS:: Phosphatidylserine

References

CDC. COVID Data Tracker Weekly Review. Visited 2022-02-18). 2022. https://www.cdc.gov/coronavirus/2019-ncov/covid-data/covidview/index.html Accessed 18 Feb 2022.
Tang N, Li D, Wang X, Sun Z. Abnormal coagulation parameters are associated with poor prognosis in patients with novel coronavirus pneumonia. J Thromb Haemostasis. 2020;18(4):844–7. https://doi.org/10.1111/jth.14768.
Article CAS Google Scholar
...Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, Zhang L, Fan G, Xu J, Gu X, Cheng Z, Yu T, Xia J, Wei Y, Wu W, Xie X, Yin W, Li H, Liu M, Xiao Y, Gao H, Guo L, Xie J, Wang G, Jiang R, Gao Z, Jin Q, Wang J, Cao B. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The Lancet. 2020;395(10223):497–506. https://doi.org/10.1016/s0140-6736(20)30183-5.
Article CAS Google Scholar
...Guan W-j, Ni Z-y, Hu Y, Liang W-h, Ou C-q, He J-x, Liu L, Shan H, Lei C-l, Hui DSC, Du B, Li L-j, Zeng G, Yuen K-Y, Chen R-c, Tang C-l, Wang T, Chen P-y, Xiang J, Li S-y, Wang J-l, Liang Z-j, Peng Y-x, Wei L, Liu Y, Hu Y-h, Peng P, Wang J-m, Liu J-y, Chen Z, Li G, Zheng Z-j, Qiu S-q, Luo J, Ye C-j, Zhu S-y, Zhong N-s. Clinical characteristics of coronavirus disease 2019 in China. N Engl J Med. 2020;382(18):1708–20. https://doi.org/10.1056/nejmoa2002032.
Article CAS PubMed Google Scholar
Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, Xiang J, Wang Y, Song B, Gu X, Guan L, Wei Y, Li H, Wu X, Xu J, Tu S, Zhang Y, Chen H, Cao B. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. The Lancet. 2020;395(10229):1054–62. https://doi.org/10.1016/s0140-6736(20)30566-3.
Article CAS Google Scholar
Bowles L, Platton S, Yartey N, Dave M, Lee K, Hart DP, MacDonald V, Green L, Sivapalaratnam S, Pasi KJ, MacCallum P. Lupus anticoagulant and abnormal coagulation tests in patients with Covid-19. N Engl J Med. 2020;383(3):288–90. https://doi.org/10.1056/nejmc2013656.
Article CAS PubMed Google Scholar
Manne BK, Denorme F, Middleton EA, Portier I, Rowley JW, Stubben C, Petrey AC, Tolley ND, Guo L, Cody M, Weyrich AS, Yost CC, Rondina MT, Campbell RA. Platelet gene expression and function in patients with COVID-19. Blood. 2020;136(11):1317–29. https://doi.org/10.1182/blood.2020007214.
Article CAS PubMed Google Scholar
Zaid Y, Puhm F, Allaeys I, Naya A, Oudghiri M, Khalki L, Limami Y, Zaid N, Sadki K, Haj RBE, Mahir W, Belayachi L, Belefquih B, Benouda A, Cheikh A, Langlois M-A, Cherrah Y, Flamand L, Guessous F, Boilard E. Platelets can associate with SARS-CoV-2 RNA and are hyperactivated in COVID-19. Circ Res. 2020;127(11):1404–18. https://doi.org/10.1161/circresaha.120.317703.
Article CAS PubMed Central Google Scholar
Smyth SS, McEver RP, Weyrich AS, Morrell CN, Hoffman MR, Arepally GM, French PA, Dauerman HL, Becker RC, Arepally GM, Becker RC, Bhatt DL, Cho J, Dauerman HL, Gretler DD, Hoffman MR, Horrow J, Kleiman NS, Kocharian R, Lincoff AM, Maya J, McEver RP, Morrell CN, Prats J, Rusconi CP, Smyth SS, Strony J, Sun H, Veltri EP, Weyrich AS, Wiviott SD, Wood JP. Platelet functions beyond hemostasis. J Thromb Haemost. 2009;7(11):1759–66.
Article CAS PubMed Google Scholar
Bashour TT, Myler RK, Andreae GE, Stertzer SH, Clark DA, Ryan CJ. Current concepts in unstable myocardial ischemia. Am Heart J. 1988;115(4):850–61.
Article CAS PubMed Google Scholar
Goeijenbier M, van Wissen M, van de Weg C, Jong E, Gerdes VEA, Meijers JCM, Brandjes DPM, van Gorp ECM. Review: viral infections and mechanisms of thrombosis and bleeding. J Med Virol. 2012;84(10):1680–96. https://doi.org/10.1002/jmv.23354.
Article CAS PubMed PubMed Central Google Scholar
Kwong JC, Schwartz KL, Campitelli MA, Chung H, Crowcroft NS, Karnauchow T, Katz K, Ko DT, McGeer AJ, McNally D, Richardson DC, Rosella LC, Simor A, Smieja M, Zahariadis G, Gubbay JB. Acute myocardial infarction after laboratory-confirmed influenza infection. N Engl J Med. 2018;378(4):345–53. https://doi.org/10.1056/nejmoa1702090.
Article PubMed Google Scholar
Cai Y, González JV, Liu Z, Huang T. Computational systems biology methods in molecular biology, chemistry biology, molecular biomedicine, and biopharmacy. BioMed Res Int. 2014;2014:1–2. https://doi.org/10.1155/2014/746814.
Article CAS Google Scholar
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinfor. 2008;9(1):66. https://doi.org/10.1186/1471-2105-9-559.
Article CAS Google Scholar
Tian Z, He W, Tang J, Liao X, Yang Q, Wu Y, Wu G. Identification of important modules and biomarkers in breast cancer based on WGCNA. OncoTargets Therapy. 2020;13:6805–17. https://doi.org/10.2147/ott.s258439.
Article CAS PubMed PubMed Central Google Scholar
Bai K-H, He S-Y, Shu L-L, Wang W-D, Lin S-Y, Zhang Q-Y, Li L, Cheng L, Dai Y-J. Identification of cancer stem cell characteristics in liver hepatocellular carcinoma by WGCNA analysis of transcriptome stemness index. Cancer Med. 2020;9(12):4290–8. https://doi.org/10.1002/cam4.3047.
Article CAS PubMed PubMed Central Google Scholar
Liang J-W, Fang Z-Y, Huang Y, Liuyang Z-Y, Zhang X-L, Wang J-L, Wei H, Wang J-Z, Wang X-C, Zeng J, et al. Application of weighted gene co-expression network analysis to explore the key genes in Alzheimer’s disease. J Alzheimer’s Dis. 2018;65(4):1353–64. https://doi.org/10.3233/JAD-180400.
Article CAS Google Scholar
Zeng D, He S, Ma C, Wen Y, Song W, Xu Q, Zhao N, Wang Q, Yu Y, Shen Y, Huang J, Li H. Network-based approach to identify molecular signatures in the brains of depressed suicides. Psychiatry Res. 2020;294:113513. https://doi.org/10.1016/j.psychres.2020.113513.
Article CAS PubMed Google Scholar
Greener JG, Kandathil SM, Moffat L, Jones DT. A guide to machine learning for biologists. Nat Rev Mol Cell Biol. 2021;23(1):40–55. https://doi.org/10.1038/s41580-021-00407-0.
Article CAS PubMed Google Scholar
Saeys Y, Inza I, Larranaga P. A review of feature selection techniques in bioinformatics. Bioinformatics. 2007;23(19):2507–17. https://doi.org/10.1093/bioinformatics/btm344.
Article CAS PubMed Google Scholar
Xiao J, Wang R, Cai X, Ye Z. Coupling of co-expression network analysis and machine learning validation unearthed potential key genes involved in rheumatoid arthritis. Front Genet. 2021;66:12. https://doi.org/10.3389/fgene.2021.604714.
Article CAS Google Scholar
Bray NL, Pimentel H, Melsted P, Pachter L. Near-optimal probabilistic RNA-seq quantification. Nat Biotechnol. 2016;34(5):525–7. https://doi.org/10.1038/nbt.3519.
Article CAS PubMed Google Scholar
Law CW, Chen Y, Shi W, Smyth GK. voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 2014;15(2):29. https://doi.org/10.1186/gb-2014-15-2-r29.
Article CAS Google Scholar
Smyth GK. Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004;3(1):1–25. https://doi.org/10.2202/1544-6115.1027.
Article Google Scholar
Kadarmideen HN, Watson-haigh NS. Building gene co-expression networks using transcriptomics data for systems biology investigations: comparison of methods using microarray data. Bioinformation. 2012;8(18):855–61. https://doi.org/10.6026/97320630008855.
Article PubMed PubMed Central Google Scholar
...Barrett TJ, Bilaloglu S, Cornwell M, Burgess HM, Virginio VW, Drenkova K, Ibrahim H, Yuriditsky E, Aphinyanaphongs Y, Lifshitz M, Liang FX, Alejo J, Smith G, Pittaluga S, Rapkiewicz AV, Wang J, Iancu-Rubin C, Mohr I, Ruggles K, Stapleford KA, Hochman J, Berger JS. Platelets contribute to disease severity in COVID-19. J Thromb Haemostasis. 2021;19(12):3139–53. https://doi.org/10.1111/jth.15534.
Article CAS Google Scholar
Chen Y-A, Tripathi LP, Fujiwara T, Kameyama T, Itoh MN, Mizuguchi K. The TargetMine data warehouse: enhancement and updates. Front Genet. 2019. https://doi.org/10.3389/fgene.2019.00934.
R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria; 2021. https://www.R-project.org/.
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825–30.
Google Scholar
Gu Z, Gu L, Eils R, Schlesner M, Brors B. circlize implements and enhances circular visualization in R. Bioinformatics. 2014;30(19):2811–2. https://doi.org/10.1093/bioinformatics/btu393.
Article CAS PubMed Google Scholar
Wickham H. Ggplot2: elegant graphics for data analysis. Springer; 2016. https://ggplot2.tidyverse.org.
Waskom ML. seaborn: statistical data visualization. J Open Source Softw. 2021;6(60):3021. https://doi.org/10.21105/joss.03021.
Article Google Scholar
Csardi G, Nepusz T. The igraph software package for complex network research. InterJournal Complex Systems. 2006;1695.
Pedersen TL. Ggraph: an implementation of grammar of graphics for graphs and networks. 2021. R package version 2.0.5. https://CRAN.R-project.org/package=ggraph.
Bu J, Liu W, Pan Z, Ling K. Comparative study of hydrochemical classification based on different hierarchical cluster analysis methods. Int J Environ Res Public Health. 2020;17(24):9515. https://doi.org/10.3390/ijerph17249515.
Article CAS PubMed Central Google Scholar
Langfelder P, Mischel PS, Horvath S. When is hub gene selection better than standard meta-analysis? PLoS ONE. 2013;8(4):61505. https://doi.org/10.1371/journal.pone.0061505.
Article CAS Google Scholar
Lou Q, Chen L, Mei H, Xu K, Wei H, Feng F, Li T, Pang X, Shi C, Luo L, Zhong Y. Root transcriptomic analysis revealing the importance of energy metabolism to the development of deep roots in rice (Oryza sativa L.). Front Plant Sci. 2017. https://doi.org/10.3389/fpls.2017.01314.
Piñero J, Ramírez-Anguita JM, Saüch-Pitarch J, Ronzano F, Centeno E, Sanz F, Furlong LI. The DisGeNET knowledge platform for disease genomics: 2019 update. Nucleic Acids Res. 2019. https://doi.org/10.1093/nar/gkz1021.
Article PubMed Central Google Scholar
Loo J, Spittle DA, Newnham M. COVID-19, immunothrombosis and venous thromboembolism: biological mechanisms. Thorax. 2021;76(4):412–20. https://doi.org/10.1136/thoraxjnl-2020-216243.
Article PubMed Google Scholar
May AE, Seizer P, Gawaz M. Platelets: inflammatory firebugs of vascular walls. Arteriosc Thromb Vasc Biol. 2008;28(3):66. https://doi.org/10.1161/atvbaha.107.158915.
Article Google Scholar
Aibibula M, Naseem KM, Sturmey RG. Glucose metabolism and metabolic flexibility in blood platelets. J Thromb Haemostasis. 2018;16(11):2300–14. https://doi.org/10.1111/jth.14274.
Article CAS Google Scholar
Fidler TP, Campbell RA, Funari T, Dunne N, Angeles EB, Middleton EA, Chaudhuri D, Weyrich AS, Abel ED. Deletion of GLUT1 and GLUT3 reveals multiple roles for glucose metabolism in platelet and megakaryocyte function. Cell Rep. 2017;20(4):881–94. https://doi.org/10.1016/j.celrep.2017.06.083.
Article CAS PubMed PubMed Central Google Scholar
Fidler TP, Marti A, Gerth K, Middleton EA, Campbell RA, Rondina MT, Weyrich AS, Abel ED. Glucose metabolism is required for platelet hyperactivation in a murine model of type 1 diabetes. Diabetes. 2019;68(5):932–8. https://doi.org/10.2337/db18-0981.
Article CAS PubMed PubMed Central Google Scholar
Shenoy S. Coronavirus (covid-19) sepsis: revisiting mitochondrial dysfunction in pathogenesis, aging, inflammation, and mortality. Inflamm Res. 2020;69(11):1077–85. https://doi.org/10.1007/s00011-020-01389-z.
Article CAS PubMed PubMed Central Google Scholar
Ajaz S, McPhail MJ, Singh KK, Mujib S, Trovato FM, Napoli S, Agarwal K. Mitochondrial metabolic manipulation by SARS-CoV-2 in peripheral blood mononuclear cells of patients with COVID-19. Am J Physiol Cell Physiol. 2021;320(1):57–65. https://doi.org/10.1152/ajpcell.00426.2020.
Article CAS Google Scholar
Denorme F, Manne BK, Portier I, Petrey AC, Middleton EA, Kile BT, Rondina MT, Campbell RA. COVID-19 patients exhibit reduced procoagulant platelet responses. J Thromb Haemostasis. 2020;18(11):3067–73. https://doi.org/10.1111/jth.15107.
Article CAS Google Scholar
Dasari CM, Bhukya R. Comparative analysis of protein synthesis rate in COVID-19 with other human coronaviruses. Infect Genet Evol. 2020;85:104432. https://doi.org/10.1016/j.meegid.2020.104432.
Article CAS PubMed PubMed Central Google Scholar
Schoenwaelder SM, Yuan Y, Josefsson EC, White MJ, Yao Y, Mason KD, O’Reilly LA, Henley KJ, Ono A, Hsiao S, Willcox A, Roberts AW, Huang DCS, Salem HH, Kile BT, Jackson SP. Two distinct pathways regulate platelet phosphatidylserine exposure and procoagulant function. Blood. 2009;114(3):663–6. https://doi.org/10.1182/blood-2009-01-200345.
Article CAS PubMed Google Scholar
Althaus K, Marini I, Zlamal J, Pelzl L, Singh A, Häberle H, Mehrländer M, Hammer S, Schulze H, Bitzer M, Malek N, Rath D, Bösmüller H, Nieswandt B, Gawaz M, Bakchoul T, Rosenberger P. Antibody-induced procoagulant platelets in severe COVID-19 infection. Blood. 2021;137(8):1061–71. https://doi.org/10.1182/blood.2020008762.
Article CAS PubMed PubMed Central Google Scholar
Valentino ML, Barboni P, Ghelli A, Bucchi L, Rengo C, Achilli A, Torroni A, Lugaresi A, Lodi R, Barbiroli B, Dotti M, Federico A, Baruzzi A, Carelli V. The ND1 gene of complex i is a mutational hot spot for Leber’s hereditary optic neuropathy. Ann Neurol. 2004;56(5):631–41. https://doi.org/10.1002/ana.20236.
Article CAS PubMed Google Scholar
Burkhart JM, Vaudel M, Gambaryan S, Radau S, Walter U, Martens L, Geiger J, Sickmann A, Zahedi RP. The first comprehensive and quantitative analysis of human platelet protein composition allows the comparative analysis of structural and functional pathways. Blood. 2012;120(15):73–82. https://doi.org/10.1182/blood-2012-04-416594.
Article CAS Google Scholar
Chen G, Cao P, Goeddel DV. TNF-induced recruitment and activation of the IKK complex require cdc37 and hsp90. Mol Cell. 2002;9(2):401–10. https://doi.org/10.1016/s1097-2765(02)00450-1.
Article CAS PubMed Google Scholar
Pawlinski R. Inhibit the calpain to climb the mountain. Blood. 2014;123(8):1123–4. https://doi.org/10.1182/blood-2013-12-543397.
Article CAS PubMed PubMed Central Google Scholar
Azam M, Andrabi SS, Sahr KE, Kamath L, Kuliopulos A, Chishti AH. Disruption of the mouse μ-calpain gene reveals an essential role in platelet function. Mol Cell Biol. 2001;21(6):2213–20. https://doi.org/10.1128/mcb.21.6.2213-2220.2001.
Article CAS PubMed PubMed Central Google Scholar
Nitsure M, Sarangi B, Shankar GH, Reddy VS, Walimbe A, Sharma V. Mechanisms of hypoxia in COVID-19 patients: a pathophysiologic reflection. Indian J Crit Care Med. 2020;24(10):967–70. https://doi.org/10.5005/jp-journals-10071-23547.
Article CAS PubMed PubMed Central Google Scholar
Thauerer B, Voegele P, Hermann-Kleiter N, Thuille N, de Araujo MEG, Offterdinger M, Baier G, Huber LA, Baier-Bitterlich G. LAMTOR2-mediated modulation of NGF/MAPK activation kinetics during differentiation of PC12 cells. PLoS ONE. 2014;9(4):95863. https://doi.org/10.1371/journal.pone.0095863.
Article CAS Google Scholar
Sparber F, Scheffler JM, Amberg N, Tripp CH, Heib V, Hermann M, Zahner SP, Clausen BE, Reizis B, Huber LA, Stoitzner P, Romani N. The late endosomal adaptor molecule p14 (LAMTOR2) represents a novel regulator of langerhans cell homeostasis. Blood. 2014;123(2):217–27. https://doi.org/10.1182/blood-2013-08-518555.
Article CAS PubMed PubMed Central Google Scholar
Flevaris P, Li Z, Zhang G, Zheng Y, Liu J, Du X. Two distinct roles of mitogen-activated protein kinases in platelets and a novel rac1-MAPK-dependent integrin outside-in retractile signaling pathway. Blood. 2009;113(4):893–901. https://doi.org/10.1182/blood-2008-05-155978.
Article CAS PubMed PubMed Central Google Scholar
Aslan JE, McCarty OJT. Regulation of the mTOR-rac1 axis in platelet function. Small GTPases. 2012;3(1):67–70. https://doi.org/10.4161/sgtp.19137.
Article PubMed PubMed Central Google Scholar
Zamani R, Shahkarami S, Rezaei N. Primary immunodeficiency associated with hypopigmentation: a differential diagnosis approach. Allergologia et Immunopathologia. 2021;49(2):178–90. https://doi.org/10.15586/aei.v49i2.61.
Article PubMed Google Scholar
Lim SC, Hroudová J, Bergen NJV, Sanchez MIGL, Trounce IA, McKenzie M. Loss of mitochondrial DNA-encoded protein ND1 results in disruption of complex I biogenesis during early stages of assembly. FASEB J. 2016;30(6):2236–48. https://doi.org/10.1096/fj.201500137r.
Article CAS PubMed Google Scholar
van der Slikke EC, Star BS, van Meurs M, Henning RH, Moser J, Bouma HR. Sepsis is associated with mitochondrial DNA damage and a reduced mitochondrial mass in the kidney of patients with sepsis-AKI. Crit Care. 2021. https://doi.org/10.1186/s13054-020-03424-1.
Article PubMed PubMed Central Google Scholar
Chow RD, Majety M, Chen S. The aging transcriptome and cellular landscape of the human lung in relation to SARS-CoV-2. Nat Commun. 2021;12(1):66. https://doi.org/10.1038/s41467-020-20323-9.
Article CAS Google Scholar
Eicher JD, Wakabayashi Y, Vitseva O, Esa N, Yang Y, Zhu J, Freedman JE, McManus DD, Johnson AD. Characterization of the platelet transcriptome by RNA sequencing in patients with acute myocardial infarction. Platelets. 2015;27(3):230–9. https://doi.org/10.3109/09537104.2015.1083543.
Article CAS PubMed PubMed Central Google Scholar
Sol N, Leurs CE, Veld SGI, Strijbis EM, Vancura A, Schweiger MW, Teunissen CE, Mateen FJ, Tannous BA, Best MG, Wurdinger T, Killestein J. Blood platelet RNA enables the detection of multiple sclerosis. Multiple Sclerosis J Exp Transl Clin. 2020;6(3):205521732094678. https://doi.org/10.1177/2055217320946784.
Article Google Scholar
Rondina MT, Voora D, Simon LM, Schwertz H, Harper JF, Lee O, Bhatlekar SC, Li Q, Eustes AS, Montenont E, Campbell RA, Tolley ND, Kosaka Y, Weyrich AS, Bray PF, Rowley JW. Longitudinal RNA-seq analysis of the repeatability of gene expression and splicing in human platelets identifies a platelet SELP splice QTL. Circ Res. 2020;126(4):501–16. https://doi.org/10.1161/circresaha.119.315215.
Article CAS PubMed Google Scholar
Everitt AR, Clare S, Pertel T, John SP, Wash RS, Smith SE, Chin CR, Feeley EM, Sims JS, Adams DJ, Wise HM, Kane L, Goulding D, Digard P, Anttila V, Baillie JK, Walsh TS, Hume DA, Palotie A, Xue Y, Colonna V, Tyler-Smith C, Dunning J, Gordon SB, Smyth RL, Openshaw PJ, Dougan G, Brass AL, PK. IFITM3 restricts the morbidity and mortality associated with influenza. Nature. 2012;484(7395):519–23. https://doi.org/10.1038/nature10921.
Zhu X, He Z, Yuan J, Wen W, Huang X, Hu Y, Lin C, Pan J, Li R, Deng H, Liao S, Zhou R, Wu J, Li J, Li M. IFITM3-containing exosome as a novel mediator for anti-viral response in dengue virus infection. Cell Microbiol. 2014;17(1):105–18. https://doi.org/10.1111/cmi.12339.
Article CAS PubMed PubMed Central Google Scholar
Gorman MJ, Poddar S, Farzan M, Diamond MS. The interferon-stimulated gene ifitm3 restricts west nile virus infection and pathogenesis. J Virol. 2016;90(18):8212–25. https://doi.org/10.1128/jvi.00581-16.
Article CAS PubMed PubMed Central Google Scholar
Hachim MY, Heialy SA, Hachim IY, Halwani R, Senok AC, Maghazachi AA, Hamid Q. Interferon-induced transmembrane protein (IFITM3) is upregulated explicitly in SARS-CoV-2 infected lung epithelial cells. Front Immunol. 2020;66:11. https://doi.org/10.3389/fimmu.2020.01372.
Article CAS Google Scholar
Shimada BK, Boyman L, Zhu J, Yang Y, Huang W, Kane MA, Yadava N, Polster BM, Zou L, Lederer WJ, Chao W. Molecular remodeling of cardiac mitochondria in mice with sepsis-induced cardiomyopathy. 2021. https://doi.org/10.21203/rs.3.rs-149184/v1.
Maity S, Saha A. Therapeutic potential of exploiting autophagy cascade against coronavirus infection. Front Microbiol. 2021;66:12. https://doi.org/10.3389/fmicb.2021.675419.
Article Google Scholar

Download references

Acknowledgements

The authors acknowledge Matthew T. Rondina and Robert A. Campbell from the department of internal medicine, University of Utah, Salt Lake City, Utah, USA, for their contribution and suggestions that improved the work substantially. The statements made here are solely the responsibility of the authors.

Funding

No external funding was utilized for this work.

Author information

Authors and Affiliations

Department of Pharmacy Practice, Irma Lerma Rangel College of Pharmacy, Texas A&M University, Kingsville, TX, USA
Ahmed B. Alarabi & Fatima Z. Alshbool
Laboratory of Bioinformatics, Artificial Intelligence Center for Health and Biomedical Research (ArCHER), National Institutes of Biomedical Innovation, Health and Nutrition, 7-6-8 Saito-Asagi, Ibaraki, Osaka, 567-0085, Japan
Attayeb Mohsen & Kenji Mizuguchi
Institute for Protein Research, Osaka University, 3-2 Yamadaoka, Suita, Osaka, 565-0081, Japan
Kenji Mizuguchi
Department of Pharmaceutical Sciences, Irma Lerma Rangel College of Pharmacy, Texas A&M University, Kingsville, TX, USA
Fadi T. Khasawneh

Authors

Ahmed B. Alarabi
View author publications
You can also search for this author in PubMed Google Scholar
Attayeb Mohsen
View author publications
You can also search for this author in PubMed Google Scholar
Kenji Mizuguchi
View author publications
You can also search for this author in PubMed Google Scholar
Fatima Z. Alshbool
View author publications
You can also search for this author in PubMed Google Scholar
Fadi T. Khasawneh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

AA, AM: conceptualization; AA, AM: analyzed results and made the figures; AA: drafted the manuscript; AA, AM, KM, FA, FK: edited the manuscript and approved the submission. AA and AM contributed equally to this manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Ahmed B. Alarabi or Fadi T. Khasawneh.

Ethics declarations

Ethics approval and consent to participate

(Not applicable) The data used are publicly available in NCBI BioProject repository, therefore no administrative permission is required to access the raw data. as well as, ethical approval and consent to participate are not applicable. The data is stored in NCBI Bioproject SRA (Sequence Read Archive), therefore, it is controlled by NIH GDS (National Institute of Health Genomic Data Sharing) policy which mandates that data should be anonymized prior to upload to the SRA repository. The original data were produced by experiments carried out in accordance with the Declaration of Helsinki according to the original published papers [7, 26].

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1. Figs. S1–S3

.

Additional file 2. Table S1

: WGCNA eigengenes.

Additional file 3. Table S2

: WGCNA modules.

Additional file 4. Table S3

: Gene significance.

Additional file 5. Table S4

: Intramodular connectivity.

Additional file 6. Table S5

: Hubgenes cross check with Disgenet database.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Alarabi, A.B., Mohsen, A., Mizuguchi, K. et al. Co-expression analysis to identify key modules and hub genes associated with COVID-19 in platelets. BMC Med Genomics 15, 83 (2022). https://doi.org/10.1186/s12920-022-01222-y

Download citation

Received: 18 November 2021
Accepted: 21 March 2022
Published: 14 April 2022
DOI: https://doi.org/10.1186/s12920-022-01222-y

Co-expression analysis to identify key modules and hub genes associated with COVID-19 in platelets

Abstract

Similar content being viewed by others

Introduction

Methods

Data preprocessing and differentially expressed genes screening

Weighted gene coexpression network analysis

Screening for key modules and hub genes

Validation of key modules using machine learning

Functional enrichment analysis of key modules

Statistical and visualization tools

Results

Construction of co-expression network

Correlation between modules and COVID-19 disease status

Identification of key modules in relationship to COVID-19 disease status

Key modules show high correlation to COVID-19 disease status

Key modules’ genes can differentiate COVID-19 from normal subjects

Gene hub detection and visualization of module networks

Enrichment analysis of key modules

Discussion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation