Skip to main content
Log in

Weighted Gene Co-expression Network Analysis and Machine Learning Validation for Identifying Major Genes Related to Sjogren’s Syndrome

  • Original Article
  • Published:
Biochemical Genetics Aims and scope Submit manuscript

Abstract

Sjogren’s syndrome (SS) is an autoimmune disorder characterized by dry mouth and dry eyes. Its pathogenic mechanism is currently unclear. This study aims to integrate weighted gene co-expression network analysis (WGCNA) and machine learning to identify key genes associated with SS. We downloaded 3 publicly available datasets from the GEO database comprising the gene expression data of 231 SS and 78 control cases, including GSE84844, GSE48378 and GSE51092, and carried out WGCNA to elucidate differences in the abundant genes. Candidate biomarkers for SS were then identified using a LASSO regression model. Totally 6 machine-learning models were subsequently utilized for validating the biological significance of major genes according to their expression. Finally, immune cell infiltration of the SS tissue was assessed using the CIBERSORT algorithm. A weighted gene co-expression network was built to divide genes into 10 modules. Among them, blue and red modules were most closely associated with SS, and showed significant enrichment in type I interferon signaling, cellular response to type I interferon and response to virus, etc. Combined machine learning identified 5 hub genes, including OAS1, EIF2AK2, IFITM3, TOP2A and STAT1. Immune cell infiltration analysis showed that SS was associated with CD8+ T cell, CD4+ T cell, gamma delta T cell, NK cell and dendritic cell activation. WGCNA was combined with machine learning to uncover genes that may be involved in SS pathogenesis, which can be utilized for developing SS biomarkers and appropriate therapeutic targets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data Availability

The data used to support the findings of this study are included in GEO.

References

Download references

Acknowledgments

Not applicable.

Funding

Nature Science Foundation of Hainan (823QN254, China); Nature Science Foundation of Shanghai (21ZR1449200, China); The New Interdisciplinary Subjects of Pudong New District Health Committee, grant number (PWXx2020-04, China); Shanghai Pujiang Program (13PJC004); Innovation research team of high-level local universities in Shanghai (SHSWU-ZDCX20212801).

Author information

Authors and Affiliations

Authors

Contributions

DMX and CW conceived and designed the study. QL and KWW and HL prepared the manuscript, and had equal contribution to the study. HW prepared the tables and Figs. DMX, CW, QL and KWW reviewed and revised the manuscript. All authors contributed to the article and approved the submitted version.

Corresponding authors

Correspondence to Chen Wang or Demeng Xia.

Ethics declarations

Competing interests

The authors declare no competing interests.

Conflict of interest

The authors declare that they have no conflict of interest.

Ethics Approval and Consent to Participate

This study does not involve animal and/or human tissue/individual data/participants, resulting in no ethics-related issues. No permissions were required to collect any repository data involved in the present study.

Consent for Publication

All authors agree to publication of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOCX 730.9 kb)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Luo, Q., Wu, K., Li, H. et al. Weighted Gene Co-expression Network Analysis and Machine Learning Validation for Identifying Major Genes Related to Sjogren’s Syndrome. Biochem Genet (2024). https://doi.org/10.1007/s10528-024-10750-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10528-024-10750-4

Keywords

Navigation