Pathway and Network Analysis of Differentially Expressed Genes in Transcriptomes
In recent years, transcriptome sequencing has become very popular, encompassing a wide variety of applications from simple mRNA profiling to discovery and analysis of the entire transcriptome. One of the most common aims of transcriptome sequencing is to identify genes that are differentially expressed (DE) between two or more biological conditions, and to infer associated pathways and gene networks from expression profiles. It can provide avenues for further systematic investigation into potential biologic mechanisms. Gene Set (GS) enrichment analysis is a popular approach to identify pathways or sets of genes that are significantly enriched in the context of differentially expressed genes. However, the approach considers a pathway as a simple gene collection disregarding knowledge of gene or protein interactions. In contrast, topology-based methods integrate the topological structure of a pathway and gene network into the analysis. To provide a panoramic view of such approaches, this chapter demonstrates several recent computational workflows, including gene set enrichment and topology-based methods, for analysis of the DE pathways and gene networks from transcriptome-wide sequencing data.
Key wordsTranscriptome RNA-Seq Microarray Pathway Network Topology Enrichment analysis
This work was supported by the Fundamental Research Funds for the Central Universities (Grant No. JZ2017YYPY0899). The authors are grateful to the editors and the anonymous reviewers for their valuable suggestions and comments facilitating the improvement of this chapter.
- 4.Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci 102(43):15545–15550. https://doi.org/10.1073/pnas.0506580102 CrossRefPubMedPubMedCentralGoogle Scholar
- 9.Team RC (2014) R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing 14(3):279-293.Google Scholar
- 10.Charmpi K, Ycart B (2015) Weighted Kolmogorov Smirnov testing: an alternative for gene set enrichment analysis. Stat Appl Genet Mol Biol 14. https://doi.org/10.1515/sagmb-2014-0077
- 18.Jacob L, Neuvial P, Dudoit S (2010) Gains in power from structured two-sample tests of means on graphs. arXiv preprint arXiv:10095173Google Scholar
- 23.Davis S, Meltzer PS (2007) GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor. Bioinformatics 23:1846–1847.Google Scholar
- 30.Lu TP, Tsai MH, Lee JM, Hsu CP, Chen PC, Lin CW, Shih JY, Yang PC, Hsiao CK, Lai LC, Chuang EY (2010) Identification of a novel biomarker, SEMA5A, for non-small cell lung carcinoma in nonsmoking women. Cancer Epidemiol Biomarkers Prevent 19(10):2590–2597. https://doi.org/10.1158/1055-9965.epi-10-0332 CrossRefGoogle Scholar
- 33.Leng N, Dawson JA, Thomson JA, Ruotti V, Rissman AI, Smits BMG, Haag JD, Gould MN, Stewart RM, Kendziorski C (2013) EBSeq: an empirical Bayes hierarchical model for inference in RNA-Seq experiments. Bioinformatics 29(8):1035–1043. https://doi.org/10.1093/bioinformatics/btt087 CrossRefPubMedPubMedCentralGoogle Scholar
- 37.Chatr-Aryamontri A, Breitkreutz BJ, Oughtred R, Boucher L, Heinicke S, Chen D, Stark C, Breitkreutz A, Kolas N, O'Donnell L, Reguly T, Nixon J, Ramage L, Winter A, Sellam A, Chang C, Hirschman J, Theesfeld C, Rust J, Livstone MS, Dolinski K, Tyers M (2015) The BioGRID interaction database: 2015 update. Nucleic Acids Res 43(Database issue):D470–D478. https://doi.org/10.1093/nar/gku1204 CrossRefPubMedGoogle Scholar
- 38.Sales G, Calura E, Romualdi C (2012) GRAPH interaction from pathway topological environment BMC Bioinformatics 2013
- 39.Caspi R, Altman T, Billington R, Dreher K, Foerster H, Fulcher CA, Holland TA, Keseler IM, Kothari A, Kubo A, Krummenacker M, Latendresse M, Mueller LA, Ong Q, Paley S, Subhraveti P, Weaver DS, Weerasinghe D, Zhang P, Karp PD (2014) The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases. Nucleic Acids Res 42(Database issue):D459–D471. https://doi.org/10.1093/nar/gkt1103 CrossRefPubMedGoogle Scholar