Abstract
In this protocol we describe our workflow for analyzing complex, multi-condition quantitative proteomic experiments, with the aim to extract biological insights. The tool we use is an R package, PloGO2, contributed to Bioconductor, which we can optionally precede by running correlation network analysis with WGCNA. We describe the data required and the steps we take, including detailed code examples and outputs explanation. The package was designed to generate gene ontology or pathway summaries for many data subsets at the same time, visualize protein abundance summaries for each biological category examined, help determine enriched protein subsets by comparing them all to a reference set, and suggest key highly correlated hub proteins, if the optional network analysis is employed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhang B, Horvath S (2005) A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol 4(1). https://doi.org/10.2202/1544-6115.1128
Langfelder P, Horvath S (2008) WGCNA: An R package for weighted correlation network analysis. BMC Bioinformatics 9(1):559. https://doi.org/10.1186/1471-2105-9-559
Consortium GO (2004) The gene ontology (go) database and informatics resource. Nucleic Acids Res 32(suppl_1):D258–D261. https://doi.org/10.1093/nar/gkh036
Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M (2016) Kegg as a reference resource for gene and protein annotation. Nucleic Acids Res 44(D1):D457–D462. https://doi.org/10.1093/nar/gkv1070
Schmidt A, Forne I, Imhof A (2014) Bioinformatic analysis of proteomics data. BMC Syst Biol 8(2):1–7. https://doi.org/10.1186/1752-0509-8-S2-S3
Carnielli CM, Winck FV, Leme AFP (2015) Functional annotation and biological interpretation of proteomics data. Biochim Biophys Acta (BBA) Protein Proteomics 1854(1):46–54. https://doi.org/10.1016/j.bbapap.2014.10.019
Khatri P, Sirota M, Butte AJ (2012) Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Comput Biol 8(2):e1002375. https://doi.org/10.1371/journal.pcbi.1002375
Sherman BT, Lempicki RA, et al (2009) Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature Protocols 4(1):44. https://doi.org/10.1038/nprot.2008.211
Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, Simonovic M, Doncheva NT, Morris JH, Bork P, Jensen LJ, Mering C (2018) STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res 47(D1):D607–D613. https://doi.org/10.1093/nar/gky1131
Jassal B, Matthews L, Viteri G, Gong C, Lorente P, Fabregat A, Sidiropoulos K, Cook J, Gillespie M, Haw R, et al (2020) The reactome pathway knowledgebase. Nucleic Acids Res 48(D1):D498–D503. https://doi.org/10.1093/nar/gkz1031
Ge SX, Jung D, Yao R (2020) Shinygo: a graphical gene-set enrichment tool for animals and plants. Bioinformatics 36(8):2628–2629. https://doi.org/10.1093/bioinformatics/btz931
Pascovici D, Keighley T, Mirzaei M, Haynes PA, Cooke B (2012) PloGO: Plotting gene ontology annotation and abundance in multi-condition proteomics experiments. Proteomics 12(3):406–410. https://doi.org/10.1002/pmic.201100445
Wu JX, Pascovici D, Wu Y, Walker A, Mirzaei M (2020) Workflow for rapidly extracting biological insights from complex, multicondition proteomics experiments with WGCNA and plogo2. J Proteome Res https://doi.org/10.1021/acs.jproteome.0c00198
Wu JX, Pascovici D (2020a) Bioconductor plogo2. http://www.bioconductor.org/packages/release/bioc/html/PloGO2.html
Wu JX, Pascovici D (2020b) Plogo2 R package. https://github.com/APAFbioinformatics/PloGO2_R_Package
Wu Y, Mirzaei M, Pascovici D, Chick JM, Atwell BJ, Haynes PA (2016) Quantitative proteomic analysis of two different rice varieties reveals that drought tolerance is correlated with reduced abundance of photosynthetic machinery and increased abundance of clpd1 protease. J Proteomics 143:73–82. https://doi.org/10.1016/j.jprot.2016.05.014
Huber W, Carey VJ, Gentleman R, Anders S, Carlson M, Carvalho BS, Bravo HC, Davis S, Gatto L, Girke T, Gottardo R, Hahne F, Hansen KD, Irizarry RA, Lawrence M, Love MI, MacDonald J, Obenchain V, Olés AK, Pagès H, Reyes A, Shannon P, Smyth GK, Tenenbaum D, Waldron L, Morgan M (2015) Orchestrating high-throughput genomic analysis with Bioconductor. Nature Methods 12(2):115–121. https://doi.org/10.1038/nmeth.3252
R Core Team (2020) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/
Langfelder P, Horvath S (2014) Tutorials for the WGCNA package
Yuan L, Chen L, Qian K, Qian G, Wu CL, Wang X, Xiao Y (2017) Co-expression network analysis identified six hub genes in association with progression and prognosis in human clear cell renal cell carcinoma (ccrcc). Genomics Data 14:132–140. https://doi.org/10.1016/j.gdata.2017.10.006
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B (Methodol) 57(1):289–300. https://doi.org/10.2307/2346101
Friso G, Majeran W, Huang M, Sun Q, Van Wijk KJ (2010) Reconstruction of metabolic pathways, protein expression, and homeostasis machineries across maize bundle sheath and mesophyll chloroplasts: large-scale quantitative proteomics using the first maize genome assembly. Plant Physiology 152(3):1219–1250. https://doi.org/10.1104/pp.109.152694
Acknowledgements
This work was conducted at the Australian Proteome Analysis Facility supported by the Australian Government’s National Collaborative Research Infrastructure Scheme (NCRIS). Aspects of this work were supported by the Australian National Health and Medical Research Council (Project Grant GNT1124005 and RD Wright Career Development Fellowship GNT1140386 to AKW), the Ross Maclean Fellowship, and the Brazil Family Program for Neurology.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Wu, J.X., Pascovici, D., Wu, Y., Walker, A.K., Mirzaei, M. (2023). Application of WGCNA and PloGO2 in the Analysis of Complex Proteomic Data. In: Burger, T. (eds) Statistical Analysis of Proteomic Data. Methods in Molecular Biology, vol 2426. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-1967-4_17
Download citation
DOI: https://doi.org/10.1007/978-1-0716-1967-4_17
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-1966-7
Online ISBN: 978-1-0716-1967-4
eBook Packages: Springer Protocols