Application of WGCNA and PloGO2 in the Analysis of Complex Proteomic Data

Wu, Jemma X.; Pascovici, Dana; Wu, Yunqi; Walker, Adam K.; Mirzaei, Mehdi

doi:10.1007/978-1-0716-1967-4_17

Jemma X. Wu³,
Dana Pascovici⁴,
Yunqi Wu³,
Adam K. Walker⁵ &
…
Mehdi Mirzaei³

Part of the book series: Methods in Molecular Biology ((MIMB,volume 2426))

1313 Accesses
1 Citations
1 Altmetric

Abstract

In this protocol we describe our workflow for analyzing complex, multi-condition quantitative proteomic experiments, with the aim to extract biological insights. The tool we use is an R package, PloGO2, contributed to Bioconductor, which we can optionally precede by running correlation network analysis with WGCNA. We describe the data required and the steps we take, including detailed code examples and outputs explanation. The package was designed to generate gene ontology or pathway summaries for many data subsets at the same time, visualize protein abundance summaries for each biological category examined, help determine enriched protein subsets by comparing them all to a reference set, and suggest key highly correlated hub proteins, if the optional network analysis is employed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zhang B, Horvath S (2005) A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol 4(1). https://doi.org/10.2202/1544-6115.1128
Langfelder P, Horvath S (2008) WGCNA: An R package for weighted correlation network analysis. BMC Bioinformatics 9(1):559. https://doi.org/10.1186/1471-2105-9-559
Consortium GO (2004) The gene ontology (go) database and informatics resource. Nucleic Acids Res 32(suppl_1):D258–D261. https://doi.org/10.1093/nar/gkh036
Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M (2016) Kegg as a reference resource for gene and protein annotation. Nucleic Acids Res 44(D1):D457–D462. https://doi.org/10.1093/nar/gkv1070
Schmidt A, Forne I, Imhof A (2014) Bioinformatic analysis of proteomics data. BMC Syst Biol 8(2):1–7. https://doi.org/10.1186/1752-0509-8-S2-S3
Carnielli CM, Winck FV, Leme AFP (2015) Functional annotation and biological interpretation of proteomics data. Biochim Biophys Acta (BBA) Protein Proteomics 1854(1):46–54. https://doi.org/10.1016/j.bbapap.2014.10.019
Khatri P, Sirota M, Butte AJ (2012) Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Comput Biol 8(2):e1002375. https://doi.org/10.1371/journal.pcbi.1002375
Sherman BT, Lempicki RA, et al (2009) Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature Protocols 4(1):44. https://doi.org/10.1038/nprot.2008.211
Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, Simonovic M, Doncheva NT, Morris JH, Bork P, Jensen LJ, Mering C (2018) STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res 47(D1):D607–D613. https://doi.org/10.1093/nar/gky1131
Jassal B, Matthews L, Viteri G, Gong C, Lorente P, Fabregat A, Sidiropoulos K, Cook J, Gillespie M, Haw R, et al (2020) The reactome pathway knowledgebase. Nucleic Acids Res 48(D1):D498–D503. https://doi.org/10.1093/nar/gkz1031
Ge SX, Jung D, Yao R (2020) Shinygo: a graphical gene-set enrichment tool for animals and plants. Bioinformatics 36(8):2628–2629. https://doi.org/10.1093/bioinformatics/btz931
Pascovici D, Keighley T, Mirzaei M, Haynes PA, Cooke B (2012) PloGO: Plotting gene ontology annotation and abundance in multi-condition proteomics experiments. Proteomics 12(3):406–410. https://doi.org/10.1002/pmic.201100445
Wu JX, Pascovici D, Wu Y, Walker A, Mirzaei M (2020) Workflow for rapidly extracting biological insights from complex, multicondition proteomics experiments with WGCNA and plogo2. J Proteome Res https://doi.org/10.1021/acs.jproteome.0c00198
Wu JX, Pascovici D (2020a) Bioconductor plogo2. http://www.bioconductor.org/packages/release/bioc/html/PloGO2.html
Wu JX, Pascovici D (2020b) Plogo2 R package. https://github.com/APAFbioinformatics/PloGO2_R_Package
Wu Y, Mirzaei M, Pascovici D, Chick JM, Atwell BJ, Haynes PA (2016) Quantitative proteomic analysis of two different rice varieties reveals that drought tolerance is correlated with reduced abundance of photosynthetic machinery and increased abundance of clpd1 protease. J Proteomics 143:73–82. https://doi.org/10.1016/j.jprot.2016.05.014
Huber W, Carey VJ, Gentleman R, Anders S, Carlson M, Carvalho BS, Bravo HC, Davis S, Gatto L, Girke T, Gottardo R, Hahne F, Hansen KD, Irizarry RA, Lawrence M, Love MI, MacDonald J, Obenchain V, Olés AK, Pagès H, Reyes A, Shannon P, Smyth GK, Tenenbaum D, Waldron L, Morgan M (2015) Orchestrating high-throughput genomic analysis with Bioconductor. Nature Methods 12(2):115–121. https://doi.org/10.1038/nmeth.3252
R Core Team (2020) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/
Langfelder P, Horvath S (2014) Tutorials for the WGCNA package
Google Scholar
Yuan L, Chen L, Qian K, Qian G, Wu CL, Wang X, Xiao Y (2017) Co-expression network analysis identified six hub genes in association with progression and prognosis in human clear cell renal cell carcinoma (ccrcc). Genomics Data 14:132–140. https://doi.org/10.1016/j.gdata.2017.10.006
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B (Methodol) 57(1):289–300. https://doi.org/10.2307/2346101
Friso G, Majeran W, Huang M, Sun Q, Van Wijk KJ (2010) Reconstruction of metabolic pathways, protein expression, and homeostasis machineries across maize bundle sheath and mesophyll chloroplasts: large-scale quantitative proteomics using the first maize genome assembly. Plant Physiology 152(3):1219–1250. https://doi.org/10.1104/pp.109.152694

Download references

Acknowledgements

This work was conducted at the Australian Proteome Analysis Facility supported by the Australian Government’s National Collaborative Research Infrastructure Scheme (NCRIS). Aspects of this work were supported by the Australian National Health and Medical Research Council (Project Grant GNT1124005 and RD Wright Career Development Fellowship GNT1140386 to AKW), the Ross Maclean Fellowship, and the Brazil Family Program for Neurology.

Author information

Authors and Affiliations

Australian Proteome Analysis Facility, Macquarie University, Sydney, NSW, Australia
Jemma X. Wu, Yunqi Wu & Mehdi Mirzaei
Insight Stats, Croydon Park, NSW, Australia
Dana Pascovici
Neurodegeneration Pathobiology Laboratory Queensland Brain Institute, The University of Queensland, St. Lucia, QLD, Australia
Adam K. Walker

Authors

Jemma X. Wu
View author publications
You can also search for this author in PubMed Google Scholar
Dana Pascovici
View author publications
You can also search for this author in PubMed Google Scholar
Yunqi Wu
View author publications
You can also search for this author in PubMed Google Scholar
Adam K. Walker
View author publications
You can also search for this author in PubMed Google Scholar
Mehdi Mirzaei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jemma X. Wu .

Editor information

Editors and Affiliations

Exploring the Dynamics of Proteomes, Proteomics French Infrastructure, Université Grenoble Alpes, CNRS, CEA, INSERM, Grenoble, France
Thomas Burger

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Wu, J.X., Pascovici, D., Wu, Y., Walker, A.K., Mirzaei, M. (2023). Application of WGCNA and PloGO2 in the Analysis of Complex Proteomic Data. In: Burger, T. (eds) Statistical Analysis of Proteomic Data. Methods in Molecular Biology, vol 2426. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-1967-4_17

Download citation

DOI: https://doi.org/10.1007/978-1-0716-1967-4_17
Published: 25 August 2021
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-1966-7
Online ISBN: 978-1-0716-1967-4
eBook Packages: Springer Protocols

Publish with us

Policies and ethics