Skip to main content
Log in

Unsupervised methods for finding protein complexes from PPI networks

  • Original Article
  • Published:
Network Modeling Analysis in Health Informatics and Bioinformatics Aims and scope Submit manuscript

Abstract

Complex biological systems are often represented as networks and studied computationally. In protein–protein interaction networks, interactions give rise to certain compounds known as protein complexes. Identifying functional protein complexes is an emerging field of study in system biology. Several machine learning methods have been proposed so far to detect functionally enriched protein complexes responsible for specific biological functions or diseases. We present an empirical study on the different unsupervised approaches towards the identification of such complexes. We report performance of seven popularly known methods against four benchmark datasets in terms of six evaluation parameters. We also highlight some issues and challenges prevailing in this field of research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Ali W, Deane C, Reinert G (2011) Protein interaction networks and their statistical analysis. Handbook of Statistical, Systems Biology. pp 200–234

  • Asur S, Ucar D, Parthasarathy S (2007) An ensemble framework for clustering protein-protein interaction networks. Bioinformatics 23(13):i29–i40

    Article  Google Scholar 

  • Boulton SJ, Anton G, Jérôme R, Philippe V, Nick D, Hill DE, Vidal M (2002) Combined functional genomic maps of the c. elegans dna damage response. Science 295(5552):127–131

    Article  Google Scholar 

  • Chen Y, Jacquemin T, Zhang S, Jiang R (2014 Prioritizing protein complexes implicated in human diseases by network optimization. BMC Syst Biol 8(S-1):S2

  • Fields S, Song O (1989) A novel genetic system to detect protein–protein interactions. Nature 340(6230):245–246

    Article  Google Scholar 

  • Gambette P, Guénoche A et al (2011) Bootstrap clustering for graph partitioning. RAIRO. Oper Res 45(4):339–352

    Article  MathSciNet  MATH  Google Scholar 

  • Ganegoda GU, Wang J, Wu FX, Li M (2014) Prediction of disease genes using tissue-specified gene-gene network. BMC Syst Biol 8(Suppl 3):S3

    Article  Google Scholar 

  • Gavin AC, Bösche M, Krause R, Grandi P, Marzioch M, Bauer A, Schultz J, Rick JM, Michon AM, Cruciat CM et al (2002) Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415(6868):141–147

    Article  Google Scholar 

  • Gavin AC, Aloy P, Grandi P, Krause R, Boesche M, Marzioch M, Rau C, Jensen LJ, Bastuck S, Dümpelfeld B et al (2006) Proteome survey reveals modularity of the yeast cell machinery. Nature 440(7084):631–636

    Article  Google Scholar 

  • Grira N, Crucianu M, Boujemaa N (2004) Unsupervised and semi-supervised clustering: a brief survey. A review of machine learning techniques for processing multimedia content. In: Report of the MUSCLE European Network of Excellence (FP6)

  • Guanming W, Feng X, Stein L (2010) Research a human functional protein interaction network and its application to cancer data analysis. Genome Biol 11:R53

    Article  Google Scholar 

  • Guénoche A (2011) Consensus of partitions: a constructive approach. Adv Data Analysis Classif 5(3):215–229

    Article  MATH  Google Scholar 

  • Ho Y, Gruhler A, Heilbut A, Bader GD, Moore L, Adams SL, Millar A, Taylor P, Bennett K, Boutilier K et al (2002) Systematic identification of protein complexes in saccharomyces cerevisiae by mass spectrometry. Nature 415(6868):180–183

    Article  Google Scholar 

  • Krogan NJ, Cagney G, Haiyuan Y, Zhong G, Guo X, Ignatchenko A, Li J, Shuye P, Datta N, Tikuisis AP et al (2006) Global landscape of protein complexes in the yeast saccharomyces cerevisiae. Nature 440(7084):637–643

    Article  Google Scholar 

  • Li M, Chen J, Wang J, Bin H, Chen G (2008a) Modifying the dpclus algorithm for identifying protein complexes based on new topological structures. BMC Bioinform 9(1):398

    Article  Google Scholar 

  • Li M, Wang J, Chen J (2008b) A fast agglomerate algorithm for mining functional modules in protein interaction networks. In: BioMedical Engineering and Informatics. BMEI 2008. International Conference on, vol. 1. IEEE, pp 3–7

  • Li X, Min W, Kwoh CK, Ng SK (2010) Computational approaches for detecting protein complexes from protein interaction networks: a survey. BMC Genomics 11(Suppl 1):S3

    Article  Google Scholar 

  • Lin C, Cho Y, Hwang W, Pei P, Zhang A (2007) Clustering methods in protein-protein interaction network. Knowledge Discovery in Bioinformatics: techniques, methods and application. pp 1–35

  • Nepusz T, Haiyuan Y, Paccanaro A (2012) Detecting overlapping protein complexes in protein-protein interaction networks. Nature Methods 9(5):471–472

    Article  Google Scholar 

  • Pizzuti C, Rombo SE, Marchiori E (2012) Complex detection in protein-protein interaction networks: a compact overview for researchers and practitioners. In: Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics. Springer, pp 211–223

  • Rahman MS, Ngom A (2013) Fac-pin: Fast agglomerative clustering method for functional modules and protein complex identification in pins. In: Computational Advances in Bio and Medical Sciences (ICCABS), 2013 IEEE 3rd International Conference on. IEEE, pp 1–6

  • Ren J, Wang J, Li M, Wang L (2013) Identifying protein complexes based on density and modularity in protein-protein interaction network. BMC Syst Biol 7(4):1–15

    Google Scholar 

  • Ruan J, Zhang W (2008) Identifying network communities with a high resolution. Phys Rev E 77(1):016104

    Article  Google Scholar 

  • Tong AHY, Evangelista M, Parsons AB, Hong X, Bader GD, Pagé N, Robinson M, Raghibizadeh S, Hogue CWV, Bussey H et al (2001) Systematic genetic analysis with ordered arrays of yeast deletion mutants. Science 294(5550):2364–2368

    Article  Google Scholar 

  • Tong AHY, Lesage G, Bader GD, Ding H, Hong X, Xin X, Young J, Berriz GF, Brost RL, Chang M et al (2004) Global mapping of the yeast genetic interaction network. Science 303(5659):808–813

    Article  Google Scholar 

  • Uetz P, Giot L, Cagney G, Mansfield TA, Judson RS, Knight JR, Lockshon D, Narayan V, Srinivasan Maithreyan, Pochart Pascale et al (2000) A comprehensive analysis of protein-protein interactions in saccharomyces cerevisiae. Nature 403(6770):623–627

    Article  Google Scholar 

  • Von Mering C, Krause R, Snel B, Cornell M, Oliver SG, Fields S, Bork P (2002) Comparative assessment of large-scale data sets of protein–protein interactions. Nature 417:399–403

    Article  Google Scholar 

  • Wang J, Li M, Deng Y, Pan Y (2010) Recent advances in clustering methods for protein interaction networks. BMC Genomics 11(Suppl 3):S10

    Article  Google Scholar 

  • Wang J, Ren J, Li M, Wu FX (2012) Identification of hierarchical and overlapping functional modules in ppi networks

  • Wang X, Wang Z, Ye J (2011) Hkc: An algorithm to predict protein complexes in protein-protein interaction networks. J Biomed Biotechnol

  • Zhang XF, Dai DQ, Ou-Yang L, Yan H (2014) Detecting overlapping protein complexes based on a generative model with functional and topological properties. BMC Bioinformatics 15(1):186

    Article  Google Scholar 

  • Zhu H, Bilgin M, Bangham R, Hall D, Casamayor A, Bertone P, Lan N, Jansen R, Bidlingmaier S, Houfek T et al (2001) Global analysis of protein activities using proteome chips. Science 293(5537):2101–2105

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dhruba K. Bhattacharyya.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sharma, P., Ahmed, H.A., Roy, S. et al. Unsupervised methods for finding protein complexes from PPI networks. Netw Model Anal Health Inform Bioinforma 4, 8 (2015). https://doi.org/10.1007/s13721-015-0080-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13721-015-0080-7

Keywords

Navigation