Skip to main content
Log in

Improving the performance of evolutionary-based complex detection models in protein–protein interaction networks

  • Methodologies and Application
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Detecting functional modules in protein–protein interaction (PPI) networks is an important and ongoing challenge in computational biology. To partition a PPI network into complexes, most existing state-of-the-art methods, including evolutionary algorithms (EAs), are based solely on either very general graphical properties or very specific biological semantics of PPI networks or both. Despite the strength and the additional predictive improvements introduced by EA-based complex detection methods and apart from the specific biological properties of PPI networks, more inherent topological properties of protein complexes are rarely explored in these approaches. Moreover, many of these EAs view and define complex detection problem as a single-objective optimization problem. In this paper, we introduce a new multi-objective optimization model for formulating complex detection problem. A heuristic perturbation operator, called protein complex attraction and repulsion, is also introduced to improve the quality of a given solution. To the best of our knowledge, this is the first attempt to exploit the topological structure of proteins interactions in both problem formulation and complex perturbation. To demonstrate the effectiveness of the proposed multi-objective evolutionary-based complex detection algorithm, two well-known PPI networks and two reference sets of benchmark complexes created from MIPS are used in the experiments. Moreover, new random networks are generated from the Saccharomyces cerevisiae (yeast) PPI network to investigate the impact of perturbing protein interactions on the final performance of the proposed algorithm against counterpart algorithms. In comparison with several state-of-the-art methods, experimental results reveal that the proposed algorithm achieves more predictive level of matched protein complexes. Moreover, the results demonstrate the positive impact of injecting the proposed heuristic perturbation operator in improving the quality of the detected complexes obtained by all state-of-the-art EA-based methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25
Fig. 26
Fig. 27
Fig. 28
Fig. 29
Fig. 30
Fig. 31
Fig. 32
Fig. 33
Fig. 34
Fig. 35
Fig. 36
Fig. 37
Fig. 38
Fig. 39
Fig. 40
Fig. 41

Similar content being viewed by others

References

  • Adamcsek B, Palla G, Farkas IJ, Derényi I, Vicsek T (2006) CFinder: locating cliques and overlapping modules in biological networks. Bioinformatics 22(8):1021–1023

    Article  Google Scholar 

  • Ahn Y-Y, Bagrow JP, Lehmann S (2010) Link communities reveal multiscale complexity in networks. Nature 466:761–764

    Article  Google Scholar 

  • Altaf-Ul-Amin M, Shinbo Y, Mihara K, Kurokawa K, Kanaya S (2006) Development and implementation of an algorithm for detection of protein complexes in large interaction networks. BMC Bioinform 7(1):207

    Article  Google Scholar 

  • Arnau V, Mars S, Marín I (2005) Iterative cluster analysis of protein interaction data. Bioinformatics 21(3):364–378

    Article  Google Scholar 

  • Attea BA, Hariz WA, Abdulhalim MF (2015) Improving the performance of evolutionary multi-objective co-clustering models for community detection in complex social networks. Swarm Evol Comput 26(2016):137–156

    Google Scholar 

  • Attea BA, Khoder HS (2016) A new multi-objective evolutionary framework for community mining in dynamic social networks. Swarm Evol Comput 31:90–109

    Article  Google Scholar 

  • Bader GD, Hogue CW (2003) An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinform 4(1):1

    Article  Google Scholar 

  • Bandyopadhyay S, Ray S, Mukhopadhyay A, Maulik U (2015) A multiobjective approach for identifying protein complexes and studying their association in multiple disorders. Algorithms Mol Biol 10(1):1

    Article  Google Scholar 

  • Barabasi AL, Oltvai ZN (2004) Network biology: understanding the cell’s functional organization. Nat Rev Genet 5(2):101–113

    Article  Google Scholar 

  • Becker E, Robisson B, Chapple CE, Guénoche A, Brun C (2012) Multifunctional proteins revealed by overlapping clustering in protein interaction network. Bioinformatics 28:84–90

    Article  Google Scholar 

  • Brandes U, Delling D, Gaertler M, Görke R, Hoefer M, Nikoloski Z, Wagner D (2008) On modularity clustering. IEEE Trans Knowl Data Eng 20(2):172–188

    Article  MATH  Google Scholar 

  • Brohée S, van Helden J (2006) Evaluation of clustering algorithms for protein–protein interaction networks. BMC Bioinform 7(1):488

    Article  Google Scholar 

  • Cho YR, Hwang W, Ramanathan M, Zhang A (2007) Semantic integration to identify overlapping functional modules in protein interaction networks. BMC Bioinform 8(1):1

    Article  Google Scholar 

  • Coello CAC, Van Veldhuizen DA, Lamont GB (2002) Evolutionary algorithms for solving multi-objective problems, vol 242. Kluwer Academic, New York

    Book  MATH  Google Scholar 

  • Deb K (2001) Multi-objective optimization using evolutionary algorithms, vol 16. Wiley, New York

    MATH  Google Scholar 

  • Ding Z, Zhang X, Sun D, Luo B (2016) Overlapping community detection based on network decomposition. Sci Rep 6:24115. doi:10.1038/srep24115

  • Enright AJ, Van Dongen S, Ouzounis CA (2002) An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res 30(7):1575–1584

    Article  Google Scholar 

  • Folino F, Pizzuti C (2010) A multiobjective and evolutionary clustering method for dynamic networks. In: 2010 International Conference on advances in social networks analysis and mining (ASONAM), IEEE, pp 256–263

  • Fortunato S (2010) Community detection in graphs. Phys Rep 486(3):75–174

    Article  MathSciNet  Google Scholar 

  • Garey MR, Johnson DS (1979) Computers and intractability. A guide to the theory of NP-completeness. W. H. Freeman and Company, New York

    MATH  Google Scholar 

  • Gavin AC, Aloy P, Grandi P, Krause R, Boesche M, Marzioch M, Edelmann A (2006) Proteome survey reveals modularity of the yeast cell machinery. Nature 440(7084):631–636

    Article  Google Scholar 

  • Georgii E, Dietmann S, Uno T, Pagel P, Tsuda K (2009) Enumeration of condition-dependent dense modules in protein interaction networks. Bioinformatics 25(7):933–940

    Article  Google Scholar 

  • Huang L, Wang G, Wang Y, Blanzieri E, Su C (2013) Link clustering with extended link similarity and EQ evaluation division. PLos One 8:e66005. doi:10.1371/journal.pone.0066005

    Article  Google Scholar 

  • King AD, Pržulj N, Jurisica I (2004) Protein complex prediction via cost-based clustering. Bioinformatics 20(17):3013–3020

    Article  Google Scholar 

  • Lubovac Z, Gamalielsson J, Olsson B (2006) Combining functional and topological properties to identify core modules in protein interaction networks. Proteins Struct Funct Bioinform 64:948–959

    Article  Google Scholar 

  • Mewes HW, Frishman D, Güldener U, Mannhaupt G, Mayer K, Mokrejs M, Weil B (2002) MIPS: a database for genomes and protein sequences. Nucleic Acids Res 30(1):31–34

    Article  Google Scholar 

  • Newman ME, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69(2):026113

    Article  Google Scholar 

  • Noack A (2007) Energy models for graph clustering. J. Graph Algorithms Appl. 11(2):453–480

    Article  MathSciNet  MATH  Google Scholar 

  • Oti M, Snel B, Huynen MA, Brunner HG (2006) Predicting disease genes using protein–protein interactions. J Med Genet 43(8):691–698

    Article  Google Scholar 

  • Palla G, Derényi I, Farkas I, Vicsek T (2005) Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(7043):814–818

    Article  Google Scholar 

  • Pei, P, Zhang A (2005) A two-step approach for clustering proteins based on protein interaction profile. In: Proceedings/IEEE computational systems bioinformatics conference, CSB. IEEE computational systems bioinformatics conference, vol 2005, No. 1544467, NIH Public Access, p 201

  • Pizzuti C, Rombo S (2012a) Experimental evaluation of topological-based fitness functions to detect complexes in PPI networks. In: Proceedings of the 14th annual conference on Genetic and evolutionary computation, ACM, pp 193–200

  • Pizzuti C, Rombo SE (2012b) A coclustering approach for mining large protein–protein interaction networks. IEEE/ACM Trans Comput Biol Bioinform (TCBB) 9(3):717–730

    Article  Google Scholar 

  • Pizzuti C, Rombo SE (2007) PINCOC: a co-clustering based approach to analyze protein–protein interaction networks. In: Intelligent data engineering and automated learning-IDEAL 2007, Springer, Berlin, pp 821–830

  • Pizzuti C, Rombo SE (2008) Discovering meaningful protein–protein interaction modules by a co-clustering based approach. In: SEBD, pp 294–301

  • Pizzuti C, Rombo SE (2014a) An evolutionary restricted neighborhood search clustering approach for PPI networks. Neurocomputing 145:53–61

    Article  Google Scholar 

  • Pizzuti C, Rombo SE (2014b) Algorithms and tools for protein–protein interaction networks clustering, with a special focus on population-based stochastic methods. Bioinformatics 30(10):1343–1352

    Article  Google Scholar 

  • Radicchi F, Castellano C, Cecconi F, Loreto V, Parisi D (2004) Defining and identifying communities in networks. Proc Natl Acad Sci USA 101(9):2658–2663

    Article  Google Scholar 

  • Ray S, Hossain A, Maulik U (2016) Disease associated protein complex detection: a multi-objective evolutionary approach. In: 2016 International conference on microelectronics, computing and communications (MicroCom), IEEE, pp 1–6

  • Schaeffer SE (2007) Graph clustering. Comput Sci Rev 1(1):27–64

    Article  MATH  Google Scholar 

  • Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905

    Article  Google Scholar 

  • Spirin V, Mirny LA (2003) Protein complexes and functional modules in molecular networks. PNAS 100:12123–12128

    Article  Google Scholar 

  • Srihari S, Leong HW (2013) A survey of computational methods for protein complex prediction from protein interaction networks. J Bioinform Comput Biol 11(02):1230002

    Article  Google Scholar 

  • von Mering C, Krause R, Snel B, Cornell M, Oliver SG, Fields S, Bork P (2002) Comparative assessment of large-scale data sets of protein–protein interactions. Nature 417(6887):399–403

    Article  Google Scholar 

  • Wang H, Azuaje F, Bodenreider O, Dopazo J (2004) Gene expression correlation and gene ontology-based similarity: an assessment of quantitative relationships. In: Proceedings of the 2004 IEEE symposium on computational intelligence in bioinformatics and computational biology, 2004, CIBCB’04, IEEE, pp 25–31

  • Zaki N, Berengueres J, Efimov D (2012) Detection of protein complexes using a protein ranking algorithm. Proteins Struct Funct Bioinform 80(10):2459–2468

    Article  Google Scholar 

  • Zhang N, Bilsland E (2011) Contributions of Saccharomyces cerevisiae to understanding mammalian gene function and therapy. Yeast Syst Biol Methods Protoc 501–523. doi:10.1007/978-1-61779-173-4_28

  • Zhang Q, Li H (2007) MOEA/D: a multi-objective evolutionary algorithm based on decomposition. IEEE Trans Evol Comput 11(6):712–731

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bara’a A. Attea.

Ethics declarations

Conflict of interest

Authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Communicated by V. Loia.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Attea, B.A., Abdullah, Q.Z. Improving the performance of evolutionary-based complex detection models in protein–protein interaction networks. Soft Comput 22, 3721–3744 (2018). https://doi.org/10.1007/s00500-017-2593-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-017-2593-8

Keywords

Navigation