ProtRet: A Web Server for Retrieving Proteins in a Functional Complex

  • Nazar ZakiEmail author
  • Maryam Al Yammahi
  • Tetiana Habuza
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 1005)


The identification of protein complexes is becoming increasingly important to our understanding of cellular functionality. However, if a biologist wishes to investigate a certain protein, currently no method exists to assist him/her to accurately retrieve the possible protein partners that are expected to be in the same functional complex. Here, we introduce ProtRet, a web server that functions as an interface for an improved Pigeonhole approach to identify protein complexes in protein-protein interaction networks. The approach provides high-quality protein comparison that is particularly valuable because of its accurate statistical estimates based on fuzzy criterion and Hamming distance. The proposed method was tested on two high-throughput experimental protein-protein interaction data sets and two gold standard data sets and was able to retrieve more correct protein members than all existing methods. The web server is accessible from the link


Protein complexes Protein-protein interaction Pigeonhole method Information retrieval 



The authors acknowledge financial support from the ICT Fund (Grant # G00001472) and the UAEU (Grant # G00002659).


  1. 1.
    Zaki, N., Deris, S., Alashwal, A.: Protein-protein interaction detection based on substring sensitivity measure. Int. J. Biomed. Sci. 1(2) (2006). ISSN 1306–1216Google Scholar
  2. 2.
    Farutin, V., Robinson, K., Lightcap, E., Dancik, V., Ruttenberg, A., Letovsky, S., Pradines, J.: Edge-count probabilities for the identification of local protein communities and their organization. Proteins: Struct. Funct. Bioinform. 62, 800–818 (2006)CrossRefGoogle Scholar
  3. 3.
    Hanna, E.M., Zaki, N.: Detecting protein complexes in protein interaction networks using a ranking algorithm with a refined merging procedure. BMC Bioinform. 15, 204 (2014)CrossRefGoogle Scholar
  4. 4.
    Liu, H., Liu, J.: Clustering protein interaction data through chaotic genetic algorithm. In: Simulated Evolution and Learning. Lecture Notes in Computer Science, vol. 4247, pp. 858–864 (2006)Google Scholar
  5. 5.
    Bader, G., Hogue, H.: An automated method for finding molecular com-plexes in large protein-protein interaction networks. BMC Bioinform. 4, 2 (2003)CrossRefGoogle Scholar
  6. 6.
    Adamcsek, B., Palla, G., Farkas, I.J., Derényi, I., Vicsek, T.: CFinder: locating cliques and overlapping modules in biological networks. Bioinformatics 22, 1021–1023 (2006)CrossRefGoogle Scholar
  7. 7.
    Zaki, N., Berengueres, J., Efimov, D.: Detection of protein complexes using a protein ranking algorithm. Proteins: Struct. Funct. Bioinform. 80(10), 2459–2468 (2012)CrossRefGoogle Scholar
  8. 8.
    Guimei, L., Wong, L., Chua, H.N.: Complex discovery from weighted PPI networks. Bioinformatics 25, 1891–1897 (2009)CrossRefGoogle Scholar
  9. 9.
    Nepusz, T., Yu, H., Paccanaro, A.: Detecting overlapping protein complexes in protein-protein interaction networks. Nat. Methods 9, 471–472 (2012)CrossRefGoogle Scholar
  10. 10.
    Zaki, N., Dmitry, D., Berengueres, J.: Protein complex detection using interaction reliability assessment and weighted clustering coefficient. BMC Bioinform. 14, 1163 (2013)CrossRefGoogle Scholar
  11. 11.
    King, A.D., Przulj, N., Jurisica, I.: Protein complex prediction via cost-based clustering. Bioinformatics 17, 3013–3020 (2004)CrossRefGoogle Scholar
  12. 12.
    Enright, A.J., Dongen, S.V., Ouzounis, C.A.: An efficient algorithm for largescale detection of protein families. Nucl. Acids Res. 30, 1575–1584 (2002)CrossRefGoogle Scholar
  13. 13.
    Macropol, K., Can, T., Singh, A.: RRW: repeated random walks on genome-scale protein networks for local cluster discovery. BMC Bioinform. 10, 283 (2009)CrossRefGoogle Scholar
  14. 14.
    Herstein, I.N.: Topics In Algebra. Blaisdell Publishing Company, Waltham (1964)zbMATHGoogle Scholar
  15. 15.
    Gavin, A.C., et al.: Functional organization of the yeast proteome by system-atic analysis of protein complexes. Nature 415, 141–147 (2002)CrossRefGoogle Scholar
  16. 16.
    Gavin, A.C., et al.: Proteome survey reveals modularity of the yeast cell machinery. Nature 440, 631–636 (2006)CrossRefGoogle Scholar
  17. 17.
    Leung, H., Xiang, Q., Yiu, S.M., Chin, F.: Predicting protein complexes from PPI data: a core-attachment approach. J. Comput. Biol. 16, 133–139 (2009)MathSciNetCrossRefGoogle Scholar
  18. 18.
    Mewes, H.W., et al.: MIPS: analysis and annotation of proteins from whole genomes. Nucl. Acids Res. 32, D41–D44 (2004)CrossRefGoogle Scholar
  19. 19.
    Pearson, W.R.: Flexible sequence similarity searching with the FASTA3 program package. Methods Mol. Biol. 132, 185–219 (2000)Google Scholar
  20. 20.
    Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J.: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucl. Acids Res. 25, 3389–3402 (1997)CrossRefGoogle Scholar
  21. 21.
    Aaron J. Mackey, AJ., Haystead, TJ., Pearson, WR.: Algorithms for rapid protein identification with multiple short peptide sequences. Mol. Cell. Proteomics 1, 139-147 (2002)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  • Nazar Zaki
    • 1
    Email author
  • Maryam Al Yammahi
    • 2
  • Tetiana Habuza
    • 1
  1. 1.Department of Computer Science and Software Engineering, College of Information TechnologyUnited Arab Emirates University (UAEU)Al AinUnited Arab Emirates
  2. 2.Department of Computer and Network Engineering, College of Information TechnologyUnited Arab Emirates University (UAEU)Al AinUnited Arab Emirates

Personalised recommendations