Collaboration-Based Function Prediction in Protein-Protein Interaction Networks

  • Hossein Rahmani
  • Hendrik Blockeel
  • Andreas Bender
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7014)


The cellular metabolism of a living organism is among the most complex systems that man is currently trying to understand. Part of it is described by so-called protein-protein interaction (PPI) networks, and much effort is spent on analyzing these networks. In particular, there has been much interest in predicting certain properties of nodes in the network (in this case, proteins) from the other information in the network. In this paper, we are concerned with predicting a protein’s functions. Many approaches to this problem exist. Among the approaches that predict a protein’s functions purely from its environment in the network, many are based on the assumption that neighboring proteins tend to have the same functions. In this work we generalize this assumption: we assume that certain neighboring proteins tend to have “collaborative”, but not necessarily the same, functions. We propose a few methods that work under this new assumption. These methods yield better results than those previously considered, with improvements in F-measure ranging from 3% to 17%. This shows that the commonly made assumption of homophily in the network (or “guilt by association”), while useful, is not necessarily the best one can make. The assumption of collaborativeness is a useful generalization of it; it is operational (one can easily define methods that rely on it) and can lead to better results.


Majority Rule Function Level Candidate Function Functional Cluster Winner Neuron 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Brun, C., Herrmann, C., Guénoche, A.: Clustering proteins from interaction networks for the prediction of cellular functions. BMC Bioinformatics 5, 95 (2004)CrossRefGoogle Scholar
  2. 2.
    Chua, H.N., Sung, W., Wong, L.: Exploiting indirect neighbours and topological weight to predict protein function from protein-protein interactions. Bioinformatics 22(13), 1623–1630 (2006)CrossRefGoogle Scholar
  3. 3.
    Deane, C.M., Salwiński, L., Xenarios, I., Eisenberg, D.: Protein interactions: two methods for assessment of the reliability of high throughput observations. Molecular & Cellular Proteomics: MCP 1(5), 349–356 (2002)CrossRefGoogle Scholar
  4. 4.
    Guldener, U., Munsterkotter, M., Kastenmuller, G., Strack, N., van Helden, J., Lemer, C., et al.: Cygd: the comprehensive yeast genome database. Nucleic Acids Research 33(supplement. 1), D364+ (January 2005)Google Scholar
  5. 5.
    King, A.D., Przulj, N., Jurisica, I.: Protein complex prediction via cost-based clustering. Bioinformatics 20(17), 3013–3020 (2004)CrossRefGoogle Scholar
  6. 6.
    Krogan, N.J., Cagney, G., Yu, H., Zhong, G., Guo, X., Ignatchenko, A., et al.: Global landscape of protein complexes in the yeast saccharomyces cerevisiae. Nature 440(7084), 637–643 (2006)CrossRefGoogle Scholar
  7. 7.
    Ma’ayan, A., Jenkins, S.L., Goldfarb, J., Iyengar, R.: Network analysis of fda approved drugs and their targets. The Mount Sinai Journal of Medicine 74(1), 27–32 (2007)CrossRefGoogle Scholar
  8. 8.
    Milenkovic, T., Przulj, N.: Uncovering biological network function via graphlet degree signatures. Cancer Informatics 6, 257–273 (2008)Google Scholar
  9. 9.
    Rahmani, H., Blockeel, H., Bender, A.: Predicting the functions of proteins in PPI networks from global information. JMLR Proceeding 8, 82–97 (2010)Google Scholar
  10. 10.
    Schwikowski, B., Uetz, P., Fields, S.: A network of protein-protein interactions in yeast. Nat. Biotechnol. 18(12), 1257–1261 (2000)CrossRefGoogle Scholar
  11. 11.
    Stelzl, U., Worm, U., Lalowski, M., Haenig, C., Brembeck, F.H., Goehler, H., et al.: A human protein-protein interaction network: a resource for annotating the proteome. Cell 122(6), 957–968 (2005)CrossRefGoogle Scholar
  12. 12.
    Sun, S., Zhao, Y., Jiao, Y., Yin, Y., Cai, L., Zhang, Y., et al.: Faster and more accurate global protein function assignment from protein interaction networks using the mfgo algorithm. FEBS Lett. 580(7), 1891–1896 (2006)CrossRefGoogle Scholar
  13. 13.
    Vazquez, A., Flammini, A., Maritan, A., Vespignani, A.: Global protein function prediction from protein-protein interaction networks. Nat. Biotechnol. 21(6), 697–700 (2003)CrossRefGoogle Scholar
  14. 14.
    von Mering, C., Krause, R., Snel, B., Cornell, M., Oliver, S.G., Fields, S., et al.: Comparative assessment of large-scale data sets of protein-protein interactions. Nature 417(6887), 399–403 (2002)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Hossein Rahmani
    • 1
  • Hendrik Blockeel
    • 1
    • 2
  • Andreas Bender
    • 3
  1. 1.Leiden Institute of Advanced Computer ScienceUniversiteit LeidenThe Netherlands
  2. 2.Department of Computer ScienceKatholieke Universiteit LeuvenThe Netherlands
  3. 3.Unilever Centre for Molecular Science Informatics, Department of ChemistryUniversity of CambridgeUK

Personalised recommendations