Skip to main content

Estimating Protein Function Using Protein-Protein Relationships

  • Protocol
Gene Function Analysis

Part of the book series: Methods in Molecular Biology™ ((MIMB,volume 408))

Abstract

Many newly identified gene products from completely sequenced genomes are difficult to characterize in the absence of sequence homology to known proteins. In such a scenario, the context of the proteins’ functional associations can be used for annotation; overrepresented functional linkages with a certain class of proteins or members of a pathway allow putative function assignments based on the “guilt-by-association” principle. Two computational functional genomics methods, phylogenetic profiling and identification of Rosetta stone linkages, are described in this chapter, which allow assessment of functional linkages between proteins, consequently facilitating annotation. Phylogenetic profiling involves measuring similarity between profiles that describe the presence or absence of a protein in a set of reference genomes, whereas Rosetta stone fusion sequences help link two or more independently transcribed and translated proteins. Both methods can be applied to investigate functional associations between individual proteins, and can also be extended to reconstruct the genomewide network of functional linkages by querying the entire protein complement of an organism.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Protocol
USD 49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Gardner, M. J., Hall, N., Funq, E., et al. (2002) Genome sequence of the human malaria parasite Plasmodium falciparum. Nature 419, 498–511.

    Article  CAS  PubMed  Google Scholar 

  2. Pellegrini, M., Marcotte, E. M., Thompson, M. J., Eisenberg, D., and Yeates, T. O. (1999) Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. Proc. Natl. Acad. Sci. USA 13, 4285–4288.

    Article  Google Scholar 

  3. Gaasterland, T. and Ragan, M. A. (1998) Microbial genescapes: phyletic and functional patterns of ORF distribution among prokaryotes Microb. Comp. Genomics 3, 199–217.

    CAS  PubMed  Google Scholar 

  4. Marcotte, E. M., Pellegrini, M., Ng, H.-L., Rice, D. W., Yeates, T. O., and Eisenberg, D. (1999) Detecting Protein Function and Protein-Protein Interactions from Genome Sequences. Science 285, 751–753.

    Article  CAS  PubMed  Google Scholar 

  5. Date, S. V. and Marcotte, E. M. (2003) Discovery of uncharacterized cellular systems by genome-wide analysis of functional linkages. Nat. Biotechnol. 21, 1055–1062.

    Article  CAS  PubMed  Google Scholar 

  6. Butland, G., Peregrin-Alvarez, J. M., Li, J., et al. (2005) Interaction Network Containing Conserved and Essential Protein Complexes in Escherichia coli. Nature 433, 531–537.

    Article  CAS  PubMed  Google Scholar 

  7. Peregrin-Alvarez, J. M., Tsoka, S., and Ouzounis, C. A. (2003) The phylogenetic extent of metabolic enzymes and pathways. Genome Res. 13, 422–427.

    Article  CAS  PubMed  Google Scholar 

  8. Date, S. V. and Stoeckert, C. J. (2006) Computational modeling of the Plasmodium falciparum interactome reveals protein function on a genome-wide scale. Genome Res. 4, 542–549.

    Article  Google Scholar 

  9. Lee, I., Date, S. V., Adai, A. T., and Marcotte, E. M. (2004) A probabilistic functional network of yeast genes. Science 306, 1555–1558.

    Article  CAS  PubMed  Google Scholar 

  10. Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lipman, D. J. (1990) Basic local alignment search tool. J. Mol. Biol. 215, 403–410.

    CAS  PubMed  Google Scholar 

  11. Lopez, R., Silventoinen, V., Robinson, S., Kibria, A., and Gish, W. (2003) WU-Blast2 server at the European Bioinformatics Institute. Nucleic Acids Res. 31, 3795–3798.

    Article  CAS  PubMed  Google Scholar 

  12. Wu, J., Kasif, S., and DeLisi, C. (2003). Identification of functional links between genes using phylogenetic profiles. Bioinformatics 19, 1524–1530.

    Article  CAS  PubMed  Google Scholar 

  13. Shannon, C. E. (1948) A mathematical theory of communication. Bell Syst. Tech. J. 27, 379–423; 623–656.

    Google Scholar 

  14. Krober, B. T. M., Farber, R. M., Wolpert, D. H., and Lapedes, A. S. (1993) Covariation of mutations in the V3 loop of human immunodeficiency virus type I envelope protein: an information theoretic analysis. Proc. Nat. Acad. Sci. USA 90, 7176–7180.

    Article  Google Scholar 

  15. Huynen, M., Snel, B., Lathe, W., and Bork, P. (2000) Predicting protein function by genomic context: quantitative evaluation and qualitative inferences. Genome Res. 10, 1204–1210.

    Article  CAS  PubMed  Google Scholar 

  16. Verjovsky Marcotte, C. J. and Marcotte, E. M. (2002) Predicting functional linkages from gene fusions with confidence. Appl. Bioinforma. 1, 1–8.

    Google Scholar 

  17. Kanehisa, M., Goto, S., Hattori, M., et al. (2006) From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 34, D354–D357.

    Article  CAS  PubMed  Google Scholar 

  18. Jansen, R., Yu, H., Greenbaum, D., et al. (2003) A Bayesian networks approach for predicting protein-protein interactions from genomic data. Science 302, 449–453.

    Article  CAS  PubMed  Google Scholar 

  19. Adai, A. T., Date, S. V., Wieland, S., and Marcotte, E. M. (2004) LGL: Creating a map of protein function with an algorithm for visualizing very large biological networks. J. Mol. Biol. 340, 179–190.

    Article  CAS  PubMed  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Humana Press Inc.

About this protocol

Cite this protocol

Date, S.V. (2007). Estimating Protein Function Using Protein-Protein Relationships. In: Ochs, M.F. (eds) Gene Function Analysis. Methods in Molecular Biology™, vol 408. Humana Press. https://doi.org/10.1007/978-1-59745-547-3_7

Download citation

  • DOI: https://doi.org/10.1007/978-1-59745-547-3_7

  • Publisher Name: Humana Press

  • Print ISBN: 978-1-58829-734-1

  • Online ISBN: 978-1-59745-547-3

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics