Skip to main content

FAUST: An Algorithm for Extracting Functionally Relevant Templates from Protein Structures

  • Conference paper
  • First Online:
Algorithms in Bioinformatics (WABI 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2452))

Included in the following conference series:

  • 1078 Accesses

Abstract

FAUST(Functional Annotations Using Structural Templates) is an algorithm for: extraction of functionally relevant templates from protein structures and using such templates to annotate novel structures. Proteins and structural templates are represented as colored, undirected graphs with atoms as nodes and interatomic distances as edge weights. Node colors are based on chemical identities of atoms. Edge labels are equivalent if interatomic distances for corresponding nodes (atoms) differ less than a threshold value. We define FAUST structural template as a common subgraph of a set of graphs corresponding to two or more functionally related proteins. Pairs of functionally related protein structures are searched for sets of chemically equivalent atoms whose interatomic distances are conserved in both structures. Structural templates resulting from such pair wise searches are then combined to maximize classification performance on a training set of irredundant protein structures. The resulting structural template provides new language for description of structure—function relationship in proteins. These templates are used for active and binding site identification in protein structures. We are demonstrating here structural template extraction results for the highly divergent family of serine proteases. We compare FAUST templates to the standard description of the serine proteases active site pattern conservation and demonstrate depth of information captured in such description. Also, we present preliminary results of the high-throughput protein structure database annotations with a comprehensive library of FAUST templates.

to whom correspondence should be addressed

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. N.N. Alexandrov and N. Go, “Biological meaning, statistical significance, and classification of local spatial similarities in nonhomologous proteins,” Protein Science 3:866, 1994.

    Google Scholar 

  2. P.J. Artymiuk, A.R. Poirette, H.M. Grindley, D.W. Rice, and P. Willet, “A graphtheoretic approach to identification of the three-dimensional patterns of amino-acid side chains in protein structures,” J. Mol. Biol. 243:327, 1994.

    Google Scholar 

  3. A. Bairoch, “The ENZYME database in 2000.” Nucleic Acids Res. 28:304, 2000.

    Google Scholar 

  4. H.M. Berman, J. Westbrook, Z. Feng, G. Gilliland, T.N. Bhat, H. Weissig, I.N. Shindyalov, and P.E. Bourne, “The protein data bank.” Nucleic Acids Research, 28:235, 2000.

    Google Scholar 

  5. C. Chotia, “One thousand families for the molecular biologist,” Nature 357:543, 1992.

    Google Scholar 

  6. J.S. Fetrow, A. Godzik, and J. Skolnick, “Functional analysis of the E. Coli genome using the sequence-to-structure-to-function paradigm: Identification of proteins exhibiting the glutaredoxin/thioredoxin disulfide oxidoreductase activity,” J. Mol. Biol. 1998.

    Google Scholar 

  7. J.S. Fetrow and J. Skolnick, “Method for prediction of protein function from sequence using sequence-to-structure-to-function paradigm with application to glutaredoxins/thioredoxins and T 1 ribonucleases,” J. Mol. Biol. 1998.

    Google Scholar 

  8. D. Fischer, H. Wolfson, S.L. Lin, and R. Nussinov, “Three-dimensional, sequenceorder independent structural comparison of a serine protease against the crystallographic database reveals active site similarities,” Protein Sci. 3:769, 1994.

    Google Scholar 

  9. M. Fujinaga and M.N.G. James, “Rat submaxillary gland serine protease, tonin, structure solution and refinement at 1.18Å resolution,” J. Mol. Biol. 195:373, 1987.

    Google Scholar 

  10. D.H. Kitson, A. Badredtinov, Z.-Y. Zhu, M. Velikanov, D.J. Edwards, K. Olszewski, S. Szalma, and L. Yan, “Functional annotation of proteomic sequences based on consensus of sequence and structural analysis,” Briefings in Bioinformatics, 3:32, 2002.

    Google Scholar 

  11. R.B. Russel, “Detection of protein three-dimensional side chain pattern.” J. Mol. Biol. 279:1211, 1998.

    Google Scholar 

  12. A.E. Todd, C.A. Orengo, and J.M. Thornton, “Evolution of function in protein superfamilies from a structural perspective, ” J. Mol. Biol. 307:1113, 2001.

    Google Scholar 

  13. A.C. Wallace, N. Borkakoti, and J.M. Thornton, “TESS, a geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases: Application to enzyme active sites. ” Protein Sci. 6:2308, 1997.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Milik, M., Szalma, S., Olszewski, K.A. (2002). FAUST: An Algorithm for Extracting Functionally Relevant Templates from Protein Structures. In: GuigĂł, R., Gusfield, D. (eds) Algorithms in Bioinformatics. WABI 2002. Lecture Notes in Computer Science, vol 2452. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45784-4_13

Download citation

  • DOI: https://doi.org/10.1007/3-540-45784-4_13

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44211-0

  • Online ISBN: 978-3-540-45784-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics