Abstract
Over the last twenty years the number of known protein structures has risen exponentially. However, there are still at least tenfold more proteins whose sequences are known but whose structures have not yet been determined. For these proteins, information about sequence/structure relationships are used to predict a probable structure. This can include residue preferences for specific secondary structure conformations or residue contact potentials. It is critically important, though, that any statistically based study, to derive such information, should use a non-degenerate dataset. That is, a set which contains a single representative from each of the protein fold families.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
F.C. Bernstein, T.F. Koetzle, G. J. D. Williams, E.F. Meyer, M.D. Brice, J. R. Rodgers, O. Kennard, T. Shimanochi, and M. Tasumi, The protein databank: a computer-based archival file for macromolecular structures, J. Mol. Biol. 112: 535 (1977).
J. Boberg, T. Salakoski and M. Vihinen, Selection of a representative set of structures from the Brookhaven protein databank, PROTEINS: Structure, Function and Genetics, 14: 265 (1992).
C. Chothia and A. M. Lesk, The relationship between the divergence of sequence and structure in proteins, The EMBO Journal, 5: 823 (1986).
C. Chothia and A. M Lesk, The evolution of protein structures, Coldspring Harbour Symposia in Quantitative Biology, LII: 399 (1987).
T. P. Flores, C. A. Orengo, D. S. Moss and J. M. Thornton, Protein Science, Conservation of conformational characteristics in structurally similar protein pairs (submitted) (1993).
T. P. Flores, D. S. Moss and J. M. Thornton, An algorithm for automatically generating protein topology cartoons, Protein Engineering, (submitted) (1993).
U. Hobohm, M. Scharf, R. Schneider and C. Sander, Selection of representative datasets, Protein Science, 1: 409 (1992).
L. Holm and C. Sander, A databank of protein structure families with common folding motifs, J. Mol. Biol 225: 93 (1992).
T. J. P. Hubbard and T. L. Blundell, Comparison of solvent-inaccessible cores of homologous proteins. Definitions useful for protein modeling, Protein Engineering., 1: 159 (1987).
D. T. Jones, W. R. Taylor and J. M. Thornton, A new approach to protein fold recognition, Nature, 358: 86 (1992).
W. Kabsch and C. Sander, Dictionary of protein secondary structure-pattern recognition of hydrogen-bonded and geometrical features, Biopolymers, 22: 2577 (1983).
P. J. Kraulis, MOLSCRIPT-A program to produce both detailed and schematic plots of protein strutcures, J. Appl. Cryst. 24: 946 (1991).
W. J. Krzanowski, ‘Principles of Multivariate Analysis’, Oxford Statistical Science, Series 3, Oxford University Press (1990).
M. Levitt and C. Chothia, Structural patterns in globular proteins, Nature, 261: 552 (1976).
A. G. Murzin and C. Chothia, Curr. Opin. in Struct. Biol., Protein architecture: new superfamilies, 2: 895 (1992).
S. B. Needleman and C. D. Wunsch, A general method applicable to the search for similarities in the amino acid sequences of two proteins, J. Mol. Biol., 48: 443 (1970).
C. A. Orengo and W. R. Taylor, A rapid method for protein structurealignment J. Theor. Biol. 147: 517 (1990).
C. A. Orengo, N. Brown and W. R. Taylor, Fast structure alignment for protein databank searching, PROTEINS: Structure, Function and Genetics, 14: 139 (1992).
C. A. Orengo and W. R. Taylor, A local alignment method for protein structure motifs, J. Mol. Biol. (In Press) (1993).
C. A. Orengo, T. P. Flores, W. R. Taylor and J. M. Thornton, Identification and Classification of Protein Fold Families, Protein Engineering, 6: 485 (1993).
C. A. Orengo and J. M. Thornton, Structure (submitted) (1993).
S. Pascarella and P. Argos, A data-bank merging related protein structures and sequences, Protein Engineering, 2: 121 (1992).
J. S. Richardson, The anatomy and taxonomy of protein structure, Advances In Protein Science, 34: 167 (1981).
F. Rippmann and W. R. Taylor, Visualisation of structural similarity in proteins, J. Mol. Graph. 9: 169 (1991).
C. Sander and R. Schneider, Database of homology derived protein strutcures and the structural meaning of sequence alignment, PROTEINS: Structure, Function and Genetics, 9: 56 (1991).
W. R. Taylor, Multiple sequence alignment by a pairwise algorithm, Comput. Appl. Biosci., 3: 81 (1987).
W. R. Taylor and C. A. Orengo, Protein structure alignment, J. Mol. Biol., 208: 1 (1989a).
W. R. Taylor and C. A. Orengo, A holistic approach to protein structure alignment, Protein Engineering, 2: 505 (1989b).
W. R. Taylor, T. P. Flores and C. A. Orengo, Multiple Protein structure Alignment, CABIOS (submitted) 1993.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1994 Springer Science+Business Media New York
About this chapter
Cite this chapter
Orengo, C.A., Flores, T.P., Taylor, W.R., Thornton, J.M. (1994). Protein Fold Families and Structural Motifs. In: Doniach, S. (eds) Statistical Mechanics, Protein Structure, and Protein Substrate Interactions. NATO ASI Series, vol 325. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-1349-4_23
Download citation
DOI: https://doi.org/10.1007/978-1-4899-1349-4_23
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-1351-7
Online ISBN: 978-1-4899-1349-4
eBook Packages: Springer Book Archive