Abstract
Associations between short amino acid sequence patterns and protein secondary structure classes can be found by searching a data base of known protein structures. Analysis of these associations suggests that secondary structure of proteins can be determined locally by sequence motifs of high predictive value, but at present our ability to find these motifs is limited by the size of the available data bases.
Similar content being viewed by others
References
Chou, P. Y. & Fasman, G. D. Biochemistry 13, 222–245 (1974).
Garnier, J., Osguthorpe, D. J. & Robson, B. J. molec. Biol. 120, 97–120 (1978).
Gibrat, J.-F., Garnier, J. & Robson, B. J. molec. Biol. 198, 425–443 (1987).
Finkelstein, A. V. & Ptitsyn, O. B. J. molec. Biol. 62, 613–624 (1971).
Lim, V. I. J. molec. Biol. 88, 873–894 (1974).
Kabsch, W. & Sander, C. FEBS Lett. 155, 179–182 (1983).
Sibanda, B. L. & Thornton, J. M. Nature 316, 170–174 (1985).
Edwards, M. S., Sternberg, M. J. E. & Thornton, J. M. Protein Engineering 1, 173–181 (1987).
Wierenga, R. K., De Maeyer, M. C. H. & Hol, W. G. J. Biochemistry 24, 1346–1357 (1985).
Taylor, W. R. & Thornton, J. M. J. molec. Biol. 173, 487–514 (1984).
Cohen, F. E., Abarbanel, R. M., Kuntz, I. D. & Fletterick, R. J. Biochemistry 25, 266–275 (1986).
Stone, M. J. R. Statist. Soc. Series B. 36, 111–147 (1974).
Rescher, N. & Urquart A. Temporal Logic (Springer, Berlin, 1971).
Bernstein, F. C. et al. J. molec. Biol. 112, 535–542 (1977).
Kabsch, W. & Sander, C. Biopolymers 22, 2577–2637 (1983).
Crawford, J. L., Lipscomb, M. N. & Schellman, C. G. Proc. natn. Acad. Sci. U.S.A. 70, 538–542 (1973).
Cohen, F. E., Abarbanel, R. M., Kuntz, I. D. & Fletterick, R. J. Biochemistry 22, 4894–4904 (1983).
Kabsch, W. & Sander, C. Proc. natn. Acad. Sci. U.S.A. 81, 1075–1078 (1984).
Blundell, T. L., Sibanda, B. L., Sternberg, M. J. E. & Thornton, J. M. Nature 326, 347–352 (1987).
Jones, T. A. & Thirup, T. EMBO J. 5, 819–822 (1986).
Kraulis, P. J., & Jones, T. A. Proteins 2, 188–201 (1987).
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Rooman, M., Wodak, S. Identification of predictive sequence motifs limited by protein structure data base size. Nature 335, 45–49 (1988). https://doi.org/10.1038/335045a0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/335045a0
- Springer Nature Limited
This article is cited by
-
Machine discovery of protein motifs
Machine Learning (1995)
-
Modelling of peptide and protein structures
Amino Acids (1994)
-
The antidote and autoregulatory functions of the F plasmid CcdA protein: a genetic and biochemical survey
Molecular and General Genetics MGG (1994)
-
Analysis of peptides from known proteins: Clusterization in sequence space
Journal of Molecular Evolution (1994)