In Search of Protein Folds

  • Manfred J. Sippl
  • Sabine Weitckus
  • Hannes Flöckner


The protein folding problem is exciting as well as frustrating. A large body of knowledge on protein folds has accumulated over past decades, but it is still impossible to calculate native structures from amino acid sequences. Our duty is to present the reasoning behind an approach that attempts to derive an energy model of protein solvent systems from the information contained in experimentally determined protein structures. The result of this approach is called knowledge-based mean field. The goal is to employ this model in the search for the native fold of amino acid sequences of unknown structure. We present an overview of the principles involved in this approach, assess the current state of the art, and present several applications in protein structure theory.


Force Field Fold Recognition Constant Domain Swissprot Database Native Fold 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Anfinsen CB (1973): Principles that govern the folding of protein chains. Science 181:223–230PubMedCrossRefGoogle Scholar
  2. Bairoch A, Boeckmann B (1991): The SWISS-PROT protein sequence data bank. Nucl Ac Res 19:2247–2248Google Scholar
  3. Bernstein FC, Koetzle TF, Williams GJB, Meyer EF Jr, Brice MD, Rodgers JR, Kennard O, Shimanouchi T, Tasumi M (1977): The protein data bank: A computer based archival file of macromolecular structures. J Mol Biol 112:535–542PubMedCrossRefGoogle Scholar
  4. Billeter M, Qian Y, Otting G, Mueller M, Gehring WJ, Wuethrich KJ (1990): Determination of the three-dimensional structure of the antennapedia homeodomain from Drosophila in solution by 1H nuclear magnetic resonance spectroscopy. J Mol Biol 214:183–197PubMedCrossRefGoogle Scholar
  5. Blundell T, Carney D, Gardner SP, Hayes F, Howlin B, Hubbard T, Overington J, Singh DA, Sibanda BL, Sutcliffe M (1988): Knowledge based protein modelling and design. Eur J Biochem 172:513–520PubMedCrossRefGoogle Scholar
  6. Bode W, Huber R (1991): Ligand-binding: Proteinase and Proteinase inhibitor interactions. Cur Opin Struct Biol 1:45–52CrossRefGoogle Scholar
  7. Bode W, Huber R (1992): Natural protein proteinase inhibitors and their interaction with proteinases. Eur J Biochem 204:433–451PubMedCrossRefGoogle Scholar
  8. Bowie JU, Clarke ND, Pabo CO, Sauer RT (1990): Identification of protein folds: Matching hydrophobicity patterns of sequence sets with solvent accessibility patterns of known structures. Proteins 7:257–264PubMedCrossRefGoogle Scholar
  9. Bowie JU, Luethy R, Eisenberg D (1991): A method to identify protein sequences that fold into a known three-dimensional structure. Science 253:164–170PubMedCrossRefGoogle Scholar
  10. Braenden CI, Jones A (1990): Between objectivity and subjectivity. Nature 343: 687–689CrossRefGoogle Scholar
  11. Brayer GD, McPherson A (1983): Refined structure of the gene 5 DNA binding protein from bacteriophage fd. J Mol Biol 108:565–596CrossRefGoogle Scholar
  12. Brooks CL, Karplus M, Pettitt BM (1988): Proteins: A Theoretical Perspective of Dynamics, Structure, and Thermodynamics. New York: John Wiley and SonsGoogle Scholar
  13. Burmeister WP, Ruigrok RWH, Cusack S (1992): The 2.2 Å resolution crystal structure of influenza B neuraminidase and its complex with sialic acid. EMBO J 11:49–56PubMedGoogle Scholar
  14. Casari G, Sippl MJ (1992): Structure-derived hydrophobic potential. Hydrophobic potential derived from X-ray structures of globular proteins is able to identify native folds. J Mol Biol 224: 725–732PubMedCrossRefGoogle Scholar
  15. Chothia C (1992): One thousand families for the molecular biologist. Nature 357: 543–544PubMedCrossRefGoogle Scholar
  16. Cohen FE, Richmond TJ, Richards FM (1979): Protein folding: Evaluation of some simple rules for the assembly of helices into tertiary structures with myoglobin as an example. J Mol Biol 132:275–288PubMedCrossRefGoogle Scholar
  17. Cohen FE, Sternberg MJ, Taylor WR (1982): Analysis and prediction of the packing of α-helices against a β-sheet in the tertiary structure of globular proteins. J Mol Biol 156:821–862PubMedCrossRefGoogle Scholar
  18. Crawford IR, Niemann T, Kirschner K (1987): Prediction of secondary structure by evolutionary comparison: Application to the alpha subunit of tryptophan synthase. Proteins 2:118–129PubMedCrossRefGoogle Scholar
  19. DeGrado WF, Raleigh DP, Handel T (1991): De novo protein design: What are we learning? Cur Opin Struct Bio 1:984–993CrossRefGoogle Scholar
  20. Dill KA (1993): Folding proteins: Finding a needle in a haystack. Cur Opin Struct Biol 3:99–103CrossRefGoogle Scholar
  21. Dyson HJ, Wright PE (1993): Peptide conformation and protein folding. Cur Opin Struct Biol 3:60–65CrossRefGoogle Scholar
  22. Fasman GD, ed. (1989): Prediction of Protein Structure and the Principles of Protein Conformation. New York and London: Plenum PressGoogle Scholar
  23. Fersht A (1985): Enzyme Structure and Mechanism. New York: WH FreemanGoogle Scholar
  24. Fersht AR, Serrano L (1993): Principles of protein stability derived from protein engineering experiments. Cur Opin Struct Biol 3:75–83CrossRefGoogle Scholar
  25. Folkers PJ, van Duynhoven PM, Jonker AJ, Harmsen BJ, Konings RN, Hilbers CW (1991): Sequence-specific 1H-NMR assignment and secondary structure of the Tyr41-His mutant of the single-stranded DNA binding protein, gene V protein, encoded by the filamentous bacteriophage M13. Eur J Biochem 202:349–360PubMedCrossRefGoogle Scholar
  26. Gregoret LM, Cohen FE (1990): Novel method for the rapid evaluation of packing in protein structures. J Mol Biol 211:959–974PubMedCrossRefGoogle Scholar
  27. Hendlich M, Lackner P, Weitckus S, Floeckner H, Froschauer R, Gottsbacher K, Casari G, Sippl MJ (1990): Identification of native protein folds amongst a large number of incorrect models. J Mol Biol 216:167–180PubMedCrossRefGoogle Scholar
  28. Holm L, Sander C (1992): Evaluation of protein models by atomic solvation preference. J Mol Biol 225:93–105PubMedCrossRefGoogle Scholar
  29. Janin J (1990): Errors in three dimensions. Biochimie 72:705–709PubMedCrossRefGoogle Scholar
  30. Jernigan RL (1992): Protein folds. Cur Opin Struct Biol 2:248–256CrossRefGoogle Scholar
  31. Jones DT, Taylor WR, Thornton JM (1992): A new approach to protein fold recognition. Nature 358:86–89PubMedCrossRefGoogle Scholar
  32. Kabsch W, Sander C (1984): On the use of sequence homologies to predict protein structure: Identical pentapeptides can have completely different conformations. Proc Natl Acad Sci 81:1075–1078PubMedCrossRefGoogle Scholar
  33. Karplus M, Petsko GA (1990): Molecular dynamics simulations in biology. Nature 347:631–639PubMedCrossRefGoogle Scholar
  34. Kelly JA, Knox JR, Moews PC, Hite GJ, Bartolone JB, Zhao H (1985): 2.8 Å structure of penicillin-sensitive D-alanyl carboxypeptidase-transpeptidase from Streptomyces R61 and complexes with β-lactams. J Biol Chem 260:6449–6458PubMedGoogle Scholar
  35. Kendrew JC, Dickerson RE, Strandberg BE, Hart RJ, Davies DR, Phillips DC, Shore VC (1960): Structure of myoglobin: A three-dimensional Fourier synthesis at 2 Å resolution. Nature 185:422–427PubMedCrossRefGoogle Scholar
  36. Kuma K, Iwabe N, Miyata T (1991): The immunoglobulin family. Cur Opin Struct Biol 1:384–393CrossRefGoogle Scholar
  37. Luethy R, Bowie JU, Eisenberg D (1992): Assessment of protein models with 3D profiles. Nature 356:83–85CrossRefGoogle Scholar
  38. Marquait M, Deisenhofer J, Huber R, Palm W (1980): Crystallographic refinement and atomic models of the intact immunoglobulin molecule KOL and its antigenbinding fragment at 3.0 Å and 1.9 Å resolution. J Mol Biol 141:369CrossRefGoogle Scholar
  39. Murzin AG, Chothia C (1992): Protein architecture: New superfamilies. Cur Opin Struct Biol 2:895–903CrossRefGoogle Scholar
  40. Needleman SB, Wunsch CD (1970): A general method applicable to the search for similarities in the amino acid sequences of two proteins. J Mol Biol 48:443–453PubMedCrossRefGoogle Scholar
  41. Nishikawa K, Taniguchi K, Torres A, Hoshino Y, Green K, Kapikian AZ, Chanock RM, Gorziglia M (1988): Comparative analysis of the VP3 gene of divergent strains of the rotaviruses simian SA 11 and bovine Nebraska calf diarrhea virus. J Virol 62:4022–4026PubMedGoogle Scholar
  42. Novotny J, Brucceroli RE, Karplus M (1984): An analysis of incorrectly folded protein models. Implications for structure predictions. J Mol Biol 177:787–818PubMedCrossRefGoogle Scholar
  43. Novotny J, Rashin AA, Bruccoleri RE (1988): Criteria that discriminate between native proteins and incorrectly folded models. Proteins 4:19–30PubMedCrossRefGoogle Scholar
  44. Ogata CM, Gordon PF, de Vos AM, Kim SH (1992): Crystal structure of a sweet tasting protein Thaumatin I, at 1.65 Å resolution. J Mol Biol 228:893–908PubMedCrossRefGoogle Scholar
  45. Overington JP (1992): Comparison of the three-dimensional structures of homologous proteins. Cur Opin Struct Biol 2:394–401CrossRefGoogle Scholar
  46. Perutz MF, Rossmann MG, Cuuis AF, Muirhead G, Will G, North AT (1960): Structure of haemoglobin: A three-dimensional fourier synthesis at 5.5 Å resolution, obtained by X-ray analysis. Nature 185:416–422PubMedCrossRefGoogle Scholar
  47. Perutz MF (1992): Protein Structure. New Approaches to Disease and Therapy. New York: W.H. FreemanGoogle Scholar
  48. Ouzounis C, Sander C, Scharf M, Schneider R (1993): Prediction of protein structure by evaluation of sequence-structure fitness. Aligning sequences to contact profiles derived from 3D structures. J Mol Biol 232:805–825PubMedCrossRefGoogle Scholar
  49. Richardson JS, Richardson DC (1988): Amino acid preferences for specific locations at the ends of alpha-helices. Science 240:1648–1652PubMedCrossRefGoogle Scholar
  50. Rooman MJ, Kocher J-P, Wodak SJ (1991): Prediction of protein backbone conformation based on seven structure assignments: Influence of local interactions. J Mol Biol 221:961–979PubMedCrossRefGoogle Scholar
  51. Rost B, Sander C (1992): Jury returns on structure prediction. Nature 360:540PubMedCrossRefGoogle Scholar
  52. Rost B, Sander C (1993a): Prediction of protein secondary structure at better than 70% accuracy. J Mol Biol 232:584–599PubMedCrossRefGoogle Scholar
  53. Rost B, Sander C (1993b): Improved prediction of protein secondary structure by use of sequence profiles and neural networks. Proc Nat Acad Sci 90:7558–7562PubMedCrossRefGoogle Scholar
  54. Sander C, Schneider R (1991): Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins 9:56–68PubMedCrossRefGoogle Scholar
  55. Sippl MJ (1990): Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. J Mol Biol 213:859–883PubMedCrossRefGoogle Scholar
  56. Sippl MJ (1993): Boltzmann’s principle, knowledge based mean fields and protein folding. J Comp Aid Mol Des 7:473–501CrossRefGoogle Scholar
  57. Sippl MJ, Hendlich M, Lackner P (1992): Assembly of Polypeptide and protein backbone conformations from low energy ensembles of short fragments: Development of strategies and construction of models for myoglobin, lysozyme, and thymosin β4. Protein Science 1:625–640PubMedCrossRefGoogle Scholar
  58. Sippl MJ, Jaritz M (1994): Predictive power of mean force pair potentials. In Protein Structure by Distance Analysis. Bohr H, Brunak S, eds. Amsterdam: IOS PressGoogle Scholar
  59. Sippl MJ, Weitckus S (1992): Detection of native-like models for amino acid sequences of unknown three-dimensional structure in a data base of known protein conformations. Proteins 13:258–271PubMedCrossRefGoogle Scholar
  60. Sternberg MJE (1992): Secondary structure prediction. Cur Opin Struct Biol 2: 237–241CrossRefGoogle Scholar
  61. Thornton JM (1992): Lessons from analyzing protein structures. Cur Opin Struct Biol 2:888–894CrossRefGoogle Scholar
  62. van Gunsteren WF (1988): The role of computer simulation techniques in protein engineering. Prot Eng 2:5–13CrossRefGoogle Scholar
  63. Weis W, Brown JH, Cusack S, Paulson JC, Skehel JJ, Wiley DC (1988): Structure of the influenza virus haemagglutinin complexed with its receptor, sialic acid. Nature 333:426–431PubMedCrossRefGoogle Scholar
  64. Wilmanns M, Eisenberg D (1993): 3D profiles from residue-pair preferences: Identification of sequences with β/α-barrel fold. Proc Natl Acad Sci 90:1379–1383PubMedCrossRefGoogle Scholar

Copyright information

© Birkhäuser Boston 1994

Authors and Affiliations

  • Manfred J. Sippl
  • Sabine Weitckus
  • Hannes Flöckner

There are no affiliations available

Personalised recommendations