Model of protein fragments and statistical potentials

Research Articles


We discuss a model of protein conformations where the conformations are combinations of short fragments from some small set. For these fragments we consider a distribution of frequencies of occurrence of pairs (sequence of amino acids, conformation), averaged over some balls in the spaces of sequences and conformations. These frequencies can be estimated due to smallness of the e-entropy of the set of conformations of protein fragments.We consider statistical potentials for protein fragments which describe the mentioned frequencies of occurrence and discuss model of free energy of a protein where the free energy is equal to a sum of statistical potentials of the fragments. We also consider contribution of contacts of fragments to the energy of protein conformation, and contribution from statistical potentials of some hierarchical set of larger protein fragments. This set of fragments is constructed using the distribution of frequencies of occurrence of short fragments. We discuss applications of this model to problem of prediction of the native conformation of a protein from its primary structure and to description of dynamics of a protein. Modification of structural alignment taking into account statistical potentials for protein fragments is considered and application to threading procedure for proteins is discussed.

Key words

protein fragments statistical potentials hierarchical models of proteins 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    M. Mezard, G. Parisi and M. Virasoro, Spin-Glass Theory and Beyond (World Scientific, Singapore, 1987).MATHGoogle Scholar
  2. 2.
    A. V. Finkelstein and O. B. Ptitsyn, Protein Physics. A Course of Lectures (Academic Press, Elsevier Science, 2002).Google Scholar
  3. 3.
    D. J. Rigden (Ed.), From Protein Structure to Function with Bioinformatics (Springer, 2009).CrossRefGoogle Scholar
  4. 4.
    A. Yu. Grosberg and A. R. Khokhlov, Giant Molecules: Here, There, and Everywhere. 2nd ed. (World Sci. Publ. Company, 2010).CrossRefGoogle Scholar
  5. 5.
    R. Unger, D. Harel, S. Wherland and J. L. Sussman, “A 3D building blocks approach to analyzing and predicting structure of proteins,” Proteins: Struc. Func.Genet. 5, 355–373 (1989).CrossRefGoogle Scholar
  6. 6.
    C. Micheletti, F. Seno and A. Maritan, “Recurrent oligomers in proteins: an optimal scheme reconciling accurate and concise backbone representations in automated folding and design studies,” Proteins: Struc. Func. Genet. 40, 662–674 (2000).CrossRefGoogle Scholar
  7. 7.
    A. G. de Brevern, C. Etchebest and S. Hazout, “Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks,” Proteins: Struc. Func. Genet. 41, 271–287 (2000).CrossRefGoogle Scholar
  8. 8.
    R. Karchin, M. Cline, Y. Mandel-Gutfreund and K. Karplus, “HiddenMarkovmodels that use predicted local structure for fold recognition: alphabets of backbone geometry,” Proteins: Struc. Func. Genet. 51, 504–514 (2003).CrossRefGoogle Scholar
  9. 9.
    R. Kolodny, P. Koehl, L. Guibas and M. Levitt, “Small libraries of protein fragments model native protein structures accurately,” J.Mol. Biol. 323, 297–307 (2002).CrossRefGoogle Scholar
  10. 10.
    A. T. Jones and S. Thirup, “Using known substructures in protein model building and crystallography,” EMBO J. 5, 819–822 (1986).Google Scholar
  11. 11.
    M. J. Rooman, J. I. Kocher and S. J. Wodak, “Extracting information on folding from the amino acid sequence: accurate predictions for protein regions with preferred conformation in the absence of tertiary interactions,” Biochemistry 31, 10226–10238 (1992).CrossRefGoogle Scholar
  12. 12.
    K. T. Simons, Ch. Kooperberg, E. Huang and D. Baker, “Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions,” J. Mol. Biol. 268, 209–225 (1997).CrossRefGoogle Scholar
  13. 13.
    A. N. Nekrasov, “Entropy of protein sequences: an integral approach,” J. Biom. Struc. Dyn. 20 (1), 87–92 (2002).CrossRefGoogle Scholar
  14. 14.
    A. N. Nekrasov, “Analysis of the information structure of protein sequences: a new method for analyzing the domain organization of proteins,” J. Biom. Struc. Dyn. 21 (5), 615–624, (2004).CrossRefGoogle Scholar
  15. 15.
    A. N. Nekrasov, A. A. Anashkina and A. I. Zinchenko, “A new paradigm of protein structural organization,” in Proceedings of the 2-nd International Conference Theoretical Approaches to Bioinformatic Systems (TABIS.2013), 1–23 (Belgrade, Serbia, Sept. 17–22, 2013).Google Scholar
  16. 16.
    S. V. Kozyrev and I. V. Volovich, “Quinary lattice model of secondary structures of polymers,” Physica A 393, 86–95 (2014); [arXiv:1206.4424].MathSciNetCrossRefGoogle Scholar
  17. 17.
    A. Yu. Grosberg, S. K. Nechaev and E. I. Shakhnovich, “The role of topological constraints in the kinetics of collapse of macromolecules,” J Phys. France 49, 2095–2100 (1988).CrossRefGoogle Scholar
  18. 18.
    P. A. Pevzner, Computational Molecular Biology: An Algorithmic Approach (The MIT Press, Cambridge, Massachusets, 2001).MATHGoogle Scholar
  19. 19.
    A. N. Kolmogorov and V. M. Tikhomirov, “e-entropy and e-capacity of sets in function spaces,” Uspekhi Math. (Russian Math. Surveys) 14:2 (86), 3–86 (1959).MathSciNetMATHGoogle Scholar
  20. 20.
    S. Miyazawa and R. Jernigan, “Estimation of effective interresidue contact energies from protein crystal structures: quasi-chemical approximation,” Macromolecules 18, 534–552 (1985).CrossRefGoogle Scholar
  21. 21.
    S. V. Kozyrev, “Dynamics on rugged landscapes of energy and ultrametric diffusion,” p-Adic Numbers Ultrametric Anal. Appl. 2 (2), 122–132 (2010).MathSciNetCrossRefMATHGoogle Scholar
  22. 22.
    G. E. Hinton and R. R. Salakhutdinov, “Reducing the dimensionality of data with neural networks,” Science 313, 504–507 (2006).MathSciNetCrossRefMATHGoogle Scholar
  23. 23.
    Y. Bengio, “Learning Deep Architectures for AI,” Foundations and Trends inMachine Learning, 2 (1), (2009).Google Scholar
  24. 24.
    V. A. Avetisov, V. A. Ivanov, D. A.Meshkov, and S. K. Nechaev, “Fractal globule as a molecular machine,” JETP Lett. 98 (4), 242–246 (2013).CrossRefGoogle Scholar
  25. 25.
    V.A. Avetisov, V. A. Ivanov, D. A. Meshkov and S. K. Nechaev, “Fractal globules: a new approach to artificial molecular machines,” Biophys. J. 107, 2361–2368 (2014).CrossRefGoogle Scholar
  26. 26.
    J. Smrek and A. Y. Grosberg, “A novel family of space-filling curves in their relation to chromosome conformation in eukaryotes,” Physica A 392, 6375–6388 (2013).MathSciNetCrossRefGoogle Scholar
  27. 27.
    Y. Togashi and A. S. Mikhailov, “Nonlinear relaxation dynamics in elastic networks and design principles of molecular machines,” Proc. Natl. Acad. Sci. USA 104, 8697–8702 (2007).CrossRefGoogle Scholar

Copyright information

© Pleiades Publishing, Ltd. 2016

Authors and Affiliations

  1. 1.Steklov Mathematical InstituteRussian Academy of SciencesMoscowRussia

Personalised recommendations