Fold Recognition

Kelley, Lawrence A.

doi:10.1007/978-94-024-1069-3_3

Lawrence A. Kelley²

Abstract

Fold recognition is concerned with the prediction of protein three-dimensional structure from amino acid sequence by the detection of extremely remote homologous or analogous relationships to known structures. As such it lies midway between ab initio protein folding and close homology modelling. This chapter surveys both the history of the field and the current state-of-the art, focussing on approaches recently shown to be successful in international blind trials.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Protein Fold Recognition Exploited by Computational and Functional Approaches: Recent Insights

Machine Learning Methods for the Protein Fold Recognition Problem

Impact of structure space continuity on protein fold classification

Article Open access 23 March 2016

References

Altschul SF, Madden TL, Schaffer AA et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25(17):3389–3402
Article CAS PubMed PubMed Central Google Scholar
Benkert P, Tosatto SC, Schwede T (2009) Global and local model quality estimation at CASP8 using the scoring functions QMEAN and QMEANclust. Proteins 77(Suppl 9):173–180. doi:10.1002/prot.22532
Article CAS PubMed Google Scholar
Bennett-Lovsey RM, Herbert AD, Sternberg MJ et al (2008) Exploring the extremes of sequence/structure space with ensemble fold recognition in the program Phyre. Proteins 70(3):611–625. doi:10.1002/prot.21688
Article CAS PubMed Google Scholar
Berman HM, Westbrook J, Feng Z et al (2000) The protein data bank. Nucleic Acids Res 28(1):235–242
Article CAS PubMed PubMed Central Google Scholar
Bowie JU, Luthy R, Eisenberg D (1991) A method to identify protein sequences that fold into a known three-dimensional structure. Science 253(5016):164–170
Article CAS PubMed Google Scholar
Cao R, Wang Z, Cheng J (2014) Designing and evaluating the MULTICOM protein local and global model quality prediction methods in the CASP10 experiment. BMC Struct Biol 14:13. doi:10.1186/1472-6807-14-13
Article PubMed PubMed Central Google Scholar
Chivian D, Kim DE, Malmstrom L et al (2005) Prediction of CASP6 structures using automated Robetta protocols. Proteins 61(Suppl 7):157–166. doi:10.1002/prot.20733
Article CAS PubMed Google Scholar
Chubb D, Jefferys BR, Sternberg MJ et al (2010) Sequencing delivers diminishing returns for homology detection: implications for mapping the protein universe. Bioinformatics 26(21):2664–2671. doi:10.1093/bioinformatics/btq527
Article CAS PubMed Google Scholar
Di Lena P, Fariselli P, Margara L et al (2010) Fast overlapping of protein contact maps by alignment of eigenvectors. Bioinformatics 26(18):2250–2258. doi:10.1093/bioinformatics/btq402
Article PubMed Google Scholar
Ginalski K, Elofsson A, Fischer D et al (2003) 3D-Jury: a simple approach to improve protein structure predictions. Bioinformatics 19(8):1015–1018
Article CAS PubMed Google Scholar
Jefferys BR, Kelley LA, Sternberg MJ (2010) Protein folding requires crowd control in a simulated cell. J Mol Biol 397(5):1329–1338. doi:10.1016/j.jmb.2010.01.074
Article CAS PubMed PubMed Central Google Scholar
Jones DT (1999) Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol 292(2):195–202. doi:10.1006/jmbi.1999.3091
Article CAS PubMed Google Scholar
Kabsch W, Sander C (1983) Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22(12):2577–2637. doi:10.1002/bip.360221211
Article CAS PubMed Google Scholar
Kallberg M, Margaryan G, Wang S et al (2014) RaptorX server: a resource for template-based protein structure modeling. Methods Mol Biol 1137:17–27. doi:10.1007/978-1-4939-0366-5_2
Article CAS PubMed Google Scholar
Kamisetty H, Ovchinnikov S, Baker D (2013) Assessing the utility of coevolution-based residue-residue contact predictions in a sequence- and structure-rich era. Proc Natl Acad Sci U S A 110(39):15674–15679. doi:10.1073/pnas.1314045110
Article CAS PubMed PubMed Central Google Scholar
Kelley LA, Mezulis S, Yates CM et al (2015) The Phyre2 web portal for protein modeling, prediction and analysis. Nat Protoc 10(6):845–858. doi:10.1038/nprot.2015.053
Article CAS PubMed PubMed Central Google Scholar
Kim H, Park H (2004) Prediction of protein relative solvent accessibility with support vector machines and long-range interaction 3D local descriptor. Proteins 54(3):557–562. doi:10.1002/prot.10602
Article CAS PubMed Google Scholar
Kryshtafovych A, Barbato A, Fidelis K et al (2014) Assessment of the assessment: evaluation of the model quality estimates in CASP10. Proteins 82(Suppl 2):112–126. doi:10.1002/prot.24347
Article CAS PubMed Google Scholar
Kumar M, Bhasin M, Natt NK et al (2005) BhairPred: prediction of beta-hairpins in a protein from multiple alignment information using ANN and SVM techniques. Nucleic Acids Res 33 (Web Server issue):W154–W159
Google Scholar
Larsson P, Skwark MJ, Wallner B et al (2011) Improved predictions by Pcons.net using multiple templates. Bioinformatics 27 (3):426–427. doi:10.1093/bioinformatics/btq664
Lewis TE, Sillitoe I, Andreeva A et al (2013) Genome3D: a UK collaborative project to annotate genomic sequences with predicted 3D structures based on SCOP and CATH domains. Nucleic Acids Res 41 (Database issue):D499–D507. doi:10.1093/nar/gks1266
Magner A, Szpankowski W, Kihara D (2015) On the origin of protein superfamilies and superfolds. Sci Rep 5:8166. doi:10.1038/srep08166
Article CAS PubMed PubMed Central Google Scholar
Marsden RL, Lee D, Maibaum M et al (2006) Comprehensive genome analysis of 203 genomes provides structural genomics with new insights into protein family space. Nucleic Acids Res 34(3):1066–1080. doi:10.1093/nar/gkj494
Article CAS PubMed PubMed Central Google Scholar
McGuffin LJ (2009) Prediction of global and local model quality in CASP8 using the ModFOLD server. Proteins 77(Suppl 9):185–190. doi:10.1002/prot.22491
Article CAS PubMed Google Scholar
Meier A, Soding J (2015a) Automatic prediction of protein 3D structures by probabilistic multi-template homology modeling. PLoS Comput Biol 11(10):e1004343. doi:10.1371/journal.pcbi.1004343
Article PubMed PubMed Central Google Scholar
Meier A, Soding J (2015b) Context similarity scoring improves protein sequence alignments in the midnight zone. Bioinformatics 31(5):674–681. doi:10.1093/bioinformatics/btu697
Article CAS PubMed Google Scholar
Miyazawa S, Jernigan RL (1996) Residue-residue potentials with a favorable contact pair term and an unfavorable high packing density term, for simulation and threading. J Mol Biol 256(3):623–644. doi:10.1006/jmbi.1996.0114
Article CAS PubMed Google Scholar
Moult J, Fidelis K, Kryshtafovych A et al (2014) Critical assessment of methods of protein structure prediction (CASP)–round x. Proteins 82(Suppl 2):1–6. doi:10.1002/prot.24452
Article CAS PubMed Google Scholar
Park J, Teichmann SA, Hubbard T et al (1997) Intermediate sequences increase the detection of homology between sequences. J Mol Biol 273(1):349–354. doi:10.1006/jmbi.1997.1288
Article CAS PubMed Google Scholar
Peng J, Xu J (2010) Low-homology protein threading. Bioinformatics 26(12):i294–i300. doi:10.1093/bioinformatics/btq192
Article CAS PubMed PubMed Central Google Scholar
Perdigao N, Heinrich J, Stolte C et al (2015) Unexpected features of the dark proteome. Proc Natl Acad Sci U S A 112(52):15898–15903. doi:10.1073/pnas.1508380112
Article CAS PubMed PubMed Central Google Scholar
Remmert M, Biegert A, Hauser A et al (2012) HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat Methods 9(2):173–175. doi:10.1038/nmeth.1818
Article CAS Google Scholar
Richmond TJ (1984) Solvent accessible surface area and excluded volume in proteins. Analytical equations for overlapping spheres and implications for the hydrophobic effect. J Mol Biol 178(1):63–89
Article CAS PubMed Google Scholar
Rohl CA, Strauss CE, Misura KM et al (2004) Protein structure prediction using Rosetta. Methods Enzymol 383:66–93. doi:10.1016/S0076-6879(04)83004-0
Article CAS PubMed Google Scholar
Rychlewski L, Jaroszewski L, Li W et al (2000) Comparison of sequence profiles. Strategies for structural predictions using sequence information. Protein Sci 9(2):232–241. doi:10.1110/ps.9.2.232
Article CAS PubMed PubMed Central Google Scholar
Sali A, Blundell TL (1993) Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol 234(3):779–815. doi:10.1006/jmbi.1993.1626
Article CAS PubMed Google Scholar
Seringhaus M, Gerstein M (2007) Chemistry nobel rich in structure. Science 315(5808):40–41. doi:10.1126/science.315.5808.40
Article CAS PubMed Google Scholar
Shen MY, Sali A (2006) Statistical potential for assessment and prediction of protein structures. Protein Sci 15(11):2507–2524. doi:10.1110/ps.062416606
Article CAS PubMed PubMed Central Google Scholar
Siew N, Elofsson A, Rychlewski L et al (2000) MaxSub: an automated measure for the assessment of protein structure prediction quality. Bioinformatics 16(9):776–785
Article CAS PubMed Google Scholar
Sillitoe I, Dawson N, Thornton J et al (2015) The history of the CATH structural classification of protein domains. Biochimie 119:209–217. doi:10.1016/j.biochi.2015.08.004
Article CAS PubMed PubMed Central Google Scholar
Sippl MJ (1990) Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. J Mol Biol 213(4):859–883
Article CAS PubMed Google Scholar
Soding J (2005) Protein homology detection by HMM-HMM comparison. Bioinformatics 21(7):951–960. doi:10.1093/bioinformatics/bti125
Article PubMed Google Scholar
Soding J, Biegert A, Lupas AN (2005) The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 33 (Web Server issue):W244–W248. doi:10.1093/nar/gki408
Sutcliffe MJ, Haneef I, Carney D et al (1987) Knowledge based modelling of homologous proteins, part I: three-dimensional frameworks derived from the simultaneous superposition of multiple structures. Protein Eng 1(5):377–384
Article CAS PubMed Google Scholar
Tanaka S, Scheraga HA (1976) Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. Macromolecules 9(6):945–950
Article CAS PubMed Google Scholar
Van Noorden R, Maher B, Nuzzo R (2014) The top 100 papers. Nature 514(7524):550–553. doi:10.1038/514550a
Article PubMed Google Scholar
Wallner B, Fang H, Elofsson A (2003) Automatic consensus-based fold recognition using Pcons, ProQ, and Pmodeller. Proteins 53(Suppl 6):534–541. doi:10.1002/prot.10536
Article CAS PubMed Google Scholar
Wu S, Zhang Y (2007) LOMETS: a local meta-threading-server for protein structure prediction. Nucleic Acids Res 35(10):3375–3382. doi:10.1093/nar/gkm251
Article CAS PubMed PubMed Central Google Scholar
Xu D, Zhang Y (2011) Improving the physical realism and structural accuracy of protein models by a two-step atomic-level energy minimization. Biophys J 101(10):2525–2534. doi:10.1016/j.bpj.2011.10.024
Article CAS PubMed PubMed Central Google Scholar
Yang J, Yan R, Roy A et al (2015) The I-TASSER suite: protein structure and function prediction. Nat Methods 12(1):7–8. doi:10.1038/nmeth.3213
Article CAS PubMed PubMed Central Google Scholar
Zhang C, Liu S, Zhou Y (2004) Accurate and efficient loop selections by the DFIRE-based all-atom statistical potential. Protein Sci 13(2):391–399. doi:10.1110/ps.03411904
Article CAS PubMed PubMed Central Google Scholar
Zhang J, Zhang Y (2010) A novel side-chain orientation dependent potential derived from random-walk reference state for protein fold selection and structure prediction. PLoS ONE 5(10):e15386. doi:10.1371/journal.pone.0015386
Article PubMed PubMed Central Google Scholar
Zhang Y, Skolnick J (2004) SPICKER: a clustering approach to identify near-native protein folds. J Comput Chem 25(6):865–871. doi:10.1002/jcc.20011
Article CAS PubMed Google Scholar
Zhou H, Skolnick J (2011) GOAP: a generalized orientation-dependent, all-atom statistical potential for protein structure prediction. Biophys J 101(8):2043–2052. doi:10.1016/j.bpj.2011.09.012
Article CAS PubMed PubMed Central Google Scholar
Zhou H, Zhou Y (2004) Single-body residue-level knowledge-based energy score combined with sequence-profile and secondary structure information for fold recognition. Proteins 55(4):1005–1013. doi:10.1002/prot.20007
Article CAS PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Structural Bioinformatics Group, Imperial College London, London, UK
Lawrence A. Kelley

Authors

Lawrence A. Kelley
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lawrence A. Kelley .

Editor information

Editors and Affiliations

Institute of Integrative Biology, University of Liverpool, Liverpool, United Kingdom
Daniel J. Rigden

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kelley, L.A. (2017). Fold Recognition. In: J. Rigden, D. (eds) From Protein Structure to Function with Bioinformatics. Springer, Dordrecht. https://doi.org/10.1007/978-94-024-1069-3_3

Download citation

DOI: https://doi.org/10.1007/978-94-024-1069-3_3
Published: 13 April 2017
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-024-1067-9
Online ISBN: 978-94-024-1069-3
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics

Fold Recognition

Abstract

Access this chapter

Similar content being viewed by others

Protein Fold Recognition Exploited by Computational and Functional Approaches: Recent Insights

Machine Learning Methods for the Protein Fold Recognition Problem

Impact of structure space continuity on protein fold classification

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Fold Recognition

Abstract

Access this chapter

Similar content being viewed by others

Protein Fold Recognition Exploited by Computational and Functional Approaches: Recent Insights

Machine Learning Methods for the Protein Fold Recognition Problem

Impact of structure space continuity on protein fold classification

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation