Modeling of Protein Tertiary and Quaternary Structures Based on Evolutionary Information

  • Gabriel Studer
  • Gerardo Tauriello
  • Stefan Bienert
  • Andrew Mark Waterhouse
  • Martino Bertoni
  • Lorenza Bordoli
  • Torsten Schwede
  • Rosalba LeporeEmail author
Part of the Methods in Molecular Biology book series (MIMB, volume 1851)


Proteins are subject to evolutionary forces that shape their three-dimensional structure to meet specific functional demands. The knowledge of the structure of a protein is therefore instrumental to gain information about the molecular basis of its function. However, experimental structure determination is inherently time consuming and expensive, making it impossible to follow the explosion of sequence data deriving from genome-scale projects. As a consequence, computational structural modeling techniques have received much attention and established themselves as a valuable complement to experimental structural biology efforts. Among these, comparative modeling remains the method of choice to model the three-dimensional structure of a protein when homology to a protein of known structure can be detected.

The general strategy consists of using experimentally determined structures of proteins as templates for the generation of three-dimensional models of related family members (targets) of which the structure is unknown. This chapter provides a description of the individual steps needed to obtain a comparative model using SWISS-MODEL, one of the most widely used automated servers for protein structure homology modeling.

Key words

Homology modeling Oligomeric proteins Quaternary structure Protein structure prediction Model quality assessment Model quality estimates SWISS-MODEL 


  1. 1.
    Guex N, Peitsch MC, Schwede T (2009) Automated comparative protein structure modeling with SWISS-MODEL and Swiss-PdbViewer: a historical perspective. Electrophoresis 30 Suppl 1:S162–S173CrossRefGoogle Scholar
  2. 2.
    Sali A, Blundell TL (1993) Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol 234:779–815CrossRefGoogle Scholar
  3. 3.
    Chothia C, Lesk AM (1986) The relation between the divergence of sequence and structure in proteins. EMBO J 5:823–826CrossRefGoogle Scholar
  4. 4.
    Arnold K, Bordoli L, Kopp J et al (2006) The SWISS-MODEL workspace: a web-based environment for protein structure homology modelling. Bioinformatics 22:195–201CrossRefGoogle Scholar
  5. 5.
    Biasini M, Bienert S, Waterhouse A et al (2014) SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Res 42:W252–W258CrossRefGoogle Scholar
  6. 6.
    Kiefer F, Arnold K, Kunzli M et al (2009) The SWISS-MODEL repository and associated resources. Nucleic Acids Res 37:D387–D392CrossRefGoogle Scholar
  7. 7.
    Waterhouse A, Bertoni M, Bienert S et al (2018) SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Research Res 46(W1):W296–W303CrossRefGoogle Scholar
  8. 8.
    Kryshtafovych A, Venclovas C, Fidelis K et al (2005) Progress over the first decade of CASP experiments. Proteins 61(Suppl 7):225–236CrossRefGoogle Scholar
  9. 9.
    Berman H, Henrick K, Nakamura H et al (2007) The worldwide protein data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res 35:D301–D303CrossRefGoogle Scholar
  10. 10.
    Altschul SF, Madden TL, Schaffer AA et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402CrossRefGoogle Scholar
  11. 11.
    Remmert M, Biegert A, Hauser A et al (2011) HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat Methods 9:173–175CrossRefGoogle Scholar
  12. 12.
    Jones DT (1999) Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol 292:195–202CrossRefGoogle Scholar
  13. 13.
    Sillitoe I, Cuff AL, Dessailly BH et al (2013) New functional families (FunFams) in CATH to improve the mapping of conserved functional sites to 3D structures. Nucleic Acids Res 41:D490–D498CrossRefGoogle Scholar
  14. 14.
    Aloy P, Ceulemans H, Stark A et al (2003) The relationship between sequence and interaction divergence in proteins. J Mol Biol 332:989–998CrossRefGoogle Scholar
  15. 15.
    Bertoni M, Kiefer F, Biasini M et al (2017) Modeling protein quaternary structure of homo- and hetero-oligomers beyond binary interactions by homology. Sci Rep 7:10480CrossRefGoogle Scholar
  16. 16.
    Marcatili P, Olimpieri PP, Chailyan A et al (2014) Antibody modeling using the prediction of immunoglobulin structure (PIGS) web server [corrected]. Nat Protoc 9:2771–2783CrossRefGoogle Scholar
  17. 17.
    Lepore R, Olimpieri PP, Messih MA et al (2017) PIGSPro: prediction of immunoGlobulin structures v2. Nucleic Acids Res 45:W17CrossRefGoogle Scholar
  18. 18.
    Biasini M, Schmidt T, Bienert S et al (2013) OpenStructure: an integrated software framework for computational structural biology. Acta Crystallogr D Biol Crystallogr 69:701–709CrossRefGoogle Scholar
  19. 19.
    Fiser A (2010) Template-based protein structure modeling. Methods Mol Biol 673:73–94CrossRefGoogle Scholar
  20. 20.
    Choi Y, Deane CM (2010) FREAD revisited: accurate loop structure prediction using a database search algorithm. Proteins 78:1431–1440PubMedGoogle Scholar
  21. 21.
    Liang S, Zhang C, Zhou Y (2014) LEAP: highly accurate prediction of protein loop conformations by integrating coarse-grained sampling and optimized energy scores with all-atom refinement of backbone and side chains. J Comput Chem 35:335–341CrossRefGoogle Scholar
  22. 22.
    Messih MA, Lepore R, Tramontano A (2015) LoopIng: a template-based tool for predicting the structure of protein loops. Bioinformatics 31:3767–3772PubMedPubMedCentralGoogle Scholar
  23. 23.
    Canutescu AA, Dunbrack RL Jr (2003) Cyclic coordinate descent: a robotics algorithm for protein loop closure. Protein science: a publication of the protein. Society 12:963–972Google Scholar
  24. 24.
    Sippl MJ (1990) Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. J Mol Biol 213:859–883CrossRefGoogle Scholar
  25. 25.
    Shapovalov MV, Dunbrack RL Jr (2011) A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions. Structure 19:844–858CrossRefGoogle Scholar
  26. 26.
    Krivov GG, Shapovalov MV, Dunbrack RL Jr (2009) Improved prediction of protein side-chain conformations with SCWRL4. Proteins 77:778–795CrossRefGoogle Scholar
  27. 27.
    Xu J (2005) Rapid protein side-chain packing via tree decomposition. In: Miyano S, Mesirov J, Kasif S, Istrail S, Pevzner PA, Waterman M (eds) Research in computational molecular biology: 9th Annual International Conference, RECOMB 2005, Cambridge, MA, USA, May 14–18, 2005. Proceedings. Springer Berlin, Heidelberg, pp 423–439CrossRefGoogle Scholar
  28. 28.
    Mackerell AD Jr, Feig M, Brooks CL 3rd (2004) Extending the treatment of backbone energetics in protein force fields: limitations of gas-phase quantum mechanics in reproducing protein conformational distributions in molecular dynamics simulations. J Comput Chem 25:1400–1415CrossRefGoogle Scholar
  29. 29.
    Eastman P, Swails J, Chodera JD et al (2017) OpenMM 7: rapid development of high performance algorithms for molecular dynamics. PLoS Comput Biol 13:e1005659CrossRefGoogle Scholar
  30. 30.
    Baker D, Sali A (2001) Protein structure prediction and structural genomics. Science 294:93–96CrossRefGoogle Scholar
  31. 31.
    Schwede T, Sali A, Honig B et al (2009) Outcome of a workshop on applications of protein models in biomedical research. Structure 17:151–159CrossRefGoogle Scholar
  32. 32.
    Read RJ, Adams PD, Arendall WB 3rd et al (2011) A new generation of crystallographic validation tools for the protein data bank. Structure 19:1395–1412CrossRefGoogle Scholar
  33. 33.
    Benkert P, Biasini M, Schwede T (2011) Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics 27:343–350CrossRefGoogle Scholar
  34. 34.
    Benkert P, Kunzli M, Schwede T (2009) QMEAN server for protein model quality estimation. Nucleic Acids Res 37:W510–W514CrossRefGoogle Scholar
  35. 35.
    Haas J, Roth S, Arnold K et al (2013) The protein model portal--a comprehensive resource for protein structure and model information. Database 2013:bat031CrossRefGoogle Scholar
  36. 36.
    Teh AH, Kanamasa S, Kajiwara S et al (2008) Structure of cu/Zn superoxide dismutase from the heavy-metal-tolerant yeast Cryptococcus liquefaciens strain N6. Biochem Biophys Res Commun 374:475–478CrossRefGoogle Scholar
  37. 37.
    Benkert P, Tosatto SC, Schomburg D (2008) QMEAN: a comprehensive scoring function for model quality assessment. Proteins 71:261–277CrossRefGoogle Scholar
  38. 38.
    Chothia C, Lesk AM (1987) Canonical structures for the hypervariable regions of immunoglobulins. J Mol Biol 196:901–917CrossRefGoogle Scholar
  39. 39.
    Morea V, Tramontano A, Rustici M et al (1998) Conformations of the third hypervariable region in the VH domain of immunoglobulins. J Mol Biol 275:269–294CrossRefGoogle Scholar
  40. 40.
    Tramontano A, Chothia C, Lesk AM (1990) Framework residue 71 is a major determinant of the position and conformation of the second hypervariable region in the VH domains of immunoglobulins. J Mol Biol 215:175–182CrossRefGoogle Scholar
  41. 41.
    Messih MA, Lepore R, Marcatili P et al (2014) Improving the accuracy of the structure prediction of the third hypervariable loop of the heavy chains of antibodies. Bioinformatics 30:2733–2740CrossRefGoogle Scholar
  42. 42.
    Almagro JC, Teplyakov A, Luo J et al (2014) Second antibody modeling assessment (AMA-II). Proteins 82:1553–1562CrossRefGoogle Scholar
  43. 43.
    Moult J (2005) A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction. Curr Opin Struct Biol 15:285–289CrossRefGoogle Scholar
  44. 44.
    Tai CH, Bai H, Taylor TJ et al (2014) Assessment of template-free modeling in CASP10 and ROLL. Proteins 82(Suppl 2):57–83CrossRefGoogle Scholar
  45. 45.
    Meier A, Soding J (2015) Automatic prediction of protein 3D structures by probabilistic multi-template homology modeling. PLoS Comput Biol 11:e1004343CrossRefGoogle Scholar
  46. 46.
    Larsson P, Wallner B, Lindahl E et al (2008) Using multiple templates to improve quality of homology models in automated homology modeling. Protein Sci 17:990–1002CrossRefGoogle Scholar
  47. 47.
    Cheng J (2008) A multi-template combination algorithm for protein comparative modeling. BMC Struct Biol 8:18CrossRefGoogle Scholar
  48. 48.
    Webb B, Sali A (2014) Comparative protein structure modeling using MODELLER. Curr Protoc Bioinformatics 47:5.6.1–5.6.32CrossRefGoogle Scholar
  49. 49.
    Grosdidier A, Zoete V, Michielin O (2011) Fast docking using the CHARMM force field with EADock DSS. J Comput Chem 32:2149–2159CrossRefGoogle Scholar
  50. 50.
    Grosdidier A, Zoete V, Michielin O (2011) SwissDock, a protein-small molecule docking web service based on EADock DSS. Nucleic Acids Res 39:W270–W277CrossRefGoogle Scholar
  51. 51.
    Lensink MF, Velankar S, Wodak SJ (2017) Modeling protein-protein and protein-peptide complexes: CAPRI 6th edition. Proteins 85:359–377CrossRefGoogle Scholar
  52. 52.
    Esquivel-Rodriguez J, Filos-Gonzalez V, Li B et al (2014) Pairwise and multimeric protein-protein docking using the LZerD program suite. Methods Mol Biol 1137:209–234CrossRefGoogle Scholar
  53. 53.
    Pierce B, Tong W, Weng Z (2005) M-ZDOCK: a grid-based approach for Cn symmetric multimer docking. Bioinformatics 21:1472–1478CrossRefGoogle Scholar
  54. 54.
    De Vries SJ, Van Dijk M, Bonvin AM (2010) The HADDOCK web server for data-driven biomolecular docking. Nat Protoc 5:883–897CrossRefGoogle Scholar
  55. 55.
    Leaver-Fay A, Tyka M, Lewis SM et al (2011) ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. Methods Enzymol 487:545–574CrossRefGoogle Scholar
  56. 56.
    Russel D, Lasker K, Webb B et al (2012) Putting the pieces together: integrative modeling platform software for structure determination of macromolecular assemblies. PLoS Biol 10:e1001244CrossRefGoogle Scholar
  57. 57.
    Simons KT, Kooperberg C, Huang E et al (1997) Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. J Mol Biol 268:209–225CrossRefGoogle Scholar
  58. 58.
    Yang J, Yan R, Roy A et al (2015) The I-TASSER suite: protein structure and function prediction. Nat Methods 12:7–8CrossRefGoogle Scholar
  59. 59.
    Maghrabi AHA, Mcguffin LJ (2017) ModFOLD6: an accurate web server for the global and local quality estimation of 3D protein models. Nucleic Acids Res 45(W1):W416–W421CrossRefGoogle Scholar
  60. 60.
    Heo L, Feig M (2018) What makes it difficult to refine protein models further via molecular dynamics simulations? Proteins 86(Suppl 1):177–188CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2019

Authors and Affiliations

  • Gabriel Studer
    • 1
  • Gerardo Tauriello
    • 1
  • Stefan Bienert
    • 1
  • Andrew Mark Waterhouse
    • 1
  • Martino Bertoni
    • 1
  • Lorenza Bordoli
    • 1
  • Torsten Schwede
    • 1
  • Rosalba Lepore
    • 1
    Email author
  1. 1.Biozentrum, University of Basel and SIB Swiss Institute of BioinformaticsBaselSwitzerland

Personalised recommendations