Automated Family-Wide Annotation of Secondary Structure Elements

  • Adam MidlikEmail author
  • Ivana Hutařová Vařeková
  • Jan Hutař
  • Taraka Ramji Moturu
  • Veronika Navrátilová
  • Jaroslav Koča
  • Karel Berka
  • Radka Svobodová Vařeková
Part of the Methods in Molecular Biology book series (MIMB, volume 1958)


Secondary structure elements (SSEs) are inherent parts of protein structures, and their arrangement is characteristic for each protein family. Therefore, annotation of SSEs can facilitate orientation in the vast number of homologous structures which is now available for many protein families. It also provides a way to identify and annotate the key regions, like active sites and channels, and subsequently answer the key research questions, such as understanding of molecular function and its variability.

This chapter introduces the concept of SSE annotation and describes the workflow for obtaining SSE annotation for the members of a selected protein family using program SecStrAnnotator.

Key words

Annotation Secondary structure Secondary structure elements Protein family Protein domain SecStrAnnotator Structural alignment Secondary structure assignment 



This work was supported by ELIXIR CZ research infrastructure project (MEYS) [LM2015047 to A.M., I.H.V., J.H., K.B., and R.S.V.]; Ministry of Education, Youth and Sports of the Czech Republic under the project CEITEC 2020 [LQ1601 to A.M., R.S.V., and J.K.]; ELIXIR-EXCELERATE project, which received funding from the European Union’s Horizon 2020 research and innovation program [676559]; ELIXIR-CZ: Budování kapacit [CZ.02.1.01/0.0/0.0/16_013/0001777]; Ministry of Education, Youth and Sports of the Czech Republic [project CZ.02.1.01/0.0/0.0/16_019/0000754 to V.N. and K.B.]; and Palacky University Olomouc [IGA_PrF_2018_032 to V.N.]. A.M. is a “Brno Ph.D. Talent” scholarship holder funded by Brno City Municipality.


  1. 1.
    Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H et al (2000) The protein data bank. Nucleic Acids Res 28(1):235–242. Scholar
  2. 2.
    Sillitoe I, Lewis TE, Cuff A, Das S, Ashford P, Dawson NL et al (2015) CATH: comprehensive structural and functional annotations for genome sequences. Nucleic Acids Res 43(D1):D376–D381. Scholar
  3. 3.
    Fox NK, Brenner SE, Chandonia JM (2014) SCOPe: structural classification of proteins-extended, integrating SCOP and ASTRAL data and classification of new structures. Nucleic Acids Res 42(D1):D304–D309. Scholar
  4. 4.
    Poulos TL, Finzel BC, Howard AJ (1987) High-resolution crystal structure of cytochrome P450cam. J Mol Biol 195(3):687–700. Scholar
  5. 5.
    Rowland P, Blaney FE, Smyth MG, Jones JJ, Leydon VR, Oxbrow AK et al (2006) Crystal structure of human cytochrome P450 2D6. J Biol Chem 281(11):7614–7622. Scholar
  6. 6.
    Cojocaru V, Winn PJ, Wade RC (2007) The ins and outs of cytochrome P450s. Biochim Biophys Acta 1770(3):390–401. Scholar
  7. 7.
    Hutchinson EG, Thornton JM (1990) HERA—a program to draw schematic diagrams of protein secondary structures. Proteins 8(3):203–212. Scholar
  8. 8.
    Hutchinson EG, Thornton JM (1996) PROMOTIF—a program to identify and analyze structural motifs in proteins. Protein Sci 5(2):212–220. Scholar
  9. 9.
    Stivala A, Wybrow M, Wirth A, Whisstock JC, Stuckey PJ (2011) Automatic generation of protein structure cartoons with Pro-origami. Bioinformatics 27(23):3315–3316. Scholar
  10. 10.
    Svobodova Varekova R, Midlik A, Hutarova Varekova I, Hutar J, Navratilova V, Koca J et al (2018) Secondary structure elements—annotations and schematic 2D visualizations stable for individual protein families. Biophys J 114(3):46a–47a. Scholar
  11. 11.
    Berman H, Henrick K, Nakamura H (2003) Announcing the worldwide Protein Data Bank. Nat Struct Biol 10(12):980–980. Scholar
  12. 12.
    Velankar S, Dana JM, Jacobsen J, van Ginkel G, Gane PJ, Luo J et al (2013) SIFTS: structure integration with function, taxonomy and sequences resource. Nucleic Acids Res 41(D1):D483–D489. Scholar
  13. 13.
    Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL et al (2016) The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res 44(D1):D279–D285. Scholar
  14. 14.
    The PyMOL Molecular Graphics System, Version 2.0 Schrodinger, LLCGoogle Scholar
  15. 15.
    Kabsch W, Sander C (1983) Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22(12):2577–2637. Scholar
  16. 16.
    Gore S, Sanz Garcia E, Hendrickx PMS, Gutmanas A, Westbrook JD, Yang H et al (2017) Validation of structures in the Protein Data Bank. Structure 25(12):1916–1927. Scholar
  17. 17.
    Shindyalov IN, Bourne PE (1998) Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng 11(9):739–747. Scholar
  18. 18.
    Mitchell EM, Artymiuk PJ, Rice DW, Willett P (1990) Use of techniques derived from graph theory to compare secondary structure motifs in proteins. J Mol Biol 212(1):151–166. Scholar
  19. 19.
    Eddy SR (2004) What is dynamic programming? Nat Biotechnol 22:909. Scholar
  20. 20.
    Needleman SB, Wunsch CD (1970) A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol 48(3):443–453. Scholar
  21. 21.
    Bron C, Kerbosch J (1973) Algorithm 457: finding all cliques of an undirected graph. Commun ACM 16(9):575–577. Scholar
  22. 22.
    Anderson CA, Rost B (2009) Secondary structure assignment. In: Gu J, Bourne PE (eds) Structural bioinformatics, 2nd edn. Wiley, HobokenGoogle Scholar
  23. 23.
    Cao C, Xu ST, Wang LC (2015) An algorithm for protein helix assignment using helix geometry. PLoS One 10(7):20. Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2019

Authors and Affiliations

  • Adam Midlik
    • 1
    • 2
    Email author
  • Ivana Hutařová Vařeková
    • 1
    • 2
    • 3
  • Jan Hutař
    • 1
    • 2
  • Taraka Ramji Moturu
    • 1
  • Veronika Navrátilová
    • 4
  • Jaroslav Koča
    • 1
    • 2
  • Karel Berka
    • 4
  • Radka Svobodová Vařeková
    • 1
    • 2
  1. 1.CEITEC—Central European Institute of TechnologyMasaryk UniversityBrnoCzech Republic
  2. 2.Faculty of Science, National Centre for Biomolecular ResearchMasaryk UniversityBrnoCzech Republic
  3. 3.Faculty of InformaticsMasaryk UniversityBrnoCzech Republic
  4. 4.Faculty of Science, Department of Physical Chemistry, Regional Centre of Advanced Technologies and MaterialsPalacký UniversityOlomoucCzech Republic

Personalised recommendations