Protein Secondary Structure Assignments and Their Usefulness for Dihedral Angle Prediction

  • Eshel FaraggiEmail author
  • Andrzej Kloczkowski
Part of the Springer Series on Bio- and Neurosystems book series (SSBN, volume 8)


We present and compare different protein secondary structure assignment methods and the effect of their use in dihedral angle prediction. It is found that consensus reassignment of secondary structure tends to improve the accuracy of secondary structure prediction. However, it is less useful for the prediction of the dihedral angles than a better defined reassignment method based on angle values. Considering reassigned residues, we find them to be hard to predict. We find the most significant improvement for reassigned residues is due to our new reassignment method. This method also reassigns a smaller number of residues as compared to consensus methods. We additionally find that improvements to the accuracy of dihedral angle prediction is due both to single residue and local-neighborhood effects.


Dihedral angle prediction Secondary structure prediction Secondary structure assignment Protein structure Machine learning 



We gratefully acknowledge support from National Science Foundation grant DBI 1661391, and Bridge funds provided by The Research Institute at Nationwide Children’s Hospital.


  1. 1.
    Garnier, J., Osguthorpe, D.J., Robson, B.: Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. J. Mol. Biol. 120(1), 97–120 (1978)CrossRefGoogle Scholar
  2. 2.
    Gibrat, J.-F., Garnier, J., Robson, B.: Further developments of protein secondary structure prediction using information theory: new parameters and consideration of residue pairs. J. Mol. Biol. 198(3), 425–443 (1987)CrossRefGoogle Scholar
  3. 3.
    Howard, L.: Holley and Martin Karplus. Protein secondary structure prediction with a neural network. Proc. National Acad. Sci. 86(1), 152–156 (1989)CrossRefGoogle Scholar
  4. 4.
    Kneller, D.G., Cohen, F.E., Langridge, R.: Improvements in protein secondary structure prediction by an enhanced neural network. J. Mol. Biol. 214(1), 171–182 (1990)CrossRefGoogle Scholar
  5. 5.
    Sikorski, A.: Prediction of protein secondary structure by neural networks: Encoding short and long range patterns of amino acid packing. Acta. Biochim. Pol., 39(4), (1992)Google Scholar
  6. 6.
    Rost, B., Sander, C.: Prediction of protein secondary structure at better than 70% accuracy. J. Mol. Biol. 232(2), 584–599 (1993)CrossRefGoogle Scholar
  7. 7.
    Rost, B., Sander, C., Schneider, R.: Phd-an automatic mail server for protein secondary structure prediction. Comput. Appl. Biosci.: CABIOS 10(1), 53–60 (1994)Google Scholar
  8. 8.
    Garnier, J., Gibrat, J.-F., Robson, B.: Gor method for predicting protein secondary structure from amino acid sequence. Methods Enzymol. 266, 540 (1996)CrossRefGoogle Scholar
  9. 9.
    Frishman, D., Argos, P.: Seventy-five percent accuracy in protein secondary structure prediction. Proteins-Struct. Funct. Genet. 27(3), 329–335 (1997)CrossRefGoogle Scholar
  10. 10.
    Cuff, J.A., Clamp, M.E., Siddiqui, A.S., Finlay, M., Barton, G.J.: Jpred: a consensus secondary structure prediction server. Bioinformatics 14(10), 892–893 (1998)CrossRefGoogle Scholar
  11. 11.
    Jones, D.T.: Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol. 292, 195–202 (1999)CrossRefGoogle Scholar
  12. 12.
    James, A.: Cuff and Geoffrey J Barton. Application of multiple sequence alignment profiles to improve protein secondary structure prediction. Proteins: Struct. Funct. Bioinformatics 40(3), 502–511 (2000)MathSciNetCrossRefGoogle Scholar
  13. 13.
    Hua, S., Sun, Z.: A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach. J. Mol. Biol. 308(2), 397–408 (2001)CrossRefGoogle Scholar
  14. 14.
    Kloczkowski, A., Ting, K.-L., Jernigan, R.L., Garnier, J.: Protein secondary structure prediction based on the gor algorithm with multiple sequence alignments. Polymer 43, 441–449 (2002)CrossRefGoogle Scholar
  15. 15.
    Kloczkowski, A., Ting, K.-L., Jernigan, R.L., Garnier, J.: Combining the gor v algorithm with evolutionary information for protein secondary structure prediction from amino acid sequence. Proteins: Struct. Funct. Gen. 49, 154–166 (2002)CrossRefGoogle Scholar
  16. 16.
    Kolinski, A.: Protein modeling and structure prediction with a reduced representation. Acta Biochim. Pol.-English Edition- 51, 349–372 (2004)Google Scholar
  17. 17.
    Cheng, H., Sen, T.Z., Kloczkowski, A., Margaritis, D., Jernigan, R.L.: Prediction of protein secondary structure by mining fragments database. Polymer 46, 4314–4321 (2005)CrossRefGoogle Scholar
  18. 18.
    Dor, O., Zhou, Y.: Achieving 80% ten-fold cross-validated accuracy for secondary structure prediction by large-scale training. Proteins 66, 838–845 (2007)CrossRefGoogle Scholar
  19. 19.
    Homaeian, L., Kurgan, L.A., Ruan, J., Cios, K.J., Chen, K.: Prediction of protein secondary structure content for the twilight zone sequences. Proteins: Struct. Funct. Bioinformatics 69(3), 486–498 (2007)CrossRefGoogle Scholar
  20. 20.
    Kurgan, L., Cios, K., Zhang, H., Zhang, T., Chen, K., Shen, S., Ruan, J.: Sequence-based methods for real value predictions of protein structure. Curr. Bioinformatics 3(3), 183–196 (2008)CrossRefGoogle Scholar
  21. 21.
    Kurgan, L., Cios, K., Chen, K.: Scpred: accurate prediction of protein structural class for sequences of twilight-zone similarity with predicting sequences. BMC Bioinformatics 9(1), 226 (2008)CrossRefGoogle Scholar
  22. 22.
    Cole, C., Barber, J.D., Barton, G.J.: The jpred 3 secondary structure prediction server. Nucleic Acids Res. 36, W197–W201 (2008)CrossRefGoogle Scholar
  23. 23.
    Kountouris, P., Hirst, J.D.: Prediction of backbone dihedral angles and protein secondary structure using support vector machines. BMC Bioinformatics 10(1), 437 (2009)CrossRefGoogle Scholar
  24. 24.
    Faraggi, E., Zhang, T., Yang, Y., Kurgan, L., Zhou, Y.: Spine X: improving protein secondary structure prediction by multistep learning coupled with prediction of solvent accessible surface area and backbone torsion angles. J. Comp. Chem. 33, 259–267 (2012)CrossRefGoogle Scholar
  25. 25.
    Sen, T.Z., Jernigan, R.L., Garnier, J., Kloczkowski, A.: Gor v server for protein secondary structure prediction. Bioinformatics 21(11), 2787–2788 (2005)CrossRefGoogle Scholar
  26. 26.
    Kouza, M., Faraggi, E., Kolinski, A., Kloczkowski, A.: The gor method of protein secondary structure prediction and its application as a protein aggregation prediction tool. In: Prediction of Protein Secondary Structure, pp. 7–24. Springer (2017 )Google Scholar
  27. 27.
    Rost, B.: TOPITS: threading one-dimensional predictions into three-dimensional structures. In: Third international conference on intelligent systems for molecular biology, pp. 314–321. AAAI Press (1995)Google Scholar
  28. 28.
    Rost, B., Sander, C.: Protein fold recognition by prediction-based threading. J. Mol. Biol. 270, 471–480 (1997)CrossRefGoogle Scholar
  29. 29.
    Kihara, D., Hui, L., Kolinski, Aj, Skolnick, J.: Touchstone: an ab initio protein structure prediction method that uses threading-based tertiary restraints. Proc. National Acad. Sci. 98(18), 10125–10130 (2001)CrossRefGoogle Scholar
  30. 30.
    Przybylski, D., Rost, B.: Improving fold recognition without folds. J. Mol. Biol. 341, 255–269 (2004)CrossRefGoogle Scholar
  31. 31.
    Cheng, J., Baldi, P.: A machine learning information retrieval approach to protein fold recognition. Bioinformatics 22, 1456–1463 (2006)CrossRefGoogle Scholar
  32. 32.
    Qiu, J., Elber, R.: SSALN: an alignment algorithm using structure-dependent substitution matrices and gap penalties learned from structurally aligned protein pairs. Proteins 62, 881–891 (2006)CrossRefGoogle Scholar
  33. 33.
    Liu, S., Zhang, C., Liang, S., Zhou, Y.: Fold recognition by concurrent use of solvent accessibility and residue depth. Proteins 68, 636–645 (2007)CrossRefGoogle Scholar
  34. 34.
    Blaszczyk, M., Kurcinski, M., Kouza, M., Wieteska, L., Debinski, A., Kolinski, A., Kmiecik, S.: Modeling of protein-peptide interactions using the cabs-dock web server for binding site search and flexible docking. Methods 93, 72–83 (2016)CrossRefGoogle Scholar
  35. 35.
    Faraggi, E., Yang, Y., Zhang, S., Zhou, Y.: Predicting continuous local structure and the effect of its substitution for secondary structure in fragment-free protein structure prediction. Structure 17, 1515–1527 (2009)CrossRefGoogle Scholar
  36. 36.
    Kabsch, W., Sander, C.: Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983)CrossRefGoogle Scholar
  37. 37.
    Heinig, M., Frishman, D.: Stride: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res. 32(suppl\(\_\)2), W500–W502 (2004)CrossRefGoogle Scholar
  38. 38.
    Zhang, W., Dunker, A.K., Zhou, Y.: Assessing secondary-structure assignment of protein structures by using pairwise sequence-alignment benchmarks. Proteins 71, 61–67 (2008)CrossRefGoogle Scholar
  39. 39.
    Roger, A.: Sayle and E James Milner-White. Rasmol: biomolecular graphics for all. Trends in biochemical sciences 20(9), 374–376 (1995)CrossRefGoogle Scholar
  40. 40.
    Faraggi, E., Xue, B., Zhou, Y.: Improving the prediction accuracy of residue solvent accessibility and real-value backbone torsion angles of proteins by fast guided-learning through a two-layer neural network. Proteins 74, 857–871 (2009)CrossRefGoogle Scholar
  41. 41.
    Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J.: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucl. Aci. Res. 25, 3389–3402 (1997)CrossRefGoogle Scholar
  42. 42.
    Meiler, J., Muller, M., Zeidler, A., Schmaschke, F.: Generation and evaluation of dimension reduced amino acid parameter representations by artificial neural networks. J. Mol. Model. 7, 360–369 (2001)CrossRefGoogle Scholar
  43. 43.
    Dor, O., Zhou, Y.: Real-SPINE: an integrated system of neural networks for real-value prediction of protein structural properties. Proteins 68, 76–81 (2007)CrossRefGoogle Scholar
  44. 44.
    Xue, B., Dor, O., Faraggi, E., Zhou, Y.: Real value prediction of backbone torsion angles. Proteins 72, 427–433 (2008)CrossRefGoogle Scholar
  45. 45.
    Wootton, J.C.: Statistic of local complexity in amino acid sequences and sequence databases. Comput. Chem. 17, 149–163 (1993)CrossRefzbMATHGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Department of PhysicsIndiana University Purdue University IndianapolisIndianapolisUSA
  2. 2.Department of PhysicsButler UniversityIndianapolisUSA
  3. 3.Battelle Center for Mathematical MedicineThe Research Institute at Nationwide Children’s HospitalColumbusUSA
  4. 4.Physics Division, Research and Information SystemsLLCIndianapolisUSA
  5. 5.Department of PediatricsThe Ohio State UniversityColumbusUSA

Personalised recommendations