Skip to main content

Tailoring Contact Based Scoring Functions for Protein Structure Prediction

  • Conference paper
  • First Online:
AI 2021: Advances in Artificial Intelligence (AI 2022)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13151))

Included in the following conference series:

  • 1797 Accesses

Abstract

Protein structure prediction (PSP) is a challenge in Bioinformatics. Given a protein’s amino acid sequence, PSP involves finding its three dimensional native structure having the minimum free energy. Unfortunately, the search space is astronomical and the energy function is not known. Many PSP search algorithms develop their own proxy energy functions known as scoring functions using predicted contacts between amino acid residue pairs where two residues are said to be in contact if their distance in the native structure is within a given threshold. Scoring functions are crucial for search guidance since they allow evaluation of the generated structures. Unfortunately, existing contact based scoring functions have not been directly compared and which one among them is the best is not known. In this paper, our goal is to evaluate a number of existing contact based scoring functions within the same PSP search framework on the same set of benchmark proteins. Moreover, we also propose a number of contact based scoring function variants. Our proposed contact based scoring functions help our search algorithm to significantly outperform existing state-of-the-art PSP search algorithm, CGLFOLD that uses contact based scoring functions. We get \(0.77\AA \) average RMSD and 0.01 average GDT values improvement than CGLFOLD.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Adhikari, B., Cheng, J.: CONFOLD2: improved contact-driven ab initio protein structure modeling. BMC Bioinf. 19(1), 1–5 (2018)

    Article  Google Scholar 

  2. Bhattacharya, D., Cao, R., Cheng, J.: UniCon3D: de novo protein structure prediction using united-residue conformational search via stepwise, probabilistic sampling. Bioinformatics 32(18), 2791–2799 (2016)

    Article  Google Scholar 

  3. Brooks, B.R., et al.: CHARMM: the biomolecular simulation program. J. Comput. Chem. 30(10), 1545–1614 (2009)

    Article  Google Scholar 

  4. Chen, X., Song, S., Ji, J., Tang, Z., Todo, Y.: Incorporating a multiobjective knowledge-based energy function into differential evolution for protein structure prediction. Inf. Sci. 540, 69–88 (2020)

    Article  MathSciNet  Google Scholar 

  5. Hanson, J., Paliwal, K., Litfin, T., Yang, Y., Zhou, Y.: Accurate prediction of protein contact maps by coupling residual two-dimensional bidirectional long short-term memory with convolutional neural networks. Bioinformatics 34(23), 4039–4045 (2018)

    Google Scholar 

  6. Hanson, J., Paliwal, K., Litfin, T., Yang, Y., Zhou, Y.: Improving prediction of protein secondary structure, backbone angles, solvent accessibility and contact numbers by using predicted contact maps and an ensemble of recurrent and residual convolutional neural networks. Bioinformatics 35(14), 2403–2410 (2018)

    Article  Google Scholar 

  7. Hou, J., Wu, T., Cao, R., Cheng, J.: Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13. Proteins Struct. Funct. Bioinf. 87(12), 1165–1178 (2019)

    Article  Google Scholar 

  8. Leaver-Fay, A., et al.: Rosetta3. In: Computer Methods, Part C, pp. 545–574. Elsevier (2011)

    Chapter  Google Scholar 

  9. Li, Y., Zhang, C., Bell, E.W., Yu, D.-J., Zhang, Y.: Ensembling multiple raw coevolutionary features with deep residual neural networks for contact-map prediction in CASP13. Proteins Struct. Funct. Bioinf. 87(12), 1082–1091 (2019)

    Article  Google Scholar 

  10. Li, Y., et al.: Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks. PLoS Comput. Biol. 17, 1–19 (2021)

    Google Scholar 

  11. Liu, J., Zhou, X.G., Zhang, Y., Zhang, G.J.: CGLFold: a contact-assisted de novo protein structure prediction using global exploration and loop perturbation sampling algorithm. Bioinformatics 36(8), 2443–2450 (2020)

    Article  Google Scholar 

  12. Mabrouk, M., Werner, T., Schneider, T., Putz, I., Brock, O.: Analysis of free modelling predictions by RBO aleph in CASP11. Proteins 84, 87–104 (2015)

    Article  Google Scholar 

  13. Magnan, C.N., Baldi, P.: SSpro/ACCpro 5: almost perfect prediction of protein secondary structure and relative solvent accessibility using profiles, machine learning and structural similarity. Bioinformatics 30(18), 2592–2597 (2014)

    Article  Google Scholar 

  14. Mataeimoghadam, F., et al.: Enhancing protein backbone angle prediction by using simpler models of deep neural networks. Sci. Rep. 10(1), 1–12 (2020)

    Article  Google Scholar 

  15. Newton, M.A.H., Pham, D.N., Sattar, A., Maher, M.: Kangaroo: an efficient constraint-based local search system using lazy propagation. In: Lee, J. (ed.) CP 2011. LNCS, vol. 6876, pp. 645–659. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23786-7_49

    Chapter  Google Scholar 

  16. Skwark, M.J., Abdel-Rehim, A., Elofsson, A.: PconsC: combination of direct information methods and alignments improves contact prediction. Bioinformatics 29(14), 1815–1816 (2013)

    Article  Google Scholar 

  17. Xu, D., Zhang, Y.: Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field. Proteins Struct. Funct. Bioinf. 80(7), 1715–1735 (2012)

    Article  Google Scholar 

  18. Xu, G., Wang, Q., Ma, J.: OPUS-TASS: a protein backbone torsion angles and secondary structure predictor based on ensemble neural networks. Bioinformatics 36(20), 5021–5026 (2020)

    Article  Google Scholar 

Download references

Acknowledgements

This research is partially supported by Australian Research Council Discovery Grant DP180102727.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Rianon Zaman or M. A. Hakim Newton .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zaman, R., Newton, M.A.H., Mataeimoghadam, F., Sattar, A. (2022). Tailoring Contact Based Scoring Functions for Protein Structure Prediction. In: Long, G., Yu, X., Wang, S. (eds) AI 2021: Advances in Artificial Intelligence. AI 2022. Lecture Notes in Computer Science(), vol 13151. Springer, Cham. https://doi.org/10.1007/978-3-030-97546-3_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-97546-3_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-97545-6

  • Online ISBN: 978-3-030-97546-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics