Abstract
Protein structure prediction (PSP) is a challenge in Bioinformatics. Given a protein’s amino acid sequence, PSP involves finding its three dimensional native structure having the minimum free energy. Unfortunately, the search space is astronomical and the energy function is not known. Many PSP search algorithms develop their own proxy energy functions known as scoring functions using predicted contacts between amino acid residue pairs where two residues are said to be in contact if their distance in the native structure is within a given threshold. Scoring functions are crucial for search guidance since they allow evaluation of the generated structures. Unfortunately, existing contact based scoring functions have not been directly compared and which one among them is the best is not known. In this paper, our goal is to evaluate a number of existing contact based scoring functions within the same PSP search framework on the same set of benchmark proteins. Moreover, we also propose a number of contact based scoring function variants. Our proposed contact based scoring functions help our search algorithm to significantly outperform existing state-of-the-art PSP search algorithm, CGLFOLD that uses contact based scoring functions. We get \(0.77\AA \) average RMSD and 0.01 average GDT values improvement than CGLFOLD.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Adhikari, B., Cheng, J.: CONFOLD2: improved contact-driven ab initio protein structure modeling. BMC Bioinf. 19(1), 1–5 (2018)
Bhattacharya, D., Cao, R., Cheng, J.: UniCon3D: de novo protein structure prediction using united-residue conformational search via stepwise, probabilistic sampling. Bioinformatics 32(18), 2791–2799 (2016)
Brooks, B.R., et al.: CHARMM: the biomolecular simulation program. J. Comput. Chem. 30(10), 1545–1614 (2009)
Chen, X., Song, S., Ji, J., Tang, Z., Todo, Y.: Incorporating a multiobjective knowledge-based energy function into differential evolution for protein structure prediction. Inf. Sci. 540, 69–88 (2020)
Hanson, J., Paliwal, K., Litfin, T., Yang, Y., Zhou, Y.: Accurate prediction of protein contact maps by coupling residual two-dimensional bidirectional long short-term memory with convolutional neural networks. Bioinformatics 34(23), 4039–4045 (2018)
Hanson, J., Paliwal, K., Litfin, T., Yang, Y., Zhou, Y.: Improving prediction of protein secondary structure, backbone angles, solvent accessibility and contact numbers by using predicted contact maps and an ensemble of recurrent and residual convolutional neural networks. Bioinformatics 35(14), 2403–2410 (2018)
Hou, J., Wu, T., Cao, R., Cheng, J.: Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13. Proteins Struct. Funct. Bioinf. 87(12), 1165–1178 (2019)
Leaver-Fay, A., et al.: Rosetta3. In: Computer Methods, Part C, pp. 545–574. Elsevier (2011)
Li, Y., Zhang, C., Bell, E.W., Yu, D.-J., Zhang, Y.: Ensembling multiple raw coevolutionary features with deep residual neural networks for contact-map prediction in CASP13. Proteins Struct. Funct. Bioinf. 87(12), 1082–1091 (2019)
Li, Y., et al.: Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks. PLoS Comput. Biol. 17, 1–19 (2021)
Liu, J., Zhou, X.G., Zhang, Y., Zhang, G.J.: CGLFold: a contact-assisted de novo protein structure prediction using global exploration and loop perturbation sampling algorithm. Bioinformatics 36(8), 2443–2450 (2020)
Mabrouk, M., Werner, T., Schneider, T., Putz, I., Brock, O.: Analysis of free modelling predictions by RBO aleph in CASP11. Proteins 84, 87–104 (2015)
Magnan, C.N., Baldi, P.: SSpro/ACCpro 5: almost perfect prediction of protein secondary structure and relative solvent accessibility using profiles, machine learning and structural similarity. Bioinformatics 30(18), 2592–2597 (2014)
Mataeimoghadam, F., et al.: Enhancing protein backbone angle prediction by using simpler models of deep neural networks. Sci. Rep. 10(1), 1–12 (2020)
Newton, M.A.H., Pham, D.N., Sattar, A., Maher, M.: Kangaroo: an efficient constraint-based local search system using lazy propagation. In: Lee, J. (ed.) CP 2011. LNCS, vol. 6876, pp. 645–659. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-23786-7_49
Skwark, M.J., Abdel-Rehim, A., Elofsson, A.: PconsC: combination of direct information methods and alignments improves contact prediction. Bioinformatics 29(14), 1815–1816 (2013)
Xu, D., Zhang, Y.: Ab initio protein structure assembly using continuous structure fragments and optimized knowledge-based force field. Proteins Struct. Funct. Bioinf. 80(7), 1715–1735 (2012)
Xu, G., Wang, Q., Ma, J.: OPUS-TASS: a protein backbone torsion angles and secondary structure predictor based on ensemble neural networks. Bioinformatics 36(20), 5021–5026 (2020)
Acknowledgements
This research is partially supported by Australian Research Council Discovery Grant DP180102727.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Zaman, R., Newton, M.A.H., Mataeimoghadam, F., Sattar, A. (2022). Tailoring Contact Based Scoring Functions for Protein Structure Prediction. In: Long, G., Yu, X., Wang, S. (eds) AI 2021: Advances in Artificial Intelligence. AI 2022. Lecture Notes in Computer Science(), vol 13151. Springer, Cham. https://doi.org/10.1007/978-3-030-97546-3_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-97546-3_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-97545-6
Online ISBN: 978-3-030-97546-3
eBook Packages: Computer ScienceComputer Science (R0)