Abstract
In this paper, we focus on protein contact map prediction, one of the most important intermediate steps of the protein folding problem. The objective of this research is to know how short-range interactions can contribute to a system based on decision trees to learn about the correlation among the covalent structures of a protein residues. We propose a solution to predict protein contact maps that combines the use of decision trees with a new input codification for short-range interactions. The method’s performance was very satisfactory, improving the accuracy instead using all information of the protein sequence. For a globulin data set the method can predict contacts with a maximal accuracy of 43%. The presented predictive model illustrates that short-range interactions play the predominant role in determining protein structure.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Ouzounis, C.A., Valencia, A.: Early bioinformatics: the birth of a discipline a personal view. Bioinformatics 19(17), 2176–2190 (2003)
Quan, Z.H., Zhang, G.-Z., Huang, D.S.: Combining a binary input encoding scheme with RBFNN for globulin protein inter-residue contact map prediction. Pattern Recognition Letters 26, 1543–1553 (2005)
Glasgow, J., Kuo, T., Davies, J.: Protein structure from contact maps: A case-based reasoning approach. Inf. Sys. Front 8, 29–36 (2006)
Ramanathan, A.: Using Tensor Analysis to characterize Contact-map Dynamics of Proteins. PhD thesis, Carnegie Mellon University Pittsburgh, PA (2008)
Zhou, J., Arndt, D., Wishart, D.S., Lin, G., Shi, Y., Zhou, J., Arndt, D., Wishart, D.S., Lin, G.: Protein contact order prediction from primari sequences. BMC Bioinformatics 9(255), 1–21 (2008)
Fariselli, P., Olmea, O., Valencia, A., Casadio, R.: Prediction of contact maps with neural networks and correlated mutations. Protein Engineering 14(11), 835–843 (2001)
Pollastri, G., Baldi, P.: Prediction of contact maps by GIOHMMs and recurrent neural networks using lateral propagation from all four cardinal corners. Bioinformatics 18, 1–9 (2002)
Kim, H.: Computational analysis of hydrogen bonds in protein-RNA complexes for interaction patterns. FEBS Letters 552, 231–239 (2003)
Martin, A.J.M., Walsh, I., Bau, D.: Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks. BMC Structural Biology 9(5), 1–38 (2009)
Ahmad, M., Mathkour, H.: An integrated approach for protein structure prediction using artificial neural network. In: 2010 Second International Conference on Computer Engineering and Applications, pp. 484–488. IEEE (2010)
Sinha, S., Durga Bhavani, S., Suvarnavani, K.: Mining of protein contact maps for protein fold prediction. WIREs Data Mining Knowl. Discov. 1(4), 362–368 (2011)
Saraee, M., Korbekandi, H., Habibi, N.: Protein contact map prediction using committee machine approach. International Journal of Data Mining and Bioinformatics 2, 205–209 (2011)
Hossein, M., Narjes, S., Habibi, K.: Protein contact map prediction based on an ensemble learning method. In: 2009 International Conference on Computer Engineering and Technology 2009, vol. 2, pp. 205–209. IEEE (2009)
Min, H., Yoon, S., Kim, J., Kim, H.: Constructing accurate contact maps for hydroxyl-radical-cleavage-based high-throughput rna structure inference. IEEE Transactions on Biomedical Engineering 58(5), 1347–1355 (2011)
Shao, Y., Bystroff, C., Zaki, M.J., Hu, J., Shen, X.: Mining Protein Contact Maps. In: BIOKDD 2002: Workshop on Data Mining in Bioinformatics (with SIGKDD 2002 Conference), pp. 3–10 (2002)
Toca, C.E.S., Márquez Chamorro, A.E., Asencio Cortes, G., Aguilar Ruiz, J.S.: A Decision Tree-Based Method for Protein Contact Map Prediction. In: Giacobini, M. (ed.) EvoBIO 2011. LNCS, vol. 6623, pp. 153–158. Springer, Heidelberg (2011)
Santiesteban-Toca, C.E., Aguilar-Ruiz, J.S.: DTP: Decision Tree-Based Predictor of Protein Contact Map. In: Mehrotra, K.G., Mohan, C.K., Oh, J.C., Varshney, P.K., Ali, M. (eds.) IEA/AIE 2011, Part II. LNCS, vol. 6704, pp. 367–375. Springer, Heidelberg (2011)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann (1993)
Valencia, A., Olmea, O.: Improving contact predictions by the combination of correlated mutations and other sources of sequence information. Protein Engineering 2, S25–S32 (1997)
Casadio, R., Fariselli, P.: A neural network based predictor of residue contacts in proteins. Protein Engineering 12(1), 15–21 (1999)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Santiesteban-Toca, C.E., Asencio-Cortés, G., Márquez-Chamorro, A.E., Aguilar-Ruiz, J.S. (2012). Short-Range Interactions and Decision Tree-Based Protein Contact Map Predictor. In: Giacobini, M., Vanneschi, L., Bush, W.S. (eds) Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics. EvoBIO 2012. Lecture Notes in Computer Science, vol 7246. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29066-4_20
Download citation
DOI: https://doi.org/10.1007/978-3-642-29066-4_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29065-7
Online ISBN: 978-3-642-29066-4
eBook Packages: Computer ScienceComputer Science (R0)