Support Vector Regression for Predicting Binding Affinity in Spinocerebellar Ataxia
Spinocerebellar ataxia (SCA) is an inherited disorder. It arises mainly due to gene mutations, which affect gray matter in the brain causing neurodegeneration. There are certain types of SCA that are caused by repeat mutation in the gene, which produces differences in the formation of protein sequence and structures. Binding affinity is very essential to know how tightly the ligand binds with the protein. In this work, a binding affinity prediction model is built using machine learning. To build the model, predictor variables and their values such as binding energy, IC50, torsional energy and surface area for both ligand and protein are extracted from the complex using AutoDock, AutoDock Vina and PyMOL. A total of 17 structures and 18 drugs were used for learning the support vector regression (SVR) model. Experimental results proved that the SVR-based affinity prediction model performs better than other regression models.
KeywordsBinding affinity Docking Ligand Machine learning Prediction Protein Protein structure
- 1.Thomas, C. Weiss. 2010. Ataxia spinocerebellar: SCA facts and information.Google Scholar
- 2.Thomas, D. Bird. 2016. Hereditary ataxia overview.Google Scholar
- 8.Li, X., M. Zhu, X. Li, H.Q. Wang, and S. Wang. 2012. Protein-protein binding affinity prediction based on an SVR ensemble. In Intelligent Computing Technology, ICIC 2012, ed. D.S. Huang, C. Jiang, V. Bevilacqua, J.C. Figueroa, vol. 7389. Lecture Notes in Computer Science, Springer: Berlin, Heidelberg.Google Scholar
- 11.Volkan, Uslan, and Huseyin Seker. 2016. Binding affinity prediction of S. Ccerevisiae 14-3-3 and GYF peptide-recognition domains using support vector regression. In 2016 IEEE 38th annual international conference of the engineering in medicine and biology society (EMBC), 3445–3448, ISSN 1558-4615.Google Scholar
- 12.Berman, Helen M., John Westbrook, Zukang Feng, Gary Gilliland, T.N. Bhat, Helge Weissig, Ilya, N. Shindyalov, and Philip E. Bourne. 2000. Protein data bank, Nucleic Acids Research, 28 (1): 235–242.Google Scholar
- 14.Soman, K.P., R. Loganathan, and V. Ajay. 2009. Machine learning with SVM and other kernel methods.Google Scholar
- 15.LIBSVM is an open source tool. http://www.csie.ntu.edu.tw/cjlin/libsvm.