Identification of Hotspots in Protein-Protein Interactions Based on Recursive Feature Elimination
The study of protein-protein interactions and protein structure through computational methods is critical to understand protein function. Hot spot residues play an important role in bioinformatics to reveal life activities. However, conventional hot spots prediction methods may face great challenges. This paper proposes a hot spot prediction method based on feature selection method SVM-RFE to improve the training performance. SMOTE based oversampling is used to adds new samples to avoid an overfitting classifier. SVM-RFE is then invoked to obtained optimal feature subset. Finally, a feature-based SVM is created to predict the hot spots. Experimental results indicate that the performance of hot spots prediction has been significantly improved compared with the previous methods.
KeywordsProtein-Protein interactions Hot spots SVM-RFE Classification
The authors thank the members of Machine Learning and Artificial Intelligence Laboratory, School of Computer Science and Technology, Wuhan University of Science and Technology, for their helpful discussion within seminars. This work was supported in part by National Natural Science Foundation of China (No. 61502356, 61273225), by Hubei Province Natural Science Foundation of China (No. 2018CFB526).
- 27.Zhang, S.H., Zhang, X.L.: Prediction of hot spots at protein-protein interface. Acta Biophysica Sinica 29(2), 1–12 (2013)Google Scholar