Advertisement

Computational Prediction of Influenza Neuraminidase Inhibitors Using Machine Learning Algorithms and Recursive Feature Elimination Method

  • Li Zhang
  • Haixin Ai
  • Qi Zhao
  • Junfeng Zhu
  • Wen Chen
  • Xuewei Wu
  • Liangchao Huang
  • Zimo Yin
  • Jian Zhao
  • Hongsheng LiuEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10330)

Abstract

Recent outbreaks of highly pathogenic influenza have highlighted the need to develop novel anti-influenza therapeutics. Neuraminidase has become the most important target for the treatment of influenza virus. In this study, classification models were developed from a large training dataset containing 457 neuraminidase inhibitors and 358 non-inhibitors using random forest and support vector machine algorithms. Recursive feature elimination (RFE) method was used to improve the accuracy of the models by selecting the most relevant molecular descriptors. The performances of the models were evaluated by five-fold cross-validation and independent validation. The accuracies of all the models are over 86% in both validation methods. This work suggests machine learning algorithms combined with RFE method can be used to build useful models for predicting influenza neuraminidase inhibitors.

Keywords

Machine learning Neuraminidase inhibitor Feature selection 

Notes

Acknowledgments

This work was supported by the National Natural Science Foundation of China (No: 31570160), Innovation Team Project (No: LT2015011) from Education Department of Liaoning Province, Large-scale Equipment Shared Services Project (No: F15165400) and Applied Basic Research Project (No: F16205151) from Science and Technology Bureau of Shenyang. This project was supported by Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of Liaoning.

References

  1. 1.
    Hurt, A.C.: The epidemiology and spread of drug resistant human influenza viruses. Curr. Opin. Virol. 8, 22–29 (2014)CrossRefGoogle Scholar
  2. 2.
    Gao, R., Cao, B., Hu, Y., Feng, Z., Wang, D., Hu, W., Chen, J., Jie, Z., Qiu, H., Xu, K.: Human infection with a novel avian-origin influenza A (H7N9) virus. N. Engl. J. Med. 368, 1888–1897 (2013)CrossRefGoogle Scholar
  3. 3.
    Matrosovich, M.N., Matrosovich, T.Y., Gray, T., Roberts, N.A., Klenk, H.-D.: Neuraminidase is important for the initiation of influenza virus infection in human airway epithelium. J. Virol. 78, 12665–12667 (2004)CrossRefGoogle Scholar
  4. 4.
    Ai, H., Zhang, L., Chang, A.K., Wei, H., Che, Y., Liu, H.: Virtual screening of potential inhibitors from TCM for the CPSF30 binding site on the NS1A protein of influenza A virus. J. Mol. Model. 20, 2142 (2014)CrossRefGoogle Scholar
  5. 5.
    Ai, H., Zheng, F., Deng, F., Zhu, C., Gu, Y., Zhang, L., Li, X., Chang, A.K., Zhao, J., Zhu, J.: Structure-based virtual screening for potential inhibitors of influenza A virus RNA polymerase PA subunit. Int. J. Pept. Res. Ther. 21, 149–156 (2015)CrossRefGoogle Scholar
  6. 6.
    Batool, S., Mushtaq, G., Kamal, W., Kamal, M.A.: Pharmacophore-based virtual screening for identification of novel neuraminidase inhibitors and verification of inhibitory activity by molecular docking. Med. Chem. 12, 63–73 (2016)CrossRefGoogle Scholar
  7. 7.
    Cong, Y., Li, B.-K., Yang, X.-G., Xue, Y., Chen, Y.-Z., Zeng, Y.: Quantitative structure–activity relationship study of influenza virus neuraminidase A/PR/8/34 (H1N1) inhibitors by genetic algorithm feature selection and support vector regression. Chemometr. Intell. Lab. 127, 35–42 (2013)CrossRefGoogle Scholar
  8. 8.
    Lian, W., Fang, J., Li, C., Pang, X., Liu, A.-L., Du, G.-H.: Discovery of Influenza A virus neuraminidase inhibitors using support vector machine and Naïve Bayesian models. Mol. Divers. 20, 439–451 (2016)CrossRefGoogle Scholar
  9. 9.
    Li, Y., Kong, Y., Zhang, M., Yan, A., Liu, Z.: Using support vector machine (SVM) for classification of selectivity of H1N1 neuraminidase inhibitors. Mol. Inform. 35, 116–124 (2016)CrossRefGoogle Scholar
  10. 10.
    Tao, P., Liu, T., Li, X., Chen, L.: Prediction of protein structural class using tri-gram probabilities of position-specific scoring matrix and recursive feature elimination. Amino Acids 47, 461–468 (2015)CrossRefGoogle Scholar
  11. 11.
    Gilson, M.K., Liu, T., Baitaluk, M., Nicola, G., Hwang, L., Chong, J.: BindingDB in 2015: a public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Res. 44, D1045–D1053 (2016)CrossRefGoogle Scholar
  12. 12.
    Li, B.-K., Cong, Y., Yang, X.-G., Xue, Y., Chen, Y.-Z.: In silico prediction of spleen tyrosine kinase inhibitors using machine learning approaches and an optimized molecular descriptor subset generated by recursive feature elimination method. Comput. Biol. Med. 43, 395–404 (2013)CrossRefGoogle Scholar
  13. 13.
    Yap, C.W.: PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints. J. Comput. Chem. 32, 1466–1474 (2011)CrossRefGoogle Scholar
  14. 14.
    Kuhn, M.: Caret package. J. Stat. Softw. 28, 1–26 (2008)CrossRefGoogle Scholar
  15. 15.
    Chen, X., Yan, C.C., Zhang, X., Zhang, X., Dai, F., Yin, J., Zhang, Y.: Drug–target interaction prediction: databases, web servers and computational models. Brief. Bioinform. 17, 696–712 (2016)CrossRefGoogle Scholar
  16. 16.
    Chen, X., Ren, B., Chen, M., Wang, Q., Zhang, L., Yan, G.: NLLSS: predicting synergistic drug combinations based on semi-supervised learning. PLoS Comput. Biol. 12, e1004975 (2016)CrossRefGoogle Scholar
  17. 17.
    Chen, X., Yan, C.C., Zhang, X., You, Z.-H.: Long non-coding RNAs and complex diseases: from experimental results to computational models. Brief. Bioinform. bbw060 (2016). doi: 10.1093/bib/bbw060
  18. 18.
    Chen, W., Feng, P., Yang, H., Ding, H., Lin, H., Chou, K.-C.: iRNA-AI: identifying the adenosine to inosine editing sites in RNA sequences. Oncotarget 8, 4208–4217 (2017)Google Scholar
  19. 19.
    Chen, W., Tang, H., Ye, J., Lin, H., Chou, K.-C.: iRNA-PseU: identifying RNA pseudouridine sites. Mol. Ther. Nucleic Acids 5, e332 (2016)Google Scholar
  20. 20.
    Chen, X., Huang, Y.-A., You, Z.-H., Yan, G.-Y., Wang, X.-S.: A novel approach based on KATZ measure to predict associations of human microbiota with non-infectious diseases. Bioinformatics 33, 733–739 (2017)Google Scholar
  21. 21.
    Huang, Z.-A., Chen, X., Zhu, Z., Liu, H., Yan, G.-Y., You, Z.-H., Wen, Z.: PBHMDA: Path-based human microbe-disease association prediction. Front. Microbiol. 8, 233 (2017)Google Scholar
  22. 22.
    Chen, X., Huang, Y.-A., Wang, X.-S., You, Z.-H., Chan, K.: FMLNCSIM: fuzzy measure-based lncRNA functional similarity calculation model. Oncotarget 7, 45948–45958 (2016)Google Scholar
  23. 23.
    Chen, X., You, Z., Yan, G., Gong, D.: IRWRLDA: improved random walk with restart for lncRNA-disease association prediction. Oncotarget 7, 57919–57931 (2016)Google Scholar
  24. 24.
    Chen, W., Ding, H., Feng, P., Lin, H., Chou, K.-C.: iACP: a sequence-based tool for identifying anticancer peptides. Oncotarget 7, 16895 (2016)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Li Zhang
    • 1
    • 2
  • Haixin Ai
    • 1
    • 2
    • 3
  • Qi Zhao
    • 4
  • Junfeng Zhu
    • 1
  • Wen Chen
    • 5
  • Xuewei Wu
    • 1
  • Liangchao Huang
    • 5
  • Zimo Yin
    • 5
  • Jian Zhao
    • 1
  • Hongsheng Liu
    • 1
    • 2
    • 3
    Email author
  1. 1.School of Life ScienceLiaoning UniversityShenyangChina
  2. 2.Research Center for Computer Simulating and Information Processing of Bio-macromolecules of Liaoning ProvinceShenyangChina
  3. 3.Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of LiaoningShenyangChina
  4. 4.School of MathematicsLiaoning UniversityShenyangChina
  5. 5.School of InformationLiaoning UniversityShenyangChina

Personalised recommendations