Quantitative prediction of drug side effects based on drug-related features

Original Research Article

Abstract

Motivation

Unexpected side effects of drugs are great concern in the drug development, and the identification of side effects is an important task. Recently, machine learning methods are proposed to predict the presence or absence of interested side effects for drugs, but it is difficult to make the accurate prediction for all of them.

Methods

In this paper, we transform side effect profiles of drugs as their quantitative scores, by summing up their side effects with weights. The quantitative scores may measure the dangers of drugs, and thus help to compare the risk of different drugs. Here, we attempt to predict quantitative scores of drugs, namely the quantitative prediction. Specifically, we explore a variety of drug-related features and evaluate their discriminative powers for the quantitative prediction. Then, we consider several feature combination strategies (direct combination, average scoring ensemble combination) to integrate three informative features: chemical substructures, targets, and treatment indications. Finally, the average scoring ensemble model which produces the better performances is used as the final quantitative prediction model.

Results

Since weights for side effects are empirical values, we randomly generate different weights in the simulation experiments. The experimental results show that the quantitative method is robust to different weights, and produces satisfying results. Although other state-of-the-art methods cannot make the quantitative prediction directly, the prediction results can be transformed as the quantitative scores. By indirect comparison, the proposed method produces much better results than benchmark methods in the quantitative prediction. In conclusion, the proposed method is promising for the quantitative prediction of side effects, which may work cooperatively with existing state-of-the-art methods to reveal dangers of drugs.

Keywords

Drugs Side effects Quantitative prediction Features Ensemble learning 

Notes

Acknowledgement

This work is supported by the National Science Foundation of China (11226267, 61103126,61572368) and the Fundamental Research Funds for the Central Universities (2042017kf0219).

References

  1. 1.
    Giacomini KM, Krauss RM, Roden DM, Eichelbaum M, Hayden MR, Nakamura Y (2007) When good drugs go bad. Nature 446(7139):975–977CrossRefPubMedGoogle Scholar
  2. 2.
    Whitebread S, Hamon J, Bojanic D, Urban L (2005) Keynote review:safety pharmacology profiling: an essential tool for successful drug development. Drug Discov Today 10(21):1421–1433CrossRefPubMedGoogle Scholar
  3. 3.
    Campillos M, Kuhn M, Gavin A-C, Jensen LJ, Bork P (2008) Drug target identification using side-effect similarity. Science 321(5886):263–266CrossRefPubMedGoogle Scholar
  4. 4.
    Scheiber J, Chen B, Milik M, Sukuru SCK, Bender A, Mikhailov D, Whitebread S, Hamon J, Azzaoui K, Urban L (2009) Gaining insight into off-target mediated effects of drug candidates with a comprehensive systems chemical biology analysis. J Chem Inf Model 49(2):308–317CrossRefPubMedGoogle Scholar
  5. 5.
    Tatonetti NP, Liu T, Altman RB (2009) Predicting drug side-effects by chemical systems biology. Genome Biol 10(9):238CrossRefPubMedPubMedCentralGoogle Scholar
  6. 6.
    Xie L, Li J, Xie L, Bourne PE (2009) Drug discovery using chemical systems biology: identification of the protein-ligand binding network to explain the side effects of CETP inhibitors. PLoS Comput Biol 5(5):e1000387CrossRefPubMedPubMedCentralGoogle Scholar
  7. 7.
    Fliri AF, Loging WT, Thadeio PF, Volkmann RA (2005) Analysis of drug-induced effect patterns to link structure and side effects of medicines. Nat Chem Biol 1(7):389–397CrossRefPubMedGoogle Scholar
  8. 8.
    Fukuzaki M, Seki M, Kashima H, Sese J: Side effect prediction using cooperative pathways. In: Bioinformatics and Biomedicine, 2009 BIBM’09 IEEE International Conference on: 2009. IEEE: 142-147Google Scholar
  9. 9.
    Hammann F, Gutmann H, Vogt N, Helma C, Drewe J (2010) Prediction of adverse drug reactions using decision tree modeling. Clin Pharmacol Ther 88(1):52–59CrossRefPubMedGoogle Scholar
  10. 10.
    Bender A, Scheiber J, Glick M, Davies JW, Azzaoui K, Hamon J, Urban L, Whitebread S, Jenkins JL (2007) Analysis of pharmacology data and the prediction of adverse drug reactions and off-target effects from chemical structure. ChemMedChem 2(6):861–873CrossRefPubMedGoogle Scholar
  11. 11.
    Huang LC, Wu X, Chen JY (2011) Predicting adverse side effects of drugs. BMC genom 12(Suppl 5):S11CrossRefGoogle Scholar
  12. 12.
    Pauwels E, Stoven V, Yamanishi Y (2011) Predicting drug side-effect profiles: a chemical fragment-based approach. BMC Bioinform 12:169CrossRefGoogle Scholar
  13. 13.
    Mizutani S, Pauwels E, Stoven V, Goto S, Yamanishi Y (2012) Relating drug-protein interaction network with drug side effects. Bioinformatics 28(18):i522–i528CrossRefPubMedPubMedCentralGoogle Scholar
  14. 14.
    Yamanishi Y, Pauwels E, Kotera M (2012) Drug side-effect prediction based on the integration of chemical and biological spaces. J Chem Inf Model 52(12):3284–3292CrossRefPubMedGoogle Scholar
  15. 15.
    Liu M, Wu YH, Chen YK, Sun JC, Zhao ZM, Chen XW, Matheny ME, Xu H (2012) Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs. J Am Med Inform Assoc 19(E1):E28–E35CrossRefPubMedPubMedCentralGoogle Scholar
  16. 16.
    Bresso E, Grisoni R, Marchetti G, Karaboga AS, Souchet M, Devignes MD, Smail-Tabbone M (2013) Integrative relational machine-learning approach for understanding drug side-effect profiles. BMC Bioinform 14(1):207CrossRefGoogle Scholar
  17. 17.
    Cheng F, Li W, Wang X, Zhou Y, Wu Z, Shen J, Tang Y (2013) Adverse drug events: database construction and in silico prediction. J Chem Inf Model 53(4):744–752CrossRefPubMedGoogle Scholar
  18. 18.
    Huang LC, Wu X, Chen JY (2013) Predicting adverse drug reaction profiles by integrating protein interaction networks with drug structures. Proteomics 13(2):313–324CrossRefPubMedGoogle Scholar
  19. 19.
    Zhang W, Liu F, Luo L, Zhang J (2015) Predicting drug side effects by multi-label learning and ensemble learning. BMC Bioinform 16:365CrossRefGoogle Scholar
  20. 20.
    Zhang W, Zou H, Luo L, Liu Q, Wu W, Xiao W (2016) Predicting potential side effects of drugs by recommender methods and ensemble learning. Neurocomputing 173:979–987CrossRefGoogle Scholar
  21. 21.
    Zhang W, Chen Y, Tu S, Liu F, Qu Q: Drug side effect prediction through linear neighborhoods and multiple data source integration. In: 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM): 2016. 427–434Google Scholar
  22. 22.
    Kuhn M, Campillos M, Letunic I, Jensen LJ, Bork P (2010) A side effect resource to capture phenotypic effects of drugs. Mol Syst Biol 6:343CrossRefPubMedPubMedCentralGoogle Scholar
  23. 23.
    Wang Y, Xiao J, Suzek TO, Zhang J, Wang J, Bryant SH (2009) PubChem: a public information system for analyzing bioactivities of small molecules. Nucleic Acids Res 37(Web Server issue):W623–W633CrossRefPubMedPubMedCentralGoogle Scholar
  24. 24.
    Li Q, Cheng T, Wang Y, Bryant SH (2010) PubChem as a public resource for drug discovery. Drug Discov Today 15(23–24):1052–1057CrossRefPubMedPubMedCentralGoogle Scholar
  25. 25.
    Wishart DS, Knox C, Guo AC, Shrivastava S, Hassanali M, Stothard P, Chang Z, Woolsey J (2006) DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res 34(Database issue):D668–D672CrossRefPubMedGoogle Scholar
  26. 26.
    Wishart DS, Knox C, Guo AC, Cheng D, Shrivastava S, Tzur D, Gautam B, Hassanali M (2008) DrugBank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Res 36(Database issue):D901–D906CrossRefPubMedGoogle Scholar
  27. 27.
    Knox C, Law V, Jewison T, Liu P, Ly S, Frolkis A, Pon A, Banco K, Mak C, Neveu V et al (2011) DrugBank 3.0: a comprehensive resource for ‘omics’ research on drugs. Nucleic Acids Res 39(Database issue):D1035–D1041CrossRefPubMedGoogle Scholar
  28. 28.
    Law V, Knox C, Djoumbou Y, Jewison T, Guo AC, Liu Y, Maciejewski A, Arndt D, Wilson M, Neveu V et al (2014) DrugBank 4.0: shedding new light on drug metabolism. Nucleic Acids Res 42(Database issue):D1091–D1097CrossRefPubMedGoogle Scholar
  29. 29.
    Kanehisa M, Goto S, Furumichi M, Tanabe M, Hirakawa M (2010) KEGG for representation and analysis of molecular networks involving diseases and drugs. Nucleic Acids Res 38(Database issue):D355–D360CrossRefPubMedGoogle Scholar
  30. 30.
    Breiman L (2001) Random forests. Mach Learn 45(1):5–32CrossRefGoogle Scholar
  31. 31.
    Zhang W, Xiong Y, Zhao M, Zou H, Ye X, Liu J (2011) Prediction of conformational B-cell epitopes from 3D structures by random forests with a distance-based feature. BMC Bioinform 12:341CrossRefGoogle Scholar
  32. 32.
    Singh H, Ansari HR, Raghava GP (2013) Improved method for linear B-cell epitope prediction using antigen’s primary sequence. PLoS ONE 8(5):e62216CrossRefPubMedPubMedCentralGoogle Scholar
  33. 33.
    Zhang W, Niu Y, Xiong Y, Zhao M, Yu R, Liu J (2012) Computational prediction of conformational B-cell epitopes from antigen primary structures by ensemble learning. PLoS ONE 7(8):e43575CrossRefPubMedPubMedCentralGoogle Scholar
  34. 34.
    Zhang W, Liu J, Xiong Y, Ke M, Zhang K: Predicting immunogenic T-cell epitopes by combining various sequence-derived features. In: IEEE International Conference on Bioinformatics and Biomedicine: December 18-21 2013; Shanghai. IEEE Computer Society: 4–9Google Scholar
  35. 35.
    Li D, Luo L, Zhang W, Liu F, Luo F (2016) A genetic algorithm-based weighted ensemble method for predicting transposon-derived piRNAs. BMC Bioinform 17(1):329CrossRefGoogle Scholar
  36. 36.
    Luo L, Li D, Zhang W, Tu S, Zhu X, Tian G (2016) Accurate prediction of transposon-derived piRNAs by integrating various sequential and physicochemical features. PloS ONE 11(4):e0153268CrossRefPubMedPubMedCentralGoogle Scholar
  37. 37.
    Zhang W, Chen Y, Liu F, Luo F, Tian G, Li X (2017) Predicting potential drug-drug interactions by integrating chemical, biological, phenotypic and network data. BMC Bioinform 18(1):18CrossRefGoogle Scholar
  38. 38.
    Zhang W, Zhu X, Fu Y, Tsuji J, Weng Z: The prediction of human splicing branchpoints by multi-label learning. In: IEEE International Conference on Bioinformatics and Biomedicine: 2016. 254–259Google Scholar
  39. 39.
    Yugandhar K, Gromiha MM (2014) Feature selection and classification of protein-protein complexes based on their binding affinities using machine learning approaches. Proteins 82:2088–2096CrossRefPubMedGoogle Scholar
  40. 40.
    Zhang Z, Dong J, Luo X, Choi KS, Wu X (2014) Heartbeat classification using disease-specific feature selection. Comput Biol Med 46:79–89CrossRefPubMedGoogle Scholar
  41. 41.
    Zhou W, Dickerson JA (2014) A novel class dependent feature selection method for cancer biomarker discovery. Comput Biol Med 47:66–75CrossRefPubMedGoogle Scholar
  42. 42.
    Qianqian Xie, Dingfang Li, Wen Zhang (2015) Two novel tree structure-based methods for gene selection (in Chinese). Comput Sci 42(7):250–253Google Scholar
  43. 43.
    Tibshirani R (2011) Regression shrinkage and selection via the lasso: a retrospective. J Royal Stat Soc 73(3):273–282CrossRefGoogle Scholar
  44. 44.
    Ding C, Peng H (2005) Minimum redundancy feature selection from microarray gene expression data. J Bioinform Comput Biol 3(2):185–205CrossRefPubMedGoogle Scholar
  45. 45.
    Zou Q, Zeng J, Cao L, Ji R (2016) A novel features ranking metric with application to scalable visual and bioinformatics data classification. Neurocomputing 173:346–354CrossRefGoogle Scholar

Copyright information

© Springer-Verlag 2017

Authors and Affiliations

  1. 1.School of Mathematics and StatisticsSouth-central University for NationalitiesWuhanChina
  2. 2.School of ComputerWuhan UniversityWuhanChina

Personalised recommendations