Quantitative prediction of drug side effects based on drug-related features
- 199 Downloads
- 2 Citations
Abstract
Motivation
Unexpected side effects of drugs are great concern in the drug development, and the identification of side effects is an important task. Recently, machine learning methods are proposed to predict the presence or absence of interested side effects for drugs, but it is difficult to make the accurate prediction for all of them.
Methods
In this paper, we transform side effect profiles of drugs as their quantitative scores, by summing up their side effects with weights. The quantitative scores may measure the dangers of drugs, and thus help to compare the risk of different drugs. Here, we attempt to predict quantitative scores of drugs, namely the quantitative prediction. Specifically, we explore a variety of drug-related features and evaluate their discriminative powers for the quantitative prediction. Then, we consider several feature combination strategies (direct combination, average scoring ensemble combination) to integrate three informative features: chemical substructures, targets, and treatment indications. Finally, the average scoring ensemble model which produces the better performances is used as the final quantitative prediction model.
Results
Since weights for side effects are empirical values, we randomly generate different weights in the simulation experiments. The experimental results show that the quantitative method is robust to different weights, and produces satisfying results. Although other state-of-the-art methods cannot make the quantitative prediction directly, the prediction results can be transformed as the quantitative scores. By indirect comparison, the proposed method produces much better results than benchmark methods in the quantitative prediction. In conclusion, the proposed method is promising for the quantitative prediction of side effects, which may work cooperatively with existing state-of-the-art methods to reveal dangers of drugs.
Keywords
Drugs Side effects Quantitative prediction Features Ensemble learningNotes
Acknowledgement
This work is supported by the National Science Foundation of China (11226267, 61103126,61572368) and the Fundamental Research Funds for the Central Universities (2042017kf0219).
References
- 1.Giacomini KM, Krauss RM, Roden DM, Eichelbaum M, Hayden MR, Nakamura Y (2007) When good drugs go bad. Nature 446(7139):975–977CrossRefPubMedGoogle Scholar
- 2.Whitebread S, Hamon J, Bojanic D, Urban L (2005) Keynote review:safety pharmacology profiling: an essential tool for successful drug development. Drug Discov Today 10(21):1421–1433CrossRefPubMedGoogle Scholar
- 3.Campillos M, Kuhn M, Gavin A-C, Jensen LJ, Bork P (2008) Drug target identification using side-effect similarity. Science 321(5886):263–266CrossRefPubMedGoogle Scholar
- 4.Scheiber J, Chen B, Milik M, Sukuru SCK, Bender A, Mikhailov D, Whitebread S, Hamon J, Azzaoui K, Urban L (2009) Gaining insight into off-target mediated effects of drug candidates with a comprehensive systems chemical biology analysis. J Chem Inf Model 49(2):308–317CrossRefPubMedGoogle Scholar
- 5.Tatonetti NP, Liu T, Altman RB (2009) Predicting drug side-effects by chemical systems biology. Genome Biol 10(9):238CrossRefPubMedPubMedCentralGoogle Scholar
- 6.Xie L, Li J, Xie L, Bourne PE (2009) Drug discovery using chemical systems biology: identification of the protein-ligand binding network to explain the side effects of CETP inhibitors. PLoS Comput Biol 5(5):e1000387CrossRefPubMedPubMedCentralGoogle Scholar
- 7.Fliri AF, Loging WT, Thadeio PF, Volkmann RA (2005) Analysis of drug-induced effect patterns to link structure and side effects of medicines. Nat Chem Biol 1(7):389–397CrossRefPubMedGoogle Scholar
- 8.Fukuzaki M, Seki M, Kashima H, Sese J: Side effect prediction using cooperative pathways. In: Bioinformatics and Biomedicine, 2009 BIBM’09 IEEE International Conference on: 2009. IEEE: 142-147Google Scholar
- 9.Hammann F, Gutmann H, Vogt N, Helma C, Drewe J (2010) Prediction of adverse drug reactions using decision tree modeling. Clin Pharmacol Ther 88(1):52–59CrossRefPubMedGoogle Scholar
- 10.Bender A, Scheiber J, Glick M, Davies JW, Azzaoui K, Hamon J, Urban L, Whitebread S, Jenkins JL (2007) Analysis of pharmacology data and the prediction of adverse drug reactions and off-target effects from chemical structure. ChemMedChem 2(6):861–873CrossRefPubMedGoogle Scholar
- 11.Huang LC, Wu X, Chen JY (2011) Predicting adverse side effects of drugs. BMC genom 12(Suppl 5):S11CrossRefGoogle Scholar
- 12.Pauwels E, Stoven V, Yamanishi Y (2011) Predicting drug side-effect profiles: a chemical fragment-based approach. BMC Bioinform 12:169CrossRefGoogle Scholar
- 13.Mizutani S, Pauwels E, Stoven V, Goto S, Yamanishi Y (2012) Relating drug-protein interaction network with drug side effects. Bioinformatics 28(18):i522–i528CrossRefPubMedPubMedCentralGoogle Scholar
- 14.Yamanishi Y, Pauwels E, Kotera M (2012) Drug side-effect prediction based on the integration of chemical and biological spaces. J Chem Inf Model 52(12):3284–3292CrossRefPubMedGoogle Scholar
- 15.Liu M, Wu YH, Chen YK, Sun JC, Zhao ZM, Chen XW, Matheny ME, Xu H (2012) Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs. J Am Med Inform Assoc 19(E1):E28–E35CrossRefPubMedPubMedCentralGoogle Scholar
- 16.Bresso E, Grisoni R, Marchetti G, Karaboga AS, Souchet M, Devignes MD, Smail-Tabbone M (2013) Integrative relational machine-learning approach for understanding drug side-effect profiles. BMC Bioinform 14(1):207CrossRefGoogle Scholar
- 17.Cheng F, Li W, Wang X, Zhou Y, Wu Z, Shen J, Tang Y (2013) Adverse drug events: database construction and in silico prediction. J Chem Inf Model 53(4):744–752CrossRefPubMedGoogle Scholar
- 18.Huang LC, Wu X, Chen JY (2013) Predicting adverse drug reaction profiles by integrating protein interaction networks with drug structures. Proteomics 13(2):313–324CrossRefPubMedGoogle Scholar
- 19.Zhang W, Liu F, Luo L, Zhang J (2015) Predicting drug side effects by multi-label learning and ensemble learning. BMC Bioinform 16:365CrossRefGoogle Scholar
- 20.Zhang W, Zou H, Luo L, Liu Q, Wu W, Xiao W (2016) Predicting potential side effects of drugs by recommender methods and ensemble learning. Neurocomputing 173:979–987CrossRefGoogle Scholar
- 21.Zhang W, Chen Y, Tu S, Liu F, Qu Q: Drug side effect prediction through linear neighborhoods and multiple data source integration. In: 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM): 2016. 427–434Google Scholar
- 22.Kuhn M, Campillos M, Letunic I, Jensen LJ, Bork P (2010) A side effect resource to capture phenotypic effects of drugs. Mol Syst Biol 6:343CrossRefPubMedPubMedCentralGoogle Scholar
- 23.Wang Y, Xiao J, Suzek TO, Zhang J, Wang J, Bryant SH (2009) PubChem: a public information system for analyzing bioactivities of small molecules. Nucleic Acids Res 37(Web Server issue):W623–W633CrossRefPubMedPubMedCentralGoogle Scholar
- 24.Li Q, Cheng T, Wang Y, Bryant SH (2010) PubChem as a public resource for drug discovery. Drug Discov Today 15(23–24):1052–1057CrossRefPubMedPubMedCentralGoogle Scholar
- 25.Wishart DS, Knox C, Guo AC, Shrivastava S, Hassanali M, Stothard P, Chang Z, Woolsey J (2006) DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res 34(Database issue):D668–D672CrossRefPubMedGoogle Scholar
- 26.Wishart DS, Knox C, Guo AC, Cheng D, Shrivastava S, Tzur D, Gautam B, Hassanali M (2008) DrugBank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Res 36(Database issue):D901–D906CrossRefPubMedGoogle Scholar
- 27.Knox C, Law V, Jewison T, Liu P, Ly S, Frolkis A, Pon A, Banco K, Mak C, Neveu V et al (2011) DrugBank 3.0: a comprehensive resource for ‘omics’ research on drugs. Nucleic Acids Res 39(Database issue):D1035–D1041CrossRefPubMedGoogle Scholar
- 28.Law V, Knox C, Djoumbou Y, Jewison T, Guo AC, Liu Y, Maciejewski A, Arndt D, Wilson M, Neveu V et al (2014) DrugBank 4.0: shedding new light on drug metabolism. Nucleic Acids Res 42(Database issue):D1091–D1097CrossRefPubMedGoogle Scholar
- 29.Kanehisa M, Goto S, Furumichi M, Tanabe M, Hirakawa M (2010) KEGG for representation and analysis of molecular networks involving diseases and drugs. Nucleic Acids Res 38(Database issue):D355–D360CrossRefPubMedGoogle Scholar
- 30.Breiman L (2001) Random forests. Mach Learn 45(1):5–32CrossRefGoogle Scholar
- 31.Zhang W, Xiong Y, Zhao M, Zou H, Ye X, Liu J (2011) Prediction of conformational B-cell epitopes from 3D structures by random forests with a distance-based feature. BMC Bioinform 12:341CrossRefGoogle Scholar
- 32.Singh H, Ansari HR, Raghava GP (2013) Improved method for linear B-cell epitope prediction using antigen’s primary sequence. PLoS ONE 8(5):e62216CrossRefPubMedPubMedCentralGoogle Scholar
- 33.Zhang W, Niu Y, Xiong Y, Zhao M, Yu R, Liu J (2012) Computational prediction of conformational B-cell epitopes from antigen primary structures by ensemble learning. PLoS ONE 7(8):e43575CrossRefPubMedPubMedCentralGoogle Scholar
- 34.Zhang W, Liu J, Xiong Y, Ke M, Zhang K: Predicting immunogenic T-cell epitopes by combining various sequence-derived features. In: IEEE International Conference on Bioinformatics and Biomedicine: December 18-21 2013; Shanghai. IEEE Computer Society: 4–9Google Scholar
- 35.Li D, Luo L, Zhang W, Liu F, Luo F (2016) A genetic algorithm-based weighted ensemble method for predicting transposon-derived piRNAs. BMC Bioinform 17(1):329CrossRefGoogle Scholar
- 36.Luo L, Li D, Zhang W, Tu S, Zhu X, Tian G (2016) Accurate prediction of transposon-derived piRNAs by integrating various sequential and physicochemical features. PloS ONE 11(4):e0153268CrossRefPubMedPubMedCentralGoogle Scholar
- 37.Zhang W, Chen Y, Liu F, Luo F, Tian G, Li X (2017) Predicting potential drug-drug interactions by integrating chemical, biological, phenotypic and network data. BMC Bioinform 18(1):18CrossRefGoogle Scholar
- 38.Zhang W, Zhu X, Fu Y, Tsuji J, Weng Z: The prediction of human splicing branchpoints by multi-label learning. In: IEEE International Conference on Bioinformatics and Biomedicine: 2016. 254–259Google Scholar
- 39.Yugandhar K, Gromiha MM (2014) Feature selection and classification of protein-protein complexes based on their binding affinities using machine learning approaches. Proteins 82:2088–2096CrossRefPubMedGoogle Scholar
- 40.Zhang Z, Dong J, Luo X, Choi KS, Wu X (2014) Heartbeat classification using disease-specific feature selection. Comput Biol Med 46:79–89CrossRefPubMedGoogle Scholar
- 41.Zhou W, Dickerson JA (2014) A novel class dependent feature selection method for cancer biomarker discovery. Comput Biol Med 47:66–75CrossRefPubMedGoogle Scholar
- 42.Qianqian Xie, Dingfang Li, Wen Zhang (2015) Two novel tree structure-based methods for gene selection (in Chinese). Comput Sci 42(7):250–253Google Scholar
- 43.Tibshirani R (2011) Regression shrinkage and selection via the lasso: a retrospective. J Royal Stat Soc 73(3):273–282CrossRefGoogle Scholar
- 44.Ding C, Peng H (2005) Minimum redundancy feature selection from microarray gene expression data. J Bioinform Comput Biol 3(2):185–205CrossRefPubMedGoogle Scholar
- 45.Zou Q, Zeng J, Cao L, Ji R (2016) A novel features ranking metric with application to scalable visual and bioinformatics data classification. Neurocomputing 173:346–354CrossRefGoogle Scholar