Abstract
The discovery and development of drugs is a long and expensive process with a high attrition rate. Computational drug discovery contributes to ligand discovery and optimization, by using models that describe the properties of ligands and their interactions with biological targets. In recent years, artificial intelligence (AI) has made remarkable modeling progress, driven by new algorithms and by the increase in computing power and storage capacities, which allow the processing of large amounts of data in a short time. This review provides the current state of the art of AI methods applied to drug discovery, with a focus on structure- and ligand-based virtual screening, library design and high-throughput analysis, drug repurposing and drug sensitivity, de novo design, chemical reactions and synthetic accessibility, ADMET, and quantum mechanics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Abbreviations
- ADMET:
-
absorption, distribution, metabolism, excretion, toxicity
- AI:
-
artificial intelligence
- ANN :
-
artificial neural network
- AUC:
-
area under the curve
- AUROC:
-
area under the receiver operating characteristic curve
- BA:
-
balanced accuracy
- BKD:
-
binary kernel discrimination
- CCLE:
-
Cancer Cell Line Encyclopedia
- CCLP:
-
COSMIC Cell Lines Project
- CNN :
-
convolutional neural network
- CV:
-
cross-validation
- DFT :
-
density functional theory
- DILI :
-
drug-induced liver injury
- DL:
-
deep learning
- DMTA :
-
design–make–test–analyze
- DNN :
-
deep neural network
- DRD2:
-
dopamine receptor D2
- DTNN:
-
deep tensor neural network
- ECFP4:
-
extended connectivity fingerprint of diameter 4
- FDA:
-
Food and Drug Administration
- FNN:
-
feed-forward neural network
- GCNN:
-
graph convolutional neural network
- GDB-7:
-
generic database with up to 7 heavy atoms
- GDSC:
-
genomics in drug sensitivity in cancer
- GENTRL :
-
generative tensorial reinforcement learning
- GPU :
-
graphics processing unit
- GSE:
-
general solubility equation
- hERG :
-
human Ether-à-go-go-Related Gene
- HTS :
-
high-throughput screening
- JAK:
-
Janus kinase
- KNN:
-
k-nearest neighbor
- LBVS :
-
ligand-based virtual screening
- LINCS:
-
library of integrated network-based cellular signatures
- LSTM :
-
long short-term memory
- MCC:
-
Matthews correlation coefficient
- MeSH:
-
medical subject headings
- ML:
-
machine learning
- MT:
-
multitasks
- MTDL:
-
multitask deep learning
- MTNN:
-
multitask neural network
- NCI-60:
-
National Cancer Institute 60 human cancer cell lines
- PAINS :
-
pan-assay interference
- PPAR:
-
peroxisome proliferator-activated receptors
- PPI:
-
protein–protein interaction
- QM :
-
quantum mechanics
- QSAR :
-
quantitative structure–activity relationship
- QSPR :
-
quantitative structure–property relationship
- RF :
-
random forest
- RL :
-
reinforcement learning
- RNN :
-
recurrent neural network
- ROC:
-
receiver operating characteristic
- RXR:
-
retinoid X receptors
- SA:
-
synthetic accessibility
- SBVS :
-
structure-based virtual screening
- SEA:
-
similarity ensemble approach
- SMILES :
-
simplified molecular input line entry specification,
- SOM :
-
site of metabolism
- SVM :
-
support vector machine
- SVR:
-
support vector regression
- VS:
-
virtual screening
References
Vamathevan J, Clark D, Czodrowski P et al (2019) Applications of machine learning in drug discovery and development. Nat Rev Drug Discov 18:463–477
Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444
Liu Z, Su M, Han L et al (2017) Forging the basis for developing protein-ligand interaction scoring functions. Acc Chem Res 50:302–309
Ain QU, Aleksandrova A, Roessler FD et al (2015) Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening. Wiley Interdiscip Rev Comput Mol Sci 5:405–424
Shen C, Ding J, Wang Z et al (2020) From machine learning to deep learning: advances in scoring functions for protein–ligand docking. WIREs Comput Mol Sci 10:e1429
Ashtawy HM, Mahapatra NR (2015) A comparative assessment of predictive accuracies of conventional and machine learning scoring functions for protein-ligand binding affinity prediction. IEEE/ACM Trans Comput Biol Bioinforma 12:335–347
Wang C, Zhang Y (2017) Improving scoring-docking-screening powers of protein–ligand scoring functions using random forest. J Comput Chem 38:169–177
Ragoza M, Hochuli J, Idrobo E et al (2017) Protein-ligand scoring with convolutional neural networks. J Chem Inf Model 57:942–957
Pereira JC, Caffarena ER, Dos Santos CN (2016) Boosting docking-based virtual screening with deep learning. J Chem Inf Model 56:2495–2506
Gomes J, Ramsundar B, Feinberg EN, et al (2017) Atomic convolutional networks for predicting protein-ligand binding. arXiv e-prints 1703.10603
Chen L, Cruz A, Ramsey S et al (2019) Hidden bias in the DUD-E dataset leads to misleading performance of deep learning in structure-based virtual screening. PLoS One 14:e0220113
Yang J, Shen C, Huang N (2020) Predicting or pretending: artificial intelligence for protein-ligand interactions lack of sufficiently large and unbiased datasets. Front Pharmacol 11:69
Sieg J, Flachsenberg F, Rarey M (2019) In need of bias control: evaluating chemical data for machine learning in structure-based virtual screening. J Chem Inf Model 59:947–961
Scantlebury J, Brown N, Von Delft F et al (2020) Data set augmentation allows deep learning-based virtual screening to better generalize to unseen target classes and highlight important binding interactions. J Chem Inf Model 60:3722–3730
Gentile F, Agrawal V, Hsing M et al (2020) Deep docking: a deep learning platform for augmentation of structure based drug discovery. ACS Cent Sci 6:939–949
Ton AT, Gentile F, Hsing M et al (2020) Rapid identification of potential inhibitors of SARS-CoV-2 Main protease by deep docking of 1.3 billion compounds. Mol Inform 39:e2000028
Dahl GE, Jaitly N, and Salakhutdinov R (2014) Multi-task Neural Networks for QSAR Predictions. arXiv 1406.1231
Rodríguez-Pérez R, Bajorath J (2019) Multitask machine learning for classifying highly and weakly potent kinase inhibitors. ACS Omega 4:4367–4375
Keshavarzi Arshadi A, Salem M, Collins J et al (2020) DeepMalaria: artificial intelligence driven discovery of potent Antiplasmodials. Front Pharmacol 10:1526
Miljković F, Rodríguez-Pérez R, Bajorath J (2020) Machine learning models for accurate prediction of kinase inhibitors with different binding modes. J Med Chem 63:8738–8748
Aldrich C, Bertozzi C, Georg GI et al (2017) The ecstasy and agony of assay interference compounds. J Chem Inf Model 57:387–390
Yang Z-Y, He J-H, Lu A-P et al (2020) Frequent hitters: nuisance artifacts in high-throughput screening. Drug Discov Today 25:657–667
Stork C, Chen Y, Šícho M et al (2019) Hit Dexter 2.0: machine-learning models for the prediction of frequent hitters. J Chem Inf Model 59:1030–1043
Blaschke T, Miljković F, Bajorath J (2019) Prediction of different classes of promiscuous and nonpromiscuous compounds using machine learning and nearest neighbor analysis. ACS Omega 4:6883–6890
Borrel A, Huang R, Sakamuru S et al (2020) High-throughput screening to predict chemical-assay interference. Sci Rep 10:3986
Borrel A, Mansouri K, Nolte S et al (2020) InterPred: a webtool to predict chemical autofluorescence and luminescence interference. Nucleic Acids Res 48:W586–W590
Lipinski CA, Lombardo F, Dominy BW et al (2001) Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings1PII of original article: S0169-409X(96)00423-1. The article was originally published in advanced drug delivery reviews 23 (1997) 3. Adv Drug Deliv Rev 46:3–26
Zhang X, Betzi S, Morelli X et al (2014) Focused chemical libraries--design and enrichment: an example of protein-protein interaction chemical space. Future Med Chem 6:1291–1307
Villoutreix BO, Labbe CM, Lagorce D et al (2012) A leap into the chemical space of protein-protein interaction inhibitors. Curr Pharm Des 18:4648–4667
Bosc N, Muller C, Hoffer L et al (2020) Fr-PPIChem: an academic compound library dedicated to protein-protein interactions. ACS Chem Biol 15:1566–1574
Nidhi GM, Davies JW et al (2006) Prediction of biological targets for compounds using multiple-category bayesian models trained on chemogenomics databases. J Chem Inf Model 46:1124–1133
Zhang P, Wang F, Hu J (2014) Towards drug repositioning: a unified computational framework for integrating multiple aspects of drug similarity and disease similarity. AMIA Annu Symp Proc 2014:1258–1267
Napolitano F, Zhao Y, Moreira VM et al (2013) Drug repositioning: a machine-learning approach through data integration. J Cheminform 5:30
Jarada TN, Rokne JG, Alhajj R (2020) A review of computational drug repositioning: strategies, approaches, opportunities, challenges, and directions. J Cheminform 12:46
Unterthiner T, Mayr A, Klambauer G et al (2014) Deep learning as an opportunity in virtual screening. In: Conference: Workshop on Deep Learning and Representation Learning (NIPS2014)
Allen BK, Ayad NG, and Schürer SC (2019) Kinome-wide activity classification of small molecules by deep learning. bioRxiv
Rifaioglu AS, Nalbat E, Atalay V et al (2020) DEEPScreen: high performance drug-target interaction prediction with convolutional neural networks using 2-D structural compound representations. Chem Sci 11:2531–2557
Stokes JM, Yang K, Swanson K et al (2020) A deep learning approach to antibiotic discovery. Cell 180:688–702.e13
Hu S, Zhang C, Chen P et al (2019) Predicting drug-target interactions from drug structure and protein sequence using novel convolutional neural networks. BMC Bioinformatics 20:689
Aliper A, Plis S, Artemov A et al (2016) Deep learning applications for predicting pharmacological properties of drugs and drug repurposing using transcriptomic data. Mol Pharm 13:2524–2530
Meyer JG, Liu S, Miller IJ et al (2019) Learning drug functions from chemical structures with convolutional neural networks and random forests. J Chem Inf Model 59:4438–4449
Yang W, Soares J, Greninger P et al (2013) Genomics of drug sensitivity in cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res 41:D955–D961
Tate JG, Bamford S, Jubb HC et al (2019) COSMIC: the catalogue of somatic mutations in cancer. Nucleic Acids Res 47:D941–D947
Barretina J, Caponigro G, Stransky N et al (2012) The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483:603–607
Shoemaker RH (2006) The NCI60 human tumour cell line anticancer drug screen. Nat Rev Cancer 6:813–823
Garnett MJ, Edelman EJ, Heidorn SJ et al (2012) Systematic identification of genomic markers of drug sensitivity in cancer cells. Nature 483:570–575
Iorio F, Knijnenburg TA, Vis DJ et al (2016) A landscape of Pharmacogenomic interactions in cancer. Cell 166:740–754
Rahman R, Matlock K, Ghosh S et al (2017) Heterogeneity aware random forest for drug sensitivity prediction. Sci Rep 7:11347
Costello JC, Heiser LM, Georgii E et al (2014) A community effort to assess and improve drug sensitivity prediction algorithms. Nat Biotechnol 32:1202–1212
Menden MP, Iorio F, Garnett M et al (2013) Machine learning prediction of cancer cell sensitivity to drugs based on genomic and chemical properties. PLoS One 8:e61318
Cortés-Ciriano I, Van Westen GJP, Bouvier G et al (2016) Improved large-scale prediction of growth inhibition patterns using the NCI60 cancer cell line panel. Bioinformatics 32:85–95
Chang Y, Park H, Yang HJ et al (2018) Cancer drug response profile scan (CDRscan): a deep learning model that predicts drug effectiveness from cancer genomic signature. Sci Rep 8:8857
Liu P, Li H, Li S et al (2019) Improving prediction of phenotypic drug response on cancer cell lines using deep convolutional network. BMC Bioinformatics 20:408
Garcia-Alonso L, Iorio F, Matchan A et al (2018) Transcription factor activities enhance markers of drug sensitivity in cancer. Cancer Res 78:769–780
Besnard J, Ruda GF, Setola V et al (2012) Automated design of ligands to polypharmacological profiles. Nature 492:215–220
Hartenfeller M, Zettl H, Walter M et al (2012) Dogs: reaction-driven de novo design of bioactive compounds. PLoS Comput Biol 8:e1002380
Zhavoronkov A, Ivanenkov YA, Aliper A et al (2019) Deep learning enables rapid identification of potent DDR1 kinase inhibitors. Nat Biotechnol 37:1038–1040
Walters WP, Murcko M (2020) Assessing the impact of generative AI on medicinal chemistry. Nat Biotechnol 38:143–145
Elton DC, Boukouvalas Z, Fuge MD et al (2019) Deep learning for molecular design - a review of the state of the art. Mol Syst Des Eng 4:828–849
Bian Y and Xie X-Q (2020) Generative chemistry: drug discovery with deep learning generative models arXiv 2008.09000
Gómez-Bombarelli R, Wei JN, Duvenaud D et al (2018) Automatic chemical design using a data-driven continuous representation of molecules. ACS Cent Sci 4:268–276
Segler MHS, Kogej T, Tyrchan C et al (2018) Generating focused molecule libraries for drug discovery with recurrent neural networks. ACS Cent Sci 4:120–131
Merk D, Friedrich L, Grisoni F et al (2018) De novo design of bioactive small molecules by artificial intelligence. Mol Inform 37:1700153–1700154
Olivecrona M, Blaschke T, Engkvist O et al (2017) Molecular de-novo design through deep reinforcement learning. J Cheminform 9:48
Blaschke T, Arús-Pous J, Chen H et al (2020) REINVENT 2.0: an AI tool for De novo drug design. J Chem Inf Model 60:5918–5922
Cao N de and Kipf T (2018) MolGAN: An implicit generative model for small molecular graphs. arXiv 1805.11973
Zhou Z, Kearnes S, Li L et al (2019) Optimization of molecules via deep reinforcement learning. Sci Rep 9:10752
Méndez-Lucio O, Baillif B, Clevert DA et al (2020) De novo generation of hit-like molecules from gene expression signatures using artificial intelligence. Nat Commun 11:10
Benhenda M (2017) ChemGAN challenge for drug discovery: can AI reproduce natural chemical diversity? arXiv 1708.08227
Brown N, Fiscato M, Segler MHS et al (2019) GuacaMol: benchmarking models for de novo molecular design. J Chem Inf Model 59:1096–1108
Gottipati SK, Sattarov B, Niu S, et al (2020) Learning To Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning. arXiv 2004.12485
Corey EJ, Wipke WT (1969) Computer-assisted design of complex organic syntheses. Science 166:178–192
Ertl P, Schuffenhauer A (2009) Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. J Cheminform 1:8
Fukunishi Y, Kurosawa T, Mikami Y et al (2014) Prediction of synthetic accessibility based on commercially available compound databases. J Chem Inf Model 54:3259–3267
Sheridan RP, Zorn N, Sherer EC et al (2014) Modeling a crowdsourced definition of molecular complexity. J Chem Inf Model 54:1604–1616
Coley CW, Rogers L, Green WH et al (2018) SCScore: synthetic complexity learned from a reaction corpus. J Chem Inf Model 58:252–261
Segler MHS, Waller MP (2017) Neural-symbolic machine learning for retrosynthesis and reaction prediction. Chemistry 23:5966–5971
Segler MHS, Preuss M, Waller MP (2018) Planning chemical syntheses with deep neural networks and symbolic AI. Nature 555:604–610
Fooshee D, Mood A, Gutman E et al (2018) Deep learning for chemical reaction prediction. Mol Syst Des Eng 3:442–452
Schwaller P, Laino T, Gaudin T et al (2019) Molecular transformer: a model for uncertainty-calibrated chemical reaction prediction. ACS Cent Sci 5:1572–1583
Segler MHS, Waller MP (2017) Modelling chemical reasoning to predict and invent reactions. Chemistry 23:6118–6128
Ahneman DT, Estrada JG, Lin S et al (2018) Predicting reaction performance in C–N cross-coupling using machine learning. Science 360:186 LP–190 LP
Sandfort F, Strieth-Kalthoff F, Kühnemund M et al (2020) A structure-based platform for predicting chemical reactivity. Chem 6:1379–1390
Reker D, Bernardes G, and Rodrigues T (2018) Evolving and Nano data enabled machine intelligence for chemical reaction optimization. ChemRxiv
Gao H, Struble TJ, Coley CW et al (2018) Using machine learning to predict suitable conditions for organic reactions. ACS Cent Sci 4:1465–1476
Zhou Z, Li X, Zare RN (2017) Optimizing chemical reactions with deep reinforcement learning. ACS Cent Sci 3:1337–1344
Gao W, Coley CW (2020) The synthesizability of molecules proposed by generative models. J Chem Inf Model 60:5714–5723
Korovina K, Xu S, Kandasamy K, et al (2019) ChemBO: Bayesian Optimization of Small Organic Molecules with Synthesizable Recommendations arXiv 1908.01425
Zubatyuk R, Smith J, Nebgen B, et al (2020) Teaching a neural network to attach and detach electrons from molecules. ChemRxiv
Genheden S, Thakkar A, Chadimova V, et al (2020) AiZynthFinder: A Fast Robust and Flexible Open-Source Software for Retrosynthetic Planning. ChemRxiv
Thakkar A, Selmi N, Reymond J-L et al (2020) “Ring breaker”: neural network driven synthesis prediction of the ring system chemical space. J Med Chem 63:8791–8808
Gale EM, Durand DJ (2020) Improving reaction prediction. Nat Chem 12:509–510
Irmann F (1965) Eine einfache Korrelation zwischen Wasserlöslichkeit und Struktur von Kohlenwasserstoffen und Halogenkohlenwasserstoffen. Chemie Ing Tech 37:789–798
Hansch C, Quinlan JE, Lawrence GL (1968) Linear free-energy relationship between partition coefficients and the aqueous solubility of organic liquids. J Org Chem 33:347–350
Ran Y, Yalkowsky SH (2001) Prediction of drug solubility by the general solubility equation (GSE). J Chem Inf Comput Sci 41:354–357
Llinàs A, Glen RC, Goodman JM (2008) Solubility challenge: can you predict Solubilities of 32 molecules using a database of 100 reliable measurements? J Chem Inf Model 48:1289–1303
Llinas A, Avdeef A (2019) Solubility challenge revisited after ten years, with multilab shake-flask data, using tight (SD ∼ 0.17 log) and loose (SD ∼ 0.62 log) test sets. J Chem Inf Model 59:3036–3040
Korotcov A, Tkachenko V, Russo DP et al (2017) Comparison of deep learning with multiple machine learning methods and metrics using diverse drug discovery data sets. Mol Pharm 14:4462–4475
Wu K, Zhao Z, Wang R et al (2018) TopP–S: persistent homology-based multi-task deep neural networks for simultaneous predictions of partition coefficient and aqueous solubility. J Comput Chem 39:1444–1454
Korolev V, Mitrofanov A, Korotcov A et al (2020) Graph convolutional neural networks as “general-purpose” property predictors: the universality and limits of applicability. J Chem Inf Model 60:22–28
Cui Q, Lu S, Ni B et al (2020) Improved prediction of aqueous solubility of novel compounds by going deeper with deep learning. Front Oncol 10:121
Montanari F, Kuhnke L, Ter Laak A et al (2020) Modeling Physico-chemical ADMET endpoints with multitask graph convolutional networks. Molecules 25:44
Avdeef A (2020) Prediction of aqueous intrinsic solubility of druglike molecules using random Forest regression trained with wiki-pS0 database. ADMET DMPK 8:29
Khurana S, Rawi R, Kunji K et al (2018) DeepSol: a deep learning framework for sequence-based protein solubility prediction. Bioinformatics 34:2605–2613
Rawi R, Mall R, Kunji K et al (2018) PaRSnIP: sequence-based protein solubility prediction using gradient boosting machine. Bioinformatics 34:1092–1098
Li X, Fourches D (2020) Inductive transfer learning for molecular activity prediction: next-gen QSAR models with MolPMoFiT. J Cheminform 12:27
Fuchs J-A, Grisoni F, Kossenjans M et al (2018) Lipophilicity prediction of peptides and peptide derivatives by consensus machine learning. Med Chem Commun 9:1538–1546
Wenzel J, Matter H, Schmidt F (2019) Predictive multitask deep neural network models for ADME-Tox properties: learning from large data sets. J Chem Inf Model 59:1253–1268
Hunt PA, Segall MD, Tyzack JD (2018) WhichP450: a multi-class categorical model to predict the major metabolising CYP450 isoform for a compound. J Comput Aided Mol Des 32:537–546
Xiong Y, Qiao Y, Kihara D et al (2019) Survey of machine learning techniques for prediction of the isoform specificity of cytochrome P450 substrates. Curr Drug Metab 20:229–235
Rydberg P, Gloriam DE, Olsen L (2010) The SMARTCyp cytochrome P450 metabolism prediction server. Bioinformatics 26:2988–2989
Rudik A, Bezhentsev V, Dmitriev A et al (2018) Metatox - web application for generation of metabolic pathways and toxicity estimation. J Bioinforma Comput Biol 17:1940001
Madzhidov TI, Khakimova AA, Nugmanov RI et al (2018) Prediction of aromatic hydroxylation sites for human CYP1A2 substrates using condensed graph of reactions. Bionanoscience 8:384–389
Matlock MK, Hughes TB, Swamidass SJ (2015) XenoSite server: a web-available site of metabolism prediction tool. Bioinformatics 31:1136–1137
Rudik AV, Dmitriev AV, Lagunin AA et al (2014) Metabolism site prediction based on xenobiotic structural formulas and PASS prediction algorithm. J Chem Inf Model 54:498–507
Finkelmann AR, Goldmann D, Schneider G et al (2018) MetScore: site of metabolism prediction beyond cytochrome P450 enzymes. ChemMedChem 13:2281–2289
Šícho M, Stork C, Mazzolari A et al (2019) FAME 3: predicting the sites of metabolism in synthetic compounds and natural products for phase 1 and phase 2 metabolic enzymes. J Chem Inf Model 59:3400–3412
Flynn NR, Le Dang N, Ward MD et al (2020) XenoNet: inference and likelihood of intermediate metabolite formation. J Chem Inf Model 60:3431–3449
Djoumbou-Feunang Y, Fiamoncini J, Gil-de-la-Fuente A et al (2019) BioTransformer: a comprehensive computational tool for small molecule metabolism prediction and metabolite identification. J Cheminform 11:2
Marchant CA, Briggs KA, Long A (2008) In silico tools for sharing data and knowledge on toxicity and metabolism: derek for windows, meteor, and vitic. Toxicol Mech Methods 18:177–187
de Bruyn Kops C, Stork C, Šícho M et al (2019) GLORY: generator of the structures of likely cytochrome P450 metabolites based on predicted sites of metabolism. Front Chem 7:402
Šícho M, de Bruyn Kops C, Stork C et al (2017) FAME 2: simple and effective machine learning model of cytochrome P450 Regioselectivity. J Chem Inf Model 57:1832–1846
Hartung T (2019) Predicting toxicity of chemicals: software beats animal testing. EFSA J 17:e170710
Lee H-M, Yu M-S, Kazmi SR et al (2019) Computational determination of hERG-related cardiotoxicity of drug candidates. BMC Bioinformatics 20:250
Ogura K, Sato T, Yuki H et al (2019) Support vector machine model for hERG inhibitory activities based on the integrated hERG database using descriptor selection by NSGA-II. Sci Rep 9:12220
Zhang Y, Zhao J, Wang Y et al (2019) Prediction of hERG K+ channel blockage using deep neural networks. Chem Biol Drug Des 94:1973–1985
Fourches D, Barnes JC, Day NC et al (2010) Cheminformatics analysis of assertions mined from literature that describe drug-induced liver injury in different species. Chem Res Toxicol 23:171–183
Kim E, Nam H (2017) Prediction models for drug-induced hepatotoxicity by using weighted molecular fingerprints. BMC Bioinformatics 18:227
Low Y, Uehara T, Minowa Y et al (2011) Predicting drug-induced hepatotoxicity using QSAR and toxicogenomics approaches. Chem Res Toxicol 24:1251–1262
Muller C, Pekthong D, Alexandre E et al (2015) Prediction of drug induced liver injury using molecular and biological descriptors. Comb Chem High Throughput Screen 18:315–322
Wang H, Liu R, Schyman P et al (2019) Deep neural network models for predicting chemically induced liver toxicity endpoints from transcriptomic responses. Front Pharmacol 10:42
Nguyen-Vo T-H, Nguyen L, Do N et al (2020) Predicting drug-induced liver injury using convolutional neural network and molecular fingerprint-embedded features. ACS Omega 5:25432–25439
Lei T, Li Y, Song Y et al (2016) ADMET evaluation in drug discovery: 15. Accurate prediction of rat oral acute toxicity using relevance vector machine and consensus modeling. J Cheminform 8:6
Fan T, Sun G, Zhao L et al (2018) QSAR and classification study on prediction of acute Oral toxicity of N-Nitroso compounds. Int J Mol Sci 19:3015
García-Jacas CR, Marrero-Ponce Y, Cortés-Guzmán F et al (2019) Enhancing acute Oral toxicity predictions by using consensus modeling and algebraic form-based 0D-to-2D molecular encodes. Chem Res Toxicol 32:1178–1192
Lunghini F, Marcou G, Azam P et al (2019) Consensus models to predict oral rat acute toxicity and validation on a dataset coming from the industrial context. SAR QSAR Environ Res 30:879–897
Wu K, Wei G-W (2018) Quantitative toxicity prediction using topology based multitask deep neural networks. J Chem Inf Model 58:520–531
Xu Y, Pei J, Lai L (2017) Deep learning based regression and multiclass models for acute Oral toxicity prediction with automatic chemical feature extraction. J Chem Inf Model 57:2672–2685
Sosnin S, Karlov D, Tetko IV et al (2019) Comparative study of multitask toxicity modeling on a broad chemical space. J Chem Inf Model 59:1062–1072
Carnesecchi E, Raitano G, Gamba A et al (2020) Evaluation of non-commercial models for genotoxicity and carcinogenicity in the assessment of EFSA’s databases. SAR QSAR Environ Res 31:33–48
Honma M, Kitazawa A, Cayley A et al (2019) Improvement of quantitative structure-activity relationship (QSAR) tools for predicting Ames mutagenicity: outcomes of the Ames/QSAR international challenge project. Mutagenesis 34:3–16
Verheyen GR, Braeken E, Van Deun K et al (2017) Evaluation of existing (Q)SAR models for skin and eye irritation and corrosion to use for REACH registration. Toxicol Lett 265:47–52
Piir G, Sild S, Maran U (2021) Binary and multi-class classification for androgen receptor agonists, antagonists and binders. Chemosphere 262:128313
Mazzolari A, Vistoli G, Testa B et al (2018) Prediction of the formation of reactive metabolites by a novel classifier approach based on enrichment factor optimization (EFO) as implemented in the VEGA program. Molecules 23:2955
Yuan Q, Wei Z, Guan X et al (2019) Toxicity prediction method based on Multi-Channel convolutional neural network. Molecules 24:3383
Watanabe R, Ohashi R, Esaki T et al (2019) Development of an in silico prediction system of human renal excretion and clearance from chemical structure information incorporating fraction unbound in plasma as a descriptor. Sci Rep 9:18782
Sun L, Yang H, Li J et al (2018) In silico prediction of compounds binding to human plasma proteins by QSAR models. ChemMedChem 13:572–581
Esposito C, Wang S, Lange UEW et al (2020) Combining machine learning and molecular dynamics to predict P-glycoprotein substrates. J Chem Inf Model 60:4730–4749
Shin M, Jang D, Nam H et al (2018) Predicting the absorption potential of chemical compounds through a deep learning approach. IEEE/ACM Trans Comput Biol Bioinforma 15:432–440
Guan L, Yang H, Cai Y et al (2019) ADMET-score – a comprehensive scoring function for evaluation of chemical drug-likeness. Med Chem Commun 10:148–157
Kar S, Leszczynski J (2020) Open access in silico tools to predict the ADMET profiling of drug candidates. Expert Opin Drug Discov 15:1473–1487
Feinberg EN, Joshi E, Pande VS et al (2020) Improvement in ADMET prediction with multitask deep Featurization. J Med Chem 63:8835–8848
Zhou Y, Cahya S, Combs SA et al (2019) Exploring tunable Hyperparameters for deep neural networks with industrial ADME data sets. J Chem Inf Model 59:1005–1016
Schütt KT, Arbabzadah F, Chmiela S et al (2017) Quantum-chemical insights from deep tensor neural networks. Nat Commun 8:13890
Blum LC, Reymond J-L (2009) 970 million Druglike small molecules for virtual screening in the chemical universe database GDB-13. J Am Chem Soc 131:8732–8733
Reymond J-L (2015) The chemical space project. Acc Chem Res 48:722–730
Ramakrishnan R, Dral PO, Rupp M et al (2014) Quantum chemistry structures and properties of 134 kilo molecules. Sci Data 1:140022
Smith JS, Isayev O, Roitberg AE (2017) ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost. Chem Sci 8:3192–3203
Fink T, Reymond J-L (2007) Virtual exploration of the chemical universe up to 11 atoms of C, N, O, F: assembly of 26.4 million structures (110.9 million stereoisomers) and analysis for new ring systems, stereochemistry, physicochemical properties, compound classes, and drug Discov. J Chem Inf Model 47:342–353
Gebauer NWA, Gastegger M, and Schütt KT (2018) Generating equilibrium molecules with deep neural networks arXiv 1810.11347
Schütt KT, Sauceda HE, Kindermans P-J et al (2018) SchNet - a deep learning architecture for molecules and materials. J Chem Phys 148:241722
Bleiziffer P, Schaller K, Riniker S (2018) Machine learning of partial charges derived from high-quality quantum-mechanical calculations. J Chem Inf Model 58:579–590
Irwin JJ, Shoichet BK (2005) ZINC—a free database of commercially available compounds for virtual screening. J Chem Inf Model 45:177–182
Gaulton A, Bellis LJ, Bento AP et al (2012) ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res 40:D1100–D1107
Callaway E (2020), It will change everything’: DeepMind’s AI makes gigantic leap in solving protein structures. https://www.nature.com/articles/d41586-020-03348-4
Acknowledgments
The authors thank Laurianne David and Martin Kotev (Evotec (France) SAS, Toulouse, France) for their contribution to the manuscript.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Muller, C., Rabal, O., Diaz Gonzalez, C. (2022). Artificial Intelligence, Machine Learning, and Deep Learning in Real-Life Drug Design Cases. In: Heifetz, A. (eds) Artificial Intelligence in Drug Design. Methods in Molecular Biology, vol 2390. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-1787-8_16
Download citation
DOI: https://doi.org/10.1007/978-1-0716-1787-8_16
Published:
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-1786-1
Online ISBN: 978-1-0716-1787-8
eBook Packages: Springer Protocols