Artificial Intelligence, Machine Learning, and Deep Learning in Real-Life Drug Design Cases

Muller, Christophe; Rabal, Obdulia; Diaz Gonzalez, Constantino

doi:10.1007/978-1-0716-1787-8_16

Artificial Intelligence, Machine Learning, and Deep Learning in Real-Life Drug Design Cases

Christophe Muller³,
Obdulia Rabal³ &
Constantino Diaz Gonzalez³

Protocol
First Online: 04 November 2021

6427 Accesses
12 Citations
10 Altmetric

Part of the book series: Methods in Molecular Biology ((MIMB,volume 2390))

Abstract

The discovery and development of drugs is a long and expensive process with a high attrition rate. Computational drug discovery contributes to ligand discovery and optimization, by using models that describe the properties of ligands and their interactions with biological targets. In recent years, artificial intelligence (AI) has made remarkable modeling progress, driven by new algorithms and by the increase in computing power and storage capacities, which allow the processing of large amounts of data in a short time. This review provides the current state of the art of AI methods applied to drug discovery, with a focus on structure- and ligand-based virtual screening, library design and high-throughput analysis, drug repurposing and drug sensitivity, de novo design, chemical reactions and synthetic accessibility, ADMET, and quantum mechanics.

This is a preview of subscription content, log in via an institution.

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Softcover Book: USD 179.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Springer Nature is developing a new tool to find and evaluate Protocols. Learn more

Abbreviations

ADMET:: absorption, distribution, metabolism, excretion, toxicity
AI:: artificial intelligence
ANN :: artificial neural network
AUC:: area under the curve
AUROC:: area under the receiver operating characteristic curve
BA:: balanced accuracy
BKD:: binary kernel discrimination
CCLE:: Cancer Cell Line Encyclopedia
CCLP:: COSMIC Cell Lines Project
CNN :: convolutional neural network
CV:: cross-validation
DFT :: density functional theory
DILI :: drug-induced liver injury
DL:: deep learning
DMTA :: design–make–test–analyze
DNN :: deep neural network
DRD2:: dopamine receptor D2
DTNN:: deep tensor neural network
ECFP4:: extended connectivity fingerprint of diameter 4
FDA:: Food and Drug Administration
FNN:: feed-forward neural network
GCNN:: graph convolutional neural network
GDB-7:: generic database with up to 7 heavy atoms
GDSC:: genomics in drug sensitivity in cancer
GENTRL :: generative tensorial reinforcement learning
GPU :: graphics processing unit
GSE:: general solubility equation
hERG :: human Ether-à-go-go-Related Gene
HTS :: high-throughput screening
JAK:: Janus kinase
KNN:: k-nearest neighbor
LBVS :: ligand-based virtual screening
LINCS:: library of integrated network-based cellular signatures
LSTM :: long short-term memory
MCC:: Matthews correlation coefficient
MeSH:: medical subject headings
ML:: machine learning
MT:: multitasks
MTDL:: multitask deep learning
MTNN:: multitask neural network
NCI-60:: National Cancer Institute 60 human cancer cell lines
PAINS :: pan-assay interference
PPAR:: peroxisome proliferator-activated receptors
PPI:: protein–protein interaction
QM :: quantum mechanics
QSAR :: quantitative structure–activity relationship
QSPR :: quantitative structure–property relationship
RF :: random forest
RL :: reinforcement learning
RNN :: recurrent neural network
ROC:: receiver operating characteristic
RXR:: retinoid X receptors
SA:: synthetic accessibility
SBVS :: structure-based virtual screening
SEA:: similarity ensemble approach
SMILES :: simplified molecular input line entry specification,
SOM :: site of metabolism
SVM :: support vector machine
SVR:: support vector regression
VS:: virtual screening

References

Vamathevan J, Clark D, Czodrowski P et al (2019) Applications of machine learning in drug discovery and development. Nat Rev Drug Discov 18:463–477
Article CAS PubMed PubMed Central Google Scholar
Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444
Article CAS PubMed Google Scholar
Liu Z, Su M, Han L et al (2017) Forging the basis for developing protein-ligand interaction scoring functions. Acc Chem Res 50:302–309
Article CAS PubMed Google Scholar
Ain QU, Aleksandrova A, Roessler FD et al (2015) Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening. Wiley Interdiscip Rev Comput Mol Sci 5:405–424
Article CAS PubMed PubMed Central Google Scholar
Shen C, Ding J, Wang Z et al (2020) From machine learning to deep learning: advances in scoring functions for protein–ligand docking. WIREs Comput Mol Sci 10:e1429
Article CAS Google Scholar
Ashtawy HM, Mahapatra NR (2015) A comparative assessment of predictive accuracies of conventional and machine learning scoring functions for protein-ligand binding affinity prediction. IEEE/ACM Trans Comput Biol Bioinforma 12:335–347
Article CAS Google Scholar
Wang C, Zhang Y (2017) Improving scoring-docking-screening powers of protein–ligand scoring functions using random forest. J Comput Chem 38:169–177
Article PubMed CAS Google Scholar
Ragoza M, Hochuli J, Idrobo E et al (2017) Protein-ligand scoring with convolutional neural networks. J Chem Inf Model 57:942–957
Article CAS PubMed PubMed Central Google Scholar
Pereira JC, Caffarena ER, Dos Santos CN (2016) Boosting docking-based virtual screening with deep learning. J Chem Inf Model 56:2495–2506
Article CAS PubMed Google Scholar
Gomes J, Ramsundar B, Feinberg EN, et al (2017) Atomic convolutional networks for predicting protein-ligand binding. arXiv e-prints 1703.10603
Google Scholar
Chen L, Cruz A, Ramsey S et al (2019) Hidden bias in the DUD-E dataset leads to misleading performance of deep learning in structure-based virtual screening. PLoS One 14:e0220113
Article CAS PubMed PubMed Central Google Scholar
Yang J, Shen C, Huang N (2020) Predicting or pretending: artificial intelligence for protein-ligand interactions lack of sufficiently large and unbiased datasets. Front Pharmacol 11:69
Article CAS PubMed PubMed Central Google Scholar
Sieg J, Flachsenberg F, Rarey M (2019) In need of bias control: evaluating chemical data for machine learning in structure-based virtual screening. J Chem Inf Model 59:947–961
Article CAS PubMed Google Scholar
Scantlebury J, Brown N, Von Delft F et al (2020) Data set augmentation allows deep learning-based virtual screening to better generalize to unseen target classes and highlight important binding interactions. J Chem Inf Model 60:3722–3730
Article CAS PubMed PubMed Central Google Scholar
Gentile F, Agrawal V, Hsing M et al (2020) Deep docking: a deep learning platform for augmentation of structure based drug discovery. ACS Cent Sci 6:939–949
Article CAS PubMed PubMed Central Google Scholar
Ton AT, Gentile F, Hsing M et al (2020) Rapid identification of potential inhibitors of SARS-CoV-2 Main protease by deep docking of 1.3 billion compounds. Mol Inform 39:e2000028
Article PubMed CAS Google Scholar
Dahl GE, Jaitly N, and Salakhutdinov R (2014) Multi-task Neural Networks for QSAR Predictions. arXiv 1406.1231
Google Scholar
Rodríguez-Pérez R, Bajorath J (2019) Multitask machine learning for classifying highly and weakly potent kinase inhibitors. ACS Omega 4:4367–4375
Article CAS Google Scholar
Keshavarzi Arshadi A, Salem M, Collins J et al (2020) DeepMalaria: artificial intelligence driven discovery of potent Antiplasmodials. Front Pharmacol 10:1526
Article PubMed PubMed Central CAS Google Scholar
Miljković F, Rodríguez-Pérez R, Bajorath J (2020) Machine learning models for accurate prediction of kinase inhibitors with different binding modes. J Med Chem 63:8738–8748
Article PubMed CAS Google Scholar
Aldrich C, Bertozzi C, Georg GI et al (2017) The ecstasy and agony of assay interference compounds. J Chem Inf Model 57:387–390
Article CAS PubMed Google Scholar
Yang Z-Y, He J-H, Lu A-P et al (2020) Frequent hitters: nuisance artifacts in high-throughput screening. Drug Discov Today 25:657–667
Article CAS PubMed Google Scholar
Stork C, Chen Y, Šícho M et al (2019) Hit Dexter 2.0: machine-learning models for the prediction of frequent hitters. J Chem Inf Model 59:1030–1043
Article CAS PubMed Google Scholar
Blaschke T, Miljković F, Bajorath J (2019) Prediction of different classes of promiscuous and nonpromiscuous compounds using machine learning and nearest neighbor analysis. ACS Omega 4:6883–6890
Article CAS Google Scholar
Borrel A, Huang R, Sakamuru S et al (2020) High-throughput screening to predict chemical-assay interference. Sci Rep 10:3986
Article CAS PubMed PubMed Central Google Scholar
Borrel A, Mansouri K, Nolte S et al (2020) InterPred: a webtool to predict chemical autofluorescence and luminescence interference. Nucleic Acids Res 48:W586–W590
Article CAS PubMed PubMed Central Google Scholar
Lipinski CA, Lombardo F, Dominy BW et al (2001) Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings1PII of original article: S0169-409X(96)00423-1. The article was originally published in advanced drug delivery reviews 23 (1997) 3. Adv Drug Deliv Rev 46:3–26
Article CAS PubMed Google Scholar
Zhang X, Betzi S, Morelli X et al (2014) Focused chemical libraries--design and enrichment: an example of protein-protein interaction chemical space. Future Med Chem 6:1291–1307
Article CAS PubMed Google Scholar
Villoutreix BO, Labbe CM, Lagorce D et al (2012) A leap into the chemical space of protein-protein interaction inhibitors. Curr Pharm Des 18:4648–4667
Article CAS PubMed PubMed Central Google Scholar
Bosc N, Muller C, Hoffer L et al (2020) Fr-PPIChem: an academic compound library dedicated to protein-protein interactions. ACS Chem Biol 15:1566–1574
Article CAS PubMed PubMed Central Google Scholar
Nidhi GM, Davies JW et al (2006) Prediction of biological targets for compounds using multiple-category bayesian models trained on chemogenomics databases. J Chem Inf Model 46:1124–1133
Article CAS PubMed Google Scholar
Zhang P, Wang F, Hu J (2014) Towards drug repositioning: a unified computational framework for integrating multiple aspects of drug similarity and disease similarity. AMIA Annu Symp Proc 2014:1258–1267
PubMed PubMed Central Google Scholar
Napolitano F, Zhao Y, Moreira VM et al (2013) Drug repositioning: a machine-learning approach through data integration. J Cheminform 5:30
Article CAS PubMed PubMed Central Google Scholar
Jarada TN, Rokne JG, Alhajj R (2020) A review of computational drug repositioning: strategies, approaches, opportunities, challenges, and directions. J Cheminform 12:46
Article CAS PubMed PubMed Central Google Scholar
Unterthiner T, Mayr A, Klambauer G et al (2014) Deep learning as an opportunity in virtual screening. In: Conference: Workshop on Deep Learning and Representation Learning (NIPS2014)
Google Scholar
Allen BK, Ayad NG, and Schürer SC (2019) Kinome-wide activity classification of small molecules by deep learning. bioRxiv
Google Scholar
Rifaioglu AS, Nalbat E, Atalay V et al (2020) DEEPScreen: high performance drug-target interaction prediction with convolutional neural networks using 2-D structural compound representations. Chem Sci 11:2531–2557
Article CAS PubMed PubMed Central Google Scholar
Stokes JM, Yang K, Swanson K et al (2020) A deep learning approach to antibiotic discovery. Cell 180:688–702.e13
Article CAS PubMed PubMed Central Google Scholar
Hu S, Zhang C, Chen P et al (2019) Predicting drug-target interactions from drug structure and protein sequence using novel convolutional neural networks. BMC Bioinformatics 20:689
Article CAS PubMed PubMed Central Google Scholar
Aliper A, Plis S, Artemov A et al (2016) Deep learning applications for predicting pharmacological properties of drugs and drug repurposing using transcriptomic data. Mol Pharm 13:2524–2530
Article CAS PubMed PubMed Central Google Scholar
Meyer JG, Liu S, Miller IJ et al (2019) Learning drug functions from chemical structures with convolutional neural networks and random forests. J Chem Inf Model 59:4438–4449
Article CAS PubMed PubMed Central Google Scholar
Yang W, Soares J, Greninger P et al (2013) Genomics of drug sensitivity in cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res 41:D955–D961
Article CAS PubMed Google Scholar
Tate JG, Bamford S, Jubb HC et al (2019) COSMIC: the catalogue of somatic mutations in cancer. Nucleic Acids Res 47:D941–D947
Article CAS PubMed Google Scholar
Barretina J, Caponigro G, Stransky N et al (2012) The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483:603–607
Article CAS PubMed PubMed Central Google Scholar
Shoemaker RH (2006) The NCI60 human tumour cell line anticancer drug screen. Nat Rev Cancer 6:813–823
Article CAS PubMed Google Scholar
Garnett MJ, Edelman EJ, Heidorn SJ et al (2012) Systematic identification of genomic markers of drug sensitivity in cancer cells. Nature 483:570–575
Article CAS PubMed PubMed Central Google Scholar
Iorio F, Knijnenburg TA, Vis DJ et al (2016) A landscape of Pharmacogenomic interactions in cancer. Cell 166:740–754
Article CAS PubMed PubMed Central Google Scholar
Rahman R, Matlock K, Ghosh S et al (2017) Heterogeneity aware random forest for drug sensitivity prediction. Sci Rep 7:11347
Article PubMed PubMed Central CAS Google Scholar
Costello JC, Heiser LM, Georgii E et al (2014) A community effort to assess and improve drug sensitivity prediction algorithms. Nat Biotechnol 32:1202–1212
Article CAS PubMed PubMed Central Google Scholar
Menden MP, Iorio F, Garnett M et al (2013) Machine learning prediction of cancer cell sensitivity to drugs based on genomic and chemical properties. PLoS One 8:e61318
Article CAS PubMed PubMed Central Google Scholar
Cortés-Ciriano I, Van Westen GJP, Bouvier G et al (2016) Improved large-scale prediction of growth inhibition patterns using the NCI60 cancer cell line panel. Bioinformatics 32:85–95
PubMed Google Scholar
Chang Y, Park H, Yang HJ et al (2018) Cancer drug response profile scan (CDRscan): a deep learning model that predicts drug effectiveness from cancer genomic signature. Sci Rep 8:8857
Article PubMed PubMed Central CAS Google Scholar
Liu P, Li H, Li S et al (2019) Improving prediction of phenotypic drug response on cancer cell lines using deep convolutional network. BMC Bioinformatics 20:408
Article PubMed PubMed Central CAS Google Scholar
Garcia-Alonso L, Iorio F, Matchan A et al (2018) Transcription factor activities enhance markers of drug sensitivity in cancer. Cancer Res 78:769–780
Article CAS PubMed Google Scholar
Besnard J, Ruda GF, Setola V et al (2012) Automated design of ligands to polypharmacological profiles. Nature 492:215–220
Article CAS PubMed Google Scholar
Hartenfeller M, Zettl H, Walter M et al (2012) Dogs: reaction-driven de novo design of bioactive compounds. PLoS Comput Biol 8:e1002380
Article CAS PubMed PubMed Central Google Scholar
Zhavoronkov A, Ivanenkov YA, Aliper A et al (2019) Deep learning enables rapid identification of potent DDR1 kinase inhibitors. Nat Biotechnol 37:1038–1040
Article CAS PubMed Google Scholar
Walters WP, Murcko M (2020) Assessing the impact of generative AI on medicinal chemistry. Nat Biotechnol 38:143–145
Article CAS PubMed Google Scholar
Elton DC, Boukouvalas Z, Fuge MD et al (2019) Deep learning for molecular design - a review of the state of the art. Mol Syst Des Eng 4:828–849
Article CAS Google Scholar
Bian Y and Xie X-Q (2020) Generative chemistry: drug discovery with deep learning generative models arXiv 2008.09000
Google Scholar
Gómez-Bombarelli R, Wei JN, Duvenaud D et al (2018) Automatic chemical design using a data-driven continuous representation of molecules. ACS Cent Sci 4:268–276
Article PubMed PubMed Central CAS Google Scholar
Segler MHS, Kogej T, Tyrchan C et al (2018) Generating focused molecule libraries for drug discovery with recurrent neural networks. ACS Cent Sci 4:120–131
Article CAS PubMed Google Scholar
Merk D, Friedrich L, Grisoni F et al (2018) De novo design of bioactive small molecules by artificial intelligence. Mol Inform 37:1700153–1700154
Article PubMed Central CAS Google Scholar
Olivecrona M, Blaschke T, Engkvist O et al (2017) Molecular de-novo design through deep reinforcement learning. J Cheminform 9:48
Article PubMed PubMed Central Google Scholar
Blaschke T, Arús-Pous J, Chen H et al (2020) REINVENT 2.0: an AI tool for De novo drug design. J Chem Inf Model 60:5918–5922
Article CAS PubMed Google Scholar
Cao N de and Kipf T (2018) MolGAN: An implicit generative model for small molecular graphs. arXiv 1805.11973
Google Scholar
Zhou Z, Kearnes S, Li L et al (2019) Optimization of molecules via deep reinforcement learning. Sci Rep 9:10752
Article PubMed PubMed Central CAS Google Scholar
Méndez-Lucio O, Baillif B, Clevert DA et al (2020) De novo generation of hit-like molecules from gene expression signatures using artificial intelligence. Nat Commun 11:10
Article PubMed PubMed Central CAS Google Scholar
Benhenda M (2017) ChemGAN challenge for drug discovery: can AI reproduce natural chemical diversity? arXiv 1708.08227
Google Scholar
Brown N, Fiscato M, Segler MHS et al (2019) GuacaMol: benchmarking models for de novo molecular design. J Chem Inf Model 59:1096–1108
Article CAS PubMed Google Scholar
Gottipati SK, Sattarov B, Niu S, et al (2020) Learning To Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning. arXiv 2004.12485
Google Scholar
Corey EJ, Wipke WT (1969) Computer-assisted design of complex organic syntheses. Science 166:178–192
Article CAS PubMed Google Scholar
Ertl P, Schuffenhauer A (2009) Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. J Cheminform 1:8
Article PubMed PubMed Central CAS Google Scholar
Fukunishi Y, Kurosawa T, Mikami Y et al (2014) Prediction of synthetic accessibility based on commercially available compound databases. J Chem Inf Model 54:3259–3267
Article CAS PubMed Google Scholar
Sheridan RP, Zorn N, Sherer EC et al (2014) Modeling a crowdsourced definition of molecular complexity. J Chem Inf Model 54:1604–1616
Article CAS PubMed Google Scholar
Coley CW, Rogers L, Green WH et al (2018) SCScore: synthetic complexity learned from a reaction corpus. J Chem Inf Model 58:252–261
Article CAS PubMed Google Scholar
Segler MHS, Waller MP (2017) Neural-symbolic machine learning for retrosynthesis and reaction prediction. Chemistry 23:5966–5971
Article CAS PubMed Google Scholar
Segler MHS, Preuss M, Waller MP (2018) Planning chemical syntheses with deep neural networks and symbolic AI. Nature 555:604–610
Article CAS PubMed Google Scholar
Fooshee D, Mood A, Gutman E et al (2018) Deep learning for chemical reaction prediction. Mol Syst Des Eng 3:442–452
Article CAS Google Scholar
Schwaller P, Laino T, Gaudin T et al (2019) Molecular transformer: a model for uncertainty-calibrated chemical reaction prediction. ACS Cent Sci 5:1572–1583
Article CAS PubMed PubMed Central Google Scholar
Segler MHS, Waller MP (2017) Modelling chemical reasoning to predict and invent reactions. Chemistry 23:6118–6128
Article CAS PubMed Google Scholar
Ahneman DT, Estrada JG, Lin S et al (2018) Predicting reaction performance in C–N cross-coupling using machine learning. Science 360:186 LP–190 LP
Article CAS Google Scholar
Sandfort F, Strieth-Kalthoff F, Kühnemund M et al (2020) A structure-based platform for predicting chemical reactivity. Chem 6:1379–1390
Article CAS Google Scholar
Reker D, Bernardes G, and Rodrigues T (2018) Evolving and Nano data enabled machine intelligence for chemical reaction optimization. ChemRxiv
Google Scholar
Gao H, Struble TJ, Coley CW et al (2018) Using machine learning to predict suitable conditions for organic reactions. ACS Cent Sci 4:1465–1476
Article CAS PubMed PubMed Central Google Scholar
Zhou Z, Li X, Zare RN (2017) Optimizing chemical reactions with deep reinforcement learning. ACS Cent Sci 3:1337–1344
Article CAS PubMed PubMed Central Google Scholar
Gao W, Coley CW (2020) The synthesizability of molecules proposed by generative models. J Chem Inf Model 60:5714–5723
Article CAS PubMed Google Scholar
Korovina K, Xu S, Kandasamy K, et al (2019) ChemBO: Bayesian Optimization of Small Organic Molecules with Synthesizable Recommendations arXiv 1908.01425
Google Scholar
Zubatyuk R, Smith J, Nebgen B, et al (2020) Teaching a neural network to attach and detach electrons from molecules. ChemRxiv
Google Scholar
Genheden S, Thakkar A, Chadimova V, et al (2020) AiZynthFinder: A Fast Robust and Flexible Open-Source Software for Retrosynthetic Planning. ChemRxiv
Google Scholar
Thakkar A, Selmi N, Reymond J-L et al (2020) “Ring breaker”: neural network driven synthesis prediction of the ring system chemical space. J Med Chem 63:8791–8808
Article CAS PubMed Google Scholar
Gale EM, Durand DJ (2020) Improving reaction prediction. Nat Chem 12:509–510
Article CAS PubMed Google Scholar
Irmann F (1965) Eine einfache Korrelation zwischen Wasserlöslichkeit und Struktur von Kohlenwasserstoffen und Halogenkohlenwasserstoffen. Chemie Ing Tech 37:789–798
Article CAS Google Scholar
Hansch C, Quinlan JE, Lawrence GL (1968) Linear free-energy relationship between partition coefficients and the aqueous solubility of organic liquids. J Org Chem 33:347–350
Article CAS Google Scholar
Ran Y, Yalkowsky SH (2001) Prediction of drug solubility by the general solubility equation (GSE). J Chem Inf Comput Sci 41:354–357
Article CAS PubMed Google Scholar
Llinàs A, Glen RC, Goodman JM (2008) Solubility challenge: can you predict Solubilities of 32 molecules using a database of 100 reliable measurements? J Chem Inf Model 48:1289–1303
Article PubMed CAS Google Scholar
Llinas A, Avdeef A (2019) Solubility challenge revisited after ten years, with multilab shake-flask data, using tight (SD ∼ 0.17 log) and loose (SD ∼ 0.62 log) test sets. J Chem Inf Model 59:3036–3040
Article CAS PubMed Google Scholar
Korotcov A, Tkachenko V, Russo DP et al (2017) Comparison of deep learning with multiple machine learning methods and metrics using diverse drug discovery data sets. Mol Pharm 14:4462–4475
Article CAS PubMed PubMed Central Google Scholar
Wu K, Zhao Z, Wang R et al (2018) TopP–S: persistent homology-based multi-task deep neural networks for simultaneous predictions of partition coefficient and aqueous solubility. J Comput Chem 39:1444–1454
Article CAS PubMed Google Scholar
Korolev V, Mitrofanov A, Korotcov A et al (2020) Graph convolutional neural networks as “general-purpose” property predictors: the universality and limits of applicability. J Chem Inf Model 60:22–28
Article CAS PubMed Google Scholar
Cui Q, Lu S, Ni B et al (2020) Improved prediction of aqueous solubility of novel compounds by going deeper with deep learning. Front Oncol 10:121
Article PubMed PubMed Central Google Scholar
Montanari F, Kuhnke L, Ter Laak A et al (2020) Modeling Physico-chemical ADMET endpoints with multitask graph convolutional networks. Molecules 25:44
Article CAS Google Scholar
Avdeef A (2020) Prediction of aqueous intrinsic solubility of druglike molecules using random Forest regression trained with wiki-pS0 database. ADMET DMPK 8:29
Article PubMed PubMed Central Google Scholar
Khurana S, Rawi R, Kunji K et al (2018) DeepSol: a deep learning framework for sequence-based protein solubility prediction. Bioinformatics 34:2605–2613
Article CAS PubMed PubMed Central Google Scholar
Rawi R, Mall R, Kunji K et al (2018) PaRSnIP: sequence-based protein solubility prediction using gradient boosting machine. Bioinformatics 34:1092–1098
Article CAS PubMed Google Scholar
Li X, Fourches D (2020) Inductive transfer learning for molecular activity prediction: next-gen QSAR models with MolPMoFiT. J Cheminform 12:27
Article CAS PubMed PubMed Central Google Scholar
Fuchs J-A, Grisoni F, Kossenjans M et al (2018) Lipophilicity prediction of peptides and peptide derivatives by consensus machine learning. Med Chem Commun 9:1538–1546
Article CAS Google Scholar
Wenzel J, Matter H, Schmidt F (2019) Predictive multitask deep neural network models for ADME-Tox properties: learning from large data sets. J Chem Inf Model 59:1253–1268
Article CAS PubMed Google Scholar
Hunt PA, Segall MD, Tyzack JD (2018) WhichP450: a multi-class categorical model to predict the major metabolising CYP450 isoform for a compound. J Comput Aided Mol Des 32:537–546
Article CAS PubMed Google Scholar
Xiong Y, Qiao Y, Kihara D et al (2019) Survey of machine learning techniques for prediction of the isoform specificity of cytochrome P450 substrates. Curr Drug Metab 20:229–235
Article CAS PubMed Google Scholar
Rydberg P, Gloriam DE, Olsen L (2010) The SMARTCyp cytochrome P450 metabolism prediction server. Bioinformatics 26:2988–2989
Article CAS PubMed Google Scholar
Rudik A, Bezhentsev V, Dmitriev A et al (2018) Metatox - web application for generation of metabolic pathways and toxicity estimation. J Bioinforma Comput Biol 17:1940001
Article CAS Google Scholar
Madzhidov TI, Khakimova AA, Nugmanov RI et al (2018) Prediction of aromatic hydroxylation sites for human CYP1A2 substrates using condensed graph of reactions. Bionanoscience 8:384–389
Article Google Scholar
Matlock MK, Hughes TB, Swamidass SJ (2015) XenoSite server: a web-available site of metabolism prediction tool. Bioinformatics 31:1136–1137
Article CAS PubMed Google Scholar
Rudik AV, Dmitriev AV, Lagunin AA et al (2014) Metabolism site prediction based on xenobiotic structural formulas and PASS prediction algorithm. J Chem Inf Model 54:498–507
Article CAS PubMed Google Scholar
Finkelmann AR, Goldmann D, Schneider G et al (2018) MetScore: site of metabolism prediction beyond cytochrome P450 enzymes. ChemMedChem 13:2281–2289
Article CAS PubMed Google Scholar
Šícho M, Stork C, Mazzolari A et al (2019) FAME 3: predicting the sites of metabolism in synthetic compounds and natural products for phase 1 and phase 2 metabolic enzymes. J Chem Inf Model 59:3400–3412
Article PubMed CAS Google Scholar
Flynn NR, Le Dang N, Ward MD et al (2020) XenoNet: inference and likelihood of intermediate metabolite formation. J Chem Inf Model 60:3431–3449
Article CAS PubMed PubMed Central Google Scholar
Djoumbou-Feunang Y, Fiamoncini J, Gil-de-la-Fuente A et al (2019) BioTransformer: a comprehensive computational tool for small molecule metabolism prediction and metabolite identification. J Cheminform 11:2
Article PubMed PubMed Central Google Scholar
Marchant CA, Briggs KA, Long A (2008) In silico tools for sharing data and knowledge on toxicity and metabolism: derek for windows, meteor, and vitic. Toxicol Mech Methods 18:177–187
Article CAS PubMed Google Scholar
de Bruyn Kops C, Stork C, Šícho M et al (2019) GLORY: generator of the structures of likely cytochrome P450 metabolites based on predicted sites of metabolism. Front Chem 7:402
Article PubMed PubMed Central CAS Google Scholar
Šícho M, de Bruyn Kops C, Stork C et al (2017) FAME 2: simple and effective machine learning model of cytochrome P450 Regioselectivity. J Chem Inf Model 57:1832–1846
Article PubMed CAS Google Scholar
Hartung T (2019) Predicting toxicity of chemicals: software beats animal testing. EFSA J 17:e170710
Article PubMed PubMed Central Google Scholar
Lee H-M, Yu M-S, Kazmi SR et al (2019) Computational determination of hERG-related cardiotoxicity of drug candidates. BMC Bioinformatics 20:250
Article PubMed PubMed Central CAS Google Scholar
Ogura K, Sato T, Yuki H et al (2019) Support vector machine model for hERG inhibitory activities based on the integrated hERG database using descriptor selection by NSGA-II. Sci Rep 9:12220
Article PubMed PubMed Central CAS Google Scholar
Zhang Y, Zhao J, Wang Y et al (2019) Prediction of hERG K+ channel blockage using deep neural networks. Chem Biol Drug Des 94:1973–1985
Article CAS PubMed Google Scholar
Fourches D, Barnes JC, Day NC et al (2010) Cheminformatics analysis of assertions mined from literature that describe drug-induced liver injury in different species. Chem Res Toxicol 23:171–183
Article CAS PubMed PubMed Central Google Scholar
Kim E, Nam H (2017) Prediction models for drug-induced hepatotoxicity by using weighted molecular fingerprints. BMC Bioinformatics 18:227
Article PubMed PubMed Central CAS Google Scholar
Low Y, Uehara T, Minowa Y et al (2011) Predicting drug-induced hepatotoxicity using QSAR and toxicogenomics approaches. Chem Res Toxicol 24:1251–1262
Article CAS PubMed PubMed Central Google Scholar
Muller C, Pekthong D, Alexandre E et al (2015) Prediction of drug induced liver injury using molecular and biological descriptors. Comb Chem High Throughput Screen 18:315–322
Article CAS PubMed Google Scholar
Wang H, Liu R, Schyman P et al (2019) Deep neural network models for predicting chemically induced liver toxicity endpoints from transcriptomic responses. Front Pharmacol 10:42
Article PubMed PubMed Central CAS Google Scholar
Nguyen-Vo T-H, Nguyen L, Do N et al (2020) Predicting drug-induced liver injury using convolutional neural network and molecular fingerprint-embedded features. ACS Omega 5:25432–25439
Article CAS PubMed PubMed Central Google Scholar
Lei T, Li Y, Song Y et al (2016) ADMET evaluation in drug discovery: 15. Accurate prediction of rat oral acute toxicity using relevance vector machine and consensus modeling. J Cheminform 8:6
Article PubMed PubMed Central CAS Google Scholar
Fan T, Sun G, Zhao L et al (2018) QSAR and classification study on prediction of acute Oral toxicity of N-Nitroso compounds. Int J Mol Sci 19:3015
Article PubMed Central CAS Google Scholar
García-Jacas CR, Marrero-Ponce Y, Cortés-Guzmán F et al (2019) Enhancing acute Oral toxicity predictions by using consensus modeling and algebraic form-based 0D-to-2D molecular encodes. Chem Res Toxicol 32:1178–1192
Article PubMed CAS Google Scholar
Lunghini F, Marcou G, Azam P et al (2019) Consensus models to predict oral rat acute toxicity and validation on a dataset coming from the industrial context. SAR QSAR Environ Res 30:879–897
Article CAS PubMed Google Scholar
Wu K, Wei G-W (2018) Quantitative toxicity prediction using topology based multitask deep neural networks. J Chem Inf Model 58:520–531
Article CAS PubMed Google Scholar
Xu Y, Pei J, Lai L (2017) Deep learning based regression and multiclass models for acute Oral toxicity prediction with automatic chemical feature extraction. J Chem Inf Model 57:2672–2685
Article CAS PubMed Google Scholar
Sosnin S, Karlov D, Tetko IV et al (2019) Comparative study of multitask toxicity modeling on a broad chemical space. J Chem Inf Model 59:1062–1072
Article CAS PubMed Google Scholar
Carnesecchi E, Raitano G, Gamba A et al (2020) Evaluation of non-commercial models for genotoxicity and carcinogenicity in the assessment of EFSA’s databases. SAR QSAR Environ Res 31:33–48
Article CAS PubMed Google Scholar
Honma M, Kitazawa A, Cayley A et al (2019) Improvement of quantitative structure-activity relationship (QSAR) tools for predicting Ames mutagenicity: outcomes of the Ames/QSAR international challenge project. Mutagenesis 34:3–16
Article CAS PubMed Google Scholar
Verheyen GR, Braeken E, Van Deun K et al (2017) Evaluation of existing (Q)SAR models for skin and eye irritation and corrosion to use for REACH registration. Toxicol Lett 265:47–52
Article CAS PubMed Google Scholar
Piir G, Sild S, Maran U (2021) Binary and multi-class classification for androgen receptor agonists, antagonists and binders. Chemosphere 262:128313
Article CAS PubMed Google Scholar
Mazzolari A, Vistoli G, Testa B et al (2018) Prediction of the formation of reactive metabolites by a novel classifier approach based on enrichment factor optimization (EFO) as implemented in the VEGA program. Molecules 23:2955
Article PubMed Central CAS Google Scholar
Yuan Q, Wei Z, Guan X et al (2019) Toxicity prediction method based on Multi-Channel convolutional neural network. Molecules 24:3383
Article CAS PubMed Central Google Scholar
Watanabe R, Ohashi R, Esaki T et al (2019) Development of an in silico prediction system of human renal excretion and clearance from chemical structure information incorporating fraction unbound in plasma as a descriptor. Sci Rep 9:18782
Article CAS PubMed PubMed Central Google Scholar
Sun L, Yang H, Li J et al (2018) In silico prediction of compounds binding to human plasma proteins by QSAR models. ChemMedChem 13:572–581
Article CAS PubMed Google Scholar
Esposito C, Wang S, Lange UEW et al (2020) Combining machine learning and molecular dynamics to predict P-glycoprotein substrates. J Chem Inf Model 60:4730–4749
Article CAS PubMed Google Scholar
Shin M, Jang D, Nam H et al (2018) Predicting the absorption potential of chemical compounds through a deep learning approach. IEEE/ACM Trans Comput Biol Bioinforma 15:432–440
Article CAS Google Scholar
Guan L, Yang H, Cai Y et al (2019) ADMET-score – a comprehensive scoring function for evaluation of chemical drug-likeness. Med Chem Commun 10:148–157
Article CAS Google Scholar
Kar S, Leszczynski J (2020) Open access in silico tools to predict the ADMET profiling of drug candidates. Expert Opin Drug Discov 15:1473–1487
Article CAS PubMed Google Scholar
Feinberg EN, Joshi E, Pande VS et al (2020) Improvement in ADMET prediction with multitask deep Featurization. J Med Chem 63:8835–8848
Article CAS PubMed Google Scholar
Zhou Y, Cahya S, Combs SA et al (2019) Exploring tunable Hyperparameters for deep neural networks with industrial ADME data sets. J Chem Inf Model 59:1005–1016
Article CAS PubMed Google Scholar
Schütt KT, Arbabzadah F, Chmiela S et al (2017) Quantum-chemical insights from deep tensor neural networks. Nat Commun 8:13890
Article PubMed PubMed Central CAS Google Scholar
Blum LC, Reymond J-L (2009) 970 million Druglike small molecules for virtual screening in the chemical universe database GDB-13. J Am Chem Soc 131:8732–8733
Article CAS PubMed Google Scholar
Reymond J-L (2015) The chemical space project. Acc Chem Res 48:722–730
Article CAS PubMed Google Scholar
Ramakrishnan R, Dral PO, Rupp M et al (2014) Quantum chemistry structures and properties of 134 kilo molecules. Sci Data 1:140022
Article CAS PubMed PubMed Central Google Scholar
Smith JS, Isayev O, Roitberg AE (2017) ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost. Chem Sci 8:3192–3203
Article CAS PubMed PubMed Central Google Scholar
Fink T, Reymond J-L (2007) Virtual exploration of the chemical universe up to 11 atoms of C, N, O, F: assembly of 26.4 million structures (110.9 million stereoisomers) and analysis for new ring systems, stereochemistry, physicochemical properties, compound classes, and drug Discov. J Chem Inf Model 47:342–353
Article CAS PubMed Google Scholar
Gebauer NWA, Gastegger M, and Schütt KT (2018) Generating equilibrium molecules with deep neural networks arXiv 1810.11347
Google Scholar
Schütt KT, Sauceda HE, Kindermans P-J et al (2018) SchNet - a deep learning architecture for molecules and materials. J Chem Phys 148:241722
Article PubMed CAS Google Scholar
Bleiziffer P, Schaller K, Riniker S (2018) Machine learning of partial charges derived from high-quality quantum-mechanical calculations. J Chem Inf Model 58:579–590
Article CAS PubMed Google Scholar
Irwin JJ, Shoichet BK (2005) ZINC—a free database of commercially available compounds for virtual screening. J Chem Inf Model 45:177–182
Article CAS PubMed PubMed Central Google Scholar
Gaulton A, Bellis LJ, Bento AP et al (2012) ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res 40:D1100–D1107
Article CAS PubMed Google Scholar
Callaway E (2020), It will change everything’: DeepMind’s AI makes gigantic leap in solving protein structures. https://www.nature.com/articles/d41586-020-03348-4

Download references

Acknowledgments

The authors thank Laurianne David and Martin Kotev (Evotec (France) SAS, Toulouse, France) for their contribution to the manuscript.

Author information

Authors and Affiliations

Evotec (France) SAS, Computational Drug Discovery, Integrated Drug Discovery, Toulouse, France
Christophe Muller, Obdulia Rabal & Constantino Diaz Gonzalez

Authors

Christophe Muller
View author publications
You can also search for this author in PubMed Google Scholar
Obdulia Rabal
View author publications
You can also search for this author in PubMed Google Scholar
Constantino Diaz Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Constantino Diaz Gonzalez .

Editor information

Editors and Affiliations

Computational Drug Discovery, Evotec (UK) Ltd., Abingdon, Oxfordshire, UK
Alexander Heifetz

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Muller, C., Rabal, O., Diaz Gonzalez, C. (2022). Artificial Intelligence, Machine Learning, and Deep Learning in Real-Life Drug Design Cases. In: Heifetz, A. (eds) Artificial Intelligence in Drug Design. Methods in Molecular Biology, vol 2390. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-1787-8_16

Download citation

DOI: https://doi.org/10.1007/978-1-0716-1787-8_16
Published: 04 November 2021
Publisher Name: Humana, New York, NY
Print ISBN: 978-1-0716-1786-1
Online ISBN: 978-1-0716-1787-8
eBook Packages: Springer Protocols

Publish with us

Policies and ethics