Cheminformatics Approaches in Modern Drug Discovery

Jamal, Salma; Grover, Abhinav

doi:10.1007/978-981-10-5187-6_9

Salma Jamal^2,3 &
Abhinav Grover²

1446 Accesses
4 Citations

Abstract

The large amount of costs, time, effort and failures involved in the process of drug discovery and development made it difficult for the researchers to discover drugs and prompted the need for methods which could improve the productivity and efficiency of drug design. Cheminformatics is an emerging field which acts as an interface between chemistry and computers and helps in processing, managing and analysis of large chemical information using computer methods. In this chapter, we have outlined the applications of cheminformatics in the field of drug discovery, such as identification of lead compounds, virtual library generation, high throughput screening and data mining, prediction of biological activities of compounds and in silico ADMET prediction. Various cheminformatics approaches that include data mining, representation of chemical compounds via descriptors, similarity and substructures searching and classification algorithms have also been discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 109.00; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Hardcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Xu J, Hagler A (2002) Chemoinformatics and drug discovery. Molecules 7:566–600
Article CAS Google Scholar
Hecht P (2002) High-throughput screening: beating the odds with informatics-driven chemistry. Curr Drug Discov:21–24
Google Scholar
Gallop MA, Barrett RW, Dower WJ, Fodor SP, Gordon EM (1994) Applications of combinatorial technologies to drug discovery. 1. Background and peptide combinatorial libraries. J Med Chem 37:1233–1251
Article CAS PubMed Google Scholar
Brown FK (1998) Chemoinformatics: what is it and how does it impact drug discovery. Annu Rep Med Chem 33:375–384
Article CAS Google Scholar
Engel T (2006) Basic overview of chemoinformatics. J Chem Inf Model 46:2267–2277
Article CAS PubMed Google Scholar
Hann M, Green R (1999) Chemoinformatics—a new name for an old problem? Curr Opin Chem Biol 3:379–383
Article CAS PubMed Google Scholar
Gasteiger J, Engel T (2006) Chemoinformatics: a textbook. Wiley
Google Scholar
James CA Cheminformatics 101. An introduction to the computer science and chemistry of chemical information systems. eMolecules Inc., Del Mar
Google Scholar
Todeschini R, Consonni V (2008) Handbook of molecular descriptors, vol 11. Wiley, NewYork
Google Scholar
Valla A, Giraud M, Dore JC (1993) Descriptive modeling of the chemical structure-biological activity relations of a group of malonic polyethylenic acids as shown by different pharmacotoxicologic tests. Pharmazie 48:295–301
CAS PubMed Google Scholar
Liu K, Feng J, Young SS (2005) Power MV: a software environment for molecular viewing, descriptor generation, data analysis and hit evaluation. J Chem Inf Model 45:515–522
Article CAS PubMed Google Scholar
Yap CW (2011) PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints. J Comput Chem 32:1466–1474
Article CAS PubMed Google Scholar
Mitchell JB (2014) Machine learning methods in chemoinformatics. Wiley Interdiscip Rev Comput Mol Sci 4:468–481
Article CAS PubMed PubMed Central Google Scholar
Alpaydin E (2014) Introduction to machine learning. MIT Press, Cambridge
Google Scholar
Daumé H (2012) A course in machine learning (ciml.Info), p. 189
Brown RD, Martin YC (1996) Use of structure−activity data to compare structure-based clustering methods and descriptors for use in compound selection. J Chem Inf Comput Sci 36:572–584
Article CAS Google Scholar
Mitchell TM (1997) Machine learning. McGraw-Hill Science/Engineering/Math, Maidenhead, p. 432
Google Scholar
Simon P (2013) Too big to ignore: the business case for big data. Wiley, Hoboken, p. 89
Google Scholar
Mitchell JBO (2014) Machine learning methods in chemoinformatics. Wiley Interdiscip Rev Comput Mol Sci 4:468–481
Article CAS PubMed PubMed Central Google Scholar
So S-S, Karplus M (1997) Three-dimensional quantitative structure− activity relationships from molecular similarity matrices and genetic neural networks. 1. Method and validations. J Med Chem 40:4347–4359
Article CAS PubMed Google Scholar
Li H et al (2006) Prediction of estrogen receptor agonists and characterization of associated molecular descriptors by statistical learning methods. J Mol Graph Model 25:313–323
Article CAS PubMed Google Scholar
Briem H, Günther J (2005) Classifying “kinase inhibitor-likeness” by using machine-learning methods. Chembiochem 6:558–566
Article CAS PubMed Google Scholar
Jehad Ali RK, Ahmad N, Maqsood I (2012) Random forests and decision trees. Int J Comput Sci Issues 9
Google Scholar
Marchese Robinson RL, Glen RC, Mitchell JB (2011) Development and comparison of hERG blocker classifiers: assessment on different datasets yields markedly different results. Mol Informat 30:443–458
Article CAS Google Scholar
Kuz'min VE, Polishchuk PG, Artemenko AG, Andronati SA (2011) Interpretation of QSAR models based on random forest methods. Mol Informat 30:593–603
Article Google Scholar
Li S, Fedorowicz A, Singh H, Soderholm SC (2005) Application of the random forest method in studies of local lymph node assay based skin sensitization data. J Chem Inf Model 45:952–964
Article CAS PubMed Google Scholar
Friedman N, Geiger D, Goldszmidt M (1997) Bayesian network classifiers. Mach Learn 29:131–163
Article Google Scholar
Koutsoukas A et al (2013) In silico target predictions: defining a benchmarking data set and comparison of performance of the multiclass Naïve Bayes and Parzen-Rosenblatt window. J Chem Inf Model 53:1957–1966
Article CAS PubMed Google Scholar
Cannon EO et al (2007) Support vector inductive logic programming outperforms the naive Bayes classifier and inductive logic programming for the classification of bioactive chemical compounds. J Comput Aided Mol Des 21:269–280
Article CAS PubMed Google Scholar
von Korff M, Sander T (2006) Toxicity-indicating structural patterns. J Chem Inf Model 46:536–544
Article Google Scholar
Platt JCSequential minimal optimization. A fast algorithm for training support vector machines. Report no. MSR-TR-98-14, 21 (Microsoft Research), 1998)
Google Scholar
Liao Q, Yao J, Yuan S (2007) Prediction of mutagenic toxicity by combination of recursive partitioning and support vector machines. Mol Divers 11:59–72
Article CAS PubMed Google Scholar
Kinnings SL et al (2011) A machine learning-based method to improve docking scoring functions and its application to drug repurposing. J Chem Inf Model 51:408–419
Article CAS PubMed PubMed Central Google Scholar
Altman NS (1992) An introduction to kernel and nearest-neighbor nonparametric regression. Am Stat 46:175–185
Google Scholar
Ajmani S, Jadhav K, Kulkarni SA (2006) Three-dimensional QSAR using the k-nearest neighbor method and its interpretation. J Chem Inf Model 46:24–31
Article CAS PubMed Google Scholar
Honório KM, da Silva AB (2005) A study on the influence of molecular properties in the psychoactivity of cannabinoid compounds. J Mol Model 11:200–209
Article PubMed Google Scholar
Basak SC, Grunwald GD (1995) Predicting mutagenicity of chemicals using topological and quantum chemical parameters: a similarity based study. Chemosphere 31:2529–2546
Article CAS PubMed Google Scholar
Begam BF, Kumar JS (2012) A study on cheminformatics and its applications on modern drug discovery. Proced Eng 38:1264–1275
Article CAS Google Scholar
Aktar MW, Murmu S (2008) Chemoinformatics: principles and applications. 1 Pesticide Residue Laboratory, Department of Agricultural Chemicals, 2 Department of Agricultural Chemistry and Soil Science, Bidhan Chandra Krishi Viswavidyalaya, Mohanpur-741252, Nadia, West Bengal, India.
Google Scholar
Nantasenamat C, Isarankura-Na-Ayudhya C, Naenna T, Prachayasittikul V (2009) A practical overview of quantitative structure-activity relationship. EXCLI J 8:74–88
Google Scholar
Walters WP, Stahl MT, Murcko MA (1998) Virtual screening—an overview. Drug Discov Today 3:160–178
Article CAS Google Scholar
Diller DJ, Merz KM (2001) High throughput docking for library design and library prioritization. Proteins 43:113–124
Article CAS PubMed Google Scholar
Willett P (2000) Chemoinformatics–similarity and diversity in chemical libraries. Curr Opin Biotechnol 11:85–88
Article CAS PubMed Google Scholar
Gedeck P, Willett P (2001) Visual and computational analysis of structure–activity relationships in high-throughput screening data. Curr Opin Chem Biol 5:389–395
Article CAS PubMed Google Scholar
Halford B (2014) Reflections on CHEMDRAW. Chem Eng News 92:26–27
Google Scholar
Park J et al (2009) Automated extraction of chemical structure information from digital raster images. Chem Cent J 3:4
Article PubMed PubMed Central Google Scholar
Hunter AD (1997) ACD/ChemSketch 1.0 (freeware); ACD/ChemSketch 2.0 and its Tautomers, Dictionary, and 3D Plug-ins; ACD/HNMR 2.0; ACD/CNMR 2.0. ACS Publications.
Google Scholar
Steinbeck C et al (2003) The chemistry development kit (CDK): an open-source Java library for chemo-and bioinformatics. J Chem Inf Comput Sci 43:493–500
Article CAS PubMed PubMed Central Google Scholar
Cao Y, Charisi A, Cheng L-C, Jiang T, Girke T (2008) Chemmine R: a compound mining framework for R. Bioinformatics 24:1733–1734
Article CAS PubMed PubMed Central Google Scholar
Ertl P (2010) Molecular structure input on the web. J Cheminform 2(1)
Google Scholar
O'Boyle NM et al (2011) Open babel: an open chemical toolbox. J Chem 3:33
Article Google Scholar
Wang Y et al (2009) PubChem: a public information system for analyzing bioactivities of small molecules. Nucleic Acids Res 37:W623–W633
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

AG is thankful to Jawaharlal Nehru University for usage of all computational facilities. AG is grateful to University Grants Commission, India for the Faculty Recharge Position. Salma Jamal acknowledges a Senior Research Fellowship from Indian Council of Medical Research (ICMR), New Delhi.

Competing Interests

The authors declare that they have no competing interests.

Author information

Authors and Affiliations

School of Biotechnology, Jawaharlal Nehru University, New Delhi, 110067, India
Salma Jamal & Abhinav Grover
Department of Bioscience and Biotechnology, Banasthali University, Tonk, Rajasthan, 304022, India
Salma Jamal

Authors

Salma Jamal
View author publications
You can also search for this author in PubMed Google Scholar
Abhinav Grover
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Abhinav Grover .

Editor information

Editors and Affiliations

School of Biotechnology, Jawaharlal Nehru University, New Delhi, India
Abhinav Grover

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Jamal, S., Grover, A. (2017). Cheminformatics Approaches in Modern Drug Discovery. In: Grover, A. (eds) Drug Design: Principles and Applications. Springer, Singapore. https://doi.org/10.1007/978-981-10-5187-6_9

Download citation

DOI: https://doi.org/10.1007/978-981-10-5187-6_9
Published: 05 June 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-5186-9
Online ISBN: 978-981-10-5187-6
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics