Inflation of correlation in the pursuit of drug-likeness
- 3.1k Downloads
Drug-likeness is a frequently invoked, although not always precisely defined, concept in drug discovery. Opinions on drug-likeness are to a large extent shaped by the relationships that are observed between surrogate measures of drug-likeness (e.g. aqueous solubility; permeability; pharmacological promiscuity) and fundamental physicochemical properties (e.g. lipophilicity; molecular size). This article draws on examples from the literature to highlight approaches to data analysis that exaggerate trends in data and the term correlation inflation is introduced in the context of drug discovery. Averaging groups of data points prior to analysis is a common cause of correlation inflation and results from analysis of binned continuous data should always be treated with caution.
KeywordsADMET Correlation Drug-likeness Lipophilicity Solubility Promiscuity
We thank Anthony Nicholls for valuable advice and the reviewers of the manuscript for their helpful and constructive feedback. We are grateful to Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) and Conselho Nacional de Pesquisa (CNPq) for financial support and OpenEye Scientific Software for an academic software license.
- 1.Ziliak ST, McCloskey DN (2008) The cult of statistical significance: How the standard error costs us jobs, justice and lives. University of Michigan Press, Ann ArborGoogle Scholar
- 8.Ekins S, Honeycutt JD, Metz JT (2010) Multiobjective optimization for drug discovery. In: Abraham DJ, Rotella DP (eds) Burger’s medicinal chemistry, drug discovery and development, 7th edn. Wiley, New YorkGoogle Scholar
- 16.ADME/T prediction models and databases. http://modem.ucsd.edu/adme/databases/databases_logS.htm. Accessed 15 Oct 2012
- 17.LOGKOW, A databank of evaluated octanol-water partition coefficients. http://logkow.cisti.nrc.ca/logkow/index.jsp. Accessed 26 Oct 2012
- 18.OEChem Toolkit Manual, OpenEye Scientific Software, Santa Fe, NM 87508. http://www.eyesopen.com/docs/toolkits/current/html/OEChem_TK-c++/index.html. Accessed 26 Oct 2012
- 19.SMARTS Theory Manual, Daylight Chemical Information Systems, Inc., Laguna Niguel, CA 92677. http://www.daylight.com/dayhtml/doc/theory/theory.smarts.html. Accessed 16 Dec 2012
- 20.JMP version 10.0.0, SAS Institute, Cary, NC 27513. http://www.jmp.com. Accessed 16 Dec 2012