Skip to main content
Log in

Lead-like, drug-like or “Pub-like”: how different are they?

  • Original Paper
  • Published:
Journal of Computer-Aided Molecular Design Aims and scope Submit manuscript

Abstract

Academic and industrial research continues to be focused on discovering new classes of compounds based on HTS. Post-HTS analyses need to prioritize compounds that are progressed to chemical probe or lead status. We report trends in probe, lead and drug discovery by examining the following categories of compounds: 385 leads and the 541 drugs that emerged from them; “active” (152) and “inactive” (1488) compounds from the Molecular Libraries Initiative Small Molecule Repository (MLSMR) tested by HTS; “active” (46) and “inactive” (72) compounds from Nature Chemical Biology (NCB) tested by HTS; compounds in the drug development phase (I, II, III and launched), as indexed in MDDR; and medicinal chemistry compounds from WOMBAT, separated into high-activity (5,784 compounds with nanomolar activity or better) and low-activity (30,690 with micromolar activity or less). We examined Molecular weight (MW), molecular complexity, flexibility, the number of hydrogen bond donors and acceptors, LogP—the octanol/water partition coefficient estimated by ClogP and ALOGPS), LogSw (intrinsic water solubility, estimated by ALOGPS) and the number of Rule of five (Ro5) criteria violations. Based on the 50% and 90% distribution moments of the above properties, there were no significant difference between leads of known drugs and “actives” from MLSMR or NCB (chemical probes). “Inactives” from NCB and MLSMR were also found to exhibit similar properties. From these combined sets, we conclude that “Actives” (569 compounds) are less complex, less flexible, and more soluble than drugs (1,651 drugs), and significantly smaller, less complex, less hydrophobic and more soluble than the 5,784 high-activity WOMBAT compounds. These trends indicate that chemical probes are similar to leads with respect to some properties, e.g., complexity, solubility, and hydrophobicity.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Abbreviations

ALOGPS:

Program available from vcclab.org, Germany

ClogP:

LogP calculated with the Biobyte program

HAC:

Number of H-bond acceptors

HDO:

Number of H-bond donors

LogP:

The logarithm of the octanol-water partition coefficient

MDDR:

MDL Drug Data Report

MLI:

Molecular Libraries and Imaging initiative

MLSCN:

The MLI Screening Centers Network

MLSMR:

The MLI Small Molecule Repository

MW:

Molecular weight

NCB:

Nature Chemical Biology

NIH:

National Institutes of Health

RNG:

Number of rings

Ro5:

Lipinski’s Rule of Five

RTB:

Number of non-terminal flexible bonds

SMCM:

Simple Molecular Complexity Metric

SMILES:

Simplified Molecular Input Line Entry Specification

SumNO:

Sum of nitrogen and oxygen atoms

TlogP:

Tetko’s LogP, calculated with ALOGPS

TlogSw:

Tetko’s logarithm of the (molar) aqueous solubility, calculated with ALOGPS

WOMBAT/WB:

WOrld of Molecular BioAcTivity database

References

  1. Austin CP, Brady LS, Insel TR, Collins FS (2004) Science 306:1138

    Article  CAS  Google Scholar 

  2. The PubChem database is available online at the National Center for Biotechnology Information, Retrieved from http://pubchem.ncbi.nlm.nih.gov/ (09/01/07)

  3. Anon (2002) Chemistry & Industry, vol. 4. London, p 9

  4. According to the Tufts Center for the Study of Drug Development, the average cost to develop a new prescription drug is $802 million. Retrieved from, http://csdd.tufts.edu/NewsEvents/RecentNews.asp?newsid =6 (09/01/07)

  5. Drews J (1998) Drug Discov Today 3:491

    Article  Google Scholar 

  6. Horrobin DF (2000) J Royal Soc Med 93:341

    CAS  Google Scholar 

  7. Olsson T, Oprea TI (2001) Curr Opin Drug Discov Dev 4:308

    CAS  Google Scholar 

  8. Oprea TI (2002) J Comput Aided Mol Design 16:325

    Article  CAS  Google Scholar 

  9. Lipinski CA, Lombardo F, Dominy BW, Feeney PJ (1997) Adv Drug Deliv 23:3

    Article  CAS  Google Scholar 

  10. Oprea TI, Davis AM, Teague SJ, Leeson PD (2001) J Chem Inf Comput Sci 41:1308

    Article  CAS  Google Scholar 

  11. Teague SJ, Davis AM, Leeson PD, Oprea TI (1999) Angew Chem Int Ed 38 (1999) 3743. German version: Angew Chem 111 (1999) 3962

    Google Scholar 

  12. Hann MM, Leach AR, Harper G (2001) J Chem Inf Comput Sci 41:856

    Article  CAS  Google Scholar 

  13. Oprea TI (2000) J Comput Aided Mol Des 14:251

    Article  CAS  Google Scholar 

  14. Proudfoot JR (2002) Bio Org Med Chem Lett 12:1647

    Article  CAS  Google Scholar 

  15. Available from MDL Information Systems, Retrieved from, http://mdl.com/products/knowledge/drug_data_report/index.jsp, http://www.prous.com/index.html. (09/01/07)

  16. Olah M, Mracec M, Ostopovici L, Rad R, Bora A, Hadaruga N, Olah I, Banda M, Simon Z, Mracec M, Oprea, TI (ed) (2005) WOMBAT: World of Molecular Bioactivity, in Chemoinformatics in Drug Discovery., Wiley-VCH, New York, 2005, pp 223–239

  17. Allu TK, Oprea TI (2005) J Chem Inf Model 45:1237

    Article  CAS  Google Scholar 

  18. SMARTS tutorial, Retrieved from, http://www.daylight.com/dayhtml/doc/theory/theory.smarts.html (09/01/07)

  19. Leo A (1993) Chem Rev 93:1281

    Article  CAS  Google Scholar 

  20. ClogP is available from the Biobyte Corporation, http://biobyte.com/index.html, (09/01/07)

  21. Tetko IV, Gasteiger J, Todeschini R, Mauri A, Livingstone D, Ertl P, Palyulin VA, Radchenko EV, Zefirov NS, Makarenko AS, Tanchuk VY, Prokopenko VV (2005) J Comput Aided Mol Des 19:453

    Article  CAS  Google Scholar 

  22. Ran Y, Jain N, Yalkowsky SH (2001) J Chem Inf Comput Sci 41:1208

    Article  CAS  Google Scholar 

  23. Livingstone DJ, Ford MG, Huuskonen JJ, Salt DW (2001) J Comput Aided Mol Des 15:741

    Article  CAS  Google Scholar 

  24. Tetko IV, Tanchuk VY, Villa AE (2001) J Chem Inf Comput Sci 41:1407

    Article  CAS  Google Scholar 

  25. The EPI suite is available from the US Environmental Protection Agency, http://www.epa.gov/opptintr/exposure/pubs/episuite.htm (09/01/07)

  26. ALOGPS is available from the Virtual Computational Chemistry Laboratory, http://vcclab.org/lab/alogps/ (09/01/07)

  27. Tetko IV, Bruneau P (2004) J Pharm Sci 93:3103

    Article  CAS  Google Scholar 

  28. Tetko IV, Poda GI (2004) J Med Chem 47:5601

    Article  CAS  Google Scholar 

  29. Wenlock MC, Austin RP, Barton P, Davis AM, Leeson PD (2003) J Med Chem 46:1250

    Article  CAS  Google Scholar 

  30. Takagi T, Ramachandran C, Bermejo M, Yamashita S, Yu LX, Amidon GL (2007) Mol Pharmaceutics 3:631

    Article  Google Scholar 

Download references

Acknowledgement

This work was supported by National Institutes of Health grant U54 MH074425-01 (National Institutes of Health Molecular Libraries Initiative); and by the New Mexico Tobacco Settlement Fund (D.F. and T.I.O.). The calculated properties and SMILES for these 42,394 molecules will be available at the UNM Screening Center website (http://screening.health.unm.edu/). This paper is dedicated to Dr. Yvonne C. Martin, whose four decades of excellence in the areas of QSAR and computer-aided drug design has helped define the field of cheminformatics.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tudor I. Oprea.

Additional information

Dedicated to Yvonne C. Martin on her 70th birthday.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Oprea, T.I., Allu, T.K., Fara, D.C. et al. Lead-like, drug-like or “Pub-like”: how different are they?. J Comput Aided Mol Des 21, 113–119 (2007). https://doi.org/10.1007/s10822-007-9105-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10822-007-9105-3

Keywords

Navigation