Journal of Computer-Aided Molecular Design

, Volume 27, Issue 8, pp 675–679

Estimation of the size of drug-like chemical space based on GDB-17 data



The goal of this paper is to estimate the number of realistic drug-like molecules which could ever be synthesized. Unlike previous studies based on exhaustive enumeration of molecular graphs or on combinatorial enumeration preselected fragments, we used results of constrained graphs enumeration by Reymond to establish a correlation between the number of generated structures (M) and the number of heavy atoms (N): logM = 0.584 × N × logN + 0.356. The number of atoms limiting drug-like chemical space of molecules which follow Lipinsky’s rules (N = 36) has been obtained from the analysis of the PubChem database. This results in M ≈ 1033 which is in between the numbers estimated by Ertl (1023) and by Bohacek (1060).


Chemical space Drug-like chemical space Graphs enumeration 

Copyright information

© Springer Science+Business Media Dordrecht 2013

Authors and Affiliations

  • P. G. Polishchuk
    • 1
  • T. I. Madzhidov
    • 2
  • A. Varnek
    • 3
  1. 1.A.V. Bogatsky Physico-Chemical Institute of National Academy of Sciences of UkraineOdessaUkraine
  2. 2.A.M. Butlerov Institute of ChemistryKazan Federal UniversityKazanRussia
  3. 3.Laboratory of ChemoinformaticsUniversity of StrasbourgStrasbourgFrance

Personalised recommendations