Molecular Diversity

, Volume 10, Issue 3, pp 333–339

JEDA: Joint entropy diversity analysis. An information-theoretic method for choosing diverse and representative subsets from combinatorial libraries

Full–length Paper

DOI: 10.1007/s11030-006-9042-4

Cite this article as:
Landon, M.R. & Schaus, S.E. Mol Divers (2006) 10: 333. doi:10.1007/s11030-006-9042-4


The joint entropy-based diversity analysis (JEDA) program is a new method of selecting representative subsets of compounds from combinatorial libraries. Similar to other cell-based diversity analyses, a set of chemical descriptors is used to partition the chemical space of a library of compounds; however, unlike other metrics for choosing a compound from each partition, a Shannon-entropy based scoring function implemented in a probabilistic search algorithm determines a representative subset of compounds. This approach enables the selection of compounds that are not only diverse but that also represent the densities of chemical space occupied by the original chemical library. Additionally, JEDA permits the user to define the size of the subset that the chemist wishes to create so that restrictions on time and chemical reagents can be considered. Subsets created from a chemical library by JEDA are compared to subsets obtained using other partition-based diversity analyses, namely principal components analysis and median partitioning, on a combinatorial library derived from the Comprehensive Medical Chemistry Dataset.

Key words

chemical diversity Shannon entropy representative subset selection 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer Science + Business Media, Inc. 2006

Authors and Affiliations

  1. 1.Graduate Program in Bioinformatics and Systems BiologyBostonU.S.A
  2. 2.Center for Chemical Methodology and Library DevelopmentBostonU.S.A
  3. 3.Department of ChemistryBoston UniversityBostonU.S.A

Personalised recommendations