Abstract
Partitioning techniques are widely used to classify compound sets or databases according to specific chemical or biological criteria. Partitioning is conceptually related to, yet algorithmically distinct from, conventional clustering methods and is particularly suitable for efficient processing of very large compound sets. Currently, some of the most popular partitioning approaches in the chemoinformatics field involve dimension reduction of initially defined chemistry spaces and creation of subsections of low-dimensional space for molecular classification. These subsections are often called cells. Original chemical reference spaces are generated through selection of various descriptors of molecular structure and properties. Principles and methodological aspects of dimension reduction of chemical spaces and compound partitioning in low-dimensional space are described herein.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Willett, P., Wintermann, V., and Bawden, D. (1986) Implementation of nonhierarchic cluster analysis methods in chemical information systems: Selection of compounds for biological testing and clustering of substructure search output. J. Chem. Inf. Comput. Sci. 26, 109–118.
Barnard, J. M. and Downs, G. M. (1992) Clustering of chemical structures on the basis of two-dimensional similarity measures. J. Chem. Inf. Comput. Sci. 32, 644–649.
Mason, J. S. and Pickett, S. D. (1997) Partition-based selection. Perspect. Drug Discov. Design 7/8, 85–114.
Stahura, F. L. and Bajorath, J. (2003) Partitioning methods for the identification of active molecules. Curr. Med. Chem. 10, 707–715.
Livingstone, D. J. (2000) The characterization of chemical structures using molecular properties. A survey. J. Chem. Inf. Comput. Sci. 40, 195–209.
Bajorath, J. (2001) Selected concepts and investigations in compound classification, molecular descriptor analysis, and virtual screening. J. Chem. Inf. Comput. Sci. 41, 233–245.
Chen, X., Rusinko, A. III, and Young, S. S. (1998) Recursive partitioning analysis of a large structure-activity data set using three-dimensional descriptors. J. Chem. Inf. Comput. Sci. 38, 1054–1062.
Agrafiotis, D. K., Lobanov, V. S., and Salemme, R. F. (2002) Combinatorial informatics in the post-genomics era. Nature Drug Discov. Rev. 1, 337–346.
Bayley, M. J. and Willett, P. (1999) Binning schemes for partition-based compound selection. J. Mol. Graph. Model. 17, 10–18.
Pearlman, R. S. and Smith, K. M. (1998) Novel software tools for chemical diversity. Perspect. Drug Discov. Design 9, 339–353.
Pearlman, R. S. and Smith, K. M. (1999) Metric validation and the receptor-relevant subspace concept. J. Chem. Inf. Comput. Sci. 39, 28–35.
Schnur, D. (1999) Design and diversity analysis of large compound libraries using cell-based methods. J. Chem. Inf. Comput. Sci. 39, 36–45.
Pirard, B. and Pickett, S. D. (2000) Classification of kinase inhibitors using BCUT descriptors. J. Chem. Inf. Comput. Sci. 40, 1431–1440.
Glen, W. G., Dunn, W. J., and Scott, D. R. (1989) Principal component analysis and partial least squares regression. Tetrahedron Comput. Methodol. 2, 349–376.
Xue, L. and Bajorath, J. (2000) Molecular descriptors for effective classification of biologically active compounds based on principal component analysis identified by a genetic algorithm. J. Chem. Inf. Comput. Sci. 40, 801–809.
Xue, L. and Bajorath, J. (2002) Accurate partitioning of compounds belonging to diverse activity classes. J. Chem. Inf. Comput. Sci. 42, 757–764.
Forrest, S. (1993) Genetic algorithms—principles of natural selection applied to computation. Science 261, 872–878.
Johnson, M. A. and Maggiora, G. M. (eds.) (1990) Concepts and applications of molecular similarity. Wiley, New York, NY.
Bajorath, J. (2002) Integration of virtual and high-throughput screening. Nature Drug Discov. Rev. 1, 337–346.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Humana Press Inc.
About this protocol
Cite this protocol
Xue, L., Stahura, F.L., Bajorath, J. (2004). Cell-Based Partitioning. In: Bajorath, J. (eds) Chemoinformatics. Methods in Molecular Biology™, vol 275. Humana Press. https://doi.org/10.1385/1-59259-802-1:279
Download citation
DOI: https://doi.org/10.1385/1-59259-802-1:279
Publisher Name: Humana Press
Print ISBN: 978-1-58829-261-2
Online ISBN: 978-1-59259-802-1
eBook Packages: Springer Protocols