Design and characterization of chemical space networks for different compound data sets

Zwierzyna, Magdalena; Vogt, Martin; Maggiora, Gerald M.; Bajorath, Jürgen

doi:10.1007/s10822-014-9821-4

Design and characterization of chemical space networks for different compound data sets

Published: 03 December 2014

Volume 29, pages 113–125, (2015)
Cite this article

Journal of Computer-Aided Molecular Design Aims and scope Submit manuscript

Magdalena Zwierzyna¹,
Martin Vogt¹,
Gerald M. Maggiora^2,3 &
…
Jürgen Bajorath¹

773 Accesses
29 Citations
Explore all metrics

Abstract

Chemical Space Networks (CSNs) are generated for different compound data sets on the basis of pairwise similarity relationships. Such networks are thought to complement and further extend traditional coordinate-based views of chemical space. Our proof-of-concept study focuses on CSNs based upon fingerprint similarity relationships calculated using the conventional Tanimoto similarity metric. The resulting CSNs are characterized with statistical measures from network science and compared in different ways. We show that the homophily principle, which is widely considered in the context of social networks, is a major determinant of the topology of CSNs of bioactive compounds, designed as threshold networks, typically giving rise to community structures. Many properties of CSNs are influenced by numerical features of the conventional Tanimoto similarity metric and largely dominated by the edge density of the networks, which depends on chosen similarity threshold values. However, properties of different CSNs with constant edge density can be directly compared, revealing systematic differences between CSNs generated from randomly collected or bioactive compounds.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Advanced high-resolution chromatographic strategies for efficient isolation of natural products from complex biological matrices: from metabolite profiling to pure chemical entities

Article Open access 06 May 2024

A novel intelligent Fuzzy-AHP based evolutionary algorithm for detecting communities in complex networks

Article 29 February 2024

Role of computer-aided drug design in modern drug discovery

Article 25 July 2015

References

Dobson CM (2004) Chemical space and biology. Nature 432:824–828
Article CAS Google Scholar
Bohacek RS, McMartin C, Guida WC (1996) The art and practice of structure-based drug design: a molecular modelling perspective. Med Res Rev 16:3–50
Article CAS Google Scholar
Maggiora GM, Bajorath J (2014) Chemical space networks—a poweful new paradigm for the description of chemical space. J Comput-Aided Mol Des 28:795–802
Article CAS Google Scholar
Pearlman R, Smith K (2002) Novel software tools for chemical diversity. 3D QSAR in drug design. Three-dimens Quant Struct-Act Relat 2:339–353
Article Google Scholar
Maggiora GM, Vogt M, Stumpfe D, Bajorath J (2014) Molecular similarity in medicinal chemistry. J Med Chem 57:3186–3204
Article CAS Google Scholar
Wawer M, Peltason L, Weskamp N, Teckentrup A, Bajorath J (2008) Structure-activity relationship anatomy by network-like similarity graphs and local structure-activity relationship indices. J Med Chem 51:6075–6084
Article CAS Google Scholar
Tanaka N, Ohno K, Niimi T, Moritomo A, Mori K, Orita M (2009) Small-world phenomena in chemical library networks: application to fragment-based drug discovery. J Chem Inf Model 49:2677–2686
Article CAS Google Scholar
Krein MP, Sukumar N (2011) Exploration of the topology of chemical spaces with network measures. J Phys Chem A 115:12905–12918
Article CAS Google Scholar
Fourches D, Tropsha A (2013) Using graph indices for the analysis and comparison of chemical data sets. Mol Inf 32:827–842
Article CAS Google Scholar
Stumpfe D, Dimova D, Bajorath J (2014) Composition and topology of activity cliff clusters formed by bioactive compounds. J Chem Inf Model 54:451–461
Article CAS Google Scholar
Watts D, Strogatz S (1998) Collective dynamics of ‘small-world’ networks. Nature 393:440–442
Article CAS Google Scholar
Barabási A, Albert R (1999) Emergence of scaling in random networks. Science 286:509–512
Article Google Scholar
Newman M (2010) Networks—an introduction. Oxford University Press, New York
Google Scholar
Newman M (2003) The structure and function of complex networks. SIAM Rev 45:167–256
Article Google Scholar
Albert R, Barabási A (2002) Statistical mechanics of complex networks. Rev Mod Phys 74:47–97
Article Google Scholar
McPherson M, Smith-Lovin L, Cook J (2001) Birds of a feather: homophily in social networks. Annu Rev Sociol 27:415–444
Article Google Scholar
Willett P, Barnard JM, Downs GM (1998) Chemical similarity searching. J Chem Inf Comput Sci 38:983–996
Article CAS Google Scholar
Newman M, Park J (2003) Why social networks are different from other types of networks. Phys Rev E 68:036122
Article CAS Google Scholar
Foster D, Foster J, Grassberger P, Paczuski M (2011) Clustering drives assortativity and community structure in ensembles of networks. Phys Rev E 84:066117
Article Google Scholar
Newman M (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69:066133
Article CAS Google Scholar
Irwin JJ, Sterling T, Mysinger MM, Bolstad ES, Coleman RG (2012) ZINC: a free tool to discover chemistry for biology. J Chem Inf Model 52:1757–1768
Article CAS Google Scholar
Willett P (1999) Dissimilarity-based algorithms for selecting structurally diverse sets of compounds. J Comput Biol 6:447–457
Article CAS Google Scholar
MACCS Structural Keys; Accelrys, San Diego
Gaulton A, Bellis LJ, Bento AP, Chambers J, Davies M, Hersey A, Light Y, McGlinchey S, Michalovich D, Al-Lazikani B, Overington JP (2012) ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res 40(Database issue):D1100–D1107
Article CAS Google Scholar
Rogers D, Hahn M (2010) Extended-connectivity fingerprints. J Chem Inf Model 50:742–754
Article CAS Google Scholar
Java Universal Network/Graph Framework. http://jung.sourceforge.net. Accessed 12 Oct 2014
Gavin A-C, Bosche M, Krause R, Grandi P, Marzioch M, Bauer A, Schultz J, Rick JM, Michon A-M, Cruciat C-M (2002) Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415:141–147
Article CAS Google Scholar

Download references

Acknowledgments

M. Z. has partly been supported by the German Academic Exchange Service (Deutscher Akademischer Austauschdienst, DAAD).

Author information

Authors and Affiliations

B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Department of Life Science Informatics, Rheinische Friedrich-Wilhelms-Universität, Dahlmannstr. 2, 53113, Bonn, Germany
Magdalena Zwierzyna, Martin Vogt & Jürgen Bajorath
BIO5 Institute, University of Arizona, 1657 East Helen Street, Tucson, AZ, 85721, USA
Gerald M. Maggiora
Translational Genomics Research Institute, 445 North Fifth Street, Phoenix, AZ, 85004, USA
Gerald M. Maggiora

Authors

Magdalena Zwierzyna
View author publications
You can also search for this author in PubMed Google Scholar
Martin Vogt
View author publications
You can also search for this author in PubMed Google Scholar
Gerald M. Maggiora
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Bajorath
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jürgen Bajorath.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zwierzyna, M., Vogt, M., Maggiora, G.M. et al. Design and characterization of chemical space networks for different compound data sets. J Comput Aided Mol Des 29, 113–125 (2015). https://doi.org/10.1007/s10822-014-9821-4

Download citation

Received: 15 October 2014
Accepted: 27 November 2014
Published: 03 December 2014
Issue Date: February 2015
DOI: https://doi.org/10.1007/s10822-014-9821-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Design and characterization of chemical space networks for different compound data sets

Abstract

Access this article

Similar content being viewed by others

Advanced high-resolution chromatographic strategies for efficient isolation of natural products from complex biological matrices: from metabolite profiling to pure chemical entities

A novel intelligent Fuzzy-AHP based evolutionary algorithm for detecting communities in complex networks

Role of computer-aided drug design in modern drug discovery

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Design and characterization of chemical space networks for different compound data sets

Abstract

Access this article

Similar content being viewed by others

Advanced high-resolution chromatographic strategies for efficient isolation of natural products from complex biological matrices: from metabolite profiling to pure chemical entities

A novel intelligent Fuzzy-AHP based evolutionary algorithm for detecting communities in complex networks

Role of computer-aided drug design in modern drug discovery

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation