Design of chemical space networks on the basis of Tversky similarity
Chemical space networks (CSNs) have been introduced as a coordinate-free representation of chemical space. In CSNs, nodes represent compounds and edges pairwise similarity relationships. These network representations are mostly used to navigate sections of biologically relevant chemical space. Different types of CSNs have been designed on the basis of alternative similarity measures including continuous numerical similarity values or substructure-based similarity criteria. CSNs can be characterized and compared on the basis of statistical concepts from network science. Herein, a new CSN design is introduced that is based upon asymmetric similarity assessment using the Tversky coefficient and termed TV-CSN. Compared to other CSNs, TV-CSNs have unique features. While CSNs typically contain separate compound communities and exhibit small world character, many TV-CSNs are also scale-free in nature and contain hubs, i.e., extensively connected central compounds. Compared to other CSNs, these hubs are a characteristic of TV-CSN topology. Hub-containing compound communities are of particular interest for the exploration of structure–activity relationships.
KeywordsChemical space networks Biologically relevant chemical space Structure–activity relationships Similarity metrics Tversky similarity Topology Network science
- 8.Maggiora GM, Shanmugasundaram V (2004) Molecular similarity measures. In: Bajorath J (ed) Chemoinformatics—concepts, methods, and tools for drug discovery. Humana Press, Totowa, pp 1–50Google Scholar
- 17.Bastian M, Heymann S, Jacomy M (2009) Gephi: an open source software for exploring and manipulating networks. In: International AAAI conference on weblogs and social mediaGoogle Scholar
- 19.Corder GW, Foreman DI (2014) Nonparametric statistics: a step-by-step approach. Wiley, Hoboken NJGoogle Scholar
- 23.Maggiora GM, Shanmugasundaram V (2010) Molecular similarity measures. In: Bajorath J (ed) Chemoinformatics and computational chemical biology. Humana Press, New York, pp 39–100Google Scholar