Predictive cartography of metal binders using generative topographic mapping
Generative topographic mapping (GTM) approach is used to visualize the chemical space of organic molecules (L) with respect to binding a wide range of 41 different metal cations (M) and also to build predictive models for stability constants (logK) of 1:1 (M:L) complexes using “density maps,” “activity landscapes,” and “selectivity landscapes” techniques. A two-dimensional map describing the entire set of 2962 metal binders reveals the selectivity and promiscuity zones with respect to individual metals or groups of metals with similar chemical properties (lanthanides, transition metals, etc). The GTM-based global (for entire set) and local (for selected subsets) models demonstrate a good predictive performance in the cross-validation procedure. It is also shown that the data likelihood could be used as a definition of the applicability domain of GTM-based models. Thus, the GTM approach represents an efficient tool for the predictive cartography of metal binders, which can both visualize their chemical space and predict the affinity profile of metals for new ligands.
KeywordsGenerative topographic mapping Metal binding Cartography of chemical space Activity landscapes
This work was supported by the Russian Science Foundation Grant No. 14-43-00052, base organization Photochemistry Center, Russian Academy of Sciences. AAB thanks for partial support from the improving of the competitiveness program of National Research Nuclear University “MEPhI”.
- 2.Cherkasov A, Muratov EN, Fourches D, Varnek A, Baskin II, Cronin M, Dearden J, Gramatica P, Martin YC, Todeschini R, Consonni V, Kuz’min VE, Cramer R, Benigni R, Yang C, Rathman J, Terfloth L, Gasteiger J, Richard A, Tropsha A (2015) QSAR modeling: where have you been? Where are you going to? J Med Chem 57(12):4977–5010. doi: 10.1021/jm4004285 CrossRefGoogle Scholar
- 5.Varnek A, Solov’ev V (2009) Quantitative structure-property relationships in solvent extraction and complexation of metals. In: Sengupta AK, Moyer BA (eds) Ion exchange and solvent extraction, a series of advances, vol 19. CRC Press, Taylor and Francis Group, Boca Raton, pp 319–358CrossRefGoogle Scholar
- 7.Solov’ev V, Sukhno I, Buzko V, Polushin A, Marcou G, Tsivadze A, Varnek A (2012) Stability constants of complexes of Zn2+, Cd2+, and Hg2+ with organic ligands: QSPR consensus modeling and design of new metal binders. J Incl Phenom Macrocycl Chem 72(3–4):309–321. doi: 10.1007/s10847-011-9978-6 CrossRefGoogle Scholar
- 8.Solov’ev VP, Tsivadze AY, Varnek AA (2012) New approach for accurate QSPR modeling of metal complexation: application to stability constants of complexes of lanthanide Ions Ln3+, Ag+, Zn2+, Cd2+ and Hg2+ with organic ligands in water. Macroheterocycles 5(4–5):404–410. doi: 10.6060/mhc2012.121104s CrossRefGoogle Scholar
- 17.Baskin I, Varnek A (2008) Fragment descriptors in SAR/QSAR/QSPR studies, molecular similarity analysis and in virtual screening. In: Varnek A, Tropsha A (eds) Chemoinformatics approaches to virtual screening RSC Publisher, Cambridge, pp 1–43Google Scholar
- 20.Tetko IV, Solov’ev VP, Antonov AV, Yao X, Doucet JP, Fan B, Hoonakker F, Fourches D, Jost P, Lachiche N, Varnek A (2006) Benchmarking of linear and nonlinear approaches for quantitative structure-property relationship studies of metal complexation with ionophores. J Chem Inf Model 46(2):808–819CrossRefGoogle Scholar
- 31.Gaspar HA, Marcou G, Horvath D, Arault A, Lozano S, Vayer P, Varnek A (2013) Generative topographic mapping-based classification models and their applicability domain: application to the biopharmaceutics drug disposition classification system (BDDCS). J Chem Inf Model 53(12):3318–3325. doi: 10.1021/ci400423c CrossRefGoogle Scholar
- 32.Gaspar HA, Sidorov P, Horvath D, Baskin II, Marcou G, Varnek A (2016) Generative topographic mapping approach to chemical space analysis. In: Frontiers in molecular design and chemical information science—Herman Skolnik Award Symposium 2015: Jürgen Bajorath, vol 1222. ACS symposium series, vol 1222. American Chemical Society, Washington, DC, pp 211–241. doi: 10.1021/bk-2016-1222.ch011 Google Scholar
- 33.The IUPAC Stability Constants Database, SC-Database. http://www.acadsoft.co.uk/. Accessed 24 June 2017
- 38.Monev V (2004) Introduction to similarity searching in chemistry. MATCH Commun Math Comput Chem 51:7–38Google Scholar
- 39.Csardi G, Nepusz T (2006) The igraph software package for complex network research. Int J Complex Syst 1695(5):1–9Google Scholar