Abstract
To validate QSAR models an external test set is increasingly used. However the definition of the compounds for the test set is still debated. We studied, co-evolutions of correlations between optimal descriptors and carcinogenicity (pTD50) for the subtraining, calibration, and test set. Weak correlations for the sub-training set are sometimes accompanied by quite good correlations for the external test set. This can be explained in terms of the probability theory and can help define a suitable test set. The simplified molecular input line entry system (SMILES) was used to represent the molecular structure. Correlation weights for calculating the optimal descriptors are related to fragments of the SMILES. The statistical quality of the model is: n=170, r2=0.6638, q2=0.6554, s=0.828, F=331 (sub-training set); n=170, r2=0.6609, r2 pred=0.6520, s=0.825, F=331 (calibration set); and n=61, r2=0.7796, r2 pred=0.7658, Rm 2=0.7448, s=0.563, F=221 (test set). The calculations were done with CORAL software (http://www.insilico.eu/coral/).
Similar content being viewed by others
References
P.P. Roy, J.T. Leonard, K. Roy, Chemomet. Intell. Lab. 90, 31 (2008)
W. Tong, Q. Xie, H. Hong, L. Shi, H. Fang, R. Perkins, Environ. Health. Persp. 112, 1249 (2004)
A.A. Toropov, A.P. Toropova, D.V. Mukhamedzhanova, I. Gutman, Indian J. Chem. 4A, 1545 (2005)
G. Melagraki, A. Afantitis, H. Sarimveis, P.A. Koutentis, G. Kollias, O. Igglessi-Markopoulou, Mol. Divers. 13, 301 (2009)
E. Vicente, P.R. Duchowicz, E.A. Castro, A. Monge, J. Mol. Graph. Model. 28, 28 (2009)
E. Benfenati (Ed.), Quantitative Structure-Activity Relationships (QSAR) for Pesticide Regulatory Purposes (Elsevier Science, Amsterdam, 2007)
A.A. Toropov, A.P. Toropova, E. Benfenati, A. Manganaro, Mol. Divers. 13, 367 (2009)
A.A. Toropov, A.P. Toropova, E. Benfenati, Int. J. Mol. Sci. 10, 3106 (2009)
P.P. Roy, K. Roy, QSAR Comb. Sci. 27, 302 (2008)
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
About this article
Cite this article
Toropova, A.P., Toropov, A.A., Diaza, R.G. et al. Analysis of the co-evolutions of correlations as a tool for QSAR-modeling of carcinogenicity: an unexpected good prediction based on a model that seems untrustworthy. cent.eur.j.chem. 9, 165–174 (2011). https://doi.org/10.2478/s11532-010-0135-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.2478/s11532-010-0135-7