Abstract
The present investigations include utility of latest statistical algorithm Support Vector Machine (SVM) to identify non-linear structure activity relationship between IC50 values and structures of C-aryl glucoside SGLT2 inhibitors. Training dataset consisted of forty molecules and the remaining six molecules were chosen for test set validation. SVM under Gaussian Kernel Function yielded non-linear QSAR models. Forward selection algorithm was applied after pruning and redundancy check on molecular descriptors. Internal validations of QSAR models have been achieved using R 2CV (LOO), PRESS, SDEP and Y-Scrambling. SVM aided non-linear models are more efficient when optimization of Gaussian Kernel Function was introduced. Non-linear QSAR studies further identified atomic van der Waals volumes, atomic masses, sum of geometrical distances between O..S and degree of unsaturation as molecular descriptors and crucial structural requirements to model IC50 of C-aryl glucoside derivatives.
Similar content being viewed by others
References
Aksyonova, T.I., Volkovich, V.V., Tetko, I.V. 2003. Robust polynomial neural networks in quantativestructure activity relationship studies. Syst Anal Model Simul 43, 1331–1339.
Bakris, G.L., Fonseca, V., Sharma, K., Wright, E. 2009. Renal sodium-glucose transport: Role in diabetes mellitus and potential clinical implications. Kidney Int 75, 1272–1277.
Berry, C.A., Rector, F.C. Jr. 1991. Renal transport of glucose, amino acids, sodium, chloride, and water. In: Brenner, B.M., Rector, F.C. Jr. (Eds.) The Kidney, 4th Edition, W.B. Saunders, Philadelphia, 245–282.
Brown, G.K. 2000. Glucose transporters: Structure, function and consequences of glucose deficiency. J Inherit Metab Dis 23, 237–246.
Cortes, C., Vapnik, V. 1995. Support-vector networks. Mach Learn 20, 273–297.
Dwarakanathan, A. 2006. Diabetes update. J Insur Med 38, 20–30.
Ehrenkranz, J.R., Lewis, N.G., Kahn, C.R., Roth, J. 2005. Phlorizin: A review. Diabetes Metab Res Rev 21, 31–38.
Furey, T., Cristianini, N., Duffy, N., Bednarski, D., Schummer, M., Haussler, D. 2000. Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics 16, 906–914.
Gerich, J.E., Woerle, H.J., Meyer, C., Stumvoll, M. 2001. Renal gluconeogenesis. Diabetes Care 24, 382–391.
International Diabetes Federation. 2009. Diabetes Atlas, 4th Edition, Montreal, Canada.
Kloeckener-Gruissem, B., Vandekerckhove, K., Nurnberg, G., Neidhardt, J., Zeitz, C., Nurnberg, P., Schipper, I., Berger, W. 2008. Mutation of solute carrier SLC16A12 associates with a syndrome combining juvenile cataract with microcornea and renal glucosuria. Am J Hum Genet 82, 772–779.
Kuhn, H.W., Tucker, A.W. 1951. Nonlinear programming. In: Neyman, J. (Ed.) Proceedings of the Second Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, University of California Press, Los Angeles, 481–492.
Lee, J., Lee, S.H., Seo, H.J., Son, E.J., Lee, S.H., Jung, M.E., Lee, M., Han, H.K., Kim, J., Kang, J., Lee, J. 2010. Novel C-aryl glucoside SGLT2 inhibitors as potential antidiabetic agents. 1,3,4-Thiadiazolylmethylphenyl glucoside congeners. Bioor & Med Chem 18, 2178–2194.
Norinder, U. 2003. Support vector machine models in drug design: Application to drug transport processes and QSAR using simplex optimizations and variable selection. Neurocomputing 55, 337–346.
Pavlidis, P., Wapinski, I., Noble, W.S. 2004. Support vector machine classification on the web. Bioinformatics 20, 586–587.
Rector, F.C. Jr. 1983. Sodium, bicarbonate, and chloride absorption by the proximal tubule. Am J Physiol 244, F461–F471.
Rossetti, L., Shulman, G.I., Zawalich, W., DeFronzo, R.A. 1987. Effect of chronic hyperglycemia on in vivo insulin secretion in partially pancreatectomized rats. J Clin Invest 80, 1037–1044.
Schölkopf, B., Smola, A.J. 2002. Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond, MIT Press, Cambridge, MA, 185–208.
Smola, A.J., Schölkopf, B. 2004. A tutorial on support vector regression. Statistics and computing. Stat Comput 14, 199–222.
Soto, A.J., Cecchini, R.L., Vazquez, G.E., Ponzoni, I. 2008. An evolutionary approach for feature selection applied to ADMET prediction. Amer J Artif Intell 37, 55–63.
Vapnik, V. 1999. The Nature of Statistical Learning Theory, Verlag Springer, New York.
Wright, E.M., Hirayama, B.A., Loo, D.F. 2007. Active sugar transport in health and disease. J Intern Med 261, 32–43.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Prasoona, R.K., Jyoti, A., Mukesh, Y. et al. Optimization of Gaussian Kernel Function in Support Vector Machine aided QSAR studies of C-aryl glucoside SGLT2 inhibitors. Interdiscip Sci Comput Life Sci 5, 45–52 (2013). https://doi.org/10.1007/s12539-013-0156-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12539-013-0156-y