Data Mining Technique for Medical Diagnosis Using a New Smooth Support Vector Machine

  • Santi Wulan Purnami
  • Jasni Mohamad Zain
  • Abdullah Embong
Part of the Communications in Computer and Information Science book series (CCIS, volume 88)


In last decade, the uses of data mining techniques in medical studies are growing gradually. The aim of this paper is to present a recent research on the application of data mining technique for medical diagnosis problems. The proposed data mining technique is Multiple Knot Spline Smooth Support Vector Machine (MKS-SSVM). MKS-SSVM is a new SSVM which used multiple knot spline function to approximate the plus function instead the integral sigmoid function in SSVM. To evaluate the effectiveness of our method, we carried out on two medical dataset (diabetes disease and heart disease). The accuracy of previous results of these data still under 90% so far. The results of this study showed that MKS-SSVM was effective to diagnose medical dataset, especially diabetes disease and heart disease and this is very promising result compared to the previously reported results.


data mining technique classification medical diagnosis smooth support vector machine multiple knot spline function 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Chen, C.: Smoothing Methods in Mathematical Programming. PhD dissertation. Department of Computer Sciences. University of Wisconsin Madison (1995)Google Scholar
  2. 2.
    Das, R., Turkoglu, I., Sengur, A.: Effective diagnosis of heart disease through neural network ensembles. Experts Systems with Applications 36, 7675–7680 (2009)CrossRefGoogle Scholar
  3. 3.
    Goyal, M.: Computer Based Numerical and Statistical Techniques. Infinity Science Press LLC. Hingham (2007)Google Scholar
  4. 4.
    Huang, C.M., Lee, Y.J., Lin, D.K.J., Huang, S.Y.: Model Selection for Support Vector Machines via Uniform Design. A Special issue on Machine Learning and Robust Data Mining of Computational Statistics and Data Analysis 52, 335–346 (2007)zbMATHMathSciNetGoogle Scholar
  5. 5.
    Hsu, C.W., Chang, C.C., Lin, C.J.: Practical Guide To Support Vector Classification. Department of Computer Science and Information Engineering National Taiwan University (2003),,
  6. 6.
    Kahramanli, H., Allahverdi, N.: Design of A Hybrid System for The Diabetes and Heart Diseases. Expert Systems with Applications 35, 82–89 (2008)CrossRefGoogle Scholar
  7. 7.
    Kayaer, K., Yildirim, T.: Medical Diagnosis on Pima Indian Diabetes using General Regression Neural Networks. In: Proceedings of the International Conference on Artificial Neural Networks and Neural Information Processing, June 26-29, pp. 181–184. Springer, Istanbul-Turkey (2003)Google Scholar
  8. 8.
    Lee, Y.J., Mangasarian, O.L.: A Smooth Support Vector Machine. J. Comput. Optimiz. Appli. 20, 5–22 (2001)zbMATHCrossRefMathSciNetGoogle Scholar
  9. 9.
    Mangasarian, O.L.: Generalized Support Vector Machines. In: Smola, A., Bartlett, P., Scholkopf, B., Schurrmans, D. (eds.) Advances in Large Margin Classifiers, pp. 35–146. MIT Press, Cambridge (2000)Google Scholar
  10. 10.
    Newman, D.J., Hettich, S., Blake, C.L.S., Merz, C.J.: UCI repository of machine learning database. University of California, Dept. of Information and Computer Science, Irvine, CA (1998), Google Scholar
  11. 11.
    Ozsen, S., Gunes, S.: Attribute weighting via genetic algorithms for attribute weighted artificial immune system (AWAIS) and its application to heart disease and liver disorders problems. Expert Systems with Applications 36, 386–392 (2009)CrossRefGoogle Scholar
  12. 12.
    Polat, K., Gunes, S.: An Expert System Approach Based on Principal Component Analysis and Adaptive Neuro-Fuzzy Inference System to Diagnosis of Diabetes Disease. Digital Signal Processing 17, 702–710 (2007)CrossRefGoogle Scholar
  13. 13.
    Polat, K., Gunes, S., Aslan, A.: A Cascade Learning System for Classification of Diabetes Disease: Generalized Discriminant Analysis and Least Square Support Vector Machine. Expert System with Applications 34, 214–222 (2008)CrossRefGoogle Scholar
  14. 14.
    Purnami, S.W., Embong, A., Zain, J.M.: A New Smooth Support Vector Machine and Its Applications in Diabetes Disease Diagnosis. Journal of Computer Science 5(12), 1006–1011 (2009)Google Scholar
  15. 15.
    Temurtas, H., et al.: A Comparative Study on Diabetes Disease Using Neural Networks. Expert System with Applications 36, 8610–8615 (2009)CrossRefGoogle Scholar
  16. 16.
    Yuan, Y., Fan, W., Pu, D.: Spline Function Smooth Support Vector Machine for Classification. J. Ind. Manage. Optimiz. 3, 529–542 (2007)zbMATHMathSciNetGoogle Scholar
  17. 17.
    Vapnik, V.: The Nature of Statistical Learning Theory, 2nd edn. Springer, New York (1995)zbMATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Santi Wulan Purnami
    • 1
    • 2
  • Jasni Mohamad Zain
    • 1
  • Abdullah Embong
    • 1
  1. 1.Faculty of Computer System and Software EngineeringUniversity Malaysia PahangKuantan PahangMalaysia
  2. 2.Department of StatisticsInstitut Teknologi Sepuluh Nopember (ITS) SurabayaSurabayaIndonesia

Personalised recommendations