Skip to main content

Improving Performance of K-Means Clustering by Initializing Cluster Centers Using Genetic Algorithm and Entropy Based Fuzzy Clustering for Categorization of Diabetic Patients

  • Conference paper
Proceedings of International Conference on Advances in Computing

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 174))

Abstract

Medical Data mining is the process of extracting hidden patterns from medical data. Among the various clustering algorithms, k-means is the one of most widely used clustering technique. The performance of k-means clustering depends on the initial cluster centers and might converge to local optimum. K-Means does not guarantee unique clustering because it generates different results with randomly chosen initial clusters for different runs of k-means. This paper investigates the use of two methods namely Genetic Algorithm (GA) and Entropy based fuzzy clustering (EFC) to assign k-means initial cluster centers for clustering PIMA Indian diabetic dataset. Experimental results show markable improvement of 3.06% reduction in the classification error and execution time of k-means clustering initialized by GA and EFC when compared to k-means clustering with random cluster centers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Han, Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kauffmann Publishers, San Francisco (2001)

    Google Scholar 

  2. Bradley, P.S., Fayyad, U.M.: Refining initial points for k-means algorithm. In: Proceedings of the 15th International Conference on Machine Learning (1998)

    Google Scholar 

  3. Al-Shbour, B., Myaeng, S.-H.: Initializing K-means using Genetic Algorithm. World Academy of Science, Engineering and Technlogy 54, 114–118 (2009)

    Google Scholar 

  4. Jimenez, J.F., Cuevas, F.J., Carpio, J.M.: Genetic Algorithms applied to Clustering Problem and Data Mining. In: Proceedings of the 7th WSEAS International Conference on Simulation, Modeling and Optimization, pp. 219–224 (2007)

    Google Scholar 

  5. Hao-Jun, Lang-Haun: Genetic Algorithm-based High-dimensional Data Clustering Technique. In: Sixth International Conference on Fuzzy Systems and Knowledge Discovery, pp. 485–489 (2009)

    Google Scholar 

  6. Eltibi, M.F., Ashour, W.M.: Initializing K-means Clustering Algorithm using Statistical Information. International Journal of Computer Applications 29, 51–55 (2011)

    Article  Google Scholar 

  7. Erisogly, M., Calis, N., Sakalliglu, S.: A new algorithms for initial cluster centers in k-means algorithm. Pattern Recognition Letters 32, 1701–1705 (2011)

    Article  Google Scholar 

  8. Dey, V., Pratihar, D.K., Datta, G.L.: Genetic algorithm-tuned entropy-based fuzzy C-means algorithm for obtaining distinct and compact clusters. Fuzzy Optimization Decision Making 10, 153–166 (2011)

    Article  MathSciNet  Google Scholar 

  9. Editorial, Diagnosis and Classification of Diabetes Mellitus, American Diabetes Association, Diabetes Care 27(suppl.1) (January 2004)

    Google Scholar 

  10. The Expert Committee on the Diagnosis and Classification of Diabetes Mellitus: Follow up report on the Diagnosis of Diabetes Mellitus. Diabetic Care 26, 3160–3167 (2003)

    Google Scholar 

  11. Breault, J.L.: Data Mining Diabetic Databases: Are rough Sets a Useful Addition? (2001), http://www.galaxy.gmu.edu/interface/I01/I2001Proceedings/Jbreault

  12. Mac Queen, J.: Some methods for the classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probility, pp. 281–297. University of California Press, Berkeley (1967)

    Google Scholar 

  13. Goldberg, D.: Genetic Algorithms in Search, Optimization, and Machine learning. Addison Wesley (1989)

    Google Scholar 

  14. Rajasekaran, S., Vijayalakshmi Pai, G.A.: Genetic Algorithm based Weight Determination for Backpropogation Networks. In: Proc. of the Fourth Int. Conf. on Advanced Computing, pp. 73–79 (1996)

    Google Scholar 

  15. Maulik, U., Bandopadhyay, S.: Genetic Algorithm-Based Clustering Technique. Pattern Recognition 33, 1455–1465 (1999)

    Article  Google Scholar 

  16. Yao, J., Dash, M., Tan, S.T., Liu, H.: Entropy based fuzzy clustering and fuzzy modeling. Fuzzy Sets and Systems 113, 381–388 (2000)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Asha Gowda Karegowda .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer India

About this paper

Cite this paper

Karegowda, A.G., Vidya, T., Shama, Jayaram, M.A., Manjunath, A.S. (2013). Improving Performance of K-Means Clustering by Initializing Cluster Centers Using Genetic Algorithm and Entropy Based Fuzzy Clustering for Categorization of Diabetic Patients. In: Kumar M., A., R., S., Kumar, T. (eds) Proceedings of International Conference on Advances in Computing. Advances in Intelligent Systems and Computing, vol 174. Springer, New Delhi. https://doi.org/10.1007/978-81-322-0740-5_108

Download citation

  • DOI: https://doi.org/10.1007/978-81-322-0740-5_108

  • Publisher Name: Springer, New Delhi

  • Print ISBN: 978-81-322-0739-9

  • Online ISBN: 978-81-322-0740-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics