An Improved Heuristic K-Means Clustering Method Using Genetic Algorithm Based Initialization

Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 509)

Abstract

In this paper, we propose methods to remove the drawbacks that commonly afflict the k-means clustering algorithm. We use nature based heuristics to improve the clustering performance offered by the k-means algorithm and also ensure the creation of the requisite number of clusters. The use of GA is found to be adequate in this case to provide a good initialization to the algorithm, and this is followed by a differential evolution based heuristic to ensure that the requisite number of clusters is created without minimal increase in the running time of the algorithm.

Keywords

K-means GA Precomputation DE heuristic 

References

  1. 1.
    Hatamlou, A.: Black hole: a new heuristic optimization approach for data clustering. Inf. Sci. 222, 175–184 (2013)MathSciNetCrossRefGoogle Scholar
  2. 2.
    Pham, D., Karaboga, D.: Intelligence Optimization Techniques: Genetic Algorithms, Tabu Search, Simulated Annealing and Neural Networks. Springer Science and Business Media (2012)Google Scholar
  3. 3.
    Song, W., Qiao, Y., Park, S.C., Qian, X.: A hybrid evolutionary computation approach with its application for optimizing text document clustering. Expert Syst. Appl. 42(5), 2517–2524 (2015)CrossRefGoogle Scholar
  4. 4.
    İnkaya, T., Kayalıgil, S., Özdemirel, N.E.: Ant colony optimization based clustering methodology. Appl. Soft Comput. 28, 301–311 (2015)CrossRefGoogle Scholar
  5. 5.
    Bramer, M., Ellis, R., Petridis, M., Yang, X.S.: Firefly algorithm, Levy flights and global optimization. Research and development in Intelligent Systems, vol. XXVI, pp. 209–218. Springer, London (2010)Google Scholar
  6. 6.
    Oztruk, C., Hancer, E., Karaboga, D.: Improved clustering criterion for image clustering with artificial bee colony algorithm. Pattern Anal. Appl. (2014)Google Scholar
  7. 7.
    Gonzalez, J.,Petla, D., Cruz, C., Terrazas, G., Krasnogor, N., Yang, X.-S.: A new metaheuristic bat-inspired algorithm. Nature Inspired Cooperative Strategies for Optimization, NICSO 2010, pp. 65–74. Springer (2011)Google Scholar
  8. 8.
    Hatamlou, A.: A Robust data clustering approach using gravitational search algorithm and a heuristic search approach. Global J. Technol. 74–83 (2014)Google Scholar
  9. 9.
    Cui, X., Gao, J., Potok, T.E.: A flocking based algorithm for document clustering analysis. J. Syst. Arch. 52, 505–515 (2006) (Elsevier)Google Scholar
  10. 10.
    Kalogeratos, A., Likas, A.: Document clustering using synthetic cluster prototypes. Data Knowl. Eng. 70(3), 284–306 (2011)CrossRefGoogle Scholar
  11. 11.
    Singh, V.K., Tiwari, N., Garg, S.: Document clustering using k-means, heuristic k-means and fuzzy c-means. In: 2011 International Conference on Computational Intelligence and Communication Networks (CICN), pp. 297–301. IEEE (2011)Google Scholar
  12. 12.
    Tan, P.N., Steinbach, M., Kumar, V.: Introduction to Data Mining. Pearson Education (2006)Google Scholar
  13. 13.
    Rothlauf, F.: Representations for Genetic and Evolutionary Algorithm. Springer, Berlin (2006)Google Scholar
  14. 14.
    Premalatha, K., Natarajan, A.M.: Genetic algorithm for document clustering with simultaneous and ranked mutation. Modern Appl. Sci. 3(2) (2009)Google Scholar
  15. 15.
    Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S., Coello, C.: A survey of multiobjective evolutionary algorithms for data mining: part I. IEEE Trans. Evol. Comput. 18(1), 4–19 (2014)CrossRefGoogle Scholar
  16. 16.
    Das, S., Abraham, A., Konar, A.: Automatic hard clustering using improved differential evolution algorithm, Stud. Comput. Intell. 137–174 (2009)Google Scholar

Copyright information

© Springer Science+Business Media Singapore 2017

Authors and Affiliations

  1. 1.Department of CSEBirla Institute of TechnologyMesraIndia

Personalised recommendations