Abstract
Data clustering is a popular data analysis approach for organizing similar objects without needing to be monitored - a collection of data, into meaningful groups (or clusters). Many approaches from different disciplines have been proposed as solutions for clustering problems and each algorithm has its advantages and drawbacks. This research paper offered a brief overview and performance comparison analysis between different existing data mining clustering strategies based on Particle Swarm Optimization, Genetic Algorithm, K-means and Hybrid PSO Solutions. The reasoning behind choosing these algorithms is that PSO has, due to the applicability of its advanced hybrid variants, reached a remarkable position in this field, and the Genetic Algorithm based unsupervised clustering technique provides a stable clustering performance with less computational time required. However, we found that Hybrid PSO solutions have been shown to produce excellent results in a wide variety of real-world data. In this paper, a brief review of these algorithms applied to data clustering has been described.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abdel-Kadel, R.F.: Genetically improved PSO algorithm for efficient data clustering. In: Second International Conference on Machine Learning and Computing, pp. 71–75 (2010)
Acid, S., De Campos, L., Fernandez-Luna, J., Huete, J.: An information retrieval model based on simple bayesian networks. Int. J. Intell. Syst. 18(2), 251–265 (2003)
Ahmadyfard, A., Modares, H.: Combining PSO and k-means to enhance data clustering.In: International symposium on telecommunications, pp. 688–691 (2008)
Al-Maqaleh, B., Shahbazkia, H.: A genetic algorithm for discovering classification rules in data mining. Int. J. Comput. Appl. 41(8), 42–43 (2012)
Ballardini, A.L.: PSO Clustering Algorithm (2014). https://github.com/iralabdisco/pso-clustering. Accessed 28 April 2020
Borkar, P., Patil, L.: Web information retrieval using genetic algorithm-particle swarm optimization. Int. J. Future Comput. Commun. 2(6), 595–599 (2013)
Cui, X., Potok, T.E.: Document clustering analysis based on hybrid PSO + K-means algorithm. J. Comput. Sci (Special Issue) 27, 33 (2005)
Dragnic, N.: Konstrukcija i analiza klaster algoritma sa primenom u definisanju bihejvioralnih faktora rizika u populaciji odraslog stanovnistva Srbije. Univerzitet u Novom Sadu, Novi Sad (2015)
Glisovic, N., Davidovic, T., Raskovic, M.: Clustering when missing data by using the variable neighborhood search. In: XLIV Symposium on Operational Research, Zlatibor, Serbia, pp. 158–159 (2017)
Halkidi, M., Bastiakis, Y., Vazirgiannis, M.: On clustering validation techniques. J. Intell. Inf. Syst. 17(2-3), 108–109 (2001)
Holland, J.: Adaptation in natural and artificial systems. an introductory analysis with application to biology, control, and artificial intelligence. University of Michigan Press, Ann Arbour. (1975)
Kamel, N., Ouchen, I., Baali, K.: A sampling PSO-K-means algorithm for document clustering. In: Pan, J.S., Krömer, P., Snášel, V. (eds.) Genetic and Evolutionary Computing. Advances in Intelligent Systems and Computing, pp. 45–54. Springer, Cham. (2014). https://doi.org/10.1007/978-3-319-01796-9_5
Kovacic, Z.J.: Multivarijaciona analiza. Ekonomski fakultet, Beograd (1994)
Malini Devi, G., Lakshmi Prasanna, M., Seetha, M.: Performance analysis of genetic algorithm with PSO for data clustering. Int. J. Adv. Eng. Res. Sci. – IJAERS, 3(11), 115–119 (2016)
Matteucci, M.: A tutorial on Clustering Algorithms - Hierarchical Clustering Algorithms. http://home.deib.polimi.it/matteucc/Clustering/tutorial_html/hierarchical.html. Accessed 24 April 2020
Nayak, J., Kanungo, D.P., Behera, H.S.: an improved swarm based hybrid k-means clustering for optimal cluster centers. In: Advances in Intelligent Systems and Computing (2015)
Omran, M., Salman, A., Engelbrecht, A.: Image classification using particle swarm optimization. In: Proceedings of the 4th Asia-Pacific Conference on Simulated Evolution and Learning (2002)
Prascevic, Z., Prascevic, N.: Particle Swarm Optimization. Izgradnja 62, 339–345 (2008)
Premalatha, K., Natarajan, A.M.: Hybrid PSO and GA models for document clustering. Int. J. Adv. Soft Comput. Appl. 2(3) 309–310 (2010)
Rana, S., Jasola, S., Kumar, R.: A hybrid sequential approach for data clustering using K-Means and particle swarm optimization algorithm. Int. J. Eng. Sci. Technol. 2(6) (2010)
Sarkar, S., Roy, A., Purkayastha, B.: Application of particle swarm optimization in data clustering. Int. J. Comput. Appl. 65(25) (2013)
Saxena, A, Prasad, M, Gupta, A, et al.: A review of clustering techniques and developments. Neurocomputing 267, 664–681 (2017)
TREC. Text Retrieval Conference (1999). http://trec.nist.gov. Accessed 02 Aug 2020
Van der Merwe, D.W., Engelhrecht, A.P.: Data clustering using particle swarm optimization. In: Conference of Evolutionary Computation CEC 2003, Canberra, ACT, Australia, pp. 215–220 (2003)
Wang, L., Si, G.: Optimal location management in mobile computing with hybrid Genetic algorithm and Particle Swarm Optimization (GAPSO). In: IEEE Transaction Information Theory, pp. 1877–1886 (2012)
Acknowledgement
The work described in this paper was supported by The Natural Science Foundation of China (Grant No.71971143, 71571120), Natural Science Foundation of Guangdong Province (2020A1515010749), Key Research Foundation of Higher Education of Guangdong Provincial Education Bureau (2019KZDXM030), Guangdong Province Soft Science Project (2019A101002075), Guangdong Province Educational Science Plan 2019 (2019JKCY010) and Guangdong Province Postgraduate Education Innovation Research Project (2019SFKC46).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Gulan, M., Huang, K. (2020). An Analysis of K-Means, Particle Swarm Optimization and Genetic Algorithm with Data Clustering Technique. In: Huang, DS., Jo, KH. (eds) Intelligent Computing Theories and Application. ICIC 2020. Lecture Notes in Computer Science(), vol 12464. Springer, Cham. https://doi.org/10.1007/978-3-030-60802-6_41
Download citation
DOI: https://doi.org/10.1007/978-3-030-60802-6_41
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60801-9
Online ISBN: 978-3-030-60802-6
eBook Packages: Computer ScienceComputer Science (R0)