Abstract
Clustering is among the pivotal elementary operations in the field of data analysis. The efficiency of a clustering algorithm depends on a variety of factors like initialization of cluster centers, shape of clusters, density of the dataset, and complexity of the clustering mechanism. Previous work in clustering has managed to achieve great results but at the expense of a trial and error approach to achieve optimal values for user-defined parameters which have a huge bearing on the quality of the clusters formed. In this work, we propose a solution that optimizes the user-defined parameters for clustering algorithm called Probability Propagation (PP) by harnessing the capabilities of Genetic Algorithm (GA). In order to overcome this sensitivity in PP, a novel optimization technique is applied by obtaining the optimal values of \(\delta \) and s using GA by maximizing inter-cluster spread and minimizing intra-cluster spread among the clusters being formed. The proposed method was found to retrieve top chromosomes (bandwidth and s) with a similar number of clusters, thus eliminating the sensitivity of user-defined parameters which is optimized by usingĀ GA.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Gana, G., Zhangb, Y., Dey, D.K.: Clustering by propagating probabilities between data points. Appl. Soft Comput. 41, (2016)
Frey, B.J., Dueck, D.: Clustering by passing messages between data points. Science 315 (2007). www.sciencemag.org
van Dongen, S.: Graph clustering via a discrete uncoupling process? SIAM J. Matrix Anal. Appl. 30(1), 121ā141 (2008)
Ester, M., Kriegel, H.-P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. KDD (1996)
Goldberg, D.E.: Genetic Algorithms in Search, Optimization and Machine Learning. Addison-Wesley (1989)
Liu, Y., Ye, M., Peng, J., Wu, H.: Finding the optimal number of clusters using genetic algorithms. In: IEEE Conference on Cybernetics and Intelligent Systems. Chengdu, pp. 1325ā1330 (2008)
Bandyopadhyay, S., Maulik, U.: Nonparametric genetic clustering: comparison of validity indices. IEEE Trans. Syst. Man Cybern. Part C: Appl. Rev. 31(1), 120ā125 (2001)
Bandyopadhyay, S., Maulik, U.: Genetic clustering for automatic evolution of clusters and application to image classification. Pattern Recogn. 35(6), 1197ā1208 (2002)
Lai, C.C.: A novel clustering approach using hierarchical genetic algorithms. Intell. Autom. Soft Comput. 11(3), 143ā153 (2005)
Kumsawat, P., Attakitmongcol, K., Srikaew, A.: A new approach for optimization in image watermarking by using genetic algorithms. IEEE Trans. Signal Process. 53(12), 4707ā4719 (2005)
Ramasubramanian, P., Kannan, A.: A genetic-algorithm based neural network short-term forecasting framework for database intrusion prediction system. Soft Comput. 10(8), 699ā714 (2006)
Chang, Y.C., Chen, S.M.: A new query reweighting method for document retrieval based on genetic algorithms. IEEE Trans. Evol. Comput. 10(5), 617ā622 (2006)
Lin, H.J., Yang, F.W., Kao, Y.T.: An efficient GA-based clustering technique. Tamkang J. Sci. Eng. 8(2), 113ā122 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Dalmia, S., Sriram, A., Ashwin, T.S. (2021). Genetic Algorithm-Based Optimization of Clustering Data Points by Propagating Probabilities. In: Mandal, J.K., Mukherjee, I., Bakshi, S., Chatterji, S., Sa, P.K. (eds) Computational Intelligence and Machine Learning. Advances in Intelligent Systems and Computing, vol 1276. Springer, Singapore. https://doi.org/10.1007/978-981-15-8610-1_2
Download citation
DOI: https://doi.org/10.1007/978-981-15-8610-1_2
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-8609-5
Online ISBN: 978-981-15-8610-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)