Abstract
The paper reports an efficient clustering technique for high-dimensional data using cycles of reversible finite cellular automata (CAs). As any arbitrary cellular automaton (CA) is not useful for maintaining less intra-cluster and high inter-cluster distance essential for an effective clustering, first the candidate CA rules have been identified based on theoretical properties of information flow and self-replication. Three stages of hierarchical clustering are incorporated over the encoded real dataset with any number of features. Because of the inherent parallelism of our algorithm, its running time is polynomial to the number of objects in the dataset which avoids the limitations of the existing CA-based clustering techniques. With respect to various standard benchmark performance metrics, our algorithm is at par with the other existing algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Caliński, T., Harabasz, J.: A dendrite method for cluster analysis. Commun. Stat.-Theory Methods 3(1), 1–27 (1974)
Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)
Dougherty, J., Kohavi, R., Sahami, M.: Supervised and unsupervised discretization of continuous features. In: Machine Learning Proceedings 1995, pp. 194–202. Elsevier (1995)
Dunn, J.C.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)
Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pp. 226–231. KDD’96, AAAI Press (1996)
Hartigan, J.A., Wong, M.A.: Algorithm as 136: a k-means clustering algorithm. J. R. Stat. Soc. Ser. c (applied statistics) 28(1), 100–108 (1979)
Jain, A., Murthy, M., Flynn, P.: Data clustering: a review. ACM Comput. Surv. 31(3), 165–193 (1999)
Kamilya, S., Das, S.: A study of chaos in cellular automata. Int. J. Bifurc. Chaos 28(03), 1830008 (2018)
Likas, A., Vlassis, N., Verbeek, J.J.: The global k-means clustering algorithm. Pattern Recogn. 36(2), 451–461 (2003)
Mukherjee, S., Bhattacharjee, K., Das, S.: Clustering using cyclic spaces of reversible cellular automata. Complex Syst. 30(2), 205–237 (2021)
Mukherjee, S., Bhattacharjee, K., Das, S.: Reversible cellular automata: a natural clustering technique. J. Cellular Automata 16 (2021)
Xu, D., Tian, Y.A.: A comprehensive survey of clustering algorithms. Ann. Data Sci. 2, 165–193 (2015)
Xu, R., Wunsch, D.: Survey of clustering algorithms. IEEE Trans. Neural Netw. 16(3), 645–678 (2005)
Zhang, T., Ramakrishnan, R., Livny, M.: Birch: a new data clustering algorithm and its applications. Data Mining Knowl. Dis. 1(2), 141–182 (1997)
Acknowledgements
This work is partially supported by Start-up Research Grant (File number SRG/2022/002098), SERB, Department of Science & Technology, Government of India. The authors are grateful to Prof. Sukanta Das and Dr. Sukanya Mukherjee for their insightful comments and discussions which have been useful for this work. A special thanks goes to Mr. Subrata Paul for his help in coding.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Abhishek, S., Dharwish, M., Das, A., Bhattacharjee, K. (2023). A Cellular Automata-Based Clustering Technique for High-Dimensional Data. In: Das, S., Martinez, G.J. (eds) Proceedings of Second Asian Symposium on Cellular Automata Technology. ASCAT 2023. Advances in Intelligent Systems and Computing, vol 1443. Springer, Singapore. https://doi.org/10.1007/978-981-99-0688-8_4
Download citation
DOI: https://doi.org/10.1007/978-981-99-0688-8_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-0687-1
Online ISBN: 978-981-99-0688-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)