Abstract
Density peaks clustering algorithms calculate the local density based on the cutoff distance and the global distribution of the sample. They cannot capture the local characteristics of the sample well, and are prone to appear errors in the selection of density peaks; additionally, the allocation strategy has poor fault tolerance. Once a sample is allocated incorrectly, subsequent allocations will magnify the error. Hence, we proposed a density peaks clustering algorithm based on k-nearest neighbors (DPC-KNN). First, the k-nearest neighbors information of the sample is used to define the local density of the sample in order to find the cluster centers accordingly; the sample with the distance between cluster centers and k-nearest neighbors sample less than the set threshold is defined as the core sample, and the core sample is classified into the corresponding cluster to construct the core area of the cluster; after the degree of attribution of the remaining samples and various clusters are calculated, they are allocated to clusters with high degree of attribution. In order to verify the effectiveness of the proposed algorithm, eight synthetic datasets and ten UCI datasets are selected for experiments, and the proposed algorithm is compared with FKNN-DPC, DPCSA, FNDPC, DPC and DBSCAN. The experimental results indicated that the proposed algorithm had better clustering performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Berkhin, P.: A survey of clustering data mining techniques. Grouping Multidimensional Data 43(1), 25–71 (2006)
Xu, R., Wunsch, D.: Survey of clustering algorithm. IEEE Trans. Neural Netw. 16(3), 645–678 (2005)
Xu, D., Tian, Y.: A comprehensive survey of clustering algorithm. Annals Data Sci. 2(2), 165–193 (2015)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323(1999)
Rodriguez, A., Laio, A.: Clustering by fast search and find of density peaks. Science 344(6191), 1492–1496 (2014)
Du, M., Ding, S., Jia, H.: Study on density peaks clustering based on K-nearest neighbors and principal component analysis. Knowl. Based Syst. 99, 135–145 (2016)
Xie, J., Gao, H., Xie, W.: K-nearest neighbors optimized clustering algorithm by fast search and finding the density peaks of a dataset. Scientia Sinica Informationis 46(2), 258–280 (2016)
Liu, R., Wang, H., Yu, X.: Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf. Sci. 450, 200–226 (2018)
Lu, J., Zhu, Q.: An effective algorithm based on density clustering framework. IEEE Access 5, 4991–5000 (2017)
Seyedi, S.A., Lotfi, A., Moradi, P., Qader, N.N.: Dynamic graph-based label propagation for density peaks clustering. Expert Syst. Appl. 115, 314–328 (2019)
Xie, J., Gao, H., Xie, W., Liu, X., Grant, P.W.: Robust clustering by detecting density peaks and assigning points based on fuzzy weighted K-nearest neighbors. Inf. Sci. 354, 19–40 (2016)
Xue, X., Gao, S., Peng, H.: Density peaks clustering algorithm based on K-nearest neighbors and classes-merging. J. jilin Univ. (Science Edition) 57(1), 111–120 (2019)
Yu, D., Liu, G., Guo, M., Liu, X., Yao, S.: Density peaks clustering based on weighted local density sequence and nearest neighbor assignment. IEEE Access 7, 34301–34317 (2019)
Du, M., Ding, S., Xue, Y.: A robust density peaks clustering algorithm using fuzzy neighborhood. Int. J. Mach. Learn. Cybern. 9(7), 1131–1140 (2018)
Ester, M.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pp. 226–231. Palo Alto, AAAI Press(1996)
Vinh, N., Epps, J., Bailey, J.: Information theoretic measures for clusterings comparison: Variants, properties, normalization and correction for chance. J. Mach. Learn. Res. 11(1), 2837–2854 (2010)
Fowlkes, E.B., Mallows, C.L.: A Method for Comparing Two Hierarchical Clusterings. J. Am. Stat. Assoc. 78(383), 553–569 (1983)
Jain, A.K., Law, M.H.: Data clustering: a user’s dilemma. In: Proceedings of the First International Conference on Pattern Recognition and Machine Intelligence, pp.1–10.Heidelberg, Springer (2005)
Chang, H., Yeung, D.Y.: Robust path-based spectral clustering. Pattern Recogn. 41(1), 191–203 (2008)
Gionis, A., Mannila, H., Tsaparas, P.: Clustering aggregation. ACM Trans. Knowl. Discov. Data 1(1), 1–30 (2007)
Fu, L., Medico, E.: FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data. BMC Bioinform. 8(1), 3 (2007)
Veenman, C.J., Reinders, M.J.T., Backer, E.: A maximum variance cluster algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 24(9), 1273–1280 (2002)
Frnti, P., Virmajoki, O.: Iterative shrinking method for clustering problems. Pattern Recogn. 39(5), 761–775 (2006)
UCI repository of machine learning database. http://archive.ics.uci.edu/ml/index.html
Street, W.N., Wolberg, W.H., Mangasarian, O.L.: Nuclear feature extraction for breast tumor diagnosis. In: Proceedings of the IS&T/SPIE International Symposium on Electronic Imaging:Science and Technology, pp. 861–870 (1993)
Charytanowicz, M., Niewczas, J., Kulczycki, P., et al.: Complete gradient clustering algorithm for features analysis of x-ray images. Adv. Intell. Soft Comput. 69, 15–24 (2010)
Sigillito, V.G., Wing, S.P., Hutton, L.V., et al.: Classification of radar returns from the ionosphere using neural networks. J. Hopkins APL Tech. Dig. 10(3), 262–266 (1989)
Acknowledgments
This research was supported by the Science and Technology Project of Jiangxi Province Department of Education (No. GJJ180940),the National Natural Science Foundation of China (61762063),the Scientific research project of the Department of Education (No. GJJ170991).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Yin, S., Wu, R., Li, P., Liu, B., Fu, X. (2022). Density Peaks Clustering Algorithm Based on K Nearest Neighbors. In: Zhang, JF., Chen, CM., Chu, SC., Kountchev, R. (eds) Advances in Intelligent Systems and Computing. Smart Innovation, Systems and Technologies, vol 268. Springer, Singapore. https://doi.org/10.1007/978-981-16-8048-9_13
Download citation
DOI: https://doi.org/10.1007/978-981-16-8048-9_13
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-8047-2
Online ISBN: 978-981-16-8048-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)