Density Peaks Clustering Algorithm Based on K Nearest Neighbors

Yin, Shihao; Wu, Runxiu; Li, Peiwu; Liu, Baohong; Fu, Xuefeng

doi:10.1007/978-981-16-8048-9_13

Shihao Yin⁷,
Runxiu Wu⁷,
Peiwu Li⁷,
Baohong Liu⁷ &
…
Xuefeng Fu⁷

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 268))

563 Accesses
2 Citations

Abstract

Density peaks clustering algorithms calculate the local density based on the cutoff distance and the global distribution of the sample. They cannot capture the local characteristics of the sample well, and are prone to appear errors in the selection of density peaks; additionally, the allocation strategy has poor fault tolerance. Once a sample is allocated incorrectly, subsequent allocations will magnify the error. Hence, we proposed a density peaks clustering algorithm based on k-nearest neighbors (DPC-KNN). First, the k-nearest neighbors information of the sample is used to define the local density of the sample in order to find the cluster centers accordingly; the sample with the distance between cluster centers and k-nearest neighbors sample less than the set threshold is defined as the core sample, and the core sample is classified into the corresponding cluster to construct the core area of the cluster; after the degree of attribution of the remaining samples and various clusters are calculated, they are allocated to clusters with high degree of attribution. In order to verify the effectiveness of the proposed algorithm, eight synthetic datasets and ten UCI datasets are selected for experiments, and the proposed algorithm is compared with FKNN-DPC, DPCSA, FNDPC, DPC and DBSCAN. The experimental results indicated that the proposed algorithm had better clustering performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 189.00; Price excludes VAT (USA)

Softcover Book: USD 249.99; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Berkhin, P.: A survey of clustering data mining techniques. Grouping Multidimensional Data 43(1), 25–71 (2006)
Article MathSciNet Google Scholar
Xu, R., Wunsch, D.: Survey of clustering algorithm. IEEE Trans. Neural Netw. 16(3), 645–678 (2005)
Article Google Scholar
Xu, D., Tian, Y.: A comprehensive survey of clustering algorithm. Annals Data Sci. 2(2), 165–193 (2015)
Article MathSciNet Google Scholar
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323(1999)
Google Scholar
Rodriguez, A., Laio, A.: Clustering by fast search and find of density peaks. Science 344(6191), 1492–1496 (2014)
Article Google Scholar
Du, M., Ding, S., Jia, H.: Study on density peaks clustering based on K-nearest neighbors and principal component analysis. Knowl. Based Syst. 99, 135–145 (2016)
Article Google Scholar
Xie, J., Gao, H., Xie, W.: K-nearest neighbors optimized clustering algorithm by fast search and finding the density peaks of a dataset. Scientia Sinica Informationis 46(2), 258–280 (2016)
Google Scholar
Liu, R., Wang, H., Yu, X.: Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf. Sci. 450, 200–226 (2018)
Article MathSciNet Google Scholar
Lu, J., Zhu, Q.: An effective algorithm based on density clustering framework. IEEE Access 5, 4991–5000 (2017)
Article Google Scholar
Seyedi, S.A., Lotfi, A., Moradi, P., Qader, N.N.: Dynamic graph-based label propagation for density peaks clustering. Expert Syst. Appl. 115, 314–328 (2019)
Article Google Scholar
Xie, J., Gao, H., Xie, W., Liu, X., Grant, P.W.: Robust clustering by detecting density peaks and assigning points based on fuzzy weighted K-nearest neighbors. Inf. Sci. 354, 19–40 (2016)
Article Google Scholar
Xue, X., Gao, S., Peng, H.: Density peaks clustering algorithm based on K-nearest neighbors and classes-merging. J. jilin Univ. (Science Edition) 57(1), 111–120 (2019)
Google Scholar
Yu, D., Liu, G., Guo, M., Liu, X., Yao, S.: Density peaks clustering based on weighted local density sequence and nearest neighbor assignment. IEEE Access 7, 34301–34317 (2019)
Article Google Scholar
Du, M., Ding, S., Xue, Y.: A robust density peaks clustering algorithm using fuzzy neighborhood. Int. J. Mach. Learn. Cybern. 9(7), 1131–1140 (2018)
Article Google Scholar
Ester, M.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pp. 226–231. Palo Alto, AAAI Press(1996)
Google Scholar
Vinh, N., Epps, J., Bailey, J.: Information theoretic measures for clusterings comparison: Variants, properties, normalization and correction for chance. J. Mach. Learn. Res. 11(1), 2837–2854 (2010)
MathSciNet MATH Google Scholar
Fowlkes, E.B., Mallows, C.L.: A Method for Comparing Two Hierarchical Clusterings. J. Am. Stat. Assoc. 78(383), 553–569 (1983)
Article Google Scholar
Jain, A.K., Law, M.H.: Data clustering: a user’s dilemma. In: Proceedings of the First International Conference on Pattern Recognition and Machine Intelligence, pp.1–10.Heidelberg, Springer (2005)
Google Scholar
Chang, H., Yeung, D.Y.: Robust path-based spectral clustering. Pattern Recogn. 41(1), 191–203 (2008)
Article Google Scholar
Gionis, A., Mannila, H., Tsaparas, P.: Clustering aggregation. ACM Trans. Knowl. Discov. Data 1(1), 1–30 (2007)
Article Google Scholar
Fu, L., Medico, E.: FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data. BMC Bioinform. 8(1), 3 (2007)
Google Scholar
Veenman, C.J., Reinders, M.J.T., Backer, E.: A maximum variance cluster algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 24(9), 1273–1280 (2002)
Article Google Scholar
Frnti, P., Virmajoki, O.: Iterative shrinking method for clustering problems. Pattern Recogn. 39(5), 761–775 (2006)
Article Google Scholar
UCI repository of machine learning database. http://archive.ics.uci.edu/ml/index.html
Street, W.N., Wolberg, W.H., Mangasarian, O.L.: Nuclear feature extraction for breast tumor diagnosis. In: Proceedings of the IS&T/SPIE International Symposium on Electronic Imaging:Science and Technology, pp. 861–870 (1993)
Google Scholar
Charytanowicz, M., Niewczas, J., Kulczycki, P., et al.: Complete gradient clustering algorithm for features analysis of x-ray images. Adv. Intell. Soft Comput. 69, 15–24 (2010)
Article Google Scholar
Sigillito, V.G., Wing, S.P., Hutton, L.V., et al.: Classification of radar returns from the ionosphere using neural networks. J. Hopkins APL Tech. Dig. 10(3), 262–266 (1989)
Google Scholar

Download references

Acknowledgments

This research was supported by the Science and Technology Project of Jiangxi Province Department of Education (No. GJJ180940),the National Natural Science Foundation of China (61762063),the Scientific research project of the Department of Education (No. GJJ170991).

Author information

Authors and Affiliations

School of Information Engineering, Nanchang Institute of Technology, Nanchang, 330099, China
Shihao Yin, Runxiu Wu, Peiwu Li, Baohong Liu & Xuefeng Fu

Authors

Shihao Yin
View author publications
You can also search for this author in PubMed Google Scholar
Runxiu Wu
View author publications
You can also search for this author in PubMed Google Scholar
Peiwu Li
View author publications
You can also search for this author in PubMed Google Scholar
Baohong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xuefeng Fu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Runxiu Wu .

Editor information

Editors and Affiliations

Institute of Intelligent Media Technology, Communication, University of Zhejiang, Hangzhou City, China
Jie-Fang Zhang
College of Computer Science and Engineering, Shandong University of Science and Technology, Shandong, China
Chien-Ming Chen
College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao, Shandong, China
Shu-Chuan Chu
Department of Radio Communications and Video Technologies, Technical University of Sofia, Sofia, Bulgaria
Roumen Kountchev

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yin, S., Wu, R., Li, P., Liu, B., Fu, X. (2022). Density Peaks Clustering Algorithm Based on K Nearest Neighbors. In: Zhang, JF., Chen, CM., Chu, SC., Kountchev, R. (eds) Advances in Intelligent Systems and Computing. Smart Innovation, Systems and Technologies, vol 268. Springer, Singapore. https://doi.org/10.1007/978-981-16-8048-9_13

Download citation

DOI: https://doi.org/10.1007/978-981-16-8048-9_13
Published: 22 February 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-8047-2
Online ISBN: 978-981-16-8048-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics