Skip to main content

Density Peaks Clustering Algorithm Based on K Nearest Neighbors

  • Conference paper
  • First Online:
Advances in Intelligent Systems and Computing

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 268))

Abstract

Density peaks clustering algorithms calculate the local density based on the cutoff distance and the global distribution of the sample. They cannot capture the local characteristics of the sample well, and are prone to appear errors in the selection of density peaks; additionally, the allocation strategy has poor fault tolerance. Once a sample is allocated incorrectly, subsequent allocations will magnify the error. Hence, we proposed a density peaks clustering algorithm based on k-nearest neighbors (DPC-KNN). First, the k-nearest neighbors information of the sample is used to define the local density of the sample in order to find the cluster centers accordingly; the sample with the distance between cluster centers and k-nearest neighbors sample less than the set threshold is defined as the core sample, and the core sample is classified into the corresponding cluster to construct the core area of the cluster; after the degree of attribution of the remaining samples and various clusters are calculated, they are allocated to clusters with high degree of attribution. In order to verify the effectiveness of the proposed algorithm, eight synthetic datasets and ten UCI datasets are selected for experiments, and the proposed algorithm is compared with FKNN-DPC, DPCSA, FNDPC, DPC and DBSCAN. The experimental results indicated that the proposed algorithm had better clustering performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Berkhin, P.: A survey of clustering data mining techniques. Grouping Multidimensional Data 43(1), 25–71 (2006)

    Article  MathSciNet  Google Scholar 

  2. Xu, R., Wunsch, D.: Survey of clustering algorithm. IEEE Trans. Neural Netw. 16(3), 645–678 (2005)

    Article  Google Scholar 

  3. Xu, D., Tian, Y.: A comprehensive survey of clustering algorithm. Annals Data Sci. 2(2), 165–193 (2015)

    Article  MathSciNet  Google Scholar 

  4. Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323(1999)

    Google Scholar 

  5. Rodriguez, A., Laio, A.: Clustering by fast search and find of density peaks. Science 344(6191), 1492–1496 (2014)

    Article  Google Scholar 

  6. Du, M., Ding, S., Jia, H.: Study on density peaks clustering based on K-nearest neighbors and principal component analysis. Knowl. Based Syst. 99, 135–145 (2016)

    Article  Google Scholar 

  7. Xie, J., Gao, H., Xie, W.: K-nearest neighbors optimized clustering algorithm by fast search and finding the density peaks of a dataset. Scientia Sinica Informationis 46(2), 258–280 (2016)

    Google Scholar 

  8. Liu, R., Wang, H., Yu, X.: Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf. Sci. 450, 200–226 (2018)

    Article  MathSciNet  Google Scholar 

  9. Lu, J., Zhu, Q.: An effective algorithm based on density clustering framework. IEEE Access 5, 4991–5000 (2017)

    Article  Google Scholar 

  10. Seyedi, S.A., Lotfi, A., Moradi, P., Qader, N.N.: Dynamic graph-based label propagation for density peaks clustering. Expert Syst. Appl. 115, 314–328 (2019)

    Article  Google Scholar 

  11. Xie, J., Gao, H., Xie, W., Liu, X., Grant, P.W.: Robust clustering by detecting density peaks and assigning points based on fuzzy weighted K-nearest neighbors. Inf. Sci. 354, 19–40 (2016)

    Article  Google Scholar 

  12. Xue, X., Gao, S., Peng, H.: Density peaks clustering algorithm based on K-nearest neighbors and classes-merging. J. jilin Univ. (Science Edition) 57(1), 111–120 (2019)

    Google Scholar 

  13. Yu, D., Liu, G., Guo, M., Liu, X., Yao, S.: Density peaks clustering based on weighted local density sequence and nearest neighbor assignment. IEEE Access 7, 34301–34317 (2019)

    Article  Google Scholar 

  14. Du, M., Ding, S., Xue, Y.: A robust density peaks clustering algorithm using fuzzy neighborhood. Int. J. Mach. Learn. Cybern. 9(7), 1131–1140 (2018)

    Article  Google Scholar 

  15. Ester, M.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, pp. 226–231. Palo Alto, AAAI Press(1996)

    Google Scholar 

  16. Vinh, N., Epps, J., Bailey, J.: Information theoretic measures for clusterings comparison: Variants, properties, normalization and correction for chance. J. Mach. Learn. Res. 11(1), 2837–2854 (2010)

    MathSciNet  MATH  Google Scholar 

  17. Fowlkes, E.B., Mallows, C.L.: A Method for Comparing Two Hierarchical Clusterings. J. Am. Stat. Assoc. 78(383), 553–569 (1983)

    Article  Google Scholar 

  18. Jain, A.K., Law, M.H.: Data clustering: a user’s dilemma. In: Proceedings of the First International Conference on Pattern Recognition and Machine Intelligence, pp.1–10.Heidelberg, Springer (2005)

    Google Scholar 

  19. Chang, H., Yeung, D.Y.: Robust path-based spectral clustering. Pattern Recogn. 41(1), 191–203 (2008)

    Article  Google Scholar 

  20. Gionis, A., Mannila, H., Tsaparas, P.: Clustering aggregation. ACM Trans. Knowl. Discov. Data 1(1), 1–30 (2007)

    Article  Google Scholar 

  21. Fu, L., Medico, E.: FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data. BMC Bioinform. 8(1), 3 (2007)

    Google Scholar 

  22. Veenman, C.J., Reinders, M.J.T., Backer, E.: A maximum variance cluster algorithm. IEEE Trans. Pattern Anal. Mach. Intell. 24(9), 1273–1280 (2002)

    Article  Google Scholar 

  23. Frnti, P., Virmajoki, O.: Iterative shrinking method for clustering problems. Pattern Recogn. 39(5), 761–775 (2006)

    Article  Google Scholar 

  24. UCI repository of machine learning database. http://archive.ics.uci.edu/ml/index.html

  25. Street, W.N., Wolberg, W.H., Mangasarian, O.L.: Nuclear feature extraction for breast tumor diagnosis. In: Proceedings of the IS&T/SPIE International Symposium on Electronic Imaging:Science and Technology, pp. 861–870 (1993)

    Google Scholar 

  26. Charytanowicz, M., Niewczas, J., Kulczycki, P., et al.: Complete gradient clustering algorithm for features analysis of x-ray images. Adv. Intell. Soft Comput. 69, 15–24 (2010)

    Article  Google Scholar 

  27. Sigillito, V.G., Wing, S.P., Hutton, L.V., et al.: Classification of radar returns from the ionosphere using neural networks. J. Hopkins APL Tech. Dig. 10(3), 262–266 (1989)

    Google Scholar 

Download references

Acknowledgments

This research was supported by the Science and Technology Project of Jiangxi Province Department of Education (No. GJJ180940),the National Natural Science Foundation of China (61762063),the Scientific research project of the Department of Education (No. GJJ170991).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Runxiu Wu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Yin, S., Wu, R., Li, P., Liu, B., Fu, X. (2022). Density Peaks Clustering Algorithm Based on K Nearest Neighbors. In: Zhang, JF., Chen, CM., Chu, SC., Kountchev, R. (eds) Advances in Intelligent Systems and Computing. Smart Innovation, Systems and Technologies, vol 268. Springer, Singapore. https://doi.org/10.1007/978-981-16-8048-9_13

Download citation

Publish with us

Policies and ethics