Abstract
Density-based clustering has gained increasing attention during the past decades as it allows the discovery of clusters with arbitrary shapes and is robust to noisy objects. However, existing density-based clustering approaches tend to fail if there exist multiple clusters with different densities in a sea of noise. In this paper, we propose a new multi-level clustering method by exploiting the dynamic local density with wavelet transform. Specifically, a concept of dynamic reverse k-nearest neighbor is first introduced, and its count distribution is modeled as a Poisson distribution. The dynamic local density, which is robust to density varieties, is further defined with the cumulative Poisson distribution function. Afterward, a cluster order is constructed based on the derived dynamic local density and finally used to yield the clusters by employing the wavelet transform. Compared to existing approaches, our proposed method can detect clusters with different densities and allows obtaining more clustering information such as the number of clusters, break points between clusters, the boundary of clusters, etc. Extensive experiments on both synthetic and real-world data sets have demonstrated that our proposed method is effective and produces better clustering results when compared to many state-of-the-art clustering algorithms.
Similar content being viewed by others
References
Saxena A, Prasad M, Gupta A, Bharill N, Patel O P, Tiwari A, Er M J, Ding W, Lin C-T (2017) A review of clustering techniques and developments. Neurocomputing 267:664–681
Campello R J G B, Kroeger P, Sander J, Zimek A (2020) Density-based clustering. Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery, 10
Tang D, Zhang S, Chen J, Wang X (2021) The detection of low-rate dos attacks using the sadbscan algorithm. Inf Sci 565:229–247
Xie X, Duan L, Qiu T, Li J (2021) Quantum algorithm for mmng-based dbscan. Sci rep 11:15559
Rad M H, Abdolrazzagh-Nezhad M (2020) A new hybridization of dbscan and fuzzy earthworm optimization algorithm for data cube clustering. Soft Comput 24:15529–15549
Lv Y, Ma T, Tang M, Cao J, Tian Y, AL-Dhelaan A, Al-Rodhaan M (2016) An efficient and scalable density-based clustering algorithm for datasets with complex structures. Neurocomputing 171:9–22
Bryant A, Cios K (2018) Rnn-dbscan: A density-based clustering algorithm using reverse nearest neighbor density estimates. IEEE Trans Knowl Data Eng 30:1109–1121
Lu J, Zhu Q (2017) An effective algorithm based on density clustering framework. IEEE Access 5:4991–5000
Cassisi C, Ferro A, Giugno R, Pigola G, Pulvirenti A (2013) Enhancing density-based clustering: Parameter reduction and outlier detection. Inf Syst 38(3):317–330
Zhu Q, Feng J, Huang J (2016) Natural neighbor: A self-adaptive neighborhood method without parameter k. Pattern Recogn Lett 80:30–36
Xie H, Li P (2021) A density-based evolutionary clustering algorithm for intelligent development. Eng Appl Artif Intell, 104
Rodriguez A, Laio A (2014) Clustering by fast search and find of density peaks. Science 344:1492–1496
Du M, Ding S, Jia H (2016) Study on density peaks clustering based on k-nearest neighbors and principal component analysis. Knowl-Based Syst 99:135–145
Liu R, Wang H, Yu X (2018) Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf Sci 450:200–226
Ni L, Luo W, Zhu W, Liu W (2019) Clustering by finding prominent peaks in density space. Eng Appl Artif Intell 85:727–739
Lu J, Zhu Q, Wu Q (2018) A novel data clustering algorithm using heuristic rules based on k-nearest neighbors chain. Eng Appl Artif Intell 72:213–227
Hou J, Zhang A, Qi N (2020) Density peak clustering based on relative density relationship. Pattern Recogn, 108
Wang S, Li Q, Zhao C, Zhu X, Yuan H, Dai T (2021) Extreme clustering - a clustering method via density extreme points. Inf Sci 542:24–39
Vadapalli S, Valluri S R, Karlapalem K (2006) A simple yet effective data clustering algorithm. In: IEEE International conference on data mining, p 1108
Ren Y, Wang N, Li M, Xu Z (2020) Deep density-based image clustering. Knowl-Based Syst 197:1–8
Guo S, Xu J, Chen D, Zhang C, Wang X, Zhao R (2020) Density-aware feature embedding for face clustering. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 6697– 6705
Qi C, Zhang J, Jia H, Mao Q, Wang L, Song H (2021) Deep face clustering using residual graph convolutional network. Knowl-Based Syst 211:2854–2871
Chen R, Tang Y, Tian L, Zhang C, Zhang W (2022) Deep convolutional self-paced clustering. Appl Intell 52(5):4858–4872
Inouye D, Yang E, Allen G, Ravikumar P (2017) A review of multivariate distributions for count data derived from the poisson distribution. Wiley interdisciplinary reviews Computational statistics, 9
Vagner A (2016) The gridoptics clustering algorithm. Intell Data Anal 20:1061–1084
Agrawal K P, Garg S, Sharma S, Patel P (2016) Development and validation of optics based spatio-temporal clustering technique. Inf Sci 369:388–401
Zhu X, Nie S, Wang C, Xi X, Wang J, Li D, Zhou H (2021) A noise removal algorithm based on optics for photon-counting lidar data. IEEE Geosci Remote Sens Lett 8:1471–1475
Franti P, Sieranoja S (2018) K-means properties on six clustering benchmark datasets. Appl Intell 48(12):4743–4759
Amin H U, Yusoff M Z, Ahmad R F (2020) A novel approach based on wavelet analysis and arithmetic coding for automated detection and diagnosis of epileptic seizure in eeg signals using machine learning techniques. Biomedical Signal Processing and Control, 56
Dogan S, Tuncer T (2021) A novel statistical decimal pattern-based surface electromyogram signal classification method using tunableq-factor wavelet transform. Soft Comput 25:1085– 1098
Suzuki T (2020) Wavelet-based spectral-spatial transforms for cfa-sampled raw camera image compression. IEEE Trans Image Process 29:433–444
Ma H, Liu D, Xiong R, Wu F (2020) iwave: Cnn-based wavelet-like transform for image compression. IEEE Trans Multimed 22(7):1667–1679
Daubechies I (1990) The wavelet transform, time-frequency localization and signal analysis. IEEE Trans Inf Theory 36:961–1005
Shyh-Jier H, Cheng-Tao H (2002) Coiflet wavelet transform applied to inspect power system disturbance-generated signals. IEEE Trans Aerosp Electron Syst 38:1
Bergil E, Oral C, Ergul E U (2021) Efficient hand movement detection using k-means clustering and k-nearest neighbor algorithms. J Med Biol Eng 41(1):11–24
Jafari M M, Ojaghlou H, Zare M, Schumann G J-P (2021) Application of a novel hybrid wavelet-anfis/fuzzy c-means clustering model to predict groundwater fluctuations. Atmosphere 12:1
Kulagin V P, Akimov D A, Pavelyev S A, Potapov D A (2019) Automated identification of critical malfunctions of aircraft engines based on modified wavelet transform and deep neural network clustering. In: 2019 Workshop on materials and engineering in aeronautics (MEA), vol 714
Sheikholeslami G, Chatterjee S, Zhang AD (2000) Wavecluster: a wavelet-based clustering approach for spatial data in very large databases. VLDB J 8(3-4):289–304
Xu Y, Zhu Q, Fan Z, Qiu M, Chen Y, Liu H (2013) Coarse to fine k nearest neighbor classifier. IEEE Trans Knowl Data Eng 34(9):980–986
Ester M, Kriegel HP, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the second international conference on knowledge discovery and data mining (KDD-96), pp 226–231
Ankerst M, Breunig M M, Kriegel H P, Plant C, Sander J (1999) Optics: ordering points to identify the clustering structure. In: SIGMOD 1999, Proceedings ACM SIGMOD international conference on management of data
von Luxburg U (2007) A tutorial on spectral clustering. Stat Comput 17(4):395–416
Liu X, Cheng H-M, Zhang Z-Y (2020) Evaluation of community detection methods. IEEE Trans Knowl Data Eng 32(9):1736–1746
Zhang S, Wong H-S, Shen Y (2012) Generalized adjusted rand indices for cluster ensembles. Pattern Recogn 45(6):2214–2226
Dua D, Graff C (2019) Uci machine learning repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science
Sampat M P, Wang Z, Gupta S, Bovik A C, Markey M K (2009) Complex wavelet structural similarity: a new image similarity index. IEEE Trans Image Process 18(11):2385–2401
Acknowledgements
This work is supported by the Fundamental Research Funds for the Central Universities (ZYGX2019Z014), National Natural Science Foundation of China (61976044, 52079026), Fok Ying-Tong Education Foundation (161062), Science and Technology Research Program of Chongqing Municipal Education Commission (KJQN202103109),and Sichuan Science and Technology Program (2020YFH0037).
We wish to confirm that there are no known conflicts of interest associated with this publication and there has been no significant financial support for this work that could have influenced its outcome.
We confirm that the manuscript has been read and approved by all named authors and that there are no other persons who satisfied the criteria for authorship but are not listed. We further confirm that the order of authors listed in the manuscript has been approved by all of us.
We understand that the Corresponding Author is the sole contact for the Editorial process (including Editorial Manager and direct communications with the office). He/she is responsible for communicating with the other authors about progress, submissions of revisions and final approval of proofs. We confirm that we have provided a current, correct email address which is accessible by the Corresponding Author.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Jianyun, L., Junming, S. & Chunling, W. Multi-level clustering based on cluster order constructed with dynamic local density. Appl Intell 53, 9744–9761 (2023). https://doi.org/10.1007/s10489-022-03830-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-03830-8