Abstract
DBSCAN algorithm discovers clusters of various shapes and sizes. But it fails to discover clusters of different density. This is due to its dependency on global value for Eps. This paper introduces an idea to deal with this problem. The offered method estimates local density for a point as the sum of distances to its k-nearest items, arranges items in ascending order according to their local density. The clustering process is started from the highest density point by adding un-clustered points that have similar density as first point in cluster. Also, the point is assigned to current cluster if the sum of distances to its Minpts-nearest neighbors is less than or equal to the density of first point (core point condition in DBSCAN). Experimental results display the efficiency of the proposed method in discovering varied density clusters from data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Fahim, A.M., Salem, A.M., Torkey, F.A., Ramadan, M.: An efficient enhanced k-means clustering algorithm. J. Zhejiang Univ. Sci. A 7(10), 1626–1633 (2006)
Kaufman, L., Rousseeuw, P.J.: Finding groups in data: an introduction to cluster analysis. In: Partitioning Around Medoids (Program PAM). Wiley (1990)
Ng, R.T., Han, J.: Efficient and effective clustering methods for spatial data mining. In: Proceedings of the 20th International Conference on Very Large Databases, Santiago, Chile, pp. 145–155 (1994)
Sibson, R.: SLINK: an optimally efficient algorithm for the single-link cluster method. Comput. J. 16(1), 30–34 (1973)
Seifoddini, H.K.: Single linkage versus average linkage clustering in machine cells formation applications. Comput. Ind. Eng. 16(3), 419–426 (1989)
Defays, D.: An efficient algorithm for a complete link method. Comput. J. 20(4), 364–366 (1977)
Karypis, G., Han, E.H., Kumar, V.: CHAMELEON: a hierarchical clustering algorithm using dynamic modeling. Computer 32(8), 68–75 (1999)
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data clustering method for very large databases. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, SIGMOD 1996, pp. 103–114. ACM, New York (1996)
Guha, S., Rastogi, R., Shim, K.: Cure: an efficient clustering algorithm for large databases. In: Haas, L.M., Tiwary, A. (eds.) Proceedings ACM SIGMOD International Conference on Management of Data, Seattle, Washington, USA, 2–4 June 1998, pp. 73–84. ACM Press (1998)
Ester, M., Krigel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the International Conference on Knowledge Discovery and Data Mining, pp. 226–231 (1996)
Ankerst, M., Breunig, M.M., Kriegel, H.P.: OPTICS: ordering points to identify the clustering structure. In: Proceedings of ACM SIGMOD, pp. 49–60 (1999)
Hinneburg, A., Keim, D.A.: An efficient approach to clustering in large multimedia databases with noise. In: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, New York, September 1998, pp. 58–65 (1998)
Idrissi, A., Rehioui, H., Laghrissi, A., Retal, S.: An improvement of DENCLUE algorithm for the data clustering. In: 2015 5th International Conference on Information and Communication Technology and Accessibility (ICTA), Marrakech, pp. 1–6 (2015)
Fahim, A.: Homogeneous densities clustering algorithm. Int. J. Inf. Technol. Comput. Sci. (IJITCS) 10(10), 1–10 (2018)
Chen, X., Min, Y., Zhao, Y., Wang, P.: GMDBSCAN: multi-density DBSCAN cluster based on grid. In: Proceedings of the IEEE International Conference on e-Business Engineering, ICEBE 2008, China, October 2008, pp. 780–783 (2008)
Alhanjouri, M.A., Ahmed, R.D.: New density-based clustering technique: GMDBSCAN-UR. Int. J. Adv. Res. Comput. Sci. 3(1), 1–9 (2012)
Liu, P., Zhou, D., Wu, N.: VDBSCAN: varied density based spatial clustering of applications with noise. In: Proceedings of the ICSSSM 2007: 2007 International Conference on Service Systems and Service Management, China, June 2007
Xiong, Z., Chen, R., Zhang, Y., Zhang, X.: Multi-density DBSCAN algorithm based on density levels partitioning. J. Inf. Comput. Sci. 9(10), 2739–2749 (2012)
Louhichi, S., Gzara, M., Abdallah, H.: A density based algorithm for discovering clusters with varied density. In: 2014 World Congress on Computer Applications and Information Systems (WCCAIS), Hammamet, pp. 1–6 (2014)
Hou, J., Gao, H., Li, X.: DSets-DBSCAN: a parameter-free clustering algorithm. IEEE Trans. Image Process. 25(7), 3182–3193 (2016)
Debnath, M., Tripathi, P.K., Elmasri, R.: K-DBSCAN: identifying spatial clusters with differing density levels. In: Proceedings of the 2015 International Workshop on Data Mining with Industrial Applications, DMIA 2015, Paraguay, September 2015, pp. 51–60 (2015)
Ashour, W., Sunoallah, S.: Multi density DBSCAN. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): Preface, vol. 6936, pp. 446–453 (2011)
Jungan, C., Jinyin, C., Dongyong, Y., Jun, L.: A k-deviation density based clustering algorithm. Math. Probl. Eng. 2018, 1–16 (2018)
Fahim, A.: A clustering algorithm based on local density of points. Int. J. Mod. Educ. Comput. Sci. (IJMECS) 9(12), 9–16 (2017)
Acknowledgements
This project was supported by the Deanship of Scientific Research at Prince Sattam Bin Abdulaziz University under the research project no. 2017/01/7120.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Fahim, A. (2020). A Clustering Algorithm for Multi-density Datasets. In: Jain, L., Peng, SL., Alhadidi, B., Pal, S. (eds) Intelligent Computing Paradigm and Cutting-edge Technologies. ICICCT 2019. Learning and Analytics in Intelligent Systems, vol 9. Springer, Cham. https://doi.org/10.1007/978-3-030-38501-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-38501-9_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-38500-2
Online ISBN: 978-3-030-38501-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)