A Clustering Algorithm for Multi-density Datasets

Fahim, Ahmed

doi:10.1007/978-3-030-38501-9_2

Ahmed Fahim^8,9

Part of the book series: Learning and Analytics in Intelligent Systems ((LAIS,volume 9))

Included in the following conference series:

International Conference on Information, Communication and Computing Technology

644 Accesses

Abstract

DBSCAN algorithm discovers clusters of various shapes and sizes. But it fails to discover clusters of different density. This is due to its dependency on global value for Eps. This paper introduces an idea to deal with this problem. The offered method estimates local density for a point as the sum of distances to its k-nearest items, arranges items in ascending order according to their local density. The clustering process is started from the highest density point by adding un-clustered points that have similar density as first point in cluster. Also, the point is assigned to current cluster if the sum of distances to its Minpts-nearest neighbors is less than or equal to the density of first point (core point condition in DBSCAN). Experimental results display the efficiency of the proposed method in discovering varied density clusters from data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Fahim, A.M., Salem, A.M., Torkey, F.A., Ramadan, M.: An efficient enhanced k-means clustering algorithm. J. Zhejiang Univ. Sci. A 7(10), 1626–1633 (2006)
Article MATH Google Scholar
Kaufman, L., Rousseeuw, P.J.: Finding groups in data: an introduction to cluster analysis. In: Partitioning Around Medoids (Program PAM). Wiley (1990)
Google Scholar
Ng, R.T., Han, J.: Efficient and effective clustering methods for spatial data mining. In: Proceedings of the 20th International Conference on Very Large Databases, Santiago, Chile, pp. 145–155 (1994)
Google Scholar
Sibson, R.: SLINK: an optimally efficient algorithm for the single-link cluster method. Comput. J. 16(1), 30–34 (1973)
Article MathSciNet Google Scholar
Seifoddini, H.K.: Single linkage versus average linkage clustering in machine cells formation applications. Comput. Ind. Eng. 16(3), 419–426 (1989)
Article Google Scholar
Defays, D.: An efficient algorithm for a complete link method. Comput. J. 20(4), 364–366 (1977)
Article MathSciNet MATH Google Scholar
Karypis, G., Han, E.H., Kumar, V.: CHAMELEON: a hierarchical clustering algorithm using dynamic modeling. Computer 32(8), 68–75 (1999)
Article Google Scholar
Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data clustering method for very large databases. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, SIGMOD 1996, pp. 103–114. ACM, New York (1996)
Google Scholar
Guha, S., Rastogi, R., Shim, K.: Cure: an efficient clustering algorithm for large databases. In: Haas, L.M., Tiwary, A. (eds.) Proceedings ACM SIGMOD International Conference on Management of Data, Seattle, Washington, USA, 2–4 June 1998, pp. 73–84. ACM Press (1998)
Article Google Scholar
Ester, M., Krigel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the International Conference on Knowledge Discovery and Data Mining, pp. 226–231 (1996)
Google Scholar
Ankerst, M., Breunig, M.M., Kriegel, H.P.: OPTICS: ordering points to identify the clustering structure. In: Proceedings of ACM SIGMOD, pp. 49–60 (1999)
Article Google Scholar
Hinneburg, A., Keim, D.A.: An efficient approach to clustering in large multimedia databases with noise. In: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, New York, September 1998, pp. 58–65 (1998)
Google Scholar
Idrissi, A., Rehioui, H., Laghrissi, A., Retal, S.: An improvement of DENCLUE algorithm for the data clustering. In: 2015 5th International Conference on Information and Communication Technology and Accessibility (ICTA), Marrakech, pp. 1–6 (2015)
Google Scholar
Fahim, A.: Homogeneous densities clustering algorithm. Int. J. Inf. Technol. Comput. Sci. (IJITCS) 10(10), 1–10 (2018)
Google Scholar
Chen, X., Min, Y., Zhao, Y., Wang, P.: GMDBSCAN: multi-density DBSCAN cluster based on grid. In: Proceedings of the IEEE International Conference on e-Business Engineering, ICEBE 2008, China, October 2008, pp. 780–783 (2008)
Google Scholar
Alhanjouri, M.A., Ahmed, R.D.: New density-based clustering technique: GMDBSCAN-UR. Int. J. Adv. Res. Comput. Sci. 3(1), 1–9 (2012)
Google Scholar
Liu, P., Zhou, D., Wu, N.: VDBSCAN: varied density based spatial clustering of applications with noise. In: Proceedings of the ICSSSM 2007: 2007 International Conference on Service Systems and Service Management, China, June 2007
Google Scholar
Xiong, Z., Chen, R., Zhang, Y., Zhang, X.: Multi-density DBSCAN algorithm based on density levels partitioning. J. Inf. Comput. Sci. 9(10), 2739–2749 (2012)
Google Scholar
Louhichi, S., Gzara, M., Abdallah, H.: A density based algorithm for discovering clusters with varied density. In: 2014 World Congress on Computer Applications and Information Systems (WCCAIS), Hammamet, pp. 1–6 (2014)
Google Scholar
Hou, J., Gao, H., Li, X.: DSets-DBSCAN: a parameter-free clustering algorithm. IEEE Trans. Image Process. 25(7), 3182–3193 (2016)
Article MathSciNet MATH Google Scholar
Debnath, M., Tripathi, P.K., Elmasri, R.: K-DBSCAN: identifying spatial clusters with differing density levels. In: Proceedings of the 2015 International Workshop on Data Mining with Industrial Applications, DMIA 2015, Paraguay, September 2015, pp. 51–60 (2015)
Google Scholar
Ashour, W., Sunoallah, S.: Multi density DBSCAN. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): Preface, vol. 6936, pp. 446–453 (2011)
Google Scholar
Jungan, C., Jinyin, C., Dongyong, Y., Jun, L.: A k-deviation density based clustering algorithm. Math. Probl. Eng. 2018, 1–16 (2018)
Article Google Scholar
Fahim, A.: A clustering algorithm based on local density of points. Int. J. Mod. Educ. Comput. Sci. (IJMECS) 9(12), 9–16 (2017)
Article Google Scholar

Download references

Acknowledgements

This project was supported by the Deanship of Scientific Research at Prince Sattam Bin Abdulaziz University under the research project no. 2017/01/7120.

Author information

Authors and Affiliations

Faculty of Sciences and Humanitarian Study, Prince Sattam Bin Abdulaziz University, Al-Aflaj, Saudi Arabia
Ahmed Fahim
Faculty of Computers and Information, Suez University, Suez, Egypt
Ahmed Fahim

Authors

Ahmed Fahim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ahmed Fahim .

Editor information

Editors and Affiliations

University of Technology Sydney, Sydney, Australia
Lakhmi C. Jain
CSIE Department, National Dong Hwa University, New Taipei City, Taiwan
Sheng-Lung Peng
Al-Balqa’ Applied University, Salt, Jordan
Basim Alhadidi
Department of Computer Science, Brainware University, Kolkata, West Bengal, India
Souvik Pal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fahim, A. (2020). A Clustering Algorithm for Multi-density Datasets. In: Jain, L., Peng, SL., Alhadidi, B., Pal, S. (eds) Intelligent Computing Paradigm and Cutting-edge Technologies. ICICCT 2019. Learning and Analytics in Intelligent Systems, vol 9. Springer, Cham. https://doi.org/10.1007/978-3-030-38501-9_2

Download citation

DOI: https://doi.org/10.1007/978-3-030-38501-9_2
Published: 18 January 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-38500-2
Online ISBN: 978-3-030-38501-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics