Skip to main content

A Clustering Algorithm for Multi-density Datasets

  • Conference paper
  • First Online:
Intelligent Computing Paradigm and Cutting-edge Technologies (ICICCT 2019)

Part of the book series: Learning and Analytics in Intelligent Systems ((LAIS,volume 9))

  • 644 Accesses

Abstract

DBSCAN algorithm discovers clusters of various shapes and sizes. But it fails to discover clusters of different density. This is due to its dependency on global value for Eps. This paper introduces an idea to deal with this problem. The offered method estimates local density for a point as the sum of distances to its k-nearest items, arranges items in ascending order according to their local density. The clustering process is started from the highest density point by adding un-clustered points that have similar density as first point in cluster. Also, the point is assigned to current cluster if the sum of distances to its Minpts-nearest neighbors is less than or equal to the density of first point (core point condition in DBSCAN). Experimental results display the efficiency of the proposed method in discovering varied density clusters from data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Fahim, A.M., Salem, A.M., Torkey, F.A., Ramadan, M.: An efficient enhanced k-means clustering algorithm. J. Zhejiang Univ. Sci. A 7(10), 1626–1633 (2006)

    Article  MATH  Google Scholar 

  2. Kaufman, L., Rousseeuw, P.J.: Finding groups in data: an introduction to cluster analysis. In: Partitioning Around Medoids (Program PAM). Wiley (1990)

    Google Scholar 

  3. Ng, R.T., Han, J.: Efficient and effective clustering methods for spatial data mining. In: Proceedings of the 20th International Conference on Very Large Databases, Santiago, Chile, pp. 145–155 (1994)

    Google Scholar 

  4. Sibson, R.: SLINK: an optimally efficient algorithm for the single-link cluster method. Comput. J. 16(1), 30–34 (1973)

    Article  MathSciNet  Google Scholar 

  5. Seifoddini, H.K.: Single linkage versus average linkage clustering in machine cells formation applications. Comput. Ind. Eng. 16(3), 419–426 (1989)

    Article  Google Scholar 

  6. Defays, D.: An efficient algorithm for a complete link method. Comput. J. 20(4), 364–366 (1977)

    Article  MathSciNet  MATH  Google Scholar 

  7. Karypis, G., Han, E.H., Kumar, V.: CHAMELEON: a hierarchical clustering algorithm using dynamic modeling. Computer 32(8), 68–75 (1999)

    Article  Google Scholar 

  8. Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data clustering method for very large databases. In: Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data, SIGMOD 1996, pp. 103–114. ACM, New York (1996)

    Google Scholar 

  9. Guha, S., Rastogi, R., Shim, K.: Cure: an efficient clustering algorithm for large databases. In: Haas, L.M., Tiwary, A. (eds.) Proceedings ACM SIGMOD International Conference on Management of Data, Seattle, Washington, USA, 2–4 June 1998, pp. 73–84. ACM Press (1998)

    Article  Google Scholar 

  10. Ester, M., Krigel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the International Conference on Knowledge Discovery and Data Mining, pp. 226–231 (1996)

    Google Scholar 

  11. Ankerst, M., Breunig, M.M., Kriegel, H.P.: OPTICS: ordering points to identify the clustering structure. In: Proceedings of ACM SIGMOD, pp. 49–60 (1999)

    Article  Google Scholar 

  12. Hinneburg, A., Keim, D.A.: An efficient approach to clustering in large multimedia databases with noise. In: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, New York, September 1998, pp. 58–65 (1998)

    Google Scholar 

  13. Idrissi, A., Rehioui, H., Laghrissi, A., Retal, S.: An improvement of DENCLUE algorithm for the data clustering. In: 2015 5th International Conference on Information and Communication Technology and Accessibility (ICTA), Marrakech, pp. 1–6 (2015)

    Google Scholar 

  14. Fahim, A.: Homogeneous densities clustering algorithm. Int. J. Inf. Technol. Comput. Sci. (IJITCS) 10(10), 1–10 (2018)

    Google Scholar 

  15. Chen, X., Min, Y., Zhao, Y., Wang, P.: GMDBSCAN: multi-density DBSCAN cluster based on grid. In: Proceedings of the IEEE International Conference on e-Business Engineering, ICEBE 2008, China, October 2008, pp. 780–783 (2008)

    Google Scholar 

  16. Alhanjouri, M.A., Ahmed, R.D.: New density-based clustering technique: GMDBSCAN-UR. Int. J. Adv. Res. Comput. Sci. 3(1), 1–9 (2012)

    Google Scholar 

  17. Liu, P., Zhou, D., Wu, N.: VDBSCAN: varied density based spatial clustering of applications with noise. In: Proceedings of the ICSSSM 2007: 2007 International Conference on Service Systems and Service Management, China, June 2007

    Google Scholar 

  18. Xiong, Z., Chen, R., Zhang, Y., Zhang, X.: Multi-density DBSCAN algorithm based on density levels partitioning. J. Inf. Comput. Sci. 9(10), 2739–2749 (2012)

    Google Scholar 

  19. Louhichi, S., Gzara, M., Abdallah, H.: A density based algorithm for discovering clusters with varied density. In: 2014 World Congress on Computer Applications and Information Systems (WCCAIS), Hammamet, pp. 1–6 (2014)

    Google Scholar 

  20. Hou, J., Gao, H., Li, X.: DSets-DBSCAN: a parameter-free clustering algorithm. IEEE Trans. Image Process. 25(7), 3182–3193 (2016)

    Article  MathSciNet  MATH  Google Scholar 

  21. Debnath, M., Tripathi, P.K., Elmasri, R.: K-DBSCAN: identifying spatial clusters with differing density levels. In: Proceedings of the 2015 International Workshop on Data Mining with Industrial Applications, DMIA 2015, Paraguay, September 2015, pp. 51–60 (2015)

    Google Scholar 

  22. Ashour, W., Sunoallah, S.: Multi density DBSCAN. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics): Preface, vol. 6936, pp. 446–453 (2011)

    Google Scholar 

  23. Jungan, C., Jinyin, C., Dongyong, Y., Jun, L.: A k-deviation density based clustering algorithm. Math. Probl. Eng. 2018, 1–16 (2018)

    Article  Google Scholar 

  24. Fahim, A.: A clustering algorithm based on local density of points. Int. J. Mod. Educ. Comput. Sci. (IJMECS) 9(12), 9–16 (2017)

    Article  Google Scholar 

Download references

Acknowledgements

This project was supported by the Deanship of Scientific Research at Prince Sattam Bin Abdulaziz University under the research project no. 2017/01/7120.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ahmed Fahim .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Fahim, A. (2020). A Clustering Algorithm for Multi-density Datasets. In: Jain, L., Peng, SL., Alhadidi, B., Pal, S. (eds) Intelligent Computing Paradigm and Cutting-edge Technologies. ICICCT 2019. Learning and Analytics in Intelligent Systems, vol 9. Springer, Cham. https://doi.org/10.1007/978-3-030-38501-9_2

Download citation

Publish with us

Policies and ethics