Advertisement

Soft Computing

, Volume 22, Issue 5, pp 1719–1730 | Cite as

Fuzzy extensions of the DBScan clustering algorithm

  • Dino Ienco
  • Gloria Bordogna
Methodologies and Application

Abstract

The DBSCAN algorithm is a well-known density-based clustering approach particularly useful in spatial data mining for its ability to find objects’ groups with heterogeneous shapes and homogeneous local density distributions in the feature space. Furthermore, it can be suitable as scaling down approach to deal with big data for its ability to remove noise. Nevertheless, it suffers for some limitations, mainly the inability to identify clusters with variable density distributions and partially overlapping borders, which is often a characteristics of both scientific data and real-world data. To this end, in this work, we propose three fuzzy extensions of the \(\textit{DBSCAN}\) algorithm to generate clusters with distinct fuzzy density characteristics. The original version of \(\textit{DBSCAN}\) requires two precise parameters (minPts and \(\epsilon \)) to define locally dense areas which serve as seeds of the clusters. Nevertheless, precise values of both parameters may be not appropriate in all regions of the dataset. In the proposed extensions of \(\textit{DBSCAN}\), we define soft constraints to model approximate values of the input parameters. The first extension, named \(\textit{Fuzzy Core DBSCAN}\), relaxes the constraint on the neighbourhood’s density to generate clusters with fuzzy core points, i.e. cores with distinct density; the second extension, named \(\textit{Fuzzy Border DBSCAN}\), relaxes \(\epsilon \) to allow the generation of clusters with overlapping borders. Finally, the third extension, named \(\textit{Fuzzy DBSCAN}\) subsumes the previous ones, thus allowing to generate clusters with both fuzzy cores and fuzzy overlapping borders. Our proposals are compared w.r.t. state of the art fuzzy clustering methods over real-world datasets.

Keywords

Fuzzy clustering Density-based clustering DBSCAN clustering 

Notes

Compliance with ethical standards

Conflict of interest

Dino Ienco and Gloria Bordogna declares that they have no conflict of interest.

Ethical standard

This article does not contain any studies with human participants or animals performed by any of the authors.

References

  1. Bezdek JC, Ehrlich R, Full W (1984) FCM: the fuzzy c-means clustering algorithm. Comput Geosci 10(2):191–203CrossRefGoogle Scholar
  2. Bordogna, G., Ienco, D.: Fuzzy core dbscan clustering algorithm. In: IPMU, pp. 100–109 (2014)Google Scholar
  3. Ester M, Kriegel H, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. KDD 160:226–231Google Scholar
  4. Guillén A, González J, Rojas I, Pomares H, Herrera LJ, Valenzuela O, Prieto A (2007) Using fuzzy logic to improve a clustering technique for function approximation. Neurocomputing 70(16–18):2853–2860CrossRefGoogle Scholar
  5. Ji Z, Xia Y, Sun Q, Cao G (2014) Interval-valued possibilistic fuzzy c-means clustering algorithm. Fuzzy Sets Syst 253:138–156MathSciNetCrossRefGoogle Scholar
  6. Kriegel H, Pfeifle M (2005) Density-based clustering of uncertain data. In: KDD’05 vol 17, pp 672–677Google Scholar
  7. Nasibov EN, Ulutagay G (2009) Robustness of density-based clustering methods with various neighborhood relations. Fuzzy Sets Syst 160(24):3601–3615MathSciNetCrossRefzbMATHGoogle Scholar
  8. Pal NR, Pal K, Keller JM, Bezdek JC (2005) A possibilistic fuzzy c-means clustering algorithm. IEEE Trans Fuzzy Syst 13(4):517–530CrossRefGoogle Scholar
  9. Parker J, Downs J (2013) Footprint generation using fuzzy-neighborhood clustering. Geoinformatica 17:283–299Google Scholar
  10. Parker J, Hall L, Kandel A (2010) Scalable fuzzy neighborhood dbscan. In: IEEE-fuzzy, pp 1–8Google Scholar
  11. Sander J, Ester M, Kriegel H, Xu X (1998) Density-based clustering in spatial databases: the algorithm GDBSCAN and its applications. Data Min Knowl Discov 2(2):169–194CrossRefGoogle Scholar
  12. Shamshirband S, Amini A, Anuar NB, Kiah LM, Wah TY, Furnell S (2014) D-ficca: a density-based fuzzy imperialist competitive clustering algorithm for intrusion detection in wireless sensor networks. Measurement 55:212–226CrossRefGoogle Scholar
  13. Smiti A, Eloudi Z (2013) Soft dbscan: improving dbscan clustering method using fuzzy set theory. Hum Syst Interact 1:380–385Google Scholar
  14. Suanmali L, Salim N, Binwahlan MS (2009) Fuzzy logic based method for improving text summarization. CoRR arXiv:0906.4690
  15. Ulutagaya G, Nasibov E (2012) Fuzzy and crisp clustering methods based on the neighborhood concept: a comprehensive review. J Intell Fuzzy Syst 23:1–11Google Scholar
  16. Yager R, Filev D (1994) Approximate clustering via the mountain method. IEEE Trans Syst Man Cybern 24(8):1279–1284CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2016

Authors and Affiliations

  1. 1.IRSTEA, UMR TETISMontpellierFrance
  2. 2.LIRMMMontpellierFrance
  3. 3.CNR IREAMilanItaly

Personalised recommendations