Neighbourhood Contrast: A Better Means to Detect Clusters Than Density
Most density-based clustering algorithms suffer from large density variations among clusters. This paper proposes a new measure called Neighbourhood Contrast (NC) as a better alternative to density in detecting clusters. The proposed NC admits all local density maxima, regardless of their densities, to have similar NC values. Due to this unique property, NC is a better means to detect clusters in a dataset with large density variations among clusters. We provide two applications of NC. First, replacing density with NC in the current state-of-the-art clustering procedure DP leads to significantly improved clustering performance. Second, we devise a new clustering algorithm called Neighbourhood Contrast Clustering (NCC) which does not require density or distance calculations, and therefore has a linear time complexity in terms of dataset size. Our empirical evaluation shows that both NC-based methods outperform density-based methods including the current state-of-the-art.
KeywordsNeighbourhood Contrast Clustering
Bo Chen is supported by scholarships provided by Data61, CSIRO and Faculty of IT, Monash University.
- 4.Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining, pp. 226–231 (1996)Google Scholar
- 8.Lichman, M.: UCI machine learning repository (2013). http://archive.ics.uci.edu/ml
- 10.Ram, A., Sharma, A., Jalal, A.S., Agrawal, A., Singh, R.: An enhanced density based spatial clustering of applications with noise. In: Proceedings of the IEEE International Advance Computing Conference, pp. 1475–1478. IEEE (2009)Google Scholar