Abstract
A method for clustering of large amounts of data is presented which is a sequenced composition of two algorithms: the former builds a partition of input space into Voronoi regions and the latter partitions them. First, a model of clusters as high-density regions in input space is presented, then it is shown how a Voronoi partition and it’s topological map (a) can be build and (b) used as a low complexity approximation of the input space. During the (b) step, the usage of “watershed” algorithm is presented which has been previously used for image segmentation, but it is its first application to a data space partition that is proposed by the authors.
Similar content being viewed by others
References
B. S. Duran and P. L. Odell, Cluster Analysis: A Survey (Springer-Verlag, Berlin and New York, 1974).
J. MacQueen, “Some methods for classification and analysis of multivariate observations,” in Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Eds. by L. M. Le Cam and J. Neyman, vol. 1 (U. California Press, Berkeley, 1967), pp. 281–297.
Bezdek and C. James, Pattern Recognition with Fuzzy Objective Function Algorithms, ISBN 0306406713 (1981).
T. Kohonen, Self-Organizing Maps (Springer-Verlag, Berlin, New York, 2001), 3d extended edition.
B. Fritzke, “Unsupervised ontogenic networks,” in Handbook of Neural Computation, Eds. by E. Fiesler and R. Beale (Oxford University Press, 1997).
A. V. Skvortsov, Delaunay Triangulation and Its Applications (Tomsk State University Publishing, Tomsk, 2002).
J. Cousty, G. Bertrand, L. Najman, and M. Couprie, “Watershed cuts: minimum spanning forests and the drop of water principle,” in Pattern Analysis and Machine Intelligence, IEEE Transactions on, 31(8), 1362, 1374, (Aug. 2009).
J. Bertrand, “On the dynamics”, Image and Vision Computing 25, 447–454 (2007).
S. V. Mitsyn and G. A. Ososkov, “The growing neural gas and clustering of large amounts of data”, Optical Memory and Neural Networks 20(4), 260–270 (2011).
Author information
Authors and Affiliations
Corresponding author
Additional information
The article is published in the original.
Rights and permissions
About this article
Cite this article
Mitsyn, S.V., Ososkov, G.A. Watershed on vector quantization for clustering of big data. Phys. Part. Nuclei Lett. 12, 170–172 (2015). https://doi.org/10.1134/S1547477115010173
Published:
Issue Date:
DOI: https://doi.org/10.1134/S1547477115010173