Abstract
We prove that the MSSC problem (the problem of clustering the set of the vectors in the Euclidean space which minimizes the sum of squares) is NP-complete in the case when the dimension of the space is an input parameter of the problem, while the number of clusters is not an input parameter.
Similar content being viewed by others
References
A. V. Kel’manov and A. V. Pyatkin, “On Complexity of Some Problems of Finding a Subset of a Vector Set and Cluster Analysis,” Zh. Vychisl. Mat. Mat. Fiz. 49(11), 2059–2067 (2009).
D. Aloise and P. Hansen, “On the Complexity of Minimum Sum-of-Squares Clustering,” Les Cahiers du GERAD, G-2007-50 (2007).
D. Aloise, A. Deshpande, P. Hansen, P. Popat, “NP-Hardness of Euclidean Sum-of-Squares Clustering,” Les Cahiers du GERAD, G-2008-33 (2008).
P. Drineas, A. Frieze, R. Kannan, S. Vempala, and V. Vinay, “Clustering Large Graphs via the Singular Value Decomposition,” Machine Learning 56, 9–33 (2004).
M. R. Garey and D. S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness (Freeman, San Francisco, CA, 1979).
M. Inaba, N. Katch, and H. Imai, “Applications of Weighted Voronoi Diagrams and Randomization to Variance-Dased Clustering,” in Proceedings of Annual Symposium on Computational Geometry (Stony Brook, New York, 1994), pp. 332–339.
J. B. MacQueen, “Some Methods for Classification and Analysis of Multivariate Observations,” in Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1 (Univ. of California Press, Berkeley, 1967), pp. 281–297.
M. Mahajan, P. Nimbhorkar, and K. Varadarajan, “The Planar k-Means Problem is NP-Hard,” in Lecture Notes in Computer Science, Vol. 5431 (Springer, Berlin, 2009), pp. 284–285.
M. Rao, “Cluster Analysis and Mathematical Programming,” J. Amer. Stat. Assoc. 66, 622–626 (1971).
Author information
Authors and Affiliations
Corresponding author
Additional information
Original Russian Text © A.V. Dolgushev, A.V. Kel’manov, 2010, published in Diskretnyi Analiz i Issledovanie Operatsii, 2010, Vol. 17, No. 2, pp. 39–45.
Rights and permissions
About this article
Cite this article
Dolgushev, A.V., Kel’manov, A.V. On the algorithmic complexity of a problem in cluster analysis. J. Appl. Ind. Math. 5, 191–194 (2011). https://doi.org/10.1134/S1990478911020050
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S1990478911020050