Skip to main content
Log in

On the algorithmic complexity of a problem in cluster analysis

  • Published:
Journal of Applied and Industrial Mathematics Aims and scope Submit manuscript

Abstract

We prove that the MSSC problem (the problem of clustering the set of the vectors in the Euclidean space which minimizes the sum of squares) is NP-complete in the case when the dimension of the space is an input parameter of the problem, while the number of clusters is not an input parameter.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. A. V. Kel’manov and A. V. Pyatkin, “On Complexity of Some Problems of Finding a Subset of a Vector Set and Cluster Analysis,” Zh. Vychisl. Mat. Mat. Fiz. 49(11), 2059–2067 (2009).

    MathSciNet  MATH  Google Scholar 

  2. D. Aloise and P. Hansen, “On the Complexity of Minimum Sum-of-Squares Clustering,” Les Cahiers du GERAD, G-2007-50 (2007).

  3. D. Aloise, A. Deshpande, P. Hansen, P. Popat, “NP-Hardness of Euclidean Sum-of-Squares Clustering,” Les Cahiers du GERAD, G-2008-33 (2008).

  4. P. Drineas, A. Frieze, R. Kannan, S. Vempala, and V. Vinay, “Clustering Large Graphs via the Singular Value Decomposition,” Machine Learning 56, 9–33 (2004).

    Article  MATH  Google Scholar 

  5. M. R. Garey and D. S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness (Freeman, San Francisco, CA, 1979).

    MATH  Google Scholar 

  6. M. Inaba, N. Katch, and H. Imai, “Applications of Weighted Voronoi Diagrams and Randomization to Variance-Dased Clustering,” in Proceedings of Annual Symposium on Computational Geometry (Stony Brook, New York, 1994), pp. 332–339.

    Chapter  Google Scholar 

  7. J. B. MacQueen, “Some Methods for Classification and Analysis of Multivariate Observations,” in Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, Vol. 1 (Univ. of California Press, Berkeley, 1967), pp. 281–297.

    Google Scholar 

  8. M. Mahajan, P. Nimbhorkar, and K. Varadarajan, “The Planar k-Means Problem is NP-Hard,” in Lecture Notes in Computer Science, Vol. 5431 (Springer, Berlin, 2009), pp. 284–285.

    Google Scholar 

  9. M. Rao, “Cluster Analysis and Mathematical Programming,” J. Amer. Stat. Assoc. 66, 622–626 (1971).

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to A. V. Dolgushev.

Additional information

Original Russian Text © A.V. Dolgushev, A.V. Kel’manov, 2010, published in Diskretnyi Analiz i Issledovanie Operatsii, 2010, Vol. 17, No. 2, pp. 39–45.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dolgushev, A.V., Kel’manov, A.V. On the algorithmic complexity of a problem in cluster analysis. J. Appl. Ind. Math. 5, 191–194 (2011). https://doi.org/10.1134/S1990478911020050

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1134/S1990478911020050

Keywords

Navigation