Abstract
We consider the strongly NP-hard problem of partitioning a finite set of points of Euclidean space into two clusters of given cardinalities under the minimum criterion for the sum over the clusters of the intracluster sums of squared distances from elements of the cluster to its center. It is assumed that the center of one of the clusters is given (without loss of generality, at the origin). The center of the second cluster is unknown and is defined as the mean value over all elements in this cluster. A polynomial-time approximation scheme (PTAS) is provided.
Similar content being viewed by others
References
D. Aloise, A. Deshpande, P. Hansen, and P. Popat, “NP-hardness of Euclidean sum-of-squares clustering,” Machine Learning 75 (2), 245–248 (2009).
A. K. Jain, “Data clustering: 50 years beyond k-means,” Pattern Recogn. Lett. 31, 651–666 (2010).
M. R. Garey and D. S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness (Freeman, San Fransisco, 1979).
E. Kh. Gimadi, A. V. Kel’manov, M. A. Kel’manova, and S. A. Khamidullin, “A posteriori detecting a quasiperiodic fragment in a numerical sequence,” Pattern Recogn. Image Anal. 18 (1), 30–42 (2008).
A. Kel’manov and V. Khandeev, “”An exact pseudopolynomial algorithm for a problem of the two-cluster partitioning of a set of vectors,” J. Appl. Ind. Math. 9 (4), 497–502 (2015).
N. Wirth, Algorithms + Data Structures = Programs (Prentice Hall, Englewood Cliffs, NJ, 1976).
A. E. Baburin, E. Kh. Gimadi, N. I. Glebov, and A. V. Pyatkin, “The problem of finding a subset of vectors with the maximum total weight,” J. Appl. Ind. Math. 2 (1), 32–38 (2008).
E. Kh. Gimadi, Yu. V. Glazkov, and I. A. Rykov, “On two problems of choosing some subset of vectors with integer coordinates that has maximum norm of the sum of elements in Euclidean space,” J. Appl. Ind. Math. 3 (3), 343–352 (2009).
E. Kh. Gimadi, A. V. Kel’manov, M. A. Kel’manova, and S. A. Khamidullin, “A posteriori detection of a quasiperiodic fragment with a given number of repetitions in a numerical sequence,” Sib. Zh. Ind. Mat. 9 (1), 55–74 (2006).
E. Kh. Gimadi, A. V. Pyatkin, and I. A. Rykov, “On polynomial solvability of some problems of a vector subset choice in a Euclidean space of fixed dimension,” J. Appl. Ind. Math. 4 (1), 48–53 (2010).
A. V. Dolgushev and A. V. Kel’manov, “An approximation algorithm for solving a problem of cluster analysis,” J. Appl. Ind. Math. 5 (4), 551–558 (2011).
A. V. Kel’manov, “On the complexity of some data analysis problems,” Comp. Math. Math. Phys. 50 (11), 1941–1947 (2010).
A. V. Kel’manov, “On the complexity of some cluster analysis problems,” Comp. Math. Math. Phys. 51 (11), 1983–1988 (2011).
A. V. Kel’manov and A. V. Pyatkin, “On the complexity of a search for a subset of “similar” vectors,” Dokl. Math. 78 (1), 574–575 (2008).
A. V. Kel’manov and A. V. Pyatkin, “On a version of the problem of choosing a vector subset,” J. Appl. Ind. Math. 3 (4), 447–455 (2009).
A. V. Kel’manov and A. V. Pyatkin, “Complexity of certain problems of searching for subsets of vectors and cluster analysis,” Comp. Math. Math. Phys. 49 (11), 1966–1971 (2009).
A. V. Kel’manov and V. I. Khandeev, “Fully polynomial-time approximation scheme for a special case of a quadratic Euclidean 2-clustering problem,” Comp. Math. Math. Phys. 56 (2), 334–341 (2016).
A. V. Kel’manov and V. I. Khandeev, “A randomized algorithm for two-cluster partition of a set of vectors,” Comp. Math. Math. Phys. 55 (2), 330–339 (2015).
V. V. Shenmaier, “An approximation scheme for a problem of search for a vector subset,” J. Appl. Ind. Math. 6 (3), 381–386 (2012).
Author information
Authors and Affiliations
Corresponding author
Additional information
Original Russian Text © A.V. Dolgushev, A.V. Kel’manov, V.V. Shenmaier, 2015, published in Trudy Instituta Matematiki i Mekhaniki UrO RAN, 2015, Vol. 21, No. 3.
Rights and permissions
About this article
Cite this article
Dolgushev, A.V., Kel’manov, A.V. & Shenmaier, V.V. Polynomial-time approximation scheme for a problem of partitioning a finite set into two clusters. Proc. Steklov Inst. Math. 295 (Suppl 1), 47–56 (2016). https://doi.org/10.1134/S0081543816090066
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S0081543816090066