Abstract
We consider three related problems of partitioning an N-element set of points in d-dimensional Euclidean space into two clusters balancing the value of the intracluster quadratic variance normalized by the cluster size in the first problem, the intracluster quadratic variance in the second problem, and the size-weighted intracluster variance in the third problem. The NP-completeness of all these problems is proved.
Similar content being viewed by others
REFERENCES
C. C. Aggarwal, Data Mining (Springer, Berlin, 2015).
T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction (Springer, New York, 2001).
J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, 3rd ed. Morgan Kaufmann, Waltham, MA, 2012).
A. S. Shirkhorshidi, S. Aghabozorgi, T. Y. Wah, and T. Herawan, “Big data clustering: A review,” Lect. Notes Comput. Sci. 8583, 707–720 (2014).
D. Aloise, A. Deshpande, P. Hansen, and P. Popat, “NP-hardness of Euclidean sum-of-squares clustering,” Mach. Learn. 75 (2), 245–248 (2009).
M. Mahajan, P. Nimbhorkar, and K. Varadarajan, “The planar k-means problem is np-hard,” Theor. Comput. Sci. 442, 13–21 (2012).
S. Arora, P. Raghavan, and S. Rao, “Approximation schemes for Euclidean k-medians and related problems,” Proceedings of the 30th Annual ACM Symposium on Theory of Computing (1998), pp. 106–113.
C. H. Papadimitriou, “Worst-case and probabilistic analysis of a geometric location problem,” SIAM J. Comput. 10 (3), 542–557 (1981).
S. Masuyama, T. Ibaraki, and T. Hasegawa, “The computational complexity of the m-center problems in the plane,” IEEE Trans. IECE Jpn. 64 (2), 57–64 (1981).
D. S. Hochbaum and D. B. Shmoys, “A best possible heuristic for the k-center problem,” Math. Oper. Res. 10 (2), 180–184 (1985).
H. Aggarwal, N. Imai, N. Katoh, and S. Suri, “Finding k points with minimum diameter and related problems,” J. Algorithms 12 (1), 38–56 (1991).
P. Brucker, “On the complexity of clustering problems,” Lect. Notes Econ. Math. Syst. 157, 45–54 (1978).
P. Indyk, “A sublinear time approximation scheme for clustering in metric space,” Proceedings of the 40th Annual IEEE Symposium on Foundations of Computer Science (FOCS) (IEEE, New York, 1999), pp. 154–159.
P. Hansen, B. Jaumard, and N. Mladenovich, “Minimum sum of squares clustering in a low dimensional space,” J. Classif. 15, 37–55 (1998).
P. Hansen and B. Jaumard, “Cluster analysis and mathematical programming,” Math. Program. 79, 191–215 (1997).
G. W. Snedecor and W. G. Cochran, Statistical Methods, 8th ed. (Iowa State University Press, Iowa, 1989).
M. R. Garey and D. S. Johnson, Computers and Intractability: A Guide to the Theory of NP-Completeness (Freeman, San Francisco, 1979).
Funding
This work was supported by the Russian Foundation for Basic Research (project nos. 19-01-00308 and 18-31-00398), by Basic Research Programs of the Russian Academy of Sciences (project nos. 0314-2019-0015 and 0314-2019-0014), and by the Top-5-100 Program of the Ministry of Education and Science of the Russian Federation.
Author information
Authors and Affiliations
Corresponding authors
Additional information
Translated by I. Ruzanova
Rights and permissions
About this article
Cite this article
Kel’manov, A.V., Pyatkin, A.V. & Khandeev, V.I. Complexity of Some Problems of Quadratic Partitioning of a Finite Set of Points in Euclidean Space into Balanced Clusters. Comput. Math. and Math. Phys. 60, 163–170 (2020). https://doi.org/10.1134/S096554251911006X
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S096554251911006X