Abstract
We consider the problem of partitioning a finite sequence of Euclidean points into a given number of clusters (subsequences) using the criterion of the minimal sum (over all clusters) of intercluster sums of squared distances from the elements of the clusters to their centers. It is assumed that the center of one of the desired clusters is at the origin, while the center of each of the other clusters is unknown and determined as the mean value over all elements in this cluster. Additionally, the partition obeys two structural constraints on the indices of sequence elements contained in the clusters with unknown centers: (1) the concatenation of the indices of elements in these clusters is an increasing sequence, and (2) the difference between an index and the preceding one is bounded above and below by prescribed constants. It is shown that this problem is strongly NP-hard. A 2-approximation algorithm is constructed that is polynomial-time for a fixed number of clusters.
Similar content being viewed by others
References
Tak-chung Fu, “A review on time series data mining,” Eng. Appl. Artificial Intelligence 24 (1), 164–181 (2011).
C. Kuenzer, S. Dech, and W. Wagner, “Remote sensing time series,” Remote Sensing and Digital Image Processing (Springer, Switzerland, 2015), Vol.22.
T. W. Liao, “Clustering of time series data: A survey,” Pattern Recogn. 38 (11), 1857–1874 (2005).
C. C. Aggarwal, Data Mining (Springer, Berlin, 2015).
A. V. Kel’manov and A. V. Pyatkin, “On complexity of some problems of cluster analysis of vector sequences,” J. Appl. Ind. Math. 7 (3), 363–369 (2013).
A. V. Kel’manov and S. A. Khamidullin, “An approximation polynomial-time algorithm for a sequence bi-clustering problem,” Comput. Math. Math. Phys. 55 (6), 1068–1076 (2015).
A. V. Kel’manov and L. V. Mikhailova, “A posteriori joint detection of reference fragments in a quasi-periodic sequence,” Comput. Math. Math. Phys. 48 (5), 850–865 (2008).
A. V. Kel’manov and S. M. Romanchenko, “An FPTAS for a vector subset search problem,” J. Appl. Ind. Math. 8 (3), 329–336 (2014).
Author information
Authors and Affiliations
Corresponding author
Additional information
Original Russian Text © A.V. Kel’manov, L.V. Mikhailova, S.A. Khamidullin, V.I. Khandeev, 2017, published in Zhurnal Vychislitel’noi Matematiki i Matematicheskoi Fiziki, 2017, Vol. 57, No. 8, pp. 1392–1400.
Rights and permissions
About this article
Cite this article
Kel’manov, A.V., Mikhailova, L.V., Khamidullin, S.A. et al. Approximation algorithm for the problem of partitioning a sequence into clusters. Comput. Math. and Math. Phys. 57, 1376–1383 (2017). https://doi.org/10.1134/S0965542517080085
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S0965542517080085