Skip to main content
Log in

An approximation algorithm for the spherical k-means problem with outliers by local search

  • Published:
Journal of Combinatorial Optimization Aims and scope Submit manuscript

Abstract

We consider the spherical k-means problem with outliers, an extension of the k-means problem. In this clustering problem, all sample points are on the unit sphere. Given two integers k and z, we can ignore at most z points (outliers) and need to find at most k cluster centers on the unit sphere and assign remaining points to these centers to minimize the k-means objective. It has been proved that any algorithm with a bounded approximation ratio cannot return a feasible solution for this problem. Our contribution is to present a local search bi-criteria approximation algorithm for the spherical k-means problem.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ahmadian S, Norouzi-Fard A, Svensson O, Ward J (2017) Better guarantees for \(k\)-means and Euclidean \(k\)-median by primal-dual algorithms. In: Proceedings of FOCS, pp 61-72

  • Arthur D, Vassilvitskii S (2007) \(k\)-means++: The advantages of careful seeding. In: Proceedings of SODA, pp 1027-1035

  • Charikar M, Khuller S, Mount DM, Narasimhan G (2001) Algorithms for facility location problems with outliers. In: Proceedings of SODA, pp 642-651

  • Chen K (2008) A constant factor approximation algorithm for \(k\)-median clustering with outliers. In: Proceedings of SODA, pp 826-835

  • Dhillon IS, Modha DS (2001) Concept decompositions for large sparse text data using clustering. Mach Learn 42:143–175

    Article  Google Scholar 

  • Endo Y, Miyamoto S (2015) Spherical \(k\)-means\(++\) clustering. In: Proceedings of MDAI, pp 103-114

  • Friggstad Z, Khodamoradi K, Rezapour M, Salavatipour MR (2019) Approximation schemes for clustering with outliers. ACM Trans Algorithms 15(2):26

    Article  MathSciNet  Google Scholar 

  • Friggstad Z, Rezapour M, Salavatipour MR (2016) Local search yields a PTAS for \(k\)-means in doubling metrics. In: Proceedings of FOCS, pp 365-374

  • Gupta S, Kumar R, Lu K, Moseley B, Vassilvitskii S (2017) Local search methods for \(k\)-means with outliers. In: Proceedings of VLDB, pp 757-768

  • Hornik K, Feinerer I, Kober M, Buchta C (2012) Spherical \(k\)-means clustering. J Stat Softw 50(10):1–22

    Article  Google Scholar 

  • Jain K, Vazirani VV (2001) Approximation algorithms for metric facility location and \(k\)-median problems using the primal-dual schema and Lagrangian relaxation. J ACM 48:274–296

    Article  MathSciNet  Google Scholar 

  • Kanungo T, Mount DM, Netanyahu NS, Piatko CD, Silverman R, Wu AY (2004) A local search approximation algorithm for \(k\)-means clustering. Comput Geom Theory Appl 28:89–112

    Article  MathSciNet  Google Scholar 

  • Li M (2020) The bi-criteria seeding algorithms for two variants of \(k\)-means problem. J Comb Optim. https://doi.org/10.1007/s10878-020-00537-9

    Article  Google Scholar 

  • Li M, Xu D, Zhang D, Zou J (2020) The seeding algorithms for spherical \(k\)-means clustering. J Glob Optim 76:695–708

    Article  MathSciNet  Google Scholar 

  • Lloyd S (1982) Least squares quantization in PCM. IEEE Trans Inf Theory 28:129–137

    Article  MathSciNet  Google Scholar 

  • Mahajan M, Nimbhorkar P, Varadarajan K (2009) The planar \(k\)-means problem is NP-hard. In: Proceedings of WALCOM, pp 274-285

  • Makarychev K, Makarychev Y, Sviridenko M, Ward J (2016) A bi-criteria approximation algorithm for \(k\)-means. In: Proceedings of APPROX and RANDOM, pp 14:1-14:20

  • Wei D (2016) A constant-factor bi-criteria approximation guarantee for \(k\)-means++. In: Proceedings of NIPS, pp 604-612

  • Zhang D, Cheng Y, Li M, Wang Y, Xu D (2020) Approximation algorithms for spherical \(k\)-means problem using local search scheme. Theor Comput Sci. https://doi.org/10.1016/j.tcs.2020.06.029

    Article  MATH  Google Scholar 

Download references

Acknowledgements

The first author is funded by NSFC (No. 12001039) and the Fundamental Research Funds for the Central Universities, USTB (FRF-TP-20-074A1). The second author is funded by NSFC (No. 11971349). The third author is funded by NSFC (No. 11871081). The fourth author is funded by NSFC (No. 11801310).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dongmei Zhang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

A preliminary version of this paper appeared in Proceedings of the 14th International Conference on Algorithmic Applications in Management, pp 141-148, 2020.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, Y., Wu, C., Zhang, D. et al. An approximation algorithm for the spherical k-means problem with outliers by local search. J Comb Optim 44, 2410–2422 (2022). https://doi.org/10.1007/s10878-021-00734-0

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10878-021-00734-0

Keywords

Navigation