Initializing K-means Batch Clustering: A Critical Evaluation of Several Techniques

Steinley, Douglas; Brusco, Michael J.

doi:10.1007/s00357-007-0003-0

Initializing K-means Batch Clustering: A Critical Evaluation of Several Techniques

Published: June 2007

Volume 24, pages 99–121, (2007)
Cite this article

Journal of Classification Aims and scope Submit manuscript

Douglas Steinley¹ &
Michael J. Brusco²

1785 Accesses
205 Citations
3 Altmetric
Explore all metrics

Abstract

K-means clustering is arguably the most popular technique for partitioning data. Unfortunately, K-means suffers from the well-known problem of locally optimal solutions. Furthermore, the final partition is dependent upon the initial configuration, making the choice of starting partitions all the more important. This paper evaluates 12 procedures proposed in the literature and provides recommendations for best practices.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

DISCERN: diversity-based selection of centroids for k-estimation and rapid non-stochastic clustering

Article 21 September 2020

Favoring the k-Means Algorithm with Initialization Methods

Linear, Deterministic, and Order-Invariant Initialization Methods for the K-Means Clustering Algorithm

Author information

Authors and Affiliations

University of Missouri-Columbia, Columbia, MO, USA
Douglas Steinley
University of Florida, Gainesville, FL, USA
Michael J. Brusco

Authors

Douglas Steinley
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. Brusco
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Steinley, D., Brusco, M. Initializing K-means Batch Clustering: A Critical Evaluation of Several Techniques. Journal of Classification 24, 99–121 (2007). https://doi.org/10.1007/s00357-007-0003-0

Download citation

Issue Date: June 2007
DOI: https://doi.org/10.1007/s00357-007-0003-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Initializing K-means Batch Clustering: A Critical Evaluation of Several Techniques

Abstract

Access this article

Similar content being viewed by others

DISCERN: diversity-based selection of centroids for k-estimation and rapid non-stochastic clustering

Favoring the k-Means Algorithm with Initialization Methods

Linear, Deterministic, and Order-Invariant Initialization Methods for the K-Means Clustering Algorithm

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Initializing K-means Batch Clustering: A Critical Evaluation of Several Techniques

Abstract

Access this article

Similar content being viewed by others

DISCERN: diversity-based selection of centroids for k-estimation and rapid non-stochastic clustering

Favoring the k-Means Algorithm with Initialization Methods

Linear, Deterministic, and Order-Invariant Initialization Methods for the K-Means Clustering Algorithm

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation