Abstract
Similarity indices can be used to compare partitions (clusterings) of a data set. Many such indices were introduced in the literature over the years. We are showing that out of 28 indices we were able to track, there are 22 different ones. Even though their values differ for the same clusterings compared, after correcting for agreement attributed to chance only, their values become similar and some of them even become equivalent. Consequently, the problem of choice of the index to be used for comparing different clusterings becomes less important.
Similar content being viewed by others
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Albatineh, A., Niewiadomska-Bugaj, M. & Mihalko, D. On Similarity Indices and Correction for Chance Agreement. Journal of Classification 23, 301–313 (2006). https://doi.org/10.1007/s00357-006-0017-z
Issue Date:
DOI: https://doi.org/10.1007/s00357-006-0017-z