Skip to main content
  • 267 Accesses

Clustering is the partitioning of a data set into subsets or clusters, so that the degree of association is strong between members of the same cluster and weak between members of different clusters according to some defined distance measure.

Several methods of performing cluster analysis exist:

  • Partitional clustering

  • Hierarchical clustering.

HISTORY

See classification and data analysis.

MATHEMATICAL ASPECTS

To carry out cluster analysis on a set of n objects, we need to define a distance between the objects (or more generally a measure of the similarity between the objects) that need to be classified. The existence of some kind of structure within the set of objects is assumed.

To carry out a hierarchical classification of a set E of objects \( { \{x_1,x_2,\ldots,x_n\} } \), it is necessary to define a distance associated with E that can be used to obtain a distance table between the objects of E. Similarly, a distance must also be defined for any subsets of E.

One approach to...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

REFERENCES

  1. Celeux, G., Diday, E., Govaert, G., Lechevallier, Y., Ralambondrainy, H.: Classification automatique des données—aspects statistiques et informatiques. Dunod, Paris (1989)

    Google Scholar 

  2. Everitt, B.S.: Cluster Analysis. Halstead, London (1974)

    Google Scholar 

  3. Gordon, A.D.: Classification. Methods for the Exploratory Analysis of Multivariate Data. Chapman & Hall, London (1981)

    MATH  Google Scholar 

  4. Jambu, M., Lebeaux, M.O.: Classification automatique pour l'analyse de données. Dunod, Paris (1978)

    Google Scholar 

  5. Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York (1990)

    Google Scholar 

  6. Lerman, L.C.: Classification et analyse ordinale des données. Dunod, Paris (1981)

    MATH  Google Scholar 

  7. Tomassone, R., Daudin, J.J., Danzart, M., Masson, J.P.: Discrimination et classement. Masson, Paris (1988)

    MATH  Google Scholar 

Download references

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag

About this entry

Cite this entry

(2008). Cluster Analysis. In: The Concise Encyclopedia of Statistics. Springer, New York, NY. https://doi.org/10.1007/978-0-387-32833-1_60

Download citation

Publish with us

Policies and ethics