Skip to main content

Similarity, Distance, and Clustering

  • Chapter
Computational Genome Analysis
  • 3942 Accesses

Abstract

In this chapter, we explore quantitative approaches to clustering, the process of identifying groups of like objects. This grouping is based upon similarities or differences as measured by the characters that the objects possess. Clustering is closely related to the process of classification, which is assigning objects into predetermined categories. This assignment to a category is also based upon the particular states of the characters associated with that object. We discussed classification in the last chapter and will say a little more at the end of this chapter. For more extensive discussions of clustering and classification, see Dunn and Everitt (1982), Everitt and Dunn (2001), and Johnson and Wichern (2002).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 99.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Dunn G, Everitt BS (1982) Introduction to Mathematical Taxonomy. Cambridge: Cambridge University Press.

    MATH  Google Scholar 

  • Everitt BS, Dunn G (2001) Applied Multivariate Data Analysis (2nd edition). Oxford: Oxford University Press.

    MATH  Google Scholar 

  • Hartigan J, Wong M (1979) A K-means clustering algorithm. Applied Statistics, 28:100–108.

    Article  MATH  Google Scholar 

  • Johnson RA, Wichern DW (2002) Applied Multivariate Statistical Analysis. Englewood Cliffs, NJ: Prentice-Hall.

    Google Scholar 

Download references

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer Science+Business Media, Inc.

About this chapter

Cite this chapter

(2005). Similarity, Distance, and Clustering. In: Computational Genome Analysis. Springer, New York, NY. https://doi.org/10.1007/0-387-28807-4_10

Download citation

  • DOI: https://doi.org/10.1007/0-387-28807-4_10

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-0-387-98785-9

  • Online ISBN: 978-0-387-28807-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics