Skip to main content

Unsupervised Learning: Clustering

  • Chapter
  • First Online:
Advanced Data Analytics Using Python
  • 12k Accesses

Abstract

Clustering classifies objects into groups based on similarity or distance measure. This is an example of unsupervised learning. The main difference between clustering and classification is that the latter has well-defined target classes. The characteristics of target classes are defined by the training data and the models learned from it. That is why classification is supervised in nature. In contrast, clustering tries to define meaningful classes based on data and its similarity or distance. Figure 4-1 illustrates a document clustering process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Sayan Mukhopadhyay

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Mukhopadhyay, S. (2018). Unsupervised Learning: Clustering. In: Advanced Data Analytics Using Python. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3450-1_4

Download citation

Publish with us

Policies and ethics