Abstract
Clustering classifies objects into groups based on similarity or distance measure. This is an example of unsupervised learning. The main difference between clustering and classification is that the latter has well-defined target classes. The characteristics of target classes are defined by the training data and the models learned from it. That is why classification is supervised in nature. In contrast, clustering tries to define meaningful classes based on data and its similarity or distance. Figure 4-1 illustrates a document clustering process.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Sayan Mukhopadhyay
About this chapter
Cite this chapter
Mukhopadhyay, S. (2018). Unsupervised Learning: Clustering. In: Advanced Data Analytics Using Python. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-3450-1_4
Download citation
DOI: https://doi.org/10.1007/978-1-4842-3450-1_4
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-3449-5
Online ISBN: 978-1-4842-3450-1
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)