Clustering: Models of High Dimensional Data

  • David Forsyth


High-dimensional data comes with problems. Data points tend not to be where you think; they can scattered quite far apart, and can be quite far from the mean. There is an important rule of thumb for coping with high dimensional data: Use simple models. One very good, very simple, model for high dimensional data is to assume that it consists of multiple blobs. To build models like this, we must determine which datapoints belong to which blob by collecting together data points that are close and forming blobs out of them.

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  • David Forsyth
    • 1
  1. 1.Computer Science DepartmentUniversity of Illinois at Urbana ChampaignUrbanaUSA

Personalised recommendations