Skip to main content

High Dimensional Data

  • Chapter
  • First Online:
Applied Machine Learning
  • 15k Accesses

Abstract

We have a dataset that is a collection of d-dimensional vectors. This chapter introduces the nasty tricks that such data can play. A dataset like this is hard to plot, though Sect. 4.1 suggests some tricks that are helpful. Most readers will already know the mean as a summary (it’s an easy generalization of the 1D mean). The covariance matrix may be less familiar. This is a collection of all covariances between pairs of components. We use covariances, rather than correlations, because covariances can be represented in a matrix easily. High dimensional data has some nasty properties (it’s usual to lump these under the name “the curse of dimension”). The data isn’t where you think it is, and this can be a serious nuisance, making it difficult to fit complex probability models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 119.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Forsyth, D. (2019). High Dimensional Data. In: Applied Machine Learning . Springer, Cham. https://doi.org/10.1007/978-3-030-18114-7_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-18114-7_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-18113-0

  • Online ISBN: 978-3-030-18114-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics