High Dimensional Data

Forsyth, David

doi:10.1007/978-3-030-18114-7_4

David Forsyth²

15k Accesses

Abstract

We have a dataset that is a collection of d-dimensional vectors. This chapter introduces the nasty tricks that such data can play. A dataset like this is hard to plot, though Sect. 4.1 suggests some tricks that are helpful. Most readers will already know the mean as a summary (it’s an easy generalization of the 1D mean). The covariance matrix may be less familiar. This is a collection of all covariances between pairs of components. We use covariances, rather than correlations, because covariances can be represented in a matrix easily. High dimensional data has some nasty properties (it’s usual to lump these under the name “the curse of dimension”). The data isn’t where you think it is, and this can be a serious nuisance, making it difficult to fit complex probability models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Hardcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Computer Science Department, University of Illinois Urbana Champaign, Urbana, IL, USA
David Forsyth

Authors

David Forsyth
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Forsyth, D. (2019). High Dimensional Data. In: Applied Machine Learning . Springer, Cham. https://doi.org/10.1007/978-3-030-18114-7_4

Download citation

DOI: https://doi.org/10.1007/978-3-030-18114-7_4
Published: 13 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-18113-0
Online ISBN: 978-3-030-18114-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics