Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Nonparametric Data Reduction Techniques

  • Rui Zhang
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_546

Definition

A nonparametric data reduction technique is a data reduction technique that does not assume any model for the data.

Key Points

Nonparametric data reduction (NDR) techniques is opposite to parametric data reduction (PDR) techniques. A PDR technique must assume a certain model for the data. Parameters of the model are determined before the data reduction is performed. A NDR technique does not assume any model and is applied to the data directly. The data reduction effectiveness of a PDR technique heavily depends on whether the model suits the data well. If well-suited, good accuracy as well as substantial data reduction can be achieved; otherwise, both cannot be achieved at the same time. A NDR technique yields more uniform effectiveness irrespective of the data, but it may not achieve as high data reduction as a well-suited PDR technique.

Popular NDR techniques include histograms, clustering and indexes. Histograms are used to approximate data distributions. An equidepth...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Barbará D, DuMouchel W, Faloutsos C, Haas PJ, Hellerstein JM, Ioannidis YE, Jagadish HV, Johnson T, Ng RT, Poosala V, Ross KA, Sevcik KC. The New Jersey data reduction report. Q Bull IEEE TC Data Eng. 1997;20(4):3–45.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.University of MelbourneMelbourneAustralia
  2. 2.Dataware VenturesTucsonUSA
  3. 3.Dataware VenturesRedondo BeachCAUSA

Section editors and affiliations

  • Xiaofang Zhou
    • 1
  1. 1.School of Inf. Tech. & Elec. Eng.Univ. of QueenslandBrisbaneAustralia