Skip to main content

Exploratory Data Analysis

  • 164 Accesses

Definition

Exploratory data analysis (EDA) is an approach to data analysis that employs a number of different techniques to:

  1. 1.

    Look at data to see what it seems to say.

  2. 2.

    Uncover underlying structures.

  3. 3.

    Isolate important variables.

  4. 4.

    Detect outliers and other anomalies.

  5. 5.

    Suggest suitable models for conventional statistics.

Key Points

The term “exploratory data analysis” was introduced by John W. Tukey who in [2] shows how simple graphical and quantitative techniques can be used to open-mindedly explore data.

Typical graphical techniques are:

  1. 1.

    Plotting the raw data (e.g., stem-and-leaf diagrams, histograms, scatter plots)

  2. 2.

    Plotting simple statistics (e.g., mean plots, box plots, residual plots)

  3. 3.

    Positioning (multiple) plots to amplify cognition

Typical quantitative techniques are:

  1. 1.

    Interval estimation

  2. 2.

    Measures of location or of scale

  3. 3.

    Shapes of distributions

Exploratory data analysis can help to improve the results of statistical hypothesis testing by forcing one to...

Keywords

  • Exploratory Data Analysis (EDA)
  • Confirmatory Data Analysis
  • Conventional Conditions
  • Validation Stage
  • Plot Means

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, access via your institution.

Recommended Reading

  1. Berry MJA, Linoff GS. Mastering data mining. New York: Wiley; 2000.

    Google Scholar 

  2. Tukey JW. Exploratory data analysis. Reading: Addison Wesley; 1977.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hans Hinterberger .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and Permissions

Copyright information

© 2017 Springer Science+Business Media LLC

About this entry

Cite this entry

Hinterberger, H. (2017). Exploratory Data Analysis. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_1384-2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4899-7993-3_1384-2

  • Received:

  • Accepted:

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4899-7993-3

  • Online ISBN: 978-1-4899-7993-3

  • eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering