Skip to main content

Exploratory Data Analysis

  • Reference work entry
  • First Online:
International Encyclopedia of Statistical Science

Exploratory data analysis (EDA) is a term first utilized by John Tukey (1977), and is intended to contrast with the more traditional statistical approach to data analysis that starts with hypothesis testing and model building. Instead of using confirmatory data analysis (CDA) methods to verify or refute suspected hypotheses, Tukey (1977) advocated the use of basic descriptive statistics and visualization methods to generate information that would lead to the development of hypotheses to test. The objectives of EDA (see Velleman and Hoaglin 1981) are therefore to provide statistical summaries of the data as a pre-processing step before building a model or testing hypotheses. The EDA phase enables new hypotheses to be suggested about the causes of the relationships in the data, and provides an opportunity to determine whether the assumption that various models rely upon are valid for the particular dataset. It provides us with a very rapid feel for the data: the shape of its...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 1,100.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 549.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References and Further Reading

  • Borg I, Groenen PJF (2005) Modern multidimensional scaling: theory and applications. Springer-Verlag, New York

    Google Scholar 

  • Deboeck GJ, Kohonen TK (1998) Visual explorations in finance. Springer-Verlag, New York

    MATH  Google Scholar 

  • Emerson JD, Hoaglin DC (1983) Analysis of two-way tables by medians. In: Hoaglin DC, Mosteller F, Tukey JW (eds) Understanding robust and exploratory data analysis Wiley, New York, pp 165–210

    Google Scholar 

  • Hair JF (2006) Multivariate data analysis, Prentice-Hall, New Jersey

    Google Scholar 

  • Kohonen T (1989) Self-organization and associative memory. Springer-Verlag, Berlin

    Google Scholar 

  • Maimon OZ, Rokash L (2005) Data mining and knowledge discovery handbook, Springer, New York

    MATH  Google Scholar 

  • Pyle D (1999) Data preparation for data mining, Morgan Kaufmann, San Francisco

    Google Scholar 

  • Tukey JW (1977) Exploratory data analysis. Addison-Wesley, Reading, MA

    MATH  Google Scholar 

  • Velleman PF, Hoaglin DC (1981) The ABC’s of EDA. Duxbury Press, Boston, MA

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this entry

Cite this entry

Smith-Miles, K. (2011). Exploratory Data Analysis. In: Lovric, M. (eds) International Encyclopedia of Statistical Science. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04898-2_242

Download citation

Publish with us

Policies and ethics