Skip to main content
  • 1351 Accesses

Abstract

This chapter presents Exploratory Data Analysis (EDA) as an approach for gaining understanding and insight about a particular dataset, in order to support and validate statistical findings and also to potentially generate, identify, and create new hypotheses based on patterns in data. Examples of EDA are provided and interpretations are discussed. EDA may be used at any stage in the data analysis process from cleaning through transformation and descriptive analysis, as well as using results from every stage. Discovery of patterns may inspire a new direction in intervention effectiveness research, as well as further supporting or validating existing projects. Data visualization skills are essential for individuals working with large datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 49.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Tukey JW (1977) Exploratory data analysis. Addison-Wesley, Reading, MA

    Google Scholar 

  2. Monsen KA, Peterson JJ, Mathiason MA, Kim E, Lee S, Chi CL, Pieczkiewicz DS (2015) Data visualization techniques to showcase nursing care quality. Comput Inform Nurs 33(10):417–426

    Article  PubMed  Google Scholar 

  3. Dzemyda G, Kurasova O, Zilinskas J (2013) Strategies for multidimensional data visualization. In Multidimensional data visualization: Methods and applications. Springer, New York, NY, pp. 5–40

    Google Scholar 

  4. Lee S, Kim E, Monsen KA (2015) Public health nurse perceptions of Omaha System data visualization. Int J Med Inform 84(10):826–834

    Article  PubMed  Google Scholar 

  5. Kirk A (2012) Data visualization: a successful design process. Packt, Birmingham

    Google Scholar 

  6. Gehlenborg N, Wong B (2012) Points of view: heat maps. Nat Methods 9(3):213

    Article  CAS  PubMed  Google Scholar 

  7. Aigner W, Miksch S, Schumann H, Tominski C (2011) Visualization of time-oriented data. Springer, London

    Book  Google Scholar 

  8. Kim E, Monsen K, Pieczkiewicz D (2013) Visualization of Omaha System data enables data-driven analysis of outcomes. In: American Medical Informatics Association Annual Meeting, Washington, DC

    Google Scholar 

  9. Monsen KA, Peterson JJ, Mathiason MA, Kim E, Votova B, Pieczkiewicz DS (2017) Discovering public health nurse–specific family home visiting intervention patterns using visualization techniques. West J Nurs Res 39(1):127–146

    Article  Google Scholar 

  10. Mendenhall W, Beaver RJ, Beaver BM (2012) Introduction to probability and statistics. Cengage Learning, Boston, MA

    Google Scholar 

  11. What is R? [Internet]. [cited 12 May 2017]. Available from https://www.r-project.org/about.html

  12. What is Tableau? [Internet]. [cited 12 May 2017]. Available from http://www.tableau.com/products/desktop#TGETWeLmfvacBMZt.99

  13. D3 Data Driven Documents [Internet]. [cited 12 May 2017]. Available from https://d3js.org/

  14. Anscombe FJ (1973) Graphs in statistical analysis. Am Stat 27(1):17–21. doi:10.2307/2682899

    Google Scholar 

  15. Green SB, Salkind NJ (2010) Using SPSS for windows and macintosh: analyzing and understanding data. Prentice Hall, Upper Saddle River, NJ

    Google Scholar 

  16. Radhakrishnan K, Monsen KA, Bae SH, Zhang W (2016) Visual analytics for pattern discovery in home care. Appl Clin Inform 7(3):711–730

    Article  PubMed  PubMed Central  Google Scholar 

  17. Chi C (2015) Data mining for translation to practice. Presented at 2nd international conference on research methods for standard terminologies, West Paul, MN

    Google Scholar 

  18. Harter JM, Wu X, Alabi OS, Phadke M, Pinto L, Dougherty D, Petersen H, Bass S, Taylor II RM (2012) Increasing the perceptual salience of relationships in parallel coordinate plots. In: IS&T/SPIE Electronic Imaging, 2012 Jan 22. International Society for Optics and Photonics, pp 82940T–82940T

    Google Scholar 

  19. Inselberg A (2009) Parallel coordinates: visual multidimensional geometry and its applications. Springer, New York, pp 1–554

    Book  Google Scholar 

  20. Few S. Multivariate analysis using parallel coordinates. Perceptual Edge web site. [Internet]. [cited 12 May 2017]. Available from http://www.perceptualedge.com/articles/b-eye/parallel_coordinates.pdf. Published 12 Sept 2006. Updated 2015

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this chapter

Cite this chapter

Monsen, K.A. (2018). Exploratory Data Analysis. In: Intervention Effectiveness Research: Quality Improvement and Program Evaluation. Springer, Cham. https://doi.org/10.1007/978-3-319-61246-1_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-61246-1_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-61245-4

  • Online ISBN: 978-3-319-61246-1

  • eBook Packages: MedicineMedicine (R0)

Publish with us

Policies and ethics