Encyclopedia of Big Data

Living Edition
| Editors: Laurie A. Schintler, Connie L. McNeely

Data Discovery

  • Anirudh Prabhu
Living reference work entry
DOI: https://doi.org/10.1007/978-3-319-32001-4_301-1



Broadly defined, data discovery is the process of finding patterns and trends in processed, analyzed, or visualized data. The reason data discovery must be defined “broadly” is because this process is popular across domains. These patterns and trends can be “discovered” from the data using different methods depending on the context and domain of the work.


Recently, the term “data discovery” has been popularized as a process in Business Intelligence, with a lot of software applications and tools aiding the user in discovering trends, patterns, outliers, clusters, etc. The data discovery process itself has a longer history that dates back to the beginning of data mining. Data mining started as a trend in the 1980s and was a process of extracting information by examining databases (under human control). Other names for data mining include knowledge extraction, information...

This is a preview of subscription content, log in to check access.

Further Readings

  1. Begoli, E., & Horey, J. (2012). Design principles for effective knowledge discovery from big data. In Software Architecture (WICSA) and European Conference on Software Architecture (ECSA), 2012 joint working IEEE/IFIP conference on IEEE (pp. 215–218). IEEE.Google Scholar
  2. Fayyad, U., Piatetsky-Shapiro, G., & Smyth, P. (1996). From data mining to knowledge discovery in databases. AI Magazine, 17(3), 37.Google Scholar
  3. Haan, K. (2016). So what is data discovery anyway? 5 key facts for BI. Retrieved Sept 24, 2017, from https://www.ironsidegroup.com/2016/03/21/data-discovery-5-facts-bi/.
  4. Kurgan, L. A., & Musilek, P. (2006). A survey of knowledge discovery and data mining process models. The Knowledge Engineering Review, 21(1), 1–24.CrossRefGoogle Scholar
  5. McGarry, K. (2005). A survey of interestingness measures for knowledge discovery. The Knowledge Engineering Review, 20(1), 39–61.CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  1. 1.Tetherless World ConstellationRensselaer Polytechnic InstituteTroyUSA