Synonyms
Data mining; Text data mining; Web content mining; Web data mining; Web mining; Web structure mining; Web usage mining
Definition
The healthcare domain presents numerous opportunities for extracting information from heterogeneous sources ranging from structured data (e.g., laboratory results and diagnoses) to unstructured data (e.g., clinical documents such as discharge summaries) to usage data (e.g., audit logs that record user activity for clinical applications). To accommodate the unique characteristics of these disparate types of data and support the subsequent use of extracted information, several existing techniques have been adapted and applied including Data Mining, Text Mining, and Web Mining [1]. This entry provides an overview of each of these mining techniques (with a focus on Web usage mining) and example applications in healthcare.
Historical Background
Given the exponential growth of data in all domains, there has been an increasing amount of work focused on the...
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsRecommended Reading
Data Mining, Web Mining, Text Mining, Knowledge Discovery. www.kdnuggets.com
Dunham M. Data mining introductory and advanced topics. Englewood Cliffs: Prentice-Hall; 2003.
Fayyad U, Piatetsky-Shapiro G, Smyth P, Uthurusamy R. Advances in knowledge discovery and data mining. Menlo Park: AAAI/MIT; 1996.
Chen H, Fuller S, Friedman C, Hersh W. Knowledge management and data mining in biomedicine. New York: Springer; 2005.
Hearst M. Untangling text data mining. In: Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics; 1999.
Konchady M. Text mining application programming. Charles River Media Boston; 2006 2.
Scime A. Web mining: applications and techniques. Hershey: Idea Group Inc.; 2005.
Srivastava J, Cooley R, Deshpande M, Tan P. Web usage mining: discovery and applications of usage patterns from web data. SIGKDD Explor. 2000;1(2):12–23.
Kosala R, Blockeel H. Web mining research: a survey. SIGKDD Explor. 2000;2(1):1–15.
Cooley R, Mobasher B, Srivastava J. Web mining: information and pattern discovery on the World Wide Web. In: Proceedings of the 9th IEEE International Conference on Tools with Artificial Intelligence; 1997. p. 558–67.
Doddi S, Marathe A, Ravi SS, Torney DC. Discovery of association rules in medical data. Med Inform Internet Med. 2001;26(1):25–33.
Prather JC, Lobach DF, Goodwin LK, Hales JW, Hage ML, Hammond WE. Medical data mining: knowledge discovery in a clinical data warehouse. In: Proceedings of the AMIA Annual Fall Symposium; 1997. p. 101–5.
Mullins IM, Siadaty MS, Lyman J, Scully K, Garrett CT, Greg MW, et al. Data mining and clinical data repositories: insights from a 667,000 patient data set. Comput Biol Med. 2006;36(12):1351–77.
Hripcsak G, Bakken S, Stetson PD, Patel VL. Mining complex clinical data for patient safety research: a framework for event discovery. J Biomed Inform. 2003;36(1–2):120–30.
Cao H, Markatou M, Melton GB, Chiang MF, Hripcsak G. Mining a clinical data warehouse to discover disease-finding associations using co-occurrence statistics. In: Proceedings of the AMIA Annual Symposium; 2005. p. 106–10.
Heinze DT, Morsch ML, Holbrook J. Mining free-text medical records. In: Proceedings of the AMIA Symposium; 2001. p. 254–8.
Eirinaki M, Vazirgiannis M. Web mining for web personalization. ACM Trans Internet Technol. 2003;3(1):1–27.
Pierrakos D, Paliouras G, Papatheodorou C, Spyropoulos C. Web usage mining as a tool for personalization: a survey. User Model User-Adap. 2003;13(4):311–72.
Mobasher B, Cooley R, Srivastava J. Automatic personalization based on web usage mining. Commun ACM. 2000;43(8):142–51.
Malin BA. Correlating web usage of health information with patient medical data. In: Proceedings of the AMIA Symposium; 2002. p. 484–8.
Johnson HA, Wagner MM, Hogan WR, Chapman W, Olszewski RT, Dowling J, et al. Analysis of web access logs for surveillance of influenza. In: Proceedings of the 11th World Congress on Medical Informatics; 2004. p. 1202.
Heino J, Toivonen H. Automated detection of epidemics from the usage logs of a physicians’s; reference database. In: Principles of Data Mining and Knowledge Discovery, 7th European Conference; 2003. p. 180–91.
Zhang D, Zambrowicz C, Zhou H, Roderer N. User information seeking behavior in a medical web portal environment: a preliminary study. J Am Soc Inf Sci Technol. 2004;55(8):670–84.
Rozic-Hristovski A, Hristovski D, Todorovski L. Users’ information-seeking behavior on a medical library Website. J Med Libr Assoc. 2002;90(2):210–7.
Bracke PJ. Web usage mining at an academic health sciences library: an exploratory study. J Med Libr Assoc. 2004;92(4):421–8.
Chen ES, Cimino JJ. Automated discovery of patient-specific clinician information needs using clinical information system log files. In: Proceedings of the AMIA Annual Symposium; 2003. p. 145–9.
Chen ES, Cimino JJ. Patterns of usage for a web-based clinical information system. In: Proceedings of the 11th World Congress on Medical Informatics; 2004. p. 18–22.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Chen, E.S. (2018). Data, Text, and Web Mining in Healthcare. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_94
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8265-9_94
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering