Stream Mining

Han, Jiawei; Ding, Bolin

doi:10.1007/978-1-4899-7993-3_369-2

Stream Mining

Jiawei Han³ &
Bolin Ding³

Living reference work entry
First Online: 01 January 2016

45 Accesses

Synonyms

Stream data analysis

Definition

Stream mining is the process of discovering knowledge or patterns from continuous data streams. Unlike traditional data sets, data streams consist of sequences of data instances that flow in and out of a system continuously and with varying update rates. They are temporally ordered, fast changing, massive, and potentially infinite. Examples of data streams include data generated by communication networks, Internet traffic, online stock or business transactions, electric power grids, industry production processes, scientific and engineering experiments, and video, audio or remote sensing data from cameras, satellites, and sensor networks. Since it is usually impossible to store an entire data stream, or to scan through it multiple times due to its tremendous volume, most stream mining algorithms are confined to reading only once or a small number of times using limited computing and storage capabilities. Moreover, much of stream data resides at...

This is a preview of subscription content, log in via an institution.

Recommended Reading

Aggarwal CC. Data streams: models and algorithms. Kluwer Academic; 2006.
Google Scholar
Aggarwal CC, Han J, Wang J, Yu PS. A framework for clustering evolving data streams. In: Proceedings of the 29th international conference on very large data bases; 2003. p. 81–92.
Google Scholar
Aggarwal CC, Han J, Wang J, Yu PS. On demand classification of data streams. In: Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining; 2004. p. 503–8.
Google Scholar
Babcock B, Babu S, Datar M, Motwani R, Widom J. Models and issues in data stream systems. In: Proceedings of the 21st ACM SIGACT-SIGMOD-SIGART symposium on principles of database systems; 2002. p. 1–16.
Google Scholar
Cai YD, Clutter D, Pape G, Han J, Welge M, Auvil L. MAIDS: mining alarming incidents from data streams. In: Proceedings of the ACM SIGMOD international conference on management of data; 2004. p. 919–20.
Google Scholar
Chen Y, Dong G, Han J, Wah BW, Wang J. Multi-dimensional regression analysis of time-series data streams. In: Proceedings of the 28th international conference on very large data bases; 2002. p. 323–34.
Google Scholar
Cormode G, Muthukrishnan S. What’s hot and what’s not: tracking most frequent items dynamically. In: Proceedings of the 22nd ACM SIGACT-SIGMOD-SIGART symposium on principles of database systems; 2003. p. 296–306.
Google Scholar
Gao J, Fan W, Han J, Yu PS. A general framework for mining concept-drifting data streams with skewed distributions. In: Proceedings of the SIAM international conference on data mining; 2007.
Google Scholar
Guha S, Mishra N, Motwani R, O’Callaghan L. Clustering data streams. In: Proceedings of the 41st annual symposium on foundations of computer science; 2000.p. 359–66.
Google Scholar
Hulten G, Spencer L, Domingos P. Mining time-changing data streams. In: Proceedings of the 7th ACM SIGKDD international conference on knowledge discovery and data mining; 2001.
Google Scholar
Kargupta H, Bhargava B, Liu K, Powers M, Blair P, Bushra S, Dull J, Sarkar K, Klein M, Vasa M, Handy D. VEDAS: a mobile and distributed data stream mining system for real-time vehicle monitoring. In: Proceedings of the SIAM international conference on data mining; 2004.
Google Scholar
Manku G, Motwani R. Approximate frequency counts over data streams. In: Proceedings of the 28th international conference on very large data bases; 2002. p. 346–57.
Google Scholar
Mendes L, Ding B, Han J. Stream sequential pattern mining with precise error bounds. In: Proceedings of the 2008 IEEE international conference on data mining; 2008.
Google Scholar
O’Callaghan L, Meyerson A, Motwani R, Mishra N, Guha S. Streaming-data algorithms for high-quality clustering. In: Proceedings of the 18th international conference on data engineering; 2002. p. 685–96.
Google Scholar
Shasha D, Zhu Y. High performance discovery in time series: techniques and case studies: Springer; 2004.
Google Scholar
Wang H, Fan W, Yu PS, Han J. Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of the 9th ACM SIGKDD international conferenc on knowledge discovery and data mining. 2003; p. 226–35.
Google Scholar

Download references

Author information

Authors and Affiliations

University of Illinios at Urbana-Champaign, Urbana, IL, USA
Jiawei Han & Bolin Ding

Authors

Jiawei Han
View author publications
You can also search for this author in PubMed Google Scholar
Bolin Ding
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jiawei Han .

Editor information

Editors and Affiliations

Georgia Institute of Technology College of Computing, Atlanta, Georgia, USA
Ling Liu
University of Waterloo School of Computer Science, Waterloo, Ontario, Canada
M. Tamer Özsu

Section Editor information

AT&T Labs - Research, AT&T, 1 AT&T Way, 07921, Bedminster, NJ, USA
Divesh Srivastava

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Han, J., Ding, B. (2016). Stream Mining. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_369-2

Download citation

DOI: https://doi.org/10.1007/978-1-4899-7993-3_369-2
Received: 26 April 2016
Accepted: 21 October 2016
Published: 05 December 2016
Publisher Name: Springer, New York, NY
Online ISBN: 978-1-4899-7993-3
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering

Publish with us

Policies and ethics