Skip to main content

Stream Mining

  • Living reference work entry
  • First Online:
  • 45 Accesses

Synonyms

Stream data analysis

Definition

Stream mining is the process of discovering knowledge or patterns from continuous data streams. Unlike traditional data sets, data streams consist of sequences of data instances that flow in and out of a system continuously and with varying update rates. They are temporally ordered, fast changing, massive, and potentially infinite. Examples of data streams include data generated by communication networks, Internet traffic, online stock or business transactions, electric power grids, industry production processes, scientific and engineering experiments, and video, audio or remote sensing data from cameras, satellites, and sensor networks. Since it is usually impossible to store an entire data stream, or to scan through it multiple times due to its tremendous volume, most stream mining algorithms are confined to reading only once or a small number of times using limited computing and storage capabilities. Moreover, much of stream data resides at...

This is a preview of subscription content, log in via an institution.

Recommended Reading

  1. Aggarwal CC. Data streams: models and algorithms. Kluwer Academic; 2006.

    Google Scholar 

  2. Aggarwal CC, Han J, Wang J, Yu PS. A framework for clustering evolving data streams. In: Proceedings of the 29th international conference on very large data bases; 2003. p. 81–92.

    Google Scholar 

  3. Aggarwal CC, Han J, Wang J, Yu PS. On demand classification of data streams. In: Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining; 2004. p. 503–8.

    Google Scholar 

  4. Babcock B, Babu S, Datar M, Motwani R, Widom J. Models and issues in data stream systems. In: Proceedings of the 21st ACM SIGACT-SIGMOD-SIGART symposium on principles of database systems; 2002. p. 1–16.

    Google Scholar 

  5. Cai YD, Clutter D, Pape G, Han J, Welge M, Auvil L. MAIDS: mining alarming incidents from data streams. In: Proceedings of the ACM SIGMOD international conference on management of data; 2004. p. 919–20.

    Google Scholar 

  6. Chen Y, Dong G, Han J, Wah BW, Wang J. Multi-dimensional regression analysis of time-series data streams. In: Proceedings of the 28th international conference on very large data bases; 2002. p. 323–34.

    Google Scholar 

  7. Cormode G, Muthukrishnan S. What’s hot and what’s not: tracking most frequent items dynamically. In: Proceedings of the 22nd ACM SIGACT-SIGMOD-SIGART symposium on principles of database systems; 2003. p. 296–306.

    Google Scholar 

  8. Gao J, Fan W, Han J, Yu PS. A general framework for mining concept-drifting data streams with skewed distributions. In: Proceedings of the SIAM international conference on data mining; 2007.

    Google Scholar 

  9. Guha S, Mishra N, Motwani R, O’Callaghan L. Clustering data streams. In: Proceedings of the 41st annual symposium on foundations of computer science; 2000.p. 359–66.

    Google Scholar 

  10. Hulten G, Spencer L, Domingos P. Mining time-changing data streams. In: Proceedings of the 7th ACM SIGKDD international conference on knowledge discovery and data mining; 2001.

    Google Scholar 

  11. Kargupta H, Bhargava B, Liu K, Powers M, Blair P, Bushra S, Dull J, Sarkar K, Klein M, Vasa M, Handy D. VEDAS: a mobile and distributed data stream mining system for real-time vehicle monitoring. In: Proceedings of the SIAM international conference on data mining; 2004.

    Google Scholar 

  12. Manku G, Motwani R. Approximate frequency counts over data streams. In: Proceedings of the 28th international conference on very large data bases; 2002. p. 346–57.

    Google Scholar 

  13. Mendes L, Ding B, Han J. Stream sequential pattern mining with precise error bounds. In: Proceedings of the 2008 IEEE international conference on data mining; 2008.

    Google Scholar 

  14. O’Callaghan L, Meyerson A, Motwani R, Mishra N, Guha S. Streaming-data algorithms for high-quality clustering. In: Proceedings of the 18th international conference on data engineering; 2002. p. 685–96.

    Google Scholar 

  15. Shasha D, Zhu Y. High performance discovery in time series: techniques and case studies: Springer; 2004.

    Google Scholar 

  16. Wang H, Fan W, Yu PS, Han J. Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of the 9th ACM SIGKDD international conferenc on knowledge discovery and data mining. 2003; p. 226–35.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jiawei Han .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Science+Business Media LLC

About this entry

Cite this entry

Han, J., Ding, B. (2016). Stream Mining. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_369-2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4899-7993-3_369-2

  • Received:

  • Accepted:

  • Published:

  • Publisher Name: Springer, New York, NY

  • Online ISBN: 978-1-4899-7993-3

  • eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering

Publish with us

Policies and ethics