Abstract
Compared to batch learning from static data, constructing classifiers from data streams implies new requirements for algorithms, such as constraints on memory usage, restricted processing time, and one scan of incoming examples. Additionally, streams classifiers have to adapt to concept drifts. The entry discusses the following stream classification issues: data stream specific requirements, processing schemes, categorization of concept drifts, classifier evaluation criteria and procedures, forgetting mechanisms, change detection methods, main algorithms for supervised learning of single classifiers and ensembles, open problems, areas of application.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Aggarwal CC (ed) (2007) Data streams – models and algorithms. Volume 31 of Advances in database systems. Springer, New York
Bifet A, Gavaldà R (2007) Learning from time-changing data with adaptive windowing. In: Proceedings of the 7th SIAM international conference on data mining, Minneapolis, pp 443–448
Bifet A, Holmes G, Kirkby R, Pfahringer B (2010) MOA: massive online analysis. J Mach Learn Res 11:1601–1604
Brzezinski D, Stefanowski J (2014) Reacting to different types of concept drift: the accuracy updated ensemble algorithm. IEEE Trans Neural Netw Learn Syst 25:81–94
Deckert M (2013) Incremental rule-based learners for handling concept drift: an overview. Found Comput Decis Sci 38(1):35–65
Ditzler G, Roveri M, Alippi C, Polikar R (2015) Learning in nonstationary environments: a survey. IEEE Comput Intell Mag 10(4):12–25
Domingos P, Hulten G (2000) Mining high-speed data streams. In: Proceedings of the 6th ACM SIGKDD international conference on knowledge discovery and data mining, Boston, pp 71–80
Gama J (2010) Knowledge discovery from data streams. Chapman and Hall/CRC, Boca Raton
Gama J, Žliobaitė I, Bifet A, Pechenizkiy M, Bouchachia A (2014) A survey on concept drift adaptation. ACM Comput Surv 46(4):44:1–44:37
Gomes JB, Gaber MM, Sousa PAC, Ruiz EM (2014) Mining recurring concepts in a dynamic feature space. IEEE Trans Neural Netw Learn Syst 25(1):95–110
Kolter JZ, Maloof MA (2007) Dynamic weighted majority: an ensemble method for drifting concepts. J Mach Learn Res 8:2755–2790
Kosina P, Gama J (2015) Very fast decision rules for classification in data streams. Data Min Knowl Discov 29(1):168–202
Krempl G, Žliobaitė I, Brzezinski D, Hüllermeier E, Last M, Lemaire V, Noack T, Shaker A, Sievi S, Spiliopoulou M, Stefanowski J (2014) Open challenges for data stream mining research. SIGKDD Explor 16(1):1–10
Kuncheva LI (2004) Classifier ensembles for changing environments. In: Proceedings of 5th international workshop on multiple classifier systems, MCS 04, Cagliari. Volume 3077 of Springer LNCS, pp 1–15
Littlestone N, Warmuth MK (1994) The weighted majority algorithm. Inf Comput 108(2):212–261
Masud M, Gao J, Khan L, Thuraisingham B (2008) A practical approach to classify evolving data streams: training with limited amount of labeled data. In: Proceedings of the 8th IEEE international conference on data mining, Pisa, pp 929–934
Oza NC, Russell SJ (2001) Experimental comparisons of online and batch versions of bagging and boosting. In: Proceedings of the 7th ACM SIGKDD international conference on knowledge discovery and data mining, San Francisco, pp 359–364
Spiliopoulou M, Krempl G (2013) Tutorial mining multiple threads of streaming data. In: Proceedings of the Pacific-Asia conference on knowledge discovery and data mining, PAKDD 2013, Gold Coast
Wang H, Fan W, Yu PS, Han J (2003) Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of the 9th ACM SIGKDD international conference on knowledge discovery and data mining, Washington, DC, pp 226–235
Wang S, Minku L, Yao X (2015) Resampling-based ensemble methods for online class imbalance learning. IEEE Trans Knowl Data Eng 27(5):1356–1368
Žliobaitė I, Bifet A, Pfahringer B, Holmes G (2011) Active learning with evolving streaming data. In: Proceedings of the 2011 European conference on machine learning and knowledge discovery in databases, Athens. Volume 6913 of Springer LNCS. pp 597–612
Zliobaite I, Pechenizkiy M, Gama J (2015) An overview of concept drift applications. In: Japkowicz N, Stefanowski J (eds) Big data analysis: new algorithms for a new society. Springer, Cham, pp 91–114
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media New York
About this entry
Cite this entry
Stefanowski, J., Brzezinski, D. (2017). Stream Classification. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning and Data Mining. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7687-1_908
Download citation
DOI: https://doi.org/10.1007/978-1-4899-7687-1_908
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-7685-7
Online ISBN: 978-1-4899-7687-1
eBook Packages: Computer ScienceReference Module Computer Science and Engineering