Abstract
The digital universe is growing rapidly. The volume of data generated per annum is in the order of zeta bytes due to the proliferation of the Internet. Many real-world applications generate data that are continuous. This type of data is known as data streams. Examples of applications generating this kind of data are business transactions, Web logs, sensors networks, etc. The data stream is analyzed, and the underlying concepts are extracted to make predictions and decisions in real time. But as data streams evolve over time, they undergo concept drift. Concept drift means the statistical properties of the data stream change over time in unforeseen ways. This causes problems because the predictions based on the data streams become less accurate as time passes. To understand the behavior of data streams, it is important to investigate the changes of the data distributions and the causes of the changes. Therefore, periodic retraining, also known as refreshing, of any model is necessary. The survey covers the various techniques available in the literature to handle concept drift in data streams.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Tsymbal, A.: The problem of concept drift: definitions and related work. Technical report TCD-CS-2004-15, Trinity College Dublin, Ireland, pp. 1â7 (2004)
Masud, M., Chen, Q., Khan, L., Aggarwal, C., Gao, J., Han, J., Thuraisingham, B.: Addressing concept-evolution in concept-drifting data streams. IEEE International Conference on Data Mining, pp. 929â934 (2010)
Alippi, C., Boracchi, G.,Roveri, M.: An effective just-in-time adaptive classifier for gradual concept drifts. In: International Joint Conference on Neural Networks, pp. 1675â1682 (2011)
Brzezinski, D., Stefanowski, J.: Accuracy updated ensemble for data streams with concept drift. In: 6th International Conference on Hybrid Artificial Intelligent Systems. Lecture Notes in Computer Science, vol. 6679, pp. 155â162. Springer (2011)
Shetty, S., Mukkavilli, S.K., Keel, L.H.: An integrated machine learning and control theoretic model for mining concept-drifting data streams. In: IEEE International Conference on Technologies for Homeland Security (HST), pp. 75â80 (2011)
Breve, F., Zhao, L.: Particle competition and cooperation in networks for semi-supervised learning with concept drift. In: The 2012 International Joint Conference on Neural Networks (IJCNN), pp. 1â6 (2012)
Bifet, A., Read, J., Pfahringer, B., Holmes, G.: Efficient data stream classification via probabilistic adaptive windows. In: Proceedings of the 28th Annual ACM Symposium on Applied Computing, pp. 801â806 (2013)
Turkov, P., Krasotkina, O., Mottl, V.: The bayesian logistic regression in pattern recognition problems under concept drift. In: International Conference on Pattern Recognition, pp. 2976â2979 (2012)
Minku, L.L., Yao, X.: DDD: New ensemble approach for dealing with concept drift. IEEE Trans. Knowl. Data Eng. 24(4), 619â633 (2012)
Gama, J., Medas, P., Castillo, G., Pedro Rodrigues, P.: Learning with drift detection. In: Advances in Artificial Intelligence, pp. 286â295. Springer, Berlin Heidelberg (2004)
Ross, G.J., Adams, N.M., Tasoulis, D.K., Hand, D.J.: Exponentially weighted moving average charts for detecting concept drift. Pattern Recognit. Lett. 33(2), 191â198 (Elsiever) (2012)
Brzezinski, D., Stefanowski, J.: Reacting to different types of concept drift: the accuracy updated ensemble algorithm. IEEE Trans. Neural Netw. Learn. Syst. 25(1), 81â94 (IEEE) (2013)
Susnjak, T., Barczak, A.L.C., Hawick, K.A.: Adaptive ensemble based learning in non-stationary environments. In: International Conference on Neural Information Processing. LNCS, vol. 6443, pp. 438â445 (2010)
Deckert, M.: Batch weighted ensemble for mining data streams with concept drift. In: International Symposium on Methodologies for Intelligent Systems. LNAI, vol. 6804, pp. 290â299 (2011)
Yeh, Y., Wang, Y.F.: A rank-one update method for least squares linear discriminant analysis with concept drift. Pattern Recogn. 46(5), 1267â1276 (2013)
Zhu, Q., Hu, X., Zhang, Y., Li, P., Wu, X.: A double-window-based classification algorithm for concept drifting data streams. In: IEEE International Conference on Granular Computing, pp. 639â644 (2010)
Alippi, C., Boracchi, G., Roveri, M.: Just-In-Time classifiers for recurrent concepts. IEEE Trans. Neural Netw. 24(4), 620â634 (IEEE) (2013)
Li, P., Wu, X., Hu, X.: Mining recurring concept drifts with limited labeled streaming data. ACM Trans. Intell. Syst. Technol. 3(2), 29:1â29:32 (ACM) (2012)
Gomes, J.B., Menasalvas, E., Sousa, P.A.: CALDS: Context-aware learning from data streams. In: Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques (StreamKDDâ10), pp. 16â24 (2010)
Gomes, J.B., Menasalvas, E., Sousa, P.A.: Learning recurring concepts from data streams with a context-aware ensemble. In: Proceedings of the 2011 ACM Symposium on Applied Computing (SACâ11), pp. 994â999 (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Âİ 2015 Springer India
About this paper
Cite this paper
Akila, V., Zayaraz, G. (2015). A Brief Survey on Concept Drift. In: Jain, L., Patnaik, S., Ichalkaranje, N. (eds) Intelligent Computing, Communication and Devices. Advances in Intelligent Systems and Computing, vol 308. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2012-1_31
Download citation
DOI: https://doi.org/10.1007/978-81-322-2012-1_31
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2011-4
Online ISBN: 978-81-322-2012-1
eBook Packages: EngineeringEngineering (R0)