Tracking Drift Types in Changing Data Streams
- 3 Citations
- 2.1k Downloads
Abstract
The rate of change of drift in a data stream can be of interest. It could show, for example, that a strand of bacteria is becoming more resistant to a drug, or that a machine is becoming unreliable and requires maintenance. While concept drift in data streams has been widely studied, no one has studied the rate of change in concept drift. In this paper we define three new drift types: relative abrupt drift, relative moderate drift and relative gradual drift. We propose a novel algorithm that tracks changes in drift intensity relative to previous drift points within the stream. The algorithm is based on mapping drift patterns to a Gaussian function. Our experimental results show that the algorithm is robust and achieving accuracy levels above 90%.
Keywords
Data Stream Relative Drift Types Gaussian CurvePreview
Unable to display preview. Download preview PDF.
References
- 1.Baena-García, M., del Campo-Ávila, J., Fidalgo, R., Bifet, A., Gavaldá, R., Morales-Bueno, R.: Early drift detection method. In: Fourth International Workshop on Knowledge Discovery from Data Streams (2006)Google Scholar
- 2.Bartlett, P., Ben-David, S., Kulkarni, S.: Learning changing concepts by exploiting the structure of change. Machine Learning 41(2), 153–174 (2000)CrossRefzbMATHGoogle Scholar
- 3.Bifet, A., Gavaldá, R.: Learning from time-changing data with adaptive windowing. In: SIAM International Conference on Data Mining (2007)Google Scholar
- 4.Gama, J., Medas, P., Castillo, G., Rodrigues, P.: Learning with drift detection. In: Bazzan, A.L.C., Labidi, S. (eds.) SBIA 2004. LNCS (LNAI), vol. 3171, pp. 286–295. Springer, Heidelberg (2004)CrossRefGoogle Scholar
- 5.Hoeffding, W.: Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association 58, 13–29 (1963)CrossRefzbMATHMathSciNetGoogle Scholar
- 6.Kifer, D., Ben-David, S., Gehrke, J.: Detecting change in data streams. In: Proceedings of the Thirtieth International Conference on VLDB, vol. 30, pp. 180–191. VLDB Endowment (2004)Google Scholar
- 7.Kosina, P., Gama, J., Sebastião, R.: Drift severity metric. In: Proceedings of the 2010 Conference on ECAI 2010: 19th European Conference on Artificial Intelligence, pp. 1119–1120. IOS Press, Amsterdam (2010)Google Scholar
- 8.Sebastião, R., Gama, J.: A study on change detection methods. In: 4th Portuguese Conf. on Artificial Intelligence, Lisbon (2009)Google Scholar
- 9.Vitter, J.S.: Random sampling with a reservoir. ACM Trans. Math. Softw. 11(1), 37–57 (1985)CrossRefzbMATHMathSciNetGoogle Scholar
- 10.Zhang, P., Zhu, X., Shi, Y.: Categorizing and mining concept drifting data streams. In: Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2008, pp. 812–820. ACM, New York (2008)Google Scholar