Abstract
Concept drift has potential in smart grid analysis because the socio-economic behaviour of consumers is not governed by the laws of physics. Likewise there are also applications in wind power forecasting. In this paper we present decision tree ensemble classification method based on the Random Forest algorithm for concept drift. The weighted majority voting ensemble aggregation rule is employed based on the ideas of Accuracy Weighted Ensemble (AWE) method. Base learner weight in our case is computed for each sample evaluation using base learners accuracy and intrinsic proximity measure of Random Forest. Our algorithm exploits ensemble pruning as a forgetting strategy. We present results of empirical comparison of our method and other state-of-the-art concept-drfit classifiers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhukov, A., Kurbatsky, V., Tomin, N., Sidorov, D., Panasetsky, D., Foley, A.: Ensemble methods of classification for power systems security assessment. arXiv, Artificial Intelligence (cs.AI), pp. 1–6. arXiv:1601.01675 (2016)
Tomin, N., Zhukov, A., Sidorov, D., Kurbatsky, V., Panasetsky, D., Spiryaev, V.: Random forest based model for preventing large-scale emergencies in power systems. Int. J. Artif. Intell. 13, 211–228 (2015)
Breiman, L.: Bagging predictors. Mach. Learn. 24(2), 123–140 (1996)
Ho, T.K.: The random subspace method for constructing decision forests. IEEE Trans. Pattern Anal. Mach. Intell. 20(8), 832–844 (1998)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Saffari, A., Leistner, C., Santner, J., Godec, M., Bischof, H.: On-line random forests. In: 2009 IEEE 12th International Conference on Computer Vision Workshops (ICCV Workshops), 1393–1400. IEEE (2009)
Sidorov, D.: Modelling of non-linear dynamic systems by Volterra series. In: Attractors, Signals, and Synergetics Workshopp, vol. 2000, pp. 276–282. Pabst Science Publication, USA-Germany (2002)
Sidorov, D.: Integral Dynamical Models: Singularities, Signals and Control. World Scientific Publishing, Singapore (2015)
Wang, H., Fan, W., Yu, P.S., Han, J.: Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of SIGKDD, 24–27 August 2003, Washington, DC, USA, pp. 226–235 (2003)
Gama, J.: Knowledge discovery from data streams. CRC Press Publishing, Singapore (2010)
Kuncheva, L.: Classier ensembles for changing environment. In: Roli, F., Kittler, J., Windeatt, T. (eds.) 2004 5th International Workshop on Multiple Classier Systems, pp. 1–15. Springer, Heidelberg (2004)
Turkov, P., Krasotkina, O., Mottl, V.: Dynamic programming for Bayesian logistic regression learning under concept drift. In: Maji, P., Ghosh, A., Murty, M.N., Ghosh, K., Pal, S.K. (eds.) PReMI 2013. LNCS, vol. 8251, pp. 190–195. Springer, Heidelberg (2013)
Wang, H., Fan, W., Yu, P.S., Han, J.: Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 226–235. ACM (2003)
Zhukov, A., Kurbatsky, V., Tomin, N., Sidorov, D., Panasetsky, D., Spiryaev, V.: Random forest based model for emergency state monitoring in power systems. In: Mathematical Method for Pattern Recognition: Book of abstract of the 17th All-Russian Conference with Interneational Participation, p. 274. TORUS PRESS, Svetlogorsk (2015)
Geurts, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Mach. Learn. 63(1), 3–42 (2006)
Blake, C.L., Merz, C.J.: UCI repository of machine learning databases (1998)
Brzezinski, D.: Mining data streams with concept drift. Dissertion MS thesis. Department of Computing Science and Management, Poznan University of Technology (2010)
Brzezinski, D., Stefanowski, J.: Reacting to different types of concept drift: the accuracy updated ensemble algorithm. IEEE Trans. Neural Netw. Learn. Syst. 25(1), 81–94 (2014)
Acknowledgment
This work is funded by the RSF grant No. 14-19-00054 and by the International science and technology cooperation program of China, project 2015DFR70850, NSFC Grant No. 61673398.
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Zhukov, A.V., Sidorov, D.N., Foley, A.M. (2017). Random Forest Based Approach for Concept Drift Handling. In: Ignatov, D., et al. Analysis of Images, Social Networks and Texts. AIST 2016. Communications in Computer and Information Science, vol 661. Springer, Cham. https://doi.org/10.1007/978-3-319-52920-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-52920-2_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-52919-6
Online ISBN: 978-3-319-52920-2
eBook Packages: Computer ScienceComputer Science (R0)