Skip to main content

Random Forest Based Approach for Concept Drift Handling

  • Conference paper
  • First Online:
Analysis of Images, Social Networks and Texts (AIST 2016)

Abstract

Concept drift has potential in smart grid analysis because the socio-economic behaviour of consumers is not governed by the laws of physics. Likewise there are also applications in wind power forecasting. In this paper we present decision tree ensemble classification method based on the Random Forest algorithm for concept drift. The weighted majority voting ensemble aggregation rule is employed based on the ideas of Accuracy Weighted Ensemble (AWE) method. Base learner weight in our case is computed for each sample evaluation using base learners accuracy and intrinsic proximity measure of Random Forest. Our algorithm exploits ensemble pruning as a forgetting strategy. We present results of empirical comparison of our method and other state-of-the-art concept-drfit classifiers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Zhukov, A., Kurbatsky, V., Tomin, N., Sidorov, D., Panasetsky, D., Foley, A.: Ensemble methods of classification for power systems security assessment. arXiv, Artificial Intelligence (cs.AI), pp. 1–6. arXiv:1601.01675 (2016)

  2. Tomin, N., Zhukov, A., Sidorov, D., Kurbatsky, V., Panasetsky, D., Spiryaev, V.: Random forest based model for preventing large-scale emergencies in power systems. Int. J. Artif. Intell. 13, 211–228 (2015)

    Google Scholar 

  3. Breiman, L.: Bagging predictors. Mach. Learn. 24(2), 123–140 (1996)

    MATH  Google Scholar 

  4. Ho, T.K.: The random subspace method for constructing decision forests. IEEE Trans. Pattern Anal. Mach. Intell. 20(8), 832–844 (1998)

    Article  Google Scholar 

  5. Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)

    Article  MATH  Google Scholar 

  6. Saffari, A., Leistner, C., Santner, J., Godec, M., Bischof, H.: On-line random forests. In: 2009 IEEE 12th International Conference on Computer Vision Workshops (ICCV Workshops), 1393–1400. IEEE (2009)

    Google Scholar 

  7. Sidorov, D.: Modelling of non-linear dynamic systems by Volterra series. In: Attractors, Signals, and Synergetics Workshopp, vol. 2000, pp. 276–282. Pabst Science Publication, USA-Germany (2002)

    Google Scholar 

  8. Sidorov, D.: Integral Dynamical Models: Singularities, Signals and Control. World Scientific Publishing, Singapore (2015)

    MATH  Google Scholar 

  9. Wang, H., Fan, W., Yu, P.S., Han, J.: Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of SIGKDD, 24–27 August 2003, Washington, DC, USA, pp. 226–235 (2003)

    Google Scholar 

  10. Gama, J.: Knowledge discovery from data streams. CRC Press Publishing, Singapore (2010)

    Book  MATH  Google Scholar 

  11. Kuncheva, L.: Classier ensembles for changing environment. In: Roli, F., Kittler, J., Windeatt, T. (eds.) 2004 5th International Workshop on Multiple Classier Systems, pp. 1–15. Springer, Heidelberg (2004)

    Google Scholar 

  12. Turkov, P., Krasotkina, O., Mottl, V.: Dynamic programming for Bayesian logistic regression learning under concept drift. In: Maji, P., Ghosh, A., Murty, M.N., Ghosh, K., Pal, S.K. (eds.) PReMI 2013. LNCS, vol. 8251, pp. 190–195. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  13. Wang, H., Fan, W., Yu, P.S., Han, J.: Mining concept-drifting data streams using ensemble classifiers. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 226–235. ACM (2003)

    Google Scholar 

  14. Zhukov, A., Kurbatsky, V., Tomin, N., Sidorov, D., Panasetsky, D., Spiryaev, V.: Random forest based model for emergency state monitoring in power systems. In: Mathematical Method for Pattern Recognition: Book of abstract of the 17th All-Russian Conference with Interneational Participation, p. 274. TORUS PRESS, Svetlogorsk (2015)

    Google Scholar 

  15. Geurts, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Mach. Learn. 63(1), 3–42 (2006)

    Article  MATH  Google Scholar 

  16. Blake, C.L., Merz, C.J.: UCI repository of machine learning databases (1998)

    Google Scholar 

  17. Brzezinski, D.: Mining data streams with concept drift. Dissertion MS thesis. Department of Computing Science and Management, Poznan University of Technology (2010)

    Google Scholar 

  18. Brzezinski, D., Stefanowski, J.: Reacting to different types of concept drift: the accuracy updated ensemble algorithm. IEEE Trans. Neural Netw. Learn. Syst. 25(1), 81–94 (2014)

    Article  Google Scholar 

Download references

Acknowledgment

This work is funded by the RSF grant No. 14-19-00054 and by the International science and technology cooperation program of China, project 2015DFR70850, NSFC Grant No. 61673398.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Aleksei V. Zhukov , Denis N. Sidorov or Aoife M. Foley .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Zhukov, A.V., Sidorov, D.N., Foley, A.M. (2017). Random Forest Based Approach for Concept Drift Handling. In: Ignatov, D., et al. Analysis of Images, Social Networks and Texts. AIST 2016. Communications in Computer and Information Science, vol 661. Springer, Cham. https://doi.org/10.1007/978-3-319-52920-2_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-52920-2_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-52919-6

  • Online ISBN: 978-3-319-52920-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics