Advertisement

A First Attempt to Construct Effective Concept Drift Detector Ensembles

  • Michał WoźniakEmail author
  • Paweł Ksieniewicz
  • Andrzej Kasprzak
  • Karol Puchała
  • Przemysław Ryba
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 525)

Abstract

The big data is usually described by so-called 5Vs (Volume, Velocity, Variety, Veracity, Value). The business success in the big data era strongly depends on the smart analytical software which can help to make efficient decisions (Value for enterprise). Therefore, the decision support software should take into consideration especially that we deal with massive data (Volume) and that data usually comes continuously in the form of so-called data stream (Velocity). Unfortunately, most of the traditional data analysis methods are not ready to efficiently analyze fast growing amount of the stored records. Additionally, one should also consider phenomenon appearing in data stream called concept drift, which means that the parameters of an using model are changing, what could dramatically decrease the analytical model quality. This work is focusing on the classification task, which is very popular in many practical cases as fraud detection, network security, or medical diagnosis. We propose how to detect the changes in the data stream using combined concept drift detection model. The experimental evaluations show that it is an interesting direction, what encourage us to use it in practical applications.

Keywords

Data stream Concept drift Pattern classification Drift detector 

Notes

Acknowledgements

This work was supported by the statutory funds of the Department of Systems and Computer Networks, Faculty of Electronics, Wroclaw University of Science and Technology and by the Polish National Science Centre under the grant no. DEC-2013/09/B/ST6/02264. All computer experiments were carried out using computer equipment sponsored by EC under FP7, Coordination and Support Action, Grant Agreement Number 316097, ENGINE European Research Centre of Network Intelligence for Innovation Enhancement (http://engine.pwr.edu.pl/).

References

  1. 1.
    Bifet, A., Holmes, G., Kirkby, R., Pfahringer, B.: Moa: massive online analysis. J. Mach. Learn. Res. 11, 1601–1604 (2010). http://dl.acm.org/citation.cfm?id=1756006.1859903 Google Scholar
  2. 2.
    Bifet, A., Read, J., Pfahringer, B., Holmes, G., Žliobaitė, I.: CD-MOA: change detection framework for massive online analysis. In: Tucker, A., Höppner, F., Siebes, A., Swift, S. (eds.) IDA 2013. LNCS, vol. 8207, pp. 92–103. Springer, Heidelberg (2013). doi: 10.1007/978-3-642-41398-8_9 CrossRefGoogle Scholar
  3. 3.
    Blanco, I.I.F., del Campo-Avila, J., Ramos-Jimenez, G., Bueno, R.M., Diaz, A.A.O., Mota, Y.C.: Online and non-parametric drift detection methods based on hoeffding’s bounds. IEEE Trans. Knowl. Data Eng. 27(3), 810–823 (2015). http://dx.doi.org/10.1109/TKDE.2014.2345382 CrossRefGoogle Scholar
  4. 4.
    Gama, J., Zliobaite, I., Bifet, A., Pechenizkiy, M., Bouchachia, A.: A survey on concept drift adaptation. ACM Comput. Surv. (CSUR), 46(4) (2014). Surveys Homepage archive. Article No. 44Google Scholar
  5. 5.
    Greiner, R., Grove, A.J., Roth, D.: Learning cost-sensitive active classifiers. Artif. Intell. 139(2), 137–174 (2002)MathSciNetCrossRefGoogle Scholar
  6. 6.
    Gustafsson, F.: Adaptive Filtering and Change Detection. Wiley, October 2000. http://www.wiley.com/WileyCDA/WileyTitle/productCd-0471492876,descCd-description.html
  7. 7.
    Jackowski, K., Krawczyk, B., Wozniak, M.: Adass+ the hybrid training method of a classifier based on a feature space partitioning. Int. J. Neural Syst. 24(3), 1430007 (2014)CrossRefGoogle Scholar
  8. 8.
    Kuncheva, L.I.: Combining Pattern Classifiers: Methods and Algorithms. Wiley-Interscience, Chichester (2004)CrossRefzbMATHGoogle Scholar
  9. 9.
    Sobolewski, P., Wozniak, M.: Concept drift detection and model selection with simulated recurrence and ensembles of statistical detectors. J. Univ. Comput. Sci. 19(4), 462–483 (2013)Google Scholar
  10. 10.
    Widmer, G., Kubat, M.: Effective learning in dynamic environments by explicit context tracking. In: Brazdil, P.B. (ed.) ECML 1993. LNCS, vol. 667, pp. 227–243. Springer, Heidelberg (1993). doi: 10.1007/3-540-56602-3_139 CrossRefGoogle Scholar
  11. 11.
    Widmer, G., Kubat, M.: Learning in the presence of concept drift and hidden contexts. Mach. Learn. 23(1), 69–101 (1996)Google Scholar
  12. 12.
    Wozniak, M., Grana, M., Corchado, E.: A survey of multiple classifier systems as hybrid systems. Inf. Fusion 16, 3–17 (2014). Special Issue on Information Fusion in Hybrid Intelligent Fusion Systems. http://www.sciencedirect.com/science/article/pii/S156625351300047X CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Michał Woźniak
    • 1
    Email author
  • Paweł Ksieniewicz
    • 1
  • Andrzej Kasprzak
    • 1
  • Karol Puchała
    • 1
  • Przemysław Ryba
    • 1
  1. 1.Faculty of Electronics, Department of Systems and Computer NetworksWroclaw University of Science and TechnologyWroclawPoland

Personalised recommendations