Advertisement

Regression

  • Leszek RutkowskiEmail author
  • Maciej Jaworski
  • Piotr Duda
Chapter
Part of the Studies in Big Data book series (SBD, volume 56)

Abstract

Although ensembles of classifiers are one of the most popular tools to deal with data streams classification task [1, 2, 3, 4, 5], in the literature there is a lack of new approaches to creating ensembles of regression estimators [6, 7]. Most of the latest developments focus on the application of the regression estimators to solve very important real-world problems. In [8] the authors propose to create an ensemble composed of decision trees, gradient boosted trees and random forest to forecast electricity consumptions. The algorithm uses different weights for each component based on its previous performance. As it was shown in [9], the regression can be applied to enhanced prediction of occurrence of the concept-drift. The authors propose an ensemble method which utilizes constrained penalized regression as a combiner to track a drifting concept in a classification setting. The data stream approach to system fault prediction has been examined in [10]. In this paper different data-stream-based linear regression prediction methods have been tested and compared with a newly developed fault detection system. The applied and evaluated data stream mining algorithms were: grid-based classifier, polygon-based method, and one-class support vector machines. The results showed that the linear regression method generally achieved good performance in predicting short-term data.

References

  1. 1.
    Pietruczuk, L., Rutkowski, L., Jaworski, M., Duda, P.: A method for automatic adjustment of ensemble size in stream data mining. In: 2016 International Joint Conference on Neural Networks (IJCNN), pp. 9–15 (2016)Google Scholar
  2. 2.
    Pietruczuk, L., Rutkowski, L., Jaworski, M., Duda, P.: How to adjust an ensemble size in stream data mining? Inf. Sci. 381, 46–54 (2017)MathSciNetCrossRefGoogle Scholar
  3. 3.
    Street, W.N., Kim, Y.: A streaming ensemble algorithm (sea) for large-scale classification. In: Proceedings of the Seventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 377–382. ACM (2001)Google Scholar
  4. 4.
    Minku, L.L., Yao, X.: DDD: a new ensemble approach for dealing with concept drift. IEEE Trans. Knowl. Data Eng. 24(4), 619–633 (2012)CrossRefGoogle Scholar
  5. 5.
    Sun, J., Fujita, H., Chen, P., Li, H.: Dynamic financial distress prediction with concept drift based on time weighting combined with adaboost support vector machine ensemble. Knowl. Based Syst. 120, 4–14 (2017)CrossRefGoogle Scholar
  6. 6.
    Ikonomovska, E., Gama, J., Džeroski, S.: Online tree-based ensembles and option trees for regression on evolving data streams. Neurocomputing 150, 458–470 (2015)CrossRefGoogle Scholar
  7. 7.
    Duda, P., Jaworski, M., Rutkowski, L.: Online GRNN-based ensembles for regression on evolving data streams. In: International Symposium on Neural Networks, pp. 221–228. Springer, Berlin (2018)Google Scholar
  8. 8.
    Galicia, A., Talavera-Llames, R., Troncoso, A., Koprinska, I., Martínez-Álvarez, F.: Multi-step forecasting for big data time series based on ensemble learning. Knowl. Based Syst. (2018)Google Scholar
  9. 9.
    Wang, L.-Y., Park, C., Choi, H., Yeon, K.: A classifier ensemble for concept drift using a constrained penalized regression combiner. Procedia Comput. Sci. 91, 252–259 (2016)CrossRefGoogle Scholar
  10. 10.
    Alzghoul, A., Löfstrand, M., Backe, B.: Data stream forecasting for system fault prediction. Comput. Ind. Eng. 62(4), 972–978 (2012)CrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  • Leszek Rutkowski
    • 1
    • 2
    Email author
  • Maciej Jaworski
    • 1
  • Piotr Duda
    • 1
  1. 1.Institute of Computational IntelligenceCzestochowa University of TechnologyCzęstochowaPoland
  2. 2.Information Technology InstituteUniversity of Social SciencesLodzPoland

Personalised recommendations