Evolving Systems

, Volume 5, Issue 4, pp 239–257

Self-adaptive and local strategies for a smooth treatment of drifts in data streams

Original Paper

DOI: 10.1007/s12530-014-9108-y

Cite this article as:
Shaker, A. & Lughofer, E. Evolving Systems (2014) 5: 239. doi:10.1007/s12530-014-9108-y

Abstract

In this paper, we are dealing with a new concept for handling drifts in data streams during the run of on-line, evolving modeling processes in a regression context. Drifts require a specific attention in evolving modeling methods, as they usually change the underlying data distribution making previously learnt model parameters and structure outdated. Our approach comes with three new stages for an appropriate drift handling: (1) drifts are not only detected, but also quantified with a new extended version of the Page-Hinkley test; (2) we integrate an adaptive forgetting factor changing over time and which steers the degree of forgetting in dependency of the current drift intensity in the data stream; (3) we introduce local forgetting factors by addressing the different local regions of the feature space with a different forgetting intensity; this is achieved by using fuzzy model architecture within stream learning whose structural components (fuzzy rules) provide a local partitioning of the feature space and furthermore ensure smooth transitions of drift handling topology between neighboring regions. Additionally, our approach foresees an early drift recognition variant, which relies on divergence measures, indicating the degree of divergence in local parts of the feature space separately already before the global model error may start to rise significantly. Thus, it can be seen as an attempt regarding drift prevention on global model level. The new approach is successfully evaluated and compared with fixed forgetting and no forgetting on high-dimensional real-world data streams, including different types of drifts.

Keywords

Data stream learning Adaptive local and global drift handling Drift and forgetting intensity Component divergence Early drift recognition Evolving granular models 

Copyright information

© Springer-Verlag Berlin Heidelberg 2014

Authors and Affiliations

  1. 1.Department of Mathematics and Computer SciencePhilipps-University MarburgMarburgGermany
  2. 2.Department of Knowledge-based Mathematical SystemsJohannes Kepler University of LinzLinzAustria

Personalised recommendations