Progress in Artificial Intelligence

, Volume 3, Issue 1, pp 15–28

Constructing fading histograms from data streams

Regular Paper

DOI: 10.1007/s13748-014-0050-9

Cite this article as:
Sebastião, R., Gama, J. & Mendonça, T. Prog Artif Intell (2014) 3: 15. doi:10.1007/s13748-014-0050-9

Abstract

The ability to collect data is changing drastically. Nowadays, data are gathered in the form of transient and finite data streams. Memory restrictions preclude keeping all received data in memory. When dealing with massive data streams, it is mandatory to create compact representations of data, also known as synopses structures or summaries. Reducing memory occupancy is of utmost importance when handling a huge amount of data. This paper addresses the problem of constructing histograms from data streams under error constraints. When constructing online histograms from data streams there are two main characteristics to embrace: the updating facility and the error of the histogram. Moreover, in dynamic environments, besides the need of compact summaries to capture the most important properties of data, it is also essential to forget old data. Therefore, this paper presents sliding histograms and fading histograms, an abrupt and a smooth strategies to forget outdated data.

Keywords

Data streams Online histograms  Error constraints Fading histograms 

Copyright information

© Springer-Verlag Berlin Heidelberg 2014

Authors and Affiliations

  • Raquel Sebastião
    • 1
    • 2
  • João Gama
    • 1
    • 3
  • Teresa Mendonça
    • 2
    • 4
  1. 1.LIAAD, INESC TECPortoPortugal
  2. 2.Dep. MatemáticaFac. Ciências da Universidade do Porto (FCUP)PortoPortugal
  3. 3.Fac. Economia da Universidade do Porto (FEP)PortoPortugal
  4. 4.Dep. de MatemáticaCenter for Research and Developments in Mathematics and Applications (CIDMA)AveiroPortugal