Data Streams and Data Synopses for Massive Data Sets (Invited Talk)
With the proliferation of data intensive applications, it has become necessary to develop new techniques to handle massive data sets. Traditional algorithmic techniques and data structures are not always suitable to handle the amount of data that is required and the fact that the data often streams by and cannot be accessed again. A field of research established over the past decade is that of handling massive data sets using data synopses, and developing algorithmic techniques for data stream models. We will discuss some of the research work that has been done in the field, and provide a decades’ perspective to data synopses and data streams.
- 2.Babcock, B., Babu, S., Datar, M., Motwani, R., Widom, J.: Models and issues in data stream systems. In: Proc. Symposium on Principles of Database Systems, pp. 1–16 (2002)Google Scholar
- 3.Gibbons, P.B., Matias, Y.: Synopses data structures for massive data sets. External memory algorithms, DIMACS Series Discrete Math. & TCS, AMS 50 (1999), Also SODA 1999Google Scholar
- 4.Matias, Y.: Data streams and data synopses for massive data sets, http://www.cs.tau.ac.il/~matias/streams/
- 5.Muthukrishnan, S.: Data streams: Algorithms and applications, http://www.cs.rutgers.edu/~muthu/stream-1-1.ps