Wavelets on Streams
Unlike conventional database query-processing engines that require several passes over a static data image, streaming data-analysis algorithms must often rely on building concise, approximate (but highly accurate) synopses of the input stream(s) in real-time (i.e., in one pass over the streaming data). Such synopses typically require space that is significantly sublinear in the size of the data and can be used to provide approximate query answers.
The collection of the top (i.e., largest) coefficients in the wavelet transform (or, decomposition) of an input data vector is one example of such a key feature of the stream. Wavelets provide a mathematical tool for the hierarchical decomposition of functions, with a long history of successful applications in signal and image processing . Applying the wavelet transform to a (one- or multi-dimensional) data vector and retaining a select small collection of the largest wavelet coefficient gives a very effective form of lossy...
- 1.Alon N, Gibbons PB, Matias Y, Szegedy M. Tracking join and self-join sizes in limited storage. In: Proceedings of the 18th ACM SIGACT-SIGMOD-SIGART symposium on principles of database systems. 1999. p. 10–20.Google Scholar
- 2.Alon N, Matias Y, Szegedy M. The space complexity of approximating the frequency moments. In: Proceedings of the 28th annual ACM symposium on theory of computing. 1996. p. 20–9.Google Scholar
- 3.Chakrabarti K, Garofalakis M, Rastogi R, Shim K. Approximate query processing using wavelets. In: Proceedings of the 26th international conference on very large data bases. 2000. p. 111–22.Google Scholar
- 5.Cormode G, Garofalakis M, Sacharidis D. Fast approximate wavelet tracking on streams. In: Advances in database technology, proceedings of the 10th international conference on extending database technology. 2006. p. 4–22.Google Scholar
- 8.Guha S, Harb B. Wavelet synopsis for data streams: minimizing non-euclidean error. In: Proceedings of the 11th ACM SIGKDD international conference on knowledge discovery and data mining. 2005. p. 88–97.Google Scholar
- 9.Matias Y, Vitter JS, Wang M. Wavelet-based histograms for selectivity estimation. In: Proceedings of the ACM SIGMOD international conference on management of data. 1998. p. 448–59.Google Scholar
- 10.Stollnitz EJ, DeRose TD, Salesin DH. Wavelets for computer graphics - theory and applications. San Francisco: Morgan Kaufmann; 1996.Google Scholar
- 11.Vitter JS, Wang M. Approximate computation of multidimensional aggregates of sparse data using wavelets. In: Proceedings of the ACM SIGMOD international conference on management of data. 1999. p. 193–204.Google Scholar