Skip to main content

Analytical Synopses for Approximate Query Answering in OLAP Environments

  • Conference paper
Database and Expert Systems Applications (DEXA 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3180))

Included in the following conference series:

Abstract

In this paper we present a technique based on an analytical interpretation of multi-dimensional data and on the well-known Least Squares Approximation (LSA) method for supporting approximate aggregate query answering in OLAP environments, the most common application interfaces for a Data Warehouse Server (DWS). Our technique consists in building data synopses by interpreting the original data distribution as a set of discrete functions. These synopses, called Δ-Syn, are obtained by approximating data with a set of polynomial coefficients, and storing these coefficients instead of the original data. Queries are issued on the compressed representation, thus reducing the number of disk accesses needed to evaluate the answer. We also provide some experimental results on several kinds of synthetic OLAP data cubes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bruno, N., Chaudhuri, S., Gravano, L.: STHoles: A Multidimensional Workload- Aware Histogram. Microsoft Technical Report MSR-TR-2001-36 (2001)

    Google Scholar 

  2. Gibbons, P.B., Matias, Y.: New Sampling-Based Summary Statistics for Improving Approximate Query Answers. In: Proc. of the 1998 ACM SIGMOD, Seattle, WA, USA, pp. 331–342 (1998)

    Google Scholar 

  3. Gibbons, P.B., Matias, Y., Poosala, V.: Fast Incremental Maintenance of Approximate Histograms. In: Proc. of the 23rd VLDB, Athens, Greece, pp. 466–475 (1997)

    Google Scholar 

  4. Hellerstein, J.M., Haas, P.J., Wang, H.J.: Online Aggregation. In: Proc. of the 1997 ACM SIGMOD, Tucson, AZ, USA, pp. 171–182 (1997)

    Google Scholar 

  5. Ho, C.-T., Agrawal, R., Megiddo, N., Srikant, R.: Range Queries in OLAP Data Cubes. In: Proc. of the 1997 ACM SIGMOD, Tucson, AZ, USA, pp. 73–88 (1997)

    Google Scholar 

  6. Ioannidis, Y.E., Poosala, V.: Histogram-based Approximation of Set-Valued Query Answers. In: Proc. of the 25th VLDB, Edinburgh, Scotland, pp. 174–185 (1999)

    Google Scholar 

  7. Jagadish, H.V., Koudas, N., Muthukrishnan, S., Poosala, V., Sevcik, K., Suel, T.: Optimal Histograms with Quality Guarantees. In: Proc. of the 24th VLDB, New York City, NY, USA, pp. 275–286 (1998)

    Google Scholar 

  8. Kenney, J.F., Keeping, E.S.: Skewness. In: Mathematics of Statistics, Pt. 1, 3rd edn., pp. 100–101. Van Nostrand, Princeton (1962)

    Google Scholar 

  9. Matias, Y., Vitter, J.S., Wang, M.: Wavelet-Based Histograms for Selectivity Estimation. In: Proc. of the 1998 ACM SIGMOD, Seattle, WA, USA, pp. 448–459 (1998)

    Google Scholar 

  10. Papoulis, A.: Probability, Random Variables, and Stochastic Processes, 2nd edn. McGraw-Hill, New York City (1984)

    MATH  Google Scholar 

  11. Poosala, V., Ioannidis, Y.E., Haas, P.J., Shekita, E.: Improved Histograms for Selectivity Estimation of Range Predicates. In: Proc. of the 1996 ACM SIGMOD, Montreal, Canada, pp. 294–305 (1996)

    Google Scholar 

  12. Poosala, V., Ioannidis, Y.E.: Selectivity Estimation without the Attribute Value Independence Assumption. In: Proc. of the 23rd VLDB, Athens, Greece, pp. 486–495 (1997)

    Google Scholar 

  13. Poosala, V., Ganti, V.: Fast Approximate Answers to Aggregate Queries on a Data Cube. In: Proc. of the 11th SSDBM, Cleveland, OH, USA, pp. 24–33 (1999)

    Google Scholar 

  14. Powell, M.J.D.: Approximation Theory and Methods. Cambridge University Press, Cambridge (1982)

    Google Scholar 

  15. Stuart, A., Ord, J.K.: Kendall’s Advanced Theory of Statistics, 6th edn. Distribution Theory, vol. 1. Oxford University Press, Oxford (1998)

    Google Scholar 

  16. Vitter, J.S., Wang, M., Iyer, B.: Data Cube Approximation and Histograms via Wavelets. In: Proc. of the 7th ACM CIKM, Bethesda, ML, USA, pp. 96–104 (1998)

    Google Scholar 

  17. Vitter, J.S., Wang, M.: Approximate Computation of Multidimensional Aggregates of Sparse Data Using Wavelets. In: Proc. of the 1999 ACM SIGMOD, Philadelphia, PA, USA, pp. 194–204 (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cuzzocrea, A., Matrangolo, U. (2004). Analytical Synopses for Approximate Query Answering in OLAP Environments. In: Galindo, F., Takizawa, M., Traunmüller, R. (eds) Database and Expert Systems Applications. DEXA 2004. Lecture Notes in Computer Science, vol 3180. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30075-5_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30075-5_35

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22936-0

  • Online ISBN: 978-3-540-30075-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics