Skip to main content

Top-Down Compression of Data Cubes in the Presence of Simultaneous Multiple Hierarchical Range Queries

  • Conference paper
Foundations of Intelligent Systems (ISMIS 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4994))

Included in the following conference series:

Abstract

A novel top-down compression technique for data cubes is introduced and experimentally assessed in this paper. This technique considers the previously unrecognized case in which multiple Hierarchical Range Queries (HRQ), a very useful class of OLAP queries, must be evaluated against the target data cube simultaneously. This scenario makes traditional data cube compression techniques ineffective, as, contrarily to the aim of our work, these techniques take into consideration one constraint only (e.g., a given space bound). The result of our study consists in introducing an innovative multiple-objective OLAP computational paradigm, and a hierarchical multidimensional histogram, whose main benefit is meaningfully implementing an intermediate compression of the input data cube able to simultaneously accommodate an even large family of different-in-nature HRQ. A complementary contribution of our work is represen-ted by a wide experimental evaluation of the query performance of our technique against both benchmark and real-life data cubes, also in comparison with state-of-the-art histogram-based compression techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Acharya, S., Poosala, V., Ramaswamy, S.: Selectivity Estimation in Spatial Databases. ACM SIGMOD (1999)

    Google Scholar 

  2. Agrawal, R., Wimmers, E.L.: A Framework for Expressing and Combining Preferences. ACM SIGMOD (2000)

    Google Scholar 

  3. Balke, W.-T., Güntzer, U.: Multi-Objective Query Processing for Database Systems. VLDB (2004)

    Google Scholar 

  4. Börzsönyi, S., Kossmann, D., Stocker, K.: The Skyline Operator. IEEE ICDE (2001)

    Google Scholar 

  5. Bowman, I.T., Salem, K.: Optimization of Query Streams using Semantic Prefetching. ACM TODS 30(4) (2005)

    Google Scholar 

  6. Bruno, N., Chaudhuri, S., Gravano, L.: STHoles: A Multidimensional Workload-Aware Histogram. ACM SIGMOD (2001)

    Google Scholar 

  7. Buccafurri, F., et al.: A Quad-Tree based Multiresolution Approach for Two-Dimensional Summary Data. IEEE SSDBM (2003)

    Google Scholar 

  8. Chaudhuri, S., et al.: Overcoming Limitations of Sampling for Aggregation Queries. IEEE ICDE (2001)

    Google Scholar 

  9. Chen, Z., Narasayya, V.: Efficient Computation of Multiple Group By Queries. ACM SIGMOD (2005)

    Google Scholar 

  10. Colliat, G.: OLAP, Relational, and Multidimensional Database Systems. ACM SIGMOD Record 25(3) (1996)

    Google Scholar 

  11. Cuzzocrea, A.: Overcoming Limitations of Approximate Query Answering in OLAP. IEEE IDEAS (2005)

    Google Scholar 

  12. Cuzzocrea, A.: Providing Probabilistically-Bounded Approximate Answers to Non-Holistic Aggregate Range Queries in OLAP. ACM DOLAP (2005)

    Google Scholar 

  13. Cuzzocrea, A.: Improving Range-Sum Query Evaluation on Data Cubes via Polynomial Approximation. Data & Knowledge Engineering 56(2) (2006)

    Google Scholar 

  14. Cuzzocrea, A.: Accuracy Control in Compressed Multidimensional Data Cubes for Quality of Answer-based OLAP Tools. IEEE SSDBM (2006)

    Google Scholar 

  15. Cuzzocrea, A., et al.: Semantics-aware Advanced OLAP Visualization of Multidimensional Data Cubes. International Journal of Data Warehousing and Mining 3(4) (2007)

    Google Scholar 

  16. Cuzzocrea, A., Wang, W.: Approximate Range-Sum Query Answering on Data Cubes with Probabilistic Guarantees. Journal of Intelligent Information Systems 28(2) (2007)

    Google Scholar 

  17. Doan, A., Levy, A.Y.: Efficiently Ordering Plans for Data Integration. IEEE ICDE (2002)

    Google Scholar 

  18. Fan, J., Kambhampati, S.: Multi-Objective Query Processing for Data Aggregation. ASU CSE TR (2006)

    Google Scholar 

  19. Fang, M., et al.: Computing Iceberg Queries Efficiently. VLDB (1998)

    Google Scholar 

  20. Garofalakis, M.N., Gibbons, P.B.: Wavelet Synopses with Error Guarantees. ACM SIGMOD (2002)

    Google Scholar 

  21. Garofalakis, M.N., Kumar, A.: Deterministic Wavelet Thresholding for Maximum-Error Metrics. ACM PODS (2004)

    Google Scholar 

  22. Gunopulos, D., et al.: Approximating Multi-Dimensional Aggregate Range Queries over Real Attributes. ACM SIGMOD (2000)

    Google Scholar 

  23. Han, J., Kamber, M.: Data Mining: Concepts and Techniques. Morgan Kauffmann Publishers, San Francisco (2000)

    Google Scholar 

  24. Ho, C.-T., et al.: Range Queries in OLAP Data Cubes. ACM SIGMOD (1997)

    Google Scholar 

  25. Ioannidis, Y., Poosala, V.: Histogram-based Approximation of Set-Valued Query Answers. VLDB (1999)

    Google Scholar 

  26. Ives, Z.G., et al.: An Adaptive Query Execution System for Data Integration. ACM SIGMOD (1999)

    Google Scholar 

  27. Jin, R., et al.: A Framework to Support Multiple Query Optimization for Complex Mining Tasks. MMDM (2005)

    Google Scholar 

  28. Jin, R., et al.: Simultaneous Optimization of Complex Mining Tasks with a Knowledgeable Cache. ACM SIGKDD (2005)

    Google Scholar 

  29. Kalnis, P., Papadias, D.: Multi-Query Optimization for On-Line Analytical Processing. Information Systems 28(5) (2003)

    Google Scholar 

  30. Koudas, N., et al.: Optimal Histograms for Hierarchical Range Queries. ACM PODS (2000)

    Google Scholar 

  31. Mistry, H., et al.: Materialized View Selection and Maintenance using Multi-Query Optimization. ACM SIGMOD (2001)

    Google Scholar 

  32. Nie, Z., Kambhampati, S.: Joint Optimization of Cost and Coverage of Query Plans in Data Integration. ACM CIKM, 223–230 (2001)

    Google Scholar 

  33. Papadias, D., et al.: An Optimal and Progressive Algorithm for Skyline Queries. ACM SIGMOD (2003)

    Google Scholar 

  34. Poosala, V., Ganti, V.: Fast Approximate Answers to Aggregate Queries on a Data Cube. IEEE SSDBM (1999)

    Google Scholar 

  35. Poosala, V., Ioannidis, Y.: Selectivity Estimation Without the Attribute Value Independence Assumption. VLDB (1997)

    Google Scholar 

  36. Roy, P., et al.: Efficient and Extensible Algorithms for Multi-Query Optimization. ACM SIGMOD (2000)

    Google Scholar 

  37. Sellis, T., Ghosh, S.: On the Multiple-Query Optimization Problem. IEEE TKDE 2(2) (1990)

    Google Scholar 

  38. Sellis, T.: Multiple-Query Optimization. ACM TODS 13(1) (1988)

    Google Scholar 

  39. Transaction Processing Council: TPC Benchmark H. (2006), http://www.tpc.org/tpch/

  40. University of California, Irvine: 1990 US Census Data (2001), http://kdd.ics.uci.edu/databases/census1990/USCensus1990.html

  41. Wang, S., et al.: State-Slice: A New Paradigm of Multi-Query Optimization of Window-Based Stream Queries. VLDB (2006)

    Google Scholar 

  42. Xin, D., et al.: Answering Top-k Queries with Multi-Dimensional Selections: The Ranking Cube Approach. VLDB (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Aijun An Stan Matwin Zbigniew W. Raś Dominik Ślęzak

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cuzzocrea, A. (2008). Top-Down Compression of Data Cubes in the Presence of Simultaneous Multiple Hierarchical Range Queries. In: An, A., Matwin, S., Raś, Z.W., Ślęzak, D. (eds) Foundations of Intelligent Systems. ISMIS 2008. Lecture Notes in Computer Science(), vol 4994. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68123-6_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68123-6_39

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68122-9

  • Online ISBN: 978-3-540-68123-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics