Efficient Computation of Multi-feature Data Cubes

Zhang, Shichao; Wang, Rifeng; Guo, Yanping

doi:10.1007/11811220_52

Shichao Zhang^21,22,
Rifeng Wang²¹ &
Yanping Guo²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4092))

Included in the following conference series:

International Conference on Knowledge Science, Engineering and Management

1098 Accesses

Abstract

A Multi-Feature Cube (MF-Cube) query is a complex-data-mining query based on data cubes, which computes the dependent complex aggregates at multiple granularities. Existing computations designed for simple data cube queries can be used to compute distributive and algebraic MF-Cubes queries. In this paper we propose an efficient computation of holistic MF-Cubes queries. This method computes holistic MF-Cubes with PDAP (Part Distributive Aggregate Property). The efficiency is gained by using dynamic subset data selection strategy (Iceberg query technique) to reduce the size of materialized data cube. Also for efficiency, this approach adopts the chunk-based caching technique to reuse the output of previous queries. We experimentally evaluate our algorithm using synthetic and real-world datasets, and demonstrate that our approach delivers up to about twice the performance of traditional computations.

This work is partially supported by Australian large ARC grants (DP0449535, DP0559536 and DP0667060), a China NSFC major research Program (60496327), a China NSFC grant (60463003) and a grant from Overseas Outstanding Talent Research Program of Chinese Academy of Sciences (06S3011S01).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ross, K.A., Srivastava, D., Chatziantoniou, D.: Complex Aggregation at Multiple Granularities. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377, pp. 263–277. Springer, Heidelberg (1998)
Chapter Google Scholar
Ross, K., Srivastava, D.: Fast computation of sparse datacubes. In: Proc. Int. Conf. Very Large Data Bases, August 1997, Athens, Greece, pp. 116–125 (1997)
Google Scholar
Zhao, Y., Deshpande, P.M., Naughton, J.F.: An Array-Based Algorithm for Simultaneous Multidimensional Aggregates. In: Proc. of ACM SIGMOD, pp. 159–170 (1997)
Google Scholar
Beyer, K.S., Ramakrishnan, R.: Bottom-Up Computation of Sparse and Iceberg Cubes. In: Proc. Of ACM SIGMOD, pp. 359–370 (1999)
Google Scholar
Agarwal, S., Agrawal, R., Deshpande, P.M., Gupta, A., Naughton, J.F., Ramakrishnan, R., Sarawagi, S.: On the Computation of Multidimensional Aggregates. In: Proc. Int’ Conf. Very Large Data Bases, pp. 506–521 (1996)
Google Scholar
Gray, A., Bosworth, A., Layman, A., Pirahesh, H.: Datacube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals. In: Proc. of the IEEE ICDE, pp. 152–159 (1996)
Google Scholar
Fang, M., Shivakumar, N., Garcia-Molina, H., Motwani, R., Ullman, J.D.: Computing iceberg queries efficiently. In: Proc. of 24th VLDB Conf., New York, pp. 299–310 (1998)
Google Scholar
Han, J.P., Dong, G., Wang, K.: Efficient Computation of Iceberg Cubes with Complex Measures. In: Proc. ACM-SIGMOD Int’l Conf. Management of Data, pp. 1–12 (2001)
Google Scholar
Deshpande, P.M., Ramasamy, K., Shukla, A.: Caching Multidimensional Queries Using Chunks. In: Proc. ACM SIGMOD, pp. 259–270 (1998)
Google Scholar
Dong, G., Han, J., Joyce, M.W.L.: Mining Constrained Gradients in Large Databases. In: IEEE TKDE (2003)
Google Scholar
Hahn, C., Warren, S., London, J.: Edited synoptic cloud reports from ships and land stations over the globe (1996), Available on: http://cdiac.esd.ornl.gov/cdiac/ndps/ndp026b.html

Download references

Author information

Authors and Affiliations

Department of Computer Science, Guangxi Normal University, Guilin, China
Shichao Zhang, Rifeng Wang & Yanping Guo
Faculty of Information Technology, University of Technology, Sydney, Australia
Shichao Zhang

Authors

Shichao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Rifeng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yanping Guo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

IRIT, UPS,, F-31062, Toulouse Cédex 9, France
Jérôme Lang
Department of Computer Science, Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong
Fangzhen Lin
Guangxi Normal University, Guilin, China
Ju Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, S., Wang, R., Guo, Y. (2006). Efficient Computation of Multi-feature Data Cubes. In: Lang, J., Lin, F., Wang, J. (eds) Knowledge Science, Engineering and Management. KSEM 2006. Lecture Notes in Computer Science(), vol 4092. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11811220_52

Download citation

DOI: https://doi.org/10.1007/11811220_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37033-8
Online ISBN: 978-3-540-37035-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics