Abstract
Enhancing on line analytical processing through efficient cube computation plays a key role in Data Warehouse management. Hashing, grouping and mining techniques are commonly used to improve cube pre-computation. BitCube, a fast cubing method which uses bitmaps as inverted indexes for grouping, is presented. It horizontally partitions data according to the values of one dimension and for each resulting fragment it performs grouping following bottom-up criteria. BitCube allows also partial materialization based on iceberg conditions to treat large datasets for which a full cube pre-computation is too expensive. Space requirement of bitmaps is optimized by applying an adaption of the WAH compression technique. Experimental analysis, on both synthetic and real datasets, shows that BitCube outperforms previous algorithms for full cube computation and results comparable on iceberg cubing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agarwal, S., Agrawal, R., Deshpande, P., Gupta, A., Naughton, J.F., Ramakrishnan, R., Sarawagi, S.: On the computation of multidimensional aggregates. In: Proceedings of 22th International Conference on Very Large Data Bases (VLDB 1996), pp. 506–521 (1996)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proc. of the 20th VLDB Conf., pp. 487–499 (1994)
Beyer, K., Ramakrishnan, R.: Bottom-up computation of sparse and iceberg cubes. In: Proceedings of the 1999 ACM SIGMOD international conference on Management of data, pp. 359–370 (1999)
Chen, Z., Narasayya, V.: Efficient computation of multiple group by queries. In: Proceedings of the 2005 ACM SIGMOD international conference on Management of data, pp. 263–274 (2005)
Feng, Y., Agrawal, D., Abbadi, A.E., Metwally, A.: Range cube: Efficient cube computation by exploiting data correlation. In: Proceedings of the 20th International Conference on Data Engineering (ICDE 2004), pp. 658–669 (2004)
Gray, J., Chaudhuri, S., Bosworth, A., Layman, A., Reichart, D., Venkatrao, M., Pellow, F., Pirahesh, H.: Data cube: A relational aggregation operator generalizing group-by. Data Mining and Knowledge Discovery 1, 29–54 (1997)
Li, X., Han, J., Gonzalez, H.: High-dimensional olap: A minimal cubing approach. In: Proc. Int’l Conf. Very Large Data Bases (VLDB 2004), pp. 528–539 (2004)
Morfonios, K., Ioannidis, Y.: Cure for cubes: cubing using a rolap engine. In: VLDB 2006: Proceedings of the 32nd international conference on Very large data bases, pp. 379–390 (2006)
Morfonios, K., Konakas, S., Ioannidis, Y., Kotsis, N.: Rolap implementations of the data cube. ACM Comput. Surv. 39(4), 12 (2007)
Ross, K.A., Srivastava, D.: Fast computation of sparse datacubes. In: Proceedings of 23rd International Conference on Very Large Data Bases (VLDB 1997), pp. 116–125 (1997)
Shao, Z., Han, J., Xin, D.: Mm-cubing: Computing iceberg cubes by factorizing the lattice space. In: Proc. 2004 Int. Conf. on Scientific and Statistical Database Management (SSDBM 2004), pp. 213–222 (2004)
Wu, K., Otoo, E.J., Shoshani, A.: A performance comparison of bitmap indexes. In: CIKM 2001: Proceedings of the tenth ACM international conference on Information and knowledge management, pp. 559–561 (2001)
Xin, D., Han, J., Li, X., Shao, Z., Wah, B.W.: Computing iceberg cubes by top-down and bottom-up integration: The starcubing approach. IEEE Transaction on Knowoledge and Data Engineering 19(1), 111–126 (2007)
Xin, D., Shao, Z., Han, J., Liu, H.: C-cubing: Efficient computation of closed cubes by aggregation-based checking. In: Proc. Int’l Conf. Data Eng (ICDE 2006), vol. 4 (2006)
Zhao, Y., Deshpande, P.M., Naughton, J.F.: An array-based algorithm for simultaneous multidimensional aggregates. In: Proceedings of the 1997 ACM SIGMOD international conference on Management of data, pp. 159–170 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ferro, A., Giugno, R., Puglisi, P.L., Pulvirenti, A. (2009). BitCube: A Bottom-Up Cubing Engineering. In: Pedersen, T.B., Mohania, M.K., Tjoa, A.M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2009. Lecture Notes in Computer Science, vol 5691. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03730-6_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-03730-6_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03729-0
Online ISBN: 978-3-642-03730-6
eBook Packages: Computer ScienceComputer Science (R0)