Skip to main content

Efficient Query Processing for Multidimensional Data Cubes

  • Conference paper
  • First Online:
Cyber Security and Computer Science (ICONCS 2020)

Abstract

Data cubes come up with a suitable paradigm for storing, accessing, processing and analysis multidimensional data. Conventional Multidimensional Arrays (CMA) are the basic data structure to process such multidimensional data. But the performance of the MDAs degrades when the number of dimension increases. In this paper, we propose a new approach for computing multidimensional data cube using conversion of dimensions of the multidimensional array. We design efficient algorithms for Multidimensional On Line Analytical Processing (MOLAP) operations using the Converted two dimensional Array (C2A). We represent the MOLAP array as a Converted two dimensional Array where n-dimension is converted into two dimension. Then we apply the operations of data cube namely slice and dice on both CMA and C2A. We calculate the time for slice and dice operations for CMA and C2A. The proposed model requires less time for index computation when number of dimension is high. The cache miss rate is also lower for C2A based implementation. Therefore, our proposed algorithm shows superior performance than the traditional scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 119.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 159.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Merticariu, V., Baumann, P.: Massively distributed data cube processing. In: IEEE International Geoscience and Remote Sensing Symposium (2019). https://doi.org/10.1109/IGARSS.2019.8900432

  2. Wang, Z., Chu, Y., Tan, K.L., Agrawal, D., Abbadi, A.E., Xu, X.: Scalable Data Cube Analysis over Big Data. Published in arXiv (2013)

    Google Scholar 

  3. Xin, D., Han, J., Li, X., Shao, Z., Wah, B.: Computing iceberg cubes by top-down and bottom-up integration: the starcubing approach. IEEE Trans. Knowl. Data Eng. 19(1), 111–126 (2007). https://doi.org/10.1145/1807167.1807271

    Article  Google Scholar 

  4. Hasan, K.M.A., Tsuji, T., Higuchi, K.: An efficient implementation for MOLAP basic data structure and its evaluation. In: Kotagiri, R., Krishna, P.R., Mohania, M., Nantajeewarawat, E. (eds.) DASFAA 2007. LNCS, vol. 4443, pp. 288–299. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-71703-4_26

    Chapter  Google Scholar 

  5. Zhang, Y., Zhou, X., Zhang, Y., Zhang, Y., Su, M., Wang, S.: Virtual denormalization via array index reference for main memory OLAP. IEEE Trans. Knowl. Data Eng. 28(4), 1061–1074 (2016). https://doi.org/10.1109/KDE.2015.2499199

    Article  Google Scholar 

  6. Zhang, Y., Ordonez, C., García-García, J., Bellatreche, L., Carrillo, H.: The percentage cube. Inf. Syst. 79, 20–31 (2019)

    Article  Google Scholar 

  7. Zhao, Y., Deshpande, P.M., Naughton, J.F.: An array-based algorithm for simultaneous multidimensional aggregates. In: Proceedings of SIGMOD Conference, pp. 159–170 (1997). https://doi.org/10.1145/253260.253288

  8. Plattner, H.: SanssouciDB: an in-memory database for processing enterprise workloads. In: Datenbanksysteme for Business, Technologie und Web (BTW). Gesellschaft for Informatik, Bonn, pp. 2–21 (2011)

    Google Scholar 

  9. Sarawagi, S., Stonebraker, M.: Efficient organization of large multidimensional arrays. In: Proceedings of 10th International Conference on Data Engineering (ICDE), Houston, Texas, pp. 328–386 (1994). https://doi.org/10.1109/ICDE.1994.283048

  10. Hasan, K.M.A., Shaikh, M.A.H.: Efficient representation of higher-dimensional arrays by dimension transformations. J. Supercomput. 73(6), 2801–2822 (2017). https://doi.org/10.1007/s11227-016-1954-x

    Article  Google Scholar 

  11. Hasan, K.M.A., Shaikh, M.A.H.: Representing higher dimensional arrays into generalized two-dimensional array: G2A. In: Park, J.J.J.H., Yi, G., Jeong, Y.-S., Shen, H. (eds.) Advances in Parallel and Distributed Computing and Ubiquitous Services. LNEE, vol. 368, pp. 39–46. Springer, Singapore (2016). https://doi.org/10.1007/978-981-10-0068-3_5

    Chapter  Google Scholar 

  12. Deshpande, P., Ramasamy, K., Shukla, A., Naughton, J.F.: Caching multidimensional queries using chunks. In: Proceedings of the ACM SIGMOD Conference on Management of Data, pp. 259–270 (1998). https://doi.org/10.1145/276305.276328

  13. Hasan, K.M.A., Kuroda, M., Azuma, N., Tsuji, T., Higuchi, K.: An extendible array based implementation of relational tables for multi dimensional databases. In: Tjoa, A.M., Trujillo, J. (eds.) DaWaK 2005. LNCS, vol. 3589, pp. 233–242. Springer, Heidelberg (2005). https://doi.org/10.1007/11546849_23

    Chapter  Google Scholar 

  14. Yan, J., et al.: Trace-oriented feature analysis for large-scale text data dimension reduction. IEEE Trans. Knowl. Data Eng. 23(7), 1103–1117 (2011)

    Article  Google Scholar 

  15. Sun, J., Tao, D., Papadimitriou, S., Yu, P.S., Faloutsos, C.: Incremental tensor analysis: theory and applications. ACM Trans. Knowl. Discov. Data 2(3), 1–37 (2008)

    Article  Google Scholar 

  16. Kim, S., Lee, S., Kim, J., Yoon, Y.-I.: MRTensorCube: tensor factorization with data reduction for context-aware recommendations. J. Supercomput. (6), 1–11 (2017). https://doi.org/10.1007/s11227-017-2002-1

  17. Miranda, F., Lins, L., Klosowski, J.T., Silva, C.: TOPKUBE: a rank-aware data cube for real-time exploration of spatiotemporal data. IEEE Trans. Vis. Comput. Graph. 24, 1394–1407 (2018)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. M. Azharul Hasan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Rimi, R.T., Hasan, K.M.A. (2020). Efficient Query Processing for Multidimensional Data Cubes. In: Bhuiyan, T., Rahman, M.M., Ali, M.A. (eds) Cyber Security and Computer Science. ICONCS 2020. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 325. Springer, Cham. https://doi.org/10.1007/978-3-030-52856-0_51

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-52856-0_51

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-52855-3

  • Online ISBN: 978-3-030-52856-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics