Advertisement

Implementation of Multidimensional Databases in Column-Oriented NoSQL Systems

  • Max Chevalier
  • Mohammed El Malki
  • Arlind Kopliku
  • Olivier Teste
  • Ronan Tournier
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9282)

Abstract

NoSQL (Not Only SQL) systems are becoming popular due to known advantages such as horizontal scalability and elasticity. In this paper, we study the implementation of multidimensional data warehouses based on column-oriented NoSQL systems. To do this, we define a set of mapping rules that transform a conceptual multidimensional data model into a logical column-oriented model. We consider three ways (three sub-models) to structure conceptual multidimensional data model into a column-oriented model. Then, we show an implementation of the proposed rules. Finally, we focus, through experiment, on data loading, model-to-model conversion and OLAP cuboid computation.

Keywords

Data warehouse design Multidimensional modelling NoSQL databases Model transformation rules Column-Oriented NoSQL model 

Notes

Acknowledgements

These studies are supported by the ANRT funding under CIFRE-Capgemini partnership.

References

  1. 1.
    Bosworth, A., Gray, J., Layman, A., Pirahesh, H.: Data cube: a relational aggregation operator generalizing group-by, cross-tab, and sub-totals. Technical Report MSR-TR-95-22, Microsoft Research February 1995Google Scholar
  2. 2.
    Chang, F., Dean, J., Ghemawat, S., Hsieh, W.C., Wallach, D.A., Burrows, M., Chandra, T., Fikes, A., Gruber, R.E.: Bigtable: a distributed storage system for structured data. ACM Trans. Comput. Syst. 26(2), 4:1–4:26 (2008)CrossRefGoogle Scholar
  3. 3.
    Chaudhuri, S., Dayal, U.: An overview of data warehousing and olap technology. SIGMOD Rec. 26, 65–74 (1997)CrossRefGoogle Scholar
  4. 4.
    Chevalier, M., El Malki, M., Kopliku, A., Teste, O., Tournier, R.: Implementing multidimensional data warehouses into NoSQL. In: 17th International Conference on Enterprise Information Systems, vol. DISIGoogle Scholar
  5. 5.
    Chevalier, M., El Malki, M., Kopliku, A., Teste, O., Tournier, R.: Benchmark for OLAP on NoSQL technologies, comparing NoSQL multidimensional data warehousing solutions. In: 9th International Conference on Research Challenges in Information Science (RCIS), IEEEGoogle Scholar
  6. 6.
    Colliat, G.: Olap, relational and multidimensional database systems. SIGMOD Rec. 25(3), 64–69 (1996). http://doi.acm.org/10.1145/234889.234901 CrossRefGoogle Scholar
  7. 7.
    Cuzzocrea, A., Bellatreche, L., Song, I.Y.: Data warehousing and olap over bigdata: current challenges and future research directions. In: Proceedings of the Sixteenth International Workshop on Data Warehousing and OLAP, pp. 67–70. DOLAP 2013, ACM, New York, NY, USA (2013)Google Scholar
  8. 8.
    Cuzzocrea, A., Song, I.Y., Davis, K.C.: Analytics over large-scale multidimensionaldata: the big data revolution! In: Proceedings of the ACM 14th International Workshop on Data Warehousing and OLAP, pp. 101–104. DOLAP 2011, ACM, New York, NY, USA (2011)Google Scholar
  9. 9.
    Dehdouh, K., Boussaid, O., Bentayeb, F.: Columnar NoSQL star schema benchmark. In: Ait Ameur, Y., Bellatreche, L., Papadopoulos, G.A. (eds.) MEDI 2014. LNCS, vol. 8748, pp. 281–288. Springer, Heidelberg (2014)Google Scholar
  10. 10.
    Golfarelli, M., Maio, D., Rizzi, S.: The dimensional fact model: a conceptual modelfor data warehouses. Int. J. Coop. Inf. Syst. 7, 215–247 (1998)CrossRefGoogle Scholar
  11. 11.
    Harter, T., Borthakur, D., Dong, S., Aiyer, A.S., Tang, L., Arpaci-Dusseau, A.C., Arpaci-Dusseau, R.H.: Analysis of hdfs under hbase: a facebook messages casestudy. In: FAST, pp. 199–212 (2014)Google Scholar
  12. 12.
    Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling. John Wiley & Sons, Inc. (2013)Google Scholar
  13. 13.
    Lakshman, A., Malik, P.: Cassandra: a decentralized structured storage system. SIGOPS Oper. Syst. Rev. 44(2), 35–40 (2010)CrossRefGoogle Scholar
  14. 14.
    Li, C.: Transforming relational database into hbase: a case study. In: International Conference on Software Engineering and Service Sciences (ICSESS), IEEE, pp. 683–687 (2010)Google Scholar
  15. 15.
    O’Neil, P., O’Neil, E., Chen, X., Revilak, S.: The star schema benchmark and augmented fact table indexing. In: Nambiar, R., Poess, M. (eds.) TPCTC 2009. LNCS, vol. 5895, pp. 237–252. Springer, Heidelberg (2009)Google Scholar
  16. 16.
    Ravat, F., Teste, O., Tournier, R., Zuruh, G.: Algebraic and graphic languages for OLAP manipulations. IJDWM 4(1), 17–46 (2008)Google Scholar
  17. 17.
    Stonebraker, M.: New opportunities for new sql. Commun. ACM 55(11), 10–11 (2012)CrossRefGoogle Scholar
  18. 18.
    Vajk, T., Feher, P., Fekete, K., Charaf, H.: Denormalizing data into schema-free databases. In: 4th International Conference on Cognitive Infocommunications (CogInfoCom), IEEE, pp. 747–752 (2013)Google Scholar
  19. 19.
    Zhao, H., Ye, X.: A Practice of TPC-DS multidimensional implementation on NoSQL database systems. In: Nambiar, R., Poess, M. (eds.) TPCTC 2013. LNCS, vol. 8391, pp. 93–108. Springer, Heidelberg (2014)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Max Chevalier
    • 1
  • Mohammed El Malki
    • 1
  • Arlind Kopliku
    • 1
  • Olivier Teste
    • 1
  • Ronan Tournier
    • 1
  1. 1.Université de ToulouseToulouseFrance

Personalised recommendations