Skip to main content

ETL Based Framework for NoSQL Warehousing

  • Conference paper
  • First Online:
Information Systems (EMCIS 2017)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 299))

Abstract

Over the last few years, NoSQL systems are gaining strong popularity and a number of decision makers are using it to implement their warehouses.

Building the ETL process is one of the important tasks of creating NoSQL warehouse. Traditional ETL tools require the structure of the target system to be known at advance. As NoSQL databases are schema-free, this increases the need for extending the existing ETL tool in order to be able to designing schema while integrating data. In spite of the importance of ETL processes in the NoSQL warehousing, little researches have been done in this area due to its complexity.

In this paper, we propose an ETL-based platform for transforming a multidimensional conceptual model into document-oriented one. We model the transformation rules using the Business Process Modeling Notation (BPMN). The resulting warehouse was evaluated in term of “Write Request Latency” and “Read Request Latency” using TPC-DS benchmark.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Favre, C., Bentayeb, F., Boussaid, O., Darmont, J., Gavin, G., Harbi, N., Kabachi, N., Loudcher, S.: Les entrepots de donnees pour les nuls .ou pas. 2eme Atelier d’aide à la Decision a tous les Etages (2013)

    Google Scholar 

  2. Chandwani, G.: NoSQL data-warehouse. Int. J. Innovative Res. Comput. Commun. Eng. 4, 96–104 (2016)

    Google Scholar 

  3. Zhao, H., Ye, X.: A practice of TPC-DS multidimensional implementation on NoSQL database systems. In: Nambiar, R., Poess, M. (eds.) TPCTC 2013. LNCS, vol. 8391, pp. 93–108. Springer, Cham (2014). doi:10.1007/978-3-319-04936-6_7

    Chapter  Google Scholar 

  4. Dehdouh, K., Boussaid, O., Bentayeb, F.: Columnar NoSQL star schema benchmark. In: Ait Ameur, Y., Bellatreche, L., Papadopoulos, G.A. (eds.) MEDI 2014. LNCS, vol. 8748, pp. 281–288. Springer, Cham (2014). doi:10.1007/978-3-319-11587-0_26

    Google Scholar 

  5. Dehdouh, K., Boussaid, O., Bentayeb, F.: Using the column oriented NoSQL model for implementing big data warehouses. In: Proceedings of the 21st International Conference on Parallel and Distributed Processing Techniques and Applications, pp. 469–475 (2015)

    Google Scholar 

  6. Chevalier, M., El Malki, M., Kopliku, A., Teste, O., Tournier, R.: Implementing multidimensional data warehouses into NoSQL. In: Proceedings of the 17th International Conference on Enterprise Information Systems, pp. 172–183 (2015)

    Google Scholar 

  7. Chevalier, M., El Malki, M., Kopliku, A., Teste, O., Tournier, R.: Document-oriented models for data warehouses. In: Proceedings of the 18th International Conference on Enterprise Information Systems, pp. 142–149 (2016)

    Google Scholar 

  8. Santos, M.Y., Martinho, B., Costa, C.: Modelling and implementing big data warehouses for decision support. J. Manag. Anal. 4, 1–19 (2017)

    Google Scholar 

  9. Bicevska, Z., Oditis, I.: Towards NoSQL-based data warehouse solutions. Proc. Comput. Sci. 104, 104–111 (2017)

    Article  Google Scholar 

  10. Wilkinson, K., Simitsis, A., Castellanos, M., Dayal, U.: Leveraging business process models for ETL design. In: Parsons, J., Saeki, M., Shoval, P., Woo, C., Wand, Y. (eds.) ER 2010. LNCS, vol. 6412, pp. 15–30. Springer, Heidelberg (2010). doi:10.1007/978-3-642-16373-9_2

    Chapter  Google Scholar 

  11. El Akkaoui, Z., Mazon, J., Vaisman, A., Zimanyi, E.: Defining ETL worfklows using BPMN and BPEL. In: Data Warehousing and OLAP, pp. 41–48 (2009)

    Google Scholar 

  12. El Akkaoui, Z., Mazón, J.-N., Vaisman, A., Zimányi, E.: BPMN-based conceptual modeling of ETL processes. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2012. LNCS, vol. 7448, pp. 1–14. Springer, Heidelberg (2012). doi:10.1007/978-3-642-32584-7_1

    Chapter  Google Scholar 

  13. Oliveira, B., Belo, O.: BPMN patterns for ETL conceptual modelling and validation. In: Chen, L., Felfernig, A., Liu, J., Raś, Z.W. (eds.) ISMIS 2012. LNCS, vol. 7661, pp. 445–454. Springer, Heidelberg (2012). doi:10.1007/978-3-642-34624-8_50

    Chapter  Google Scholar 

  14. Delgado, A., Marotta, A., González, L.: Towards the construction of quality-aware web warehouses with BPMN 2.0 business processes. In: RCIS, pp. 1–6 (2014)

    Google Scholar 

  15. Marotta, A., Delgado, A.: Data quality management in web warehouses using BPM. In: ICIQ, pp. 18–27 (2016)

    Google Scholar 

  16. Sahiet, D., Asanka, P.D.: ETL framework design for NoSQL databases in dataware housing. Int. J. Res. Comput. Appl. Rob. 3, 67–75 (2015)

    Google Scholar 

  17. Sadalage, P.J., Fowler, M.: NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence. Pearson Education, London (2012)

    Google Scholar 

  18. Chaudhuri, S., Dayal, U., Ganti, V.: Database technology for decision support systems. IEEE Comput. Soc. 34, 48–55 (2002)

    Article  Google Scholar 

  19. Sharma, V., Dave, M.: SQL and NoSQL databases. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 2, 20–27 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rania Yangui .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Yangui, R., Nabli, A., Gargouri, F. (2017). ETL Based Framework for NoSQL Warehousing. In: Themistocleous, M., Morabito, V. (eds) Information Systems. EMCIS 2017. Lecture Notes in Business Information Processing, vol 299. Springer, Cham. https://doi.org/10.1007/978-3-319-65930-5_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-65930-5_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-65929-9

  • Online ISBN: 978-3-319-65930-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics