Abstract
Over the last few years, NoSQL systems are gaining strong popularity and a number of decision makers are using it to implement their warehouses.
Building the ETL process is one of the important tasks of creating NoSQL warehouse. Traditional ETL tools require the structure of the target system to be known at advance. As NoSQL databases are schema-free, this increases the need for extending the existing ETL tool in order to be able to designing schema while integrating data. In spite of the importance of ETL processes in the NoSQL warehousing, little researches have been done in this area due to its complexity.
In this paper, we propose an ETL-based platform for transforming a multidimensional conceptual model into document-oriented one. We model the transformation rules using the Business Process Modeling Notation (BPMN). The resulting warehouse was evaluated in term of “Write Request Latency” and “Read Request Latency” using TPC-DS benchmark.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Favre, C., Bentayeb, F., Boussaid, O., Darmont, J., Gavin, G., Harbi, N., Kabachi, N., Loudcher, S.: Les entrepots de donnees pour les nuls .ou pas. 2eme Atelier d’aide à la Decision a tous les Etages (2013)
Chandwani, G.: NoSQL data-warehouse. Int. J. Innovative Res. Comput. Commun. Eng. 4, 96–104 (2016)
Zhao, H., Ye, X.: A practice of TPC-DS multidimensional implementation on NoSQL database systems. In: Nambiar, R., Poess, M. (eds.) TPCTC 2013. LNCS, vol. 8391, pp. 93–108. Springer, Cham (2014). doi:10.1007/978-3-319-04936-6_7
Dehdouh, K., Boussaid, O., Bentayeb, F.: Columnar NoSQL star schema benchmark. In: Ait Ameur, Y., Bellatreche, L., Papadopoulos, G.A. (eds.) MEDI 2014. LNCS, vol. 8748, pp. 281–288. Springer, Cham (2014). doi:10.1007/978-3-319-11587-0_26
Dehdouh, K., Boussaid, O., Bentayeb, F.: Using the column oriented NoSQL model for implementing big data warehouses. In: Proceedings of the 21st International Conference on Parallel and Distributed Processing Techniques and Applications, pp. 469–475 (2015)
Chevalier, M., El Malki, M., Kopliku, A., Teste, O., Tournier, R.: Implementing multidimensional data warehouses into NoSQL. In: Proceedings of the 17th International Conference on Enterprise Information Systems, pp. 172–183 (2015)
Chevalier, M., El Malki, M., Kopliku, A., Teste, O., Tournier, R.: Document-oriented models for data warehouses. In: Proceedings of the 18th International Conference on Enterprise Information Systems, pp. 142–149 (2016)
Santos, M.Y., Martinho, B., Costa, C.: Modelling and implementing big data warehouses for decision support. J. Manag. Anal. 4, 1–19 (2017)
Bicevska, Z., Oditis, I.: Towards NoSQL-based data warehouse solutions. Proc. Comput. Sci. 104, 104–111 (2017)
Wilkinson, K., Simitsis, A., Castellanos, M., Dayal, U.: Leveraging business process models for ETL design. In: Parsons, J., Saeki, M., Shoval, P., Woo, C., Wand, Y. (eds.) ER 2010. LNCS, vol. 6412, pp. 15–30. Springer, Heidelberg (2010). doi:10.1007/978-3-642-16373-9_2
El Akkaoui, Z., Mazon, J., Vaisman, A., Zimanyi, E.: Defining ETL worfklows using BPMN and BPEL. In: Data Warehousing and OLAP, pp. 41–48 (2009)
El Akkaoui, Z., Mazón, J.-N., Vaisman, A., Zimányi, E.: BPMN-based conceptual modeling of ETL processes. In: Cuzzocrea, A., Dayal, U. (eds.) DaWaK 2012. LNCS, vol. 7448, pp. 1–14. Springer, Heidelberg (2012). doi:10.1007/978-3-642-32584-7_1
Oliveira, B., Belo, O.: BPMN patterns for ETL conceptual modelling and validation. In: Chen, L., Felfernig, A., Liu, J., Raś, Z.W. (eds.) ISMIS 2012. LNCS, vol. 7661, pp. 445–454. Springer, Heidelberg (2012). doi:10.1007/978-3-642-34624-8_50
Delgado, A., Marotta, A., González, L.: Towards the construction of quality-aware web warehouses with BPMN 2.0 business processes. In: RCIS, pp. 1–6 (2014)
Marotta, A., Delgado, A.: Data quality management in web warehouses using BPM. In: ICIQ, pp. 18–27 (2016)
Sahiet, D., Asanka, P.D.: ETL framework design for NoSQL databases in dataware housing. Int. J. Res. Comput. Appl. Rob. 3, 67–75 (2015)
Sadalage, P.J., Fowler, M.: NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence. Pearson Education, London (2012)
Chaudhuri, S., Dayal, U., Ganti, V.: Database technology for decision support systems. IEEE Comput. Soc. 34, 48–55 (2002)
Sharma, V., Dave, M.: SQL and NoSQL databases. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 2, 20–27 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Yangui, R., Nabli, A., Gargouri, F. (2017). ETL Based Framework for NoSQL Warehousing. In: Themistocleous, M., Morabito, V. (eds) Information Systems. EMCIS 2017. Lecture Notes in Business Information Processing, vol 299. Springer, Cham. https://doi.org/10.1007/978-3-319-65930-5_4
Download citation
DOI: https://doi.org/10.1007/978-3-319-65930-5_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-65929-9
Online ISBN: 978-3-319-65930-5
eBook Packages: Computer ScienceComputer Science (R0)