Abstract
ETL is the bedrock of a data warehouse. As the data accelerates in the diversified fields, it is substantive to integrate the data to extract the prerequisites for business advancement and policy formation in the arena of health care, education, smart cities, transportation, and many other areas. ETL (Extract–Transform–Load) is the process used to incorporate a range of data sources in the data warehouse for business intelligence. As per the requirement of data integration and analysis, several categories of ETL tools have developed like code-based, GUI-based, cloud-based, Metadata support, Real-time support, and batch processing. Selecting appropriate ETL tools is a crucial task in any business, with each tool having its dominant and stumbling blocks. This paper aims to first, the study of existing ETL tools and its features, and secondly, how to identify vital functions of the ETL tool for an organization.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
C. Thomsen, ETL. Springer International Publishing AG, Part of Springer Nature (2018)
R. Mukherjee, P. Kar, A comparative review of data warehousing ETL tools with new trends and industry insight, in IEEE 7th International Advance Computing Conference (2017), pp. 943–948
J. Awiti, E. Zimányi, An XML interchange format for ETL models, in New Trends in Databases and Information Systems. ADBIS 2019, ed. by T. Welzer et al. Communications in Computer and Information Science, vol. 1064 (Springer, 2019)
R. Aluvalu, M. Jabbar, Handling data analytics on unstructured data using MongoDB, in Smart Cities Symposium (2018), pp. 1–5
R. Yangui, A. Nabli, F. Gargouri, ETL Based Framework for NoSQL Warehousing, Lecture Notes in Business Information Processing (Springer, Cham, 2017)
J. Wang, W. Zhao, T. Fan, S. Yang, H. Lv, An improved join free snowflake schema for ETL and OLAP of the data warehouse. Concurr. Comput. Pract. Exper. (2019)
N. Biswas, A. Sarkar, K.C. Mondal, Empirical analysis of programmable ETL tools, in Computational, ed. by Intelligence, Communications, and Business Analytics, CICBA 2018, ed. by J. Mandal, S. Mukhopadhyay, P. Dutta, K. Dasgupta. Communications in Computer and Information Science, vol. 1031 (Springer, Singapore, 2019)
J. Nwokeji, F. Aqlan, A. Apoorva, A. Olagunju, Big Data ETL implementation approaches: a systematic literature review, in Conference of Software Engineering and Knowledge Engineering (2018)
S.M.F. Ali, R. Wrembel, Towards a cost model to optimize user-defined functions in an ETL workflow based on user-defined performance metrics, in Advances in Databases and Information Systems. ADBIS 2019, ed. by T. Welzer, J. Eder, V. Podgorelec, A. Kamišalić Latifić. Lecture Notes in Computer Science, vol. 11695 (Springer, Cham, 2019)
V. Para, A. Mohammad, A. Syed, M. Halgamuge, Pentaho and Jaspersoft: A comparative study of business intelligence open source tools processing big data to evaluate performances. Int. J. Adv. Comput. Sci. Appl. (2016)
A. Amine, R. Daoud, B. Bouikhalene, Efficiency comparison and evaluation between two ETL extraction tools. Indones. J. Electric. Eng. Comput. Sci. 174–181 (2016)
J. Awiti, A. Vaisman, E. Zimányi, From conceptual to logical ETL design using BPMN and relational algebra, in Big Data Analytics and Knowledge Discovery, DaWaK 2019, ed. by C. Ordonez, I.Y. Song, G. Anderst-Kotsis, A. Tjoa, I. Khalil. Lecture Notes in Computer Science, vol. 11708 (Springer, 2019)
S. Yousuf, S. Rizvi, A comparative study of ETL tools, https://www.academia.edu/354387
P. Diouf, A. Boly, S. Ndiaye, Performance of the ETL processes in terms of volume and velocity in the cloud: state of the art, in 4th IEEE International Conference on Engineering Technologies and Applied Sciences (2017)
J. Chakraborty, A. Padki, S. Bansal, Semantic ETL—state-of-art and open research challenges, in IEEE 11th International Conference on Semantic Computing. San Diego, CA (2017)
P. Diouf, A. Boly, S. Ndiaye, Variety of data in the ETL processes in the cloud: state of the art, in IEEE International Conference on Innovative Research and Development, Bangkok, Thailand (2018)
I. Oditis, Z. Bicevska, J. Bicevskis, G. Karnitis, Implementation of NoSQL-based data warehouses. Baltic J. Modern Comput. 6, 45–55 (2018)
Panoply Blog Home page, https://blog.panoply.io/top-9-mongodb-etl-tools
M. Moly, O. Roy, A. Hossain, An advanced ETL technique for error-free data in data warehousing environment. Int. J. Sci. Res. Eng. Trends, 554–558 (2019)
A. Pall, J. Singh, ETL Methodologies, limitations, and framework for the selection and development of an ETL tool. Int. J. Res. Eng. Appl. Sci. 6 (2016)
M.B. Biplob, G.A. Sheraji, S.I. Khan, Comparison of different extraction transformation and loading tools for data warehousing, in 2018 International Conference on Innovations in Science, Engineering, and Technology (2018), pp. 262–267
R. Katragadda, S.S. Tirumala, D. Nandigam, ETL tools for data warehousing: an empirical study of open source Talend Studio versus Microsoft SSIS
I.I. Kholod, M.S. Efimova, S.Y. Kulikov, Using ETL tools for developing a virtual data warehouse, in 2016 XIX IEEE International Conference on Soft Computing and Measurements (2016), pp. 351–354
Gartner Peer Insights, https://www.gartner.com/reviews/market/data-integration-tools. Accessed 15 Sep 2019
H. Mallek, F. Ghozzi, O. Teste, F. Gargouri, BigDimETL with NoSQL database, in 22nd International Conference on Knowledge-based and Intelligent Information & Engineering Systems (2018)
B. Pan, G. Zhang, X. Qin, Design and realization of an ETL method in business intelligence project, in 3rd IEEE International Conference on Cloud Computing and Big Data Analytics (2018)
B. Nabila, B. Ladjel, K. Selma, Towards a conceptualization of ETL and physical storage of semantic data warehouses as a service. Cluster Comput. 16(4), 915–931 (2013)
Data Warehouse Guide, https://panoply.io/data-warehouse-guide/etl-tools/. Accessed 2 Sep 2019
MongoDB Home page, https://www.mongodb.com/use-cases/real-time-analytics. Accessed 5 Sep 2019
S.M.F. Ali, R. Wrembel, From conceptual design to performance optimization of ETL workflows: current state of research and open problems. VLDB J. (2017)
B. Nabila, B. Ladjel, G. Laurent, ETL processes in the era of variety. Large-Scale Data- and Knowl.-Centered Syst. 39, 98–129 (2018)
G2, https://www.g2.com/search?utf8=%E2%9C%93&query=ETL+tools. Accessed 17 Sep 2019
J. Smith, M. Rege, The Data Warehousing Evolution: Where’s it headed next? (ACM, ICCDA, 2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Patel, M., Patel, D.B. (2021). Progressive Growth of ETL Tools: A Literature Review of Past to Equip Future. In: Rathore, V.S., Dey, N., Piuri, V., Babo, R., Polkowski, Z., Tavares, J.M.R.S. (eds) Rising Threats in Expert Applications and Solutions. Advances in Intelligent Systems and Computing, vol 1187. Springer, Singapore. https://doi.org/10.1007/978-981-15-6014-9_45
Download citation
DOI: https://doi.org/10.1007/978-981-15-6014-9_45
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-6013-2
Online ISBN: 978-981-15-6014-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)