Skip to main content

Progressive Growth of ETL Tools: A Literature Review of Past to Equip Future

  • Conference paper
  • First Online:
Rising Threats in Expert Applications and Solutions

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1187))

Abstract

ETL is the bedrock of a data warehouse. As the data accelerates in the diversified fields, it is substantive to integrate the data to extract the prerequisites for business advancement and policy formation in the arena of health care, education, smart cities, transportation, and many other areas. ETL (Extract–Transform–Load) is the process used to incorporate a range of data sources in the data warehouse for business intelligence. As per the requirement of data integration and analysis, several categories of ETL tools have developed like code-based, GUI-based, cloud-based, Metadata support, Real-time support, and batch processing. Selecting appropriate ETL tools is a crucial task in any business, with each tool having its dominant and stumbling blocks. This paper aims to first, the study of existing ETL tools and its features, and secondly, how to identify vital functions of the ETL tool for an organization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. C. Thomsen, ETL. Springer International Publishing AG, Part of Springer Nature (2018)

    Google Scholar 

  2. R. Mukherjee, P. Kar, A comparative review of data warehousing ETL tools with new trends and industry insight, in IEEE 7th International Advance Computing Conference (2017), pp. 943–948

    Google Scholar 

  3. J. Awiti, E. Zimányi, An XML interchange format for ETL models, in New Trends in Databases and Information Systems. ADBIS 2019, ed. by T. Welzer et al. Communications in Computer and Information Science, vol. 1064 (Springer, 2019)

    Google Scholar 

  4. R. Aluvalu, M. Jabbar, Handling data analytics on unstructured data using MongoDB, in Smart Cities Symposium (2018), pp. 1–5

    Google Scholar 

  5. R. Yangui, A. Nabli, F. Gargouri, ETL Based Framework for NoSQL Warehousing, Lecture Notes in Business Information Processing (Springer, Cham, 2017)

    Book  Google Scholar 

  6. J. Wang, W. Zhao, T. Fan, S. Yang, H. Lv, An improved join free snowflake schema for ETL and OLAP of the data warehouse. Concurr. Comput. Pract. Exper. (2019)

    Google Scholar 

  7. N. Biswas, A. Sarkar, K.C. Mondal, Empirical analysis of programmable ETL tools, in Computational, ed. by Intelligence, Communications, and Business Analytics, CICBA 2018, ed. by J. Mandal, S. Mukhopadhyay, P. Dutta, K. Dasgupta. Communications in Computer and Information Science, vol. 1031 (Springer, Singapore, 2019)

    Google Scholar 

  8. J. Nwokeji, F. Aqlan, A. Apoorva, A. Olagunju, Big Data ETL implementation approaches: a systematic literature review, in Conference of Software Engineering and Knowledge Engineering (2018)

    Google Scholar 

  9. S.M.F. Ali, R. Wrembel, Towards a cost model to optimize user-defined functions in an ETL workflow based on user-defined performance metrics, in Advances in Databases and Information Systems. ADBIS 2019, ed. by T. Welzer, J. Eder, V. Podgorelec, A. Kamišalić Latifić. Lecture Notes in Computer Science, vol. 11695 (Springer, Cham, 2019)

    Google Scholar 

  10. V. Para, A. Mohammad, A. Syed, M. Halgamuge, Pentaho and Jaspersoft: A comparative study of business intelligence open source tools processing big data to evaluate performances. Int. J. Adv. Comput. Sci. Appl. (2016)

    Google Scholar 

  11. A. Amine, R. Daoud, B. Bouikhalene, Efficiency comparison and evaluation between two ETL extraction tools. Indones. J. Electric. Eng. Comput. Sci. 174–181 (2016)

    Google Scholar 

  12. J. Awiti, A. Vaisman, E. Zimányi, From conceptual to logical ETL design using BPMN and relational algebra, in Big Data Analytics and Knowledge Discovery, DaWaK 2019, ed. by C. Ordonez, I.Y. Song, G. Anderst-Kotsis, A. Tjoa, I. Khalil. Lecture Notes in Computer Science, vol. 11708 (Springer, 2019)

    Google Scholar 

  13. S. Yousuf, S. Rizvi, A comparative study of ETL tools, https://www.academia.edu/354387

  14. P. Diouf, A. Boly, S. Ndiaye, Performance of the ETL processes in terms of volume and velocity in the cloud: state of the art, in 4th IEEE International Conference on Engineering Technologies and Applied Sciences (2017)

    Google Scholar 

  15. J. Chakraborty, A. Padki, S. Bansal, Semantic ETL—state-of-art and open research challenges, in IEEE 11th International Conference on Semantic Computing. San Diego, CA (2017)

    Google Scholar 

  16. P. Diouf, A. Boly, S. Ndiaye, Variety of data in the ETL processes in the cloud: state of the art, in IEEE International Conference on Innovative Research and Development, Bangkok, Thailand (2018)

    Google Scholar 

  17. I. Oditis, Z. Bicevska, J. Bicevskis, G. Karnitis, Implementation of NoSQL-based data warehouses. Baltic J. Modern Comput. 6, 45–55 (2018)

    Article  Google Scholar 

  18. Panoply Blog Home page, https://blog.panoply.io/top-9-mongodb-etl-tools

  19. M. Moly, O. Roy, A. Hossain, An advanced ETL technique for error-free data in data warehousing environment. Int. J. Sci. Res. Eng. Trends, 554–558 (2019)

    Google Scholar 

  20. A. Pall, J. Singh, ETL Methodologies, limitations, and framework for the selection and development of an ETL tool. Int. J. Res. Eng. Appl. Sci. 6 (2016)

    Google Scholar 

  21. M.B. Biplob, G.A. Sheraji, S.I. Khan, Comparison of different extraction transformation and loading tools for data warehousing, in 2018 International Conference on Innovations in Science, Engineering, and Technology (2018), pp. 262–267

    Google Scholar 

  22. R. Katragadda, S.S. Tirumala, D. Nandigam, ETL tools for data warehousing: an empirical study of open source Talend Studio versus Microsoft SSIS

    Google Scholar 

  23. I.I. Kholod, M.S. Efimova, S.Y. Kulikov, Using ETL tools for developing a virtual data warehouse, in 2016 XIX IEEE International Conference on Soft Computing and Measurements (2016), pp. 351–354

    Google Scholar 

  24. Gartner Peer Insights, https://www.gartner.com/reviews/market/data-integration-tools. Accessed 15 Sep 2019

  25. H. Mallek, F. Ghozzi, O. Teste, F. Gargouri, BigDimETL with NoSQL database, in 22nd International Conference on Knowledge-based and Intelligent Information & Engineering Systems (2018)

    Google Scholar 

  26. B. Pan, G. Zhang, X. Qin, Design and realization of an ETL method in business intelligence project, in 3rd IEEE International Conference on Cloud Computing and Big Data Analytics (2018)

    Google Scholar 

  27. B. Nabila, B. Ladjel, K. Selma, Towards a conceptualization of ETL and physical storage of semantic data warehouses as a service. Cluster Comput. 16(4), 915–931 (2013)

    Article  Google Scholar 

  28. Data Warehouse Guide, https://panoply.io/data-warehouse-guide/etl-tools/. Accessed 2 Sep 2019

  29. MongoDB Home page, https://www.mongodb.com/use-cases/real-time-analytics. Accessed 5 Sep 2019

  30. S.M.F. Ali, R. Wrembel, From conceptual design to performance optimization of ETL workflows: current state of research and open problems. VLDB J. (2017)

    Google Scholar 

  31. B. Nabila, B. Ladjel, G. Laurent, ETL processes in the era of variety. Large-Scale Data- and Knowl.-Centered Syst. 39, 98–129 (2018)

    Google Scholar 

  32. G2, https://www.g2.com/search?utf8=%E2%9C%93&query=ETL+tools. Accessed 17 Sep 2019

  33. J. Smith, M. Rege, The Data Warehousing Evolution: Where’s it headed next? (ACM, ICCDA, 2017)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Monika Patel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Patel, M., Patel, D.B. (2021). Progressive Growth of ETL Tools: A Literature Review of Past to Equip Future. In: Rathore, V.S., Dey, N., Piuri, V., Babo, R., Polkowski, Z., Tavares, J.M.R.S. (eds) Rising Threats in Expert Applications and Solutions. Advances in Intelligent Systems and Computing, vol 1187. Springer, Singapore. https://doi.org/10.1007/978-981-15-6014-9_45

Download citation

Publish with us

Policies and ethics