Abstract
Traceability has been used as a quality attribute for softwares for some decades now. Traceability can be defined as the ability to follow the life of software artifacts. Unfortunately, making a DW traceable did not have the same spring as for software systems. Nowadays, DW systems are evolving in a dynamic environment, where DW design become a complex task involving many resources and artifacts. In order to facilitate this task, a design life-cycle has been defined including five main phases. Due to the special idiosyncrasy of DW development, a tailored traceability approach is required. Our proposal in this paper is a novel DW traceability approach, driven by its design life-cycle. This approach covers the whole cycle and considers its inter-relationships. This study required (i) the formalization of each life-cycle phase and (ii) the identification of the interactions between and inside these phases. The traceability approach is conducted by two main activities: the identification of trace artifacts and links materialized in a traceability model and the recording of the model. The approach is illustrated using TPC-H and ETL benchmarks. It is implemented using Postgres DBMS.
Chapter PDF
Similar content being viewed by others
References
Radatz, J., Geraci, A., Katki, F.: IEEE standard glossary of software engineering terminology. IEEE Std 610121990(121990), 3 (1990)
IEEE Computer Society. Software Engineering Standards Committee, and IEEE-SA Standards Board. “IEEE Recommended Practice for Software Requirements Specifications.” Institute of Electrical and Electronics Engineers (1998)
Calvanese, D., Lenzerini, M., Nardi, D.: Description logics for conceptual data modeling. In: Logics for Databases and Information Systems, pp. 229–263. Springer, US (1998)
Cui, Y., Widom, J.: Practical lineage tracing in data warehouses. In: Proceedings. 16th International Conference on Data Engineering, 2000, pp. 367–378. IEEE (2000)
Cui, Y., Widom, J.: Lineage tracing for general data warehouse transformations. The VLDB JournalThe International Journal on Very Large Data Bases 12(1), 41–58 (2003)
Golfarelli, M.: From user requirements to conceptual design in data warehouse design a survey. In: Data Warehousing Design and Advanced Engineering Applications Methods for Complex Construction, pp. 1–6 (2010)
Golfarelli, M., Maio, D., Rizzi, S.: The dimensional fact model: a conceptual model for data warehouses. International Journal of Cooperative Information Systems 7(02n03), 215–247 (1998)
Khouri, S.: Cycle de vie smantique de conception de systmes de stockage et de manipulation de donnees. PhD thesis, ENSMA & ESI, October 2013
Khouri, S., Bellatreche, L.: Towards a configurable database design: a case of semantic data warehouses. In: Meersman, R., Panetto, H., Dillon, T., Missikoff, M., Liu, L., Pastor, O., Cuzzocrea, A., Sellis, T. (eds.) OTM 2014. LNCS, vol. 8841, pp. 760–767. Springer, Heidelberg (2014)
Kimball, R.: The data warehouse toolkit: practical techniques for building dimensional data warehouses. John Wiley & Sons Inc., New York (1996)
Lenzerini, M.: Data integration: a theoretical perspective. In: :Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, pp. 233–246. ACM (2002)
Marotta, A., Ruggia, R.: Data warehouse design: a schema-transformation approach. In : Computer Science Society, SCCC 2002, Proceedings of the 22nd International Conference of the Chilean, pp. 153–161. IEEE (2002)
Mat, A., Trujillo, J.: A trace metamodel proposal based on the model driven architecture framework for the traceability of user requirements in data warehouses. Information Systems 37(8), 753–766 (2012)
Mat, A., Trujillo, J.: Tracing conceptual models’ evolution in data warehouses by using the model driven architecture. Computer Standards & Interfaces 36(5), 831–843 (2014)
Simitsis, A., Vassiliadis, P., Dayal, U., Karagiannis, A., Tziovara, V.: Benchmarking ETL workflows. In: Nambiar, R., Poess, M. (eds.) TPCTC 2009. LNCS, vol. 5895, pp. 199–220. Springer, Heidelberg (2009)
Spanoudakis, G., Zisman, A.: Software traceability: a roadmap. Handbook of Software Engineering and Knowledge Engineering 3, 395–428 (2005)
Theodorou, V., Abelló, A., Thiele, M., Lehner, W.: A framework for user-centered declarative etl. In: Proceedings of the 17th International Workshop on Data Warehousing and OLAP, pp. 67–70. ACM (2014)
Winkler, S., Pilgrim, J.V.: A survey of traceability in requirements engineering and model-driven development. Software & Systems Modeling 9(4), 529–565 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Khouri, S., Semassel, K., Bellatreche, L. (2015). Managing Data Warehouse Traceability: A Life-Cycle Driven Approach. In: Zdravkovic, J., Kirikova, M., Johannesson, P. (eds) Advanced Information Systems Engineering. CAiSE 2015. Lecture Notes in Computer Science(), vol 9097. Springer, Cham. https://doi.org/10.1007/978-3-319-19069-3_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-19069-3_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19068-6
Online ISBN: 978-3-319-19069-3
eBook Packages: Computer ScienceComputer Science (R0)