Abstract
We present a technique for discovering and representing changes between versions of data warehouse structures. We select a tree comparison algorithm, adapt it for the particularities of multidimensional data structures and extend it with a module for detection of node renamings. The result of these algorithms are so called editscripts consisting of transformation operations which, when executed in sequence, transform the earlier version to the later, and thus show the relationships between the elements of different versions of data warehouse structures. This procedure helps data warehouse administrators to register changes. We describe a prototypical implementation of the concept which imports multidimensional structures from Hyperion Essbase data warehouses, compares these versions and generates a list of differences.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Eder, J., Koncilia, C.: Changes of dimension data in temporal data warehouses. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds.) DaWaK 2001. LNCS, vol. 2114, p. 284. Springer, Heidelberg (2001)
Eder, J., Koncilia, C., Mitsche, D.: Automatic Detection of Structural Changes in Data Warehouses. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds.) DaWaK 2003. LNCS, vol. 2737, pp. 119–128. Springer, Heidelberg (2003)
Kimball, R.: Slowly Changing Dimensions, Data Warehouse Architect. DBMS Magazine 9 (1996), http://www.dbmsmag.com/
Chamoni, P., Stock, S.: Temporal Structures in Data Warehousing. In: Mohania, M., Tjoa, A.M. (eds.) DaWaK 1999. LNCS, vol. 1676, pp. 353–358. Springer, Heidelberg (1999)
Yang, J.: Temporal Data Warehousing. PhD thesis, Stanford University (2001)
Vaisman, A.: Updates, View Maintenance and Time Management in Multidimensional Databases. PhD thesis, Universidad de Buenos Aires (2001)
Blaschka, M.: FIESTA: A Framework for Schema Evolution in Multidimensional Information Systems. PhD thesis, Technische Universität München (2000)
Eder, J., Koncilia, C., Morzy, T.: The COMET Metamodel for Temporal Data Warehouses. In: Proc. of the 14th Intl. Conf. on Advanced Information Systems Engineering 2002 (2002)
Zhang, K., Shasha, D.: Simple fast algorithms for the editing distance between trees and related problems. SIAM journal on computing 18, 1245–1262 (1989)
Chawathe, S., Rajaraman, A., Garcia-Molina, H., Widom, J.: Change detection in hierarchically structured information. In: Proc. of the 1996 ACM SIGMOD (1996)
Chawathe, S., Garcia-Molina, H.: Meaningful change detection in structured data. In: Proc. of the 1997 ACM SIGMOD (1997)
Cobena, G., Abiteboul, S., Marian, A.: Detecting changes in XML documents. In: Proc. of the 18th Intl. Conf. on Data Engineering (2002)
Wang, Y., DeWitt, D., Cai, J.Y.: X-diff: An effective change detection algorithm for XML documents. In: Proc. of the 19th Intl. Conf. on Data Engineering (2003)
Zhang, L.: On matching nodes between trees. Tech. Rep. 2003–67, HP Labs (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Eder, J., Koncilia, C., Wiggisser, K. (2005). A Tree Comparison Approach to Detect Changes in Data Warehouse Structures. In: Tjoa, A.M., Trujillo, J. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2005. Lecture Notes in Computer Science, vol 3589. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11546849_1
Download citation
DOI: https://doi.org/10.1007/11546849_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28558-8
Online ISBN: 978-3-540-31732-6
eBook Packages: Computer ScienceComputer Science (R0)