Abstract
On-Line Analytical Processing (OLAP) dimensions are usually modelled as a hierarchical set of categories (the dimension schema), and dimension instances. The latter consist in a set of elements for each category, and relations between these elements (denoted rollup). To guarantee summarizability, a dimension is required to be strict, that is, every element of the dimension instance must have a unique ancestor in each of its ancestor categories. In practice, elements in a dimension instance are often reclassified, meaning that their rollups are changed (e.g., if the current available information is proved to be wrong). After this operation the dimension may become non-strict. To fix this problem, we propose to compute a set of minimal r-repairs for the new non-strict dimension. Each minimal r-repair is a strict dimension that keeps the result of the reclassification, and is obtained by performing a minimum number of insertions and deletions to the instance graph. We show that, although in the general case finding an r-repair is NP-complete, for real-world dimension schemas, computing such repairs can be done in polynomial time. We present algorithms for this, and discuss their computational complexity.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Chaudhuri, S., Dayal, U.: An Overview of Data Warehousing and OLAP Technology. SIGMOD Record 26, 65–74 (1997)
Bertossi, L., Bravo, L., Caniupán, M.: Consistent query answering in data warehouses. In: AMW (2009)
Hurtado, C., Gutierrez, C., Mendelzon, A.: Capturing Summarizability with Integrity Constraints in OLAP. ACM Transacations on Database Systems 30, 854–886 (2005)
Lenz, H., Shoshani, A.: Summarizability in OLAP and Statistical Data Bases. In: SSDBM, pp. 132–143 (1997)
Rafanelli, M., Shoshani, A.: STORM: a Statistical Object Representation Model. In: Michalewicz, Z. (ed.) SSDBM 1990. LNCS, vol. 420, pp. 14–29. Springer, Heidelberg (1990)
Hurtado, C., Mendelzon, A., Vaisman, A.: Maintaining Data Cubes under Dimension Updates. In: ICDE, pp. 346–355 (1999)
Hurtado, C., Mendelzon, A., Vaisman, A.: Updating OLAP Dimensions. In: DOLAP, pp. 60–66 (1999)
Caniupán, M., Bravo, L., Hurtado, C.: A logic programming approach for repairing inconsistent dimensions in data warehouses. Submitted to Data and Knowledge Engineering (2010)
Dodge, G., Gorman, T.: Essential Oracle8i Data Warehousing: Designing, Building, and Managing Oracle Data Warehouses (with Website). John Wiley & Sons, Inc., Chichester (2000)
Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling. John Wiley & Sons, Inc., Chichester (2002)
Hurtado, C., Mendelzon, A.: Reasoning about summarizability in heterogeneous multidimensional schemas. In: Van den Bussche, J., Vianu, V. (eds.) ICDT 2001. LNCS, vol. 1973, p. 375. Springer, Heidelberg (2000)
Pedersen, T., Jensen, C., Dyreson, C.: Extending Practical Pre-Aggregation in On-Line Analytical Processing. In: VLDB, pp. 663–674 (1999)
Vaisman, A.: Updates, View Maintenance and Materialized Views in Multidimnensional Databases. PhD thesis, Universidad de Buenos Aires (2001)
Bertossi, L.: Consistent query answering in databases. ACM Sigmod Record 35, 68–76 (2006)
Zhuge, Y., Garcia-Molina, H., Wiener, J.L.: Multiple View Consistency for Data Warehousing. In: ICDE, pp. 289–300 (1997)
Gupta, H., Mumick, I.S.: Selection of views to materialize under a maintenance cost constraint. In: Beeri, C., Bruneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 453–470. Springer, Heidelberg (1998)
Schlesinger, L., Lehner, W.: Extending Data Warehouses by Semiconsistent Views. In: DMDW, pp. 43–51 (2002)
Letz, C., Henn, E.T., Vossen, G.: Consistency in Data Warehouse Dimensions. In: IDEAS, pp. 224–232 (2002)
Bravo, L., Caniupán, M., Hurtado, C.: Logic programs for repairing inconsistent dimensions in data warehouses. In: AMW (2010)
Espil, M.M., Vaisman, A., Terribile, L.: Revising data cubes with exceptions: a rule-based perspective. In: DMDW, pp. 72–81 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Caniupán, M., Vaisman, A. (2011). Repairing Dimension Hierarchies under Inconsistent Reclassification. In: De Troyer, O., Bauzer Medeiros, C., Billen, R., Hallot, P., Simitsis, A., Van Mingroot, H. (eds) Advances in Conceptual Modeling. Recent Developments and New Directions. ER 2011. Lecture Notes in Computer Science, vol 6999. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24574-9_11
Download citation
DOI: https://doi.org/10.1007/978-3-642-24574-9_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24573-2
Online ISBN: 978-3-642-24574-9
eBook Packages: Computer ScienceComputer Science (R0)