Skip to main content

Repairing Dimension Hierarchies under Inconsistent Reclassification

  • Conference paper
  • 995 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6999))

Abstract

On-Line Analytical Processing (OLAP) dimensions are usually modelled as a hierarchical set of categories (the dimension schema), and dimension instances. The latter consist in a set of elements for each category, and relations between these elements (denoted rollup). To guarantee summarizability, a dimension is required to be strict, that is, every element of the dimension instance must have a unique ancestor in each of its ancestor categories. In practice, elements in a dimension instance are often reclassified, meaning that their rollups are changed (e.g., if the current available information is proved to be wrong). After this operation the dimension may become non-strict. To fix this problem, we propose to compute a set of minimal r-repairs for the new non-strict dimension. Each minimal r-repair is a strict dimension that keeps the result of the reclassification, and is obtained by performing a minimum number of insertions and deletions to the instance graph. We show that, although in the general case finding an r-repair is NP-complete, for real-world dimension schemas, computing such repairs can be done in polynomial time. We present algorithms for this, and discuss their computational complexity.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chaudhuri, S., Dayal, U.: An Overview of Data Warehousing and OLAP Technology. SIGMOD Record 26, 65–74 (1997)

    Article  Google Scholar 

  2. Bertossi, L., Bravo, L., Caniupán, M.: Consistent query answering in data warehouses. In: AMW (2009)

    Google Scholar 

  3. Hurtado, C., Gutierrez, C., Mendelzon, A.: Capturing Summarizability with Integrity Constraints in OLAP. ACM Transacations on Database Systems 30, 854–886 (2005)

    Article  Google Scholar 

  4. Lenz, H., Shoshani, A.: Summarizability in OLAP and Statistical Data Bases. In: SSDBM, pp. 132–143 (1997)

    Google Scholar 

  5. Rafanelli, M., Shoshani, A.: STORM: a Statistical Object Representation Model. In: Michalewicz, Z. (ed.) SSDBM 1990. LNCS, vol. 420, pp. 14–29. Springer, Heidelberg (1990)

    Chapter  Google Scholar 

  6. Hurtado, C., Mendelzon, A., Vaisman, A.: Maintaining Data Cubes under Dimension Updates. In: ICDE, pp. 346–355 (1999)

    Google Scholar 

  7. Hurtado, C., Mendelzon, A., Vaisman, A.: Updating OLAP Dimensions. In: DOLAP, pp. 60–66 (1999)

    Google Scholar 

  8. Caniupán, M., Bravo, L., Hurtado, C.: A logic programming approach for repairing inconsistent dimensions in data warehouses. Submitted to Data and Knowledge Engineering (2010)

    Google Scholar 

  9. Dodge, G., Gorman, T.: Essential Oracle8i Data Warehousing: Designing, Building, and Managing Oracle Data Warehouses (with Website). John Wiley & Sons, Inc., Chichester (2000)

    Google Scholar 

  10. Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling. John Wiley & Sons, Inc., Chichester (2002)

    Google Scholar 

  11. Hurtado, C., Mendelzon, A.: Reasoning about summarizability in heterogeneous multidimensional schemas. In: Van den Bussche, J., Vianu, V. (eds.) ICDT 2001. LNCS, vol. 1973, p. 375. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  12. Pedersen, T., Jensen, C., Dyreson, C.: Extending Practical Pre-Aggregation in On-Line Analytical Processing. In: VLDB, pp. 663–674 (1999)

    Google Scholar 

  13. Vaisman, A.: Updates, View Maintenance and Materialized Views in Multidimnensional Databases. PhD thesis, Universidad de Buenos Aires (2001)

    Google Scholar 

  14. Bertossi, L.: Consistent query answering in databases. ACM Sigmod Record 35, 68–76 (2006)

    Article  Google Scholar 

  15. Zhuge, Y., Garcia-Molina, H., Wiener, J.L.: Multiple View Consistency for Data Warehousing. In: ICDE, pp. 289–300 (1997)

    Google Scholar 

  16. Gupta, H., Mumick, I.S.: Selection of views to materialize under a maintenance cost constraint. In: Beeri, C., Bruneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 453–470. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  17. Schlesinger, L., Lehner, W.: Extending Data Warehouses by Semiconsistent Views. In: DMDW, pp. 43–51 (2002)

    Google Scholar 

  18. Letz, C., Henn, E.T., Vossen, G.: Consistency in Data Warehouse Dimensions. In: IDEAS, pp. 224–232 (2002)

    Google Scholar 

  19. Bravo, L., Caniupán, M., Hurtado, C.: Logic programs for repairing inconsistent dimensions in data warehouses. In: AMW (2010)

    Google Scholar 

  20. Espil, M.M., Vaisman, A., Terribile, L.: Revising data cubes with exceptions: a rule-based perspective. In: DMDW, pp. 72–81 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Caniupán, M., Vaisman, A. (2011). Repairing Dimension Hierarchies under Inconsistent Reclassification. In: De Troyer, O., Bauzer Medeiros, C., Billen, R., Hallot, P., Simitsis, A., Van Mingroot, H. (eds) Advances in Conceptual Modeling. Recent Developments and New Directions. ER 2011. Lecture Notes in Computer Science, vol 6999. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24574-9_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-24574-9_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-24573-2

  • Online ISBN: 978-3-642-24574-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics