Efficient Representation of Multidimensional Data over Hierarchical Domains

  • Nieves R. Brisaboa
  • Ana Cerdeira-Pena
  • Narciso López-López
  • Gonzalo Navarro
  • Miguel R. Penabad
  • Fernando Silva-Coira
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9954)

Abstract

We consider the problem of representing multidimensional data where the domain of each dimension is organized hierarchically, and the queries require summary information at a different node in the hierarchy of each dimension. This is the typical case of OLAP databases. A basic approach is to represent each hierarchy as a one-dimensional line and recast the queries as multidimensional range queries. This approach can be implemented compactly by generalizing to more dimensions the \(k^2\)-treap, a compact representation of two-dimensional points that allows for efficient summarization queries along generic ranges. Instead, we propose a more flexible generalization, which instead of a generic quadtree-like partition of the space, follows the domain hierarchies across each dimension to organize the partitioning. The resulting structure is much more efficient than a generic multidimensional structure, since queries are resolved by aggregating much fewer nodes of the tree.

References

  1. 1.
    Brisaboa, N.R., de Bernardo, G., Konow, R., Navarro, G., Seco, D.: Aggregated 2d range queries on clustered points. Inf. Syst. 60, 34–49 (2016)CrossRefGoogle Scholar
  2. 2.
    Brisaboa, N.R., Ladra, S., Navarro, G.: DACs: bringing direct access to variable-length codes. Inf. Process. Manag. 49, 392–404 (2013)CrossRefGoogle Scholar
  3. 3.
    Brisaboa, N.R., Ladra, S., Navarro, G.: Compact representation of web graphs with extended functionality. Inf. Syst. 39, 152–174 (2014)CrossRefGoogle Scholar
  4. 4.
    Chan, T., Durocher, S., Larsen, K., Morrison, J., Wilkinson, B.: Linear-space data structures for range mode query in arrays. In: Proceedings of 29th International Symposium on Theoretical Aspects of Computer Science (STACS), pp. 290–301 (2012)Google Scholar
  5. 5.
    Chaudhuri, S., Dayal, U.: An overview of data warehousing and OLAP technology. SIGMOD Rec. 26(1), 65–74 (1997)CrossRefGoogle Scholar
  6. 6.
    Clark, D.: Compact PAT Trees. Ph.D. thesis, University of Waterloo, Canada (1996)Google Scholar
  7. 7.
    Codd, E.F., Codd, S.B., Salley, C.T.: Providing OLAP. On-Line Analytical Processing to User-Analysts: An IT Mandate. E. F. Codd and Associates (1993)Google Scholar
  8. 8.
    Hon, W., Shah, R., Thankachan, S.V., Vitter, J.S.: Space-efficient frameworks for top-k string retrieval. J. ACM 61(2), 9:1–9:36 (2014)MathSciNetCrossRefMATHGoogle Scholar
  9. 9.
    Jacobson, G.: Space-efficient static trees and graphs. In: Proceedings of the 30th Annual Symposium on Foundations of Computer Science, SFCS 1989, pp. 549–554. IEEE Computer Society, Washington, DC (1989)Google Scholar
  10. 10.
    Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling, 2nd edn. Wiley, New York (2002)Google Scholar
  11. 11.
    Larsen, K., van Walderveen, F.: Near-optimal range reporting structures for categorical data. In: Proceedings of 24th Symposium on Discrete Algorithms (SODA), pp. 265–276 (2013)Google Scholar
  12. 12.
    Levene, M., Loizou, G.: Why is the snowflake schema a good data warehouse design? Inf. Syst. 28(3), 225–240 (2003)CrossRefGoogle Scholar
  13. 13.
    Sadakane, K.: Succinct data structures for flexible text retrieval systems. J. Discrete Algorithms 5, 12–22 (2007)MathSciNetCrossRefMATHGoogle Scholar
  14. 14.
    Samet, H.: Foundations of Multidimensional and Metric Data Structures. Morgan Kaufmann, San Francisco (2006)MATHGoogle Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  • Nieves R. Brisaboa
    • 1
  • Ana Cerdeira-Pena
    • 1
  • Narciso López-López
    • 1
  • Gonzalo Navarro
    • 2
  • Miguel R. Penabad
    • 1
  • Fernando Silva-Coira
    • 1
  1. 1.Database LaboratoryUniversity of A CoruñaA CoruñaSpain
  2. 2.Department of Computer ScienceUniversity of ChileSantiagoChile

Personalised recommendations