Advertisement

Towards a Scalable, Performance-Oriented OLAP Storage Engine

  • Todd Eavis
  • Ahmad Taleb
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7239)

Abstract

Over the past generation, data warehousing and OLAP applications have become the cornerstone of contemporary decision support environments. Typically, OLAP servers are implemented on top of either proprietary array-based storage engines (MOLAP) or as extensions to conventional relational DBMSs (ROLAP). While MOLAP systems do indeed provide impressive performance on common analytics queries, they tend to have limited scalability. Conversely, ROLAP’s table oriented model scales quite nicely, but offers mediocre performance at best relative to the MOLAP systems. In this paper, we describe a storage and indexing framework that aims to provide both MOLAP like performance and ROLAP like scalability by essentially combining some of the best features of both. Based upon a combination of R-trees and bitmap indexes, the storage engine has been integrated with a robust OLAP query engine prototype that is able to fully exploit the efficiency of the proposed storage model. Experimental results demonstrate that not only does the framework improve upon more naive approaches, but that it does indeed offer the potential to optimize both query performance and scalability.

Keywords

Dimension Table Data Cube Query Performance Fact Table View Manager 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Abouzeid, A., Bajda-Pawlikowski, K., Abadi, D., Silberschatz, A., Rasin, A.: Hadoopdb: an architectural hybrid of mapreduce and dbms technologies for analytical workloads. Proc. VLDB Endow. 2, 922–933 (2009)Google Scholar
  2. 2.
  3. 3.
    Dean, J., Ghemawat, S.: Mapreduce: a flexible data processing tool. Commununications of the ACM 53, 72–77 (2010)CrossRefGoogle Scholar
  4. 4.
    Dehne, F., E.T., Rau-Chaplin, A.: Rcube: Parallel multi-dimensional rolap indexing. Journal of Data Warehousing and Mining 4, 1–14 (2008)Google Scholar
  5. 5.
    Eavis, T., Cueva, D.: The lbf r-tree: Efficient multidimensional indexing with graceful degradation. In: 22nd International Database Engineering and Applications Symposium, IDEAS 2007 (2007)Google Scholar
  6. 6.
    Eavis, T., Taleb, A.: Mapgraph: efficient methods for complex olap hierarchies. In: Conference on Information and Knowledge Management, pp. 465–474 (2007)Google Scholar
  7. 7.
    Zimanyi, E., Malinowski, E.: Hierarchies in a conceptual mode, from conceptual modeling to logical representation. In: Data & KNowledge Engineering (2005)Google Scholar
  8. 8.
  9. 9.
    Gray, J., Bosworth, A., Layman, A., Pirahesh, H.: Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-total. In: International Conference on Data Engineering (ICDE), pp. 152–159. IEEE Computer Society, Washington, DC (1996)Google Scholar
  10. 10.
    Gupta, H., Harinarayan, V., Rajaraman, A., Ullman, J.D.: Index selection for olap. In: Proceedings of the Thirteenth International Conference on Data Engineering, ICDE 1997, pp. 208–219. IEEE Computer Society, Washington, DC (1997)CrossRefGoogle Scholar
  11. 11.
    Lakshmanan, L.V.S., Pei, J., Zhao, Y.: Qc-trees: an efficient summary structure for semantic olap. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, SIGMOD 2003, pp. 64–75. ACM, New York (2003)CrossRefGoogle Scholar
  12. 12.
  13. 13.
  14. 14.
    Morfonios, K., Ioannidis, Y.: Cure for cubes: cubing using a rolap engine. In: Proceedings of the 32nd International Conference on Very Large Data Bases, VLDB 2006, pp. 379–390. VLDB Endowment (2006)Google Scholar
  15. 15.
  16. 16.
    Plattner, H.: A common database approach for oltp and olap using an in-memory column database. In: Proceedings of the 35th SIGMOD International Conference on Management of Data, SIGMOD 2009, pp. 1–2 (2009)Google Scholar
  17. 17.
    Roussopoulos, N., Kotidis, Y., Roussopoulos, M.: Cubetree: organization of and bulk incremental updates on the data cube. In: Proceedings of the 1997 ACM SIGMOD International Conference on Management of Data, SIGMOD 1997, pp. 89–99. ACM, New York (1997)CrossRefGoogle Scholar
  18. 18.
    Sismanis, Y., Deligiannakis, A., Roussopoulos, N., Kotidis, Y.: Dwarf: shrinking the PetaCube. In: Proceedings of the 2002 ACM SIGMOD Conference, pp. 464–475 (2002)Google Scholar
  19. 19.
    Stonebraker, M., Abadi, D., DeWitt, D.J., Madden, S., Paulson, E., Pavlo, A., Rasin, A.: Mapreduce and parallel dbmss: friends or foes? Commun. ACM 53, 64–71 (2010)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Todd Eavis
    • 1
  • Ahmad Taleb
    • 2
  1. 1.Concordia UniversityMontrealCanada
  2. 2.College of Computer Science and Information SystemsNajran UniversitySaudi Arabia

Personalised recommendations