Ad Hoc Star Join Query Processing in Cluster Architectures

  • Josep Aguilar-Saborit
  • Victor Muntés-Mulero
  • Calisto Zuzarte
  • Josep-L. Larriba-Pey
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3589)

Abstract

Processing of large amounts of data in data warehouses is increasingly being done in cluster architectures to achieve scalability. In this paper we look into the problem of ad hoc star join query processing in clusters architectures. We propose a new technique, the Star Hash Join (SHJ), which exploits a combination of multiple bit filter strategies in such architectures. SHJ is a generalization of the Pushed Down Bit Filters for clusters. The objectives of the technique are to reduce (i) the amount of data communicated, (ii) the amount of data spilled to disk during the execution of intermediate joins in the query plan, and (iii) amount of memory used by auxiliary data structures such as bit filters.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Top500 Supercomputer Sites, http://www.top500.org
  2. 2.
    Transaction processing and database benchmarks, http://www.tpc.org
  3. 3.
    Aguilar-Saborit, J., Muntes-Mulero, V., Larriba-Pey, J.-L.: Pushing down bit filters in the pipelined execution of large queries. In: Kosch, H., Böszörményi, L., Hellwagner, H. (eds.) Euro-Par 2003. LNCS, vol. 2790, pp. 328–337. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  4. 4.
    Aguilar-Saborit, J., Muntes-Mulero, V., Larriba-Pey, J.-L., Zuzarte, C.: Ad-hoc star hash join processing in clusters of smp. Technical Report. Universitat Politecnica de Catalunya UPC-DAC-RR-GEN-2005-4Google Scholar
  5. 5.
    Aguilar-Saborit, J., Muntes-Mulero, V., Larriba-Pey, J.-L., Zuzarte, C., Pereyra, H.: On the use of bit filters in shared nothing partitioned systems. To appear in IWIA 2005 (2005)Google Scholar
  6. 6.
    Bernstein, P.A., Chiu, D.M.: Using semijoins to solve relational queries. J. ACM 28(1), 25–40 (1981)MATHCrossRefMathSciNetGoogle Scholar
  7. 7.
    Bernstein, P.A., Goodman, N.: The power of natural joins. SIAM J. Computi. 10, 751–771 (1981)MATHCrossRefMathSciNetGoogle Scholar
  8. 8.
    Bloom, B.H.: Space/time trade-offs in hash coding with allowable errors. Communications of the ACM 13(7), 422–426 (1970)MATHCrossRefGoogle Scholar
  9. 9.
    Chan, C.-Y., Ioannidis, Y.E.: Bitmap index design and evaluation. In: Proc. of the SIGMOD Conf. on the Management of Data, pp. 355–366 (1998)Google Scholar
  10. 10.
    Chaudhuri, S., Dayal, U.: Data warehousing and olap for decision support (turorial). In: SIGMOD Conference 1997, pp. 507–508 (1997)Google Scholar
  11. 11.
    Chaudhuri, S., Dayal, U.: An overview of data warehousing and olap technology. In: SIGMOD, vol. 26, pp. 65–74 (1997)Google Scholar
  12. 12.
    Deshpande, P., Ramasamy, K., Shuckla, A., Naughton, J.F.: Caching multidimensional queries using chunks. In: SIGMOD Conference, pp. 259–270 (1998)Google Scholar
  13. 13.
    DeWitt, D.J., Katz, R., Olken, F., Shapiro, L., Stonebreaker, M., Wood, D.: Implementation Techniques for Main Memory Database Systems. In: Proceedings of the SIGMOD Int’l. Conf. on the Management of Data, pp. 1–8. ACM, New York (1984)Google Scholar
  14. 14.
    Markl, V., Ramsak, F., Bayer, R.: Improving olap performance by multidimensional hierarchical clustering. In: Proc. of the Intl. Database Enfineering and Applications Symposium, pp. 165–177 (1999)Google Scholar
  15. 15.
    Mehta, M., DeWitt, D.J.: Parallel database systems: The future of high performance database processing. In: Proceedings of the 21st VLDB Conference (1995)Google Scholar
  16. 16.
    O’Neil, P., Graefe, G.: Multi-Table Joins Through Bitmapped Join Indices. SIGMOD Record 24(3), 8–11 (1995)CrossRefGoogle Scholar
  17. 17.
    O’Neil, P., Quass, D.: Improved query performance with variant indexes. In: Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 38–49 (1997)Google Scholar
  18. 18.
    Roussopoulos, R.: Materialized Views and Data Warehouses. SIGMOD Record 27(1), 21–26 (1998)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Josep Aguilar-Saborit
    • 1
  • Victor Muntés-Mulero
    • 1
  • Calisto Zuzarte
    • 2
  • Josep-L. Larriba-Pey
    • 1
  1. 1.Computer Architecture DepartmentUniversitat Politecnica de CatalunyaBarcelonaSpain
  2. 2.IBM Toronto Lab.MarkhamCanada

Personalised recommendations