Filter-based join algorithms on uniprocessor and distributed-memory multiprocessor database machines

  • Ghassan Z. Qadah
Efficient Data Access
Part of the Lecture Notes in Computer Science book series (LNCS, volume 303)


The hybrid-hash algorithm and its parallel variant have been recently found to outperform all other algorithms in joining disk-based large relations on uniprocessor and ring-interconnected distributed-memory multiprocessor database machines. This paper presents several extensions to the centralized and distributed hybrid-hash algorithms. These extensions are based on the usage of one or more bit-vectors as filters. A comparative performance study to the presented algorithms is carried out. In a uniprocessor environment, this study shows that one of the proposed filter-based algorithms outperforms all of the other ones, including the hybrid-hash algorithm. In a distributed environment, the filter-based algorithms are found to suffer from a serious problem, namely, overloading the interconnection network with the transmission of large size bit-vectors. Different compression schemes are proposed to reduce the size of a transmitted bit-vector. The augmentation of the distributed version of best-performing centralized algorithm with one of the proposed compression schemes have been found to outperform all of the other algorithms and substantially improves the performance of the join operation.


Hash Function Main Memory Hash Table Processing Node Compression Scheme 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [Babb79]
    Babb, E., "Implementing a Relational Database by Means of Specialized Hardware." ACM Trans. on Database Systems, Vol. 4, No. 6 (June 1979), pp. 414–429.Google Scholar
  2. [Baru87]
    Baru, C. and Frieder, F., "Implementing Relational Database Operations in a Cube-Connected Multicomputer System." Proceedings of the Third International Conference on Data Engineering, 1987.Google Scholar
  3. [Baru86]
    Baru, C. K. and Su, S. Y. W., "The Architecture of SM3: A Dynamically Partitionable Multicomputer System." IEEE Transaction on Computers, Vol. C-35, No. 9(September 1986), pp. 790–802.Google Scholar
  4. [Bitt83]
    Bitton, D., et al., "Parallel Algorithms for the Execution of Relational Database Operations." ACM Trans. on Database Systems, Vol.8, No. 3(September 1983), pp. 324–353.Google Scholar
  5. [Dewi79]
    Dewitt, D. J., "DIRECT-A Multiprocessor Organization for Supporting Relational Database Management Systems." IEEE Transaction on Computers, Vol. C-28, No. 6(June 1979), pp. 395–408.Google Scholar
  6. [Dewi87]
    Dewitt, D. J., et al., "A Single User Evaluation for the Gamma Database Machine." Proceedings of the 5th International Workshop on Database Machines, 1987.Google Scholar
  7. [Dewi86]
    Dewitt, D., J., et al., "GAMMA — A High Performance Dataflow Database Machine." Proceedings of the 12th International Conference on Very Large Databases, 1986, pp. 228–237.Google Scholar
  8. [Dewi85]
    Dewitt, D., J. and Gerber, R., "Multiprocessor Hash-Based Join Algorithms." Proceedings of VLDB, 1985, pp. 151–164.Google Scholar
  9. [Dewi84]
    Dewitt, D. J. et al., "Implementation Techniques for Large Main Memory Database Systems." Proceedings of SIGMOD, 1984.Google Scholar
  10. [Gard81]
    Gardarin, G., "An Introduction to SABRE: A Multi-Microprocessor Database Machine." Proceedings of the 6th Workshop on Computer Architecture for Non-numeric Processing, Hyeres, France, June 1981.Google Scholar
  11. [Gerb86]
    Gerber, R., "Dataflow Query Processing Using Multiprocessor Hash-Partitioned Algorithms." Tech. Report #672, Computer Sciences Department, University of Wisconsin-Madison, Oct. 1986.Google Scholar
  12. [Good81]
    Goodman, J. R., "An Investigation of Multiprocessor Structures and Algorithms for Database Management." Memo No. UCB/ERLM81 (May 81), Electronic Research Lab., College of Engineering, University of California, Berkeley.Google Scholar
  13. [Hsia81]
    Hsiao, D. K., and Menon, M. J., "Design and Analysis of a Multi-Backend Database System for Performance Improvements, Functionality Expansion and Capacity Growth (Part I and II)." Technical Reports, OSU-CISRC-TR-81-7 and OSU-CISRC-TR-81-8, The Ohio State University, Columbus, Ohio, 1981.Google Scholar
  14. [Kim79]
    Kim, W., "Relational Database Systems." ACM Computing Survey, Vol. 11, No.3, 1979, pp. 185–211.Google Scholar
  15. [Kits84]
    Kitsuregawa, et al., "Architecture and Performance of Relational Algebra Machine GRACE." Proceedings of the International Conference on Parallel Processing, 1984, pp. 241–250.Google Scholar
  16. [Qada]
    Qadah, G. Z. and Irani, K. B., "The Join Operation on A Shared-memory Multiprocessor Database Machine." to appear in the IEEE Transaction on Software Engineering.Google Scholar
  17. [Qada87]
    Qadah, G. Z., "Filter-based Algorithms on Uniprocessor and Distributed-memory Multiprocessor Database Machines." Technical Report # 87-06-DBM-03, EECS Department, Northwestern University, 1987.Google Scholar
  18. [Qada85a]
    Qadah, G. Z., "Database Machines: A Survey." Proceedings of the National Computer Conference, AFIPS Press, 1985, pp.211–223.Google Scholar
  19. [Qada85b]
    Qadah, G. Z. and Irani, K. B., "A Database Machine for Very Large Relational Databases." IEEE Transaction on Computers, Vol. C-34, No. 11(November 1985), pp. 1015–1025.Google Scholar
  20. [Schw83]
    Schweppe, H., Zeidler, H., Hell, W., Leilich, H., Stiege, G. and Teich, W., "RDBM-A Dedicated Multiprocessor System for Database Management." Advanced Database Architecture, Hsiao, D. K.(ed.), Prentice-Hall, 1983, pp. 36–86.Google Scholar
  21. [Seit85]
    Seitz, C., "The Cosmic Cube." Communication of ACM, Vol. 28, No. 1 (Jan. 1985), pp. 22–33.Google Scholar
  22. [Shul87]
    Shultz, R. and Miller, lla, "Tree Structured Multiple Processor Join Methods." Proceedings of the 3rd Data Engineering Conference, 1987, pp. 190–199.Google Scholar
  23. [Tera83]
    Teradata: DBC/1012 Data Base Computer Concepts and Facilities, Teradata Corp. Document No. C02-0001-01, 1984.Google Scholar
  24. [Vald84]
    Valduriez, P. and Gardarin, G., "Join and Semijoin Algorithms for a Multiprocessor Database Machine." ACM Trans. on Database Systems, Vol. 9, No. 1(March 1984), pp. 133–161.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1988

Authors and Affiliations

  • Ghassan Z. Qadah
    • 1
  1. 1.Electrical Engineering and Computer Science DepartmentNorthwestern UniversityEvanstonUSA

Personalised recommendations