Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Parallel Query Optimization

  • Hans Zeller
  • Goetz Graefe
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_1079

Synonyms

Optimization of parallel query plans

Definition

Parallel query optimization is the process of finding a plan for database queries that employs parallel hardware effectively. The details of this process depend on the types of parallelism supported by the underlying hardware, but the most common method is partitioning of the data across multiple processors.

Historical Background

Most parallel database systems today can trace part of their heritage back to the Gamma project at the University of Wisconsin, Madison in the 1980s – with the exception of the Teradata system, which predates Gamma by several years. Also influential were the GRACE database machine, developed at the University of Tokyo, and work at the Norwegian Institute of Technology, University of Trondheim. These projects did not publish a description of their parallel query optimization algorithms, however. Later projects, like XPRS (University of California, Berkeley) and IBM DB2 Parallel Edition describe that...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Ballinger C, Fryer R. Born to be parallel. why parallel origins give teradata an enduring performance edge. IEEE Data Eng Bull. 1997;20(2):3–12.Google Scholar
  2. 2.
    Barclay T, Barnes R, Gray J, Sundaresan P. Loading databases using dataflow parallelism. ACM SIGMOD Rec. 1994;23(4):72–83.CrossRefGoogle Scholar
  3. 3.
    Baru CK, Fecteau G, Goyal A, Hsiao H-I, Jhingran A, Padmanabhan S, Wilson WG. An overview of DB2 parallel edition. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1995. p. 460–2.Google Scholar
  4. 4.
    Boral H, Alexander W, Clay L, Copeland GP, Danforth S, Franklin MJ, Hart BE, Smith MG, Valduriez P. Prototyping Bubba, a highly parallel database system. IEEE Trans Knowl Data Eng. 1990;2(1): 4–24.CrossRefGoogle Scholar
  5. 5.
    Bratbergsengen K. Algebra operations on a parallel computer – performance evaluation. In: Proceedings of the 5th International Workshop on Data Machines; 1987. p. 415–28.Google Scholar
  6. 6.
    Chen A, Kao Y-F, Pong M, Shak D, Sharma S, Vaishnav J, Zeller H. Query processing in nonstop SQL. IEEE Data Eng Bull. 1993;16(4):29–41.Google Scholar
  7. 7.
    DeWitt DJ, Gerber RH, Graefe G, Heytens ML, Kumar KB, Muralikrishna M. GAMMA – a high performance dataflow database machine. In: Proceedings of the 12th International Conference on Very Large Data Bases; 1986. p. 228–37.Google Scholar
  8. 8.
    DeWitt DJ, Gray J. Parallel database systems: the future of high performance database systems. Commun ACM. 1992;35(6):85–98.CrossRefGoogle Scholar
  9. 9.
    DeWitt DJ, Smith M, Boral H. A single-user performance evaluation of the Teradata database machine. In: Proceedings of the 2nd International Workshop on High Performance Transaction Systems; 1987.Google Scholar
  10. 10.
    Fushimi S, Kitsuregawa M, Tanaka H. An overview of the system software of a parallel relational database machine GRACE. In: Proceedings of the 12th International Conference on Very Large Data Bases; 1986. p. 209–19.Google Scholar
  11. 11.
    Graefe G. Encapsulation of parallelism in the volcano query processing system. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1990. p. 102–11.Google Scholar
  12. 12.
    Graefe G, Davison DL. Encapsulation of parallelism and architecture-independence in extensible database query execution. IEEE Trans Softw Eng. 1993;19(8):749–64.CrossRefGoogle Scholar
  13. 13.
    Jhingran A, Malkemus T, Padmanabhan S. Query optimization in DB2 parallel edition. IEEE Data Eng. Bull. 1997;20(2):27–34.Google Scholar
  14. 14.
    Lee MK, Freytag JC, Lohman GM. Implementing an interpreter for functional rules in a query optimizer. In: Proceedings of the 14th International Conference on Very Large Data Bases; 1988. p. 218–29.Google Scholar
  15. 15.
    Mohan C, Pirahesh H, Tang WG, Wang Y. Parallelism in relational database management systems. IBM Sys J. 1994;33(2):349–71.CrossRefGoogle Scholar
  16. 16.
    Neches PM. The anatomy of a database computer system. In: Digest of papers – COMPCON; 1985. p. 252–4.Google Scholar
  17. 17.
    Selinger P, Astrahan M, Chamberlin D, Lorie R, Price T. Access path selection in a relational database management system. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1979. p. 23–34.Google Scholar
  18. 18.
    Stonebraker M, Katz RH, Patterson DA, Ousterhout JK. The design of XPRS. In: Proceedings of the 14th International Conference on Very Large Data Bases; 1988. p. 318–30.Google Scholar
  19. 19.
    von Bueltzingsloewen G. Optimizing SQL queries for parallel execution. In: Proceedings of the ODBF Workshop on Database Query Optimization; 1989.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Hewlett-Packard LaboratoriesPalo AltoUSA
  2. 2.Google, Inc.Mountain ViewUSA

Section editors and affiliations

  • Patrick Valduriez
    • 1
  1. 1.INRIALINANantesFrance