Tutorial on parallel database systems

  • Gerhard Weikum
Tutorials
Part of the Lecture Notes in Computer Science book series (LNCS, volume 893)

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

General references

  1. 1.
    DeWitt, D.J., Gray, J.: Parallel Database Systems: The Future of High Performance Database Systems, Communications of the ACM Vol.35 No.6, 1992, pp. 85–98Google Scholar
  2. 2.
    Graefe, G.: Query Evaluation Techniques for Large Databases, ACM Computing Surveys Vol.25 No.2, 1993, pp. 73–170, particularly Sections 9 and 10 on Parallel Query Execution and Parallel AlgorithmsGoogle Scholar
  3. 3.
    Mohan, C., Pirahesh, H., Tang, W.G., Wang, Y.: Parallelism in Relational Database Management Systems, IBM Systems Journal Vol.33 No.2, 1994, pp. 349–371Google Scholar
  4. 4.
    Valduriez, P.: Parallel Database Systems: Open Problems and New Issues, Distributed and Parallel Databases Vol.1 No.2, 1993, pp. 137–165Google Scholar

Data placement issues

  1. 5.
    Abdel-Ghaffar K.A.S., El Abbadi, A., Optimal Disk Allocation for Partial Match Queries, ACM Transactions on Database Systems Vol.18 No.1, 1993, pp. 132–156Google Scholar
  2. 6.
    Azar, Y., Broder, A.Z., Karlin, A.R., Upfal, E.: Balanced Allocations, ACM SIGACT Symposium on Theory of Computing, Montreal, 1994Google Scholar
  3. 7.
    Azar, Y., Naor, J., Rom, R.: The Competitiveness of Online Assignment, 3rd ACM/SIAM Symposium on Discrete Algorithms, 1992Google Scholar
  4. 8.
    Berson, S., Ghandharizadeh, S., Muntz, R., Ju, X.: Staggered Striping in Multimedia Information Systems, ACM SIGMOD International Conference on Management of Data, Minneapolis, 1994Google Scholar
  5. 9.
    Chen, P.M., Patterson, D.A.: Maximizing Performance in a Striped Disk Array, ACM SIGARCH International Symposium on Computer Architecture, Seattle, 1990Google Scholar
  6. 10.
    Copeland, G., Alexander, W., Boughter, E., Keller, T.: Data Placement in Bubba, ACM SIGMOD International Conference on Management of Data, Chicago, 1988Google Scholar
  7. 11.
    Devine, R.: Design and Implementation of DDH: A Distributed Dynamic Hashing Algorithm, 4th International Conference on Foundations of Data Organization and Algorithms, Chicago, 1993Google Scholar
  8. 12.
    Dowdy, L.W., Foster, D.V.: Comparative Models of the File Assignment Problem, ACM Computing Surveys Vol.14 No.2, 1982, pp. 287–313Google Scholar
  9. 13.
    Du, H.C., Sobolewski, J.S., Disk Allocation for Cartesian Product Files on Multiple Disk Systems, ACM Transactions on Database Systems Vol. 7 No.1, 1982, pp. 82–101Google Scholar
  10. 14.
    Faloutsos, C., Metaxas, D., Disk Allocation Methods Using Error Correcting Codes, IEEE Transactions on Computers Vol.40 No.8, 1991, pp. 907–914Google Scholar
  11. 15.
    Faloutsos, C., Bhagwat, P., Declustering Using Fractals, 2nd International Conference on Parallel and Distributed Information Systems, San Diego, 1993Google Scholar
  12. 16.
    Ghandeharizadeh, S., DeWitt, D.J.: A Multiuser Performance Analysis of Alternative Declustering Strategies, 6th IEEE International Conference on Data Engineering, Los Angeles, 1990Google Scholar
  13. 17.
    Ghandeharizadeh, S., DeWitt, D.J.: Hybrid-range Partitioning Strategy: A New Declustering Strategy for Multiprocessor Database Machines, 16th International Conference on Very Large Data Bases, Brisbane, 1990Google Scholar
  14. 18.
    Ghandeharizadeh, S., Ramos, L., Asad, Z., Qureshi, W.: Object Placement in Parallel Hypermedia Systems, 17th International Conference on Very Large Data Bases, Barcelona, 1991Google Scholar
  15. 19.
    Ghandeharizadeh, S., Meyer, R., Schultz, G., Yackel, J.: Optimal Balanced Partitions and a Parallel Database Application, Operations Research Society of America (ORSA) Journal of Computing Vol.5 No.2, 1993Google Scholar
  16. 20.
    Ghandeharizadeh, S., Wilhite, D., Lin, K., Zhao, X.: Object Placement in Parallel Object-Oriented Database Systems, 10th IEEE International Conference on Data Engineering, Houston, 1994Google Scholar
  17. 21.
    Ghandeharizadeh, S., DeWitt, D.J.: MAGIC: A Multiattribute Declustering Mechanism for Multiprocessor Database Machines, IEEE Transactions on Parallel and Distributed Systems Vol.5 No.5, 1994, pp. 509–524Google Scholar
  18. 22.
    Himatsingka, B., Srivastava, J., Li, J., Rotem, D.: Latin Hypercubes: A Class of Multidimensional Declustering Techniques, Technical Report TR 94-09, Computer Science Department, University of Minnesota, Minneapolis, 1994Google Scholar
  19. 23.
    Himatsingka, B., Srivastava, J.: Performance Evaluation of Grid Based Multi-Attribute Record Declustering Methods, 10th IEEE International Conference on Data Engineering, Houston, 1994Google Scholar
  20. 24.
    Houtsma, M.A.W., Apers, P.M.G., Schipper, G.L.V.: Data Fragmentation for Parallel Transitive Closure Strategies, 9th International Conference on Data Engineering, Vienna, 1993Google Scholar
  21. 25.
    Hua, K., Lee, C.: An Adaptive Data Placement Scheme for Parallel Database Computer Systems, 16th International Conference on Very Large Data Bases, Brisbane, 1990Google Scholar
  22. 26.
    Johnson, T., Krishna, P.: Lazy Updates for Distributed Search Structures, ACM SIGMOD International Conference on Management of Data, Washington, 1993Google Scholar
  23. 27.
    Kamel, I., Faloutsos, C.: Parallel R-Trees, ACM SIGMOD International Conference on Management of Data, San Diego, 1992Google Scholar
  24. 28.
    Kim, M.H., Pramanik, S., Optimal File Distribution for Partial Match Queries, ACM SIGMOD International Conference on Management of Data, Chicago, 1988Google Scholar
  25. 29.
    Kim, M.Y., Tantawi, A.N.: Asynchronous Disk Interleaving: Approximating Access Delays, IEEE Transactions on Computers Vol.40 No.7, 1991, pp. 801–810Google Scholar
  26. 30.
    Kouramajian, V., Elmasri, R., Chaudhry, A.: Declustering Techniques for Parallelizing Temporal Access Structures, 10th IEEE International Conference on Data Engineering, Houston, 1994Google Scholar
  27. 31.
    Kroell, B., Widmayer, P.: Distributing a Search Tree Among a Growing Number of Processors, ACM SIGMOD International Conference on Management of Data, Minneapolis, 1994Google Scholar
  28. 32.
    Lee, E.K., Katz, R.H.: An Analytic Performance Model of Disk Arrays and its Application, ACM SIGMETRICS International Conference on Measure ment and Modeling of Computer Systems, Santa Clara, 1993Google Scholar
  29. 33.
    Lee, L.-W.: Optimization of Load-Balanced File Allocation, Ph.D. Dissertation, Department of Electrical Engineering and Computer Science, Northwestern University, Evanston, Illinois, 1994Google Scholar
  30. 34.
    Li, J., Srivastava, J., Rotem, D.: CMD: A Multidimensional Declustering Method for Parallel Database Systems, 18th International Conference on Very Large Data Bases, Vancouver, 1992Google Scholar
  31. 35.
    Litwin, W., Neimat, M.-A., Schneider, D.A.: LH — Linear Hashing for Distributed Files, ACM SIGMOD International Conference on Management of Data, Washington, 1993Google Scholar
  32. 36.
    Litwin, W., Neimat, M.-A., Schneider, D.A.: RP: A Family of Order-Preserving Scalable Distributed Data Structures, 20th International Conference on Very Large Data Bases, Santiago de Chile, 1994Google Scholar
  33. 37.
    Matsliach, G., Shmueli, O.: A Combined Method for Maintaining Large Indices in Multiprocessor Multidisk Environments, IEEE Transactions on Knowledge and Data Engineering Vol.6 No.3, 1994, pp. 479–496Google Scholar
  34. 38.
    Nelson, R., Tantawi, A.N.: Approximate Analysis of Fork/Join Synchronization in Parallel Queues, IEEE Transactions on Computers Vol.37 No.6, 1988, pp. 739–743Google Scholar
  35. 39.
    Scheuermann, P., Weikum, G., Zabback, P.: Data Partitioning and Load Balancing in Parallel Disk Systems, Technical Report 209, Department of Computer Science, ETH Zurich, January 1994, submitted for publicationGoogle Scholar
  36. 40.
    Scheuermann, P., Weikum, G., Zabback, P., “Disk Cooling” in Parallel Disk Systems, Bulletin of the IEEE TC on Data Engineering Vol.17 No.3, September 1994Google Scholar
  37. 41.
    Seeger, B., Larson, P.-A.: Multi-Disk B-Trees, ACM SIGMOD International Conference on Management of Data, Denver, 1991Google Scholar
  38. 42.
    Severance, C., Pramanik, S., Wolberg, P.: Distributed Linear Hashing and Parallel Projection in Main Memory Databases, 16th International Conference on Very Large Data Bases, Brisbane, 1990Google Scholar
  39. 43.
    Srivastava, J., Niccum, T.M., Himatsingka, B., Data Declustering in PADMA: A PArallel Database MAnager, Bulletin of the IEEE TC on Data Engineering Vol.17 No.3, September 1994Google Scholar
  40. 44.
    Tomasic, A., Garcia-Molina, H.: Query Processing and Inverted Indices in Shared-Nothing Text Document Information Retrieval Systems, VLDB Journal Vol.2 No.3, 1993, pp. 243–275Google Scholar
  41. 45.
    Vingralek, R., Breitbart, Y., Weikum, G.: Distributed File Organization with Scalable Cost/Performance, ACM SIGMOD International Conference on Management of Data, Minneapolis, 1994Google Scholar
  42. 46.
    Wolf, J.L.: The Placement Optimization Program: A Practical Solution to the Disk File Assignment Problem, ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, Berkeley, California, 1989Google Scholar
  43. 47.
    Zabback, P.: I/O Parallelism in Database Systems (in German), Doctoral Thesis, Department of Computer Science, ETH Zurich, April 1994Google Scholar
  44. 48.
    Zhou, Y., Shekhar, S., Coyle, M.: Disk Allocation Methods for Parallelizing Grid Files, 10th IEEE International Conference on Data Engineering, Houston, 1994Google Scholar

Other optimization issues

  1. 49.
    DeWitt, D.J., Naughton, J.F., Schneider, D.A., Seshadri, S.: Practical Skew Handling in Parallel Joins, 18th International Conference on Very Large Data Bases, Vancouver, 1992Google Scholar
  2. 50.
    Frieder, O., Baru, C.K.: Site and Query Scheduling Policies in Multicomputer Database Systems, IEEE Transactions on Knowledge and Data Engineering Vol.6 No.4, 1994, pp. 609–619Google Scholar
  3. 51.
    Ganguly, S., Hasan, W., Krishnamurthy, R.: Query Optimization for Parallel Execution, ACM SIGMOD International Conference on Management of Data, San Diego, 1992Google Scholar
  4. 52.
    Hasan, W., Motwani, R.: Optimization Algorithms for Exploiting the Parallelism-Communication Tradeoff in Pipelined Parallelism, 20th International Conference on Very Large Data Bases, Santiago de Chile, 1994Google Scholar
  5. 53.
    Hong, W., Stonebraker, M.: Optimization of Parallel Query Execution Plans in XPRS, Distributed and Parallel Databases Vol.1 No.1, 1993, pp. 9–32Google Scholar
  6. 54.
    Lo, M.-L., Chen, M.-S., Ravishankar, C.V., Yu, P.S.: On Optimal Processor Allocation to Support Pipelined Hash Joins, ACM SIGMOD International Conference on Management of Data, Washington, 1993Google Scholar
  7. 55.
    Rahm, E., Marek, R.: Analysis of Dynamic Load Balancing Strategies for Parallel Shared Nothing Database Systems, 19th International Conference on Very Large Data Bases, Dublin, 1993Google Scholar
  8. 56.
    Shekita, E.J., Young, H.C., Tan, K.-L.: Multi-Join Optimization for Symmetric Multiprocessors, 19th International Conference on Very Large Data Bases, Dublin, 1993Google Scholar
  9. 57.
    Wolf, J.L., Yu, P.S., Turek, J., Dias, D.M.: A Parallel Hash Join Algorithm for Managing Data Skew, IEEE Transactions on Parallel and Distributed Systems Vol.4 No. 12, 1993, pp. 1355–1371Google Scholar
  10. 58.
    Wolf, J.L., Turek, J., Chen, M.-S., Yu, P.S.: Scheduling Multiple Queries on a Parallel Machine, ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, Nashville, Tennessee, 1994Google Scholar

Copyright information

© Springer-Verlag 1995

Authors and Affiliations

  • Gerhard Weikum
    • 1
  1. 1.Department of Computer ScienceUniversity of the SaarlandSaarbrückenGermany

Personalised recommendations