Skip to main content

Improving the Performance of Parallel I/O Using Distributed Scheduling Algorithms

  • Chapter
Book cover Input/Output in Parallel and Distributed Computer Systems

Abstract

The cost of data transfers, and in particular of I/O operations, is a growing problem in parallel computing. This performance bottleneck is especially severe for data-intensive applications such as multimedia information systems, databases, and Grand Challenge problems. A promising approach to alleviating this bottleneck is to schedule parallel I/O operations explicitly.

Although centralized algorithms for batch scheduling of parallel I/O operations have previously been developed, they are not be appropriate for all applications and architectures. We develop a class of decentralized algorithms for scheduling parallel I/O operations, where the objective is to reduce the time required to complete a given set of transfers. These algorithms, based on edge-coloring and matching of bipartite graphs, rely upon simple heuristics to obtain shorter schedules. We present simulation results indicating that the best of our algorithms can produce schedules whose length (or makespan) is within 2–20% of the optimal schedule, a substantial improvement on previous decentralized algorithms. We discuss theoretical and experimental work in progress and possible extensions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. A. Aggarwal and J. S. Vitter. The Input/Output complexity of sorting and related problems. Communications of the ACM, pages 1116–1127, Sep. 1988.

    Google Scholar 

  2. T. E. Anderson, S.S. Owicki, J. B. Saxe, and C. P. Thacker. High-Speed Switch Scheduling for Local-Area Networks. ACM Transactions on Computer Systems, 11 (4): 319–352, November 1993.

    Article  Google Scholar 

  3. M. Beeler. Inside the TC2000 Computer, First Rel. BBN Advanced Computers Inc., 1990.

    Google Scholar 

  4. Claude Berge. Graphs. North Holland, 1985.

    MATH  Google Scholar 

  5. L. Bianco, J. Blazewicz, P. Dell’Olmo, Dell’Olmo P, and M. Drozdowski. Scheduling multiprocessor tasks on a dynamic configuration of dedicated processors. Technical Report R-92/045, Institute of Computing Science, TU Poznan, 1992.

    Google Scholar 

  6. L. Bianco, J. Blazewicz, P. Dell’Olmo P, and M. Drozdowski. Scheduling preemptive multiprocessor tasks on dedicated processors. Perf. Eval., 1994. To appear.

    Google Scholar 

  7. E. G. Coffman, Jr., editor. Computer and Job-Shop Scheduling Theory. John Wiley, 1976.

    MATH  Google Scholar 

  8. E. G. Coffman, Jr., M. R. Garey, D. S. Johnson, and A. S. LaPaugh. Scheduling file transfers. SIAM Journal of Computing, 3:744–780,1985.

    Article  MathSciNet  Google Scholar 

  9. P. F. Corbett, S. J. Baylor, and D. G. Feitelson. Overview of the Vesta Parallel File System. In The 1993 Workshop on Input/Output in Parallel Computer Systems, pages 1–17,1993.

    Google Scholar 

  10. Thomas H. Corman. Fast Permuting on Disk Arrays. Journal of Parallel and Distributed Computing, 17:41–57, January 1993.

    Article  MATH  Google Scholar 

  11. T. H. Cormen and D. Kotz. Integrating Theory and Practise in Parallel File Systems. In Proceedings of the DAGS 93 Symposium on Parallel I/O and Databases, pages 64–74,1993.

    Google Scholar 

  12. Juan Miguel del Rosario, Rajesh Bordawekar, and Alok Choudhary. Improved Parallel I/O via a Two-phase Run-time Access Strategy. In The 1993 Workshop on Inputl Output in Parallel Computer Systems, pages 56–70,1993.

    Google Scholar 

  13. P. J. Denning. Effects of scheduling on file memory operations. In Proc. AFIPS Spring Joint Comp. Conf., pages 9–21,1967.

    Google Scholar 

  14. M.D. Durand, T. Montaut, L. Kervella, and W. Jalby. Impact of Memory Contention on Dynamic Scheduling on NUMA Multiprocessors. In Proceedings of the 1993 International Conference on Parallel Processing, August 1993.

    Google Scholar 

  15. Gereb-Graus and Tsantilas. Efficient Optical Communication in Parallel Computers. In 1992 Symposium on Parallel Algorithms and Architectures, pages 41–48,1992.

    Google Scholar 

  16. Mario Gonzalez, Jr. Deterministic Processor Scheduling. Computing Surveys, 9:173, Sept. 1977.

    Article  MathSciNet  MATH  Google Scholar 

  17. R. Jain, K. Somalwar, J. Werth, and J.C. Browne. Scheduling Parallel I/O Operations in Multiple Bus Systems. Journal of Parallel and Distributed Computing, 16:352–362, December 1992.

    Article  MATH  Google Scholar 

  18. R. Jain, K. Somalwar, J. Werth, and J.C. Browne. Heuristics for Scheduling Parallel I/O Operations. IEEE Trans. Par. and Distrib. Sys., to appear, 1996.

    Google Scholar 

  19. Ravi Jain. Scheduling data transfers in parallel computers and communications systems. Technical Report TR-93–03, Univ. Texas at Austin, Dept. of Comp. Sci., Feb. 1993.

    Google Scholar 

  20. H. J. Karloff and D. B. Schmoys. Efficient Parallel Algorithms for Edge Coloring Problems. Journal of Algorithms, pages 39 -52, August 1987.

    Google Scholar 

  21. D. Kotz. Multiprocessor file system interfaces. In Proc. 2nd Intl. Conf. Par. Distrib. Info. Sys., pages 194–201,1993.

    Google Scholar 

  22. O. Kreiger and M. Stumm. HFS: A Flexible File System for large-scale Multiprocessors. In Proceedings of the DAGS 93 Symposium on Parallel I/O and Databases, pages 6–14,1993.

    Google Scholar 

  23. E. L. Lawler, J. K. Lenstra, and A. H. G. Rinnooy Kan. Recent developments in deterministic sequencing and scheduling: A survey. In Deterministic and Stochastic Scheduling, pages 35–73. D. Reidel Publishing, 1982.

    Google Scholar 

  24. M. Luby. Removing Randomness in Parallel Computation without a Processor Penalty. In Proceedings of the IEEE Symposium on Foundations of Computer Science, pages 162–173,1988.

    Google Scholar 

  25. M. Nodine and J. S. Vitter. Paradigms for optimal sorting with multiple disks. In Proc. 26th Hawaii Intl. Conf Sys. Sci., page 50, 1993.

    Google Scholar 

  26. Krishna Palem. On the Complexity of Precedence Constrained Scheduling. PhD thesis, Univ. Texas at Austin, Dept. of Comp. Sci., 1986. Available as Tech. Rept. TR-86–11.

    Google Scholar 

  27. A. Panconesi and A Srinavasan. Fast Randomized Algorithms for Distributed Edge Coloring. In Proceedings of the 1992 ACM Symposium on Parallel and Distributed Computing, pages 251–262, August 1992.

    Google Scholar 

  28. David Patterson, Garth Gibson, and Randy Katz. A case for redundant arrays of inexpensive disks (RAID). In ACM SIGMOD Conference, pages 109–116, June 1988.

    Google Scholar 

  29. R. H. Patterson, G. A. Gibson, and M. Satyanarayanan. Informed Prefetching: Converting High Throughput to Low Latency. In Proceedings of the DAGS 93 Symposium on Parallel I/O and Databases, pages 41–55,1993.

    Google Scholar 

  30. A. Silberschatz and J. Peterson. Operating systems concepts. Addison-Wesley, 1988.

    MATH  Google Scholar 

  31. M. Stonebraker and G. A. Schloss. Distributed RAID — a new multiple copy algorithm. In Proc. 6th Intl. Conf. Data Eng., pages 430–437,1990.

    Chapter  Google Scholar 

  32. J.S. Vitter and M.H. Nodine. Large-scale sorting in uniform memory hierarchies. Journal of Parallel and Distributed Computing, pages 107–4114,Jan./Feb.1993.

    Google Scholar 

  33. J.S. Vitter and E. A. M. Shriver.Optimal disk I/O with parallel block transfer.In Proc. ACM Symp. Theory of Comp., 1990.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Kluwer Academic Publishers

About this chapter

Cite this chapter

Durand, D., Jain, R., Tseytlin, D. (1996). Improving the Performance of Parallel I/O Using Distributed Scheduling Algorithms. In: Jain, R., Werth, J., Browne, J.C. (eds) Input/Output in Parallel and Distributed Computer Systems. The Kluwer International Series in Engineering and Computer Science, vol 362. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-1401-1_11

Download citation

  • DOI: https://doi.org/10.1007/978-1-4613-1401-1_11

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4612-8607-3

  • Online ISBN: 978-1-4613-1401-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics