Improving the Performance of Parallel I/O Using Distributed Scheduling Algorithms

Durand, Dannie; Jain, Ravi; Tseytlin, David

doi:10.1007/978-1-4613-1401-1_11

Dannie Durand⁴,
Ravi Jain³ &
David Tseytlin³

Part of the book series: The Kluwer International Series in Engineering and Computer Science ((SECS,volume 362))

127 Accesses
3 Citations

Abstract

The cost of data transfers, and in particular of I/O operations, is a growing problem in parallel computing. This performance bottleneck is especially severe for data-intensive applications such as multimedia information systems, databases, and Grand Challenge problems. A promising approach to alleviating this bottleneck is to schedule parallel I/O operations explicitly.

Although centralized algorithms for batch scheduling of parallel I/O operations have previously been developed, they are not be appropriate for all applications and architectures. We develop a class of decentralized algorithms for scheduling parallel I/O operations, where the objective is to reduce the time required to complete a given set of transfers. These algorithms, based on edge-coloring and matching of bipartite graphs, rely upon simple heuristics to obtain shorter schedules. We present simulation results indicating that the best of our algorithms can produce schedules whose length (or makespan) is within 2–20% of the optimal schedule, a substantial improvement on previous decentralized algorithms. We discuss theoretical and experimental work in progress and possible extensions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

A. Aggarwal and J. S. Vitter. The Input/Output complexity of sorting and related problems. Communications of the ACM, pages 1116–1127, Sep. 1988.
Google Scholar
T. E. Anderson, S.S. Owicki, J. B. Saxe, and C. P. Thacker. High-Speed Switch Scheduling for Local-Area Networks. ACM Transactions on Computer Systems, 11 (4): 319–352, November 1993.
Article Google Scholar
M. Beeler. Inside the TC2000 Computer, First Rel. BBN Advanced Computers Inc., 1990.
Google Scholar
Claude Berge. Graphs. North Holland, 1985.
MATH Google Scholar
L. Bianco, J. Blazewicz, P. Dell’Olmo, Dell’Olmo P, and M. Drozdowski. Scheduling multiprocessor tasks on a dynamic configuration of dedicated processors. Technical Report R-92/045, Institute of Computing Science, TU Poznan, 1992.
Google Scholar
L. Bianco, J. Blazewicz, P. Dell’Olmo P, and M. Drozdowski. Scheduling preemptive multiprocessor tasks on dedicated processors. Perf. Eval., 1994. To appear.
Google Scholar
E. G. Coffman, Jr., editor. Computer and Job-Shop Scheduling Theory. John Wiley, 1976.
MATH Google Scholar
E. G. Coffman, Jr., M. R. Garey, D. S. Johnson, and A. S. LaPaugh. Scheduling file transfers. SIAM Journal of Computing, 3:744–780,1985.
Article MathSciNet Google Scholar
P. F. Corbett, S. J. Baylor, and D. G. Feitelson. Overview of the Vesta Parallel File System. In The 1993 Workshop on Input/Output in Parallel Computer Systems, pages 1–17,1993.
Google Scholar
Thomas H. Corman. Fast Permuting on Disk Arrays. Journal of Parallel and Distributed Computing, 17:41–57, January 1993.
Article MATH Google Scholar
T. H. Cormen and D. Kotz. Integrating Theory and Practise in Parallel File Systems. In Proceedings of the DAGS 93 Symposium on Parallel I/O and Databases, pages 64–74,1993.
Google Scholar
Juan Miguel del Rosario, Rajesh Bordawekar, and Alok Choudhary. Improved Parallel I/O via a Two-phase Run-time Access Strategy. In The 1993 Workshop on Inputl Output in Parallel Computer Systems, pages 56–70,1993.
Google Scholar
P. J. Denning. Effects of scheduling on file memory operations. In Proc. AFIPS Spring Joint Comp. Conf., pages 9–21,1967.
Google Scholar
M.D. Durand, T. Montaut, L. Kervella, and W. Jalby. Impact of Memory Contention on Dynamic Scheduling on NUMA Multiprocessors. In Proceedings of the 1993 International Conference on Parallel Processing, August 1993.
Google Scholar
Gereb-Graus and Tsantilas. Efficient Optical Communication in Parallel Computers. In 1992 Symposium on Parallel Algorithms and Architectures, pages 41–48,1992.
Google Scholar
Mario Gonzalez, Jr. Deterministic Processor Scheduling. Computing Surveys, 9:173, Sept. 1977.
Article MathSciNet MATH Google Scholar
R. Jain, K. Somalwar, J. Werth, and J.C. Browne. Scheduling Parallel I/O Operations in Multiple Bus Systems. Journal of Parallel and Distributed Computing, 16:352–362, December 1992.
Article MATH Google Scholar
R. Jain, K. Somalwar, J. Werth, and J.C. Browne. Heuristics for Scheduling Parallel I/O Operations. IEEE Trans. Par. and Distrib. Sys., to appear, 1996.
Google Scholar
Ravi Jain. Scheduling data transfers in parallel computers and communications systems. Technical Report TR-93–03, Univ. Texas at Austin, Dept. of Comp. Sci., Feb. 1993.
Google Scholar
H. J. Karloff and D. B. Schmoys. Efficient Parallel Algorithms for Edge Coloring Problems. Journal of Algorithms, pages 39 -52, August 1987.
Google Scholar
D. Kotz. Multiprocessor file system interfaces. In Proc. 2nd Intl. Conf. Par. Distrib. Info. Sys., pages 194–201,1993.
Google Scholar
O. Kreiger and M. Stumm. HFS: A Flexible File System for large-scale Multiprocessors. In Proceedings of the DAGS 93 Symposium on Parallel I/O and Databases, pages 6–14,1993.
Google Scholar
E. L. Lawler, J. K. Lenstra, and A. H. G. Rinnooy Kan. Recent developments in deterministic sequencing and scheduling: A survey. In Deterministic and Stochastic Scheduling, pages 35–73. D. Reidel Publishing, 1982.
Google Scholar
M. Luby. Removing Randomness in Parallel Computation without a Processor Penalty. In Proceedings of the IEEE Symposium on Foundations of Computer Science, pages 162–173,1988.
Google Scholar
M. Nodine and J. S. Vitter. Paradigms for optimal sorting with multiple disks. In Proc. 26th Hawaii Intl. Conf Sys. Sci., page 50, 1993.
Google Scholar
Krishna Palem. On the Complexity of Precedence Constrained Scheduling. PhD thesis, Univ. Texas at Austin, Dept. of Comp. Sci., 1986. Available as Tech. Rept. TR-86–11.
Google Scholar
A. Panconesi and A Srinavasan. Fast Randomized Algorithms for Distributed Edge Coloring. In Proceedings of the 1992 ACM Symposium on Parallel and Distributed Computing, pages 251–262, August 1992.
Google Scholar
David Patterson, Garth Gibson, and Randy Katz. A case for redundant arrays of inexpensive disks (RAID). In ACM SIGMOD Conference, pages 109–116, June 1988.
Google Scholar
R. H. Patterson, G. A. Gibson, and M. Satyanarayanan. Informed Prefetching: Converting High Throughput to Low Latency. In Proceedings of the DAGS 93 Symposium on Parallel I/O and Databases, pages 41–55,1993.
Google Scholar
A. Silberschatz and J. Peterson. Operating systems concepts. Addison-Wesley, 1988.
MATH Google Scholar
M. Stonebraker and G. A. Schloss. Distributed RAID — a new multiple copy algorithm. In Proc. 6th Intl. Conf. Data Eng., pages 430–437,1990.
Chapter Google Scholar
J.S. Vitter and M.H. Nodine. Large-scale sorting in uniform memory hierarchies. Journal of Parallel and Distributed Computing, pages 107–4114,Jan./Feb.1993.
Google Scholar
J.S. Vitter and E. A. M. Shriver.Optimal disk I/O with parallel block transfer.In Proc. ACM Symp. Theory of Comp., 1990.
Google Scholar

Download references

Author information

Authors and Affiliations

Bellcore, Morristown, NJ, 07960, USA
Ravi Jain & David Tseytlin
CIS, University of Pennsylvania, Philadelphia, PA, 19104, USA
Dannie Durand

Authors

Dannie Durand
View author publications
You can also search for this author in PubMed Google Scholar
Ravi Jain
View author publications
You can also search for this author in PubMed Google Scholar
David Tseytlin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Bell Communications Research, Morristown, New Jersey, USA
Ravi Jain
University of Texas at Austin, Austin, Texas, USA
John Werth & James C. Browne &

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Durand, D., Jain, R., Tseytlin, D. (1996). Improving the Performance of Parallel I/O Using Distributed Scheduling Algorithms. In: Jain, R., Werth, J., Browne, J.C. (eds) Input/Output in Parallel and Distributed Computer Systems. The Kluwer International Series in Engineering and Computer Science, vol 362. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-1401-1_11

Download citation

DOI: https://doi.org/10.1007/978-1-4613-1401-1_11
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4612-8607-3
Online ISBN: 978-1-4613-1401-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics