Abstract
In this paper, we address the problem of effective management of parallel I/O in multiprogrammed cluster computing systems by using appropriate I/O scheduling strategies. The performance of I/O devices lags behind the performance of processors and network systems, resulting in I/O becoming the bottleneck in current systems. This gap is expected to increase in the future since I/O performance is limited by physical motion. Therefore, it is imperative that novel techniques for improving I/O performance be developed. Parallel I/O is a promising approach to alleviating this bottleneck. However, very little work exists with respect to scheduling parallel I/O operations explicitly. We propose new I/O scheduling algorithms and evaluate the relative performance of the proposed policies against the most commonly used approaches. Our preliminary results show that the proposed policies outperform current methods and can substantially enhance the performance of real-world scientific applications.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
R. Ross and W. Ligon, “Server-Side Scheduling in Cluster Parallel I/O Systems,” Special Issue of Calculateurs Paralleles, 2003.
Fangyu Chen, Shikharesh Majumdar, “Performance of Parallel I/O Scheduling Strategies on a Network of Workstations,” ICPADS 2001, pages: 157–164, 2001.
Phillip M. Dickens and Rajeev Thakur, “Evaluation of Collective I/O Implementations on Parallel Architectures,” Journal of Parallel and Distributed Computing, 61(8),(2001):1052–1076.
R. Ross, D. Nurmi, A. Cheng, and M. Zingale, “A Case Study in Application I/O on Linux Clusters,” Proceedings of SC2001, Denver, CO, November,(2001).
P. Carns, W. Ligon III, R. Ross, and R. Thakur, “PVFS: A Parallel File System For Linux Clusters,” Proceedings of the 4th Annual Linux Showcase and Conference, (Atlanta, GA, October 2000,) pp. 317–327.
Rajeev Thakur, William Gropp, and Ewing Lusk, “On Implementing MPI-IO Portably and with High Performance,” Proc. of the Sixth Workshop on I/O in Parallel and Distributed Systems, (May 1999), pp. 23–32.
Y. Cho, M. Winslett, S. Kuo, Y. Chen, J. Lee, and K. Motukuri, “Parallel I/O on Networks of Workstations: Performance Improvement by Careful Placement of I/O Servers,” In Proceedings of the HiPer’98, High Performance Computing on Hewlett-Packard Systems (Annual conference of the HP2EUG),Switzerland, October 1998, pages 104–111, Zurich.
Y. Cho, M. Winslett, S. Kuo, J. Lee, and Y. Chen, “Parallel I/O for Scientific Applications on Heterogeneous Clusters: A Resource-utilization Approach,” Proceedings of the 13th ACM International Conference on Supercomputing, Rhodes, Greece, June 1999.
P. Kwong, S. Majumdar, “Scheduling of I/O in Multiprogrammed Parallel Systems,” Informatica, 23(1),(April 1999):67–76.
Kent E. Seamons. “Panda: Fast Access to Persistent Arrays Using High Level Interfaces and Server Directed Input/Output,” PhD thesis, Dept. of Computer Science, University of Illinois at Urbana-Champaign, May 1996.
R. Arpaci-Dusseau, E. Anderson, N. Treuhaft, D. Culler, J. Hellerstein, D. Patterson, and K. Yelick. “Cluster I/O with River: Making the Fast Case Common,” In Proceedings of the Sixth Workshop on I/O in Parallel and Distributed Systems, Atlanta, Georgia, 1999.
Sandra Johnson Baylor, Caroline B. Benveniste, and Y. Hsu. “Performance evaluation of a parallel I/O architecture,” Technical Report RC 20049, IBM T. J. Watson Research Center, May 1995.
A. Acharya, M. Uysal, R. Bennett, A. Mendelson, M. Beynon, J. Hollingsworth, J. Saltz, and A. Sussman. “Tuning the Performance of I/O-Intensive Parallel Applications,” In Proceedings of the Fourth Annual Workshop on I/O in Parallel and Distributed Systems, pages 15–27, May 1996.
Ravi Jain, Kiran Somalwar, John Werth, J.c. Browne, “Heuristics for Scheduling I/O Operations,” IEEE Transactions on Parallel and Distributed Systems, March 1997 (Vol. 8, No. 3), pp. 310–320.
J. H. Abawajy, “An integrated resource scheduling approach on cluster computing systems,” In Proceedings of the PDSECA/IPDPS-03), Nice, France, April 22–26, 2003.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abawajy, J.H. (2003). Parallel I/O Scheduling in Multiprogrammed Cluster Computing Systems. In: Sloot, P.M.A., Abramson, D., Bogdanov, A.V., Gorbachev, Y.E., Dongarra, J.J., Zomaya, A.Y. (eds) Computational Science — ICCS 2003. ICCS 2003. Lecture Notes in Computer Science, vol 2660. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44864-0_23
Download citation
DOI: https://doi.org/10.1007/3-540-44864-0_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40197-1
Online ISBN: 978-3-540-44864-8
eBook Packages: Springer Book Archive