DataStager: scalable data staging services for petascale applications

Abbasi, Hasan; Wolf, Matthew; Eisenhauer, Greg; Klasky, Scott; Schwan, Karsten; Zheng, Fang

doi:10.1007/s10586-010-0135-6

DataStager: scalable data staging services for petascale applications

Published: 15 June 2010

Volume 13, pages 277–290, (2010)
Cite this article

Cluster Computing Aims and scope Submit manuscript

Hasan Abbasi¹,
Matthew Wolf¹,
Greg Eisenhauer¹,
Scott Klasky²,
Karsten Schwan¹ &
…
Fang Zheng¹

436 Accesses
65 Citations
Explore all metrics

Abstract

Known challenges for petascale machines are that (1) the costs of I/O for high performance applications can be substantial, especially for output tasks like checkpointing, and (2) noise from I/O actions can inject undesirable delays into the runtimes of such codes on individual compute nodes. This paper introduces the flexible ‘DataStager’ framework for data staging and alternative services within that jointly address (1) and (2). Data staging services moving output data from compute nodes to staging or I/O nodes prior to storage are used to reduce I/O overheads on applications’ total processing times, and explicit management of data staging offers reduced perturbation when extracting output data from a petascale machine’s compute partition. Experimental evaluations of DataStager on the Cray XT machine at Oak Ridge National Laboratory establish both the necessity of intelligent data staging and the high performance of our approach, using the GTC fusion modeling code and benchmarks running on 1000+ processors.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Abbasi, H., Wolf, M., Schwan, K.: LIVE data workspace: a flexible, dynamic and extensible platform for petascale applications. In: Cluster Computing, Sept. 2007. IEEE International
Ali, N., Lauria, M.: Improving the performance of remote i/o using asynchronous primitives. In: 15th IEEE International Symposium on High Performance Distributed Computing (2006), pp. 218–228
Beckman, P., Coghlan, S.: ZeptoOS: the small Linux for big computers (2005)
Bell, K., Chien, A., Lauria, M.: A high-performance cluster storage server. In: Proceedings of 11th IEEE International Symposium on High Performance Distributed Computing. HPDC-11 2002, pp. 311–320 (2002)
Bianchini, R., Crovella, M., Kontothanassis, L., LeBlanc, T.: Alleviating memory contention in matrix computations on large-scale shared-memory multiprocessors. Technical report, DTIC (1993)
Borrill, J., Oliker, L., Shalf, J., Shan, H.: Investigation of leading HPC I/O performance using a scientific-application derived Benchmark. In: Proceedings of the Conference on SuperComputing, SC07 (2007)
Brightwell, R., Hudson, T., Riesen, R., Maccabe, A.B.: The Portals 3.0 message passing interface. Technical report SAND99-2959, Sandia National Laboratories, December 1999
Brightwell, R., Lawry, B., MacCabe, A.B., Riesen, R.: Portals 3.0: Protocol building blocks for low overhead communication. In: IPDPS ’02: Proceedings of the 16th International Parallel and Distributed Processing Symposium, p. 268. IEEE Comput. Soc., Washington (2002)
Google Scholar
Bustamante, F.E., Eisenhauer, G., Schwan, K., Widener, P.: Efficient wire formats for high performance computing. In: Proceedings of the ACM/IEEE Conference on Supercomputing (CDROM), p. 39. IEEE Comput. Soc., Los Alamitos (2000)
Google Scholar
Cai, Z., Eisenhauer, G., He, Q., Kumar, V., Schwan, K., Wolf, M.: Iq-services: network-aware middleware for interactive large-data applications. In: MGC ’04: Proceedings of the 2nd Workshop on Middleware for Grid Computing, pp. 11–16. ACM Press, New York (2004)
Chapter Google Scholar
Carns, P.H., Ligon, W.B. III, Ross, R.B., Thakur, R.: PVFS: A parallel file system for Linux clusters. In: Proceedings of the 4th Annual Linux Showcase and Conference, pp. 317–327, Atlanta, GA, 2000. USENIX Association
Cluster File Systems Inc. Lustre: a scalable, high-performance file system. White paper, version 1.0, November 2002. http://www.lustre.org/docs/whitepaper.pdf
Dandamudi, S.: Reducing hot-spot contention in shared-memory multiprocessor systems. IEEE Parallel Distrib. Technol. 7(1), 48–59 (1999)
Google Scholar
Ding, C., Dwarkadas, S., Huang, M., Shen, K., Carter, J.: Program phase detection and exploitation. In: 20th International Parallel and Distributed Processing Symposium. IPDPS 2006, 8 pp., 25–29 April 2006
Docan, C., Parashar, M., Klasky, S.: High speed asynchronous data transfers on the cray xt3. In: Cray User Group Conference (2007)
Eisenhauer, G.: The evpath library. http://www.cc.gatech.edu/systems/projects/EVPath
Eisenhauer, G.: Portable binary input/output. http://www.cc.gatech.edu/systems/projects/PBIO
Eisenhauer, G., Bustamente, F., Schwan, K.: Event services for high performance computing. In: Proceedings of High Performance Distributed Computing, HPDC-2000 (2000)
Gardner, M.K., Feng, W.-C., Archuleta, J.S., Lin, H., Ma, X.: Parallel genomic sequence-searching on an ad-hoc grid: experiences, lessons learned, and implications. In: ACM/IEEE SC—06: The International Conference on High-Performance Computing, Networking, Storage, and Analysis, Tampa, FL, November 2006. Best Paper Nominee
Golestani, S.: A stop-and-go queueing framework for congestion management. In: SIGCOMM’90 Symposium, September 1990, pp. 8–18. ACM
Jain, R., Ramakrishnan, K.K., Chiu, D.M.: Congestion avoidance in computer networks with a connectionless network layer. Technical Report DEC-TR-506, Digital Equipment Corporation, MA, Aug. 1987
Kotz, D.: Disk-directed I/O for MIMD multiprocessors. ACM Trans. Comput. Syst. 15(1), 41–47 (1997)
Article MathSciNet Google Scholar
Latham, R., Miller, N., Ross, R., Carns, P.: A next-generation parallel file system for Linux clusters. LinuxWorld, 2(1), January 2004
Lofstead, J., Schwan, K., Klasky, S., Podhorszki, N., Jin, C.: Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS). In: Challenges of Large Applications in Distributed Environments, CLADE (2008)
Miller, E.L., Katz, R.H.: Input/output behavior of supercomputing applications. In: Supercomputing ’91: Proceedings of the 1991 ACM/IEEE Conference on Supercomputing, pp. 567–576. ACM, New York (1991)
Chapter Google Scholar
Nisar, A., Liao, K.W., Choudhary, A.: Scaling parallel I/O performance through I/O delegate and caching system. In: SC ’08: Proceedings of the ACM/IEEE Conference on Supercomputing, pp. 1–12, Piscataway, NJ, USA. IEEE Press, New York (2008)
Google Scholar
Oldfield, R.A., Maccabe, A.B., Arunagiri, S., Kordenbrock, T., Riesen, R., Ward, L., Widener, P.: Lightweight I/O for Scientific Applications. In: Proc. of IEEE Conference on Cluster Computing, Barcelona, Spain, September 2006
Oldfield, R.A., Widener, P., Maccabe, A.B., Ward, L., Kordenbrock, T.: Efficient data movement for lightweight I/O. In: Proc. 2006 Workshop on high-performance I/O Techniques and Deployment of Very-Large Scale I/O Systems (HiPerI/O 2006), Barcelona, Spain, September 2006
Oliker, L., Carter, J., Wehner, M., Canning, A., Ethier, S., Mirin, A., Bala, G., Parks, D., Shigemune Kitawaki, P.W., Tsuda, Y.: Leading computational methods on scalar and vector hec platforms. In: Proceedings of SuperComputing (2005)
Patrick, C.M., Son, S., Kandemir, M.: Comparative evaluation of overlap strategies with study of i/o overlap in mpi-io. SIGOPS Oper. Syst. Rev. 42(6), 43–49 (2008)
Article Google Scholar
Schmuck, F., Haskin, R.: GPFS: a shared-disk file system for large computing clusters. In: Proceedings of the 1st USENIX Conference on File and Storage Technologies (2002)
Seamons, K.E., Chen, Y., Jones, P., Jozwiak, J., Winslett, M.: Server-directed collective i/o in panda. In: Supercomputing ’95: Proceedings of the ACM/IEEE Conference on Supercomputing (CDROM), p. 57. ACM, New York (1995)
Chapter Google Scholar
Sinha, S., Parashar, M.: Adaptive system sensitive partitioning of amr applications on heterogeneous clusters. Cluster Comput. 5(4), 343–352 (2002)
Article Google Scholar
Stone, N., Balog, D., Gill, B., Johan-Son, B., Marsteller, J., Nowoczynski, P., Porter, D., Reddy, R., Scott, J., Simmel, D., et al.: PDIO: High-performance remote file I/O for portals enabled compute nodes. In: Proceedings of the Conference on Parallel and Distributed Processing Techniques and Applications, Las Vegas, NV, June 2006
Widener, P.M., Wolf, M., Abbasi, H., Barrick, M., Lofstead, J., Pullikottil, J., Eisenhauer, G., Gavrilovska, A., Klasky, S., Oldfield, R., Bridges, P.G., Maccabe, A.B., Schwan, K.: Structured streams: data services for petascale science environments. Technical Report TR-CS-2007-17, University of New Mexico, Albuquerque, NM, November 2007
Wolf, M., Abbasi, H., Collins, B., Spain, D., Schwan, K.: Service augmentation for high end interactive data services. In: IEEE International Conference on Cluster Computing, Cluster 2005, September 2005
Wolf, M., Cai, Z., Huang, W., Schwan, K.: Smartpointers: Personalized scientific data portals in your hand. In: Proceedings of the Conference IEEE/ACM SC2002, p. 20. IEEE Computer Society, Los Alamitos (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Computing, Georgia Institute of Technology, Atlanta, GA, USA
Hasan Abbasi, Matthew Wolf, Greg Eisenhauer, Karsten Schwan & Fang Zheng
Oak Ridge National Laboratory, Oak Ridge, TN, USA
Scott Klasky

Authors

Hasan Abbasi
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Wolf
View author publications
You can also search for this author in PubMed Google Scholar
Greg Eisenhauer
View author publications
You can also search for this author in PubMed Google Scholar
Scott Klasky
View author publications
You can also search for this author in PubMed Google Scholar
Karsten Schwan
View author publications
You can also search for this author in PubMed Google Scholar
Fang Zheng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hasan Abbasi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Abbasi, H., Wolf, M., Eisenhauer, G. et al. DataStager: scalable data staging services for petascale applications. Cluster Comput 13, 277–290 (2010). https://doi.org/10.1007/s10586-010-0135-6

Download citation

Received: 09 November 2009
Accepted: 16 March 2010
Published: 15 June 2010
Issue Date: September 2010
DOI: https://doi.org/10.1007/s10586-010-0135-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

DataStager: scalable data staging services for petascale applications

Abstract

Access this article

Similar content being viewed by others

Big data analytics on Apache Spark

A survey on the evolution of stream processing systems

Containerization technologies: taxonomies, applications and challenges

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

DataStager: scalable data staging services for petascale applications

Abstract

Access this article

Similar content being viewed by others

Big data analytics on Apache Spark

A survey on the evolution of stream processing systems

Containerization technologies: taxonomies, applications and challenges

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation