Skip to main content
Log in

Gfarm Grid File System

  • Published:
New Generation Computing Aims and scope Submit manuscript

Abstract

Gfarm Grid file system is a global distributed file system to share data and to support distributed data-intensive computing. It federates local file systems on compute nodes to maximize distributed file I/O bandwidth, and allows to store multiple file replicas in any location to avoid read access concentration of hot files. Data location aware process scheduling improves the file I/O performance of distributed data-intensive computing. This paper discusses the design and implementation of the Gfarm Grid file system, and reports the performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Grid Datafarm, http://datafarm.apgrid.org/.

  2. Grid File System WG, https://forge.gridforum.org/projects/gfs-wg/.

  3. The Grid Security Infrastructure Working Group, http://www.gridforum.org/security/gsi/index.htm.

  4. InTrigger, http://www.intrigger.jp/.

  5. PVFS, http://www.pvfs.org/.

  6. Storage Resource Broker, http://www.sdsc.edu/srb/index.ph.

  7. Allcock, B., Bester, J., Bresnahan, J., Chervenak, A. L., Foster, I., Kesselman, C., Meder, S., Nefedova, V., Quesnel, D. and Tuecke, S., “Secure, efficient data transport and replica management for high-performance data-intensive computing,” in Proc. of IEEE Mass Storage Conference, 2001.

  8. Braam, P. J., Lustre, http://www.lustre.org/.

  9. Callaghan, B., Pawlowski, B. and Staubach, P., NFS Version 3 Protocol Specification, RFC 1813, 1995.

  10. Chervenak, A., Deelman, E., Foster, I., Guy, L., Hoschek, W., Iamnitchi, A., Kesselman, C., Kunszt, P., Ripeanu, M., Schwartzkopf, B., Stockinger, H., Stockinger, K. and Tierney, B., “Giggle: A framework for constructing scalable replica location services,” in Proc. of the 2002 ACM/IEEE conference on Supercomputing (SC2002), 2002.

  11. Czajkowski, K., Fitzgerald, S., Foster, I. and Kesselman, C., “Grid information services for distributed resource sharing,” in Proc. of the Tenth IEEE International Symposium on High Performance Distributed Computing (HPDC-10), pp.181–194, 2001.

  12. Ghemawat, S., Gobioff, H. and Leung, S.-T., “The google file system,” in Proc. of 19th ACM Symposium on Operating Systems Principles (SOSP-19), 2003.

  13. Hildebrand, D. and Honeyman, P., “Exporting storage systems in a scalable manner with pNFS,” in Proc. of the 22nd IEEE / 13th NASA Goddard Conference on Mass Storage Systems and Technologies (MSST'05), pp.18–27, 2005.

  14. Howard, J. H., Kazar, M. L., Menees, S. G., Nichols, D. A., Satyanarayanan, M., Sidebotham, R. N. and West, M. J., “Scale and performance in a distributed file system,” ACM Transactions on Computer Systems, 6, 1, pp.51–81, 1988.

    Article  Google Scholar 

  15. Kubiatowicz, J., Bindel, D., Chen, Y., Czerwinski, S., Eaton, P., Geels, D., Gummadi, R., Rhea, S., Weatherspoon, H., Wells, C. and Zhao, B., “OceanStore: an architecture for global-scale persistent storage,” in Proc. of the ninth international conference on Architectural Support for Programming Languages and Operating Systems, pp.190–201, 2000.

  16. Menon, J., Pease, D. A., Rees, R., Duyanovich, L. and Hillsberg, B., “Ibm storage tank - a heterogeneous scalable san file system,” IBM Systems Journal, 42, 2, pp.250–267, 2003.

    Article  Google Scholar 

  17. Schmuck, F. and Haskin, R., “GPFS: A shared-disk file system for large computing clusters,” in Proc. of the 2002 Conference on File and Storage Technologies (FAST), pp.231–244, 2002.

  18. Shepler, S., Callaghan, B., Robinson, D., Thurlow, R., Beame, C., Eisler, M. and Noveck, D., Network File System (NFS) version 4 Protocol, RFC 3530, 2003.

  19. Soltis, S. R., Ruwart, T. M. and O'Keefe, M. T., “The global file system,” in Proc. of the Fifth NASA Goddard Space Flight Center Conference on Mass Storage Systems and Technologies, 1996. http://www.globalfilesystem.org/.

  20. Szeredi, M., FUSE: Filesystem in Userspace, http://fuse.sourceforge.net/.

  21. Tatebe, O., Morita, Y., Matsuoka, S., Soda, N. and Sekiguchi, S., “Grid datafarm architecture for petascale data intensive computing,” in Proc. of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2002), pp.102–110, 2002.

  22. Tatebe, O., Soda, N., Morita, Y., Matsuoka, S., and Sekiguchi, S., “Gfarm v2: A grid file system that supports high-performance distributed and parallel data computing,” in Proc. of the 2004 Computing in High Energy and Nuclear Physics (CHEP04), 2004.

  23. Weil, S. A., Brandt, S. A., Miller, E. L. and Long, D. D. E., “Ceph: A scalable, high-performance distributed file system,” in Proc. of the 7th USENIX Symposium on Operating Systems Design and Implementation (OSDI'06), pp.307–320, 2006.

  24. Weil, S. A., Brandt, S. A., Miller, E. L. and Maltzahn, C., “CRUSH: Controlled, scalable, decentralized placement of replicated data,” in Proc. of the 2006 ACM/IEEE conference on Supercomputing (SC2006), 2006.

  25. Welch, B. and Gibson, G., “Managing scalability in object storage systems for HPC Linux clusters,” in Proc. of the 21th IEEE/12th NASA Goddard Conference on Mass Storage Systems and Technologies, pp.433–445, 2004.

  26. White, B. S., Walker, M., Humphrey, M. and A. Grimshaw, A. S., “LegionFS: a secure and scalable file system supporting cross-domain high-performance applications,” in Proc. of the 2001 ACM/IEEE conference on Supercomputing (SC2001), 2001.

  27. Xiaohui, W., Li, W. W., Tatebe, O., Gaochao, X., Liang, H. and Jiubin, J., “Integrating local job scheduler - LSF with Gfarm,” LNCS 3758, pp.196–204, 2005.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Osamu Tatebe.

About this article

Cite this article

Tatebe, O., Hiraga, K. & Soda, N. Gfarm Grid File System. New Gener. Comput. 28, 257–275 (2010). https://doi.org/10.1007/s00354-009-0089-5

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00354-009-0089-5

Keywords:

Navigation