New Generation Computing

, Volume 28, Issue 3, pp 257–275 | Cite as

Gfarm Grid File System

  • Osamu TatebeEmail author
  • Kohei Hiraga
  • Noriyuki Soda


Gfarm Grid file system is a global distributed file system to share data and to support distributed data-intensive computing. It federates local file systems on compute nodes to maximize distributed file I/O bandwidth, and allows to store multiple file replicas in any location to avoid read access concentration of hot files. Data location aware process scheduling improves the file I/O performance of distributed data-intensive computing. This paper discusses the design and implementation of the Gfarm Grid file system, and reports the performance.


Distributed File System Global File System Distributed Data Intensive Computing 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
  2. 2.
    Grid File System WG, Scholar
  3. 3.
    The Grid Security Infrastructure Working Group,
  4. 4.
  5. 5.
  6. 6.
    Storage Resource Broker,
  7. 7.
    Allcock, B., Bester, J., Bresnahan, J., Chervenak, A. L., Foster, I., Kesselman, C., Meder, S., Nefedova, V., Quesnel, D. and Tuecke, S., “Secure, efficient data transport and replica management for high-performance data-intensive computing,” in Proc. of IEEE Mass Storage Conference, 2001.Google Scholar
  8. 8.
    Braam, P. J., Lustre,
  9. 9.
    Callaghan, B., Pawlowski, B. and Staubach, P., NFS Version 3 Protocol Specification, RFC 1813, 1995.Google Scholar
  10. 10.
    Chervenak, A., Deelman, E., Foster, I., Guy, L., Hoschek, W., Iamnitchi, A., Kesselman, C., Kunszt, P., Ripeanu, M., Schwartzkopf, B., Stockinger, H., Stockinger, K. and Tierney, B., “Giggle: A framework for constructing scalable replica location services,” in Proc. of the 2002 ACM/IEEE conference on Supercomputing (SC2002), 2002.Google Scholar
  11. 11.
    Czajkowski, K., Fitzgerald, S., Foster, I. and Kesselman, C., “Grid information services for distributed resource sharing,” in Proc. of the Tenth IEEE International Symposium on High Performance Distributed Computing (HPDC-10), pp.181–194, 2001.Google Scholar
  12. 12.
    Ghemawat, S., Gobioff, H. and Leung, S.-T., “The google file system,” in Proc. of 19th ACM Symposium on Operating Systems Principles (SOSP-19), 2003.Google Scholar
  13. 13.
    Hildebrand, D. and Honeyman, P., “Exporting storage systems in a scalable manner with pNFS,” in Proc. of the 22nd IEEE / 13th NASA Goddard Conference on Mass Storage Systems and Technologies (MSST'05), pp.18–27, 2005.Google Scholar
  14. 14.
    Howard, J. H., Kazar, M. L., Menees, S. G., Nichols, D. A., Satyanarayanan, M., Sidebotham, R. N. and West, M. J., “Scale and performance in a distributed file system,” ACM Transactions on Computer Systems, 6, 1, pp.51–81, 1988.CrossRefGoogle Scholar
  15. 15.
    Kubiatowicz, J., Bindel, D., Chen, Y., Czerwinski, S., Eaton, P., Geels, D., Gummadi, R., Rhea, S., Weatherspoon, H., Wells, C. and Zhao, B., “OceanStore: an architecture for global-scale persistent storage,” in Proc. of the ninth international conference on Architectural Support for Programming Languages and Operating Systems, pp.190–201, 2000.Google Scholar
  16. 16.
    Menon, J., Pease, D. A., Rees, R., Duyanovich, L. and Hillsberg, B., “Ibm storage tank - a heterogeneous scalable san file system,” IBM Systems Journal, 42, 2, pp.250–267, 2003.CrossRefGoogle Scholar
  17. 17.
    Schmuck, F. and Haskin, R., “GPFS: A shared-disk file system for large computing clusters,” in Proc. of the 2002 Conference on File and Storage Technologies (FAST), pp.231–244, 2002.Google Scholar
  18. 18.
    Shepler, S., Callaghan, B., Robinson, D., Thurlow, R., Beame, C., Eisler, M. and Noveck, D., Network File System (NFS) version 4 Protocol, RFC 3530, 2003.Google Scholar
  19. 19.
    Soltis, S. R., Ruwart, T. M. and O'Keefe, M. T., “The global file system,” in Proc. of the Fifth NASA Goddard Space Flight Center Conference on Mass Storage Systems and Technologies, 1996.
  20. 20.
    Szeredi, M., FUSE: Filesystem in Userspace,
  21. 21.
    Tatebe, O., Morita, Y., Matsuoka, S., Soda, N. and Sekiguchi, S., “Grid datafarm architecture for petascale data intensive computing,” in Proc. of the 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2002), pp.102–110, 2002.Google Scholar
  22. 22.
    Tatebe, O., Soda, N., Morita, Y., Matsuoka, S., and Sekiguchi, S., “Gfarm v2: A grid file system that supports high-performance distributed and parallel data computing,” in Proc. of the 2004 Computing in High Energy and Nuclear Physics (CHEP04), 2004.Google Scholar
  23. 23.
    Weil, S. A., Brandt, S. A., Miller, E. L. and Long, D. D. E., “Ceph: A scalable, high-performance distributed file system,” in Proc. of the 7th USENIX Symposium on Operating Systems Design and Implementation (OSDI'06), pp.307–320, 2006.Google Scholar
  24. 24.
    Weil, S. A., Brandt, S. A., Miller, E. L. and Maltzahn, C., “CRUSH: Controlled, scalable, decentralized placement of replicated data,” in Proc. of the 2006 ACM/IEEE conference on Supercomputing (SC2006), 2006.Google Scholar
  25. 25.
    Welch, B. and Gibson, G., “Managing scalability in object storage systems for HPC Linux clusters,” in Proc. of the 21th IEEE/12th NASA Goddard Conference on Mass Storage Systems and Technologies, pp.433–445, 2004.Google Scholar
  26. 26.
    White, B. S., Walker, M., Humphrey, M. and A. Grimshaw, A. S., “LegionFS: a secure and scalable file system supporting cross-domain high-performance applications,” in Proc. of the 2001 ACM/IEEE conference on Supercomputing (SC2001), 2001.Google Scholar
  27. 27.
    Xiaohui, W., Li, W. W., Tatebe, O., Gaochao, X., Liang, H. and Jiubin, J., “Integrating local job scheduler - LSF with Gfarm,” LNCS 3758, pp.196–204, 2005.Google Scholar

Copyright information

© Ohmsha and Springer Japan jointly hold copyright of the journal. 2010

Authors and Affiliations

  1. 1.University of TsukubaIbarakiJapan
  2. 2.Software Research Associates, IncAichiJapan

Personalised recommendations