Advertisement

PPMS: A Peer to Peer Metadata Management Strategy for Distributed File Systems

  • Di Yang
  • Weigang Wu
  • Zhansong Li
  • Jiongyu Yu
  • Yong Li
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8707)

Abstract

Distributed file system is one of the key blocks of cloud computing systems. With the fast increase of user scale and data amount, metadata management has become a crucial point affecting the overall performance of a distributed file system. In this paper, we design and implement PPMS, a novel metadata management strategy in a peer to peer way. Different from existing metadata management methods, we adopt a two layer structure to achieve high scalability and low latency. The upper layer is metadata index server, which is used to store metadata of directories, while the lower layer consists of metadata servers to store the metadata of files. More importantly, the lower layer is organized in a peer to peer way to further improve scalability. We implement a prototype file system based on PPMS and evaluate its performance via experiments. The results show that our design can achieve high performance with in terms of time latency and system throughput.

Keywords

Distributed File System Metadata Management Scalability Low Latency Peer-to-Peer 

References

  1. 1.
    Roselli, D.S., Lorch, J.R., Anderson, T.E.: A Comparison of File System Workloads. In: USENIX Annual Technical Conference, General Track, pp. 41–54 (2000)Google Scholar
  2. 2.
    Brandt, S.A., Xue, L., Miller, E.L., et al.: Efficient metadata management in large distributed storage systems. In: 2012 IEEE 9th International Conference on Mobile Ad-Hoc and Sensor Systems (MASS 2012), p. 290. IEEE Computer Society (2012)Google Scholar
  3. 3.
    Corbett, P.F., Feitelson, D.G.: The Vesta parallel file system. ACM Transactions on Computer Systems (TOCS) 14(3), 225–264 (1996)CrossRefGoogle Scholar
  4. 4.
    Weil, S.A., Pollack, K.T., Brandt, S.A., et al.: Dynamic metadata management for petabyte-scale file systems. In: Proceedings of the 2004 ACM/IEEE Conference on Supercomputing, p. 4 (2004)Google Scholar
  5. 5.
    Zhu, Y., Jiang, H., Wang, J.: Hierarchical bloom filter arrays (hba): a novel, scalable metadata management system for large cluster-based storage. In: 2004 IEEE International Conference on Cluster Computing, pp. 165–174. IEEE (2004)Google Scholar
  6. 6.
    Hua, Y., Zhu, Y., Jiang, H., et al.: Scalable and adaptive metadata management in ultra large-scale file systems. In: ICDCS, pp. 403–410 (2008)Google Scholar
  7. 7.
    Borthakur, D.: The hadoop distributed file system: Architecture and design. Hadoop Project Website 11, 21 (2007)Google Scholar
  8. 8.
    Ghemawat, S., Gobioff, H., Leung, S.T.: The Google file system. ACM SIGOPS Operating Systems Review 37(5), 29–43 (2003)CrossRefGoogle Scholar
  9. 9.
    Leung, A.W., Shao, M., Bisson, T., et al.: Spyglass: Fast, Scalable Metadata Search for Large- Scale Storage Systems. FAST, pp.153-166 (2009)Google Scholar
  10. 10.
    Hua, Y., Jiang, H., Zhu, Y., et al.: SmartStore: A new metadata organization paradigm with semantic-awareness for next-generation file systems. In: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, pp. 1–12. IEEE (2009)Google Scholar
  11. 11.
    Broder, A., Mitzenmacher, M.: Network applications of bloom filters: A survey. Internet mathematics 1(4), 485–509 (2004)CrossRefzbMATHMathSciNetGoogle Scholar
  12. 12.
    Weil, S.A., Brandt, S.A., Miller, E.L., et al.: Ceph: A scalable, high-performance distributed file system. In: OSDI, pp. 307–320 (2006)Google Scholar
  13. 13.
    Wang, J., Feng, D., Wang, F., et al.: MHS: A distributed metadata management strategy. Journal of Systems and Software 82(12), 2004–2011 (2009)CrossRefGoogle Scholar
  14. 14.
  15. 15.
  16. 16.
    Katcher, J.: Postmark: A new file system benchmark. Technical Report TR3022, Network Appliance (1997), http://www.netapp.com/tech_library/3022.html
  17. 17.
    Patil, S., Gibson, G.A.: Scale and Concurrency of GIGA+: File System Directories with Millions of Files. In: FAST 2011, p. 13 (2011)Google Scholar
  18. 18.
    Trace,tracehost.cs.berkeley.eduGoogle Scholar
  19. 19.
    Yu, J., Wu, W., Li, H.: DMooseFS: Design and implementation of distributed files system with distributed metadata server. APCloudCC, pp.42-47 (2012)Google Scholar

Copyright information

© IFIP International Federation for Information Processing 2014

Authors and Affiliations

  • Di Yang
    • 1
  • Weigang Wu
    • 1
  • Zhansong Li
    • 1
  • Jiongyu Yu
    • 1
  • Yong Li
    • 1
  1. 1.Department of Computer ScienceSun Yat-sen UniversityGuangzhouChina

Personalised recommendations