Skip to main content

Advertisement

Log in

Evaluation of redundant data storage in clusters based on multi-multicast and local storage

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Cluster platforms have an important role in high performance computing (HPC). They execute cloud computing, data-intensive computing and data center applications, which are supported on distributed file systems. The implementation of data redundancy in these file systems provides a support for high availability and error tolerance. This work proposes an implementation of redundant data storage based on the storage included in the cluster nodes, instead of more expensive approaches with a dedicated storage and network, and on multi-multicast transfers, instead on unicast transfers, to perform the multiple simultaneous data diffusion required for implementing redundant data storage. The proposal applies a recently proposed congestion control scheme that adjusts the sender injection rate, taking into account control information from the receiver nodes and the storage technology available on the cluster nodes. The implementation takes full advantage of the switch diffusion hardware and of the IGMP snooping capability of current switches, which allows to multicast a packet just to the output links with receivers joined to a multicast group. It is made at the user level directly on the UDP interface. Evaluation tests with multiple simultaneous storage accesses were performed in a CentOS cluster. Test results show a more efficient use of the cluster storage. The global bandwidth improves by using hardware related to the storage (network and storage devices) more efficiently.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Adamson B, Macker J, Bormann C, Handley M (2009) NACK-oriented reliable multicast (NORM) transport protocol. Technical report, RFC 5740

  2. Boru D, Kliazovich D, Granelli F, Bouvry P, Zomaya AY (2015) Energy-efficient data replication in cloud computing datacenters. Clust Comput 18(1):385–402

    Article  Google Scholar 

  3. Braam PJ (2002) The Lustre storage architecture

  4. Carns PH, Ligon III WB, Ross RB, Thakur R (2000) PVFS: A parallel file system for linux clusters. In: Proceedings of the 4th Annual Linux Showcase and Conference, pp 317–327. USENIX Association

  5. Chaturvedi N, Jain DC (2012) Analysis of replication and replication algorithms in distributed system. Int J Adv Res Comput Sci Softw Eng 2(5):261–266

    Google Scholar 

  6. Chowdhury S, Fatema K (2013) Analysing TCP performance when link experiencing packet loss. Master’s thesis, University of Gothenburg, Göteborg, Sweden

  7. Díaz AF, Anguita M, Camacho HE, Nieto E, Ortega J (2013) Two-level hash/table approach for metadata management in distributed file systems. J Supercomput 64(1):144–155

    Article  Google Scholar 

  8. Dickens PM (2010) A lightweight, high performance communication protocol for grid computing. Clust Comput 13(1):47–66

    Article  Google Scholar 

  9. Filgueira R, Singh DE, Carretero J, Calderón A, García F (2011) Adaptive-CoMPI: enhancing MPI-based applications-performance and scalability by using adaptive compression. Int J High Perform Comput Appl 25(1):93–114

    Article  Google Scholar 

  10. Gemmell J, Montgomery T, Speakman T, Crowcroft J (2003) The PGM reliable multicast protocol. IEEE Network 17(1):16–22

    Article  Google Scholar 

  11. Gu Y, Grossman R (2011) Toward efficient and simplified distributed data intensive computing. IEEE Trans Parallel Distrib Syst 22(6):974–984

    Article  Google Scholar 

  12. Gu Y, Grossman RL (2007) UDT: UDP-based data transfer for high-speed wide area networks. Comput Netw 51(7):1777–1799 (Protocols for fast, long-distance networks)

  13. Haddad I (2006) The HAS architecture: a highly available and scalable cluster architecture for web servers. PhD thesis, Concordia University, Concordia University Libraries

  14. Kadhum MM, Hassanein HS (2015) Congestion-aware TCP-friendly system for multimedia transmission based on UDP. Clust Comput 18(2):693–705

    Article  Google Scholar 

  15. Kasera SK, Hjalmtusson G, Towsley DF, Kurose JF (2000) Scalable reliable multicast using multiple multicast channels. IEEE/ACM Trans Netw 8(3):294–310

    Article  Google Scholar 

  16. Lane RG, Daniels S, Yuan X (2007) An empirical study of reliable multicast protocols over ethernet-connected networks. Perform Eval 64(3):210–228

    Article  Google Scholar 

  17. Li D, Xu M, Liu Y, Xie X, Cui Y, Wang J, Chen G (2013) Reliable multicast in data center networks. IEEE Trans Comput 63(8):2011–2024

    Article  MathSciNet  Google Scholar 

  18. Li J, Veeraraghavan M (2012) A reliable message multicast transport protocol for virtual circuits. In: 4th International conference on communications, mobility, and computing

  19. Palacios RH, Díaz AF, Anguita M, Ortega J, Rodríguez-Quintana C (2016) High-throughput multi-multicast transfers in data center networks. J Supercomput: 1–12

  20. Palacios RH, Díaz AF, Ortega J, Rodríguez-Quintana C, Anguita M (2015) Analyzing high-throughput multicast traffic in cluster computing. In: Proceedings of the 15th International Conference on Computational and Mathematical Methods in Science and Engineering, CMMSE 6–10 Jul 2015

  21. Rani LS, Sudhakar K, Kumar SV (2014) Distributed file systems: a survey. Int J Comput Sci Inf Technol 5(3):3716–3721

    Google Scholar 

  22. Sheng L (1999) Java native interface: programmer’s guide and reference, 1st edn. Addison-Wesley Longman Publishing, Boston

    Google Scholar 

  23. Shvachko K, Kuang H, Radia S, Chansler R (2010)The hadoop distributed file system. In: Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), IEEE Computer Society MSST ’10, pp 1–10, Washington, DC, USA

  24. Thanh TD, Mohan S, Choi E, Kim SB, Kim P (2008) A taxonomy and survey on distributed file systems. In: Fourth International Conference on Networked Computing and Advanced Information Management, 2008. NCM ’08, vol 1, pp 144–149

  25. Wang J, Xu Z (2002) Cluster file systems: a case study. Futur Gener Comput Syst 18(3):373–387 (Cluster Computing)

    Article  Google Scholar 

  26. Weil SA, Brandt SA, Miller EL, Long DDE, Maltzahn C (2006) Ceph: a scalable, high-performance distributed file system. In: Proceedings of the 7th Symposium on Operating Systems Design and Implementation OSDI ’06, pp 307–320, USENIX Association, Berkeley, CA, USA

  27. Xiong R, Luo J, Dong F (2015) Optimizing data placement in heterogeneous hadoop clusters. Clust Comput: 1–16

Download references

Acknowledgements

This work has been partially supported by the European Union FEDER and the Spanish Ministry of Economy and Competitiveness TIN2015-67020-P, FPA2015-65150-C3-3-P, and PROMEP/103.5/13/6475 UAEH-146.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Antonio F. Díaz.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Palacios, R.H., Rodríguez-Quintana, C., Díaz, A.F. et al. Evaluation of redundant data storage in clusters based on multi-multicast and local storage. J Supercomput 73, 576–590 (2017). https://doi.org/10.1007/s11227-016-1913-6

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-016-1913-6

Keywords

Navigation