Design and Performance Analysis of File Replication Strategy on Distributed File System Using GridSim
Distributed Computing Systems like Peer-to-Peer, Web and Cloud are becoming popular day by day and are being used in a wide variety of data intensive applications. Data replication is an important concept which increases the availability and reliability of data in these systems. This paper presents the design and performance analysis of simulated implementation of data replication strategy on a distributed file system using GridSim toolkit. The parameters taken for the performance analysis are aggregate bandwidth, successful execution rate and system byte effective rate. The results indicate that the distributed file system making use of data replication strategy performs better, with respect to the parameters mentioned above, than the distributed file system which is not making use of data replication strategy. The integration of data replication scheme with distributed file system can greatly improve the availability and reliability of data.
KeywordsDistributed File System Replication Strategy Web Cloud Peerto- Peer GridSim
Unable to display preview. Download preview PDF.
- 1.Goel, S., Buyya, R.: Data Replication Strategies in Wide Area Distributed Systems, http://www.buyya.com/papers/DataReplicationInDSChapter2006.pdf
- 3.Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The Hadoop Distributed File System. In: Proceedings of the 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), May 03-07, pp. 1–10 (2010)Google Scholar
- 4.Buyya, R., Murshed, M.: GridSim: A toolkit for the modeling and simulation of distributed resource management and scheduling for Grid Computing. Concurrency and Computation: Practice and Experience 14(13-15) (2002)Google Scholar
- 5.Borthakur, D.: HDFS Architecture, The Apache Software Foundation, http://hadoop.apache.org/docs/r0.20.0/hdfs_design.pdf
- 6.Rahman, R.M., Barker, K., Alhajj, R.: Replica placement design with static optimality and dynamic maintainability. In: Proceedings of the 6th IEEE International Symposium on Cluster Computing and the Grid, pp. 434–437 (2006)Google Scholar
- 13.Latip, R., Othman, M., Abdullah, A., Ibrahim, H., Sulaiman, M.N.: Quorum-based data replication in grid environment. International Journal of Computational Intelligence Systems 2(4), 386–397 (2009)Google Scholar
- 14.Wei, Q., Veeravalli, B., Gong, B., Zeng, L., Feng, D.: CDRM: A cost-effiective dynamic replication management scheme for cloud storage cluster. In: Proceedings of the 2010 IEEE International Conference on Cluster Computing, Heraklion, pp. 188–196 (2010)Google Scholar
- 15.Bonvin, N., Papaioannou, T.G., Aberer, K.: A self-organized, fault-tolerant and scalable replication scheme for cloud storage. In: Proceedings of the 1st ACM Symposium on Cloud Computing, Indianapolis, pp. 205–216 (2010)Google Scholar
- 16.Peter, B., Ishai, M., Mosharaf, C., Pradeepkumar, M., David, A.M., Ion, S.: Surviving failures in bandwidth-constrained datacenters. In: Proceedings of the ACM SIGCOMM 2012 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, August 13-17 (2012)Google Scholar