Abstract
Cloud computing has a significant impact on information technology solutions for both organizations and researchers. Different users share critical data over the cloud where failures are normal rather than exceptional. Therefore, data fragmentation and data replication algorithms are useful to enhance data security. Three important questions need to be answered carefully: (1) Which files should be replicated; (2) how many appropriate new replicas should be placed; (3) where the new replicas should be stored. In this paper, we propose a CSO-based approach for secure data replication (SDR) that determines suitable data center for new replica by designing a smart fuzzy inference system with four inputs as centrality, energy, storage usage, and load. In addition, a high-quality knowledge base is designed to describe the fuzzy system of CSO algorithm. To obtain a higher level of security, we partition each popular file into several fragments with different sizes based on the ability of data centers. Then, these fragments are stored based on the T-coloring concept to prevent an attacker from determining the locations of the fragments. Consequently, SDR protects the data file without any encryption technique since each data center has a single fragment of a particular file and no meaningful data are achieved in a successful attack. We evaluate the proposed algorithm with CloudSim toolkit, and the experiments show that SDR strategy can reduce the total energy consumption and response time by 31% and 28% (on average) compared to other related algorithms, respectively. In terms of storage usage, effective network usage, hit ratio, mean latency, load variance, number of replications, efficiency, and bandwidth consumption, the obtained results indicate that our strategy outperforms previous replication methods by a significant margin. The main reason is that SDR successfully balances the trade-offs among objectives by the fuzzy system.
Similar content being viewed by others
References
Wei J, Zeng X (2019) Optimal computing resource allocation algorithm in cloud computing based on hybrid differential parallel scheduling. Clust Comput 22:7577–7583
Singh Gill S, Ouyang X, Garraghan P (2020) Tails in the cloud: a survey and taxonomy of straggler management within large-scale cloud data centres. J Supercomput 76:10050–10089
AliKhan A, Zakarya M, Khan R (2019) Energy-aware dynamic resource management in elastic cloud datacenters. Simul Model Pract Theory 92:82–99
Mansouri N, Javidi MM (2020) A review of data replication based on meta-heuristics approach in cloud computing and data grid. Soft Comput 24:14503–14530
Mansouri N, Javidi MM (2018a) A hybrid data replication strategy with fuzzy-based deletion for heterogeneous cloud data centers. J Supercomput 74(10):5349–5372
Liang B, Dong X, Wang Y, Zhang X (2020) Memory-aware resource management algorithm for low-energy cloud data centers. Future Gener Comput Syst 113:329–342
Ardagna D, Panicucci B, Trubian M, Zhang L (2012) Energy-aware autonomic resource allocation in multitier virtualized environments. IEEE Trans Serv Comput 5(1):2–19
Kelefouras V, Djemame K (2018) Workflow simulation aware and multi-threading effective task scheduling for heterogeneous computing. In: 25th International Conference on High Performance Computing (HiPC)
Mansouri N (2016) QDR: a QoS-aware data replication algorithm for Data Grids considering security factors. Clust Comput 19(3):1071–1087
Kang S, Veeravalli B, Aung KMM (2014) ESPRESSO: an encryption as a service for cloud storage systems. In: AIMS 2014, Brno, Czech Republic, pp 15–28
Bhattacherjee S, Das R, Khatua S, Roy S (2020) Energy-efficient migration techniques for cloud environment: a step toward green computing. J Supercomput 76:5192–5220
Mansouri N (2014) Network and data location aware approach for simultaneous job scheduling and data replication in large-scale data grid environments. Front Comput Sci 8:391–408
Mansouri N, Ghafari R, Mohammad Hasani Zade B (2020) Cloud computing simulators: a comprehensive review. Simul Model Pract Theory 104:102144
Li C, Zhang J, Tang H (2019) Replica-aware task scheduling and load balanced cache placement for delay reduction in multi-cloud environment. J Supercomput 75:2805–2836
Mansouri N, Mohammad Hasani Zade B, Javidi MM (2020) A multi-objective optimized replication using fuzzy based self-defense algorithm for cloud computing. J Netw Comput Appl 171:102811
Long SQ, Zhao YL, Chen W (2014) MORM: a multi-objective optimized replication management strategy for cloud storage cluster. J Syst Architect 60:234–244
Boru D, Kliazovich D, Granelli F, Bouvry P, Zomaya AY (2015) Energy-efficient data replication in cloud computing datacenters. Clust Comput 18:385–402
Kliazovich D, Bouvry P, Khan SU (2012) GreenCloud: a packet-level simulator of energy-aware cloud computing data centers. J Supercomput 62(3):1263–1283
Casas I, Taheri J, Ranjan R, Wang L, Zomaya AY (2017) A balanced scheduler with data reuse and replication for scientific workflows in cloud computing. Future Gener Comput Syst 74:1689–2178
Manjula S, Indra Devi M, Swathiya R (2016) Division of data in cloud environment for secure data storage. In: International Conference on Computing Technologies and Intelligent Data Engineering (ICCTIDE)
Nivetha NK, Vijayakumar D (2016) Modeling fuzzy based replication strategy to improve data availability in cloud datacenter. In: International Conference on Computing Technologies and Intelligent Data Engineering
Tos U, Mokadem R, Hameurlain A, Ayav T, Bora S (2016) A performance and profit oriented data replication strategy for cloud systems. In: International Conferences on Ubiquitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress, pp 780–787
Mansouri N, Kuchaki Rafsanjani M, Javidi MM (2017) DPRS: a dynamic popularity aware replication strategy with parallel download scheme in cloud environments. Simul Model Pract Theory 77:177–196
Li B, Song SL, Bezakova I, Cameron KW (2013) EDR: an energy-aware runtime load distribution system for data-intensive applications in the cloud. In: IEEE International Conference on Cluster Computing, pp 1–8
Limam S, Mokadem R, Belalem G (2019) Data replication strategy with satisfaction of availability, performance and tenant budget requirements. Clust Comput 22:1–12
Mansouri N, Javidi MM (2018b) A new Prefetching-aware Data Replication to decrease access latency in cloud environment. J Syst Softw 144:197–215
Liang L, Xing L, Levitin G (2019) Optimizing dynamic survivability and security of replicated data in cloud systems under co-residence attacks. Reliab Eng Syst Saf 192:106265
Sun SY, Yao WB, Li XY (2018) DARS: a dynamic adaptive replica strategy under high load Cloud-P2P. Future Gener Comput Syst 78:31–40
He L, Qian Z, Shang F (2020) A novel predicted replication strategy in cloud storage. J Supercomput 76:4838–4856
Xue L, Ni J, Li Y, Shen J (2017) Provable data transfer from provable data possession and deletion in cloud storage. Comput Standards Interfaces 54:46–54
Ramanan M, Vivekanandan P (2019) Efficient data integrity and data replication in cloud using stochastic diffusion method. Clust Comput 22:14999–15006
Antonio Parejo J, Ruiz-Corte’s A, Lozano S, Fernandez P (2012) Metaheuristic optimization frameworks: a survey and benchmarking. Soft Comput 16(3):527–561
Mahdavi Jafari M, Khayati GR (2018) Prediction of hydroxyapatite crystallite size prepared by sol–gel route: gene expression programming approach. J Sol-Gel Sci Technol 86(1):112–125
Eberhart R, Kennedy J (1995) A new optimizer using particle swarm theory. In: Proceedings of the Sixth International Symposium on Micro Machine and Human Science, pp 39–43
Cheng R, Jin Y (2015) A competitive swarm optimizer for large scale optimization. IEEE Trans Cybern 45(2):191–204
Luo Y, Che X (2009) Chaos immune particle swarm optimization algorithm with hybrid discrete variables and its application to mechanical optimization. In: Third International Symposium on Intelligent Information Technology Application Workshops
Cheng R, Jin Y (2014) Demonstrator selection in a social learning particle swarm optimizer, In: IEEE Congress on Evolutionary Computation, pp 3103–3110
Wang H, Sun H, Li C, Rahnamayan S, Pan JS (2013) Diversity enhanced particle swarm optimization with neighborhood search. Inf Sci 223:119–135
Phan DH, Suzuki J, Carroll R (2012) Evolutionary multi objective optimization for green clouds. In: Annual Conference Companion on Genetic and Evolutionary Computation, pp 19–26
Jiang G (2009) Power and performance management of virtualized computing environments via look ahead control. Clust Comput 12(1):1–15
Moran MJ, Shapiro HN (1995) Fundamentals of engineering thermodynamics. Wiley, Hoboken
Lub L, Chena D, Rend XL, Ming Zhang Q, Cheng Y (2016) Vital nodes identification in complex networks. Phys Rep 650:1–63
Hale WK (1980) Frequency assignment: theory and applications. Proc IEEE 68(12):1497–1514
Wylie JJ, Bakkaloglu M, Pandurangan V, Bigrigg MW, Oguz S, Tew K, Williams C, Ganger GR, Khosla PK (2001) Selecting the right data distribution scheme for a survivable storage system, Carnegie Mellon University, Technical Report. CMU-CS-01-120
Saadat N, Rahmani AM (2012) PDDRA: a new pre-fetching based dynamic data replication algorithm in data grids. Future Gener Comput Syst 28:666–681
Jeffrey D, Sanjay G, MapReduce: simplified data processing on large clusters. In: Proceedings of the Conference on Operating System Design and Implementation, pp 137–150
Ghemawat S, Gobioff H, Leung ST (2003) The Google file system. ACM SIGOPS Oper Syst Rev 37(5):29–43
Shvachko K, Hairong K, Radia S, Chansler R (2010) The Hadoop distributed file system. In: Proceedings of the 26th Symposium on Mass Storage Systems and Technologies, pp 1–10
Jararweh Y, Alshara Z, Jarrah M, Kharbutli M, Alsaleh MN (2013) TeachCloud: a cloud computing educational toolkit. Int J Cloud Comput. 2(2):237–257
Gupta SKS, Robin Gilbert R, Banerjee A, Abbasi Z, Mukherjeey T, Varsamopoulos G (2011) GDCSim: a tool for analyzing green data center design and resource management techniques. In: International Green Computing Conference and Workshops
Nunez A, Vazquez-Poletti JL, Caminero AC, Castane GG, Carretero J, Llorente IM (2012) iCanCloud: a flexible and scalable cloud infrastructure simulator. J Grid Comput 10(1):185–209
Fittkau F, Frey S, Hasselbring W (2012) Cloud user-centric enhancements of the simulator CloudSim to improve cloud deployment option analysis. In: Proceedings of the 1st European Conference on Service-Oriented and Cloud Computing
Garg S, Buyya R (2011) Networkcloudsim: modeling parallel applications in cloud simulations. In: Proceedings of the 4th IEEE/ACM International Conference on Utility and Cloud Computing, pp 105–113
Lim S, Sharma B, Nam G, Kim E, Das C (2009) MDCSim: a multi-tier data center simulation, platform. In: Proceedings of IEEE International Conference on Cluster Computing and Workshops
Kecskemeti G (2015) DISSECT-CF: a simulator to foster energy-aware scheduling in infrastructure clouds. Simul Model Pract Theory 58:188–218
Teixeira T, Calheiros RN, Gomes DG (2014) CloudReports: an extensible simulation tool for energy-aware cloud computing environments. Cloud Computi, pp 127–142
Barroso LA, Clidaras J, Holzle U (2013) The datacenter as a computer: an introduction to the design of warehouse-scale machines, 2nd ed. Morgan and Claypool Publishers
Cameron DG, Carvajal-schiaffino R, Paul Millar A, Nicholson C, Stockinger K, Zini F (2003) UK Grid Simulation with OptorSim, UK e-Science All Hands Meeting
Wen Y, Xu H, Yang J (2011) A heuristic-based hybrid genetic-variable neighborhood search algorithm for task scheduling in heterogeneous multiprocessor system. Inf Sci 181:567–581
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
Rights and permissions
About this article
Cite this article
Mansouri, N., Javidi, M.M. & Mohammad Hasani Zade, B. A CSO-based approach for secure data replication in cloud computing environment. J Supercomput 77, 5882–5933 (2021). https://doi.org/10.1007/s11227-020-03497-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-020-03497-3