A Hybrid Genetic Algorithm for Privacy and Cost Aware Scheduling of Data Intensive Workflow in Cloud
- 1.5k Downloads
In the context of cloud computing and big data, the data of all walks of life has been obtained conveniently. Some information of users in the business process is in need of protection with the popularity of workflow applications, which will greatly affect the scheduling of workflow. Meanwhile, the amount of data is usually very large in workflow, the data privacy protection in workflow has also become an important research problem. In this paper, in order to satisfy the requirement of data privacy protection from user and minimize the total scheduling cost in workflow scheduling, we proposed a privacy and cost aware method based on genetic algorithm for data intensive workflow applications which takes into account computation cost, data transmission cost and data storage cost in cloud to solve this problem on finding the best scheduling solution. The proposed algorithm uses the summation of upward and downward rank values for prioritizing workflow tasks, then merges it to make an optimal initial population to obtain a good solution quickly. Besides, a series of operations like selection, crossover and mutation have been used to optimize the scheduling. In the workflow task scheduling, we assign the datacenter for tasks needing privacy protection, which data of these tasks cannot be moved or copied to other datacenter. Finally, we demonstrate the potential of proposed algorithm for optimizing economic cost with user privacy protection requirement. The experimental results show that proposed algorithm can help improve the scheduling and save the time and cost by an average of 3.6 % and 15.6 % respectively.
KeywordsPrivacy protection Cloud computing Workflow scheduling Genetic
This paper was supported by Nature Science Fund of China, under grant number 61272063, 61402167, 61202111, 61402168, 61300129, the Planned Science and Technology Project of Hunan Province under grant number 13FJ4048, 2014GK3004, and Scientific Research Fund of Hunan Provincial Education Department under grant number 13C160.
- 1.Pearson, S.: Taking account of privacy when designing cloud computing services In: Proceeding s of the 2009 ICSE Workshop on Software Engineering Challenges of Cloud Computing, pp. 44–52. IEEE Computer Society, Washington DC (2009)Google Scholar
- 2.Liu, Y., Zhang, T., Jin, X., Cheng, X.: Personal privacy protection in the era of big data. J. Comput. Res. Dev. 52(1), 229–247 (2015)Google Scholar
- 6.Chen, C., Liu, J., Wen, Y., Chen, J.: Research on workflow scheduling algorithms in the cloud. In: Cao, J., Wen, L., Liu, X. (eds.) PAS 2014. CCIS, vol. 495, pp. 35–48. Springer, Heidelberg (2015)Google Scholar
- 7.Zheng, M., Cao, J., Yao, Y.: Cloud workflow scheduling algorithm oriented to dynamic pric changes. Comput. Integr. Manuf. Syst. 19(8), 1849–1858 (2013)Google Scholar
- 8.Mukute, S., Hapanyengwi, G., Mapako, B., et al.: Scheduling in instance-intensive cost-constrained workflows in a cloud. Int. J. Sci. Eng. Res. 4, 755–760 (2013)Google Scholar
- 12.Pandey, S., Wu, L., Guru, S.M., Buyya, R.: A particle swarm optimization-based heuristic for scheduling workflow applications in cloud computing environments. In: Proceedings of the 24th IEEE International Conference on Advanced Information Networking and Applications (AINA), pp. 400–407 (2010)Google Scholar
- 14.Lu, G., Tan, W., Sun, Y., Zhang, Z., Tang, A.: QoS constraint based workflow scheduling for cloud computing services. J. Softw. 9(4), 926–930 (2014)Google Scholar