Abstract
Workflow scheduling has been excessively studied in different environments like clusters, grids, and clouds. Cloud is a scalable, cost-effective environment that allows users to access an unlimited amount of resources and offers a pay-as-you-go model. An increase in the users’ desire to run their workflow applications on clouds leads to the development of multi-tenant environments like workflow-as-a-service platforms (WaaS). By leveraging cloud features, WaaS offers an environment where users can submit their workflows for execution with different quality of service (QoS) attributes at different. The problem of finding an appropriate scheduling algorithm considering factors like resource heterogeneity and QoS requirements is an NP-complete problem. Most of the existing algorithms in the literature are designed to schedule a single instance of a workflow or have a static behavior. Using static scheduling in dynamic environments like WaaS can lead to a low planning success rate. Besides, while it is possible to share resources between users, for simplicity purposes a majority of proposed algorithms schedule at most one task on a computing resource at any given point in time. They also consider either the time or cost as a hard constraint during scheduling. To cover these limitations in this study, we propose DDBWS, a Dynamic, Deadline and Budget-aware, Workflow Scheduling algorithm that is designed specifically for the WaaS environments. DDBWS schedules workflows by solving a multi-resource packing problem. Unlike several existing algorithms, it considers both CPU and memory demands for tasks simultaneously. Also, it leverages containers to run multiple tasks on a VM concurrently. It uses a bi-factor to control the tradeoff between cost and resource utilization during the mapping of tasks to resources. Based on real-world workflow traces, we have generated 6 different datasets of synthetic workflows. To compare the performance of the proposed scheduling algorithm, we chose two of the state-of-the-art dynamic concurrent workflow scheduling algorithms called EPSM and MW-HBDCS. We have conducted several experiments on these datasets. The results of the performed experiments show that DDBWS achieves at least 96% planning success rate on 6 different workloads which is a comparable PSR. The proposed algorithm decreases the total leased VM numbers considerably. It also outperforms its rivals in terms of the total execution cost and decreases the overall execution cost by at least 8.03% and on average 32.08%. The 95% confidence interval for this decrease is 32.08 ± 14.1 based on 12 samples.
Similar content being viewed by others
References
Ullman JD (1975) NP-complete scheduling problems. J Comput Syst Sci 10(3):384–393
Deldari A, Naghibzadeh M, Abrishami S (2017) CCA: a deadline-constrained workflow scheduling algorithm for multicore resources on the cloud. J Supercomput 73(2):756–781
Sahni J, Vidyarthi DP (2015) A cost-effective deadline-constrained dynamic scheduling algorithm for scientific workflows in a cloud environment. IEEE Trans Cloud Comput 6(2):2–18
Alejandra Rodriguez M, Buyya R (2017) Budget-driven scheduling of scientific workflows in IaaS clouds with fine-grained billing periods. ACM Trans Auton Adapt Syst (TAAS) 12(2):1–22
Wu Q, Ishikawa F, Zhu Q, Xia Y, Wen J (2017) Deadline-constrained Cost optimization approaches for workflow scheduling in clouds. IEEE Trans Parallel Distrib Syst 28(12):3401–3412
Caniou Y, Caron E, Kong Win Chang A, Robert Y (2018) Budget-aware scheduling algorithms for scientific workflows with stochastic task weights on heterogeneous IaaS Cloud platforms. In: Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), Vancouver, BC, Canada, pp 15–26
Ghasemzadeh M, Arabnejad H, Barbosa JG (2017) Deadline-budget constrained scheduling algorithm for scientific workflows in a cloud environment. In: .
Arabnejad H, Barbosa JG, Prodan R (2016) Low-time complexity budget–deadline constrained workflow scheduling on heterogeneous resources. Future Gener Comput Syst 55:29–40
Zheng W, Sakellariou R (2013) Budget-deadline constrained workflow planning for admission control. J Grid Comput 11(4):633–651
Asghari A, Sohrabi MK, Yaghmaee F (2020) Online scheduling of dependent tasks of cloud’s workflows to enhance resource utilization and reduce the makespan using multiple reinforcement learning-based agents. Soft Comput 24(21):16177–16199
Hu Y, Laat CD, Zhao Z (2019) Multi-objective container deployment on heterogeneous clusters. In: Proceedings of the 2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID), Larnaca, Cyprus, pp 592–599
Wang Y, Liu H, Zheng W, Xia Y, Li Y, Chen P, Guo K, Xie H (2019) Multi-objective workflow scheduling with deep-Q-network-based multi-agent reinforcement learning. IEEE Access 7:39974–39982
Zheng W, Yan W, Bugingo E, Zhang D (2018) Online scheduling to maximize resource utilization of deadline-constrained workflows on the cloud. In: Proceedings of the 2018 IEEE 22nd International Conference on Computer Supported Cooperative Work in Design ((CSCWD), Nanjing, China, pp 98–103A
Li Z, Ge J, Hu H, Song W, Hu H, Luo B (2018) Cost and energy aware scheduling algorithm for scientific workflows with deadline constraint in clouds. IEEE Trans Serv Comput 11(4):713–726
Chen H, Zhu X, Liu G, Pedrycz W (2018) Uncertainty-aware online scheduling for real-time workflows in cloud service environment. IEEE Trans Serv Comput (Early Access ), pp 1–13
Chen H, Zhu J, Wu G, Huo L (2018) Cost-efficient reactive scheduling for real-time workflows in clouds. J Supercomput 74(11):6291–6309
Liu J, Ren J, Dai W, Zhang D, Zhou P, Zhang Y, Min G, Najjari N (2019) Online multi-workflow scheduling under uncertain task execution time in IaaS clouds. IEEE Trans Cloud Comput ( Early Access ), pp 1–15
Malawski M, Juve G, Deelman E, Nabrzyski J (2015) Algorithms for cost-and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds. J Internet Serv Appl 48:1–18
Rodriguez MA, Buyya R (2018) Scheduling dynamic workloads in multi-tenant scientific workflow as a service platforms. Future GenerComput Syst 79:739–750
Zhou N, Li F, Xu K, Qi D (2018) Concurrent workflow budget- and deadline-constrained scheduling in heterogeneous distributed environments. Soft Comput 22(23):7705–7718
Hu Y, Laat CD, Zhao Z (2019) Learning workflow scheduling on multi-resource clusters. In: Proceedings of the 2019 IEEE International Conference on Networking, Architecture and Storage (NAS), EnShi, China, pp 1–8
Zhu Z, Tang Z (2019) Deadline-constrained workflow scheduling in IaaS clouds with multi-resource packing. Future Gener Comput Syst 101:880–893
Rubab S, Hassan MF, Mahmood AK, Mehmood Shah SN (2019) QoS based multi constraints bin packing job scheduling heuristic for heterogeneous volunteer grid resources. Int Arab J Inf Technol 16(4):661–668
Topcuoglu H, Hariri S, Wu M-Y (2002) Performance-effective and low-complexity task scheduling for heterogeneous computing. IEEE Trans Parallel Distrib Syst 13(3):260–274
Verma A, Kaushal S (2015) Cost-time efficient scheduling plan for executing workflows in the cloud. J Grid Comput 13(4):495–506
Meng S, Wang S, Wu T, Li D, Huang T, Wu X, Xu X, Dou W (2016) An uncertainty-aware evolutionary scheduling method for cloud service provisioning. In: Proceedings of the 2016 IEEE International Conference on Web Services (ICWS), San Francisco, CA, USA, pp 506–5013
Anwar N, Deng H (2018) Elastic scheduling of scientific workflows under deadline constraints in cloud computing environments. Futur Internet 10(5):5
Poola D, Garg SK, Buyya R, Yang Y, Ramamohanarao K (2014) Robust scheduling of scientific workflows with deadline and budget constraints in clouds. In: Proceedings of the 2014 IEEE 28th International Conference on Advanced Information Networking and Applications, Victoria, BC, Canada, pp 858–865
Yana H, Zhuab X, Chena H, Guoc H, Zhoua W, Baoa W (2019) DEFT: dynamic fault-tolerant elastic scheduling for tasks with uncertain runtime in cloud. Inf Sci 477:30–46
Verma A, Kaushal S (2017) A hybrid multi-objective particle swarm optimization for scientific workflow scheduling. Parallel Comput 62:1–19
Arabnejad V, Bubendorfer K, Ng B (2019) Dynamic multi-workflow scheduling: a deadline and cost-aware approach for commercial clouds. Futur Gener Comput Syst 100:98–108
Deelman E, Vahi K, Juve G, Rynge M, Callaghan S, Maechling PJ, Mayani R, Chen W, da Silva RF, Livny M, Wenger K (2015) Pegasus, a workflow management system for science automation. Futur Gener Comput Syst 46:17–35
Arabnejad H, Barbosa JG (2017) Maximizing the completion rate of concurrent scientific applications under time and budget constraints. J Comput Sci 23:120–129
Yu J, Buyya R (2006) Scheduling scientific workflow applications with deadline and budget constraints using genetic algorithms. Sci Program 14(3):217–230
J Empowering App Development for Developers | Docker. https://www.docker.com/. Accessed 29 Nov 2020
Amazon EC2 Instance Types—Amazon Web Services. https://aws.amazon.com/ec2/instance-types/. Accessed 29 Nov 2019
V. Arabnejad, K. Bubendorfer, B. Ng, "Deadline Distribution Strategies for Scientific Workflow Scheduling in Commercial Clouds," in Proceedings of the 9th International Conference on Utility and Cloud Computing, December-2016, Shanghai, China, pp. 70-78.9.
Sharif S, Taheri J, Zomaya AY (2016) Online multiple workflow scheduling under privacy and deadline in hybrid cloud environment. In: Proceedings of the 2014 IEEE 6th International Conference on Cloud Computing Technology and Science, Singapore, Singapore, pp 455–462
Sun T, Xiao C, Xu X, Tian G (2017) An improved budget-deadline constrained workflow scheduling algorithm on heterogeneous resources. In: Proceedings of the 2017 IEEE 4th International Conference on Cyber Security and Cloud Computing (CSCloud), New York, NY, USA, pp 40–45
Rizvi N, Ramesh D (2020) Fair budget constrained workflow scheduling approach for heterogeneous clouds. Cluster Comput 23(4):1–17
Yu Z, Shi W (2008) A planner-guided scheduling strategy for multiple workflow applications. In: Proceedings of the 2008 International Conference on Parallel Processing-Workshops, Portland, OR, USA, pp 1–8
Arabnejad H, Barbosa JG (2012) Fairness resource sharing for dynamic workflow scheduling on heterogeneous systems. In: Proceedings of the 2012 IEEE 10th International Symposium on Parallel and Distributed Processing with Applications, Leganes, Spain, pp 633–639
Hsu CC, Huang KC, Wang FJ (2011) Online scheduling of workflow applications in grid environments. Futur Gener Comput Syst 27(6):860–870
Calheiros RN, Ranjan R, Beloglazov A, De Rose CA, Buyya R (2011) Cloudsim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms. Softw Pract Exp 41(1):23–50
Piraghaj SF, Dastjerdi AV, Calheiros RN, Buyya R (2017) Containercloudsim: An environment for modeling and simulation of containers in cloud data centers. Softw Pract Exp 47(4):505–521
Schad J, Dittrich J, Quiané-Ruiz JA (2010) Runtime measurements in the cloud: observing, analyzing, and reducing variance. In: Proceedings of the VLDB Endowment, Vol. 3, no.1–2, pp 460–471
Mao M, Humphrey M A performance study on the VMstartup time in the cloud. In: Proceedings of the 2012 IEEE 5th International Conference on Cloud Computing, June 2012, Honolulu, HI, USA, pp 423–430
Bharathi S, Chervenak A, Deelman E, Mehta G, Su M, Vahi K Characterization of scientific workflows. In: Proceedings of the 2008 Workshop on Workflows in Support of Large-Scale Science, December-2008, Austin, TX, USA, pp 1–10
Juve G, Chervenak A, Deelman E, Bharathi S, Mehta G, Vahi K (2013) Characterizing and profiling scientific workflows. Futur Gener Comput Syst 29(3):682–692
Singh V, Indrajeet G, Prasanta KJ (2018) A novel cost-efficient approach for deadline-constrained workflow scheduling by dynamic provisioning of resources. Futur Gener Comput Syst 79:95–110
da Silva RF, Chen W, Juve G, Vahi K, Deelman E Community Resources for enabling research in distributed scientific workflows. In:
da Silva RF, Pottier L, Coleman T, Deelman E, Casanova H (2020) WorkflowHub: community framework for enabling scientific workflow research and development. arXiv:2009.00250v1
pegasus-isi/1000genome-workflow. https://github.com/pegasus-isi/1000genome-workflow. Accessed 29 Nov 2020
Soybean Knowledge Base (SoyKB) Pipeline. https://pegasus.isi.edu/application-showcase/soykb. Accessed 29 Nov 2020
pegasus-traces/cycles. https://github.com/workflowhub/pegasus-traces/tree/master/cycles. Accessed 4 Feb 2021
Funding
This study has received no funding from any organization.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
All of the authors declare that they have no conflict of interest.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Saeedizade, E., Ashtiani, M. DDBWS: a dynamic deadline and budget-aware workflow scheduling algorithm in workflow-as-a-service environments. J Supercomput 77, 14525–14564 (2021). https://doi.org/10.1007/s11227-021-03858-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-021-03858-6