GA-based QOS-aware workflow scheduling of deadline tasks in grid computing

Girgis, Moheb R.; Mahmoud, Tarek M.; Azzam, Hagar M.

doi:10.1007/s10115-023-02048-5

GA-based QOS-aware workflow scheduling of deadline tasks in grid computing

Regular Paper
Open access
Published: 18 January 2024

Volume 66, pages 2859–2884, (2024)
Cite this article

Download PDF

You have full access to this open access article

Knowledge and Information Systems Aims and scope Submit manuscript

GA-based QOS-aware workflow scheduling of deadline tasks in grid computing

Download PDF

Moheb R. Girgis¹,
Tarek M. Mahmoud² &
Hagar M. Azzam¹

558 Accesses
Explore all metrics

Abstract

Grid computing is the aggregation of the power of heterogeneous, geographically distributed computing resources to provide high-performance computing. To benefit from the grid computing capabilities, effectual scheduling algorithms are primarily essential. This paper presents a GA-based approach, called Grid Workflow Tasks Scheduling Algorithm (GWTSA), for scheduling workflow tasks on grid services based on users’ QoS (quality of service) constraints in terms of cost and time. For a given set of inter-dependent workflow tasks, it generates an optimal schedule, which minimizes the execution time and cost, such that the optimized time be within the time constraints (deadline) imposed by the user. In GWTSA, the workflow tasks are modeled as a DAG, which is divided, then the optimal sub-schedules of all task divisions are computed and used to obtain the execution schedule of the entire workflow. A GA-based technique is employed in GWTSA to compute the optimal execution sub-schedule for each branch division that consists of a set of sequential tasks. In this technique, the chromosome represents a branch division, where each gene holds the id of the service provider chosen to execute the corresponding task in the branch; and the fitness function is formulated as a multi-objective function of time and cost, this gives users the ability to determine their requirements if speed against cost or vice versa, by changing the weighting coefficients in the fitness function. The paper also exhibits the experimental results of assessing the performance of GWTSA with workflow samples of different sizes.

A novel hybrid Artificial Gorilla Troops Optimizer with Honey Badger Algorithm for solving cloud scheduling problem

Article Open access 22 June 2024

From conceptual design to performance optimization of ETL workflows: current state of research and open problems

Article Open access 06 September 2017

Deadline-aware and energy efficient IoT task scheduling using fuzzy logic in fog computing

Article 15 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Grid computing has emerged as an efficient approach to solving extensive problems in business, engineering, and science. In grid computing, several processing resources are integrated and connected to work together as one huge computing power to accomplish a common goal. These resources could be geographically distributed over the world, and they could have significantly different capabilities and specifications. To benefit from the grid computing capabilities, effectual scheduling algorithms are primarily essential. Grid scheduling is an activity that assigns and manages the execution of related tasks on distributed resources. The main challenge in grid scheduling is how to distribute interdependent tasks to the available resources, taking into account the quality of service (QoS) time, and cost available to the user. For workflow processing systems, time denotes the overall time needed for finishing the workflow execution; while cost denotes the cost linked to the workflow execution incorporating grid resources usage charge for processing workflow tasks and the workflow systems management cost. The algorithms for scheduling inter-dependent tasks utilize DAGs (directed acyclic graphs) to model tasks dependency.

Grid scheduling is an NP-complete problem as the computational grid comprises resources that are heterogeneous and reside in different administrative regions, which employ distinctive management rules. GAs (genetic algorithms) [1] belong to the metaheuristic algorithms, which have been successfully applied to deal with NP-complete problems.

This paper presents a proposed GA-based approach for scheduling workflow tasks on grid services based on users’ QoS constraints in terms of time and cost. It is called Grid Workflow Tasks Scheduling Algorithm (GWTSA). The input to GWTSA is the set of workflow tasks, the dependencies between them, and the time limit (deadline) stated by the user for the execution of the workflow. The output of GWTSA is an optimal schedule for all workflow tasks that minimizes the execution time and cost, such that the scheduled time be within the deadline imposed by the user. In this approach, a DAG is used to represent the dependency between the workflow tasks. The DAG is divided, then the optimal sub-schedules of all task divisions are computed and used to obtain the execution schedule of the entire workflow. A GA-based technique is employed in GWTSA to compute the optimal execution sub-schedule for each branch division that consists of a set of sequential tasks. In this technique, the fitness function is formulated as a multi-objective function of time and cost, and each chromosome represents the tasks included in a branch division, where each gene holds the id of the service provider chosen to execute the corresponding task in the branch.

The next sections of this paper are as follows: Sect. 2 presents related work; Sect. 3 presents the problem description; Sect. 4 describes the proposed QoS-based grid workflow tasks scheduling approach GWTSA; Sect. 5 presents a case study to illustrate the working of GWTSA; Sect. 6 exhibits the experimental results; and Sect. 7 presents the conclusion and future work.

2 Related work

Several research studies have been proposed that used heuristic and metaheuristic algorithms to address the problem of scheduling tasks in computational grids. This section gives a review of examples of such studies.

2.1 Metaheuristics-based approaches

Aggarwal et al. [2] have presented a scheduler based on a GA for computational grids. It minimizes makespan, the available resources idle time, and turn-around time while satisfying the deadlines specified by users. Yu and Buyya [3] have proposed a GA-based workflow scheduling approach with budget constraints. It aims to minimize execution time while satisfying a specified processing budget. Yu and Buyya [4] have presented a GA-based workflow scheduling approach with two QoS constraints, deadline and budget. Chen et al. [5] have proposed a grid scheduling approach that combines a discrete PSO (particle swarm optimization) with the SA (simulated annealing) method, aiming to minimize the grid cost, which comprises communication and computing costs. Bouali et al. [6] have proposed a hybrid approach between the heterogeneous earliest finish time (HEFT) heuristic and PSO, to minimize the overall competition time of all tasks in the DAG. Jiang and Chen [7] have presented TSGA genetic algorithm for task scheduling that divides the search space into random patterns to check out the search space to minimize the execution time. Gabaldon et al. [8] have proposed a PSO-based approach for scheduling parallel jobs containing cooperating tasks, which aimed to minimize energy consumption. Gabaldon et al. [9] have proposed a hybrid PSO-GA metaheuristic approach for solving the resource matching and scheduling of parallel jobs that contain collaborative tasks in heterogeneous multi-cluster systems, which aimed to minimize both makespan and energy consumption. Younis and Yang [10] have proposed two hybrid metaheuristic schedulers. The first scheduler combines ant colony optimization and variable neighborhood search (VNS), while the second one merges the GA with VNS, with the aim of minimizing the makespan. Ghosh et al. [11] have proposed a hybrid GA-PSO algorithm for grid job scheduling, which aimed to reduce the schedule makespan and flowtime. Chhabra et al. [12] have proposed a multi-objective hybrid scheduling algorithm that combines cuckoo search and firefly algorithm for scheduling offline workload of parallel jobs with collaborative tasks in high-performance computing grid systems to optimize both energy-efficiency and QoS-aware performance expectations. Ankita and Sahana [13] have used the shortest job first to generate guided initial population in the proposed GA-based scheduling approach. Bose et al. [14] have proposed a scheduling approach named Pro-GA for scheduling bag-of-tasks jobs in multi-core heterogeneous computing systems to minimize the makespan, resource utilization, and speedup ratio. Ankita and Sahana [15] introduced a PSO-based scheduling algorithm called Ba-PSO that is designed to allocate jobs to appropriate resources, resulting in a significant reduction in job completion time. Yousif [16] presented a novel approach that is utilizing the firefly algorithm and the smallest position value (SPV) technique to significantly improve scheduling efficiency, leading to reduced makespan times compared to alternative methods like Tabu search.

2.2 Heuristics-based approaches

Yu et al. [17] have proposed an algorithm for QoS-based workflow scheduling, which minimizes the execution cost while satisfying the deadline. This algorithm utilizes an approach based on Markov decision process to schedule the execution of sequential workflow tasks. Benedict and Vasudevan [18] have proposed a grid scheduling approach that uses Tabu search method, for obtaining better computational grid schedules, with two objectives: maximizing the job completion ratio and minimizing the grid scheduler overhead to choose the precise workflow sequence. Meddeber and Yagoubi [19] have presented a dependent task allocation approach for grids, which divides a given task graph into a set of linked components to decrease, if possible, the average execution time of submitted tasks, and to reduce communication costs while respecting the dependency between tasks constraints. Bahnasawy et al. [20] have presented an algorithm for scheduling distributed heterogeneous computing systems. The algorithm divides the given DAG into levels based on the precedency relationships and sorts each level task descendingly according to their computation sizes, then the tasks are selected from that level in order. Bidgoli and Nezad [21] have proposed a scheduling algorithm, GCDM, for grid computing to minimize the final cost of implementation tasks, taking into account the data transfer cost between different tasks and their inter-dependencies that are modeled as a DAG. Hossam et al. [22] have proposed the algorithm WS-GCDM (WorkStealing-Grid Cost Dependency Matrix), which is an enhancement of GCDM [21]. It balances task scheduling among the available grid resources, while GCDM utilizes a certain number of grid resources irrespective of the number of available resources. Rahman et al. [21] have presented a dynamic and adaptive workflow scheduling algorithm based on critical path (CP) for grid computing, which dynamically and efficiently maps tasks of the workflow to grid resources by determining, at every step, the CP in the workflow graph. They, also, outlined a hybrid heuristic that merges the presented adaptive scheduling technique features with metaheuristics to obtain optimal execution time and cost while satisfying the users' requirements. Chauhan and Nitin [24] have proposed an entirely decentralized P2P algorithm for grid scheduling that schedules subtasks of DAG tasks, taking into account three factors: subtasks computation and communication costs, and the subtask waiting time caused by predecessors and precedence constraints. Garg and Singh [25] have proposed an adaptive approach based on a rescheduling method for scheduling workflow-dependent tasks on the dynamic grid resources. It initially performs static scheduling, followed by resource monitoring, then rescheduling to minimize the execution time for workflow applications.

3 Problem description and system model

Workflow application tasks in a grid computing system can be modeled as a DAG, which is represented with two sets (T, E), where T = {T_i, i = 1... n} denotes a set of n tasks, while E denotes the set of directed edges between tasks, where an edge (T_i, T_k) represents the dependency of task T_k on task T_i, which means that task T_i must be completed before scheduling task T_k. Task T_i is referred to as task T_k’s parent, and task T_k is referred to as task T_i’s child. Assuming D is the user’s provided deadline (time constraint) for the workflow execution, then the workflow application can be expressed as G (T, E, D). In the DAG, an entry task is a task that has no parent tasks and is referred to as T_entry, and an exit task is a task that has no child tasks and is referred to as T_exit.

In a grid computing system, there are a set of service types, where diverse service providers can support each service type. Let m be the number of available services. Each task T_i has a set of services ${S}_{i}^{j}$ (1 ≤ i ≤ n, 1 ≤ j ≤ m_i, m_i ≤ m) that can execute this task, but only one of these services is chosen to execute the task. The processing capabilities of services vary and are provided at different prices. In general, there is an inverse proportion between the service price and the processing time [17]. The service price and time for executing task T_i on service ${S}_{i}^{j}$ are denoted by ${c}_{i}^{j}{\text{and}}$ ${t}_{i}^{j}$, respectively.

The scheduling problem is to assign every task T_i to a service ${S}_{i}^{j}$ to minimize the execution time and cost, such that the execution of the workflow is completed within the user’s provided deadline while taking into account tasks precedence constraints.

4 Scheduling methodology: the proposed QoS-based grid workflow tasks scheduling approach

In this work, the following steps are performed to solve the scheduling problem:

Step 1: Detect available services, and choose available service providers for each task according to the QoS parameters of services specified by the user.
Step 2: Cluster tasks of the workflow into task divisions.
Step 3: Distribute the user’s provided deadline (referred to as the specified deadline) on task divisions.
Step 4: Compute optimal sub-schedules for individual task divisions by using a GA-based strategy, then use these sub-schedules to generate an optimal schedule for the entire workflow.

The following subsections provide a detailed description of these steps.

4.1 Service detection and QoS request

Providing details of QoS for every available service is important for efficient workflow tasks scheduling. As Fig. 1 illustrates, the WMS (Workflow Management System) first sends a query to the GIS (Grid Information Service), which has knowledge of all registered grid services providers, to detect services, which are suitable for processing every task of the workflow user. Each query specifies the parameters of the task, estimated execution time, and workflow user. The GIS, in turn, replies with an available services list for every task. Then, the WMS sends a QoS request to these services to obtain their processing price and time for providing the service with the required QoS level.

4.2 Workflow DAG dividing

Workflow DAG dividing process starts by categorizing workflow tasks in G into simple tasks or synchronization tasks [17]. A synchronization task is one that has many parent tasks and/or child tasks, while a simple task is one which has at most one child and/or parent task. For example, in Fig. 2a, the 1st, 10th, and 14th tasks are synchronization tasks, while the remaining tasks are simple tasks. Then, the workflow tasks in G are divided into independent branches B and synchronization tasks Y, which leads to minimizing the size of G making it more simple thus containing less number of nodes. Let P be a set of nodes representing a set of task divisions P_i, 1 ≤ i ≤ nY + nB, where nY and nB are the total numbers of synchronization tasks and workflow branches, respectively. Assume E' is the set of directed edges, where each edge takes the form (P_i, P_j) with P_i as a parent of P_j. Then, the divided graph can be described as G'(P, E', D). Figure 2b shows the DAG of Fig. 2a after dividing. For example, in this figure, the sequence of tasks T₂, T_3, and T₄ forms a branch division. A simple path in G' is a task division sequence that includes a directed edge from each task division in it to its successor, where the path task divisions are not repeated. The DAG_Dividing algorithm is shown in Fig. 3.

Each task division P_i has four attributes: deadline (dl[P_i]), expected execution time (eet[P_i]), start time (start_time[P_i]), and minimum execution time (met[P_i]). If P_i is a branch, then its earliest start time is the earliest start time of the 1st task in it, and is calculated according to the deadlines of its parent divisions as follows:

$${\text{start}}\_{\text{time}}\left[{P}_{i}\right]=\underset{{P}_{j}\in {\mathit{PP}}_{i}}{{\text{max}}}{\text{d}}l\left[{P}_{j}\right]$$

(1)

where PP_i is the set of P_i’s parent task divisions. The P_i’s minimum execution time is calculated as follows:

$${\text{met}}\left[{P}_{i}\right]=\sum_{{T}_{x}\in {P}_{i}}\underset{1\le y\le {m}_{x}}{{\text{min}}}{t}_{x}^{y}$$

(2)

The Expected execution time of P_i is calculated as follows:

$$ {\text{eet}}\left[ {P_{i} } \right] \, = {\text{ d}}l\left[ {P_{i} } \right] \, {-}{\text{ start}}\_{\text{time}}\left[ {P_{i} } \right] $$

(3)

4.3 Deadline distribution

Having divided the workflow graph G, the specified deadline, D, is distributed on the G′ task divisions, such that the deadline dl[P_i] allocated to each task division P_i is a sub-deadline of D.

Following are the deadline distribution rules: [17]

R1: The total sub-deadline of any path from a synchronization task Y_i to another synchronization task Y_j must be the same.
R2: Any path from P_i to P_j, where T_entry ∈ P_i and T_exit ∈ P_j, have a total sub-deadline equal to D.
R3: A sub-deadline allocated to any task division P_i must be greater than or equal to met(P_i).
R4: The specified deadline, D, is distributed over task divisions in proportion to their met.

These deadline distribution rules are implemented on the task division graph by using BFS (Breadth-First Search) algorithm and DFS (Depth-First Search) algorithm to calculate, for each task division, start time and sub-deadline. The deadline distribution algorithm is shown in Fig. 4.

4.4 Generation of an optimal schedule and fitness function

Once the sub-deadline of a task division is determined, an optimal sub-schedule for this task division can be obtained. If the obtained optimal sub-schedules for all task divisions ensure that the execution of these task divisions can be completed within their sub-deadlines, the entire workflow execution will be finished within the specified deadline. Also, minimizing the costs for all task divisions leads to reaching an optimal cost for the whole workflow. Thus, by combining all optimal sub-schedules, an optimized workflow schedule can be easily obtained. The scheduling solutions for the two task division types: synchronization task and branch division, as well as the overall Grid Workflow Tasks Scheduling Algorithm (GWTSA), are described below.

4.4.1 Scheduling the synchronization tasks

Synchronization task scheduling (STS) is a single-task scheduling problem. The optimal solution for such a problem can be simply obtained by selecting the service with the lowest cost which is able to execute the synchronization task within its allocated sub-deadline. Thus, for scheduling a synchronization task Y_i, the objective function is as follows:

$$ \min c_{i}^{k} ,\,{\text{where}}\,\,1 \le k \le m_{i} ,\,\,{\text{and}}\,\,t_{i}^{k} \le {\text{ eet}}(Y_{i} ) $$

(4)

4.4.2 Branch division scheduling

If a branch division contains only one simple task, the solution for the branch division scheduling (BDS) is the same as STS. But, if a branch contains multiple tasks, a GA is used to get an optimal solution according to the evaluation of a fitness function. Here, the optimal solution is to minimize the branch execution time and cost, with the condition that the optimized time be within its allocated sub-deadline. Thus, the objective function to be minimized to obtain an optimal sub-schedule for branch B_j can be represented as a weighted sum that combines the following two objectives:

$$ \min \,{\text{cost}}\left( {B_{j} } \right)\,{\text{and}}\,\min \,{\text{ time}}\left( {B_{j} } \right),{\text{such}}\,{\text{that}}\,{\text{time}}(B_{j} )\, \le \,{\text{eet}}(B_{j} ) $$

where

$${\text{cost}}({B}_{j}) = \sum_{{T}_{i}\in {B}_{j}}{c}_{i}^{k}$$

(5)

$${\text{time}}({B}_{j}) = \sum_{{T}_{i}\in {B}_{j}}{t}_{i}^{k}$$

(6)

and 1 $\le k\le {m}_{i}$.

That is, the objective function is formulated as follows:

$$ F\left( {B_{j} } \right) \, = \, w_{1} \;cost\left( {B_{j} } \right) \, + \, w_{2} \;time\left( {B_{j} } \right) $$

(7)

where w₁ and w₂ are weighting coefficients with value ≥ 0 and ≤ 1, which satisfy the condition w₁ + w₂ = 1. This objective function F will be the fitness function used by the proposed GA. This gives users the ability to control the workflow schedule by choosing weighting parameters to set an order of preference for time and cost objectives according to their needs. For example, if a user prefers to accomplish his workflow tasks as quickly as possible, regardless of the cost, then the optimal value for w₁ is one and zero for w₂, and vice versa. The weighting coefficients can be set proportionally to achieve the increase or decrease in the time and cost calculation of the flow process, when w1 = 0.2, w2 = 0.8, that is, the user prefers to reduce the cost against the time.

4.5 The proposed GA-based branch division scheduling (BDSGA) algorithm

A GA is a powerful search technique that applies the evolution principle to derive, in polynomial time, a good solution from a big search space. A GA merges the exploration of new areas of the solution space with the exploitation of the best solutions obtained from previous searches.

Genetic algorithms offer several advantages for task scheduling in grid computing systems, including optimization, flexibility, scalability, robustness, and efficiency.

Optimization: Genetic algorithms can optimize the scheduling of tasks by selecting an optimal or suboptimal scheduling of jobs. This can lead to better resource utilization and improved system performance.

Flexibility: Genetic algorithms are flexible and can be adapted to different scheduling scenarios. This means that they can be used in a variety of grid computing environments and can handle different types of tasks and resources.

Scalability: Genetic algorithms can handle large-scale scheduling problems [26]. This is important in grid computing systems, which typically involve a large number of tasks and resources.

Robustness: Genetic algorithms are robust and can handle noisy and incomplete data. This is important in real-world grid computing systems, which may have incomplete or inaccurate information about tasks and resources.

Efficiency: Genetic algorithms can be computationally efficient, especially when compared to exhaustive search methods [27]. This means that they can find good solutions to scheduling problems in a reasonable amount of time.

The problem search space solutions are represented by the population of chromosomes (individuals). A fitness function is used to determine the quality of a chromosome in the population. An individual’s fitness value signifies how good this individual is among other individuals in the population.

The main steps of any GA are as follows:

1.
Randomly generate pop_size solutions to form the initial population.
2.
Set current population = initial population.
3.
Calculate the fitness values of all chromosomes in the current population.
4.
Create a new population by repeating the following steps:
- From the current population, select two chromosomes as parents.
- With a certain probability, perform crossover on the parents to form a new offspring. In the case where no crossover is performed, offspring will be the same as their parents.
- With a certain probability, perform mutation on the new offspring.
- Add new offspring to the new population.
5.
Set current population = new population.
6.
Check the stopping condition. If it is not met, go to Step 3; otherwise, stop and return the best solution.

Solving the BDS problem by using a GA requires the determination of the chromosome representation, the initial population, the genetic operators (selection, crossover, and mutation), and a suitable fitness function. The proposed GA’s components are presented below.

4.5.1 BDS problem representation and initial population

In the proposed GA-based branch division scheduling (BDSGA) algorithm, for each branch in G′, we build a population of individuals that represents possible solutions for branch division scheduling on available service providers that satisfy the QoS constraints and specified deadline.

Each branch consists of a number of tasks, and each task has its service providers. So, the proposed branch chromosome representation is a one-dimension list consisting of a number of genes corresponding to the branch tasks, and each task T_i in the branch is accompanied by a list of service providers, spl_i, capable of executing this task. A gene g_i that corresponds to a task T_i in the chromosome holds the id of a service provider, chosen from spl_i, to execute this task. Figure 5 shows an encoding for a branch B that consists of r tasks, T₁, T₂, …, T_k, …, T_r, where g_k $\in $ [1, m_k], and m_k is the number of services capable of executing task T_k.

For example, the following chromosome represents the branch that consists of tasks T₂, T_3, and T₄, in Fig. 1, and indicates that these tasks will be executed on services with ids 1, 3, and 2, respectively.

T₂	T₃	T₄
1	3	2

Each individual in the initial population for each branch consists of a random set of providers that are capable of executing each task in the branch. The chromosome representing branch B must satisfy the condition:

$$\sum_{{T}_{i}\in B}{t}_{i}^{k}\le {\text{eet}}(B)$$

(8)

According to the fitness function for each individual, new offspring is generated by using the genetic operators (selection, crossover, and mutation), in turn, then each individual’s fitness in the branch population is evaluated until the algorithm converges. We keep the best schedule for each branch division in G′. Then, these best schedules are used, with the best schedules of synchronization tasks, by the proposed Grid Workflow Tasks Scheduling Algorithm (GWTSA) to obtain the best schedule for the whole DAG (workflow tasks).

4.5.2 The fitness function and selection

Based on the considered optimization objective, a fitness function is utilized to assess the quality of the population individuals. The scheduling goal here is to optimize the grid system performance in terms of cost and time for each division, as explained above. Therefore, in BDSGA, the fitness function is the multi-objective function defined by Eq. (7).

4.5.3 The genetic operators

Genetic operations are applied to the current population individuals to create new individuals. Individuals that have better fitness values have a better chance to provide one or more offspring in the following generations. The three genetic operators, selection, crossover, and mutation, used in BDSGA, are described below.

4.5.3.1 Selection operation

In the selection stage of a GA, chromosomes are chosen from a population for crossover. In BDSGA, the roulette wheel selection method [1] is used to select the parents to be mated in the crossover operation.

4.5.3.2 Crossover

In the crossover operation, two parents (chromosomes) generate new offspring by swapping portions of their genetic information. Over time, better populations are generated by mixing genetic information from couples of fitter individuals from the previous population. In BDSGA, a one-point crossover operator is used. The crossover operation is performed according to a specified crossover rate (X_r). This rate represents the percentage of offspring generated through crossover in each iteration to the size of the population. In one-point crossover, one random crossover point is chosen, then genes (providers’ ids) between this point and the chromosome end in the two parent chromosomes are exchanged. Figure 6 shows an example of a one-point crossover operation. In this example, child1 takes the service id for task T₁ from parent1 and the service ids for tasks T₂, T_3, and T₄ from parent2, while child2 takes the service id for task T₁ from parent2 and the service ids for task T₂, T_3, and T₄ from parent1.

4.5.3.3 Mutation

Mutation aims to find new points in the search space in order to maintain population diversity. It is performed on a gene-by-gene basis. In BDSGA, the mutation is performed by randomly choosing a gene with a certain mutation rate (M_r), then replacing the id value in it with another id value from the remaining providers’ ids that can execute the corresponding task. An example illustrating the mutation operation is shown in Fig. 7.

4.5.4 Overall BDSGA algorithm

The BDSGA algorithm is given in Fig. 8. The inputs to BDSGA are a branch B, list of service providers spl_i of each task T_i in B (see Fig. 5), the dl(B) (deadline of branch B), pop_size (population size), Max_Gen (maximum number of generations), X_r (crossover rate), M_r (mutation rate), and the weights w₁ and w₂ of the fitness function. In steps 1–3, BDSGA sets the number of generations counter No_of_Gens to 0, then generates the initial population consisting of pop_size chromosomes, where the genes of each chromosome are populated by service providers' ids randomly selected from the list of service providers of the corresponding tasks. Then, in steps 4–6, it calculates the schedule (time and cost) of each chromosome, according to providers’ ids placed in its genes, using the procedure ComputeBranchSchedule(), shown in Fig. 9. In step 8, these schedules are used to evaluate the fitness of each chromosome, using Eq. (7), and the best chromosome is saved. Next, steps 9–18 include a While loop, in which the steps of generating a new population, evaluating it, and keeping the best chromosome are repeated until No_of_Gens reaches Max_Gen. Finally, the best chromosome (best schedule for branch B) is returned.

4.5.5 Decoding

After obtaining the best chromosome, which represents the best schedule for the given branch B, it is decoded in order to set the start and end times for each task composing that chromosome. This decoding process is performed by applying the procedure Compute_Branch_Schedule(), shown in Fig. 9. It uses the provider's id placed in each gene to get the time and price provided by this provider for the corresponding task. Next, the procedure calculates the best execution cost and time for the whole branch by summing the prices and times of all tasks in B, using Eq. (5) and Eq. (6), respectively.

4.6 Overall grid workflow tasks scheduling algorithm (GWTSA)

The proposed Grid Workflow Tasks Scheduling Algorithm (GWTSA) is described in this section. It schedules the tasks of a workflow on grid services based on users’ QoS constraints. Figure 10 shows the flowchart of the proposed GWTSA, and Fig. 11 shows its procedural details. As shown in Fig. 11, the input to GWTSA is the workflow graph G (T, E, D), where T = {T_i, i = 1... n} denotes the workflow tasks set, E denotes the set of edges connecting tasks, and D denotes the user’s provided deadline for the execution of the workflow. The output of GWTSA is an optimal schedule for all workflow tasks, which minimizes both time and cost of workflow execution, such that the optimized time be within the specified deadline. The algorithm starts by requesting processing times and prices from available grid services for all workflow tasks. Then, it divides workflow tasks into independent branches (sequences of simple tasks) and synchronization tasks, by using the DAG_Dividing algorithm, shown in Fig. 3, generating a reduced graph G′(P, E', D), where P denotes the divisions (branches and synchronization tasks) set, and E' denotes the set of directed edges between divisions in G'. Then, it uses the deadline assignment algorithm, Deadline_Distributing, shown in Fig. 4, to distribute the overall deadline D over each division. Finally, GWTSA generates the execution schedule for the entire workflow by using the optimal sub-schedules of task divisions. If a task division is a branch, its optimal schedule is obtained using BDSGA, as described in Sec. 4.5; otherwise, STS is used, as described in Sec. 4.4.1. If a task division has one or more child divisions, then the procedure HandleChildDivision() is called to compute their schedules.

4.7 Complexity analysis

The complexity of different parts of the GWTSA algorithm:

Task Graph Division (DAG_Dividing): The complexity of this part depends on the size of the original task graph. If there are N tasks in the graph, and each task is examined once, the complexity would be O(N).

Deadline Distribution (Deadline_Distributing): This part involves traversing possible paths in the graph and performing calculations for each division. The complexity can be influenced by the number of paths and divisions. If there are M divisions and P paths, the complexity can be around O(M * P).

Division Scheduling (Queue Q and HandleChildDivision): The complexity of this part depends on th genetic algorithm (GA), the complexity of each GA operation (e.g., crossover and mutation) could be O(G * P), where G is the number of generations, and P is the population size. If there are K divisions to be scheduled, and each division undergoes GA-based scheduling, the overall complexity could be O(K * G * P).

Queue Handling (While loop in HandleChildDivision): This part involves handling the queue of divisions to be scheduled. The complexity here depends on the number of divisions and their relationships. If there are L divisions in the queue, the complexity would be O(L)

Overall, the complexity of the proposed GWTSA algorithm largely depends on the number of task divisions (K), the total number of generations (G), and (P) the population size.

5 A case study using GWTSA

To illustrate the working of the proposed grid scheduling approach, GWTSA, it was applied to the workflow modeled by the DAG given in Fig. 2 (adapted from [17]), which consists of 14 tasks. So, 14 service types were simulated, with a number of diverse service providers supporting each service type. Table 1 shows, for each task, the QoS attributes of providers that will provide the same service type needed for processing this task. These attributes are: provider id, processing time (sec), and cost ($). We assumed that the required deadline (DL) is 350 s.

Table 1 QoS attributes (provider id, processing time in sec, and cost in $) of services of different providers for executing the tasks of the example workflow

Full size table

The inputs to GWTSA were:

The specified deadline D;
The GA parameters: pop_size, Max_Gen, Mr, Xr, w1, and w2;
A file containing the edges of the DAG of the example workflow: 1–2, 1–5, 1–7, 1–8, 2–3, 3–4, 4–14, 5–6, 6–14, 7–10, 10–11, 11–14, 10–12, 12–13, 13–14, 8–9, 9–10; and
A file containing the service providers’ information is shown in Table 1.

The outputs produced by GWTSA were:

A file containing the divisions of the example workflow DAG, with their types, is shown in Table 2.
A file containing the scheduled start and end times for each task with its service provider id, as shown in Table 3. Note that each provider id is prefixed with the corresponding task number to differentiate between providers of different tasks that have the same id. For example, 3:1 refers to the provider with id 1 of task 3, and 5:1 refers to provider with id 1 of task 5
A file containing the scheduled start and end times for each division, as shown in Table 4.
Resultant the best schedule time: 161 s and the best cost: $ 175.

Table 2 The divisions of the example workflow DAG, with their types (Y: synchronization and B: branch)

Full size table

Table 3 The scheduled start and end times for each task with its service provider id

Full size table

Table 4 The scheduled start and end times for each division

Full size table

6 Experimental results

Two types of experiments have been carried out to assess GWTSA performance. In the first type of experiment, three workflows of 11, 14, and 25 tasks were used. For each task in each workflow, a different service type with diverse service providers was simulated.

6.1 Comparative scheduling algorithms

GWTSA was applied to the three workflows, with different deadlines, and the results were compared with three other scheduling algorithms, namely, Greedy-Time (GT) [17], Greedy-Cost (GC) [17], and Modified Greedy-Cost (MGC). For processing each task, GC chooses the lowest-cost service, whereas GT chooses the quickest service. MGC searches for the lowest-cost service for processing each task within the required deadline. The evaluation criteria were the execution cost and time constraint. The first criterion shows the workflow tasks scheduling costs on the utilized service grid, while the second one shows whether the scheduling approach has generated a schedule that satisfies the specified deadline. Table 5 shows some characteristics of the proposed algorithm and other scheduling algorithms. For each deadline value, we have run GWTSA ten times and calculated the average of the best time and cost values generated. The GA parameters used were: pop_size = 10, Max_Gen = 50, Xr = 0.70, Mr = 0.01, and w₁ = w₂ = 0.5. The algorithms were implemented using C# and run on TOSHIBA-Lap Intel(R) Core™ i5-2430 M CPU, 2.4 GHz, 4 GB RAM.

Table 5 Characteristics of comparative baseline algorithms

Full size table

6.2 Comparative results

Figures 12, 13, and 14 show comparisons between the results of applying the four scheduling algorithms on the three workflows in terms of execution times and costs. Figures 12a, 13a, and 14a show that the expected execution time for the three workflows using GWTSA and MGC increases as the deadline is relaxed. For the first workflow (11 tasks), the expected execution time using GWTSA is lower than MGC. For the second workflow (14 tasks), the expected execution time using GWTSA is lower than MGC, except for some deadline values in the middle. For the third workflow (25 tasks), the expected execution times using GWTSA and MGC are too close with lower deadlines, but GWTSA generates lower execution times with higher deadlines. The workflow execution time using the GC algorithm is higher and cannot meet the required deadline when it is low. The GT generates a lower execution time than the other three algorithms.

As shown in Figs. 12b, 13b, and 14b, for the three workflows, the execution cost using the GT algorithm is higher, but when using GWTSA and MGC, it is reduced as the deadline is relaxed. For the first two workflows (11 and 14 tasks), the execution cost using MGC is lower than GWTSA, while for the third workflow (25 tasks), the execution costs using GWTSA and MGC are too close with lower deadlines, but MGC generates lower execution costs with higher deadlines. The GC generates a lower execution cost than the other three algorithms.

As can be seen from these results, GWTSA tries to optimize both the execution time and cost, the MGC algorithm tries to minimize the cost while keeping the execution time within the required deadline, whereas the GT and GC algorithms try to minimize the execution time and cost, respectively.

In the second type of experiment, we have studied the effect of varying the weighting coefficients, w₁ and w₂, of the BDSGA objective function (Eq. 7). We have applied GWTSA to the workflow of 14 tasks with DL = 350 s and different values for w_1, and w₂, where w₁ + w₂ = 1. Figure 15 shows the effect of varying w₁ and w₂ on the behavior of the objective function F and the expected execution time and cost. It shows that F reached its minimum at two points (w₁ = 0, w₂ = 1) and (w₁ = 1, w₂ = 0) while reaching its maximum at the point (w₁ = 0.6, w₂ = 0.4). It also shows that increasing the time weighting coefficient (w₁) while decreasing the cost weighting coefficient (w₂), gives more emphasis to the time minimization, whereas increasing w₂ while decreasing w₁, gives more emphasis to the cost minimization. This means that the optimal schedule varies with the weighting coefficients. So, users can control the workflow schedule by choosing weighting coefficients to assign a preference order to the time and cost objectives.

6.3 Reasons for optimal performance of the proposed GWTSA

The problem of scheduling workflow tasks on grid is formulated as a problem of multi-objective optimization, where the execution time and cost are minimized, such that the optimized time be within the deadline imposed by the user.
It employs a GA-based technique to compute the optimal execution sub-schedule for each set of sequential tasks, represented by a branch division in the workflow DAG.
This technique uses a novel chromosome representation, in which the chromosome represents a branch division, where each gene holds the id of the service provider, chosen from the list of service providers capable of executing the corresponding task in the branch; and the fitness function is formulated as a multi-objective function of time and cost.
The optimal sub-schedules of all task divisions are used to obtain the execution schedule of the entire workflow.

7 Conclusion

This paper presented a proposed GA-based approach, GWTSA, for scheduling workflow tasks on grid services based on users’ QoS constraints in terms of time and cost. For a given inter-dependent set of workflow tasks, GWTSA generates an optimal schedule, which minimizes the execution time and cost, such that the optimized time be within the deadline imposed by the user. In this approach, a DAG is used to represent the dependency between the workflow tasks. The DAG is divided, then the optimal sub-schedules of task divisions are computed and utilized to generate the schedule for executing the entire workflow. A GA-based technique is employed in GWTSA to compute the optimal execution sub-schedule for each branch division, which consists of a set of sequential tasks.

Experiments have been carried out to assess GWTSA’s performance. The results were compared with three other scheduling algorithms: GC, GT, and MGC. The results indicated that GWTSA tries to optimize both the execution time and cost; the MGC algorithm tries to minimize the cost while keeping the execution time within the required deadline; whereas the GT and GC algorithms try to minimize the execution time and cost, respectively.

We have also studied the effect of varying the weighting coefficients, w₁ and w₂, of the BDSGA objective function. The results indicated that the optimal schedule varies with the weighting coefficients. So, users can control the workflow schedule by choosing weighting coefficients to assign a preference order to the time and cost objectives.

In the future work, we intend to modify the proposed workflow scheduler GWTSA to consider resource dynamics, such that the schedule is adapted and updated during scheduling according to these dynamics. We also intend to augment BDSGA with a heuristic algorithm, such as simulated annealing (SA), to improve the optimal execution schedule it produces for each branch division.

References

Goldberg DE (1989) Genetic algorithms in search, optimization, and machine learning, reading. Addison-Wesley, MA
Google Scholar
Aggarwal M, Kent RD, Ngom A (2005) Genetic algorithm based scheduler for computational grids, In: Proceedings of the 19th International Symposium on High Performance Computing Systems and Applications (HPCS’05)
Yu J, Buyya R (2006) A budget constrained scheduling of workflow applications on utility grids using genetic algorithms, In: Proceedings of the 15th IEEE International Symposium on High Performance Distributed Computing (HPDC’06), France, Jun 2006
Yu J, Buyya R (2006) Scheduling scientific workflow applications with deadline and budget constraints using genetic algorithms. Sci Prog 14:217–230
Google Scholar
Chen R, Shiau D, Andlo SH (2009) Combined discrete particle swarm optimization and simulated annealing for grid computing scheduling problem, In: Lecture notes in computer science, vol. 57, Springer, Berlin, pp. 242–251
Jiang Y, Chen M (2015) Task scheduling for grid computing systems using a genetic algorithm. J Supercomput 71(4):1357–1377
Article Google Scholar
Bouali L, Oukfif K, Bouzefrane S, Oulebsir FB (2015) A hybrid algorithm for DAG application scheduling on computational grids, In: International Conference on Mobile, Secure and Programmable Networking (MSPN’2015), Paris, France, June 2015, pp. 63–77
Gabaldon E, Guirado F, Lerida JL, Planes J (2016) Particle swarm optimization scheduling for energy saving in cluster computing heterogeneous environments, In: 2016 IEEE 4th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW), Vienna, Austria, August 2016, pp 321–325
Gabaldon E, Vila S, Guirado F, Lerida JL, Planes J (2017) Energy efficient scheduling on heterogeneous federated clusters using a fuzzy multi-objective meta-heuristics, In: IEEE international conference on fuzzy systems (FUZZ-IEEE), Naples, Italy
Younis MT, Yang S (2018) Hybrid meta-heuristic algorithms for independent job scheduling in grid computing. Appl Soft Comput 72:498–517
Article Google Scholar
Ghosh TK, Das S, Ghoshal N (2019) Job scheduling in computational grid using a hybrid algorithm based on genetic algorithm and particle swarm optimization. In: Castillo O, Jana D, Giri D, Ahmed A (eds) Recent advances in intelligent information systems and applied mathematics, ICITAM, Studies in Computational Intelligence, vol 863. Springer, New York
Google Scholar
Chhabra A, Singh G, Kahlon KS (2021) Performance-aware energy-efficient parallel job scheduling in HPC grid using nature-inspired hybrid meta-heuristics. J Ambient Intell Humaniz Comput 12:1801–1835
Article Google Scholar
Ankita, Sahana SK (2019) Evolutionary based hybrid GA for solving multi-objective grid scheduling problem, Microsyst Technol 26:1405–1416.
Bose A, Biswas T, Kuila P (2019) A novel genetic algorithm based scheduling for multi-core systems. In: Tiwari S et al (eds) Smart innovations in communication and computational sciences, advances in intelligent systems and computing. Springer, New York, p 851
Google Scholar
Ankita, Sahana SK (2022) A balanced PSO to solve multi-objective grid scheduling problem, J Appl Intell 52:4015–4027
Yousif A (2021) An enhanced firefly algorithm for time shared grid task scheduling. Appl Artif Intell 35(15):1567–1586
Article Google Scholar
Yu J, Buyya R, Tham CK (2005) QoS-based scheduling of workflow applications on service grids, In: Proceedings of the 1st IEEE International Conference on e-Science and Grid Computing (e-Science’05), Melbourne, Australia, December 2005.
Benedict SH, Vasudevan V (2008) Improving scheduling of scientific workflows using tabu search for computational grids. Inf Technol J 7(1):91–97
Article Google Scholar
Meddeber M, Yagoubi B (2011) Tasks assignment for grid computing. Int J Web Grid Serv 7:427–443
Article Google Scholar
Bahnasawy NA, Koutb MA, Mosa M, Omara F (2011) A new algorithm for static task scheduling for heterogeneous distributed computing systems. Afr J Math Comput Sci Res 4(6):221–234
Google Scholar
Bidgoli AM, Nezad ZM (2011) A new scheduling algorithm design for grid computing tasks, In: 5th Symposium on Advances in Science and Technology, Khavaran Higher-education Institute, Mashhad, Iran
Hossam HS, Abdel-Galil H, Belal M (2021) WorkStealing algorithm for load balancing in grid computing. Int J Adv Comput Sci Appl 12(7):98–104
Google Scholar
Rahman M, Hassan R, Ranjan R, Buyya R (2013) Adaptive workflow scheduling for dynamic grid and cloud computing environment. Concurr Comput Pract Experience 25:1816–1842
Article Google Scholar
Chauhan P, Nitin N (2014) Decentralized scheduling algorithm for DAG based tasks on P2P grid. J Eng 1–14:2014
Google Scholar
Garg R, Singh AK (2015) Adaptive workflow scheduling in grid computing based on dynamic resource availability. Eng Sci Technol Int J 18:256–269
Google Scholar
Shakya S, Prajapati U (2015.) Task scheduling in grid computing using genetic algorithm, In: International Conference on Green Computing and Internet of Things (ICGCIoT), Greater Noida, India, 2015, pp. 1245–1248
Keshanchi B, Souri A, Navimipour NJ (2017) An improved genetic algorithm for task scheduling in the cloud environments using the priority queues: formal verification, simulation, and statistical testing. J Syst Softw 124:1–21
Article Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

Open access funding provided by The Science, Technology & Innovation Funding Authority (STDF) in cooperation with The Egyptian Knowledge Bank (EKB).

Author information

Authors and Affiliations

Department of Computer Science, Faculty of Science, Minia University, El-Minia, Egypt
Moheb R. Girgis & Hagar M. Azzam
Department of Computer Science, Faculty of Computers and Artificial Intelligence, Sadat City University, Sadat City, Egypt
Tarek M. Mahmoud

Authors

Moheb R. Girgis
View author publications
You can also search for this author in PubMed Google Scholar
Tarek M. Mahmoud
View author publications
You can also search for this author in PubMed Google Scholar
Hagar M. Azzam
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors confirm the contribution to the paper as follows: Moheb R. Girgis and Tarek M. Mahmoud helped in conceptualization and methodology of the research; Hagar M. Azzam helped in implementation of the research and experiments; Moheb R. Girgis helped in analysis and interpretation of results, and manuscript writing with input from all authors. All authors reviewed the results and approved the final version of the manuscript.

Corresponding author

Correspondence to Hagar M. Azzam.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Girgis, M.R., Mahmoud, T.M. & Azzam, H.M. GA-based QOS-aware workflow scheduling of deadline tasks in grid computing. Knowl Inf Syst 66, 2859–2884 (2024). https://doi.org/10.1007/s10115-023-02048-5

Download citation

Received: 25 March 2023
Revised: 02 October 2023
Accepted: 07 December 2023
Published: 18 January 2024
Issue Date: May 2024
DOI: https://doi.org/10.1007/s10115-023-02048-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

GA-based QOS-aware workflow scheduling of deadline tasks in grid computing

Abstract

Similar content being viewed by others

A novel hybrid Artificial Gorilla Troops Optimizer with Honey Badger Algorithm for solving cloud scheduling problem

From conceptual design to performance optimization of ETL workflows: current state of research and open problems

Deadline-aware and energy efficient IoT task scheduling using fuzzy logic in fog computing

1 Introduction

2 Related work

2.1 Metaheuristics-based approaches

2.2 Heuristics-based approaches

3 Problem description and system model

4 Scheduling methodology: the proposed QoS-based grid workflow tasks scheduling approach

4.1 Service detection and QoS request

4.2 Workflow DAG dividing

4.3 Deadline distribution

4.4 Generation of an optimal schedule and fitness function

4.4.1 Scheduling the synchronization tasks

4.4.2 Branch division scheduling

4.5 The proposed GA-based branch division scheduling (BDSGA) algorithm

4.5.1 BDS problem representation and initial population

4.5.2 The fitness function and selection

4.5.3 The genetic operators

4.5.3.1 Selection operation

4.5.3.2 Crossover

4.5.3.3 Mutation

4.5.4 Overall BDSGA algorithm

4.5.5 Decoding

4.6 Overall grid workflow tasks scheduling algorithm (GWTSA)

4.7 Complexity analysis

5 A case study using GWTSA

6 Experimental results

6.1 Comparative scheduling algorithms

6.2 Comparative results

6.3 Reasons for optimal performance of the proposed GWTSA

7 Conclusion

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation