Abstract
Given a group of heterogeneous blade servers in a cloud computing environment or a data center of a cloud computing provider, each having its own size and speed and its own amount of preloaded special tasks, we are facing the problem of optimal distribution of generic tasks over these blade servers, such that the average response time of generic tasks is minimized. Such performance optimization is important for a cloud computing provider to efficiently utilize all the available resources and to deliver the highest quality of service. We develop a queueing model for a group of heterogeneous blade servers, and formulate and solve the optimal load distribution problem of generic tasks for multiple heterogeneous blade servers in a cloud computing environment in two different situations, namely, special tasks with and without higher priority. Extensive numerical examples and data are demonstrated and some important observations are made. It is found that server sizes, server speeds, task execution requirement, and the arrival rates of special tasks all have significant impact on the average response time of generic tasks, especially when the total arrival rate of generic tasks is large. It is also found that the server size heterogeneity and the server speed heterogeneity do not have much impact on the average response time of generic tasks. Furthermore, larger (smaller, respectively) heterogeneity results in shorter (longer, respectively) average response time of generic tasks.
Similar content being viewed by others
References
Bonomi, F., Kumar, A.: Adaptive optimal load balancing in a nonhomogeneous multiserver system with a central job scheduler. IEEE Trans. Comput. 39(10), 1232–1250 (1990)
He, L., Jarvis, S.A., Spooner, D.P., Jiang, H., Dillenberger, D.N., Nudd, G.R.: Allocating non-real-time and soft real-time jobs in multiclusters. IEEE Trans. Parallel Distrib. Syst. 17(2), 99–112 (2006)
http://en.wikipedia.org/wiki/Cloud_computing. Accessed 17 May 2010
http://searchcloudcomputing.techtarget.com/sDefinition/0,,sid201_gci1287881,00.html. Accessed 17 May 2010
http://searchdatacenter.techtarget.com/sDefinition/0,,sid80_gci1070272,00.html. Accessed 17 May 2010
http://searchdatacenter.techtarget.com/sDefinition/0,,sid80_gci770169,00.html. Accessed 17 May 2010
Kameda, H., Li, J., Kim, C., Zhang, Y.: Optimal Load balancing in Distributed Computer Systems. Springer, London (1997)
Kleinrock, L.: Queueing Systems, Volume 1: Theory. Wiley, New York (1975)
Li, K.: Minimizing mean response time in heterogeneous multiple computer systems with a central stochastic job dispatcher. Int. J. Comput. Appl. 20(1), 32–39 (1998)
Li, K.: Optimizing average job response time via decentralized probabilistic job dispatching in heterogeneous multiple computer systems. Comput. J. 41(4), 223–230 (1998)
Li, K.: Minimizing the probability of load imbalance in heterogeneous distributed computer systems. Math. Comput. Model. 36(9–10), 1075–1084 (2002)
Li, K.: Optimal load distribution in nondedicated heterogeneous cluster and Grid computing environments. J. Syst. Architect. 54(1–2), 111–123 (2008)
Rommel, C.G.: The probability of load balancing success in a homogeneous network. IEEE Trans. Softw. Eng. 17(9), 922–933 (1991)
Ross, K.W., Yao, D.D.: Optimal load balancing and scheduling in a distributed computer system. J. ACM 38(3), 676–690 (1991)
Shirazi, B.A., Hurson, A.R., Kavi, K.M.: Scheduling and load balancing in parallel and distributed systems. In: IEEE Computer Society Press, Los Alamitos, California (1995)
Tang, X., Chanson, S.T.: Optimizing static job scheduling in a network of heterogeneous computers. In: Proceedings of International Conference on Parallel Processing, pp. 373–382. Toronto, Canada (2000)
Tantawi. A.N., Towsley, D.: Optimal static load balancing in distributed computer systems. J. ACM 32(2), 445–465 (1985)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, K. Optimal Load Distribution for Multiple Heterogeneous Blade Servers in a Cloud Computing Environment. J Grid Computing 11, 27–46 (2013). https://doi.org/10.1007/s10723-012-9239-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10723-012-9239-y