Skip to main content
Log in

Optimal Load Distribution for Multiple Heterogeneous Blade Servers in a Cloud Computing Environment

  • Published:
Journal of Grid Computing Aims and scope Submit manuscript

Abstract

Given a group of heterogeneous blade servers in a cloud computing environment or a data center of a cloud computing provider, each having its own size and speed and its own amount of preloaded special tasks, we are facing the problem of optimal distribution of generic tasks over these blade servers, such that the average response time of generic tasks is minimized. Such performance optimization is important for a cloud computing provider to efficiently utilize all the available resources and to deliver the highest quality of service. We develop a queueing model for a group of heterogeneous blade servers, and formulate and solve the optimal load distribution problem of generic tasks for multiple heterogeneous blade servers in a cloud computing environment in two different situations, namely, special tasks with and without higher priority. Extensive numerical examples and data are demonstrated and some important observations are made. It is found that server sizes, server speeds, task execution requirement, and the arrival rates of special tasks all have significant impact on the average response time of generic tasks, especially when the total arrival rate of generic tasks is large. It is also found that the server size heterogeneity and the server speed heterogeneity do not have much impact on the average response time of generic tasks. Furthermore, larger (smaller, respectively) heterogeneity results in shorter (longer, respectively) average response time of generic tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bonomi, F., Kumar, A.: Adaptive optimal load balancing in a nonhomogeneous multiserver system with a central job scheduler. IEEE Trans. Comput. 39(10), 1232–1250 (1990)

    Article  Google Scholar 

  2. He, L., Jarvis, S.A., Spooner, D.P., Jiang, H., Dillenberger, D.N., Nudd, G.R.: Allocating non-real-time and soft real-time jobs in multiclusters. IEEE Trans. Parallel Distrib. Syst. 17(2), 99–112 (2006)

    Article  Google Scholar 

  3. http://en.wikipedia.org/wiki/Cloud_computing. Accessed 17 May 2010

  4. http://searchcloudcomputing.techtarget.com/sDefinition/0,,sid201_gci1287881,00.html. Accessed 17 May 2010

  5. http://searchdatacenter.techtarget.com/sDefinition/0,,sid80_gci1070272,00.html. Accessed 17 May 2010

  6. http://searchdatacenter.techtarget.com/sDefinition/0,,sid80_gci770169,00.html. Accessed 17 May 2010

  7. Kameda, H., Li, J., Kim, C., Zhang, Y.: Optimal Load balancing in Distributed Computer Systems. Springer, London (1997)

    Book  MATH  Google Scholar 

  8. Kleinrock, L.: Queueing Systems, Volume 1: Theory. Wiley, New York (1975)

    Google Scholar 

  9. Li, K.: Minimizing mean response time in heterogeneous multiple computer systems with a central stochastic job dispatcher. Int. J. Comput. Appl. 20(1), 32–39 (1998)

    Google Scholar 

  10. Li, K.: Optimizing average job response time via decentralized probabilistic job dispatching in heterogeneous multiple computer systems. Comput. J. 41(4), 223–230 (1998)

    Article  MATH  Google Scholar 

  11. Li, K.: Minimizing the probability of load imbalance in heterogeneous distributed computer systems. Math. Comput. Model. 36(9–10), 1075–1084 (2002)

    Article  MATH  Google Scholar 

  12. Li, K.: Optimal load distribution in nondedicated heterogeneous cluster and Grid computing environments. J. Syst. Architect. 54(1–2), 111–123 (2008)

    Article  Google Scholar 

  13. Rommel, C.G.: The probability of load balancing success in a homogeneous network. IEEE Trans. Softw. Eng. 17(9), 922–933 (1991)

    Article  MathSciNet  Google Scholar 

  14. Ross, K.W., Yao, D.D.: Optimal load balancing and scheduling in a distributed computer system. J. ACM 38(3), 676–690 (1991)

    Article  MATH  Google Scholar 

  15. Shirazi, B.A., Hurson, A.R., Kavi, K.M.: Scheduling and load balancing in parallel and distributed systems. In: IEEE Computer Society Press, Los Alamitos, California (1995)

    Google Scholar 

  16. Tang, X., Chanson, S.T.: Optimizing static job scheduling in a network of heterogeneous computers. In: Proceedings of International Conference on Parallel Processing, pp. 373–382. Toronto, Canada (2000)

  17. Tantawi. A.N., Towsley, D.: Optimal static load balancing in distributed computer systems. J. ACM 32(2), 445–465 (1985)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Keqin Li.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, K. Optimal Load Distribution for Multiple Heterogeneous Blade Servers in a Cloud Computing Environment. J Grid Computing 11, 27–46 (2013). https://doi.org/10.1007/s10723-012-9239-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10723-012-9239-y

Keywords

Navigation