An Analytical Model for Estimating Cloud Resources of Elastic Services

Salah, Khaled; Elbadawi, Khalid; Boutaba, Raouf

doi:10.1007/s10922-015-9352-x

An Analytical Model for Estimating Cloud Resources of Elastic Services

Published: 07 August 2015

Volume 24, pages 285–308, (2016)
Cite this article

Journal of Network and Systems Management Aims and scope Submit manuscript

Khaled Salah¹,
Khalid Elbadawi² &
Raouf Boutaba^3,4

911 Accesses
43 Citations
Explore all metrics

Abstract

In the cloud, ensuring proper elasticity for hosted applications and services is a challenging problem and far from being solved. To achieve proper elasticity, the minimal number of cloud resources that are needed to satisfy a particular service level objective (SLO) requirement has to be determined. In this paper, we present an analytical model based on Markov chains to predict the number of cloud instances or virtual machines (VMs) needed to satisfy a given SLO performance requirement such as response time, throughput, or request loss probability. For the estimation of these SLO performance metrics, our analytical model takes the offered workload, the number of VM instances as an input, and the capacity of each VM instance. The correctness of the model has been verified using discrete-event simulation. Our model has also been validated using experimental measurements conducted on the Amazon Web Services cloud platform.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

QoS-Aware Service VM Provisioning in Clouds: Experiences, Models, and Cost Analysis

Prediction-based VM provisioning and admission control for multi-tier web applications

Article Open access 27 September 2016

Computing Resources Scalability Performance Analysis in Cloud Computing Data Center

Article 31 October 2023

References

Azeez, A.: Auto-scaling web services on Amazon EC2 (2014). http://people.apache.org/~azeez/autoscaling-web-services-azeez.pdf
Amazon Inc.: Amazon web services auto scaling (2014). http://aws.amazon.com/autoscaling
Aceto, G., Botta, A., de Donato, W., Pescape, A.: Cloud monitoring: a survey. J. Comput. Netw. 57(9), 2093–2115 (2013)
Article Google Scholar
Amazon Inc.: AWS web services (2014). http://aws.amazon.com/
Google Inc.: Google compute engine (2014). https://cloud.google.com/products/compute-engine/
Google Inc.: Google App Engine (2014). http://appengine.google.com/
Lorido-Botran, T., Miguel-Alonso, J., Lozano, J.A.: A review of auto-scaling techniques for elastic applications in cloud environments. J. Grid Comput. 12(4), 559–592 (2014)
Article Google Scholar
Lagar-Cavilla, H, Whitney, J., Scannell, A., Patchin, P., Rumble, S., Lara, E., Brudno, M., Satyanarayanan, M., SnowFlock: rapid virtual machine cloning for cloud computing. In: Proceedings of the 4th ACM European Conference on Computer Systems, EuroSys’09, Nuremberg, Germany, March 2009, pp. 1–12
Mao, M., Humphrey, M.: A performance study on the MV startup time in the cloud. In: Proceedings of the 5^th IEEE International Conference on Cloud Computing (CLOUD2012), June 2012, pp. 423–430
Iqbal, W., Dailey, M., Carrera, D., Janecek, P.: Adaptive resource provisioning for read intensive multi-tier applications in the cloud. J. Future Gener. Comput. Syst. 27(6), 871–879 (2011)
Article Google Scholar
Liu, H., Wee, S.: Web server farm in the cloud: performance evaluation and dynamic architecture. In: Proceedings of the 1st 2009 International Conference on Cloud Computing, Springer, Berlin, pp. 369–380 (2009)
Wang, Z., Chen, Y., Gmach, D., Singhal, S., Watson, B., Rivera, W., Zhu, X., Hyser, C.: AppRAISE: application-level performance management in virtualized server environments. IEEE Trans. Netw. Serv. Manag. 6(4), 240–254 (2008)
Article Google Scholar
Urgaonkar, B., Shenoy, P., Chandra, A., Goyal, P., Wood, T.: Agile dynamic provisioning of mult-tier internet applications. ACM Trans. Auton. Adapt. Syst. 3, 1–39 (2008)
Article Google Scholar
Urgaonkar, B., Pacifici, G., Shenoy, P., Spreitzer, M., Tantawi, A.: An analytical model for multi-tier internet services and its applications. In: Proceedings of the 2005 ACM SIGMETRICS International Conference, vol. 33, Alberta, Canada, pp. 291–302
Khazaei, H., Misic, J., Misic, V.: Performance analysis of cloud computing centers using M/G/m/m + r queueing systems. IEEE Trans. Parallel Distrib. Syst. 23(5), 936–943 (2012)
Article Google Scholar
Kikuchi, S., Matsumoto, Y.: Performance modeling of concurrent live migration operations in cloud computing systems using PRISM probabilistic model checker. In: Proceedings of the 4th IEEE International Conference on Cloud Computing, Melbourne, Australia, pp. 49–56 (2011)
Firdhous, M., Ghazali, O., Hassan, S.: Modeling of cloud system using Erlang formulas. In: Proceedings of the 2011 7th Asia-Pacific Conference on Communications (APCC), Saba, Malaysia, October, pp. 411–416 (2011)
Xiong, K., Perros, H.: Service performance and analysis in cloud computing. In: Proceedings of the 2009 IEEE Congress on Services, Los Angeles, Californian, July 2009, pp. 693–700
Wuhib, F., Yanggratoke, R., Stadler, R.: Allocating compute and network resources under management objectives in large-scale clouds. J. Netw. Syst. Manag. 23, 111–136 (2015)
Article Google Scholar
Jennings, B., Stadler, R.: Resource management in clouds: survey and research challenges. J. Netw. Syst. Manag. 23, 567–619 (2015)
Article Google Scholar
Chunlin, L., Layuan, L.: Multi-layer resource management in cloud computing. J. Netw. Syst. Manag. 22(1), 100–120 (2014)
Article Google Scholar
Salah, K., Boutaba, R.: Estimating service response time for elastic cloud applications. In: Proceedings of the 1st IEEE International Conference on Cloud Networking (CloudNet 2012), Paris, France, 28–30 November 2012, pp. 12–16
Cockcroft, A.: Utilization is virtually useless as a metric. In: Proceedings of CMG 2006 Conference, December 2006
Salah, K.: Implementation and experimental evaluation of a simple packet rate estimator. AEU Int. J. Electron. Commun. 63(11), 977–985 (2009)
Article Google Scholar
Salah, K., Haidari, F.: Performance evaluation and comparison of four network packet rate estimators. AEU Int. J. Electron. Commun. 64(11), 1015–1023 (2010)
Article Google Scholar
Salah, K., Haidari, F.: On the performance of a simple packet rate estimator. In: IEEE/ACS International Conference on Computer Systems and Applications, 2008. AICCSA 2008 (2008)
Andersson, M., Bengtsson, A., Host, M., Nyberg, C.: Web server traffic in crisis conditions. In: Proceedings of the rd Swedish national computer networking workshop. Nov 2005
Leland, W., Taqqu, M., Willinger, W., Wilson, D.: On the self-similar nature of ethernet traffic. IEEE/ACM Trans. Netw. 2(1), 1–15 (1994)
Article Google Scholar
Paxson, V., Floyd, S.: Wide-area traffic: the failure of poisson modeling. IEEE/ACM Trans. Netw. 3(3), 226–244 (1995)
Article Google Scholar
Willinger, W., Taqqu, M., Sherman, R., Wilson, D.: Self-similarity through high-variability: statistical analysis of ethernet LAN traffic at the source level. In: Proceedings of ACM SIGCOMM, Cambridge, Massachusetts, pp. 100–113, Aug 1995
Salah, K., Elbadawi, K., Boutaba, R.: Performance modeling and analysis of network firewalls. IEEE Trans. Netw. Serv. Manag. 9(1), 12–21 (2012)
Article Google Scholar
Van Der Mei, R.D., Hariharan, R., Reeser, P.K.: Web server performance modeling. J. Telecommun. Syst. 16(3–4), 361–378 (2001)
MATH Google Scholar
Chandy, K.M., Sauer, C.H.: Approximate methods for analyzing queueing network models of computing systems. J. ACM Comput. Surv. 10(3), 281–317 (1978)
Article MATH Google Scholar
Vaquero, L., Rodero-Merino, L., Buyya, R.: Dynamically scaling applications in the cloud. ACM SIGCOMM Comput. Commun. Rev. 41(1), 45–52 (2011)
Article Google Scholar
Gross, D., Harris, C.: Fundamentals of Queueing Theory. Wiley, New York (1998)
MATH Google Scholar
Salah, K.: To coalesce or not to coalesce. Int. J. Electron. Commun. 61(4), 215–225 (2007)
Article Google Scholar
Jain, R.: The art of computer systems performance analysis: techniques for experimental design, measurement, simulation, and modeling. Wiley, New York (1991)
MATH Google Scholar
Amazon Inc.: Amazon Elastic Load Balancing (2014). http://aws.amazon.com/elasticloadbalancing/
Kleinrock, L.: Power and deterministic rules of thump for probabilistic problems in computer communications. In: Proceeding of the IEEE ICC’79, Boston, Massachusetts, June 1979
Law, A., Kelton, W.: Simulation Modeling and Analysis, 2nd edn. McGraw-Hill, New York (1991)
MATH Google Scholar
White, J.: An effective truncation heuristic for bias reduction in simulation output. Simul. J. 69(6), 323–334 (1997)
Article Google Scholar
Amazon Inc.: Amazon EC2 instances (2014). https://aws.amazon.com//ec2/instance-types/
Apache JMeter: Apache.org. http://jmeter.apache.org/
Custom Plugins for Apache JMeter: JMeter-Plugins.org. http://jmeter-plugins.org/
HAProxy: 2014. http://haproxy.1wt.eu/
AWS Documents: HAProxy layer (2014). http://docs.aws.amazon.com/opsworks/latest/userguide/workinglayers-load.html
Amazon Web Services: Amazon Virtual Private Cloud Route Tables. http://aws.amazon.com/documentation/vpc/
Botta, A., Dainotti, A., Pescapè, A.: A tool for the generation of realistic network workload for emerging networking scenarios. Comput. Netw. 56(15), 3531–3547 (2012)
Article Google Scholar
Distributed Internet Traffic Generator (2014). http://traffic.comics.unina.it/software/ITG/
Dainotti, A., Pescape, A., Ventre, G.: A packet-level characterization of network traffic. Proceedings of the 11th IEEE Workshop on Computer-Aided Modeling, Analysis and Design of Communication Links and Networks, pp. 38–45 (2006)
Salah, K., Hamawi, M.: Comparative packet-forwarding measurement of three popular operating systems. Int. J. Netw. Comput. Appl. 32(4), 1039–1048 (2009)
Article Google Scholar
Dejun, J., Pierre, G., Chi, C.-H.: EC2 performance analysis for resource provisioning of service-oriented applications. In: Proceedings of the 3rd Workshop on Non-functional Properties and SLA Management in Service-Oriented Computing, Nov 2009
Islam, S., Lee, K., Fekete, A., Liu, A.: How a consumer can measure elasticity for cloud platforms. In: Proceedings of the 3rd International Conference on Performance Engineering, Boston, MA, 22–25 April 2012
Mello, J.P.: Netflix rates broadband provided by bandwidth. In: PCWorld Magazine. 27 Jan 2011
Ward, N.: How to improve Netflix streaming (2014). http://www.helium.com/items/2067366-how-to-improve-netflix-streaming
Amazon Inc.: Amazon AWS Education Grants (2014). http://aws.amazon.com/education

Download references

Acknowledgments

We would like to acknowledge the reviewers for their invaluable comments and feedback that tremendously enhanced the quality of our work. Moreover, the experimental work in this paper was supported by a generous research Grant provided by Amazon AWS in Education [56].

Author information

Authors and Affiliations

Electrical and Computer Engineering Department, Khalifa University of Science, Technology and Research (KUSTAR), Abu Dhabi, UAE
Khaled Salah
School of Computing, DePaul University, Chicago, IL, USA
Khalid Elbadawi
David R. Cheriton School of CS, University of Waterloo, Waterloo, Canada
Raouf Boutaba
Division of IT Convergence Engineering, POSTECH, Pohang, Korea
Raouf Boutaba

Authors

Khaled Salah
View author publications
You can also search for this author in PubMed Google Scholar
Khalid Elbadawi
View author publications
You can also search for this author in PubMed Google Scholar
Raouf Boutaba
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Khaled Salah.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Salah, K., Elbadawi, K. & Boutaba, R. An Analytical Model for Estimating Cloud Resources of Elastic Services. J Netw Syst Manage 24, 285–308 (2016). https://doi.org/10.1007/s10922-015-9352-x

Download citation

Received: 10 November 2014
Accepted: 03 August 2015
Published: 07 August 2015
Issue Date: April 2016
DOI: https://doi.org/10.1007/s10922-015-9352-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Analytical Model for Estimating Cloud Resources of Elastic Services

Abstract

Access this article

Similar content being viewed by others

QoS-Aware Service VM Provisioning in Clouds: Experiences, Models, and Cost Analysis

Prediction-based VM provisioning and admission control for multi-tier web applications

Computing Resources Scalability Performance Analysis in Cloud Computing Data Center

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An Analytical Model for Estimating Cloud Resources of Elastic Services

Abstract

Access this article

Similar content being viewed by others

QoS-Aware Service VM Provisioning in Clouds: Experiences, Models, and Cost Analysis

Prediction-based VM provisioning and admission control for multi-tier web applications

Computing Resources Scalability Performance Analysis in Cloud Computing Data Center

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation