A cost-aware auto-scaling approach using the workload prediction in service clouds

Yang, Jingqi; Liu, Chuanchang; Shang, Yanlei; Cheng, Bo; Mao, Zexiang; Liu, Chunhong; Niu, Lisha; Chen, Junliang

doi:10.1007/s10796-013-9459-0

A cost-aware auto-scaling approach using the workload prediction in service clouds

Published: 06 October 2013

Volume 16, pages 7–18, (2014)
Cite this article

Information Systems Frontiers Aims and scope Submit manuscript

Jingqi Yang¹,
Chuanchang Liu¹,
Yanlei Shang¹,
Bo Cheng¹,
Zexiang Mao¹,
Chunhong Liu¹,
Lisha Niu¹ &
…
Junliang Chen¹

1719 Accesses
83 Citations
Explore all metrics

Abstract

Service clouds are distributed infrastructures which deploys communication services in clouds. The scalability is an important characteristic of service clouds. With the scalability, the service cloud can offer on-demand computing power and storage capacities to different services. In order to achieve the scalability, we need to know when and how to scale virtual resources assigned to different services. In this paper, a novel service cloud architecture is presented, and a linear regression model is used to predict the workload. Based on this predicted workload, an auto-scaling mechanism is proposed to scale virtual resources at different resource levels in service clouds. The auto-scaling mechanism combines the real-time scaling and the pre-scaling. Finally experimental results are provided to demonstrate that our approach can satisfy the user Service Level Agreement (SLA) while keeping scaling costs low.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Dynamic Provisioning of Cloud Resources Based on Workload Prediction

Cost-Availability Aware Scaling: Towards Optimal Scaling of Cloud Services

Article Open access 07 December 2023

Model-driven optimal resource scaling in cloud

Article 06 February 2017

References

Armbrust, M., Fox, A., Griffith, R., Joseph, A., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I. (2009). Above the clouds: a berkeley view of cloud computing. EECS Department, University of Califonia, Berkeley, Technical report, UCB/EECS-2009-28.
Baltagi, B. H. (1998). Econometrics (pp. 4169). Berlin: Springer.
Calheiros, R., Ranjan, R., Beloglazov, A., De Rose, C., Buyya, R. (2011). Cloudsim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisoning algorithms. Software: Pratice and Experience, 41(1), 23–50.
Google Scholar
Caron, E., Desprez, F., Muresan, A. (2010). Forecasting for grid and cloud computing on-demand resources based on pattern matching. In Proceedings of 2nd IEEE international conference on cloud computing technology and science (pp. 456–463).
Dilley, J. A. (1996). Web Server Workload Characterization, Hewlett-Packard Laboratories Report, HPL-96-160. http://www.hpl.hp.com/techreports/96/HPL-96-160.html.
Dutta, S., Gera, S., Verma, A., Viswanathan, B. (2012). SmartScale: automatic application scaling in enterprise clouds. In Proceedings of the 2012 IEEE 5th international conference on cloud computing (pp. 221–228).
Gong, Z., Gu, X., Wilkes, J. (2010). PRESS: PRedictive elastic ReSource scaling for cloud systems. In Proceedings of the 2010 international conference on network and service management (pp. 9–16).
Han, R., Guo, L., Ghanem, M. M., Guo, Y. (2012). Lightweight resource scaling for cloud applications. In Proceedings of the 12th IEEE/ACM international symposium on cluster, cloud and grid computing (pp. 644–651).
Lin, C.-C., Wu, J.-J., Lin, J.-A., Song, L.-C., Liu, P. (2012). Automatic resource scaling based on application service requirements. In Proceedings of the 2012 IEEE 5th international conference on cloud computing (pp. 941–942).
Lin, C.-C., Liu, P., Wu, J.-J. (2011). Energy-aware virtual machine dynamic provision and scheduling for cloud computing. In Proceedings of the 2011 IEEE 4th international conference on cloud computing (pp. 736–737).
Mohagheghi, P., & Sther, T. (2011). Software engineering challenges for migration to the service cloud paradigm: ongoing work in the REMICS project. In Proceedings of 2011 IEEE world congress on services (pp. 507–514).
Roy, N., Dubey, A., Gokhale, A. (2011). Efficient autoscaling in the cloud using predictive models for workload forecasting. In Proceedings of the 2011 IEEE 4th international conference on cloud computing (pp. 500–507).
Samimi, F. A., McKinley, P. K., Sadjadi, S. M., Tang, C., Shapiro, J. K., Zhou, Z. (2007). Service clouds: distributed infrastructure for adaptive communication services. IEEE Transactions on Network and Service Management, 4(2), 84–95.
Article Google Scholar
Saripalli, P., Kiran, G., Shankar, R., Narware, H., Bindal, N. (2011). Load prediction and hot spot detection models for autonomic cloud computing. In Proceedings of the 2011 4th IEEE international conference on utility and cloud computing (pp. 397–402).
Wang, W., Chen, H., Chen, X. (2012). An availability-aware approach to resource placement of dynamic scaling in clouds. In Proceedings of the 2012 IEEE 5th international conference on cloud computing (pp. 930–931).

Download references

Acknowledgments

The work is supported by 973 program of National Basic Research Program of China (Grant No. 2011CB302506, 2011CB302704), National Natural Science Foundation of China (Grant No. 61132001, 61001118, 61171102), Program for New Century Excellent Talents in University (Grant No. NCET-11-0592), the National Key Technology Research and Development Program of China “Research on the mobile community cultural service aggregation supporting technology” (2012BAH94F02), and the Novel Mobile Service Control Network Architecture and Key Technologies (Grant No.2010ZX03004-001-01).

Author information

Authors and Affiliations

State Key Lab of Networking and Switching Technology, Beijing University of Posts & Telecommunications, Beijing, 100876, China
Jingqi Yang, Chuanchang Liu, Yanlei Shang, Bo Cheng, Zexiang Mao, Chunhong Liu, Lisha Niu & Junliang Chen

Authors

Jingqi Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chuanchang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yanlei Shang
View author publications
You can also search for this author in PubMed Google Scholar
Bo Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Zexiang Mao
View author publications
You can also search for this author in PubMed Google Scholar
Chunhong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Lisha Niu
View author publications
You can also search for this author in PubMed Google Scholar
Junliang Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jingqi Yang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, J., Liu, C., Shang, Y. et al. A cost-aware auto-scaling approach using the workload prediction in service clouds. Inf Syst Front 16, 7–18 (2014). https://doi.org/10.1007/s10796-013-9459-0

Download citation

Published: 06 October 2013
Issue Date: March 2014
DOI: https://doi.org/10.1007/s10796-013-9459-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A cost-aware auto-scaling approach using the workload prediction in service clouds

Abstract

Access this article

Similar content being viewed by others

Dynamic Provisioning of Cloud Resources Based on Workload Prediction

Cost-Availability Aware Scaling: Towards Optimal Scaling of Cloud Services

Model-driven optimal resource scaling in cloud

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A cost-aware auto-scaling approach using the workload prediction in service clouds

Abstract

Access this article

Similar content being viewed by others

Dynamic Provisioning of Cloud Resources Based on Workload Prediction

Cost-Availability Aware Scaling: Towards Optimal Scaling of Cloud Services

Model-driven optimal resource scaling in cloud

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation