Metrics-Based Auto Scaling Module for Amazon Web Services Cloud Platform

Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 716)


One of the key benefits of moving an application to the cloud is the ability to easy scale horizontally when the workload increases. Many cloud providers offer a mechanism of auto scaling which dynamically adjusts the number of virtual server instances, on which given system is running, according to some basic resource-based metrics like CPU utilization. In this work, we propose a model of auto scaling which is based on timing statistics: a high order quantile and a mean value, which are calculated from custom metrics, like execution time of a user request, gathered on application level. Inputs to the model are user defined values of those custom metrics. We developed software module that controls a number of virtual server instances according to both auto scaling models and conducted experiments that show our model based on custom metrics can perform better, while it uses less instances and still maintains assumed time constraints.


Cloud computing Scalability Auto scaling Custom metrics Load balancing 



This work was supported by NCBiR of Poland (No. INNOTECH-K3/IN3/46/229379/NCBR/14).


  1. 1.
    Andrikopoulos, V., Binz, T., Leymann, F., Strauch, S.: How to adapt applications for the cloud environment. Computing, 1–43 (2012).
  2. 2.
    Balalaie, A., Heydarnoori, A., Jamshidi, P.: Migrating to cloud-native architectures using microservices: an experience report. In: Celesti, A., Leitner, P. (eds.) ESOCC 2015. Communications in Computer and Information Science, vol. 567, pp. 201–215. Springer International Publishing, Cham (2016). doi: 10.1007/978-3-319-33313-7_15 Google Scholar
  3. 3.
    Calheiros, R.N., Masoumi, E., Ranjan, R., Buyya, R.: Workload prediction using ARIMA model and its impact on cloud applications’ QoS. IEEE Trans. Cloud Comput. 3(4), 449–458 (2015). CrossRefGoogle Scholar
  4. 4.
    Del Ra III, W.: Service design patterns: fundamental design solutions for SOAP/WSDL and RESTful web services by robert daigneau. ACM SIGSOFT Softw. Eng. Notes 37(4), 40 (2012). Google Scholar
  5. 5.
    Dias De Assuncao, M., Cardonha, C., Netto, M., Cunha, R.: Impact of user patience on auto-scaling resource capacity for cloud services. Future Gener. Comput. Syst. 1–10 (2015).
  6. 6.
    Evans, E.: Domain-Driven Design: Tackling Complexity in the Heart of Software. Addison-Wesley, Boston (2004).
  7. 7.
  8. 8.
    Jiang, J., Lu, J., Zhang, G., Long, G.: Optimal cloud resource auto-scaling for web applications. In: 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing, CCGrid 2013, Delft, Netherlands, 13–16 May 2013, pp. 58–65 (2013).
  9. 9.
    Pahl, C.: Containerization and the PaaS cloud. IEEE Cloud Comput. 2(03), 24–31 (2015)CrossRefGoogle Scholar
  10. 10.
    Qu, C., Calheiros, R.N., Buyya, R.: Auto-scaling web applications in clouds: a taxonomy and survey. CoRR abs/1609.09224 (2016).
  11. 11.
    Roy, N., Dubey, A., Gokhale, A.: Efficient autoscaling in the cloud using predictive models for workload forecasting. In: Proceedings of the 2011 IEEE 4th International Conference on Cloud Computing, Cloud 2011, pp. 500–507 (2011).
  12. 12.
    Wilcox, R.: Chapter 3 - estimating measures of location and scale. In: Wilcox, R. (ed.) Introduction to Robust Estimation and Hypothesis Testing. Statistical Modeling and Decision Science, pp. 43–101. Academic Press, Boston (2012). CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  1. 1.Institute of InformaticsSilesian University of TechnologyGliwicePoland

Personalised recommendations