Skip to main content
Log in

Combining time series prediction models using genetic algorithm to autoscaling Web applications hosted in the cloud infrastructure

  • Predictive Analytics Using Machine Learning
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

In a cloud computing environment, companies have the ability to allocate resources according to demand. However, there is a delay that may take minutes between the request for a new resource and it being ready for using. This causes the reactive techniques, which request a new resource only when the system reaches a certain load threshold, to be not suitable for the resource allocation process. To address this problem, it is necessary to predict requests that arrive at the system in the next period of time to allocate the necessary resources, before the system becomes overloaded. There are several time series forecasting models to calculate the workload predictions based on history of monitoring data. However, it is difficult to know which is the best time series forecasting model to be used in each case. The work becomes even more complicated when the user does not have much historical data to be analyzed. Most related work considers only single methods to evaluate the results of the forecast. Other works propose an approach that selects suitable forecasting methods for a given context. But in this case, it is necessary to have a significant amount of data to train the classifier. Moreover, the best solution may not be a specific model, but rather a combination of models. In this paper we propose an adaptive prediction method using genetic algorithms to combine time series forecasting models. Our method does not require a previous phase of training, because it constantly adapts the extent to which the data are coming. To evaluate our proposal, we use three logs extracted from real Web servers. The results show that our proposal often brings the best result and is generic enough to adapt to various types of time series.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25

Similar content being viewed by others

References

  1. 1998 world cup web site access logs. http://ita.ee.lbl.gov/html/contrib/WorldCup.html. Accessed 15 Oct 2014

  2. Clarknet-http-two weeks of http logs from the clarknet www server. http://ita.ee.lbl.gov/html/contrib/ClarkNet-HTTP.html. Accessed 15 Oct 2014

  3. Nasa-http- two months of http logs from the ksc-nasa www server. http://ita.ee.lbl.gov/html/contrib/NASA-HTTP.html. Accessed 15 Oct 2014

  4. Rightscale. set up autoscaling using voting tags. https://support.rightscale.com/. Accessed 1 Oct 2014

  5. Ali-Eldin A, Tordsson J, Elmroth E (2012) An adaptive hybrid elasticity controller for cloud infrastructures. In: Network operations and management symposium (NOMS), 2012 IEEE, pp 204–212. IEEE

  6. Arlitt M, Jin T (2000) A workload characterization study of the 1998 world cup web site. IEEE Network 14(3):30–37

    Article  Google Scholar 

  7. Armbrust M, Fox O, Griffith R, Joseph AD, Katz Y, Konwinski A, Lee G, Patterson D, Rabkin A, Stoica I et al (2009) Above the clouds: a Berkeley view of cloud computing

  8. Balaji M, Rao G, Kumar C et al (2014) A comparitive study of predictive models for cloud infrastructure management. In: 14th IEEE/ACM international symposium on cluster, cloud and grid computing (CCGrid), 2014, pp 923–926. IEEE

  9. Beloglazov A, Buyya R (2010) Adaptive threshold-based approach for energy-efficient consolidation of virtual machines in cloud data centers. In: Proceedings of the 8th international workshop on middleware for grids, clouds and e-science, p 4. ACM

  10. Box GE, Jenkins GM, Reinsel GC (2013) Time series analysis: forecasting and control. Wiley, New York

    MATH  Google Scholar 

  11. Braun J, Murdoch DJ (2007) A first course in statistical programming with R, vol 25. Cambridge University Press, Cambridge

    Book  MATH  Google Scholar 

  12. Chieu TC, Mohindra A, Karve AA (2011) Scalability and performance of web applications in a compute cloud. In: 2011 IEEE 8th International Conference on e-business engineering (ICEBE), pp 317–323. IEEE

  13. Chieu TC, Mohindra A, Karve AA, Segal A (2009) Dynamic scaling of web applications in a virtualized cloud computing environment. In: IEEE International Conference on e-business engineering, 2009. ICEBE’09. pp 281–286. IEEE

  14. Clark T (2010) Quantifying the benefits of the rightscale cloud management platform. Fact point group whitepaper, funded by Rightscale

  15. Dantzig GB (1998) Linear programming and extensions. Princeton University Press, Princeton

    MATH  Google Scholar 

  16. Di Penta M, Casazza G, Antoniol G, Merlo E (2001) Modeling web maintenance centers through queue models. In: 5th European conference on software maintenance and reengineering, 2001, pp 131–138. IEEE

  17. Fernandez H, Pierre G, Kielmann T et al (2014) Autoscaling web applications in heterogeneous cloud infrastructures. In: IEEE international conference on cloud engineering

  18. Gross D, Shortle JF, Thompson JM, Harris CM (2013) Fundamentals of queueing theory. Wiley, New York

    MATH  Google Scholar 

  19. Herbst NR, Huber N, Kounev S, Amrehn E (2014) Self-adaptive workload classification and forecasting for proactive resource provisioning. Concurrency and computation: practice and experience

  20. Hyndman RJ, Athanasopoulos G (2014) Forecasting: principles and practice. OTexts, New York

    Google Scholar 

  21. Hyndman RJ, Khandakar Y (2007) Automatic time series for forecasting: the forecast package for R. Tech. rep., MonashUniversity, Department of Econometrics and Business Statistics

  22. Jiang J, Lu J, Zhang G, Long G (2013) Optimal cloud resource auto-scaling for web applications. In: 13th IEEE/ACM international symposium on cluster, cloud and grid computing (CCGrid), 2013, pp 58–65. IEEE

  23. Kalyvianaki E, Charalambous T, Hand S (2009) Self-adaptive and self-configured cpu resource provisioning for virtualized servers using kalman filters. In: Proceedings of the 6th international conference on autonomic computing, pp 117–126. ACM

  24. Kleinberg J, Tardos É (2006) Algorithm design. Pearson Education India, India

    Google Scholar 

  25. Lim HC, Babu S, Chase JS, Parekh SS (2009) Automated control in cloud computing: challenges and opportunities. In: Proceedings of the 1st workshop on automated control for datacenters and clouds, pp 13–18. ACM

  26. Lorido-Botrán T, Miguel-Alonso J, Lozano JA (2012) Auto-scaling techniques for elastic applications in cloud environments. Department of Computer Architecture and Technology, University of Basque Country, Tech. Rep. EHU-KAT-IK-09 12

  27. Lorido-Botran T, Miguel-Alonso J, Lozano JA (2014) A review of auto-scaling techniques for elastic applications in cloud environments. J Grid Comput 12(4):559–592

    Article  Google Scholar 

  28. Math C (2014) The apache commons mathematics library

  29. Miller M (2008) Cloud computing: web-based applications that change the way you work and collaborate online. Que publishing, New York

    Google Scholar 

  30. Padala P, Hou KY, Shin KG, Zhu X, Uysal M, Wang Z, Singhal S, Merchant A (2009) Automated control of multiple virtualized resources. In: Proceedings of the 4th ACM European conference on computer systems, pp 13–26. ACM

  31. Roy N, Dubey A, Gokhale A (2011) Efficient autoscaling in the cloud using predictive models for workload forecasting. In: 2011 IEEE International Conference on cloud computing (CLOUD), pp 500–507. IEEE

  32. Wang L, Xu J, Zhao M, Tu Y, Fortes JA (2011) Fuzzy modeling based resource management for virtualized database systems. In: 2011 IEEE 19th International Symposium on modeling, analysis & simulation of computer and telecommunication systems (MASCOTS), pp 32–42. IEEE

  33. Xu J, Zhao M, Fortes J, Carpenter R, Yousif M (2007) On the use of fuzzy modeling in virtualized data center management. In: 4th International conference on autonomic computing, 2007. ICAC’07, pp 25–25. IEEE

Download references

Acknowledgments

We would like to thank the Federal Institute of Sao Paulo (IFSP) and the Higher Education Personnel Training Coordination (CAPES) for financial support. The logs used in this work were obtained in http://ita.ee.lbl.gov/html/traces.html.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Valter Rogério Messias.

Ethics declarations

Conflict of interest

There are no conflicts of interest with reviewers, funding agencies, etc. and neither financial support for the development of this work that could have direct or potential influence or impart bias on the work.

Ethical standard

The content of this paper are according with of ethical and professional conduct described by the Springer.

Additional information

The reviewers are given the following countries: Spain, Italy, Germany, Australia and the Netherlands. None of the reviewers are from the same country or the same institution of the authors.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Messias, V.R., Estrella, J.C., Ehlers, R. et al. Combining time series prediction models using genetic algorithm to autoscaling Web applications hosted in the cloud infrastructure. Neural Comput & Applic 27, 2383–2406 (2016). https://doi.org/10.1007/s00521-015-2133-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-015-2133-3

Keywords

Navigation