Abstract
In a cloud computing environment, companies have the ability to allocate resources according to demand. However, there is a delay that may take minutes between the request for a new resource and it being ready for using. This causes the reactive techniques, which request a new resource only when the system reaches a certain load threshold, to be not suitable for the resource allocation process. To address this problem, it is necessary to predict requests that arrive at the system in the next period of time to allocate the necessary resources, before the system becomes overloaded. There are several time series forecasting models to calculate the workload predictions based on history of monitoring data. However, it is difficult to know which is the best time series forecasting model to be used in each case. The work becomes even more complicated when the user does not have much historical data to be analyzed. Most related work considers only single methods to evaluate the results of the forecast. Other works propose an approach that selects suitable forecasting methods for a given context. But in this case, it is necessary to have a significant amount of data to train the classifier. Moreover, the best solution may not be a specific model, but rather a combination of models. In this paper we propose an adaptive prediction method using genetic algorithms to combine time series forecasting models. Our method does not require a previous phase of training, because it constantly adapts the extent to which the data are coming. To evaluate our proposal, we use three logs extracted from real Web servers. The results show that our proposal often brings the best result and is generic enough to adapt to various types of time series.
Similar content being viewed by others
References
1998 world cup web site access logs. http://ita.ee.lbl.gov/html/contrib/WorldCup.html. Accessed 15 Oct 2014
Clarknet-http-two weeks of http logs from the clarknet www server. http://ita.ee.lbl.gov/html/contrib/ClarkNet-HTTP.html. Accessed 15 Oct 2014
Nasa-http- two months of http logs from the ksc-nasa www server. http://ita.ee.lbl.gov/html/contrib/NASA-HTTP.html. Accessed 15 Oct 2014
Rightscale. set up autoscaling using voting tags. https://support.rightscale.com/. Accessed 1 Oct 2014
Ali-Eldin A, Tordsson J, Elmroth E (2012) An adaptive hybrid elasticity controller for cloud infrastructures. In: Network operations and management symposium (NOMS), 2012 IEEE, pp 204–212. IEEE
Arlitt M, Jin T (2000) A workload characterization study of the 1998 world cup web site. IEEE Network 14(3):30–37
Armbrust M, Fox O, Griffith R, Joseph AD, Katz Y, Konwinski A, Lee G, Patterson D, Rabkin A, Stoica I et al (2009) Above the clouds: a Berkeley view of cloud computing
Balaji M, Rao G, Kumar C et al (2014) A comparitive study of predictive models for cloud infrastructure management. In: 14th IEEE/ACM international symposium on cluster, cloud and grid computing (CCGrid), 2014, pp 923–926. IEEE
Beloglazov A, Buyya R (2010) Adaptive threshold-based approach for energy-efficient consolidation of virtual machines in cloud data centers. In: Proceedings of the 8th international workshop on middleware for grids, clouds and e-science, p 4. ACM
Box GE, Jenkins GM, Reinsel GC (2013) Time series analysis: forecasting and control. Wiley, New York
Braun J, Murdoch DJ (2007) A first course in statistical programming with R, vol 25. Cambridge University Press, Cambridge
Chieu TC, Mohindra A, Karve AA (2011) Scalability and performance of web applications in a compute cloud. In: 2011 IEEE 8th International Conference on e-business engineering (ICEBE), pp 317–323. IEEE
Chieu TC, Mohindra A, Karve AA, Segal A (2009) Dynamic scaling of web applications in a virtualized cloud computing environment. In: IEEE International Conference on e-business engineering, 2009. ICEBE’09. pp 281–286. IEEE
Clark T (2010) Quantifying the benefits of the rightscale cloud management platform. Fact point group whitepaper, funded by Rightscale
Dantzig GB (1998) Linear programming and extensions. Princeton University Press, Princeton
Di Penta M, Casazza G, Antoniol G, Merlo E (2001) Modeling web maintenance centers through queue models. In: 5th European conference on software maintenance and reengineering, 2001, pp 131–138. IEEE
Fernandez H, Pierre G, Kielmann T et al (2014) Autoscaling web applications in heterogeneous cloud infrastructures. In: IEEE international conference on cloud engineering
Gross D, Shortle JF, Thompson JM, Harris CM (2013) Fundamentals of queueing theory. Wiley, New York
Herbst NR, Huber N, Kounev S, Amrehn E (2014) Self-adaptive workload classification and forecasting for proactive resource provisioning. Concurrency and computation: practice and experience
Hyndman RJ, Athanasopoulos G (2014) Forecasting: principles and practice. OTexts, New York
Hyndman RJ, Khandakar Y (2007) Automatic time series for forecasting: the forecast package for R. Tech. rep., MonashUniversity, Department of Econometrics and Business Statistics
Jiang J, Lu J, Zhang G, Long G (2013) Optimal cloud resource auto-scaling for web applications. In: 13th IEEE/ACM international symposium on cluster, cloud and grid computing (CCGrid), 2013, pp 58–65. IEEE
Kalyvianaki E, Charalambous T, Hand S (2009) Self-adaptive and self-configured cpu resource provisioning for virtualized servers using kalman filters. In: Proceedings of the 6th international conference on autonomic computing, pp 117–126. ACM
Kleinberg J, Tardos É (2006) Algorithm design. Pearson Education India, India
Lim HC, Babu S, Chase JS, Parekh SS (2009) Automated control in cloud computing: challenges and opportunities. In: Proceedings of the 1st workshop on automated control for datacenters and clouds, pp 13–18. ACM
Lorido-Botrán T, Miguel-Alonso J, Lozano JA (2012) Auto-scaling techniques for elastic applications in cloud environments. Department of Computer Architecture and Technology, University of Basque Country, Tech. Rep. EHU-KAT-IK-09 12
Lorido-Botran T, Miguel-Alonso J, Lozano JA (2014) A review of auto-scaling techniques for elastic applications in cloud environments. J Grid Comput 12(4):559–592
Math C (2014) The apache commons mathematics library
Miller M (2008) Cloud computing: web-based applications that change the way you work and collaborate online. Que publishing, New York
Padala P, Hou KY, Shin KG, Zhu X, Uysal M, Wang Z, Singhal S, Merchant A (2009) Automated control of multiple virtualized resources. In: Proceedings of the 4th ACM European conference on computer systems, pp 13–26. ACM
Roy N, Dubey A, Gokhale A (2011) Efficient autoscaling in the cloud using predictive models for workload forecasting. In: 2011 IEEE International Conference on cloud computing (CLOUD), pp 500–507. IEEE
Wang L, Xu J, Zhao M, Tu Y, Fortes JA (2011) Fuzzy modeling based resource management for virtualized database systems. In: 2011 IEEE 19th International Symposium on modeling, analysis & simulation of computer and telecommunication systems (MASCOTS), pp 32–42. IEEE
Xu J, Zhao M, Fortes J, Carpenter R, Yousif M (2007) On the use of fuzzy modeling in virtualized data center management. In: 4th International conference on autonomic computing, 2007. ICAC’07, pp 25–25. IEEE
Acknowledgments
We would like to thank the Federal Institute of Sao Paulo (IFSP) and the Higher Education Personnel Training Coordination (CAPES) for financial support. The logs used in this work were obtained in http://ita.ee.lbl.gov/html/traces.html.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
There are no conflicts of interest with reviewers, funding agencies, etc. and neither financial support for the development of this work that could have direct or potential influence or impart bias on the work.
Ethical standard
The content of this paper are according with of ethical and professional conduct described by the Springer.
Additional information
The reviewers are given the following countries: Spain, Italy, Germany, Australia and the Netherlands. None of the reviewers are from the same country or the same institution of the authors.
Rights and permissions
About this article
Cite this article
Messias, V.R., Estrella, J.C., Ehlers, R. et al. Combining time series prediction models using genetic algorithm to autoscaling Web applications hosted in the cloud infrastructure. Neural Comput & Applic 27, 2383–2406 (2016). https://doi.org/10.1007/s00521-015-2133-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-015-2133-3