Abstract
Host load prediction is one of the most effective measures for improving resource utilization in cloud computing systems. Due to the drastic fluctuation of the host load in the Cloud, accurately predicting the host load remains a challenge. In this paper, we propose a new prediction method that combines the Phase Space Reconstruction method and the Group Method of Data Handling based on an Evolutionary Algorithm. The performance of our proposed method is evaluated using two real-world load traces. The first is the load trace in a traditional distributed system, whereas the second is in a Google data center. The results show that the proposed method achieves a better prediction performance than some state-of-the-art methods.
Similar content being viewed by others
References
Load traces on unix systems (1997) http://www.cs.northwestern.edu/pdinda/LoadTraces/
More google cluster data. Google reach blog (2011) http://googleresearch.blogspot.com/2011/11/more-google-cluster-data.html
Akioka S, Muraoka Y (2004) Extended forecast of cpu and network load on computational grid. In: IEEE international symposium on cluster computing and the grid, 2004. CCGrid 2004. IEEE, pp 765–772
Bobroff N, Kochut A, Beaty K (2007) Dynamic placement of virtual machines for managing sla violations. In: 10th IFIP/IEEE international symposium on integrated network management, 2007. IM’07. IEEE, pp 119–128
Cleveland WS, Devlin SJ (1988) Locally weighted regression: an approach to regression analysis by local fitting. J Am Stat Assoc 83(403):596–610
Dabrowski C, Hunt F (2009) Using markov chain analysis to study dynamic behaviour in large-scale grid systems. In: Proceedings of the seventh Australasian symposium on grid computing and e-research, vol 99. Australian Computer Society, Inc., pp 29–40
Di S, Kondo D, Cirne W (2012) Characterization and comparison of cloud versus grid workloads. In: 2012 IEEE international conference on cluster computing (CLUSTER). IEEE, pp 230–238
Di S, Kondo D, Cirne W (2012) Host load prediction in a google compute cloud with a bayesian model. In: Proceedings of the international conference on high performance computing, networking, storage and analysis. IEEE Computer Society Press, p 21
Dinda PA, O’Hallaron DR (2000) Host load prediction using linear models. Clust Comput 3(4):265–280
Duy TVT, Sato Y, Inoguchi Y (2011) Improving accuracy of host load predictions on computational grids by artificial neural networks. Int J Parallel Emerg Distrib Syst 26(4):275–290
Guenter B, Jain N, Williams C (2011) Managing cost, performance, and reliability tradeoffs for energy-aware server provisioning. In: INFOCOM, 2011 Proceedings IEEE. IEEE, pp 1332–1340
Han M, Xi J, Xu S, Yin FL (2004) Prediction of chaotic time series based on the recurrent predictor neural network. IEEE Trans Signal Process 52(12):3409–3416
Kim H, Eykholt R, Salas J (1999) Nonlinear dynamics, delay times, and embedding windows. Phys D Nonlinear Phenom 127(1):48–60
Nariman-Zadeh N, Darvizeh A, Jamali A, Moeini A (2005) Evolutionary design of generalized polynomial neural networks for modelling and prediction of explosive forming process. J Mater Process Technol 164:1561–1571
Osman S, Subhraveti D, Su G, Nieh J (2002) The design and implementation of zap: a system for migrating computing environments. ACM SIGOPS Oper Syst Rev 36(SI):361–376
Packard NH, Crutchfield JP, Farmer JD, Shaw RS (1980) Geometry from a time series. Phys Rev Lett 45:712–716
Takens F (1981) Detecting strange attractors in turbulence. In: Dynamical systems and turbulence, Warwick 1980. Springer, Berlin, pp 366–381
Urgaonkar B, Shenoy P, Chandra A, Goyal P (2005) Dynamic provisioning of multi-tier internet applications. In: Proceedings, second international conference on autonomic computing, 2005. ICAC 2005. IEEE, pp 217–228
Wolf A, Swift JB, Swinney HL, Vastano JA (1985) Determining lyapunov exponents from a time series. Phys D Nonlinear Phenom 16(3):285–317
Wu Y, Yuan Y, Yang G, Zheng W (2007) Load prediction using hybrid model for computational grid. In: 2007 8th IEEE/ACM international conference on grid computing. IEEE, pp 235–242
Yang D, Cao J, Yu C, Xiao J (2012) A multi-step-ahead cpu load prediction approach in distributed system. In: 2012 second international conference on cloud and green computing (CGC). IEEE, pp 206–213
Zhang Q, Cheng L, Boutaba R (2010) Cloud computing: state-of-the-art and research challenges. J Internet Serv Appl 1(1):7–18
Zhang Q, Zhani MF, Zhang S, Zhu Q, Boutaba R, Hellerstein JL (2012) Dynamic energy-aware capacity provisioning for cloud computing environments. In: Proceedings of the 9th international conference on Autonomic computing. ACM, pp 145–154
Acknowledgments
This work was partially supported by Grant Nos. BE2011169, BK2011563 from the Natural Science Foundation of Jiangsu Province and Grant Nos. 61100111, 61300157, 61201425, 61271231 from the Natural Science Foundation of China.
Author information
Authors and Affiliations
Corresponding author
Additional information
Q. Yang and C. Peng are joint first authors.
Rights and permissions
About this article
Cite this article
Yang, Q., Peng, C., Zhao, H. et al. A new method based on PSR and EA-GMDH for host load prediction in cloud computing system. J Supercomput 68, 1402–1417 (2014). https://doi.org/10.1007/s11227-014-1097-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-014-1097-x