Abstract
Nowadays, Cloud services are gaining importance among users due to their cost-effectiveness and highly scalable resources. To meet the user’s demands, several data centres are built across the globe, which has severe environmental as well as economical concerns. Energy consumption is one of the most significant issue faced by cloud service providers. Prediction of accurate resource usage of the physical machine helps in effective utilization of resources in a data centre, resulting in minimizing an active number of physical machines, which helps to minimize the energy consumption of a data centre. Although several models till date focus on virtual machine consolidation with a notion of reducing energy consumption, the reduction of operational physical machines has not gathered enough attention. In this paper, we propose a prediction model to predict resource utilization of physical machines, which enables to effectively utilize the entire data centre’s resources to reduce energy consumption. First, the raw time series workload is processed to enhance the value of its features for better training and prediction of mean resource utilization in the cloud data centre using the proposed Sum Average (SA) algorithm. Afterward, Gaussian Mixture Model (GMM) is employed to cluster heterogeneous machines of data centre based on its resource usage which helps to analyze the prediction for each kind of configured machine available in a data centre. In addition, the Long Short Term Memory model (LSTM) is employed to predict the mean resource usage of physical machines for every clustered machine. Furthermore, the effectiveness of our proposed model is evaluated using the Google cluster trace usage dataset. Lastly, the proposed model is compared with Linear Regression, Moving Average, and Auto Regression Integrated Moving Average model. Root Mean Square Error (RMSE) analysis states that our proposed model outperforms the other compared techniques
Similar content being viewed by others
Data Availibility
Publicly available dataset is used for this research.
References
Alam, T.: Cloud computing and its role in the information technology. IAIC Transact. Sust. Digital Innovat. (ITSDI) 1, 108–115 (2021)
Ullah, Q.Z., Khan, G.M., Hassan, S., Iqbal, A., Ullah, F., Kwak, K.S.: A cartesian genetic programming based parallel neuroevolutionary model for cloud server’s cpu usage prediction. Electronics 10(1), 67 (2021)
Hsieh, S.-Y., Liu, C.-S., Buyya, R., Zomaya, A.Y.: Utilization-prediction-aware virtual machine consolidation approach for energy-efficient cloud data centers. J. Parallel Distribut. Comput. 139, 99–109 (2020)
Hussain, M., Wei, L.-F., Lakhan, A., Wali, S., Ali, S., Hussain, A.: Energy and performance-efficient task scheduling in heterogeneous virtualized cloud computing. Sust. Comput.: Informat. Syst. 30, 100517 (2021)
Osman, A., Sagahyroon, A., Aburukba, R., Aloul, F.: Optimization of energy consumption in cloud computing datacenters. Int. J. Elec. Comput. Eng. 11(1), 2088–8708 (2021)
Kumar, J., Goomer, R., Singh, A.K.: Long short term memory recurrent neural network (lstm-rnn) based workload forecasting model for cloud datacenters. Procedia Comput. Sci. 125, 676–682 (2018)
Ismaeel, S., Miri, A.: Using elm techniques to predict data centre vm requests. In: 2015 IEEE 2nd International Conference on Cyber Security and Cloud Computing, pp. 80–86 (2015). IEEE
Patel, E., Kushwaha, D.S.: Clustering cloud workloads: k-means vs gaussian mixture model. Procedia Comput. Sci. 171, 158–167 (2020)
Reiss, C., Wilkes, J., Hellerstein, J.L.: Google cluster-usage traces: format+ schema, pp. 1–14. Google Inc., White Paper (2011)
Zhang, Y., Liu, F., Wang, B., Lin, W., Zhong, G., Xu, M., Li, K.: A multi-output prediction model for physical machine resource usage in cloud data centers. Future Generation Computer Systems (2022)
Xu, M., Song, C., Wu, H., Gill, S.S., Ye, K., Xu, C.: esdnn: Deep neural network based multivariate workload prediction in cloud computing environments. ACM Transactions on Internet Technology (TOIT) (2022)
Mrhari, A., Hadi, Y.: Workload prediction using vmd and tcn in cloud computing. J. Adv. Informat. Technol. 13(3), 12 (2022)
Malik, S., Tahir, M., Sardaraz, M., Alourani, A.: A resource utilization prediction model for cloud data centers using evolutionary algorithms and machine learning techniques. Appl. Sci. 12(4), 2160 (2022)
Sood, S.K.: Function points-based resource prediction in cloud computing. Concurr. Comput.: Practice and Exp. 28(10), 2781–2794 (2016)
Mazumdar, S., Kumar, A.S.: Forecasting data center resource usage: An experimental comparison with time-series methods. In: International Conference on Soft Computing and Pattern Recognition, pp. 151–165 (2016). Springer
Dabbagh, M., Hamdaoui, B., Guizani, M., Rayes, A.: An energy-efficient vm prediction and migration framework for overcommitted clouds. IEEE Transac. Cloud Comput. 6(4), 955–966 (2016)
Kumar, N., Misra, S., Obaidat, M.S.: Collaborative learning automata-based routing for rescue operations in dense urban regions using vehicular sensor networks. IEEE Systems Journal 9(3), 1081–1090 (2014)
Tseng, F.-H., Wang, X., Chou, L.-D., Chao, H.-C., Leung, V.C.: Dynamic resource prediction and allocation for cloud data center using the multiobjective genetic algorithm. IEEE Systems Journal 12(2), 1688–1699 (2017)
Chaudhary, R., Aujla, G.S., Kumar, N., Rodrigues, J.J.: Optimized big data management across multi-cloud data centers: Software-defined-network-based analysis. IEEE Communications Magazine 56(2), 118–126 (2018)
Chen, J., Wang, Y.: A resource demand prediction method based on eemd in cloud computing. Procedia Comput. Sci. 131, 116–123 (2018)
Song, B., Yu, Y., Zhou, Y., Wang, Z., Du, S.: Host load prediction with long short-term memory in cloud computing. J. Supercomput. 74(12), 6554–6568 (2018)
Kumar, N., Rodrigues, J.J., Chilamkurti, N.: Bayesian coalition game as-a-service for content distribution in internet of vehicles. IEEE Internet of Things J. 1(6), 544–555 (2014)
Deepika, T., Prakash, P.: Power consumption prediction in cloud data center using machine learning. Int. J. Electr. Comput. Eng. (IJECE) 10(2), 1524–1532 (2020)
Hieu, N.T., Di Francesco, M., Ylä-Jääski, A.: Virtual machine consolidation with multiple usage prediction for energy-efficient cloud data centers. IEEE Transac. Serv. Comput. 13(1), 186–199 (2017)
Miglani, A., Kumar, N.: Deep learning models for traffic flow prediction in autonomous vehicles: A review, solutions, and challenges. Vehicular Commun. 20, 100184 (2019)
Karim, M.E., Maswood, M.M.S., Das, S., Alharbi, A.G.: Bhyprec: A novel bi-lstm based hybrid recurrent neural network model to predict the cpu workload of cloud virtual machine. IEEE Access 9, 131476–131495 (2021)
Banerjee, A., Dave, R.N.: Validating clusters using the hopkins statistic. In: 2004 IEEE International Conference on Fuzzy Systems (IEEE Cat. No. 04CH37542), vol. 1, pp. 149–153 (2004). IEEE
Ogbuabor, G., Ugwoke, F.: Clustering algorithm for a healthcare dataset using silhouette score value. Int. J. Comput. Sci. Informat. Technol. (IJCSIT) 10(2), 27–37 (2018)
Jupyter Notebook. https://jupyter.org/. Last accessed in December, 2021
Funding
No funding involved for this research.
Author information
Authors and Affiliations
Contributions
SG: Conceptualization, Implementation, Software, Validation, Writing - Original Draft. RA: Resources, Writing - Original Draft, Supervision.RS: Data Curation, Visualization, Resources, Supervision. IP: Supervision, Resources.
Corresponding author
Ethics declarations
Conflict of interest
There is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Garg, S., Ahuja, R., Singh, R. et al. GMM-LSTM: a component driven resource utilization prediction model leveraging LSTM and gaussian mixture model. Cluster Comput 26, 3547–3563 (2023). https://doi.org/10.1007/s10586-022-03747-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-022-03747-4