Skip to main content

Advertisement

Log in

GMM-LSTM: a component driven resource utilization prediction model leveraging LSTM and gaussian mixture model

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

Nowadays, Cloud services are gaining importance among users due to their cost-effectiveness and highly scalable resources. To meet the user’s demands, several data centres are built across the globe, which has severe environmental as well as economical concerns. Energy consumption is one of the most significant issue faced by cloud service providers. Prediction of accurate resource usage of the physical machine helps in effective utilization of resources in a data centre, resulting in minimizing an active number of physical machines, which helps to minimize the energy consumption of a data centre. Although several models till date focus on virtual machine consolidation with a notion of reducing energy consumption, the reduction of operational physical machines has not gathered enough attention. In this paper, we propose a prediction model to predict resource utilization of physical machines, which enables to effectively utilize the entire data centre’s resources to reduce energy consumption. First, the raw time series workload is processed to enhance the value of its features for better training and prediction of mean resource utilization in the cloud data centre using the proposed Sum Average (SA) algorithm. Afterward, Gaussian Mixture Model (GMM) is employed to cluster heterogeneous machines of data centre based on its resource usage which helps to analyze the prediction for each kind of configured machine available in a data centre. In addition, the Long Short Term Memory model (LSTM) is employed to predict the mean resource usage of physical machines for every clustered machine. Furthermore, the effectiveness of our proposed model is evaluated using the Google cluster trace usage dataset. Lastly, the proposed model is compared with Linear Regression, Moving Average, and Auto Regression Integrated Moving Average model. Root Mean Square Error (RMSE) analysis states that our proposed model outperforms the other compared techniques

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data Availibility

Publicly available dataset is used for this research.

References

  1. Alam, T.: Cloud computing and its role in the information technology. IAIC Transact. Sust. Digital Innovat. (ITSDI) 1, 108–115 (2021)

    Article  Google Scholar 

  2. Ullah, Q.Z., Khan, G.M., Hassan, S., Iqbal, A., Ullah, F., Kwak, K.S.: A cartesian genetic programming based parallel neuroevolutionary model for cloud server’s cpu usage prediction. Electronics 10(1), 67 (2021)

    Article  Google Scholar 

  3. Hsieh, S.-Y., Liu, C.-S., Buyya, R., Zomaya, A.Y.: Utilization-prediction-aware virtual machine consolidation approach for energy-efficient cloud data centers. J. Parallel Distribut. Comput. 139, 99–109 (2020)

    Article  Google Scholar 

  4. Hussain, M., Wei, L.-F., Lakhan, A., Wali, S., Ali, S., Hussain, A.: Energy and performance-efficient task scheduling in heterogeneous virtualized cloud computing. Sust. Comput.: Informat. Syst. 30, 100517 (2021)

    Google Scholar 

  5. Osman, A., Sagahyroon, A., Aburukba, R., Aloul, F.: Optimization of energy consumption in cloud computing datacenters. Int. J. Elec. Comput. Eng. 11(1), 2088–8708 (2021)

    Google Scholar 

  6. Kumar, J., Goomer, R., Singh, A.K.: Long short term memory recurrent neural network (lstm-rnn) based workload forecasting model for cloud datacenters. Procedia Comput. Sci. 125, 676–682 (2018)

    Article  Google Scholar 

  7. Ismaeel, S., Miri, A.: Using elm techniques to predict data centre vm requests. In: 2015 IEEE 2nd International Conference on Cyber Security and Cloud Computing, pp. 80–86 (2015). IEEE

  8. Patel, E., Kushwaha, D.S.: Clustering cloud workloads: k-means vs gaussian mixture model. Procedia Comput. Sci. 171, 158–167 (2020)

    Article  Google Scholar 

  9. Reiss, C., Wilkes, J., Hellerstein, J.L.: Google cluster-usage traces: format+ schema, pp. 1–14. Google Inc., White Paper (2011)

    Google Scholar 

  10. Zhang, Y., Liu, F., Wang, B., Lin, W., Zhong, G., Xu, M., Li, K.: A multi-output prediction model for physical machine resource usage in cloud data centers. Future Generation Computer Systems (2022)

  11. Xu, M., Song, C., Wu, H., Gill, S.S., Ye, K., Xu, C.: esdnn: Deep neural network based multivariate workload prediction in cloud computing environments. ACM Transactions on Internet Technology (TOIT) (2022)

  12. Mrhari, A., Hadi, Y.: Workload prediction using vmd and tcn in cloud computing. J. Adv. Informat. Technol. 13(3), 12 (2022)

    Article  Google Scholar 

  13. Malik, S., Tahir, M., Sardaraz, M., Alourani, A.: A resource utilization prediction model for cloud data centers using evolutionary algorithms and machine learning techniques. Appl. Sci. 12(4), 2160 (2022)

    Article  Google Scholar 

  14. Sood, S.K.: Function points-based resource prediction in cloud computing. Concurr. Comput.: Practice and Exp. 28(10), 2781–2794 (2016)

    Article  Google Scholar 

  15. Mazumdar, S., Kumar, A.S.: Forecasting data center resource usage: An experimental comparison with time-series methods. In: International Conference on Soft Computing and Pattern Recognition, pp. 151–165 (2016). Springer

  16. Dabbagh, M., Hamdaoui, B., Guizani, M., Rayes, A.: An energy-efficient vm prediction and migration framework for overcommitted clouds. IEEE Transac. Cloud Comput. 6(4), 955–966 (2016)

    Article  Google Scholar 

  17. Kumar, N., Misra, S., Obaidat, M.S.: Collaborative learning automata-based routing for rescue operations in dense urban regions using vehicular sensor networks. IEEE Systems Journal 9(3), 1081–1090 (2014)

    Article  Google Scholar 

  18. Tseng, F.-H., Wang, X., Chou, L.-D., Chao, H.-C., Leung, V.C.: Dynamic resource prediction and allocation for cloud data center using the multiobjective genetic algorithm. IEEE Systems Journal 12(2), 1688–1699 (2017)

    Article  Google Scholar 

  19. Chaudhary, R., Aujla, G.S., Kumar, N., Rodrigues, J.J.: Optimized big data management across multi-cloud data centers: Software-defined-network-based analysis. IEEE Communications Magazine 56(2), 118–126 (2018)

    Article  Google Scholar 

  20. Chen, J., Wang, Y.: A resource demand prediction method based on eemd in cloud computing. Procedia Comput. Sci. 131, 116–123 (2018)

    Article  Google Scholar 

  21. Song, B., Yu, Y., Zhou, Y., Wang, Z., Du, S.: Host load prediction with long short-term memory in cloud computing. J. Supercomput. 74(12), 6554–6568 (2018)

    Article  Google Scholar 

  22. Kumar, N., Rodrigues, J.J., Chilamkurti, N.: Bayesian coalition game as-a-service for content distribution in internet of vehicles. IEEE Internet of Things J. 1(6), 544–555 (2014)

    Article  Google Scholar 

  23. Deepika, T., Prakash, P.: Power consumption prediction in cloud data center using machine learning. Int. J. Electr. Comput. Eng. (IJECE) 10(2), 1524–1532 (2020)

    Article  Google Scholar 

  24. Hieu, N.T., Di Francesco, M., Ylä-Jääski, A.: Virtual machine consolidation with multiple usage prediction for energy-efficient cloud data centers. IEEE Transac. Serv. Comput. 13(1), 186–199 (2017)

    Article  Google Scholar 

  25. Miglani, A., Kumar, N.: Deep learning models for traffic flow prediction in autonomous vehicles: A review, solutions, and challenges. Vehicular Commun. 20, 100184 (2019)

    Article  Google Scholar 

  26. Karim, M.E., Maswood, M.M.S., Das, S., Alharbi, A.G.: Bhyprec: A novel bi-lstm based hybrid recurrent neural network model to predict the cpu workload of cloud virtual machine. IEEE Access 9, 131476–131495 (2021)

    Article  Google Scholar 

  27. Banerjee, A., Dave, R.N.: Validating clusters using the hopkins statistic. In: 2004 IEEE International Conference on Fuzzy Systems (IEEE Cat. No. 04CH37542), vol. 1, pp. 149–153 (2004). IEEE

  28. Ogbuabor, G., Ugwoke, F.: Clustering algorithm for a healthcare dataset using silhouette score value. Int. J. Comput. Sci. Informat. Technol. (IJCSIT) 10(2), 27–37 (2018)

    Google Scholar 

  29. Jupyter Notebook. https://jupyter.org/. Last accessed in December, 2021

Download references

Funding

No funding involved for this research.

Author information

Authors and Affiliations

Authors

Contributions

SG: Conceptualization, Implementation, Software, Validation, Writing - Original Draft. RA: Resources, Writing - Original Draft, Supervision.RS: Data Curation, Visualization, Resources, Supervision. IP: Supervision, Resources.

Corresponding author

Correspondence to Sheetal Garg.

Ethics declarations

Conflict of interest

There is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Garg, S., Ahuja, R., Singh, R. et al. GMM-LSTM: a component driven resource utilization prediction model leveraging LSTM and gaussian mixture model. Cluster Comput 26, 3547–3563 (2023). https://doi.org/10.1007/s10586-022-03747-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-022-03747-4

Keywords

Navigation