Abstract
Energy consumption in data centers grows rapidly in recent years. As a widely-applied energy-efficient method, workload consolidation also has its own limitations that may bring some negative effects, such as performance degradation, QoS violation, localized hotspots and so on, which is especially true when optimal objectives are inherently conflict. In this paper, we present a power and thermal-aware VM management framework called PTM-ML, which relies on machine learning technique to find optimal host configuration based on workload characteristics and cooling system’s working state. Based on such an optimal host configuration, it then makes VM migration and consolidation decisions by enforcing an efficient load-balancing policy, with aiming at achieving a better trade-off between energy efficiency and performance. The prototype of PTM-ML framework is deployed and evaluated in a real-world cloud data center. Extensive experiments are conducted by using different workload traces with distinctive characteristics, and the results are compared with four similar approaches in terms of total energy consumption, real-time power consumption, average latency and etc. Experimental results show that the proposed PTM-ML outperforms the existing approaches in terms of multiple metrics, and it also exhibits better robustness and adaptability in presence of dynamic workloads.
Similar content being viewed by others
References
Singh, S., Chana, I.: Resource provisioning and scheduling in clouds: QoS perspective. J. Supercomput. 72(3), 926–960 (2016)
Lu, Y., Sun, N.: An effective task scheduling algorithm based on dynamic energy management and efficient resource utilization in green cloud computing environment. Cluster Comput. 22(1), 513–520 (2019)
Abdessamia, F., Zhang, W.-Z., Tian, Y.-C.: Energy-efficiency virtual machine placement based on binary gravitational search algorithm. Cluster Comput. 23(3), 1577–1588 (2020)
Sharma, Y., Si, W., Sun, D., et al.: Failure-aware energy-efficient VM consolidation in cloud computing systems. Future Gener. Comput. Syst. 94, 620–633 (2019)
He, K., Li, Z., Deng, D., et al.: Energy-efficient framework for virtual machine consolidation in cloud data centers. China Commun. 14(10), 192–201 (2017)
Arif, M., Mahmood, T.: Cloud computing and its environmental effects. Int. J. Grid Distrib. Comput. 8(1), 279–286 (2015)
Shehabi, A., Smith, S., Sartor, D., et al.: United states data center energy usage report. Technical Report, Lawrence Berkeley National Laboratory. https://eta.lbl.gov/publications/united-states-data-center-energy (2016)
Brown, R.: Report to congress on server and data center energy efficiency: public law, 109–431. Technical Report, Lawrence Berkeley National Laboratory. https://escholarship.org/uc/item/74g2r0vg (2008)
Le, T.N., Liang, J., Liu, Z., et al.: Optimal energy procurement for geo-distributed data centers in multi-timescale electricity markets. SIGMETRICS 45(3), 185–197 (2018)
Akhter, N., Othman, M.: Energy aware resource allocation of cloud data center: review and open issues. Cluster Comput. 19(3), 1163–1182 (2016)
Aksanli, B.: Data center peak power management with energy storage devices. IEEE Internet Comput. 21(4), 26–33 (2017)
Grange, L., Costa, G.D., Stolf, P.: Green IT scheduling for data center powered with renewable energy. Future Gener. Comput. Syst. 86, 99–120 (2018)
Gaggero, M., Caviglione, L.: Predictive control for energy-aware consolidation in cloud datacenters. IEEE Trans. Control Syst. Technol. 24(2), 461–474 (2016)
Khoshkholghi, M.A., Derahman, M.N., Abdullah, A., et al.: Energy-efficient algorithms for dynamic virtual machine consolidation in cloud data centers. IEEE Access 5, 10709–10722 (2017)
Teng, F., Yu, L., Li, T., et al.: Energy efficiency of VM consolidation in IaaS clouds. J. Supercomput. 73(2), 782–809 (2017)
Yao, J., Guan, H., Luo, J., et al.: Adaptive power management through thermal aware workload balancing in internet data centers. IEEE Trans. Parallel Distrib. Syst. 26(9), 2400–2409 (2015)
Skach, M., Arora, M., Hsu, C.H., et al.: Thermal time shifting: decreasing data center cooling costs with phase-change materials. IEEE Internet Comput. 21(4), 34–43 (2017)
Ilager, S., Ramamohanarao, K., Buyya, R.: ETAS: energy and thermal-aware dynamic virtual machine consolidation in cloud data center with proactive hotspot mitigation. Concurr. Comput. Pract. Exp. 31(17), 1–15 (2019)
Li, X., Garraghan, P., Jiang, X., et al.: Holistic virtual machine scheduling in cloud datacenters towards minimizing total energy. IEEE Trans. Parallel Distrib. Syst. 29(6), 1317–1331 (2018)
Goiri, I., Nguyen, T.D., Bianchini, R.: Temperature- and variation-aware management for free-cooled datacenters. Proceeding International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2015), pp. 1–12. ACM Press, Istanbul (2015)
Oxley, M.A., Jonardi, E., Pasricha, S., et al.: Rate-based thermal, power, and co-location aware resource management for heterogeneous data centers. J. Parallel Distrib Comput. 112, 126–139 (2018)
Al-Qawasmeh, A.M., Pasricha, S., Maciejewski, A.A., et al.: Power and thermal-aware workload allocation in heterogeneous data centers. IEEE Trans. Comput. 64(2), 447–461 (2015)
Damme, T.V., Persis, C.D., Tesi, P.: Optimized thermal-aware job scheduling and control of data centers. IEEE Trans. Control Syst. Technol. 27(2), 760–771 (2019)
Arianyan, E., Taheri, H., Sharifian, S.: Novel energy and SLA efficient resource management heuristics for consolidation of virtual machines in cloud data centers. Comput. Electr. Eng. 47, 222–240 (2015)
Justafort, V.D., Beaubrun, R., Pierre, S.: On the carbon footprint optimization in an InterCloud environment. IEEE Trans. Cloud Comput. 6(3), 829–842 (2018)
Farahnakian, F., Pahikkala, T., Liljeberg, P., et al.: Energy-aware VM consolidation in cloud data centers using utilization prediction model. IEEE Trans. Cloud Comput. 7(2), 524–536 (2019)
Hussain, W., Hussain, F.K., Saberi, M., et al.: Comparing time series with machine learning-based prediction approaches for violation management in cloud SLAs. Future Gener. Comput. Syst. 89, 464–477 (2018)
Zhang, W., Wulan, G., Zhai, J., et al.: An intelligent power distribution service architecture using cloud computing and deep learning techniques. J. Network Comput. Appl. 103, 239–248 (2018)
Abusitta, A., Bellaiche, M., Dagenais, M., et al.: A deep learning approach for proactive multi-cloud cooperative intrusion detection system. Future Gener. Comput. Syst. 98, 308–318 (2019)
Garg, S., Kaur, K., Kumar, N., et al.: A hybrid deep learning-based model for anomaly detection in cloud datacenter networks. IEEE Trans. Netw. Serv. Manag. 16(3), 924–935 (2019)
Oh, J., Kim, Y.: Job placement using reinforcement learning in GPU virtualization environment. Cluster Comput. 23(3), 2219–2234 (2020)
Gong, Y., Li, B., Liang, B., et al.: Chic: experience-driven scheduling in machine learning clusters. Proceeding of International Symposium on Quality of Service (IWQoS 2019), pp. 1–10. ACM Press, Phoenix (2019)
Madireddy, S., Balaprakash, P., Carns, P., et al.: Adaptive learning for concept drift in application performance modeling. Proceeding of International Conference on Parallel Processing (ICPP 2019), pp. 1–11. ACM Press, Kyoto (2019)
Gschwind, M., Kaldewey, T., Tam, D.K.: Optimizing the efficiency of deep learning through accelerator virtualization. IBM J. Res. Dev. 61(4/5), 1–11 (2017)
Kim, Y., Choi, H., Lee, J., et al.: Towards an optimized distributed deep learning framework for a heterogeneous multi-GPU cluster. Cluster Comput. 23(3), 2287–2300 (2020)
Liu, D.: Design and implementation of performance monitor and evaluation middleware in virtual clouds. Int. J. Networking Virtual Organ. 15(1), 48–64 (2015)
Diao, L., Liu, Y., Nan, D., et al.: Units and layer’s effects on deep boltzman machines. Proceedings of the International Conference on Computer Science and Application Engineering (CASE 2019), pp. 1–5. ACM Press, Sanya (2019)
Beloglazov, A., Abawajy, J., Buyya, R.: Energy-aware resource allocation heuristics for efficient management of data centers for cloud computing. Future Gener. Comput. Syst. 28(5), 755–768 (2012)
Acknowledgements
This work is supported by the National Natural Science Foundation of China (No. 61402163), and the Hunan Provincial Natural Science Foundation of China (No. 2020JJ6022).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Xiao, P., Ni, Z., Liu, D. et al. A power and thermal-aware virtual machine management framework based on machine learning. Cluster Comput 24, 2231–2248 (2021). https://doi.org/10.1007/s10586-020-03228-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-020-03228-6