Abstract
In the cloud-edge collaborative environment, the edge server manager will divide the physical resources based on virtualization technology, so as to deploy multiple applications on the same server. However, due to the imperfect virtualization technology and the complexity and dynamics of the applications deployed on virtual machines (VMs), it is difficult for cloud service providers to evaluate the performance of VMs and thus cannot implement dynamic resource management effectively. To address this problem, this paper proposes an adaptive resource allocation approach. Firstly, we use the profiling tools to collect hardware counters and corresponding performance that reflect the resource usage in real time. Then, we select the data instances that contribute more to the performance prediction based on Gradient-based One Side Sampling (GOSS) to build a VM performance prediction model. When the prediction results indicate the performance cannot meet users’ requirements, we further apply one of the reinforcement learning framework-Deep Deterministic Policy Gradient (DDPG) to optimize the allocation of fine-grained resources. Our proposed method enables adaptive allocation of fine-grained resources in cloud environment, and the extensive experiments demonstrate that the average accuracy of performance prediction by our proposed method surpasses 95%, whereas the metrics derived from the others ranges only between 75 and 97.5%. Furthermore, the average accuracy by our proposed method on the several benchmark applications is 88.4%, gaining a performance improvement of 9.1% compared to the suboptimal baseline.
Similar content being viewed by others
Data availability
The data involved in this paper are openly available in a public repository. The data that support the findings of this study are openly available at https://www.kaggle.com/datasets/jiahaoynu/espl-based-dataset.
References
Zhou, Z., Chen, X., Li, E., et al.: Edge intelligence: paving the last mile of artificial intelligence with edge computing [J]. Proc. IEEE 107(8), 1738–1762 (2019)
China academy of information and communications technology, research report on key technologies situation of cloud-edge collaboration. [EB/OL]. http://ww w.caict.ac.cn/ 2023
Abaker, I., Badrul, N.: The rise of “Big Data” on cloud computing: review and open research issues [J]. Inf. Syst. 47, 98–115 (2015)
Keshavarzi, A., Haghighat, A., Bohlouli, M.: Enhanced time-aware QoS prediction in multi-cloud: a hybird k-medoids and lazy learning approach (QoPC) [J]. Computing 102(4), 923–949 (2020)
Youpeng, Tu., Haiming, C., Linjie, Y.: Edge computing offloading decision-making problem in Internet of Things systems: modeling, solution and classification [J]. Small Microcomput. Syst. 42(10), 2145–2152 (2021)
Hao, J., Yue, K., Zhang, B., et al.: Transfer learning of bayesian network for measuring QoS of virtual machines. Appl. Intell. [J] 51(12), 8641–8660 (2021)
Baek, J., Kaddoum, G.: Heterogeneous task offloading and resource allocations via deep recurrent reinforcement learning in partial observable multifog networks [J]. IEEE Internet Things J. 8(2), 1041–1056 (2021)
Abedi, S., Mostafa, G., Khorami, E., Musa, M.: Dynamic resource allocation using improved firefly optimization algorithm in cloud environment [J]. Appl. Artif. Intell. (2022). https://doi.org/10.1080/08839514.2022.2055394
Ke, G., Meng, Q., Finley, T.: LightGBM: A highly efficient gradient boosting decision tree [C]// conference on neural information processing systems (NIPS), 2017, 3146–3154
Chen, S., GalOn, S., Delimitrou, C., Manne, S., Martinez, J.: Workload characterization of interactive cloud services on big and small server platforms [C]. In: International Symposium on Workload Characterization (IISWC), 2017, 125- 134
Arulkumaran, K., Peter, M., Brundage, M., Bharath, A.: A brief survey of deep reinforcement learning [EB/OL]. arXiv preprint: CoRR abs/1708.05866, 2017
Yan, L., Chen, X., Zhou, J., Chen, Y., Wen, J.: Deep reinforcement learning for continuous electric vehicles charging control with dynamic user behaviors [J]. IEEE Trans. Smart Grid 12(6), 5124–5134 (2021)
Liu, J., Wang, X., Shen, S., et al.: A Bayesian Q-learning game for dependable task offloading against DDoS attacks in sensor edge cloud [J]. IEEE Internet Things J. 8(9), 7546–7561 (2021)
Ramanathan, R., Latha, B.: Towards optimal resource provisioning for Hadoop-MapReduce jobs using scale-out strategy and its performance analysis in private cloud environment [J]. Clust. Comput. 22, 14061–14071 (2019)
Qiu, H., Banerjee, S., Jha, S. et.al.: FIRM: An intelligent fine-grained resource management framework for SLO-oriented microservices [C]. In: Symposium on Operating Systems Design and Implementation (OSDI), 2020, 805-825
Zhang, J., Xu, S., Li, X., Wang, C., Xu, J.: Cloud computing task scheduling based on orthogonal adaptive whale optimization [J]. Comput. Appl. 42(5), 1516–1523 (2022)
Chen, K., Powers, J., Guo, S., Tian, F.: CRESP: towards optimal resource provisioning for MapReduce computing in public clouds. IEEE Trans. Parallel Distrib. Syst. 25(6), 1403–1412 (2014). https://doi.org/10.1109/TPDS.2013.297
Kavitha, J., Thirupathi, K.: Dynamic resource allocation in cloud infrastructure using ant lion-based auto-regression model. Int. J. Commun. Syst. 35(6), e5071 (2022)
Alizadeh, S., Ghaemi, R., Naeen, H.: An autonomous architecture based on reinforcement deep neural network for resource allocation in cloud computing. Computing 106(2), 371–403 (2024)
Chen, Y., Zhao, J., Wu, Y., Huang, J., et al.: QoE-aware decentralized task offloading and resource allocation for end-edge-cloud systems: a game-theoretical approach. IEEE Trans. Mobile Comput. 23(1), 769–784 (2024)
Fan, W., Zhao, L., Liu, X., et al.: Collaborative service placement, task scheduling, and resource allocation for task offloading with edge-cloud cooperation. IEEE Trans. Mob. Comput. 23(1), 238–256 (2024)
Zhang, Y., Hua, W., Zhou, Z., et al.: Sinan: ML-based and QoS-aware resource management for cloud microservices[C]. Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems(ASPLOS), 2021: 167–181
Gan, Y., Zhang, Y., Hu, K., et al.: Seer: Leveraging big data to navigate the complexity of performance debugging in cloud microservices[C]. Proceedings of the 24th International Conference on Architectural Support for Programming Languages and Operating Systems(ASPLOS), 2019: 19–33
Kannan, R.S., Subramanian, L., Raju, A., et al.: Grandslam: Guaranteeing SLAs for jobs in microservices execution frameworks[C]. Proceedings of the 14th EuroSys Conference 2019(EuroSys), 2019: 1–16
Al-lessa, S., Sheibani, R., Veisi, G.: A resource allocation and scheduling model for hierarchical distributed services in cloud environment using game theory. Int. J. Commun. Syst. 35(6), e5075 (2022)
Hua, W., Liu, P., Huang, L.: Energy-efficient resource allocation for heterogeneous edge–cloud computing. IEEE Internet of Things J. 11(2), 2808–2818 (2024)
Habib, A., Khan, M.: Reinforcement learning based autonomic virtual machine management in clouds, 2016 5th International Conference on Informatics, Electronics and Vision, 2016, pp. 1083–1088, https://doi.org/10.1109/ICIEV.2016.7760166
John, I., Sreekantan, A., Bhatnagar, S.: Auto-scaling resources for cloud applications using reinforcement learning, 2019, pp. 1–5
Schuler, L., Jamil, S., Kühl, N.: AI-based Resource Allocation: Reinforcement Learning for Adaptive Auto-scaling in Serverless Environments, 2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and Internet Computing (CCGrid), 2021, pp. 804-811, https://doi.org/10.1109/CCGrid51090.2021.00098
Park, J., Choi, B., Lee, C., et al.: Graf: a graph neural network based proactive resource allocation framework for slo-oriented microservices [C]. In: Proceedings of the 17th International Conference on emerging Networking Experiments and Technologies. 2021: 154-167
Bahreini, T., Badri, H., Grosu, D.: Mechanisms for resource allocation and pricing in mobile edge computing systems. IEEE Trans. Parallel Distrib. Syst. 33(3), 667–682 (2022)
Li, C., Zhu, L., Luo, Y.: Joint edge caching and dynamic service migration in SDN based mobile edge computing. J. Netw. Comput. Appl. 177, 102966 (2021)
Kumar, R., Garg, S., Chan, A., et al.: Deadline-based dynamic resource allocation and provisioning algorithms in fog-cloud environment. Futur. Gener. Comput. Syst. 104, 131–141 (2020)
Yadav, M., Rohit, D., Yadav, D.K.: Maintaining container sustainability through machine learning. Cluster Comput. 24, 3725–3750 (2021)
Belgacem, A., Beghdad, K., Nacer, H., et al.: Efficient dynamic resource allocation method for cloud computing environment. Clust. Comput. 23, 2871–2889 (2020)
Yadav, M., Pal, N., Yadav, D.: Resource provisioning for containerized applications. Clust. Comput. 24, 2819–2840 (2021)
Chouliaras, S., Sotiriadis, S.: An adaptive auto-scaling framework for cloud resource provisioning. Future Gener. Comput. Syst. 148, 173–183 (2023)
Acknowledgements
We will thank the editor-in-chief and all the reviewers for their patience, and this work is supported by National Natural Science Foundation of China (No.61862068), Youth Project of Applied Basic Research Program of Yunnan Province (NO.202201AU070050), Key Project of Applied Basic Research Program of Yunnan Province (NO. 202201AS070021).
Funding
This research was funded by Youth Project of Applied Basic Research Program of Yunnan Province (Grant No. 202201AU070050), National Natural Science Foundation of China (Grant No. 61862068), Key Project of Applied Basic Research Program of Yunnan Province, (Grant no. 202201AS070021).
Author information
Authors and Affiliations
Contributions
H.J. and P.Y. authored the main manuscript text, while C.Y. prepared the figures and charts. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Peng, Y., Hao, J., Chen, Y. et al. Fine-grained resource adjustment of edge server in cloud-edge collaborative environment. Cluster Comput (2024). https://doi.org/10.1007/s10586-024-04380-z
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10586-024-04380-z