Fine-grained resource adjustment of edge server in cloud-edge collaborative environment

Peng, Yu; Hao, Jia; Chen, Yang; Gan, Jianhou

doi:10.1007/s10586-024-04380-z

Fine-grained resource adjustment of edge server in cloud-edge collaborative environment

Published: 27 March 2024

(2024)
Cite this article

Cluster Computing Aims and scope Submit manuscript

Yu Peng^1,2,
Jia Hao ORCID: orcid.org/0000-0001-8099-9771^1,2,
Yang Chen^1,2 &
…
Jianhou Gan^1,2

66 Accesses
Explore all metrics

Abstract

In the cloud-edge collaborative environment, the edge server manager will divide the physical resources based on virtualization technology, so as to deploy multiple applications on the same server. However, due to the imperfect virtualization technology and the complexity and dynamics of the applications deployed on virtual machines (VMs), it is difficult for cloud service providers to evaluate the performance of VMs and thus cannot implement dynamic resource management effectively. To address this problem, this paper proposes an adaptive resource allocation approach. Firstly, we use the profiling tools to collect hardware counters and corresponding performance that reflect the resource usage in real time. Then, we select the data instances that contribute more to the performance prediction based on Gradient-based One Side Sampling (GOSS) to build a VM performance prediction model. When the prediction results indicate the performance cannot meet users’ requirements, we further apply one of the reinforcement learning framework-Deep Deterministic Policy Gradient (DDPG) to optimize the allocation of fine-grained resources. Our proposed method enables adaptive allocation of fine-grained resources in cloud environment, and the extensive experiments demonstrate that the average accuracy of performance prediction by our proposed method surpasses 95%, whereas the metrics derived from the others ranges only between 75 and 97.5%. Furthermore, the average accuracy by our proposed method on the several benchmark applications is 88.4%, gaining a performance improvement of 9.1% compared to the suboptimal baseline.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Energy efficiency in cloud computing data centers: a survey on software technologies

Article 30 August 2022

Efficient Virtual Machine Placement Strategy Based on Enhanced Genetic Approach

Article 20 April 2024

Edge artificial intelligence for big data: a systematic review

Article 16 April 2024

Data availability

The data involved in this paper are openly available in a public repository. The data that support the findings of this study are openly available at https://www.kaggle.com/datasets/jiahaoynu/espl-based-dataset.

References

Zhou, Z., Chen, X., Li, E., et al.: Edge intelligence: paving the last mile of artificial intelligence with edge computing [J]. Proc. IEEE 107(8), 1738–1762 (2019)
Article Google Scholar
China academy of information and communications technology, research report on key technologies situation of cloud-edge collaboration. [EB/OL]. http://ww w.caict.ac.cn/ 2023
Abaker, I., Badrul, N.: The rise of “Big Data” on cloud computing: review and open research issues [J]. Inf. Syst. 47, 98–115 (2015)
Article Google Scholar
Keshavarzi, A., Haghighat, A., Bohlouli, M.: Enhanced time-aware QoS prediction in multi-cloud: a hybird k-medoids and lazy learning approach (QoPC) [J]. Computing 102(4), 923–949 (2020)
Article MathSciNet Google Scholar
Youpeng, Tu., Haiming, C., Linjie, Y.: Edge computing offloading decision-making problem in Internet of Things systems: modeling, solution and classification [J]. Small Microcomput. Syst. 42(10), 2145–2152 (2021)
Google Scholar
Hao, J., Yue, K., Zhang, B., et al.: Transfer learning of bayesian network for measuring QoS of virtual machines. Appl. Intell. [J] 51(12), 8641–8660 (2021)
Article Google Scholar
Baek, J., Kaddoum, G.: Heterogeneous task offloading and resource allocations via deep recurrent reinforcement learning in partial observable multifog networks [J]. IEEE Internet Things J. 8(2), 1041–1056 (2021)
Article Google Scholar
Abedi, S., Mostafa, G., Khorami, E., Musa, M.: Dynamic resource allocation using improved firefly optimization algorithm in cloud environment [J]. Appl. Artif. Intell. (2022). https://doi.org/10.1080/08839514.2022.2055394
Article Google Scholar
Ke, G., Meng, Q., Finley, T.: LightGBM: A highly efficient gradient boosting decision tree [C]// conference on neural information processing systems (NIPS), 2017, 3146–3154
Chen, S., GalOn, S., Delimitrou, C., Manne, S., Martinez, J.: Workload characterization of interactive cloud services on big and small server platforms [C]. In: International Symposium on Workload Characterization (IISWC), 2017, 125- 134
Arulkumaran, K., Peter, M., Brundage, M., Bharath, A.: A brief survey of deep reinforcement learning [EB/OL]. arXiv preprint: CoRR abs/1708.05866, 2017
Yan, L., Chen, X., Zhou, J., Chen, Y., Wen, J.: Deep reinforcement learning for continuous electric vehicles charging control with dynamic user behaviors [J]. IEEE Trans. Smart Grid 12(6), 5124–5134 (2021)
Article Google Scholar
Liu, J., Wang, X., Shen, S., et al.: A Bayesian Q-learning game for dependable task offloading against DDoS attacks in sensor edge cloud [J]. IEEE Internet Things J. 8(9), 7546–7561 (2021)
Article Google Scholar
Ramanathan, R., Latha, B.: Towards optimal resource provisioning for Hadoop-MapReduce jobs using scale-out strategy and its performance analysis in private cloud environment [J]. Clust. Comput. 22, 14061–14071 (2019)
Article Google Scholar
Qiu, H., Banerjee, S., Jha, S. et.al.: FIRM: An intelligent fine-grained resource management framework for SLO-oriented microservices [C]. In: Symposium on Operating Systems Design and Implementation (OSDI), 2020, 805-825
Zhang, J., Xu, S., Li, X., Wang, C., Xu, J.: Cloud computing task scheduling based on orthogonal adaptive whale optimization [J]. Comput. Appl. 42(5), 1516–1523 (2022)
Google Scholar
Chen, K., Powers, J., Guo, S., Tian, F.: CRESP: towards optimal resource provisioning for MapReduce computing in public clouds. IEEE Trans. Parallel Distrib. Syst. 25(6), 1403–1412 (2014). https://doi.org/10.1109/TPDS.2013.297
Article Google Scholar
Kavitha, J., Thirupathi, K.: Dynamic resource allocation in cloud infrastructure using ant lion-based auto-regression model. Int. J. Commun. Syst. 35(6), e5071 (2022)
Article Google Scholar
Alizadeh, S., Ghaemi, R., Naeen, H.: An autonomous architecture based on reinforcement deep neural network for resource allocation in cloud computing. Computing 106(2), 371–403 (2024)
Article Google Scholar
Chen, Y., Zhao, J., Wu, Y., Huang, J., et al.: QoE-aware decentralized task offloading and resource allocation for end-edge-cloud systems: a game-theoretical approach. IEEE Trans. Mobile Comput. 23(1), 769–784 (2024)
Article Google Scholar
Fan, W., Zhao, L., Liu, X., et al.: Collaborative service placement, task scheduling, and resource allocation for task offloading with edge-cloud cooperation. IEEE Trans. Mob. Comput. 23(1), 238–256 (2024)
Article Google Scholar
Zhang, Y., Hua, W., Zhou, Z., et al.: Sinan: ML-based and QoS-aware resource management for cloud microservices[C]. Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems(ASPLOS), 2021: 167–181
Gan, Y., Zhang, Y., Hu, K., et al.: Seer: Leveraging big data to navigate the complexity of performance debugging in cloud microservices[C]. Proceedings of the 24th International Conference on Architectural Support for Programming Languages and Operating Systems(ASPLOS), 2019: 19–33
Kannan, R.S., Subramanian, L., Raju, A., et al.: Grandslam: Guaranteeing SLAs for jobs in microservices execution frameworks[C]. Proceedings of the 14th EuroSys Conference 2019(EuroSys), 2019: 1–16
Al-lessa, S., Sheibani, R., Veisi, G.: A resource allocation and scheduling model for hierarchical distributed services in cloud environment using game theory. Int. J. Commun. Syst. 35(6), e5075 (2022)
Article Google Scholar
Hua, W., Liu, P., Huang, L.: Energy-efficient resource allocation for heterogeneous edge–cloud computing. IEEE Internet of Things J. 11(2), 2808–2818 (2024)
Article Google Scholar
Habib, A., Khan, M.: Reinforcement learning based autonomic virtual machine management in clouds, 2016 5th International Conference on Informatics, Electronics and Vision, 2016, pp. 1083–1088, https://doi.org/10.1109/ICIEV.2016.7760166
John, I., Sreekantan, A., Bhatnagar, S.: Auto-scaling resources for cloud applications using reinforcement learning, 2019, pp. 1–5
Schuler, L., Jamil, S., Kühl, N.: AI-based Resource Allocation: Reinforcement Learning for Adaptive Auto-scaling in Serverless Environments, 2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and Internet Computing (CCGrid), 2021, pp. 804-811, https://doi.org/10.1109/CCGrid51090.2021.00098
Park, J., Choi, B., Lee, C., et al.: Graf: a graph neural network based proactive resource allocation framework for slo-oriented microservices [C]. In: Proceedings of the 17th International Conference on emerging Networking Experiments and Technologies. 2021: 154-167
Bahreini, T., Badri, H., Grosu, D.: Mechanisms for resource allocation and pricing in mobile edge computing systems. IEEE Trans. Parallel Distrib. Syst. 33(3), 667–682 (2022)
Article Google Scholar
Li, C., Zhu, L., Luo, Y.: Joint edge caching and dynamic service migration in SDN based mobile edge computing. J. Netw. Comput. Appl. 177, 102966 (2021)
Article Google Scholar
Kumar, R., Garg, S., Chan, A., et al.: Deadline-based dynamic resource allocation and provisioning algorithms in fog-cloud environment. Futur. Gener. Comput. Syst. 104, 131–141 (2020)
Article Google Scholar
Yadav, M., Rohit, D., Yadav, D.K.: Maintaining container sustainability through machine learning. Cluster Comput. 24, 3725–3750 (2021)
Article Google Scholar
Belgacem, A., Beghdad, K., Nacer, H., et al.: Efficient dynamic resource allocation method for cloud computing environment. Clust. Comput. 23, 2871–2889 (2020)
Article Google Scholar
Yadav, M., Pal, N., Yadav, D.: Resource provisioning for containerized applications. Clust. Comput. 24, 2819–2840 (2021)
Article Google Scholar
Chouliaras, S., Sotiriadis, S.: An adaptive auto-scaling framework for cloud resource provisioning. Future Gener. Comput. Syst. 148, 173–183 (2023)
Article Google Scholar

Download references

Acknowledgements

We will thank the editor-in-chief and all the reviewers for their patience, and this work is supported by National Natural Science Foundation of China (No.61862068), Youth Project of Applied Basic Research Program of Yunnan Province (NO.202201AU070050), Key Project of Applied Basic Research Program of Yunnan Province (NO. 202201AS070021).

Funding

This research was funded by Youth Project of Applied Basic Research Program of Yunnan Province (Grant No. 202201AU070050), National Natural Science Foundation of China (Grant No. 61862068), Key Project of Applied Basic Research Program of Yunnan Province, (Grant no. 202201AS070021).

Author information

Authors and Affiliations

Key Laboratory of Education Informatization for Nationalities, Ministry of Education, Yunnan Normal University, Kunming, 650500, China
Yu Peng, Jia Hao, Yang Chen & Jianhou Gan
Yunnan Key Laboratory of Smart Education, Yunnan Normal University, Kunming, 650500, China
Yu Peng, Jia Hao, Yang Chen & Jianhou Gan

Authors

Yu Peng
View author publications
You can also search for this author in PubMed Google Scholar
Jia Hao
View author publications
You can also search for this author in PubMed Google Scholar
Yang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jianhou Gan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.J. and P.Y. authored the main manuscript text, while C.Y. prepared the figures and charts. All authors reviewed the manuscript.

Corresponding author

Correspondence to Jia Hao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Peng, Y., Hao, J., Chen, Y. et al. Fine-grained resource adjustment of edge server in cloud-edge collaborative environment. Cluster Comput (2024). https://doi.org/10.1007/s10586-024-04380-z

Download citation

Received: 29 December 2023
Revised: 21 February 2024
Accepted: 22 February 2024
Published: 27 March 2024
DOI: https://doi.org/10.1007/s10586-024-04380-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fine-grained resource adjustment of edge server in cloud-edge collaborative environment

Abstract

Access this article

Similar content being viewed by others

Energy efficiency in cloud computing data centers: a survey on software technologies

Efficient Virtual Machine Placement Strategy Based on Enhanced Genetic Approach

Edge artificial intelligence for big data: a systematic review

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Fine-grained resource adjustment of edge server in cloud-edge collaborative environment

Abstract

Access this article

Similar content being viewed by others

Energy efficiency in cloud computing data centers: a survey on software technologies

Efficient Virtual Machine Placement Strategy Based on Enhanced Genetic Approach

Edge artificial intelligence for big data: a systematic review

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation