Skip to main content
Log in

Fine-grained resource adjustment of edge server in cloud-edge collaborative environment

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

In the cloud-edge collaborative environment, the edge server manager will divide the physical resources based on virtualization technology, so as to deploy multiple applications on the same server. However, due to the imperfect virtualization technology and the complexity and dynamics of the applications deployed on virtual machines (VMs), it is difficult for cloud service providers to evaluate the performance of VMs and thus cannot implement dynamic resource management effectively. To address this problem, this paper proposes an adaptive resource allocation approach. Firstly, we use the profiling tools to collect hardware counters and corresponding performance that reflect the resource usage in real time. Then, we select the data instances that contribute more to the performance prediction based on Gradient-based One Side Sampling (GOSS) to build a VM performance prediction model. When the prediction results indicate the performance cannot meet users’ requirements, we further apply one of the reinforcement learning framework-Deep Deterministic Policy Gradient (DDPG) to optimize the allocation of fine-grained resources. Our proposed method enables adaptive allocation of fine-grained resources in cloud environment, and the extensive experiments demonstrate that the average accuracy of performance prediction by our proposed method surpasses 95%, whereas the metrics derived from the others ranges only between 75 and 97.5%. Furthermore, the average accuracy by our proposed method on the several benchmark applications is 88.4%, gaining a performance improvement of 9.1% compared to the suboptimal baseline.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Algorithm 1
Algorithm 2
Algorithm 3
Fig. 3
Algorithm 4
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data availability

The data involved in this paper are openly available in a public repository. The data that support the findings of this study are openly available at https://www.kaggle.com/datasets/jiahaoynu/espl-based-dataset.

References

  1. Zhou, Z., Chen, X., Li, E., et al.: Edge intelligence: paving the last mile of artificial intelligence with edge computing [J]. Proc. IEEE 107(8), 1738–1762 (2019)

    Article  Google Scholar 

  2. China academy of information and communications technology, research report on key technologies situation of cloud-edge collaboration. [EB/OL]. http://ww w.caict.ac.cn/ 2023

  3. Abaker, I., Badrul, N.: The rise of “Big Data” on cloud computing: review and open research issues [J]. Inf. Syst. 47, 98–115 (2015)

    Article  Google Scholar 

  4. Keshavarzi, A., Haghighat, A., Bohlouli, M.: Enhanced time-aware QoS prediction in multi-cloud: a hybird k-medoids and lazy learning approach (QoPC) [J]. Computing 102(4), 923–949 (2020)

    Article  MathSciNet  Google Scholar 

  5. Youpeng, Tu., Haiming, C., Linjie, Y.: Edge computing offloading decision-making problem in Internet of Things systems: modeling, solution and classification [J]. Small Microcomput. Syst. 42(10), 2145–2152 (2021)

    Google Scholar 

  6. Hao, J., Yue, K., Zhang, B., et al.: Transfer learning of bayesian network for measuring QoS of virtual machines. Appl. Intell. [J] 51(12), 8641–8660 (2021)

    Article  Google Scholar 

  7. Baek, J., Kaddoum, G.: Heterogeneous task offloading and resource allocations via deep recurrent reinforcement learning in partial observable multifog networks [J]. IEEE Internet Things J. 8(2), 1041–1056 (2021)

    Article  Google Scholar 

  8. Abedi, S., Mostafa, G., Khorami, E., Musa, M.: Dynamic resource allocation using improved firefly optimization algorithm in cloud environment [J]. Appl. Artif. Intell. (2022). https://doi.org/10.1080/08839514.2022.2055394

    Article  Google Scholar 

  9. Ke, G., Meng, Q., Finley, T.: LightGBM: A highly efficient gradient boosting decision tree [C]// conference on neural information processing systems (NIPS), 2017, 3146–3154

  10. Chen, S., GalOn, S., Delimitrou, C., Manne, S., Martinez, J.: Workload characterization of interactive cloud services on big and small server platforms [C]. In: International Symposium on Workload Characterization (IISWC), 2017, 125- 134

  11. Arulkumaran, K., Peter, M., Brundage, M., Bharath, A.: A brief survey of deep reinforcement learning [EB/OL]. arXiv preprint: CoRR abs/1708.05866, 2017

  12. Yan, L., Chen, X., Zhou, J., Chen, Y., Wen, J.: Deep reinforcement learning for continuous electric vehicles charging control with dynamic user behaviors [J]. IEEE Trans. Smart Grid 12(6), 5124–5134 (2021)

    Article  Google Scholar 

  13. Liu, J., Wang, X., Shen, S., et al.: A Bayesian Q-learning game for dependable task offloading against DDoS attacks in sensor edge cloud [J]. IEEE Internet Things J. 8(9), 7546–7561 (2021)

    Article  Google Scholar 

  14. Ramanathan, R., Latha, B.: Towards optimal resource provisioning for Hadoop-MapReduce jobs using scale-out strategy and its performance analysis in private cloud environment [J]. Clust. Comput. 22, 14061–14071 (2019)

    Article  Google Scholar 

  15. Qiu, H., Banerjee, S., Jha, S. et.al.: FIRM: An intelligent fine-grained resource management framework for SLO-oriented microservices [C]. In: Symposium on Operating Systems Design and Implementation (OSDI), 2020, 805-825

  16. Zhang, J., Xu, S., Li, X., Wang, C., Xu, J.: Cloud computing task scheduling based on orthogonal adaptive whale optimization [J]. Comput. Appl. 42(5), 1516–1523 (2022)

    Google Scholar 

  17. Chen, K., Powers, J., Guo, S., Tian, F.: CRESP: towards optimal resource provisioning for MapReduce computing in public clouds. IEEE Trans. Parallel Distrib. Syst. 25(6), 1403–1412 (2014). https://doi.org/10.1109/TPDS.2013.297

    Article  Google Scholar 

  18. Kavitha, J., Thirupathi, K.: Dynamic resource allocation in cloud infrastructure using ant lion-based auto-regression model. Int. J. Commun. Syst. 35(6), e5071 (2022)

    Article  Google Scholar 

  19. Alizadeh, S., Ghaemi, R., Naeen, H.: An autonomous architecture based on reinforcement deep neural network for resource allocation in cloud computing. Computing 106(2), 371–403 (2024)

    Article  Google Scholar 

  20. Chen, Y., Zhao, J., Wu, Y., Huang, J., et al.: QoE-aware decentralized task offloading and resource allocation for end-edge-cloud systems: a game-theoretical approach. IEEE Trans. Mobile Comput. 23(1), 769–784 (2024)

    Article  Google Scholar 

  21. Fan, W., Zhao, L., Liu, X., et al.: Collaborative service placement, task scheduling, and resource allocation for task offloading with edge-cloud cooperation. IEEE Trans. Mob. Comput. 23(1), 238–256 (2024)

    Article  Google Scholar 

  22. Zhang, Y., Hua, W., Zhou, Z., et al.: Sinan: ML-based and QoS-aware resource management for cloud microservices[C]. Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems(ASPLOS), 2021: 167–181

  23. Gan, Y., Zhang, Y., Hu, K., et al.: Seer: Leveraging big data to navigate the complexity of performance debugging in cloud microservices[C]. Proceedings of the 24th International Conference on Architectural Support for Programming Languages and Operating Systems(ASPLOS), 2019: 19–33

  24. Kannan, R.S., Subramanian, L., Raju, A., et al.: Grandslam: Guaranteeing SLAs for jobs in microservices execution frameworks[C]. Proceedings of the 14th EuroSys Conference 2019(EuroSys), 2019: 1–16

  25. Al-lessa, S., Sheibani, R., Veisi, G.: A resource allocation and scheduling model for hierarchical distributed services in cloud environment using game theory. Int. J. Commun. Syst. 35(6), e5075 (2022)

    Article  Google Scholar 

  26. Hua, W., Liu, P., Huang, L.: Energy-efficient resource allocation for heterogeneous edge–cloud computing. IEEE Internet of Things J. 11(2), 2808–2818 (2024)

    Article  Google Scholar 

  27. Habib, A., Khan, M.: Reinforcement learning based autonomic virtual machine management in clouds, 2016 5th International Conference on Informatics, Electronics and Vision, 2016, pp. 1083–1088, https://doi.org/10.1109/ICIEV.2016.7760166

  28. John, I., Sreekantan, A., Bhatnagar, S.: Auto-scaling resources for cloud applications using reinforcement learning, 2019, pp. 1–5

  29. Schuler, L., Jamil, S., Kühl, N.: AI-based Resource Allocation: Reinforcement Learning for Adaptive Auto-scaling in Serverless Environments, 2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and Internet Computing (CCGrid), 2021, pp. 804-811, https://doi.org/10.1109/CCGrid51090.2021.00098

  30. Park, J., Choi, B., Lee, C., et al.: Graf: a graph neural network based proactive resource allocation framework for slo-oriented microservices [C]. In: Proceedings of the 17th International Conference on emerging Networking Experiments and Technologies. 2021: 154-167

  31. Bahreini, T., Badri, H., Grosu, D.: Mechanisms for resource allocation and pricing in mobile edge computing systems. IEEE Trans. Parallel Distrib. Syst. 33(3), 667–682 (2022)

    Article  Google Scholar 

  32. Li, C., Zhu, L., Luo, Y.: Joint edge caching and dynamic service migration in SDN based mobile edge computing. J. Netw. Comput. Appl. 177, 102966 (2021)

    Article  Google Scholar 

  33. Kumar, R., Garg, S., Chan, A., et al.: Deadline-based dynamic resource allocation and provisioning algorithms in fog-cloud environment. Futur. Gener. Comput. Syst. 104, 131–141 (2020)

    Article  Google Scholar 

  34. Yadav, M., Rohit, D., Yadav, D.K.: Maintaining container sustainability through machine learning. Cluster Comput. 24, 3725–3750 (2021)

    Article  Google Scholar 

  35. Belgacem, A., Beghdad, K., Nacer, H., et al.: Efficient dynamic resource allocation method for cloud computing environment. Clust. Comput. 23, 2871–2889 (2020)

    Article  Google Scholar 

  36. Yadav, M., Pal, N., Yadav, D.: Resource provisioning for containerized applications. Clust. Comput. 24, 2819–2840 (2021)

    Article  Google Scholar 

  37. Chouliaras, S., Sotiriadis, S.: An adaptive auto-scaling framework for cloud resource provisioning. Future Gener. Comput. Syst. 148, 173–183 (2023)

    Article  Google Scholar 

Download references

Acknowledgements

We will thank the editor-in-chief and all the reviewers for their patience, and this work is supported by National Natural Science Foundation of China (No.61862068), Youth Project of Applied Basic Research Program of Yunnan Province (NO.202201AU070050), Key Project of Applied Basic Research Program of Yunnan Province (NO. 202201AS070021).

Funding

This research was funded by Youth Project of Applied Basic Research Program of Yunnan Province (Grant No. 202201AU070050), National Natural Science Foundation of China (Grant No. 61862068), Key Project of Applied Basic Research Program of Yunnan Province, (Grant no. 202201AS070021).

Author information

Authors and Affiliations

Authors

Contributions

H.J. and P.Y. authored the main manuscript text, while C.Y. prepared the figures and charts. All authors reviewed the manuscript.

Corresponding author

Correspondence to Jia Hao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Peng, Y., Hao, J., Chen, Y. et al. Fine-grained resource adjustment of edge server in cloud-edge collaborative environment. Cluster Comput (2024). https://doi.org/10.1007/s10586-024-04380-z

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10586-024-04380-z

Keywords

Navigation