Abstract
Future applications to be supported by 6G networks are envisaged to be realized by loosely-coupled and independent microservices. In order to achieve an optimal deployment of applications, smart resource management strategies will be required, working in a cost-effective and resource-efficient manner. Current cloud computing services are challenged to meet the explosive growth and demand of future use cases such as virtual/augmented/mixed reality (VR/AR/MR). The purpose of edge computing (EC) is to better address latency and transmission requirements of those future stringent applications. However, a high flexibility and a rapid decision-making will be required since EC suffers from limited resources availability. For this reason, this work proposes an artificial intelligence (AI) technique, based on reinforcement learning (RL), to make intelligent decisions on the optimal tier and edge-site selection to serve any request according to the application’s category, constraints, and conflicting costs. In addition, when deployed at the edge-network, a heuristic has been proposed for the mapping of microservices within the selected edge-site. That heuristic will exploit a ranking methodology based on the network topology and available network and compute resources while preserving the revenue of the mobile network operator (MNO). Simulation results show that the performance of the proposed RL approach is close to the optimal solution by reaching the cost minimization objective within a 8.3% margin; moreover, RL outperforms considered benchmark algorithms in most of the conducted experiments.
Similar content being viewed by others
Data availability
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Luévano E, de Lara EL, Castro JE (2015) Use of telepresence and holographic projection mobile device for Colege degree level. Proc Comput Sci 75:339–347
Clemm A, Vega MT, Ravuri HK, Wauters T, De Turck F (2020) Toward truly immersive holographic-type communication: challenges and solutions. IEEE Commun Magaz 58(1):93–99
Yastrebova A, Kirichek R, Koucheryavy Y, Borodin A, Koucheryavy A (2018) Future networks 2030: Architecture & requirements, In: 2018 10th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT), IEEE, pp. 1–8
Huang C, Hu S, Alexandropoulos GC, Zappone A, Yuen C, Zhang R, Di Renzo M, Debbah M (2020) Holographic mimo surfaces for 6g wireless networks: Opportunities, challenges, and trends. IEEE Wireless Commun 27(5):118–125
Liu G, Huang Y, Li N, Dong J, Jin J, Wang Q, Li N (2020) Vision, requirements and network architecture of 6g mobile network beyond 2030. China Commun 17(9):92–104
Srinivas J, Reddy KVS, Qyser AM (2012) Cloud computing basics. Int J Adv Res Comput Commun Eng 1(5):343–347
Merluzzi M, Di Lorenzo P, Barbarossa S, Frascolla V (2020) Dynamic computation offloading in multi-access edge computing via ultra-reliable and low-latency communications. IEEE Trans Signal Inf Process Over Netw 6:342–356
Chakareski J, Gupta S (2020) Multi-connectivity and edge computing for ultra-low-latency lifelike virtual reality, In: 2020 IEEE International Conference on Multimedia and Expo (ICME), IEEE, pp. 1–6
Karimi A, Pedersen Km, Mahmood NH, Berardinelli G, Mogensen P (2020) On Teh multiplexing of data and metadata for ultra-reliable low-latency communications in 5g. IEEE Trans Vehicul Technol 69(10):12136–12147
Deng M, Tian H, Fan B (2016) Fine-granularity based application offloading policy in cloud-enhanced small cell networks, In: 2016 IEEE International Conference on Communications Workshops (ICC), IEEE, pp. 638–643
FG-NET2030 I (2019) New services and capabilities for network 2030: description, technical gap and performance target analysis, FG-NET2030 document NET2030-O-027
Clemm A, Zhani MF, Boutaba R (2020) Network management 2030: operations and control of network 2030 services. J Netw Syst Manage 28(4):721–750
Cuervo E, Balasubramanian A, Cho D-k, Wolman A, Saroiu S, Chandra R, Bahl P (2010) Maui: making smartphones last longer with code offload, In: Proceedings of teh 8th international conference on Mobile systems, applications, and services, pp. 49–62
De Maio V, Brandic I (2018) First hop mobile offloading of dag computations. 2018 18th IEEE/ACM International Symposium on Cluster. Cloud and Grid Computing (CCGRID), IEEE, pp 83–92
Avasalcai C, Tsigkanos C, Dustdar S (2021) Resource management for latency-sensitive IoT applications WiF satisfiability. IEEE Trans Serv Comput 15(5):2982–2993
Hassan SR, Ahmad I, Rehman AU, Hussen S, Hamam H (2022) Design of resource-aware load allocation for heterogeneous fog computing environments. Wireless Commun Mobile Comput 2022:1–11
Lefèvre S, Bajcsy R, Laugier C (2013) Probabilistic decision making for collision avoidance systems: Postponing decisions, In: 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, IEEE, pp. 4370–4375
Rauschnabel PA (2021) Augmented reality is eating Teh real-world! Teh substitution of physical products by holograms. Int J Inf Manage 57:102279
Lu L, Wang H, Liu P, Liu R, Zhang J, Xie Y, Liu S, Huo T, Xie M, Wu X, et al (2022) Applications of mixed reality technology in orthopedics surgery: A pilot study, Frontiers in Bioengineering and Biotechnology 10
Jia M, Cao J, Liang W (2015) Optimal cloudlet placement and user to cloudlet allocation in wireless metropolitan area networks. IEEE Trans Cloud Comput 5(4):725–737
Li B, He Q, Cui G, Xia X, Chen F, Jin H, Yang Y (2020) Read: Robustness-oriented edge application deployment in edge computing environment. IEEE Trans Serv Comput 15(3):1746–1759
Chen Q, Wang Z, Leng J, Li C, Zheng W, Guo M (2019) Avalon: towards qos awareness and improved utilization through multi-resource management in datacenters, In: Proceedings of the ACM International Conference on Supercomputing, pp. 272–283
Zhang W, Cui W, Fu K, Chen Q, Mawhirter D E, Wu B, Li C, Guo M (2019) Laius: Towards latency awareness and improved utilization of spatial multitasking accelerators in datacenters, In: Proceedings of teh ACM international conference on supercomputing, pp. 58–68
Zhou X, Peng X, Xie T, Sun J, Ji C, Li W, Ding D (2018) Fault analysis and debugging of microservice systems: industrial survey, benchmark system, and empirical study. IEEE Trans Softw Eng 47(2):243–260
Filip I-D, Pop F, Serbanescu C, Choi C (2018) Microservices scheduling model over heterogeneous cloud-edge environments as support for IoT applications. IEEE Int Things J 5(4):2672–2681
Yang H, Chen Q, Riaz M, Luan Z, Tang L, Mars J (2017) Powerchief: Intelligent power allocation for multi-stage applications to improve responsiveness on power constrained cmp, In: Proceedings of teh 44th Annual International Symposium on Computer Architecture, pp. 133–146
Patel T, Tiwari D (2020) Clite: Efficient and qos-aware co-location of multiple latency-critical jobs for warehouse scale computers, In: 2020 IEEE International Symposium on High Performance Computer Architecture (HPCA), IEEE, pp. 193–206
Hou X, Li C, Liu J, Zhang L, Ren S, Leng J, Chen Q, Guo M (2021) Alphar: Learning-powered resource management for irregular, dynamic microservice graph, In: 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, pp. 797–806
Liu J, Zhang Q (2019) Reliability and latency aware code-partitioning offloading in mobile edge computing, In: 2019 IEEE Wireless Communications and Networking Conference (WCNC), IEEE, pp. 1–7
Gan Y, Zhang Y, Cheng D, Shetty A, Rathi P, Katarki N, Bruno A, Hu J, Ritchken B, Jackson B et al (2019) An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems, In: Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 3–18
Park S, Kim H (2022) Dagmap: Multi-drone slam via a dag-based distributed ledger. Drones 6(2):34
Callieri M, Ponchio F, Cignoni P, Scopigno R (2008) Virtual inspector: a flexible visualizer for dense 3d scanned models. IEEE Comput Graph Appl 28(1):44–54
Chaturvedi V, Singh A K, Zhang W, Srikanthan T (2014) Thermal-aware task scheduling for peak temperature minimization under periodic constraint for 3d-mpsocs, In: 2014 25nd IEEE International Symposium on Rapid System Prototyping, IEEE, pp. 107–113
Kwan A, Wong J, Jacobsen H-A, Muthusamy V (2019) Hyscale: Hybrid and network scaling of dockerized microservices in cloud data centres, In: 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS), IEEE, pp. 80–90
Gias A U, Casale G, Woodside M (2019) Atom: Model-driven autoscaling for microservices, In: 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS), IEEE, pp. 1994–2004
Bao L, Wu C, Bu X, Ren N, Shen M (2019) Performance modeling and workflow scheduling of microservice-based applications in clouds. IEEE Trans Parallel Distribut Syst 30(9):2114–2129
Liang Y, Ge J, Zhang S, Wu J, Pan L, Zhang T, Luo B (2019) Interaction-oriented service entity placement in edge computing. IEEE Trans Mobile Comput 20(3):1064–1075
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533
Gan Y, Liang M, Dev S, Lo D, Delimitrou C (2021) Sage: practical and scalable ml-driven performance debugging in microservices, In: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 135–151
Gan Y, Zhang Y, Hu K, Cheng D, He Y, Pancholi M, Delimitrou C (2019) Seer: Leveraging big data to navigate teh complexity of performance debugging in cloud microservices, In: Proceedings of teh twenty-fourth international conference on architectural support for programming languages and operating systems, pp. 19–33
Zhang Y, Hua W, Zhou Z, Suh G E, Delimitrou C (2021) Sinan: Ml-based and qos-aware resource management for cloud microservices, In: Proceedings of teh 26th ACM international conference on architectural support for programming languages and operating systems, pp. 167–181
Wu Q, Wu Z, Zhuang Y, Cheng Y, (2018) Adaptive dag tasks scheduling wif deep reinforcement learning, In: International Conference on Algorithms and Architectures for Parallel Processing, Springer, pp. 477–490
Chen Y, Deng S, Zhao H, He Q, Li Y, Gao H (2019) Data-intensive application deployment at edge: A deep reinforcement learning approach, In: 2019 IEEE International Conference on Web Services (ICWS), IEEE, pp. 355–359
Wang S, Urgaonkar R, Zafer M, He T, Chan K, Leung K K (2015) Dynamic service migration in mobile edge-clouds, In: 2015 IFIP networking conference (IFIP networking), IEEE, pp. 1–9
Wang S, Guo Y, Zhang N, Yang P, Zhou A, Shen X (2019) Delay-aware microservice coordination in mobile edge computing: a reinforcement learning approach. IEEE Trans Mobile Comput 20(3):939–951
Kannan RS, Subramanian L, Raju A, Ahn J, Mars J, Tang L (2019) Grandslam: guaranteeing slas for jobs in microservices execution frameworks. Proc Fourteenth EuroSys Conf 2019:1–16
Kibalya G, Serrat J, Gorricho J-L, Bujjingo D G, Sserugunda J, Zhang P (2021) A reinforcement learning approach for placement of stateful virtualized network functions, In: 2021 IFIP/IEEE International Symposium on Integrated Network Management (IM), IEEE, pp. 672–676
Kibalya G, Serrat J, Gorricho J-L, Okello D, Zhang P (2020) A deep reinforcement learning-based algorithm for reliability-aware multi-domain service deployment in smart ecosystems, Neural Computing and Applications 1–23
Cao B, Zhang L, Li Y, Feng D, Cao W (2019) Intelligent offloading in multi-access edge computing: a state-of-the-art review and framework. IEEE Commun Magaz 57(3):56–62
Kibalya G, Serrat J, Gorricho J-L, Pasquini R, Yao H, Zhang P (2019) A reinforcement learning based approach for 5g network slicing across multiple domains, In: 2019 15th International Conference on Network and Service Management (CNSM), IEEE, pp. 1–5
Mao Y, You C, Zhang J, Huang K, Letaief KB (2017) A survey on mobile edge computing: the communication perspective. IEEE Commun Surveys Tutor 19(4):2322–2358
Maier M (2021) 6g as if people mattered: From industry 4.0 toward society 5.0, In: 2021 International Conference on Computer Communications and Networks (ICCCN), IEEE, pp. 1–10
Giordani M, Polese M, Mezzavilla M, Rangan S, Zorzi M (2020) Toward 6g networks: use cases and technologies. IEEE Commun Magaz 58(3):55–61
Zheng T, Wan J, Zhang J, Jiang C (2022) Deep reinforcement learning-based workload scheduling for edge computing. J Cloud Comput 11(1):3
Gu L, Chen Z, Xu H, Zeng D, Li B, Jin H (2022) Layer-aware collaborative microservice deployment toward maximal edge throughput, In: IEEE INFOCOM 2022-IEEE Conference on Computer Communications, IEEE, pp. 71–79
Amiri R, Mehrpouyan H, Fridman L, Mallik R K, Nallanathan A, Matolak D (2018) A machine learning approach for power allocation in hetnets considering qos, In: 2018 IEEE international conference on communications (ICC), IEEE, pp. 1–7
Wang J, Hu J, Min G, Zomaya AY, Georgalas N (2020) Fast adaptive task offloading in edge computing based on meta reinforcement learning. IEEE Trans Parallel Distribut Syst 32(1):242–253
Toyoda H, Nishimura S, Okuno M, Yamaoka R, Nishi H (2005) A 100-gb-ethernet subsystem for next-generation metro-area network, In: IEEE International Conference on Communications, 2005. ICC 2005. 2005, Vol. 2, pp. 1036–1042 Vol. 2
Pekar A, Mocnej J, Seah WK, Zolotova I (2020) Application domain-based overview of IoT network traffic characteristics. ACM Comput Surv (CSUR) 53(4):1–33
Chaccour C, Amer R, Zhou B, Saad W (2019) On the reliability of wireless virtual reality at terahertz (thz) frequencies, In: 2019 10th IFIP International Conference on New Technologies, Mobility and Security (NTMS), pp. 1–5
Qiuping L, Junhui Z, Yi G (2019) Computation offloading and resource management scheme in mobile edge computing. Telecommun Sci 35(3):36
Goudarzi M, Wu H, Palaniswami M, Buyya R (2020) An application placement technique for concurrent IoT applications in edge and fog computing environments. IEEE Trans Mobile Comput 20(4):1298–1311
Kibalya G, Serrat J, Gorricho J-L, Yao H, Zhang P (2020) A novel dynamic programming inspired algorithm for embedding of virtual networks in future networks. Comput Netw 179:107349
Acknowledgements
This work has been partially funded by the project "UNICO-5 G I+D-OPTIMAIX - TSI -063000-2021-34." This work is also supported by the research group "Management, Pricing and Services in Next Generation Networks" (MAPS) of the Network Engineering Department of the Universitat Politècnica de Catalunya.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ssemakula, J.B., Gorricho, JL., Kibalya, G. et al. An artificial intelligence strategy for the deployment of future microservice-based applications in 6G networks. Neural Comput & Applic (2024). https://doi.org/10.1007/s00521-024-09643-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00521-024-09643-9