Skip to main content
Log in

An artificial intelligence strategy for the deployment of future microservice-based applications in 6G networks

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Future applications to be supported by 6G networks are envisaged to be realized by loosely-coupled and independent microservices. In order to achieve an optimal deployment of applications, smart resource management strategies will be required, working in a cost-effective and resource-efficient manner. Current cloud computing services are challenged to meet the explosive growth and demand of future use cases such as virtual/augmented/mixed reality (VR/AR/MR). The purpose of edge computing (EC) is to better address latency and transmission requirements of those future stringent applications. However, a high flexibility and a rapid decision-making will be required since EC suffers from limited resources availability. For this reason, this work proposes an artificial intelligence (AI) technique, based on reinforcement learning (RL), to make intelligent decisions on the optimal tier and edge-site selection to serve any request according to the application’s category, constraints, and conflicting costs. In addition, when deployed at the edge-network, a heuristic has been proposed for the mapping of microservices within the selected edge-site. That heuristic will exploit a ranking methodology based on the network topology and available network and compute resources while preserving the revenue of the mobile network operator (MNO). Simulation results show that the performance of the proposed RL approach is close to the optimal solution by reaching the cost minimization objective within a 8.3% margin; moreover, RL outperforms considered benchmark algorithms in most of the conducted experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Algorithm 1
Algorithm 2
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

  1. Luévano E, de Lara EL, Castro JE (2015) Use of telepresence and holographic projection mobile device for Colege degree level. Proc Comput Sci 75:339–347

    Article  Google Scholar 

  2. Clemm A, Vega MT, Ravuri HK, Wauters T, De Turck F (2020) Toward truly immersive holographic-type communication: challenges and solutions. IEEE Commun Magaz 58(1):93–99

    Article  Google Scholar 

  3. Yastrebova A, Kirichek R, Koucheryavy Y, Borodin A, Koucheryavy A (2018) Future networks 2030: Architecture & requirements, In: 2018 10th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT), IEEE, pp. 1–8

  4. Huang C, Hu S, Alexandropoulos GC, Zappone A, Yuen C, Zhang R, Di Renzo M, Debbah M (2020) Holographic mimo surfaces for 6g wireless networks: Opportunities, challenges, and trends. IEEE Wireless Commun 27(5):118–125

    Article  Google Scholar 

  5. Liu G, Huang Y, Li N, Dong J, Jin J, Wang Q, Li N (2020) Vision, requirements and network architecture of 6g mobile network beyond 2030. China Commun 17(9):92–104

    Article  Google Scholar 

  6. Srinivas J, Reddy KVS, Qyser AM (2012) Cloud computing basics. Int J Adv Res Comput Commun Eng 1(5):343–347

    Google Scholar 

  7. Merluzzi M, Di Lorenzo P, Barbarossa S, Frascolla V (2020) Dynamic computation offloading in multi-access edge computing via ultra-reliable and low-latency communications. IEEE Trans Signal Inf Process Over Netw 6:342–356

    Article  MathSciNet  Google Scholar 

  8. Chakareski J, Gupta S (2020) Multi-connectivity and edge computing for ultra-low-latency lifelike virtual reality, In: 2020 IEEE International Conference on Multimedia and Expo (ICME), IEEE, pp. 1–6

  9. Karimi A, Pedersen Km, Mahmood NH, Berardinelli G, Mogensen P (2020) On Teh multiplexing of data and metadata for ultra-reliable low-latency communications in 5g. IEEE Trans Vehicul Technol 69(10):12136–12147

    Article  Google Scholar 

  10. Deng M, Tian H, Fan B (2016) Fine-granularity based application offloading policy in cloud-enhanced small cell networks, In: 2016 IEEE International Conference on Communications Workshops (ICC), IEEE, pp. 638–643

  11. FG-NET2030 I (2019) New services and capabilities for network 2030: description, technical gap and performance target analysis, FG-NET2030 document NET2030-O-027

  12. Clemm A, Zhani MF, Boutaba R (2020) Network management 2030: operations and control of network 2030 services. J Netw Syst Manage 28(4):721–750

    Article  Google Scholar 

  13. Cuervo E, Balasubramanian A, Cho D-k, Wolman A, Saroiu S, Chandra R, Bahl P (2010) Maui: making smartphones last longer with code offload, In: Proceedings of teh 8th international conference on Mobile systems, applications, and services, pp. 49–62

  14. De Maio V, Brandic I (2018) First hop mobile offloading of dag computations. 2018 18th IEEE/ACM International Symposium on Cluster. Cloud and Grid Computing (CCGRID), IEEE, pp 83–92

    Google Scholar 

  15. Avasalcai C, Tsigkanos C, Dustdar S (2021) Resource management for latency-sensitive IoT applications WiF satisfiability. IEEE Trans Serv Comput 15(5):2982–2993

    Article  Google Scholar 

  16. Hassan SR, Ahmad I, Rehman AU, Hussen S, Hamam H (2022) Design of resource-aware load allocation for heterogeneous fog computing environments. Wireless Commun Mobile Comput 2022:1–11

    Google Scholar 

  17. Lefèvre S, Bajcsy R, Laugier C (2013) Probabilistic decision making for collision avoidance systems: Postponing decisions, In: 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, IEEE, pp. 4370–4375

  18. Rauschnabel PA (2021) Augmented reality is eating Teh real-world! Teh substitution of physical products by holograms. Int J Inf Manage 57:102279

    Article  Google Scholar 

  19. Lu L, Wang H, Liu P, Liu R, Zhang J, Xie Y, Liu S, Huo T, Xie M, Wu X, et al (2022) Applications of mixed reality technology in orthopedics surgery: A pilot study, Frontiers in Bioengineering and Biotechnology 10

  20. Jia M, Cao J, Liang W (2015) Optimal cloudlet placement and user to cloudlet allocation in wireless metropolitan area networks. IEEE Trans Cloud Comput 5(4):725–737

    Article  Google Scholar 

  21. Li B, He Q, Cui G, Xia X, Chen F, Jin H, Yang Y (2020) Read: Robustness-oriented edge application deployment in edge computing environment. IEEE Trans Serv Comput 15(3):1746–1759

    Article  Google Scholar 

  22. Chen Q, Wang Z, Leng J, Li C, Zheng W, Guo M (2019) Avalon: towards qos awareness and improved utilization through multi-resource management in datacenters, In: Proceedings of the ACM International Conference on Supercomputing, pp. 272–283

  23. Zhang W, Cui W, Fu K, Chen Q, Mawhirter D E, Wu B, Li C, Guo M (2019) Laius: Towards latency awareness and improved utilization of spatial multitasking accelerators in datacenters, In: Proceedings of teh ACM international conference on supercomputing, pp. 58–68

  24. Zhou X, Peng X, Xie T, Sun J, Ji C, Li W, Ding D (2018) Fault analysis and debugging of microservice systems: industrial survey, benchmark system, and empirical study. IEEE Trans Softw Eng 47(2):243–260

    Article  Google Scholar 

  25. Filip I-D, Pop F, Serbanescu C, Choi C (2018) Microservices scheduling model over heterogeneous cloud-edge environments as support for IoT applications. IEEE Int Things J 5(4):2672–2681

    Article  Google Scholar 

  26. Yang H, Chen Q, Riaz M, Luan Z, Tang L, Mars J (2017) Powerchief: Intelligent power allocation for multi-stage applications to improve responsiveness on power constrained cmp, In: Proceedings of teh 44th Annual International Symposium on Computer Architecture, pp. 133–146

  27. Patel T, Tiwari D (2020) Clite: Efficient and qos-aware co-location of multiple latency-critical jobs for warehouse scale computers, In: 2020 IEEE International Symposium on High Performance Computer Architecture (HPCA), IEEE, pp. 193–206

  28. Hou X, Li C, Liu J, Zhang L, Ren S, Leng J, Chen Q, Guo M (2021) Alphar: Learning-powered resource management for irregular, dynamic microservice graph, In: 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS), IEEE, pp. 797–806

  29. Liu J, Zhang Q (2019) Reliability and latency aware code-partitioning offloading in mobile edge computing, In: 2019 IEEE Wireless Communications and Networking Conference (WCNC), IEEE, pp. 1–7

  30. Gan Y, Zhang Y, Cheng D, Shetty A, Rathi P, Katarki N, Bruno A, Hu J, Ritchken B, Jackson B et al (2019) An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems, In: Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 3–18

  31. Park S, Kim H (2022) Dagmap: Multi-drone slam via a dag-based distributed ledger. Drones 6(2):34

    Article  Google Scholar 

  32. Callieri M, Ponchio F, Cignoni P, Scopigno R (2008) Virtual inspector: a flexible visualizer for dense 3d scanned models. IEEE Comput Graph Appl 28(1):44–54

    Article  Google Scholar 

  33. Chaturvedi V, Singh A K, Zhang W, Srikanthan T (2014) Thermal-aware task scheduling for peak temperature minimization under periodic constraint for 3d-mpsocs, In: 2014 25nd IEEE International Symposium on Rapid System Prototyping, IEEE, pp. 107–113

  34. Kwan A, Wong J, Jacobsen H-A, Muthusamy V (2019) Hyscale: Hybrid and network scaling of dockerized microservices in cloud data centres, In: 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS), IEEE, pp. 80–90

  35. Gias A U, Casale G, Woodside M (2019) Atom: Model-driven autoscaling for microservices, In: 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS), IEEE, pp. 1994–2004

  36. Bao L, Wu C, Bu X, Ren N, Shen M (2019) Performance modeling and workflow scheduling of microservice-based applications in clouds. IEEE Trans Parallel Distribut Syst 30(9):2114–2129

    Article  Google Scholar 

  37. Liang Y, Ge J, Zhang S, Wu J, Pan L, Zhang T, Luo B (2019) Interaction-oriented service entity placement in edge computing. IEEE Trans Mobile Comput 20(3):1064–1075

    Article  Google Scholar 

  38. Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533

    Article  Google Scholar 

  39. Gan Y, Liang M, Dev S, Lo D, Delimitrou C (2021) Sage: practical and scalable ml-driven performance debugging in microservices, In: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 135–151

  40. Gan Y, Zhang Y, Hu K, Cheng D, He Y, Pancholi M, Delimitrou C (2019) Seer: Leveraging big data to navigate teh complexity of performance debugging in cloud microservices, In: Proceedings of teh twenty-fourth international conference on architectural support for programming languages and operating systems, pp. 19–33

  41. Zhang Y, Hua W, Zhou Z, Suh G E, Delimitrou C (2021) Sinan: Ml-based and qos-aware resource management for cloud microservices, In: Proceedings of teh 26th ACM international conference on architectural support for programming languages and operating systems, pp. 167–181

  42. Wu Q, Wu Z, Zhuang Y, Cheng Y, (2018) Adaptive dag tasks scheduling wif deep reinforcement learning, In: International Conference on Algorithms and Architectures for Parallel Processing, Springer, pp. 477–490

  43. Chen Y, Deng S, Zhao H, He Q, Li Y, Gao H (2019) Data-intensive application deployment at edge: A deep reinforcement learning approach, In: 2019 IEEE International Conference on Web Services (ICWS), IEEE, pp. 355–359

  44. Wang S, Urgaonkar R, Zafer M, He T, Chan K, Leung K K (2015) Dynamic service migration in mobile edge-clouds, In: 2015 IFIP networking conference (IFIP networking), IEEE, pp. 1–9

  45. Wang S, Guo Y, Zhang N, Yang P, Zhou A, Shen X (2019) Delay-aware microservice coordination in mobile edge computing: a reinforcement learning approach. IEEE Trans Mobile Comput 20(3):939–951

    Article  Google Scholar 

  46. Kannan RS, Subramanian L, Raju A, Ahn J, Mars J, Tang L (2019) Grandslam: guaranteeing slas for jobs in microservices execution frameworks. Proc Fourteenth EuroSys Conf 2019:1–16

    Google Scholar 

  47. Kibalya G, Serrat J, Gorricho J-L, Bujjingo D G, Sserugunda J, Zhang P (2021) A reinforcement learning approach for placement of stateful virtualized network functions, In: 2021 IFIP/IEEE International Symposium on Integrated Network Management (IM), IEEE, pp. 672–676

  48. Kibalya G, Serrat J, Gorricho J-L, Okello D, Zhang P (2020) A deep reinforcement learning-based algorithm for reliability-aware multi-domain service deployment in smart ecosystems, Neural Computing and Applications 1–23

  49. Cao B, Zhang L, Li Y, Feng D, Cao W (2019) Intelligent offloading in multi-access edge computing: a state-of-the-art review and framework. IEEE Commun Magaz 57(3):56–62

    Article  Google Scholar 

  50. Kibalya G, Serrat J, Gorricho J-L, Pasquini R, Yao H, Zhang P (2019) A reinforcement learning based approach for 5g network slicing across multiple domains, In: 2019 15th International Conference on Network and Service Management (CNSM), IEEE, pp. 1–5

  51. Mao Y, You C, Zhang J, Huang K, Letaief KB (2017) A survey on mobile edge computing: the communication perspective. IEEE Commun Surveys Tutor 19(4):2322–2358

    Article  Google Scholar 

  52. Maier M (2021) 6g as if people mattered: From industry 4.0 toward society 5.0, In: 2021 International Conference on Computer Communications and Networks (ICCCN), IEEE, pp. 1–10

  53. Giordani M, Polese M, Mezzavilla M, Rangan S, Zorzi M (2020) Toward 6g networks: use cases and technologies. IEEE Commun Magaz 58(3):55–61

    Article  Google Scholar 

  54. Zheng T, Wan J, Zhang J, Jiang C (2022) Deep reinforcement learning-based workload scheduling for edge computing. J Cloud Comput 11(1):3

    Article  Google Scholar 

  55. Gu L, Chen Z, Xu H, Zeng D, Li B, Jin H (2022) Layer-aware collaborative microservice deployment toward maximal edge throughput, In: IEEE INFOCOM 2022-IEEE Conference on Computer Communications, IEEE, pp. 71–79

  56. Amiri R, Mehrpouyan H, Fridman L, Mallik R K, Nallanathan A, Matolak D (2018) A machine learning approach for power allocation in hetnets considering qos, In: 2018 IEEE international conference on communications (ICC), IEEE, pp. 1–7

  57. Wang J, Hu J, Min G, Zomaya AY, Georgalas N (2020) Fast adaptive task offloading in edge computing based on meta reinforcement learning. IEEE Trans Parallel Distribut Syst 32(1):242–253

    Article  Google Scholar 

  58. Toyoda H, Nishimura S, Okuno M, Yamaoka R, Nishi H (2005) A 100-gb-ethernet subsystem for next-generation metro-area network, In: IEEE International Conference on Communications, 2005. ICC 2005. 2005, Vol. 2, pp. 1036–1042 Vol. 2

  59. Pekar A, Mocnej J, Seah WK, Zolotova I (2020) Application domain-based overview of IoT network traffic characteristics. ACM Comput Surv (CSUR) 53(4):1–33

    Article  Google Scholar 

  60. Chaccour C, Amer R, Zhou B, Saad W (2019) On the reliability of wireless virtual reality at terahertz (thz) frequencies, In: 2019 10th IFIP International Conference on New Technologies, Mobility and Security (NTMS), pp. 1–5

  61. Qiuping L, Junhui Z, Yi G (2019) Computation offloading and resource management scheme in mobile edge computing. Telecommun Sci 35(3):36

    Google Scholar 

  62. Goudarzi M, Wu H, Palaniswami M, Buyya R (2020) An application placement technique for concurrent IoT applications in edge and fog computing environments. IEEE Trans Mobile Comput 20(4):1298–1311

    Article  Google Scholar 

  63. Kibalya G, Serrat J, Gorricho J-L, Yao H, Zhang P (2020) A novel dynamic programming inspired algorithm for embedding of virtual networks in future networks. Comput Netw 179:107349

    Article  Google Scholar 

Download references

Acknowledgements

This work has been partially funded by the project "UNICO-5 G I+D-OPTIMAIX - TSI -063000-2021-34." This work is also supported by the research group "Management, Pricing and Services in Next Generation Networks" (MAPS) of the Network Engineering Department of the Universitat Politècnica de Catalunya.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to John Bosco Ssemakula.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Ssemakula, J.B., Gorricho, JL., Kibalya, G. et al. An artificial intelligence strategy for the deployment of future microservice-based applications in 6G networks. Neural Comput & Applic (2024). https://doi.org/10.1007/s00521-024-09643-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s00521-024-09643-9

Keywords

Navigation