DeepMECagent: multi-agent computing resource allocation for UAV-assisted mobile edge computing in distributed IoT system

Zhang, Xiangxiang; Wang, Yichao

doi:10.1007/s10489-022-03482-8

DeepMECagent: multi-agent computing resource allocation for UAV-assisted mobile edge computing in distributed IoT system

Published: 26 April 2022

Volume 53, pages 1180–1191, (2023)
Cite this article

Applied Intelligence Aims and scope Submit manuscript

Xiangxiang Zhang¹ &
Yichao Wang²

752 Accesses
11 Citations
1 Altmetric
Explore all metrics

Abstract

The proliferation of Internet-of-Things (IoTs) devices provides a promising platform for various intelligent applications such as virtual reality. However, because of the limited onboard computation resource of IoT devices, the computation task suffers from long latency. Mobile edge computing (MEC), which is a critical technology, allows offloading the computation tasks to an edge server to alleviate the shortage of computation resources and accelerate the computation tasks of IoT devices. Owing to the flexibility and mobility advantages of unmanned aerial vehicles (UAVs), UAV-assisted MEC has been widely researched. However, existing studies mostly explore a centralized offloading strategy. Therefore, when the number of IoT devices increases, the performance of the centralized strategy is reduced. The present study explores an intelligent strategy to minimize computation latency using a distributed algorithm. We develop a distributed algorithm named DeepMECagent based on deep reinforcement learning to optimize the computation offloading with minimum computation latency. In the considered scenario, a UAV functions as an aerial edge server to collect and process the computation tasks offloaded by ground IoT devices.The simulation results demonstrate the effective-ness of the proposed approach for minimizing the computation latency, where the computation latency of the proposed algorithm improves 39.71%,87.42%, and 88.55%, respectively, while compared with the centralized-DQN, Q-table, and the random algorithm. Given the expiration time as 1 second, the number of completed tasks within the expiration time of the proposed DeepMECagent is around 2 × and 1.25 × compared with the random algorithm and the Q-table algorithm, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep Reinforcement Learning for Task Allocation in UAV-enabled Mobile Edge Computing

Intelligent reflecting surface-aided computation offloading in UAV-enabled edge networks

Article 12 April 2024

Optimizing dag scheduling and deployment for Iot data analysis services in the multi-UAV mobile edge computing system

Article 22 July 2023

References

The future of iot miniguide: The burgeoning iot market continues. https://www.cisco.com/c/en/us/solutions/internet-of-things/future-of-iot.html
Abbas N, Zhang Y, Taherkordi A, Skeie T (2017) Mobile edge computing: a survey. IEEE Internet of Things Journal 5(1):450–465
Article Google Scholar
Arulkumaran K, Deisenroth MP, Brundage M, Bharath AA (2017) Deep reinforcement learning: a brief survey. IEEE Signal Proc Mag 34(6):26–38
Article Google Scholar
Dan L, Xin C, Chongwei H, Liangliang J (2015) Intelligent agriculture greenhouse environment monitoring system based on iot technology. In: 2015 International conference on intelligent transportation, big data and smart city. IEEE, pp 487–490
Diao X, Zheng J, Cai Y, Wu Y, Anpalagan A (2019) Fair data allocation and trajectory optimization for uav-assisted mobile edge computing. IEEE Commun Lett 23(12):2357–2361
Article Google Scholar
Erdelj M, Król M, Natalizio E (2017) Wireless sensor networks and multi-uav systems for natural disaster management. Comput Netw 124:72–86
Article Google Scholar
Guo H, Liu J (2019) Uav-enhanced intelligent offloading for internet of things at the edge. IEEE Transactions on Industrial Informatics 16(4):2737–2746
Article Google Scholar
Hu W, Zhou Y, Zhang Z, Fujita H (2021) Model predictive control for hybrid levitation systems of maglev trains with state constraints. IEEE Trans Veh Technol 70(10):9972–9985
Article Google Scholar
Hua M, Wang Y, Li C, Huang Y, Yang L (2019) Uav-aided mobile edge computing systems with one by one access scheme. IEEE Transactions on Green Communications and Networking 3(3):664–678
Article Google Scholar
Ji J, Zhu K, Yi C, Niyato D (2020) Energy consumption minimization in uav-assisted mobile-edge computing systems: Joint resource allocation and trajectory design. IEEE Internet of Things Journal 8(10):8570–8584
Article Google Scholar
Li B, Fei Z, Zhang Y (2018) Uav communications for 5g and beyond: Recent advances and future trends. IEEE Internet of Things Journal 6(2):2241–2263
Article Google Scholar
Li M, Cheng N, Gao J, Wang Y, Zhao L, Shen X (2020) Energy-efficient uav-assisted mobile edge computing: Resource allocation and trajectory optimization. IEEE Trans Veh Technol 69(3):3424–3438
Article Google Scholar
Li Y (2017) Deep reinforcement learning: An overview. arXiv:1701.07274
Liu Y, Qiu M, Hu J, Yu H (2020) Incentive uav-enabled mobile edge computing based on microwave power transmission. IEEE Access 8:28,584–28,593
Article Google Scholar
Lyu J, Zeng Y, Zhang R (2016) Cyclical multiple access in uav-aided communications: a throughput-delay tradeoff. IEEE Wireless Communications Letters 5(6):600–603
Article Google Scholar
Mach P, Becvar Z (2017) Mobile edge computing: a survey on architecture and computation offloading. IEEE Communications Surveys & Tutorials 19(3):1628–1656
Article Google Scholar
Mao Y, You C, Zhang J, Huang K, Letaief KB (2017) A survey on mobile edge computing: The communication perspective. IEEE Communications Surveys & Tutorials 19(4):2322–2358
Article Google Scholar
Miraz MH, Ali M, Excell PS, Picking R (2018) Internet of nano-things, things and everything: future growth trends. Future Internet 10(8):68
Article Google Scholar
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533
Article Google Scholar
Mohammed A, Nahom H, Tewodros A, Habtamu Y, Hayelom G (2020) Deep reinforcement learning for computation offloading and resource allocation in blockchain-based multi-uav-enabled mobile edge computing. In: 2020 17Th international computer conference on wavelet active media technology and information processing (ICCWAMTIP). IEEE, pp 295–299
Morales M (2020) Grokking deep reinforcement learning manning publications
Mozaffari M, Kasgari ATZ, Saad W, Bennis M, Debbah M (2018) Beyond 5g with uavs: Foundations of a 3d wireless cellular network. IEEE Trans Wirel Commun 18(1):357–372
Article Google Scholar
Mukherjee M, Kumar V, Lat A, Guo M, Matam R, Lv Y (2020) Distributed deep learning-based task offloading for uav-enabled mobile edge computing. In: IEEE INFOCOM 2020-IEEE Conference on computer communications workshops (INFOCOM WKSHPS). IEEE, pp 1208–1212
Nguyen V, Khanh TT, Van Nam P, Thu NT, Hong CS, Huh EN (2020) Towards flying mobile edge computing. In: 2020 International conference on information networking (ICOIN). IEEE, pp 723–725
Poongodi T, Krishnamurthi R, Indrakumari R, Suresh P, Balusamy B (2020) Wearable devices and iot. In: A handbook of internet of things in biomedical and cyber physical system. Springer, pp 245–273
Ren T, Niu J, Dai B, Liu X, Hu Z, Xu M, Guizani M (2021) Enabling efficient scheduling in large-scale uav-assisted mobile edge computing via hierarchical reinforcement learning. IEEE Internet of Things Journal
Ryu JW, Pham QV, Luan HN, Hwang WJ, Kim JD, Lee JT (2019) Multi-access edge computing empowered heterogeneous networks: a novel architecture and potential works. Symmetry 11(7):842
Article Google Scholar
Tokic M, Palm G (2011) Value-difference based exploration: adaptive control between epsilon-greedy and softmax. In: Annual conference on artificial intelligence. Springer, pp 335–346
Wang L, Huang P, Wang K, Zhang G, Zhang L, Aslam N, Yang K (2019) Rl-based user association and resource allocation for multi-uav enabled mec. In: 2019 15Th international wireless communications & mobile computing conference (IWCMC). IEEE, pp 741–746
Wang L, Wang K, Pan C, Xu W, Aslam N, Hanzo L (2020) Multi-agent deep reinforcement learning-based trajectory planning for multi-uav assisted mobile edge computing. IEEE Transactions on Cognitive Communications and Networking 7(1):73–84
Article Google Scholar
Watkins CJ, Dayan P (1992) Q-learning. Machine Learning 8(3-4):279–292
Article MATH Google Scholar
Yang D, Wu Q, Zeng Y, Zhang R (2018) Energy tradeoff in ground-to-uav communication via trajectory design. IEEE Trans Veh Technol 67(7):6721–6726
Article Google Scholar
Zeng Y, Zhang R (2017) Energy-efficient uav communication with trajectory optimization. IEEE Trans Wirel Commun 16(6):3747–3760
Article Google Scholar
Zhang J, Zhou L, Tang Q, Ngai ECH, Hu X, Zhao H, Wei J (2018) Stochastic computation offloading and trajectory scheduling for uav-assisted mobile edge computing. IEEE Internet of Things Journal 6(2):3688–3699
Article Google Scholar
Zhang K, Gui X, Ren D, Li D (2020) Energy–latency tradeoff for computation offloading in uav-assisted multiaccess edge computing system. IEEE Internet of Things Journal 8(8):6709–6719
Article Google Scholar
Zhang L, Zhang ZY, Min L, Tang C, Zhang HY, Wang YH, Cai P (2021) Task offloading and trajectory control for uav-assisted mobile edge computing using deep reinforcement learning. IEEE Access 9:53,708–53,719
Article Google Scholar
Zhang L, Zhao Z, Wu Q, Zhao H, Xu H, Wu X (2018) Energy-aware dynamic resource allocation in uav assisted mobile edge computing over social internet of vehicles. IEEE Access 6:56,700–56,715
Article Google Scholar
Zhang S, Zeng Y, Zhang R (2018) Cellular-enabled uav communication: Trajectory optimization under connectivity constraint. In: 2018 IEEE International conference on communications (ICC). IEEE, pp 1–6
Zhang T, Xu Y, Loo J, Yang D, Xiao L (2019) Joint computation and communication design for uav-assisted mobile edge computing in iot. IEEE Transactions on Industrial Informatics 16(8):5505–5516
Article Google Scholar
Zhang Y, Zhou Y, Lu H, Fujita H (2021) Cooperative multi-agent actor–critic control of traffic network flow based on edge computing. Futur Gener Comput Syst 123:128–141
Article Google Scholar
Zhang Y, Zhou Y, Lu H, Fujita H (2021) Spark cloud-based parallel computing for traffic network flow predictive control using non-analytical predictive model. IEEE Transactions on Intelligent Transportation Systems
Zhao L, Zhou Y, Lu H, Fujita H (2019) Parallel computing method of deep belief networks and its application to traffic flow prediction. Knowl-Based Syst 163:972–987
Article Google Scholar
Zhou Y, Tao X, Yu Z, Fujita H (2019) Train-movement situation recognition for safety justification using moving-horizon tbm-based multisensor data fusion. Knowl-Based Syst 177:117–126
Article Google Scholar
Zhou Z, Feng J, Tan L, He Y, Gong J (2018) An air-ground integration approach for mobile edge computing in iot. IEEE Commun Mag 56(8):40–47
Article Google Scholar
Zhu S, Gui L, Zhao D, Cheng N, Zhang Q, Lang X (2021) Learning-based computation offloading approaches in uavs-assisted edge computing. IEEE Trans Veh Technol 70(1):928–944
Article Google Scholar

Download references

Author information

Authors and Affiliations

Qisibo Educational Technology, Beijing, 100084, China
Xiangxiang Zhang
The University of New South Wales, Sydney, 2052, Australia
Yichao Wang

Authors

Xiangxiang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yichao Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiangxiang Zhang.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhang, X., Wang, Y. DeepMECagent: multi-agent computing resource allocation for UAV-assisted mobile edge computing in distributed IoT system. Appl Intell 53, 1180–1191 (2023). https://doi.org/10.1007/s10489-022-03482-8

Download citation

Accepted: 05 March 2022
Published: 26 April 2022
Issue Date: January 2023
DOI: https://doi.org/10.1007/s10489-022-03482-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

DeepMECagent: multi-agent computing resource allocation for UAV-assisted mobile edge computing in distributed IoT system

Abstract

Access this article

Similar content being viewed by others

Deep Reinforcement Learning for Task Allocation in UAV-enabled Mobile Edge Computing

Intelligent reflecting surface-aided computation offloading in UAV-enabled edge networks

Optimizing dag scheduling and deployment for Iot data analysis services in the multi-UAV mobile edge computing system

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

DeepMECagent: multi-agent computing resource allocation for UAV-assisted mobile edge computing in distributed IoT system

Abstract

Access this article

Similar content being viewed by others

Deep Reinforcement Learning for Task Allocation in UAV-enabled Mobile Edge Computing

Intelligent reflecting surface-aided computation offloading in UAV-enabled edge networks

Optimizing dag scheduling and deployment for Iot data analysis services in the multi-UAV mobile edge computing system

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation