Abstract
The proliferation of Internet-of-Things (IoTs) devices provides a promising platform for various intelligent applications such as virtual reality. However, because of the limited onboard computation resource of IoT devices, the computation task suffers from long latency. Mobile edge computing (MEC), which is a critical technology, allows offloading the computation tasks to an edge server to alleviate the shortage of computation resources and accelerate the computation tasks of IoT devices. Owing to the flexibility and mobility advantages of unmanned aerial vehicles (UAVs), UAV-assisted MEC has been widely researched. However, existing studies mostly explore a centralized offloading strategy. Therefore, when the number of IoT devices increases, the performance of the centralized strategy is reduced. The present study explores an intelligent strategy to minimize computation latency using a distributed algorithm. We develop a distributed algorithm named DeepMECagent based on deep reinforcement learning to optimize the computation offloading with minimum computation latency. In the considered scenario, a UAV functions as an aerial edge server to collect and process the computation tasks offloaded by ground IoT devices.The simulation results demonstrate the effective-ness of the proposed approach for minimizing the computation latency, where the computation latency of the proposed algorithm improves 39.71%,87.42%, and 88.55%, respectively, while compared with the centralized-DQN, Q-table, and the random algorithm. Given the expiration time as 1 second, the number of completed tasks within the expiration time of the proposed DeepMECagent is around 2 × and 1.25 × compared with the random algorithm and the Q-table algorithm, respectively.
Similar content being viewed by others
References
The future of iot miniguide: The burgeoning iot market continues. https://www.cisco.com/c/en/us/solutions/internet-of-things/future-of-iot.html
Abbas N, Zhang Y, Taherkordi A, Skeie T (2017) Mobile edge computing: a survey. IEEE Internet of Things Journal 5(1):450–465
Arulkumaran K, Deisenroth MP, Brundage M, Bharath AA (2017) Deep reinforcement learning: a brief survey. IEEE Signal Proc Mag 34(6):26–38
Dan L, Xin C, Chongwei H, Liangliang J (2015) Intelligent agriculture greenhouse environment monitoring system based on iot technology. In: 2015 International conference on intelligent transportation, big data and smart city. IEEE, pp 487–490
Diao X, Zheng J, Cai Y, Wu Y, Anpalagan A (2019) Fair data allocation and trajectory optimization for uav-assisted mobile edge computing. IEEE Commun Lett 23(12):2357–2361
Erdelj M, Król M, Natalizio E (2017) Wireless sensor networks and multi-uav systems for natural disaster management. Comput Netw 124:72–86
Guo H, Liu J (2019) Uav-enhanced intelligent offloading for internet of things at the edge. IEEE Transactions on Industrial Informatics 16(4):2737–2746
Hu W, Zhou Y, Zhang Z, Fujita H (2021) Model predictive control for hybrid levitation systems of maglev trains with state constraints. IEEE Trans Veh Technol 70(10):9972–9985
Hua M, Wang Y, Li C, Huang Y, Yang L (2019) Uav-aided mobile edge computing systems with one by one access scheme. IEEE Transactions on Green Communications and Networking 3(3):664–678
Ji J, Zhu K, Yi C, Niyato D (2020) Energy consumption minimization in uav-assisted mobile-edge computing systems: Joint resource allocation and trajectory design. IEEE Internet of Things Journal 8(10):8570–8584
Li B, Fei Z, Zhang Y (2018) Uav communications for 5g and beyond: Recent advances and future trends. IEEE Internet of Things Journal 6(2):2241–2263
Li M, Cheng N, Gao J, Wang Y, Zhao L, Shen X (2020) Energy-efficient uav-assisted mobile edge computing: Resource allocation and trajectory optimization. IEEE Trans Veh Technol 69(3):3424–3438
Li Y (2017) Deep reinforcement learning: An overview. arXiv:1701.07274
Liu Y, Qiu M, Hu J, Yu H (2020) Incentive uav-enabled mobile edge computing based on microwave power transmission. IEEE Access 8:28,584–28,593
Lyu J, Zeng Y, Zhang R (2016) Cyclical multiple access in uav-aided communications: a throughput-delay tradeoff. IEEE Wireless Communications Letters 5(6):600–603
Mach P, Becvar Z (2017) Mobile edge computing: a survey on architecture and computation offloading. IEEE Communications Surveys & Tutorials 19(3):1628–1656
Mao Y, You C, Zhang J, Huang K, Letaief KB (2017) A survey on mobile edge computing: The communication perspective. IEEE Communications Surveys & Tutorials 19(4):2322–2358
Miraz MH, Ali M, Excell PS, Picking R (2018) Internet of nano-things, things and everything: future growth trends. Future Internet 10(8):68
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533
Mohammed A, Nahom H, Tewodros A, Habtamu Y, Hayelom G (2020) Deep reinforcement learning for computation offloading and resource allocation in blockchain-based multi-uav-enabled mobile edge computing. In: 2020 17Th international computer conference on wavelet active media technology and information processing (ICCWAMTIP). IEEE, pp 295–299
Morales M (2020) Grokking deep reinforcement learning manning publications
Mozaffari M, Kasgari ATZ, Saad W, Bennis M, Debbah M (2018) Beyond 5g with uavs: Foundations of a 3d wireless cellular network. IEEE Trans Wirel Commun 18(1):357–372
Mukherjee M, Kumar V, Lat A, Guo M, Matam R, Lv Y (2020) Distributed deep learning-based task offloading for uav-enabled mobile edge computing. In: IEEE INFOCOM 2020-IEEE Conference on computer communications workshops (INFOCOM WKSHPS). IEEE, pp 1208–1212
Nguyen V, Khanh TT, Van Nam P, Thu NT, Hong CS, Huh EN (2020) Towards flying mobile edge computing. In: 2020 International conference on information networking (ICOIN). IEEE, pp 723–725
Poongodi T, Krishnamurthi R, Indrakumari R, Suresh P, Balusamy B (2020) Wearable devices and iot. In: A handbook of internet of things in biomedical and cyber physical system. Springer, pp 245–273
Ren T, Niu J, Dai B, Liu X, Hu Z, Xu M, Guizani M (2021) Enabling efficient scheduling in large-scale uav-assisted mobile edge computing via hierarchical reinforcement learning. IEEE Internet of Things Journal
Ryu JW, Pham QV, Luan HN, Hwang WJ, Kim JD, Lee JT (2019) Multi-access edge computing empowered heterogeneous networks: a novel architecture and potential works. Symmetry 11(7):842
Tokic M, Palm G (2011) Value-difference based exploration: adaptive control between epsilon-greedy and softmax. In: Annual conference on artificial intelligence. Springer, pp 335–346
Wang L, Huang P, Wang K, Zhang G, Zhang L, Aslam N, Yang K (2019) Rl-based user association and resource allocation for multi-uav enabled mec. In: 2019 15Th international wireless communications & mobile computing conference (IWCMC). IEEE, pp 741–746
Wang L, Wang K, Pan C, Xu W, Aslam N, Hanzo L (2020) Multi-agent deep reinforcement learning-based trajectory planning for multi-uav assisted mobile edge computing. IEEE Transactions on Cognitive Communications and Networking 7(1):73–84
Watkins CJ, Dayan P (1992) Q-learning. Machine Learning 8(3-4):279–292
Yang D, Wu Q, Zeng Y, Zhang R (2018) Energy tradeoff in ground-to-uav communication via trajectory design. IEEE Trans Veh Technol 67(7):6721–6726
Zeng Y, Zhang R (2017) Energy-efficient uav communication with trajectory optimization. IEEE Trans Wirel Commun 16(6):3747–3760
Zhang J, Zhou L, Tang Q, Ngai ECH, Hu X, Zhao H, Wei J (2018) Stochastic computation offloading and trajectory scheduling for uav-assisted mobile edge computing. IEEE Internet of Things Journal 6(2):3688–3699
Zhang K, Gui X, Ren D, Li D (2020) Energy–latency tradeoff for computation offloading in uav-assisted multiaccess edge computing system. IEEE Internet of Things Journal 8(8):6709–6719
Zhang L, Zhang ZY, Min L, Tang C, Zhang HY, Wang YH, Cai P (2021) Task offloading and trajectory control for uav-assisted mobile edge computing using deep reinforcement learning. IEEE Access 9:53,708–53,719
Zhang L, Zhao Z, Wu Q, Zhao H, Xu H, Wu X (2018) Energy-aware dynamic resource allocation in uav assisted mobile edge computing over social internet of vehicles. IEEE Access 6:56,700–56,715
Zhang S, Zeng Y, Zhang R (2018) Cellular-enabled uav communication: Trajectory optimization under connectivity constraint. In: 2018 IEEE International conference on communications (ICC). IEEE, pp 1–6
Zhang T, Xu Y, Loo J, Yang D, Xiao L (2019) Joint computation and communication design for uav-assisted mobile edge computing in iot. IEEE Transactions on Industrial Informatics 16(8):5505–5516
Zhang Y, Zhou Y, Lu H, Fujita H (2021) Cooperative multi-agent actor–critic control of traffic network flow based on edge computing. Futur Gener Comput Syst 123:128–141
Zhang Y, Zhou Y, Lu H, Fujita H (2021) Spark cloud-based parallel computing for traffic network flow predictive control using non-analytical predictive model. IEEE Transactions on Intelligent Transportation Systems
Zhao L, Zhou Y, Lu H, Fujita H (2019) Parallel computing method of deep belief networks and its application to traffic flow prediction. Knowl-Based Syst 163:972–987
Zhou Y, Tao X, Yu Z, Fujita H (2019) Train-movement situation recognition for safety justification using moving-horizon tbm-based multisensor data fusion. Knowl-Based Syst 177:117–126
Zhou Z, Feng J, Tan L, He Y, Gong J (2018) An air-ground integration approach for mobile edge computing in iot. IEEE Commun Mag 56(8):40–47
Zhu S, Gui L, Zhao D, Cheng N, Zhang Q, Lang X (2021) Learning-based computation offloading approaches in uavs-assisted edge computing. IEEE Trans Veh Technol 70(1):928–944
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhang, X., Wang, Y. DeepMECagent: multi-agent computing resource allocation for UAV-assisted mobile edge computing in distributed IoT system. Appl Intell 53, 1180–1191 (2023). https://doi.org/10.1007/s10489-022-03482-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-03482-8