Intelligent Transmission Scheduling Based on Deep Reinforcement Learning

Yang, Tingting; Shen, Xuemin (Sherman)

doi:10.1007/978-981-15-4412-5_3

Part of the book series: SpringerBriefs in Computer Science ((BRIEFSCOMPUTER))

250 Accesses

Abstract

With the increasing diversification of ship users’ communication services, the QoS of data transmission has become the limitation of the development of maritime communication. The software-defined maritime communication networks are proposed to solve the problem of communication mode obstacles in heterogeneous networks. Based on this framework, we propose a transmission scheduling scheme based on improved deep Q learning algorithm which combines the deep Q network with softmax classifier (also known as S-DQN algorithm) to improve throughput, balance delay and energy consumption. First of all, the Markov decision process (MDP) is used to realize the optimal scheduling strategy. In addition, the mapping relationship between the optimal policy and the obtained information is established by using the deep Q network in the system. When the input data arrives, after the amounts of data self-learning, the optimal strategy is made as quickly and accurately as possible. The simulation results show that the scheme is better than other traditional schemes under the different quality of service, which verifies the effectiveness of the scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 16.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Wang, H., Yu, F.R., Zhu, L., Tang, T., Ning, B.: Finite-state Markov modeling for wireless channels in tunnel communication-based train control systems. IEEE Trans. Intell. Transport. Syst. 15(3), 1083–1090 (2014)
Article Google Scholar
Lin, S., et al.: Finite-state Markov modeling for high-speed railway fading channels. IEEE Antennas Wirel. Propagat. Lett. 14, 954–957 (2015)
Article Google Scholar
Svensson, A.: An introduction to adaptive QAM modulation schemes for known and predicted channels. Proc. IEEE 95(12), 2322–2336 (2007)
Article Google Scholar
Pappi, K.N., Lioumpas, A.S., Karagiannidis, G.K.: \(\theta \)-QAM: a parametric quadrature amplitude modulation family and its performance in AWGN and fading channels. IEEE Trans. Commun. 58(4), 1014–1019 (2010)
Article Google Scholar
Li, Q., Zhao, L., Gao, J., Liang, H., Zhao, L., Tang, X.: SMDP-based coordinated virtual machine allocations in cloud-fog computing systems. IEEE Internet Things J. 5(3), 1977–1988 (2018)
Article Google Scholar
Li, M., Zhao, L., Liang, H.: An SMDP-based prioritized channel allocation scheme in cognitive enabled vehicular ad hoc networks. IEEE Trans. Veh. Technol. 66(9), 7925–7933 (2017)
Article Google Scholar
Qin, M., et al.: Learning-aided multiple time-scale SON function coordination in ultra-dense small-cell networks. IEEE Trans. Wireless Commun. 18(4), 2080–2092 (2019)
Google Scholar
Mnih, V,Kavukcuoglu, K,Silver, D., et al.: Playing Atari with deep reinforcement learning. Computer Science (2013). arXiv:1312.5602
Silver, D., Huang, A., et al.: Mastering the game of Go with deep neural networks and tree search. Nature 529(7587), 484 (2016)
Article Google Scholar
Van Hasselt, H., Guez, A., Silver, D.: Deep reinforcement learning with double q-learning. In: Proceedings of the 30th AAAI Conference on Artificial Intelligence, pp. 2094–2100 (2016)
Google Scholar
Mnih, V., Kavukcuoglu, K., Silver, D., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529 (2015)
Article Google Scholar
Ahmed, N.: Data-Free/Data-sparse softmax parameter estimation with structured class geometries. IEEE Signal Processing Lett. 25(9), 1408–1412 (2018)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Electrical Engineering and Intelligentization, Dongguan University of Technology, Dongguan, Guangdong, China
Tingting Yang
Department of Electrical and Computer Engineering, University of Waterloo, Waterloo, ON, Canada
Xuemin (Sherman) Shen

Authors

Tingting Yang
View author publications
You can also search for this author in PubMed Google Scholar
Xuemin (Sherman) Shen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tingting Yang .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Yang, T., Shen, X. (2020). Intelligent Transmission Scheduling Based on Deep Reinforcement Learning. In: Mission-Critical Application Driven Intelligent Maritime Networks. SpringerBriefs in Computer Science. Springer, Singapore. https://doi.org/10.1007/978-981-15-4412-5_3

Download citation

DOI: https://doi.org/10.1007/978-981-15-4412-5_3
Published: 14 April 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-4411-8
Online ISBN: 978-981-15-4412-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics