Abstract
With the increasing diversification of ship users’ communication services, the QoS of data transmission has become the limitation of the development of maritime communication. The software-defined maritime communication networks are proposed to solve the problem of communication mode obstacles in heterogeneous networks. Based on this framework, we propose a transmission scheduling scheme based on improved deep Q learning algorithm which combines the deep Q network with softmax classifier (also known as S-DQN algorithm) to improve throughput, balance delay and energy consumption. First of all, the Markov decision process (MDP) is used to realize the optimal scheduling strategy. In addition, the mapping relationship between the optimal policy and the obtained information is established by using the deep Q network in the system. When the input data arrives, after the amounts of data self-learning, the optimal strategy is made as quickly and accurately as possible. The simulation results show that the scheme is better than other traditional schemes under the different quality of service, which verifies the effectiveness of the scheme.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wang, H., Yu, F.R., Zhu, L., Tang, T., Ning, B.: Finite-state Markov modeling for wireless channels in tunnel communication-based train control systems. IEEE Trans. Intell. Transport. Syst. 15(3), 1083–1090 (2014)
Lin, S., et al.: Finite-state Markov modeling for high-speed railway fading channels. IEEE Antennas Wirel. Propagat. Lett. 14, 954–957 (2015)
Svensson, A.: An introduction to adaptive QAM modulation schemes for known and predicted channels. Proc. IEEE 95(12), 2322–2336 (2007)
Pappi, K.N., Lioumpas, A.S., Karagiannidis, G.K.: \(\theta \)-QAM: a parametric quadrature amplitude modulation family and its performance in AWGN and fading channels. IEEE Trans. Commun. 58(4), 1014–1019 (2010)
Li, Q., Zhao, L., Gao, J., Liang, H., Zhao, L., Tang, X.: SMDP-based coordinated virtual machine allocations in cloud-fog computing systems. IEEE Internet Things J. 5(3), 1977–1988 (2018)
Li, M., Zhao, L., Liang, H.: An SMDP-based prioritized channel allocation scheme in cognitive enabled vehicular ad hoc networks. IEEE Trans. Veh. Technol. 66(9), 7925–7933 (2017)
Qin, M., et al.: Learning-aided multiple time-scale SON function coordination in ultra-dense small-cell networks. IEEE Trans. Wireless Commun. 18(4), 2080–2092 (2019)
Mnih, V,Kavukcuoglu, K,Silver, D., et al.: Playing Atari with deep reinforcement learning. Computer Science (2013). arXiv:1312.5602
Silver, D., Huang, A., et al.: Mastering the game of Go with deep neural networks and tree search. Nature 529(7587), 484 (2016)
Van Hasselt, H., Guez, A., Silver, D.: Deep reinforcement learning with double q-learning. In: Proceedings of the 30th AAAI Conference on Artificial Intelligence, pp. 2094–2100 (2016)
Mnih, V., Kavukcuoglu, K., Silver, D., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529 (2015)
Ahmed, N.: Data-Free/Data-sparse softmax parameter estimation with structured class geometries. IEEE Signal Processing Lett. 25(9), 1408–1412 (2018)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2020 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Yang, T., Shen, X. (2020). Intelligent Transmission Scheduling Based on Deep Reinforcement Learning. In: Mission-Critical Application Driven Intelligent Maritime Networks. SpringerBriefs in Computer Science. Springer, Singapore. https://doi.org/10.1007/978-981-15-4412-5_3
Download citation
DOI: https://doi.org/10.1007/978-981-15-4412-5_3
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-4411-8
Online ISBN: 978-981-15-4412-5
eBook Packages: Computer ScienceComputer Science (R0)