Skip to main content

Intelligent Transmission Scheduling Based on Deep Reinforcement Learning

  • Chapter
  • First Online:
Mission-Critical Application Driven Intelligent Maritime Networks

Part of the book series: SpringerBriefs in Computer Science ((BRIEFSCOMPUTER))

  • 250 Accesses

Abstract

With the increasing diversification of ship users’ communication services, the QoS of data transmission has become the limitation of the development of maritime communication. The software-defined maritime communication networks are proposed to solve the problem of communication mode obstacles in heterogeneous networks. Based on this framework, we propose a transmission scheduling scheme based on improved deep Q learning algorithm which combines the deep Q network with softmax classifier (also known as S-DQN algorithm) to improve throughput, balance delay and energy consumption. First of all, the Markov decision process (MDP) is used to realize the optimal scheduling strategy. In addition, the mapping relationship between the optimal policy and the obtained information is established by using the deep Q network in the system. When the input data arrives, after the amounts of data self-learning, the optimal strategy is made as quickly and accurately as possible. The simulation results show that the scheme is better than other traditional schemes under the different quality of service, which verifies the effectiveness of the scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 16.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Wang, H., Yu, F.R., Zhu, L., Tang, T., Ning, B.: Finite-state Markov modeling for wireless channels in tunnel communication-based train control systems. IEEE Trans. Intell. Transport. Syst. 15(3), 1083–1090 (2014)

    Article  Google Scholar 

  2. Lin, S., et al.: Finite-state Markov modeling for high-speed railway fading channels. IEEE Antennas Wirel. Propagat. Lett. 14, 954–957 (2015)

    Article  Google Scholar 

  3. Svensson, A.: An introduction to adaptive QAM modulation schemes for known and predicted channels. Proc. IEEE 95(12), 2322–2336 (2007)

    Article  Google Scholar 

  4. Pappi, K.N., Lioumpas, A.S., Karagiannidis, G.K.: \(\theta \)-QAM: a parametric quadrature amplitude modulation family and its performance in AWGN and fading channels. IEEE Trans. Commun. 58(4), 1014–1019 (2010)

    Article  Google Scholar 

  5. Li, Q., Zhao, L., Gao, J., Liang, H., Zhao, L., Tang, X.: SMDP-based coordinated virtual machine allocations in cloud-fog computing systems. IEEE Internet Things J. 5(3), 1977–1988 (2018)

    Article  Google Scholar 

  6. Li, M., Zhao, L., Liang, H.: An SMDP-based prioritized channel allocation scheme in cognitive enabled vehicular ad hoc networks. IEEE Trans. Veh. Technol. 66(9), 7925–7933 (2017)

    Article  Google Scholar 

  7. Qin, M., et al.: Learning-aided multiple time-scale SON function coordination in ultra-dense small-cell networks. IEEE Trans. Wireless Commun. 18(4), 2080–2092 (2019)

    Google Scholar 

  8. Mnih, V,Kavukcuoglu, K,Silver, D., et al.: Playing Atari with deep reinforcement learning. Computer Science (2013). arXiv:1312.5602

  9. Silver, D., Huang, A., et al.: Mastering the game of Go with deep neural networks and tree search. Nature 529(7587), 484 (2016)

    Article  Google Scholar 

  10. Van Hasselt, H., Guez, A., Silver, D.: Deep reinforcement learning with double q-learning. In: Proceedings of the 30th AAAI Conference on Artificial Intelligence, pp. 2094–2100 (2016)

    Google Scholar 

  11. Mnih, V., Kavukcuoglu, K., Silver, D., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529 (2015)

    Article  Google Scholar 

  12. Ahmed, N.: Data-Free/Data-sparse softmax parameter estimation with structured class geometries. IEEE Signal Processing Lett. 25(9), 1408–1412 (2018)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tingting Yang .

Rights and permissions

Reprints and permissions

Copyright information

© 2020 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Yang, T., Shen, X. (2020). Intelligent Transmission Scheduling Based on Deep Reinforcement Learning. In: Mission-Critical Application Driven Intelligent Maritime Networks. SpringerBriefs in Computer Science. Springer, Singapore. https://doi.org/10.1007/978-981-15-4412-5_3

Download citation

  • DOI: https://doi.org/10.1007/978-981-15-4412-5_3

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-4411-8

  • Online ISBN: 978-981-15-4412-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics