Deep Reinforcement Learning

Lei, Chen

doi:10.1007/978-981-16-2233-5_10

Chen Lei⁵

Part of the book series: Cognitive Intelligence and Robotics ((CIR))

3585 Accesses
14 Citations

Abstract

This chapter starts by covering the basic concepts involved in reinforcement learning and then describes how to solve reinforcement learning tasks by using basic and deep learning-based solutions. It also provides a brief overview of the typical algorithms central to the deep learning-based solutions, namely DQN, DDPG, and A3C.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

V. Mnih, K. Kavukcuoglu, D. Silver et al., Playing atari with deep reinforcement learning, (2013). [2019–11–10] https://arxiv.org/pdf/1312.5602.pdf
T.P.Lillicrap, J.J. Hunt, A. Pritzel et al., Continuous control with deep reinforcement learning, (2015). [2019–11–10] https://arxiv.org/pdf/1509.02971.pdf
R.S. Sutton, D.A. McAllester, S.P.Singh et al., Policy gradient methods for reinforcement learning with function approximation. in Advances in Neural Information Processing Systems (2000), pp. 1057–1063
Google Scholar
D. Silver, G. Lever, N. Heess et al., Deterministic policy gradient algorithms, (2014). [2019–11–10] http://xueshu.baidu.com/usercenter/paper/show?paperid=43a8642b81092513eb6bad1f3f5231e2&site=xueshu_se
V. Mnih, A.P. Badia, M. Mirza et al., Asynchronous methods for deep reinforcement learning. in International Conference on Machine Learning (2016), pp. 1928–1937
Google Scholar
X. Zhao, L. Zhang, Z. Ding et al., Deep reinforcement learning for list-wise recommendations, (2017). [2019–11–10] https://arxiv.org/pdf/1801.00209.pdf
G. Zheng, F. Zhang, Z. Zheng et al., DRN: a deep reinforcement learning framework for news recommendation. in Proceedings of the 2018 World Wide Web Conference. International World Wide Web Conferences Steering Committee, (2018), pp. 167–176
Google Scholar
X. Chen, S. Li, H. Li et al., Generative adversarial user model for reinforcement learning based recommendation system. in International Conference on Machine Learning (2019), pp. 1052–1061
Google Scholar
B. Wu, Q. Fu, J. Liang et al., Hierarchical macro strategy model for MOBA game AI, (2018) [2019–11–10] https://arxiv.org/pdf/1812.07887.pdf
P. Sun, X. Sun, L. Han et al., TStarBots: defeating the cheating level builtin AI in starCraft II in the full game, (2018). [2019–11–10] https://arxiv.org/pdf/1809.07193.pdf
J.X. Wang, Z. Kurth-Nelson, D. Kumaran et al., Prefrontal cortex as a meta-reinforcement learning system. Nat. Neurosci. 21(6), 860 (2018)
Article Google Scholar
D. Silver, A. Huang, C.J. Maddison et al., Mastering the game of go with deep neural networks and tree search. Nature 529(7587), 484 (2016)
Article Google Scholar
D. Silver, J. Schrittwieser, K. Simonyan et al., Mastering the game of go without human knowledge. Nature 550(7676), 354 (2017)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Kowloon, Hong Kong
Chen Lei

Authors

Chen Lei
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Lei, C. (2021). Deep Reinforcement Learning. In: Deep Learning and Practice with MindSpore. Cognitive Intelligence and Robotics. Springer, Singapore. https://doi.org/10.1007/978-981-16-2233-5_10

Download citation

DOI: https://doi.org/10.1007/978-981-16-2233-5_10
Published: 18 August 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-2232-8
Online ISBN: 978-981-16-2233-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics