Abstract
This chapter starts by covering the basic concepts involved in reinforcement learning and then describes how to solve reinforcement learning tasks by using basic and deep learning-based solutions. It also provides a brief overview of the typical algorithms central to the deep learning-based solutions, namely DQN, DDPG, and A3C.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
V. Mnih, K. Kavukcuoglu, D. Silver et al., Playing atari with deep reinforcement learning, (2013). [2019–11–10] https://arxiv.org/pdf/1312.5602.pdf
T.P.Lillicrap, J.J. Hunt, A. Pritzel et al., Continuous control with deep reinforcement learning, (2015). [2019–11–10] https://arxiv.org/pdf/1509.02971.pdf
R.S. Sutton, D.A. McAllester, S.P.Singh et al., Policy gradient methods for reinforcement learning with function approximation. in Advances in Neural Information Processing Systems (2000), pp. 1057–1063
D. Silver, G. Lever, N. Heess et al., Deterministic policy gradient algorithms, (2014). [2019–11–10] http://xueshu.baidu.com/usercenter/paper/show?paperid=43a8642b81092513eb6bad1f3f5231e2&site=xueshu_se
V. Mnih, A.P. Badia, M. Mirza et al., Asynchronous methods for deep reinforcement learning. in International Conference on Machine Learning (2016), pp. 1928–1937
X. Zhao, L. Zhang, Z. Ding et al., Deep reinforcement learning for list-wise recommendations, (2017). [2019–11–10] https://arxiv.org/pdf/1801.00209.pdf
G. Zheng, F. Zhang, Z. Zheng et al., DRN: a deep reinforcement learning framework for news recommendation. in Proceedings of the 2018 World Wide Web Conference. International World Wide Web Conferences Steering Committee, (2018), pp. 167–176
X. Chen, S. Li, H. Li et al., Generative adversarial user model for reinforcement learning based recommendation system. in International Conference on Machine Learning (2019), pp. 1052–1061
B. Wu, Q. Fu, J. Liang et al., Hierarchical macro strategy model for MOBA game AI, (2018) [2019–11–10] https://arxiv.org/pdf/1812.07887.pdf
P. Sun, X. Sun, L. Han et al., TStarBots: defeating the cheating level builtin AI in starCraft II in the full game, (2018). [2019–11–10] https://arxiv.org/pdf/1809.07193.pdf
J.X. Wang, Z. Kurth-Nelson, D. Kumaran et al., Prefrontal cortex as a meta-reinforcement learning system. Nat. Neurosci. 21(6), 860 (2018)
D. Silver, A. Huang, C.J. Maddison et al., Mastering the game of go with deep neural networks and tree search. Nature 529(7587), 484 (2016)
D. Silver, J. Schrittwieser, K. Simonyan et al., Mastering the game of go without human knowledge. Nature 550(7676), 354 (2017)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2021 Tsinghua University Press
About this chapter
Cite this chapter
Lei, C. (2021). Deep Reinforcement Learning. In: Deep Learning and Practice with MindSpore. Cognitive Intelligence and Robotics. Springer, Singapore. https://doi.org/10.1007/978-981-16-2233-5_10
Download citation
DOI: https://doi.org/10.1007/978-981-16-2233-5_10
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-2232-8
Online ISBN: 978-981-16-2233-5
eBook Packages: Computer ScienceComputer Science (R0)