Skip to main content

Deep Reinforcement Learning

  • Chapter
  • First Online:
Deep Learning and Practice with MindSpore

Part of the book series: Cognitive Intelligence and Robotics ((CIR))

Abstract

This chapter starts by covering the basic concepts involved in reinforcement learning and then describes how to solve reinforcement learning tasks by using basic and deep learning-based solutions. It also provides a brief overview of the typical algorithms central to the deep learning-based solutions, namely DQN, DDPG, and A3C.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. V. Mnih, K. Kavukcuoglu, D. Silver et al., Playing atari with deep reinforcement learning, (2013). [2019–11–10] https://arxiv.org/pdf/1312.5602.pdf

  2. T.P.Lillicrap, J.J. Hunt, A. Pritzel et al., Continuous control with deep reinforcement learning, (2015). [2019–11–10] https://arxiv.org/pdf/1509.02971.pdf

  3. R.S. Sutton, D.A. McAllester, S.P.Singh et al., Policy gradient methods for reinforcement learning with function approximation. in Advances in Neural Information Processing Systems (2000), pp. 1057–1063

    Google Scholar 

  4. D. Silver, G. Lever, N. Heess et al., Deterministic policy gradient algorithms, (2014). [2019–11–10] http://xueshu.baidu.com/usercenter/paper/show?paperid=43a8642b81092513eb6bad1f3f5231e2&site=xueshu_se

  5. V. Mnih, A.P. Badia, M. Mirza et al., Asynchronous methods for deep reinforcement learning. in International Conference on Machine Learning (2016), pp. 1928–1937

    Google Scholar 

  6. X. Zhao, L. Zhang, Z. Ding et al., Deep reinforcement learning for list-wise recommendations, (2017). [2019–11–10] https://arxiv.org/pdf/1801.00209.pdf

  7. G. Zheng, F. Zhang, Z. Zheng et al., DRN: a deep reinforcement learning framework for news recommendation. in Proceedings of the 2018 World Wide Web Conference. International World Wide Web Conferences Steering Committee, (2018), pp. 167–176

    Google Scholar 

  8. X. Chen, S. Li, H. Li et al., Generative adversarial user model for reinforcement learning based recommendation system. in International Conference on Machine Learning (2019), pp. 1052–1061

    Google Scholar 

  9. B. Wu, Q. Fu, J. Liang et al., Hierarchical macro strategy model for MOBA game AI, (2018) [2019–11–10] https://arxiv.org/pdf/1812.07887.pdf

  10. P. Sun, X. Sun, L. Han et al., TStarBots: defeating the cheating level builtin AI in starCraft II in the full game, (2018). [2019–11–10] https://arxiv.org/pdf/1809.07193.pdf

  11. J.X. Wang, Z. Kurth-Nelson, D. Kumaran et al., Prefrontal cortex as a meta-reinforcement learning system. Nat. Neurosci. 21(6), 860 (2018)

    Article  Google Scholar 

  12. D. Silver, A. Huang, C.J. Maddison et al., Mastering the game of go with deep neural networks and tree search. Nature 529(7587), 484 (2016)

    Article  Google Scholar 

  13. D. Silver, J. Schrittwieser, K. Simonyan et al., Mastering the game of go without human knowledge. Nature 550(7676), 354 (2017)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Tsinghua University Press

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Lei, C. (2021). Deep Reinforcement Learning. In: Deep Learning and Practice with MindSpore. Cognitive Intelligence and Robotics. Springer, Singapore. https://doi.org/10.1007/978-981-16-2233-5_10

Download citation

  • DOI: https://doi.org/10.1007/978-981-16-2233-5_10

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-16-2232-8

  • Online ISBN: 978-981-16-2233-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics