Skip to main content

Deep Reinforcement Learning for Text and Speech

  • Chapter
  • First Online:
Deep Learning for NLP and Speech Recognition

Abstract

In this chapter, we investigate deep reinforcement learning for text and speech applications. Reinforcement learning is a branch of machine learning that deals with how agents learn a set of actions that can maximize expected cumulative reward. In past research, reinforcement learning has focused on game play. Recent advances in deep learning have opened up reinforcement learning to wider applications for real-world problems, and the field of deep reinforcement learning was spawned. In the first part of this chapter, we introduce the fundamental concepts of reinforcement learning and their extension through the use of deep neural networks. In the latter part of the chapter, we investigate several popular deep reinforcement learning algorithms and their application to text and speech NLP tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 119.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dzmitry Bahdanau et al. “An Actor-Critic Algorithm for Sequence Prediction.” In: CoRR abs/1607.07086 (2016).

    Google Scholar 

  2. Mehdi Fatemi et al. “Policy Networks with Two-Stage Training for Dialogue Systems.” In: CoRR abs/1606.03152 (2016).

    Google Scholar 

  3. Wenfeng Feng, Hankz Hankui Zhuo, and Subbarao Kambhampati. “Extracting Action Sequences from Texts Based on Deep Reinforcement Learning.” In: IJCAI. ijcai.org, 2018, pp. 4064–4070.

    Google Scholar 

  4. Yuntian Feng et al. “Joint Extraction of Entities and Relations Using Reinforcement Learning and Deep Learning.” In: Comp. Int. and Neurosc. 2017 (2017), 7643065:1–7643065:11.

    Google Scholar 

  5. Jianfeng Gao, Michel Galley, and Lihong Li. “Neural Approaches to Conversational AI.” In: CoRR abs/1809.08267 (2018).

    Google Scholar 

  6. Tomas Gogar, Ondrej Hubácek, and Jan Sedivý. “Deep Neural Networks for Web Page Information Extraction.” In: AIAI. Vol. 475. Springer, 2016, pp. 154–163.

    Google Scholar 

  7. Hado van Hasselt, Arthur Guez, and David Silver “Deep Reinforcement Learning with Double Q-learning.” In: CoRR abs/1509.06461 (2015).

    Google Scholar 

  8. Yaser Keneshloo et al. “Deep Reinforcement Learning For Se quence to Sequence Models.” In: CoRR abs/1805.09461 (2018).

    Google Scholar 

  9. Gyoung Ho Lee and Kong Joo Lee. “Automatic Text Summarization Using Reinforcement Learning with Embedding Features.” In: IJCNLP(2). Asian Federation of Natural Language Processing, 2017, pp. 193–197.

    Google Scholar 

  10. Jiwei Li et al. “Deep Reinforcement Learning for Dialogue Gener ation”. In: CoRR abs/1606.01541 (2016).

    Google Scholar 

  11. Mike Mintz et al. “Distant supervision for relation extraction without labeled data.” In: ACL/IJCNLP. The Association for Computer Linguistics, 2009, pp. 1003–1011.

    Google Scholar 

  12. Volodymyr Mnih et al. “Playing Atari with Deep Reinforcement Learning.” In: CoRR abs/1312.5602 (2013).

    Google Scholar 

  13. Karthik Narasimhan, Adam Yala, and Regina Barzilay “Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning.” In: CoRR abs/1603.07954 (2016).

    Google Scholar 

  14. Romain Paulus, Caiming Xiong, and Richard Socher. “A Deep Reinforced Model for Abstractive Summarization.” In: CoRR abs/1705.04304 (2017).

    Google Scholar 

  15. Yanjun Qi et al. “Deep Learning for Character-Based Information Extraction.” In: ECIR. Vol. 8416. Springer, 2014, pp. 668–674.

    Google Scholar 

  16. Tom Schaul et al. “Prioritized Experience Replay.” In: CoRR abs/1511.05952 (2015).

    Google Scholar 

  17. Abigail See, Peter J. Liu, and Christopher D. Manning. “Get To The Point: Summarization with Pointer-Generator Networks.” In: CoRR abs/1704.04368 (2017).

    Google Scholar 

  18. Andros Tjandra, Sakriani Sakti, and Satoshi Nakamura. “Sequence-to-Sequence ASR Optimization via Reinforcement Learning.” In: CoRR abs/1710.10774 (2017).

    Google Scholar 

  19. Dong Yu and Jinyu Li. “Recent Progresses in Deep Learning based Acoustic Models (Updated).” In: CoRR (2018). http://arxiv.org/abs/1804.09298

  20. Xiangrong Zeng et al. “Large Scaled Relation Extraction With Reinforcement Learning.” In: AAAI AAAI Press, 2018.

    Google Scholar 

  21. Tianyang Zhang, Minlie Huang, and Li Zhao. “Learning Structured Representation for Text Classification via Reinforcement Learning.” In: AAAI. AAAI Press, 2018.

    Google Scholar 

  22. Tiancheng Zhao and Maxine Eskénazi. “Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning.” In: SIGDIAL Conference. The Association for Computer Linguistics, 2016, pp. 1–10.

    Google Scholar 

  23. Yingbo Zhou, Caiming Xiong, and Richard Socher. “Improving End-to-End Speech Recognition with Policy Learning.” In: CoRR abs/1712.07101 (2017).

    Google Scholar 

  24. Asli Çelikyilmaz et al. “Deep Communicating Agents for Abstractive Summarization.” In: NAACL-HLT. Association for Computational Linguistics, 2018, pp. 1662–1675.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Kamath, U., Liu, J., Whitaker, J. (2019). Deep Reinforcement Learning for Text and Speech. In: Deep Learning for NLP and Speech Recognition . Springer, Cham. https://doi.org/10.1007/978-3-030-14596-5_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-14596-5_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-14595-8

  • Online ISBN: 978-3-030-14596-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics