Deep Reinforcement Learning for Text and Speech

Kamath, Uday; Liu, John; Whitaker, James

doi:10.1007/978-3-030-14596-5_13

Uday Kamath⁴,
John Liu⁵ &
James Whitaker⁴

9117 Accesses

Abstract

In this chapter, we investigate deep reinforcement learning for text and speech applications. Reinforcement learning is a branch of machine learning that deals with how agents learn a set of actions that can maximize expected cumulative reward. In past research, reinforcement learning has focused on game play. Recent advances in deep learning have opened up reinforcement learning to wider applications for real-world problems, and the field of deep reinforcement learning was spawned. In the first part of this chapter, we introduce the fundamental concepts of reinforcement learning and their extension through the use of deep neural networks. In the latter part of the chapter, we investigate several popular deep reinforcement learning algorithms and their application to text and speech NLP tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Hardcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dzmitry Bahdanau et al. “An Actor-Critic Algorithm for Sequence Prediction.” In: CoRR abs/1607.07086 (2016).
Google Scholar
Mehdi Fatemi et al. “Policy Networks with Two-Stage Training for Dialogue Systems.” In: CoRR abs/1606.03152 (2016).
Google Scholar
Wenfeng Feng, Hankz Hankui Zhuo, and Subbarao Kambhampati. “Extracting Action Sequences from Texts Based on Deep Reinforcement Learning.” In: IJCAI. ijcai.org, 2018, pp. 4064–4070.
Google Scholar
Yuntian Feng et al. “Joint Extraction of Entities and Relations Using Reinforcement Learning and Deep Learning.” In: Comp. Int. and Neurosc. 2017 (2017), 7643065:1–7643065:11.
Google Scholar
Jianfeng Gao, Michel Galley, and Lihong Li. “Neural Approaches to Conversational AI.” In: CoRR abs/1809.08267 (2018).
Google Scholar
Tomas Gogar, Ondrej Hubácek, and Jan Sedivý. “Deep Neural Networks for Web Page Information Extraction.” In: AIAI. Vol. 475. Springer, 2016, pp. 154–163.
Google Scholar
Hado van Hasselt, Arthur Guez, and David Silver “Deep Reinforcement Learning with Double Q-learning.” In: CoRR abs/1509.06461 (2015).
Google Scholar
Yaser Keneshloo et al. “Deep Reinforcement Learning For Se quence to Sequence Models.” In: CoRR abs/1805.09461 (2018).
Google Scholar
Gyoung Ho Lee and Kong Joo Lee. “Automatic Text Summarization Using Reinforcement Learning with Embedding Features.” In: IJCNLP(2). Asian Federation of Natural Language Processing, 2017, pp. 193–197.
Google Scholar
Jiwei Li et al. “Deep Reinforcement Learning for Dialogue Gener ation”. In: CoRR abs/1606.01541 (2016).
Google Scholar
Mike Mintz et al. “Distant supervision for relation extraction without labeled data.” In: ACL/IJCNLP. The Association for Computer Linguistics, 2009, pp. 1003–1011.
Google Scholar
Volodymyr Mnih et al. “Playing Atari with Deep Reinforcement Learning.” In: CoRR abs/1312.5602 (2013).
Google Scholar
Karthik Narasimhan, Adam Yala, and Regina Barzilay “Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning.” In: CoRR abs/1603.07954 (2016).
Google Scholar
Romain Paulus, Caiming Xiong, and Richard Socher. “A Deep Reinforced Model for Abstractive Summarization.” In: CoRR abs/1705.04304 (2017).
Google Scholar
Yanjun Qi et al. “Deep Learning for Character-Based Information Extraction.” In: ECIR. Vol. 8416. Springer, 2014, pp. 668–674.
Google Scholar
Tom Schaul et al. “Prioritized Experience Replay.” In: CoRR abs/1511.05952 (2015).
Google Scholar
Abigail See, Peter J. Liu, and Christopher D. Manning. “Get To The Point: Summarization with Pointer-Generator Networks.” In: CoRR abs/1704.04368 (2017).
Google Scholar
Andros Tjandra, Sakriani Sakti, and Satoshi Nakamura. “Sequence-to-Sequence ASR Optimization via Reinforcement Learning.” In: CoRR abs/1710.10774 (2017).
Google Scholar
Dong Yu and Jinyu Li. “Recent Progresses in Deep Learning based Acoustic Models (Updated).” In: CoRR (2018). http://arxiv.org/abs/1804.09298
Xiangrong Zeng et al. “Large Scaled Relation Extraction With Reinforcement Learning.” In: AAAI AAAI Press, 2018.
Google Scholar
Tianyang Zhang, Minlie Huang, and Li Zhao. “Learning Structured Representation for Text Classification via Reinforcement Learning.” In: AAAI. AAAI Press, 2018.
Google Scholar
Tiancheng Zhao and Maxine Eskénazi. “Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning.” In: SIGDIAL Conference. The Association for Computer Linguistics, 2016, pp. 1–10.
Google Scholar
Yingbo Zhou, Caiming Xiong, and Richard Socher. “Improving End-to-End Speech Recognition with Policy Learning.” In: CoRR abs/1712.07101 (2017).
Google Scholar
Asli Çelikyilmaz et al. “Deep Communicating Agents for Abstractive Summarization.” In: NAACL-HLT. Association for Computational Linguistics, 2018, pp. 1662–1675.
Google Scholar

Download references

Author information

Authors and Affiliations

Digital Reasoning Systems Inc., McLean, VA, USA
Uday Kamath & James Whitaker
Intelluron Corporation, Nashville, TN, USA
John Liu

Authors

Uday Kamath
View author publications
You can also search for this author in PubMed Google Scholar
John Liu
View author publications
You can also search for this author in PubMed Google Scholar
James Whitaker
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kamath, U., Liu, J., Whitaker, J. (2019). Deep Reinforcement Learning for Text and Speech. In: Deep Learning for NLP and Speech Recognition . Springer, Cham. https://doi.org/10.1007/978-3-030-14596-5_13

Download citation

DOI: https://doi.org/10.1007/978-3-030-14596-5_13
Published: 11 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14595-8
Online ISBN: 978-3-030-14596-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics