Abstract
In this chapter, we investigate deep reinforcement learning for text and speech applications. Reinforcement learning is a branch of machine learning that deals with how agents learn a set of actions that can maximize expected cumulative reward. In past research, reinforcement learning has focused on game play. Recent advances in deep learning have opened up reinforcement learning to wider applications for real-world problems, and the field of deep reinforcement learning was spawned. In the first part of this chapter, we introduce the fundamental concepts of reinforcement learning and their extension through the use of deep neural networks. In the latter part of the chapter, we investigate several popular deep reinforcement learning algorithms and their application to text and speech NLP tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dzmitry Bahdanau et al. “An Actor-Critic Algorithm for Sequence Prediction.” In: CoRR abs/1607.07086 (2016).
Mehdi Fatemi et al. “Policy Networks with Two-Stage Training for Dialogue Systems.” In: CoRR abs/1606.03152 (2016).
Wenfeng Feng, Hankz Hankui Zhuo, and Subbarao Kambhampati. “Extracting Action Sequences from Texts Based on Deep Reinforcement Learning.” In: IJCAI. ijcai.org, 2018, pp. 4064–4070.
Yuntian Feng et al. “Joint Extraction of Entities and Relations Using Reinforcement Learning and Deep Learning.” In: Comp. Int. and Neurosc. 2017 (2017), 7643065:1–7643065:11.
Jianfeng Gao, Michel Galley, and Lihong Li. “Neural Approaches to Conversational AI.” In: CoRR abs/1809.08267 (2018).
Tomas Gogar, Ondrej Hubácek, and Jan Sedivý. “Deep Neural Networks for Web Page Information Extraction.” In: AIAI. Vol. 475. Springer, 2016, pp. 154–163.
Hado van Hasselt, Arthur Guez, and David Silver “Deep Reinforcement Learning with Double Q-learning.” In: CoRR abs/1509.06461 (2015).
Yaser Keneshloo et al. “Deep Reinforcement Learning For Se quence to Sequence Models.” In: CoRR abs/1805.09461 (2018).
Gyoung Ho Lee and Kong Joo Lee. “Automatic Text Summarization Using Reinforcement Learning with Embedding Features.” In: IJCNLP(2). Asian Federation of Natural Language Processing, 2017, pp. 193–197.
Jiwei Li et al. “Deep Reinforcement Learning for Dialogue Gener ation”. In: CoRR abs/1606.01541 (2016).
Mike Mintz et al. “Distant supervision for relation extraction without labeled data.” In: ACL/IJCNLP. The Association for Computer Linguistics, 2009, pp. 1003–1011.
Volodymyr Mnih et al. “Playing Atari with Deep Reinforcement Learning.” In: CoRR abs/1312.5602 (2013).
Karthik Narasimhan, Adam Yala, and Regina Barzilay “Improving Information Extraction by Acquiring External Evidence with Reinforcement Learning.” In: CoRR abs/1603.07954 (2016).
Romain Paulus, Caiming Xiong, and Richard Socher. “A Deep Reinforced Model for Abstractive Summarization.” In: CoRR abs/1705.04304 (2017).
Yanjun Qi et al. “Deep Learning for Character-Based Information Extraction.” In: ECIR. Vol. 8416. Springer, 2014, pp. 668–674.
Tom Schaul et al. “Prioritized Experience Replay.” In: CoRR abs/1511.05952 (2015).
Abigail See, Peter J. Liu, and Christopher D. Manning. “Get To The Point: Summarization with Pointer-Generator Networks.” In: CoRR abs/1704.04368 (2017).
Andros Tjandra, Sakriani Sakti, and Satoshi Nakamura. “Sequence-to-Sequence ASR Optimization via Reinforcement Learning.” In: CoRR abs/1710.10774 (2017).
Dong Yu and Jinyu Li. “Recent Progresses in Deep Learning based Acoustic Models (Updated).” In: CoRR (2018). http://arxiv.org/abs/1804.09298
Xiangrong Zeng et al. “Large Scaled Relation Extraction With Reinforcement Learning.” In: AAAI AAAI Press, 2018.
Tianyang Zhang, Minlie Huang, and Li Zhao. “Learning Structured Representation for Text Classification via Reinforcement Learning.” In: AAAI. AAAI Press, 2018.
Tiancheng Zhao and Maxine Eskénazi. “Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning.” In: SIGDIAL Conference. The Association for Computer Linguistics, 2016, pp. 1–10.
Yingbo Zhou, Caiming Xiong, and Richard Socher. “Improving End-to-End Speech Recognition with Policy Learning.” In: CoRR abs/1712.07101 (2017).
Asli Çelikyilmaz et al. “Deep Communicating Agents for Abstractive Summarization.” In: NAACL-HLT. Association for Computational Linguistics, 2018, pp. 1662–1675.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Kamath, U., Liu, J., Whitaker, J. (2019). Deep Reinforcement Learning for Text and Speech. In: Deep Learning for NLP and Speech Recognition . Springer, Cham. https://doi.org/10.1007/978-3-030-14596-5_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-14596-5_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14595-8
Online ISBN: 978-3-030-14596-5
eBook Packages: Computer ScienceComputer Science (R0)