Abstract
In the following Chapter we provide a “proof-of-concept” that RL outperforms hand-coded strategies which are manually tuned to the same reward function. We also show this for a wide range of application scenarios. We use simulation-based optimisation, i.e. we adapt different strategies to different simulated environments.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Rieser, V., Lemon, O. (2011). Proof-of-Concept: Information Seeking Strategies. In: Reinforcement Learning for Adaptive Dialogue Systems. Theory and Applications of Natural Language Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24942-6_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-24942-6_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24941-9
Online ISBN: 978-3-642-24942-6
eBook Packages: Computer ScienceComputer Science (R0)