Optimizing Dialogue Strategy in Large-Scale Spoken Dialogue System: A Learning Automaton Based Approach
Application of statistical methodology to model dialogue strategy in spoken dialogue system is a growing research area. Reinforcement learning is a promising technique for creating a dialogue management component that accepts semantic of the current dialogue state and seeks to find the best action given those features. In practice, increase in the number of dialogue states, much use of memory and processing is needed and the use of exhaustive search techniques like dynamic programming leads to sub-optimal solution. Hence, this paper investigates an adaptive policy iterative method using learning automata that cover large state-action space by hierarchical organization of automaton to learn optimal dialogue strategy. The proposed approach has clear advantages over baseline reinforcement learning algorithms in terms of faster learning with good exploitation in its update and scalability to larger problems.
KeywordsHuman–computer interaction Reinforcement learning Learning automata Spoken dialogue system
- 1.McTear M (2004) Spoken dialog technology: toward the conversational user interface. Springer, New YorkGoogle Scholar
- 2.Sutton RS, Barto AG (1998) Reinforcement learning an introduction. MIT Press, CambridgeGoogle Scholar
- 4.Singh S, Litman D, Walker M (2002) Optimizing dialogue management with reinforcement leaning: experiments with the NJFun system. J Artif Intell 16:105–133Google Scholar
- 9.Toney D, Moore J, Lemon O (2006) Evolving optimal inspectable strategies for spoken dialogue systems. In: Proceedings of HLT, pp 173–176Google Scholar
- 11.Thathachar MAL, Sastry PS (2004) Networks of learning automata: techniques for online stochastic optimization. Kluwer, NorwellGoogle Scholar