Evolvable dialogue state tracking for statistical dialogue management
- 86 Downloads
Abstract
Statistical dialogue management is the core of cognitive spoken dialogue systems (SDS) and has attracted great research interest. In recent years, SDS with the ability of evolution is of particular interest and becomes the cuttingedge of SDS research. Dialogue state tracking (DST) is a process to estimate the distribution of the dialogue states at each dialogue turn, given the previous interaction history. It plays an important role in statistical dialogue management. To provide a common testbed for advancing the research of DST, international DST challenges (DSTC) have been organised and well-attended by major SDS groups in the world. This paper reviews recent progresses on rule-based and statistical approaches during the challenges. In particular, this paper is focused on evolvable DST approaches for dialogue domain extension. The two primary aspects for evolution, semantic parsing and tracker, are discussed. Semantic enhancement and a DST framework which bridges rule-based and statistical models are introduced in detail. By effectively incorporating prior knowledge of dialogue state transition and the ability of being data-driven, the new framework supports reliable domain extension with little data and can continuously improve with more data available. Thismakes it excellent candidate for DST evolution. Experiments show that the evolvable DST approaches can achieve the state-of-the-art performance and outperform all previously submitted trackers in the third DSTC.
Keywords
dialogue management domain extension evolvable dialogue state tracking parser trackerPreview
Unable to display preview. Download preview PDF.
Supplementary material
References
- 1.Williams J D, Young S. Partially observable Markov decision processes for spoken dialog systems. Computer Speech & Language, 2007, 21(2): 393–422CrossRefGoogle Scholar
- 2.Thomson B, Young S. Bayesian update of dialogue state: a POMDP framework for spoken dialogue systems. Computer Speech & Language, 2010, 24(4): 562–588CrossRefGoogle Scholar
- 3.Young S, Gašic M, Keizer S, Mairesse F, Schatzmann J, Thomson B, Yu K. The hidden information state model: a practical framework for POMDP-based spoken dialogue management. Computer Speech & Language, 2010, 24(2): 150–174CrossRefGoogle Scholar
- 4.Young S, Gasic M, Thomson B, Williams J D. POMDP-based statistical spoken dialog systems: a review. Proceedings of the IEEE, 2013, 101(5): 1160–1179CrossRefGoogle Scholar
- 5.Williams J D. Challenges and opportunities for state tracking in statistical spoken dialog systems: results from two public deployments. IEEE Journal of Selected Topics in Signal Processing, 2012, 6(8): 959–970CrossRefGoogle Scholar
- 6.Williams J D. A belief tracking challenge task for spoken dialog systems. In: Proceedings of NAACL-HLTWorkshop on Future Directions and Needs in the Spoken Dialog Community: Tools and Data. 2012, 23–24Google Scholar
- 7.Williams J, Raux A, Ramachandran D, Black A. The dialog state tracking challenge. In: Proceedings of the SIGDIAL 2013 Conference., 2013, 404–413Google Scholar
- 8.Henderson M, Thomson B, Williams J D. The second dialog state tracking challenge. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL)., 2014, 263–272CrossRefGoogle Scholar
- 9.Henderson M, Thomson B, Williams J D. The third dialog state tracking challenge. In: Proceedings of IEEE Spoken Language Technology Workshop (SLT)., 2014Google Scholar
- 10.Yu K, Sun K, Chen L, Zhu S. Constrained markov bayesian polynomial for efficient dialogue state tracking. IEEE/ACM Transactions on Audio, Speech and Language Processing, 2015, 23(12): 2177–2188CrossRefGoogle Scholar
- 11.Sun K, Xie Q, Yu K. Recurrent polynomial network for dialogue state tracking. 2015, arXiv preprint arXiv: 1507.03934Google Scholar
- 12.Kim D, Henderson M, Gasic M, Tsiakoulis P, Young S J. The use of discriminative belief tracking in pomdp-based dialogue systems. In: Proceedings of IEEE Spoken Language Technology Workshop (SLT)., 2014, 354–359Google Scholar
- 13.Henderson M, Thomson B, Williams J. The second dialog state tracking challenge. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discowrse and Dialogue., 2014, 263CrossRefGoogle Scholar
- 14.Young S. CUED standard dialogue acts. Report, Cambridge University, Engineering Department,, 2007Google Scholar
- 15.He Y, Young S. Spoken language understanding using the hidden vector state model. Speech Communication, 2006, 48(3): 262–275CrossRefGoogle Scholar
- 16.Wong YW, Mooney R J. Learning synchronous grammars for semantic parsing with lambda calculus. Annual Meeting-Association for Computational Linguistics., 2007, 45(1): 960Google Scholar
- 17.Zettlemoyer L S, Collins M. Online learning of relaxed CCG grammars for parsing to logical form. In: Proceedings of the Joint Conference on Emperical Methods in Natural Language Processing and Computational Natural Language Learning., 2007, 878–887Google Scholar
- 18.Sha F, Pereira F. Shallow parsing with conditional random fields. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology., 2003, 134–141Google Scholar
- 19.Jurcicek F, Mairesse F, Gašic M, Keizer S, Thomson B, Yu K, Young S. Transformation-based learning for semantic parsing. In: Proceedings of INTERSPEECH., 2009, 2719–2722Google Scholar
- 20.Yao K, Zweig G, Hwang M Y, Shi Y, Yu D. Recurrent neural networks for language understanding. In: Proceedings of INTERSPEECH., 2013, 2524–2528Google Scholar
- 21.Yao K, Peng B, Zhang Y, Yu D, Zweig G, Shi Y. Spoken language understanding using long short-term memory neural networks. In: Proceedings of IEEE Spoken Language Technology Workshop., 2014, 189–194Google Scholar
- 22.Guo D Z, Tur G, Yih W T, Zweig G. Joint semantic utterance classification and slot filling with recursive neural networks. In: Proceedings of IEEE Spoken Language Technology Workshop., 2014, 554–559Google Scholar
- 23.Mairesse F, Gasic M, Jurcícek F, Keizer S, Thomson B, Yu K, Young S. Spoken language understanding from unaligned data using discriminative classification models. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing., 2009Google Scholar
- 24.Henderson M, GasicM, Thomson B, Tsiakoulis P, Yu K, Young S. Discriminative spoken language understanding using word confusion networks. In: Proceedings of IEEE Spoken Language Technology Workshop., 2012, 176–181Google Scholar
- 25.Raymond C, Riccardi G. Generative and discriminative algorithms for spoken language understanding. In: Proceedings of INTERSPEECH., 2007, 1605–1608Google Scholar
- 26.Zhu S, Chen L, Sun K, Zheng D, Yu K. Semantic parser enhancement for dialogue domain extension with little data. In: Proceedings of IEEE Spoken Language Technology Workshop (SLT)., 2014Google Scholar
- 27.Lee S, Eskenazi M. Recipe for building robust spoken dialog state trackers: dialog state tracking challenge system description. In: Proceedings of the SIGDIAL 2013 Conference., 2013, 414–422Google Scholar
- 28.Sun K, Chen L, Zhu S, Yu K. The SJTU system for dialog state tracking challenge 2. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL)., 2014, 318–326CrossRefGoogle Scholar
- 29.Lee S. Structured discriminative model for dialog state tracking. In: Proceedings of the SIGDIAL 2013 Conference., 2013, 442–451Google Scholar
- 30.Kim S, Banchs R. Sequential labeling for tracking dynamic dialog states. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue., 2014, 332CrossRefGoogle Scholar
- 31.Henderson M, Thomson B, Young S. Deep neural network approach for the dialog state tracking challenge. In: Proceedings of the SIGDIAL 2013 Conference., 2013, 467–471Google Scholar
- 32.Henderson M, Thomson B, Young S. Word-based dialog state tracking with recurrent neural networks. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL)., 2014, 292–299CrossRefGoogle Scholar
- 33.Williams J D. Web-style ranking and SLU combination for dialog state tracking. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL). 2014, 282–291CrossRefGoogle Scholar
- 34.Zilka L, Marek D, Korvas M, Jurcicek F. Comparison of Bayesian discriminative and generative models for dialogue state tracking. In: Proceedings of the SIGDIAL 2013 Conference., 2013, 452–456Google Scholar
- 35.Wang Z, Lemon O. A simple and generic belief tracking mechanism for the dialog state tracking challenge: on the believability of observed information. In: Proceedings of the SIGDIAL 2013 Conference., 2013, 423–432Google Scholar
- 36.Sun K, Chen L, Zhu S, Yu K. A generalized rule based tracker for dialogue state tracking. In: Proceedings of IEEE Spoken Language Technology Workshop (SLT)., 2014, 330–335Google Scholar
- 37.Achterberg T. SCIP: Solving constraint integer programs. Mathematical Programming Computation,, 2009, 1(1): 1–41MathSciNetCrossRefMATHGoogle Scholar
- 38.Wang Z. HWU baseline belief tracker for dstc 2 & 3. Technical Report., 2013Google Scholar
- 39.Henderson M, Thomson B, Young S. Robust dialog state tracking using delexicalised recurrent neural networks and unsupervised adaptation. In: Proceedings of IEEE Spoken Language Technology Workshop (SLT)., 2014, 360–365Google Scholar
- 40.Kadlec R, Vodolan M, Libovicky J, Macek J, Kleindienst J. Knowledge-based dialog state tracking. In: Proceedings of IEEE Spoken Language Technology Workshop., 2014, 348–353Google Scholar