Frontiers of Computer Science

, Volume 10, Issue 2, pp 201–215 | Cite as

Evolvable dialogue state tracking for statistical dialogue management

Review Article
  • 86 Downloads

Abstract

Statistical dialogue management is the core of cognitive spoken dialogue systems (SDS) and has attracted great research interest. In recent years, SDS with the ability of evolution is of particular interest and becomes the cuttingedge of SDS research. Dialogue state tracking (DST) is a process to estimate the distribution of the dialogue states at each dialogue turn, given the previous interaction history. It plays an important role in statistical dialogue management. To provide a common testbed for advancing the research of DST, international DST challenges (DSTC) have been organised and well-attended by major SDS groups in the world. This paper reviews recent progresses on rule-based and statistical approaches during the challenges. In particular, this paper is focused on evolvable DST approaches for dialogue domain extension. The two primary aspects for evolution, semantic parsing and tracker, are discussed. Semantic enhancement and a DST framework which bridges rule-based and statistical models are introduced in detail. By effectively incorporating prior knowledge of dialogue state transition and the ability of being data-driven, the new framework supports reliable domain extension with little data and can continuously improve with more data available. Thismakes it excellent candidate for DST evolution. Experiments show that the evolvable DST approaches can achieve the state-of-the-art performance and outperform all previously submitted trackers in the third DSTC.

Keywords

dialogue management domain extension evolvable dialogue state tracking parser tracker 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Supplementary material

11704_2015_5209_MOESM1_ESM.ppt (265 kb)
Supplementary material, approximately 265 KB.

References

  1. 1.
    Williams J D, Young S. Partially observable Markov decision processes for spoken dialog systems. Computer Speech & Language, 2007, 21(2): 393–422CrossRefGoogle Scholar
  2. 2.
    Thomson B, Young S. Bayesian update of dialogue state: a POMDP framework for spoken dialogue systems. Computer Speech & Language, 2010, 24(4): 562–588CrossRefGoogle Scholar
  3. 3.
    Young S, Gašic M, Keizer S, Mairesse F, Schatzmann J, Thomson B, Yu K. The hidden information state model: a practical framework for POMDP-based spoken dialogue management. Computer Speech & Language, 2010, 24(2): 150–174CrossRefGoogle Scholar
  4. 4.
    Young S, Gasic M, Thomson B, Williams J D. POMDP-based statistical spoken dialog systems: a review. Proceedings of the IEEE, 2013, 101(5): 1160–1179CrossRefGoogle Scholar
  5. 5.
    Williams J D. Challenges and opportunities for state tracking in statistical spoken dialog systems: results from two public deployments. IEEE Journal of Selected Topics in Signal Processing, 2012, 6(8): 959–970CrossRefGoogle Scholar
  6. 6.
    Williams J D. A belief tracking challenge task for spoken dialog systems. In: Proceedings of NAACL-HLTWorkshop on Future Directions and Needs in the Spoken Dialog Community: Tools and Data. 2012, 23–24Google Scholar
  7. 7.
    Williams J, Raux A, Ramachandran D, Black A. The dialog state tracking challenge. In: Proceedings of the SIGDIAL 2013 Conference., 2013, 404–413Google Scholar
  8. 8.
    Henderson M, Thomson B, Williams J D. The second dialog state tracking challenge. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL)., 2014, 263–272CrossRefGoogle Scholar
  9. 9.
    Henderson M, Thomson B, Williams J D. The third dialog state tracking challenge. In: Proceedings of IEEE Spoken Language Technology Workshop (SLT)., 2014Google Scholar
  10. 10.
    Yu K, Sun K, Chen L, Zhu S. Constrained markov bayesian polynomial for efficient dialogue state tracking. IEEE/ACM Transactions on Audio, Speech and Language Processing, 2015, 23(12): 2177–2188CrossRefGoogle Scholar
  11. 11.
    Sun K, Xie Q, Yu K. Recurrent polynomial network for dialogue state tracking. 2015, arXiv preprint arXiv: 1507.03934Google Scholar
  12. 12.
    Kim D, Henderson M, Gasic M, Tsiakoulis P, Young S J. The use of discriminative belief tracking in pomdp-based dialogue systems. In: Proceedings of IEEE Spoken Language Technology Workshop (SLT)., 2014, 354–359Google Scholar
  13. 13.
    Henderson M, Thomson B, Williams J. The second dialog state tracking challenge. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discowrse and Dialogue., 2014, 263CrossRefGoogle Scholar
  14. 14.
    Young S. CUED standard dialogue acts. Report, Cambridge University, Engineering Department,, 2007Google Scholar
  15. 15.
    He Y, Young S. Spoken language understanding using the hidden vector state model. Speech Communication, 2006, 48(3): 262–275CrossRefGoogle Scholar
  16. 16.
    Wong YW, Mooney R J. Learning synchronous grammars for semantic parsing with lambda calculus. Annual Meeting-Association for Computational Linguistics., 2007, 45(1): 960Google Scholar
  17. 17.
    Zettlemoyer L S, Collins M. Online learning of relaxed CCG grammars for parsing to logical form. In: Proceedings of the Joint Conference on Emperical Methods in Natural Language Processing and Computational Natural Language Learning., 2007, 878–887Google Scholar
  18. 18.
    Sha F, Pereira F. Shallow parsing with conditional random fields. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology., 2003, 134–141Google Scholar
  19. 19.
    Jurcicek F, Mairesse F, Gašic M, Keizer S, Thomson B, Yu K, Young S. Transformation-based learning for semantic parsing. In: Proceedings of INTERSPEECH., 2009, 2719–2722Google Scholar
  20. 20.
    Yao K, Zweig G, Hwang M Y, Shi Y, Yu D. Recurrent neural networks for language understanding. In: Proceedings of INTERSPEECH., 2013, 2524–2528Google Scholar
  21. 21.
    Yao K, Peng B, Zhang Y, Yu D, Zweig G, Shi Y. Spoken language understanding using long short-term memory neural networks. In: Proceedings of IEEE Spoken Language Technology Workshop., 2014, 189–194Google Scholar
  22. 22.
    Guo D Z, Tur G, Yih W T, Zweig G. Joint semantic utterance classification and slot filling with recursive neural networks. In: Proceedings of IEEE Spoken Language Technology Workshop., 2014, 554–559Google Scholar
  23. 23.
    Mairesse F, Gasic M, Jurcícek F, Keizer S, Thomson B, Yu K, Young S. Spoken language understanding from unaligned data using discriminative classification models. In: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing., 2009Google Scholar
  24. 24.
    Henderson M, GasicM, Thomson B, Tsiakoulis P, Yu K, Young S. Discriminative spoken language understanding using word confusion networks. In: Proceedings of IEEE Spoken Language Technology Workshop., 2012, 176–181Google Scholar
  25. 25.
    Raymond C, Riccardi G. Generative and discriminative algorithms for spoken language understanding. In: Proceedings of INTERSPEECH., 2007, 1605–1608Google Scholar
  26. 26.
    Zhu S, Chen L, Sun K, Zheng D, Yu K. Semantic parser enhancement for dialogue domain extension with little data. In: Proceedings of IEEE Spoken Language Technology Workshop (SLT)., 2014Google Scholar
  27. 27.
    Lee S, Eskenazi M. Recipe for building robust spoken dialog state trackers: dialog state tracking challenge system description. In: Proceedings of the SIGDIAL 2013 Conference., 2013, 414–422Google Scholar
  28. 28.
    Sun K, Chen L, Zhu S, Yu K. The SJTU system for dialog state tracking challenge 2. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL)., 2014, 318–326CrossRefGoogle Scholar
  29. 29.
    Lee S. Structured discriminative model for dialog state tracking. In: Proceedings of the SIGDIAL 2013 Conference., 2013, 442–451Google Scholar
  30. 30.
    Kim S, Banchs R. Sequential labeling for tracking dynamic dialog states. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue., 2014, 332CrossRefGoogle Scholar
  31. 31.
    Henderson M, Thomson B, Young S. Deep neural network approach for the dialog state tracking challenge. In: Proceedings of the SIGDIAL 2013 Conference., 2013, 467–471Google Scholar
  32. 32.
    Henderson M, Thomson B, Young S. Word-based dialog state tracking with recurrent neural networks. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL)., 2014, 292–299CrossRefGoogle Scholar
  33. 33.
    Williams J D. Web-style ranking and SLU combination for dialog state tracking. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL). 2014, 282–291CrossRefGoogle Scholar
  34. 34.
    Zilka L, Marek D, Korvas M, Jurcicek F. Comparison of Bayesian discriminative and generative models for dialogue state tracking. In: Proceedings of the SIGDIAL 2013 Conference., 2013, 452–456Google Scholar
  35. 35.
    Wang Z, Lemon O. A simple and generic belief tracking mechanism for the dialog state tracking challenge: on the believability of observed information. In: Proceedings of the SIGDIAL 2013 Conference., 2013, 423–432Google Scholar
  36. 36.
    Sun K, Chen L, Zhu S, Yu K. A generalized rule based tracker for dialogue state tracking. In: Proceedings of IEEE Spoken Language Technology Workshop (SLT)., 2014, 330–335Google Scholar
  37. 37.
    Achterberg T. SCIP: Solving constraint integer programs. Mathematical Programming Computation,, 2009, 1(1): 1–41MathSciNetCrossRefMATHGoogle Scholar
  38. 38.
    Wang Z. HWU baseline belief tracker for dstc 2 & 3. Technical Report., 2013Google Scholar
  39. 39.
    Henderson M, Thomson B, Young S. Robust dialog state tracking using delexicalised recurrent neural networks and unsupervised adaptation. In: Proceedings of IEEE Spoken Language Technology Workshop (SLT)., 2014, 360–365Google Scholar
  40. 40.
    Kadlec R, Vodolan M, Libovicky J, Macek J, Kleindienst J. Knowledge-based dialog state tracking. In: Proceedings of IEEE Spoken Language Technology Workshop., 2014, 348–353Google Scholar

Copyright information

© Higher Education Press and Springer-Verlag Berlin Heidelberg 2016

Authors and Affiliations

  1. 1.Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering SpeechLab, Department of Computer Science and EngineeringShanghai Jiao Tong UniversityShanghaiChina
  2. 2.Department of Computer ScienceCornell UniversityIthacaUSA

Personalised recommendations