Skip to main content

Optimisation for POMDP-Based Spoken Dialogue Systems

  • Chapter
  • First Online:
Data-Driven Methods for Adaptive Spoken Dialogue Systems

Abstract

Spoken dialogue systems (SDS) allow users to interact with a wide variety of information systems using speech as the primary, and often the only, communication medium. The principal elements of an SDS are a speech understanding component which converts each spoken input into an abstract semantic representation called a user dialogue act (see Chap. 3), a dialogue manager which responds to the user’s input and generates a system act a t in response, and a message generator which converts each system act back into speech (see Chap. 6). At each turn t, the system updates its state s t , and based on a policy π, it determines the next system act a t = π(s t ). The state consists of the variables needed to track the progress of the dialogue and the attribute values (often called slots) that determine the user’s requirements. In conventional systems, as discussed in Chap. 8, the policy is usually defined by a flow chart with nodes representing states and actions and arcs representing user inputs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Amari, S.: Natural gradient works efficiently in learning. Neural. Comput., 10(2), 251–276 (1998)

    Article  MathSciNet  Google Scholar 

  2. Bishop, C.: Pattern Recognition and Machine Learning, Springer (2006)

    Google Scholar 

  3. Cassandra, A.R.: POMDP solver [Computer Software] (2005). Available from http://www.cassandra.org/pomdp/code/

  4. Cohn, D., Atlas, L., Ladner, R.: Improving Generalization with Active Learning. Mach. Learn. 15, 201–221 (1994)

    Google Scholar 

  5. Deisenroth, M.P., Rasmussen, C.E., Peters, J.: Gaussian Process Dynamic Programming. Neurocomputing 72(7-9), 1508–1524 (2009)

    Google Scholar 

  6. Engel, Y.: Algorithms and Representations for Reinforcement Learning. PhD thesis, Hebrew University (2005)

    Google Scholar 

  7. Engel, Y., Mannor, S., Meir, R.: Reinforcement learning with Gaussian processes. In: Proceedings of ICML (2005)

    Google Scholar 

  8. Engel, E., Mannor, S., Meir, R.: Reinforcement learning with Gaussian processes. In: ICML 2005, Bonn, Germany (2005)

    Google Scholar 

  9. Gašić, M., Keizer, S., Mairesse, F., Schatzmann, J., Thomson, B., Yu, K., Young, S.: Training and evaluation of the HIS-POMDP dialogue system in noise. In: Proceedings of SIGDIAL (2008)

    Google Scholar 

  10. Gašić, M., Jurčíček, F., Keizer, S., Mairesse, F., Thomson, B., Yu, K., Young, S.J.: Gaussian Processes for Fast Policy Optimisation of a POMDP Dialogue Manager for a Real-world Task. In: SigDial 2010, Tokyo, Japan (2010)

    Google Scholar 

  11. Gašić, M., Young, S.: Effective Handling of Dialogue State in the Hidden Information State POMDP Dialogue Manager. ACM Transactions on Speech and Language Processing 7(3), 2011

    Google Scholar 

  12. Gašić, M.: Statistical dialogue modelling. PhD thesis, University of Cambridge (2011)

    Google Scholar 

  13. Jurčíček, F., Thomson, B., Keizer, S., Mairesse, F., Gašić, M., Yu, K., Young, S.J.: Natural Belief-Critic: a reinforcement algorithm for parameter estimation in statistical spoken dialogue systems. In Takao Kobayashi, Keikichi Hirose, and Satoshi Nakamura, editors, Proc. Interspeech, 90–93, ISCA (2010)

    Google Scholar 

  14. Jurčíček, F., Thomson, B., Young, S.J.: Natural Actor and Belief Critic: Reinforcement algorithm for learning parameters of dialogue systems modelled as POMDPs. ACM Transactions on Speech and Language Processing 7(3), 2011

    Google Scholar 

  15. Keizer, S., Gašić, M., Mairesse, F., Thomson, B., Yu, K., Young, S.: Modelling user behaviour in the HIS-POMDP dialogue manager. In: Proceedings of SLT, pp. 121–124, 2008

    Google Scholar 

  16. MacKay, D.J.C.: Information-based objective functions for active data selection. Neural. Comput. 4(4), 590–604 (1992)

    Article  Google Scholar 

  17. Minka, T.P.: Expectation Propagation for Approximate Bayesian Inference. In: Proc 17th Conf in Uncertainty in Artificial Intelligence, pp. 362–369. Seattle, Morgan-Kaufmann (2001)

    Google Scholar 

  18. Paquet, U.: Bayesian inference for latent variable models. PhD thesis, University of Cambridge (2007)

    Google Scholar 

  19. Peters, J., Schaal, S.: Natural Actor-Critic. m Neurocomputing 71(7-9), 1180–1190 (2008)

    Google Scholar 

  20. Rasmussen, C.E., Williams, C.K.I.: Gaussian Processes for Machine Learning. MIT Press (2006)

    Google Scholar 

  21. Roy, N., Pineau, J., Thrun, S: Spoken Dialogue Management Using Probabilistic Reasoning. In: Proceedings of the ACL 2000, 2000

    Google Scholar 

  22. Schatzmann, J., Stuttle, M.N., Weilhammer, K., Young, S.: Effects of the user model on simulation-based learning of dialogue strategies. In: IEEE ASRU ’05: Proc. IEEE Workshop Automatic Speech Recognition and Understanding (2005)

    Google Scholar 

  23. Schatzmann, J.: Statistical User and Error Modelling for Spoken Dialogue Systems. PhD thesis, University of Cambridge (2008)

    Google Scholar 

  24. Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. Adaptive Computation and Machine Learning. MIT Press, Cambridge, Massachusetts (1998)

    Google Scholar 

  25. Thomson, B.: Statistical methods for spoken dialogue management. PhD thesis, University of Cambridge (2009)

    Google Scholar 

  26. Thomson, B., Schatzmann, J., Young, S.J.: Bayesian Update of Dialogue State for Robust Dialogue Systems. In: Int Conf Acoustics Speech and Signal Processing ICASSP, Las Vegas (2008)

    Google Scholar 

  27. Thomson, B., Young, S.J.: Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems. Computer Speech and Language 24(4), 562–588 (2010)

    Article  Google Scholar 

  28. Thomson, B., Jurčíček, F., Gašić, M., Keizer, S., Mairesse, F., Yu, K., Young, S.J.: Parameter learning for POMDP spoken dialogue models. In: IEEE Workshop on Spoken Language Technology (SLT 2010), Berkeley, CA (2010)

    Google Scholar 

  29. Williams, J.D.: Partially Observable Markov Decision Processes for Spoken Dialogue Management. PhD thesis, University of Cambridge (2006)

    Google Scholar 

  30. Williams, R.J.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Mach. Learn. 8, 229–256 (1992)

    MATH  Google Scholar 

  31. Williams, J.D., Poupart, P., Young, S.: Factored partially observable Markov decision processes for dialogue management. In: Proceedings of the IJCAI Workshop on Knowledge and Reasoning in Practical Dialog Systems, 2005

    Google Scholar 

  32. Williams, J.D., Young, S.J.: Partially Observable Markov Decision Processes for Spoken Dialog Systems. Computer Speech and Language 21(2), 393–422 (2007)

    Article  Google Scholar 

  33. Young, S.J.: Talking to Machines (Statistically Speaking). In: Int Conf Spoken Language Processing, Denver, Colorado (2002)

    Google Scholar 

  34. Young, S.J., Gašić, M., Keizer, S., Mairesse, F., Schatzmann, J., Thomson, B., Yu, K.: The Hidden Information State Model: a practical framework for POMDP-based spoken dialogue management. Computer Speech and Language 24(2), 150–174 (2010)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Steve Young .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer Science+Business Media New York

About this chapter

Cite this chapter

Gašić, M., Jurčíček, F., Thomson, B., Young, S. (2012). Optimisation for POMDP-Based Spoken Dialogue Systems. In: Lemon, O., Pietquin, O. (eds) Data-Driven Methods for Adaptive Spoken Dialogue Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-4803-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-4803-7_5

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4614-4802-0

  • Online ISBN: 978-1-4614-4803-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics