Skip to main content

Feature Functions for Tree-Based Dialogue Course Management

  • Chapter
  • First Online:
Book cover Spoken Multimodal Human-Computer Dialogue in Mobile Environments

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 28))

Abstract

We propose a set of feature functions for dialogue course management and investigate their effect on the system's behaviour for choosing the subsequent dialogue action during a dialogue session. Especially, we investigate whether the system is able to detect and resolve ambiguities, and if it always chooses that state which leads as quickly as possible to a final state that is likely to meet the user's request. The criteria and data structures used are independent of the underlying domain and can therefore be employed for different applications of spoken dialogue systems. Experiments were performed on a German in-house corpus that covers the domain of a German telephone directory assistance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Abella, A. and Gorin, A. L. (1999). Construct Algebra: Analytical dialog management. In Proceedings of Annual Meeting of the Association for Computational Linguistics (ACL), pages 191–199, University of Maryland, USA.

    Google Scholar 

  • Ammicht, E., Potamianos, A., and Fosler-Lussier, E. (2001). Ambiguity representation and resolution in spoken dialogue systems. In Proceedings of European Conference on Speech Communication and Technology (EURO-SPEECH), pages 2217–2220, Aalborg, Denmark.

    Google Scholar 

  • Aust, H., Oerder, M., Seide, F., and Steinbiss, V. (1995). The Philips automatic train timetable information system. Speech Communication, 17:249–262.

    Article  Google Scholar 

  • Constantinides, P., Hansma, S., Tchou, C., and Rudnicky, A. (1998). A schema based approach to dialog control. In Proceedings of International Conference on Spoken Language Processing (ICSLP), pages 409–412, Sidney, Australia.

    Google Scholar 

  • Hirsch, H.-G. and Pearce, D. (2000). The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In Proceedings of International Workshop on Automatic Speech Recognition: Challenges for the new Millenium, pages 181–188, Paris, France.

    Google Scholar 

  • Kanthak, S., Sixtus, A., Molau, S., Schlüter, R., and Ney, H. (2000). Fast search for large vocabulary speech recognition. Verbmobil: Foundations of Speechto-Speech Translation, pages 63–78.

    Google Scholar 

  • Levin, E. and Pieraccini, R. (1995). Concept-based spontaneous speech understanding system. In Proceedings of European Conference on Speech Communication and Technology (EUROSPEECH), pages 555–558, Madrid, Spain.

    Google Scholar 

  • Lleida, E. and Rose, R. C. (1996). Efficient decoding and training procedures for utterance verification in continuous speech recognition. In Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 507–510, Atlanta, Georgia, USA.

    Google Scholar 

  • Macherey, K., Och, F. J., and Ney, H. (2001). Natural language understanding using statistical machine translation. In Proceedings of European Conference on Speech Communication and Technology (EUROSPEECH), pages 2205–2208, Aalborg, Denmark.

    Google Scholar 

  • Pearce, D. (2000). Enabling new speech driven services for mobile devices: An overview of the ETSI standards activities for distributed speech recognition front-ends. In Proceedings of Applied Voice Input/Output Society Conference, San Jose, California, USA.

    Google Scholar 

  • Potamianos, A., Ammicht, E., and Kuo, H.-K. J. (2000). Dialogue management in the Bell Labs Communicator system. In Proceedings of International Conference on Spoken Language Processing (ICSLP), pages 603–606, Beijing, China.

    Google Scholar 

  • Seneff, S. and Polifroni, J. (1996). A new restaurant guide conversational system: Issues in rapid prototyping for specialized domains. In Proceedings of International Conference on Spoken Language Processing (ICSLP), pages 665–668, Philadelphia, Pennsylvania, USA.

    Google Scholar 

  • Weintraub, M., Beaufays, F., Rivlin, Z., Konig, Y., and Stolcke, A. (1997). Neural-network based measures of confidence for word recognition. In Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 887–890, Munich, Germany.

    Google Scholar 

  • Wessel, F., Macherey, K., and Schlüter, R. (1998). Using word probabilities as confidence measures. In Proceedings of International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 225–228, Seattle, Washington, USA.

    Google Scholar 

  • Wessel, F., Schlüter, R., Macherey, K., and Ney, H. (2001). Confidence measures for large vocabulary continuous speech recognition. IEEE Transactions on Speech and Audio Processing, 9(3):288–298.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer

About this chapter

Cite this chapter

Macherey, K., Ney, H. (2005). Feature Functions for Tree-Based Dialogue Course Management. In: Minker, W., Bühler, D., Dybkjær, L. (eds) Spoken Multimodal Human-Computer Dialogue in Mobile Environments. Text, Speech and Language Technology, vol 28. Springer, Dordrecht. https://doi.org/10.1007/1-4020-3075-4_4

Download citation

  • DOI: https://doi.org/10.1007/1-4020-3075-4_4

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-1-4020-3073-4

  • Online ISBN: 978-1-4020-3075-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics