Skip to main content

Dialogue Strategies to Overcome Speech Recognition Errors in Form-Filling Dialogue

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5459))

Abstract

In a spoken dialogue system, the speech recognition performance accounts for the largest part of the overall system performance. Yet spontaneous speech recognition has an unstable performance. The proposed postprocessing method solves this problem. The state of a legacy DB can be used as an important factor for recognizing a user’s intention because form-filling dialogues tend to depend on the legacy DB. Our system uses the legacy DB and ASR result to infer the user’s intention, and the validity of the current user’s intention is verified using the inferred user’s intention. With a plan-based dialogue model, the proposed system corrected 27% of the incomplete tasks, and achieved an 89% overall task completion rate.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hain, T.: Implicit modelling of pronunciation variation in automatic speech recognition. Speech Communication 46, 171–188 (2005)

    Article  Google Scholar 

  2. Kim, K., Lee, C., Jung, S., Lee, G.G.: A Frame-Based Probabilistic Framework for Spoken Dialog Management Using Dialog Examples. In: 9th SIGdial Workshop on Discourse and Dialogue, pp. 120–127. Association for Computational Linguistics, USA (2008)

    Chapter  Google Scholar 

  3. Cavazza, M.: An Empirical Study of Speech Recognition Errors in a Task-oriented Dialogue System. In: 2th SIGdial Workshop on Discourse and Dialogue, pp. 98–105. Association for Computational Linguistics, Denmark (2001)

    Google Scholar 

  4. Gorrell, G.: Language Modelling and Error Handling in Spoken Dialogue System. Licentiate thesis, Linköping University, Sweden (2003)

    Google Scholar 

  5. Goddeau, D., Meng, H., Polifroni, J., Seneff, S., Busayapongchai, S.: A Form-Based Dialog Manager for Spoken Language Applications. In: 4th International Conference on Spoken Language, pp. 701–705. IEEE Press, USA (1996)

    Google Scholar 

  6. Litman, D., Allen, J.: A Plan Recognition Model for Subdialogue in Conversations. Cognitive Science 11, 163–200 (1987)

    Article  Google Scholar 

  7. Chu-Carroll, J., Carberry, S.: Generating information-sharing sub- dialogues in expert-user consultation. In: 14th International Conference on Artificial Intelligence, pp. 1234–1250. AAAI, UK (1995)

    Google Scholar 

  8. Oh, J.: The Design of Plan-based Dialogue System In Task Execution Domain. M.S. thesis, Sogang University, Korea (1999)

    Google Scholar 

  9. Walker, M., Passonneau, R., Boland, J.: Quantitative and Qualitative Evaluation of Darpa Communicator Spoken Dialogue Systems. In: 39th Annual Meeting of the Association for Computational Linguistics, pp. 512–522. Association for Computational Linguistics, France (2001)

    Google Scholar 

  10. Ahn, D., Chung, M.: One-pass Semi-dynamic Network Decoding Using a Subnetwork Caching Model for Large Vocabulary Continuous Speech Recognition. IEICE Transaction. Information and Systems E87-D(5) 5, 1164–1174 (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kang, S., Lee, S., Seo, J. (2009). Dialogue Strategies to Overcome Speech Recognition Errors in Form-Filling Dialogue. In: Li, W., Mollá-Aliod, D. (eds) Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy. ICCPOL 2009. Lecture Notes in Computer Science(), vol 5459. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00831-3_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-00831-3_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-00830-6

  • Online ISBN: 978-3-642-00831-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics