Dialogue Strategies to Overcome Speech Recognition Errors in Form-Filling Dialogue

Kang, Sangwoo; Lee, Songwook; Seo, Jungyun

doi:10.1007/978-3-642-00831-3_26

Dialogue Strategies to Overcome Speech Recognition Errors in Form-Filling Dialogue

Sangwoo Kang²¹,
Songwook Lee²² &
Jungyun Seo²³

Conference paper

832 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5459))

Abstract

In a spoken dialogue system, the speech recognition performance accounts for the largest part of the overall system performance. Yet spontaneous speech recognition has an unstable performance. The proposed postprocessing method solves this problem. The state of a legacy DB can be used as an important factor for recognizing a user’s intention because form-filling dialogues tend to depend on the legacy DB. Our system uses the legacy DB and ASR result to infer the user’s intention, and the validity of the current user’s intention is verified using the inferred user’s intention. With a plan-based dialogue model, the proposed system corrected 27% of the incomplete tasks, and achieved an 89% overall task completion rate.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hain, T.: Implicit modelling of pronunciation variation in automatic speech recognition. Speech Communication 46, 171–188 (2005)
Article Google Scholar
Kim, K., Lee, C., Jung, S., Lee, G.G.: A Frame-Based Probabilistic Framework for Spoken Dialog Management Using Dialog Examples. In: 9th SIGdial Workshop on Discourse and Dialogue, pp. 120–127. Association for Computational Linguistics, USA (2008)
Chapter Google Scholar
Cavazza, M.: An Empirical Study of Speech Recognition Errors in a Task-oriented Dialogue System. In: 2th SIGdial Workshop on Discourse and Dialogue, pp. 98–105. Association for Computational Linguistics, Denmark (2001)
Google Scholar
Gorrell, G.: Language Modelling and Error Handling in Spoken Dialogue System. Licentiate thesis, Linköping University, Sweden (2003)
Google Scholar
Goddeau, D., Meng, H., Polifroni, J., Seneff, S., Busayapongchai, S.: A Form-Based Dialog Manager for Spoken Language Applications. In: 4th International Conference on Spoken Language, pp. 701–705. IEEE Press, USA (1996)
Google Scholar
Litman, D., Allen, J.: A Plan Recognition Model for Subdialogue in Conversations. Cognitive Science 11, 163–200 (1987)
Article Google Scholar
Chu-Carroll, J., Carberry, S.: Generating information-sharing sub- dialogues in expert-user consultation. In: 14th International Conference on Artificial Intelligence, pp. 1234–1250. AAAI, UK (1995)
Google Scholar
Oh, J.: The Design of Plan-based Dialogue System In Task Execution Domain. M.S. thesis, Sogang University, Korea (1999)
Google Scholar
Walker, M., Passonneau, R., Boland, J.: Quantitative and Qualitative Evaluation of Darpa Communicator Spoken Dialogue Systems. In: 39th Annual Meeting of the Association for Computational Linguistics, pp. 512–522. Association for Computational Linguistics, France (2001)
Google Scholar
Ahn, D., Chung, M.: One-pass Semi-dynamic Network Decoding Using a Subnetwork Caching Model for Large Vocabulary Continuous Speech Recognition. IEICE Transaction. Information and Systems E87-D(5) 5, 1164–1174 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Sogang University, Korea
Sangwoo Kang
Department of Computer Science, Chungju National University, Korea
Songwook Lee
Department of Computer Science & Interdisciplinary Program of Integrated Biotechnology, Sogang University, Korea
Jungyun Seo

Authors

Sangwoo Kang
View author publications
You can also search for this author in PubMed Google Scholar
Songwook Lee
View author publications
You can also search for this author in PubMed Google Scholar
Jungyun Seo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computing, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong
Wenjie Li
Division of Information and Communication Sciences, Macquarie University, NSW 2109, Sydney, Australia
Diego Mollá-Aliod

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kang, S., Lee, S., Seo, J. (2009). Dialogue Strategies to Overcome Speech Recognition Errors in Form-Filling Dialogue. In: Li, W., Mollá-Aliod, D. (eds) Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy. ICCPOL 2009. Lecture Notes in Computer Science(), vol 5459. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00831-3_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-00831-3_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00830-6
Online ISBN: 978-3-642-00831-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics