An Empirical Study on Users’ Acceptance of Speech Recognition Errors in Text-Messaging

  • Shuang Xu
  • Santosh Basapur
  • Mark Ahlenius
  • Deborah Matteo
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4552)

Abstract

Although speech recognition technology and voice synthesis systems have become readily available, recognition accuracy remain a serious problem in the design and implementation of voice-based user interfaces. Error correction becomes particularly difficult on mobile devices due to the limited system resources and constrained input methods. This research is aimed to investigate users’ acceptance of speech recognition errors in mobile text messaging. Our results show that even though the audio presentation of the text messages does help users understand the speech recognition errors, users indicate low satisfaction when sending or receiving text messages with errors. Specifically, senders show significantly lower acceptance than the receivers due to the concerns of follow-up clarifications and the reflection of the sender’s personality. We also find that different types of recognition errors greatly affect users’ overall acceptance of the received message.

Keywords

Mobile Device Speech Recognition Cell Phone Automatic Speech Recognition Recognition Error 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Selected References (full references are available at: http://www.geocities.com/shuangxu/HCII2007_Xu.pdf)
  2. 2.
    Alewine, N., Ruback, H., Deligne, S.: Pervasive Speech Recognition. IEEE Pervasive Computing 3(4), 78–81 (2004)CrossRefGoogle Scholar
  3. 3.
    Bell, N.: Gestalt imagery: A critical factor in language comprehension. Annals of Dyslexia 41, 246–260 (1991)Google Scholar
  4. 4.
    Frost, R., Kampf, M.: Phonetic recoding of phonological ambiguous printed words. Journal of Experimental Psychology: Learning, Memory, and Cognition 19, 23–33 (1993)CrossRefGoogle Scholar
  5. 5.
    Larson, K., Mowatt, D.: Speech error correction: the story of the alternates list. International Journal of Speech Technology, 6(2), 183–194. Memory, and Cognition, 24, 573–592 (2003)Google Scholar
  6. 6.
    Lieberman, H., Faaborg, A., Daher, W., Espinosa, J.: How to wreck a nice beach you sing calm incense. In: Proceedings of Intelligent User Interface 2005, pp. 278–280 (2005)Google Scholar
  7. 7.
    MacKenzie, I.S., Soukoreff, R.W.: Text entry for mobile computing: Models and methods, theory and practice. Human-Computer Interaction 17, 147–198 (2002)CrossRefGoogle Scholar
  8. 8.
    Oviatt, S., VanGent, R.: Error resolution during multimodal human-computer interaction. In: Proceedings of International Conference on Spoken Language Process (ICSLP 1996), pp. 204–207 (1996)Google Scholar
  9. 9.
    Suhm, B., Myers, B., Waibel, A.: Multimodal error correction for speech user interfaces. ACM Transactions on Computer-Human Interaction 8(1), 60–98 (2001)CrossRefGoogle Scholar
  10. 10.
    Van Orden, G.C.: A ROWS is a ROSE: Spelling, sound, and reading. Memory and Cognition 15, 181–198 (1987)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Shuang Xu
    • 1
  • Santosh Basapur
    • 1
  • Mark Ahlenius
    • 1
  • Deborah Matteo
    • 1
  1. 1.Human Interaction Research, Motorola Labs, Schaumburg, IL 60196USA

Personalised recommendations