Skip to main content
Log in

Speech Error Correction: The Story of the Alternates List

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

Error correction with speech recognition products is extraordinarily difficult for users. Users spend much more time correcting errors than they spend dictating new text. In order to find ways to improve users' error correction experience, we examined the use of four different error correction mechanisms. The two error correction methods that users were most successful with were redictation and selection of a list of alternatives (“the alternates list”). Users rated the latter as the more satisfying method. User satisfaction with the alternates list was surprising as it was not a terribly accurate error correction method. On the Tablet PC we made several interface enhancements to facilitate the use of the alternates which included the use of (1) strong modes, (2) a push-to-talk model for microphone control, (3) a lighter weight alternates list which was easier to open and dismiss. Users performed transcription tasks with this new interface and we examined which error correction methods people preferred. Users of the new interface no longer compounded error upon error and were far more likely to use the alternates list than was the case for users of pre-existing interfaces. Users were very likely to switch modes from the alternates list to redictation when the alternates list did not contain the target word.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ainsworth,W.A. and Pratt, S.R. (1992). Feedback strategies for error correction in speech recognition systems. International Journal of Man-Machine Studies, 36(6):833-842.

    Google Scholar 

  • Baber, C. and Hone, K.S. (1993). Modeling error recovery and repair in automatic speech recognition. International Journal of Man-Machine Studies, 39(3):495-515.

    Google Scholar 

  • Halverson, C., Horn, D., Karat, C., and Karat, J. (1999). The beauty of errors: Patterns of error correction in desktop speech systems. Proceedings of INTERACT'99. Amsterdam: IOS Press, pp. 133-140.

    Google Scholar 

  • Huang, X., Acero, A., Chelba, C., Deng, L., Duchene, D., Goodman, J., Hon, H., Jacoby, D., Jiang, L., Loynd, R., Mahajan, M., Mau, P., Meredith, S., Mughal, S., Neto, S., Plumpe, M., Steury, K., Venolia, G., Wang, K., and Wang, Y. (2001). MIPAD: A Multimodal Interactive Prototype. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 9-12.

    Google Scholar 

  • Karat, C., Halverson, C., Horn, and Karat, J. (1999). Patterns of entry and correction in large vocabulary continuous speech recognition systems. CHI'99: Proceedings of the CHI 99 Conference on Human Factors in Computing Systems: The CHI is the Limit. New York: ACM Press, pp. 568-575.

    Google Scholar 

  • Lai, J. and Vergo, J. (1997). MedSpeak. CHI'97: Proceedings of the CHI 97 Conference on Human Factors in Computing Systems. New York: ACM Press, pp. 431-438.

    Google Scholar 

  • MacKenzie, I.S., Zhang, S.X., and Soukoreff, R.W. (1999). Text entry using soft keyboards. Behaviour & Information Technology, 18:235-244.

    Google Scholar 

  • Mankoff, J. and Abowd, G.D. (1999). Error correction techniques for handwriting, speech, and other ambiguous or error prone systems. Georgia Tech GVU Center Technical Report, GIT-GVU-99-18.

  • Oviatt, S. (1999). Mutual disambiguation of recognition errors in a multimodel architecture. CHI'99: Proceedings of the CHI 99 Conference on Human Factors in Computing Systems: The CHI is the Limit. New York: ACM Press, pp. 576-583.

    Google Scholar 

  • Oviatt, S. and Van Gent, R. (1996). Error resolution during multimodal human-computer interaction. Proceedings of the Fourth International Conference on Spoken Language Processing (ICSLP 96), pp. 204-207.

  • Suhm, B., Myers, B., and Waibel, A. (1999). Model-based and empirical evaluation of multimodal interactive error correction. CHI'99: Proceedings of the CHI 99 Conference on Human Factors in Computing Systems: The CHI is the Limit. New York: ACM Press, pp. 584-591.

    Google Scholar 

  • Suhm, B., Myers, B., and Waibel, A. (2001). Multimodal error correction for speech user interfaces. ACM Transactions on Computer-Human Interaction (TOCHI), 8(1):60-98.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Larson, K., Mowatt, D. Speech Error Correction: The Story of the Alternates List. International Journal of Speech Technology 6, 183–194 (2003). https://doi.org/10.1023/A:1022342732234

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1022342732234

Navigation