Abstract
Error correction with speech recognition products is extraordinarily difficult for users. Users spend much more time correcting errors than they spend dictating new text. In order to find ways to improve users' error correction experience, we examined the use of four different error correction mechanisms. The two error correction methods that users were most successful with were redictation and selection of a list of alternatives (“the alternates list”). Users rated the latter as the more satisfying method. User satisfaction with the alternates list was surprising as it was not a terribly accurate error correction method. On the Tablet PC we made several interface enhancements to facilitate the use of the alternates which included the use of (1) strong modes, (2) a push-to-talk model for microphone control, (3) a lighter weight alternates list which was easier to open and dismiss. Users performed transcription tasks with this new interface and we examined which error correction methods people preferred. Users of the new interface no longer compounded error upon error and were far more likely to use the alternates list than was the case for users of pre-existing interfaces. Users were very likely to switch modes from the alternates list to redictation when the alternates list did not contain the target word.
Similar content being viewed by others
References
Ainsworth,W.A. and Pratt, S.R. (1992). Feedback strategies for error correction in speech recognition systems. International Journal of Man-Machine Studies, 36(6):833-842.
Baber, C. and Hone, K.S. (1993). Modeling error recovery and repair in automatic speech recognition. International Journal of Man-Machine Studies, 39(3):495-515.
Halverson, C., Horn, D., Karat, C., and Karat, J. (1999). The beauty of errors: Patterns of error correction in desktop speech systems. Proceedings of INTERACT'99. Amsterdam: IOS Press, pp. 133-140.
Huang, X., Acero, A., Chelba, C., Deng, L., Duchene, D., Goodman, J., Hon, H., Jacoby, D., Jiang, L., Loynd, R., Mahajan, M., Mau, P., Meredith, S., Mughal, S., Neto, S., Plumpe, M., Steury, K., Venolia, G., Wang, K., and Wang, Y. (2001). MIPAD: A Multimodal Interactive Prototype. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, pp. 9-12.
Karat, C., Halverson, C., Horn, and Karat, J. (1999). Patterns of entry and correction in large vocabulary continuous speech recognition systems. CHI'99: Proceedings of the CHI 99 Conference on Human Factors in Computing Systems: The CHI is the Limit. New York: ACM Press, pp. 568-575.
Lai, J. and Vergo, J. (1997). MedSpeak. CHI'97: Proceedings of the CHI 97 Conference on Human Factors in Computing Systems. New York: ACM Press, pp. 431-438.
MacKenzie, I.S., Zhang, S.X., and Soukoreff, R.W. (1999). Text entry using soft keyboards. Behaviour & Information Technology, 18:235-244.
Mankoff, J. and Abowd, G.D. (1999). Error correction techniques for handwriting, speech, and other ambiguous or error prone systems. Georgia Tech GVU Center Technical Report, GIT-GVU-99-18.
Oviatt, S. (1999). Mutual disambiguation of recognition errors in a multimodel architecture. CHI'99: Proceedings of the CHI 99 Conference on Human Factors in Computing Systems: The CHI is the Limit. New York: ACM Press, pp. 576-583.
Oviatt, S. and Van Gent, R. (1996). Error resolution during multimodal human-computer interaction. Proceedings of the Fourth International Conference on Spoken Language Processing (ICSLP 96), pp. 204-207.
Suhm, B., Myers, B., and Waibel, A. (1999). Model-based and empirical evaluation of multimodal interactive error correction. CHI'99: Proceedings of the CHI 99 Conference on Human Factors in Computing Systems: The CHI is the Limit. New York: ACM Press, pp. 584-591.
Suhm, B., Myers, B., and Waibel, A. (2001). Multimodal error correction for speech user interfaces. ACM Transactions on Computer-Human Interaction (TOCHI), 8(1):60-98.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Larson, K., Mowatt, D. Speech Error Correction: The Story of the Alternates List. International Journal of Speech Technology 6, 183–194 (2003). https://doi.org/10.1023/A:1022342732234
Issue Date:
DOI: https://doi.org/10.1023/A:1022342732234