Abstract
This paper describes the underlying aspects of networked voice portal applications with a focus on a natural language voice user interface. The paper provides a detailed description of a representative commercially deployed personal voice portal solution and reviews results from speech recognition testing of one specific voice portal application—voice-activated dialing. It offers several suggestions on network-based voice portal design in general, and specifically, voice-activated dialing (VAD) applications.
Similar content being viewed by others
References
Balentine, Bruce and Morgan, David P. (1999). How to Build a Speech Recognition Application: A Style Guide for Telephony Dialogues. San Ramon, CA: New York, Enterprise Integration Group, Incorporated.
Boyce, Susan J. (2000). Natural spoken dialogue systems for telephony applications. Communications of the ACM, 43(9):36-43.
Gaver, William W., Smith, Randall B., and O'Shea, Tim. (1991). Effective sounds in complex systems: The Arkola simulation. In Human Factors in Computing Systems Conference Proceedings on Reaching Through Technology. LA, USA, pp. 85-90.
Glass, James, Polifroni, Joseph, Seneff, Stephanie, and Zue, Victor. (2000). Data collection and performance evaluation of spoken dialogue systems: The MIT experience. In Proceedings of the Sixth International Conference on Spoken Language Processing. October, Beijing, China.
Lamel, Lori, Rosset, Sophie, and Gauvain, Jean-Luc. (2000). Considerations in the design and evaluation of spoken language dialogue systems. In Proceedings of the Sixth International Conference on Spoken Language Processing. October, Beijing, China.
Lucas, Bruce. (2000). VoiceXML for web-based distributed conversational applications. Communications of the ACM, New York, 43(9):53-57.
Leplatre, Grégory and Brewster, Stephen. (2000). Designing nonspeech sounds to support navigation in mobile phone menus. In Proceedings of ICAD'2000. Atlanta.
Lucente, Mark. (2000). Conversational interfaces for E-commerce applications. Communications of the ACM, New York, 43(9):59-65.
Nass, Clifford and Gong, Li. (2000). Speech interfaces from an evolutionary perspective. Communications of the ACM, 43(9):36-43.
Oviatt, Sharon. (2000). Taming recognition errors with a multimodal interface. Communications of the ACM, 43(9):45-51.
Schmandt, C. (1994). Voice Communication with Computers: Conversational Systems. Van Nostrand Reinhold: New York.
Shattuck-Hufnagel, Stefanie. (1982). Three kinds of speech error evidence for the role of grammatical elements in processing. In L.K. Obler and L. Menn (Eds.), Exceptional Language and Linguistics. New York: Academic Press.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Goldman, E.L., Panttaja, E., Wojcikowski, A. et al. Voice Portals—Where Theory Meets Practice. International Journal of Speech Technology 4, 227–240 (2001). https://doi.org/10.1023/A:1011356507824
Issue Date:
DOI: https://doi.org/10.1023/A:1011356507824