Abstract
One of the first speech recognition systems was built in 1952 at Bell Laboratories [1]. The system could recognize sequences of digits spoken with pauses between them.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Davis, K., Biddulph, R., Balashek, S. (1952). Automatic recognition of spoken digits. Soc. Am., 637–642.
Flanagan, J. L., Levinson, S. E., Rabiner, L. R., Rosenberg, A. E. (1980). Techniques for expanding the capabilities of practical speech recognizers. In: Trends in Speech Recognition, Prentice Hall, Englewood Cliffs, NJ.
Price, P., Fisher, W. M., Bernstein, J., Pallet, D. S. (1988). The DARPA 1000 word Resource Management database for continuous speech recognition. In: IEEE Conf. on Acoustics Speech and Signal Processing.
Hirschmann, L. (1992). Multi-site data collection for a spoken language corpus. In: Proc. 5th DARPA Speech and Natural Language Workshop. Defense Advanced Research Projects Agency.
Walker, M., Rudnicky, A., Aberdeen, J., Bratt, E. O., Carofolo, J., Hastie, H., Le Audrey Pellom, B., Potamianos, A., Passonneau, R., Prasad, R., Roukos, S., Sanders, G., Seneff, S., Stallard, D. (2002). DARPA communicator: Cross system results for the 2001 evaluation. In: ICSLP 2002.
Barnard, E., Halberstadt, A., Kotelly, C., Phillips, M. (1999). A consistent approach to designing spoken-dialogue systems. In: IEEE Workshop. Keystone, CO.
Zue, V. (1997). Conversational interfaces: Advances and challenges. In: Eurospeech 97. Rhodes, Greece.
Pieraccini, R., Huerta, J. (2005).Where do we go from here? Research and commercial spoken dialogue systems. SIGdial, 1–10.
Gorin, A. L„ Riccardi, G., Wright, J. H. (1997). How may i help you? Speech Commun., 113–127.
Chu-Carroll, J., Carpenter, B. (1999). Vector-based natural language call routing. Comput. Linguist., 361–388.
Oviatt, S. L. (1995). Predicting spoken disfluencies during human–computer interaction. Comp. Speech Lang., 19–35.
Voice Extensible Markup Language (VoiceXML) 2.1. (2005). W3C Candidate Recommendation 13 June 2005.
Media Resource Control Protocol (MRCP) Introduction.
Standard ECMA-262 (1999). ECMAScript Language Specification, 3rd Edition.
Speech Recognition Grammar Specification Version 1.0. (2004). W3C Recommendation.
Voice Browser Call Control: CCXML Version 1.0. (2005). W3C Working Draft.
Speech Synthesis Markup Language (SSML), Version 1.0. (2004).
State Chart XML (SCXML) State Machine Notation for Control Abstraction. (2006). W3C Working Draft.
Harel, D., Politi, M. (1998). Modeling Reactive Systems with Statecharts: The STATEMATE Approach. McGraw-Hill, New York, NY.
O’Reilly, T. (2004). What is Web 2.0. In: Design Patterns and Business Models for the Next Generation of Software. W3C Recommendation.
Acomb, K., Bloom, J., Dayanidhi, K., Hunter, P., Krogh, P., Levin, E., Pieraccini, R. (2007). Technical support dialog systems, issues, problems, and solutions. In: Bridging the Gap, Academic and Industrial Research in Dialog Technology. Rochester, NY.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Pieraccini, R. (2010). The Industry of Spoken-Dialog Systems and the Third Generation of Interactive Applications. In: Chen, F., Jokinen, K. (eds) Speech Technology. Springer, New York, NY. https://doi.org/10.1007/978-0-387-73819-2_4
Download citation
DOI: https://doi.org/10.1007/978-0-387-73819-2_4
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-73818-5
Online ISBN: 978-0-387-73819-2
eBook Packages: EngineeringEngineering (R0)