Skip to main content

The Industry of Spoken-Dialog Systems and the Third Generation of Interactive Applications

  • Chapter
  • First Online:
Speech Technology
  • 1372 Accesses

Abstract

One of the first speech recognition systems was built in 1952 at Bell Laboratories [1]. The system could recognize sequences of digits spoken with pauses between them.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Davis, K., Biddulph, R., Balashek, S. (1952). Automatic recognition of spoken digits. Soc. Am., 637–642.

    Google Scholar 

  2. Flanagan, J. L., Levinson, S. E., Rabiner, L. R., Rosenberg, A. E. (1980). Techniques for expanding the capabilities of practical speech recognizers. In: Trends in Speech Recognition, Prentice Hall, Englewood Cliffs, NJ.

    Google Scholar 

  3. Price, P., Fisher, W. M., Bernstein, J., Pallet, D. S. (1988). The DARPA 1000 word Resource Management database for continuous speech recognition. In: IEEE Conf. on Acoustics Speech and Signal Processing.

    Book  Google Scholar 

  4. Hirschmann, L. (1992). Multi-site data collection for a spoken language corpus. In: Proc. 5th DARPA Speech and Natural Language Workshop. Defense Advanced Research Projects Agency.

    Google Scholar 

  5. Walker, M., Rudnicky, A., Aberdeen, J., Bratt, E. O., Carofolo, J., Hastie, H., Le Audrey Pellom, B., Potamianos, A., Passonneau, R., Prasad, R., Roukos, S., Sanders, G., Seneff, S., Stallard, D. (2002). DARPA communicator: Cross system results for the 2001 evaluation. In: ICSLP 2002.

    Google Scholar 

  6. Barnard, E., Halberstadt, A., Kotelly, C., Phillips, M. (1999). A consistent approach to designing spoken-dialogue systems. In: IEEE Workshop. Keystone, CO.

    Google Scholar 

  7. Zue, V. (1997). Conversational interfaces: Advances and challenges. In: Eurospeech 97. Rhodes, Greece.

    Google Scholar 

  8. Pieraccini, R., Huerta, J. (2005).Where do we go from here? Research and commercial spoken dialogue systems. SIGdial, 1–10.

    Google Scholar 

  9. Gorin, A. L„ Riccardi, G., Wright, J. H. (1997). How may i help you? Speech Commun., 113–127.

    Google Scholar 

  10. Chu-Carroll, J., Carpenter, B. (1999). Vector-based natural language call routing. Comput. Linguist., 361–388.

    Google Scholar 

  11. Oviatt, S. L. (1995). Predicting spoken disfluencies during human–computer interaction. Comp. Speech Lang., 19–35.

    Google Scholar 

  12. Voice Extensible Markup Language (VoiceXML) 2.1. (2005). W3C Candidate Recommendation 13 June 2005.

    Google Scholar 

  13. Media Resource Control Protocol (MRCP) Introduction.

    Google Scholar 

  14. Standard ECMA-262 (1999). ECMAScript Language Specification, 3rd Edition.

    Google Scholar 

  15. Speech Recognition Grammar Specification Version 1.0. (2004). W3C Recommendation.

    Google Scholar 

  16. Voice Browser Call Control: CCXML Version 1.0. (2005). W3C Working Draft.

    Google Scholar 

  17. Speech Synthesis Markup Language (SSML), Version 1.0. (2004).

    Google Scholar 

  18. State Chart XML (SCXML) State Machine Notation for Control Abstraction. (2006). W3C Working Draft.

    Google Scholar 

  19. Harel, D., Politi, M. (1998). Modeling Reactive Systems with Statecharts: The STATEMATE Approach. McGraw-Hill, New York, NY.

    Google Scholar 

  20. O’Reilly, T. (2004). What is Web 2.0. In: Design Patterns and Business Models for the Next Generation of Software. W3C Recommendation.

    Google Scholar 

  21. Acomb, K., Bloom, J., Dayanidhi, K., Hunter, P., Krogh, P., Levin, E., Pieraccini, R. (2007). Technical support dialog systems, issues, problems, and solutions. In: Bridging the Gap, Academic and Industrial Research in Dialog Technology. Rochester, NY.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Roberto Pieraccini .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Pieraccini, R. (2010). The Industry of Spoken-Dialog Systems and the Third Generation of Interactive Applications. In: Chen, F., Jokinen, K. (eds) Speech Technology. Springer, New York, NY. https://doi.org/10.1007/978-0-387-73819-2_4

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-73819-2_4

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-0-387-73818-5

  • Online ISBN: 978-0-387-73819-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics