Skip to main content

Automatic Speech Recognition

  • Chapter
  • First Online:
The Application of Artificial Intelligence
  • 1545 Accesses

Abstract

This chapter is entirely dedicated to automatic speech recognition (ASR) which is one of the most complex fields of machine learning. Topics from signal processing and the properties of the acoustic signal to acoustic and language modeling, pronunciation modeling and performance analysis will all be explained in an easily comprehensible manner. After reading this chapter you will also understand how the open source software package in the AI-TOOLKIT, called VoiceBridge, works.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 119.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Indurkhya, N., Damerau, F.J.: Handbook of Natural Language Processing, 2nd edn. Chapman & Hall/CRC, New York (2010)

    Book  Google Scholar 

  2. Everest, F.A.: The Master Handbook of Acoustics, 4th edn. McGraw-Hill, New York (2001)

    Google Scholar 

  3. Gelfand, S.A.: Hearing: An Introduction to Psychological and Physiological Acoustics, 5th edn. Taylor and Francis, London (2010)

    Google Scholar 

  4. Haeb-Umbach, R., Ney, H.: Linear Discriminant Analysis for Improved Large Vocabulary Continuous Speech Recognition, Proceedings ICASSP, pp. 13–16. IEEE, San Francisco, CA (1992)

    Google Scholar 

  5. Saon, G., Padmanabhan, M., Gopinath, R., Chen, S.: Maximum likelihood discriminant feature spaces. IEEE International Conference on Acoustics Speech and Signal Processing. 2(2000), II-1129–II-1132 (2000)

    Google Scholar 

  6. Povey, D., Kuo, H-K. J., Soltau, H.: Fast Speaker Adaptive Training for Speech Recognition (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Somogyi, Z. (2021). Automatic Speech Recognition. In: The Application of Artificial Intelligence. Springer, Cham. https://doi.org/10.1007/978-3-030-60032-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-60032-7_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-60031-0

  • Online ISBN: 978-3-030-60032-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics