Automatic Speech Recognition

Somogyi, Zoltán

doi:10.1007/978-3-030-60032-7_5

Zoltán Somogyi²

1545 Accesses

Abstract

This chapter is entirely dedicated to automatic speech recognition (ASR) which is one of the most complex fields of machine learning. Topics from signal processing and the properties of the acoustic signal to acoustic and language modeling, pronunciation modeling and performance analysis will all be explained in an easily comprehensible manner. After reading this chapter you will also understand how the open source software package in the AI-TOOLKIT, called VoiceBridge, works.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Hardcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Indurkhya, N., Damerau, F.J.: Handbook of Natural Language Processing, 2nd edn. Chapman & Hall/CRC, New York (2010)
Book Google Scholar
Everest, F.A.: The Master Handbook of Acoustics, 4th edn. McGraw-Hill, New York (2001)
Google Scholar
Gelfand, S.A.: Hearing: An Introduction to Psychological and Physiological Acoustics, 5th edn. Taylor and Francis, London (2010)
Google Scholar
Haeb-Umbach, R., Ney, H.: Linear Discriminant Analysis for Improved Large Vocabulary Continuous Speech Recognition, Proceedings ICASSP, pp. 13–16. IEEE, San Francisco, CA (1992)
Google Scholar
Saon, G., Padmanabhan, M., Gopinath, R., Chen, S.: Maximum likelihood discriminant feature spaces. IEEE International Conference on Acoustics Speech and Signal Processing. 2(2000), II-1129–II-1132 (2000)
Google Scholar
Povey, D., Kuo, H-K. J., Soltau, H.: Fast Speaker Adaptive Training for Speech Recognition (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Antwerp, Belgium
Zoltán Somogyi

Authors

Zoltán Somogyi
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Somogyi, Z. (2021). Automatic Speech Recognition. In: The Application of Artificial Intelligence. Springer, Cham. https://doi.org/10.1007/978-3-030-60032-7_5

Download citation

DOI: https://doi.org/10.1007/978-3-030-60032-7_5
Published: 12 March 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60031-0
Online ISBN: 978-3-030-60032-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics