Abstract
This chapter is entirely dedicated to automatic speech recognition (ASR) which is one of the most complex fields of machine learning. Topics from signal processing and the properties of the acoustic signal to acoustic and language modeling, pronunciation modeling and performance analysis will all be explained in an easily comprehensible manner. After reading this chapter you will also understand how the open source software package in the AI-TOOLKIT, called VoiceBridge, works.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Indurkhya, N., Damerau, F.J.: Handbook of Natural Language Processing, 2nd edn. Chapman & Hall/CRC, New York (2010)
Everest, F.A.: The Master Handbook of Acoustics, 4th edn. McGraw-Hill, New York (2001)
Gelfand, S.A.: Hearing: An Introduction to Psychological and Physiological Acoustics, 5th edn. Taylor and Francis, London (2010)
Haeb-Umbach, R., Ney, H.: Linear Discriminant Analysis for Improved Large Vocabulary Continuous Speech Recognition, Proceedings ICASSP, pp. 13–16. IEEE, San Francisco, CA (1992)
Saon, G., Padmanabhan, M., Gopinath, R., Chen, S.: Maximum likelihood discriminant feature spaces. IEEE International Conference on Acoustics Speech and Signal Processing. 2(2000), II-1129–II-1132 (2000)
Povey, D., Kuo, H-K. J., Soltau, H.: Fast Speaker Adaptive Training for Speech Recognition (2008)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Somogyi, Z. (2021). Automatic Speech Recognition. In: The Application of Artificial Intelligence. Springer, Cham. https://doi.org/10.1007/978-3-030-60032-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-030-60032-7_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-60031-0
Online ISBN: 978-3-030-60032-7
eBook Packages: Computer ScienceComputer Science (R0)