Automotive Speech Recognition

Höge, Harald; Hohenner, Sascha; Kämmerer, Bernhard; Kunstmann, Niels; Schachtl, Stefanie; Schönle, Martin; Setiawan, Panji

doi:10.1007/978-1-84800-143-5_16

Harald Höge³,
Sascha Hohenner³,
Bernhard Kämmerer³,
Niels Kunstmann³,
Stefanie Schachtl³,
Martin Schönle³ &
…
Panji Setiawan⁴

Part of the book series: Advances in Pattern Recognition ((ACVPR))

In the coming years speech recognition will be a commodity feature in car. Control of communication systems integrated in the car infotainment system including telephony, audio devices and destination inputs for navigation can be done via voice. Concerning speech recognition technology biggest the challenge is the recognition of large vocabularies in noisy environments using cost sensitive hardware platforms. Further intuitive dialog design coupled with natural sounding text to speech systems has to be provided to achieve a smooth man-machine interaction. This chapter describes commercial driven activities to develop and produce speech technology components for various automotive applications including the used speech recognition, speaker characterization, speech synthesis and dialog technology, the used platforms, and a methodology for the evaluation of recognition performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Andrassy, B., Hilger, F. and Beaugeant, C. (2001) Investigations on the combination of four algorithms to increase the noise robustness of a DSR front-end for real world car data. In Proceedings of Automatic Speech Recognition and Understanding Workshop.
Google Scholar
Automotive Electronic Council (2003) Stress Test Qualification for Integrated Circuits, AEC— Q100—Rev-F.2, 2003-07-18, Automotive Electronics Council, Component Technical Committee.
Google Scholar
Bauer, J.G. (1997) Enhanced control and estimation of parameters for a telephone based isolated digit recognizer. In Proceedings of IEEE International Conference of Acoustics, Speech, and Signal Processing (ICASSP), pp. 1531-1534.
Google Scholar
Beaugeant, C., Gilg, V., Schönle, M., Jax, P. and Martin, R. (2002) Computationally efficient speech enhancement using RLS and psycho-acoustic motivated algorithm. In Proceedings of World Multi-Conference on Systemics, Cybernetics and Informatics.
Google Scholar
Berton, A., Regel-Brietzmann, P., Block, H.U. and Schachtl, S. (2006) Integration of Scalable Dialog Systems in Cars. In Proceedings of ESSV, Freiberg.
Google Scholar
Block, H.-U., Caspari, R. and Schachtl, S. (2004) Callable Manuals - Access to Product Docu-mentation via Voice. “it” Information Technology, Vol. 46, Oldenburg Verlag, München, pp. 299-305.
Google Scholar
Ephraim, Y. and Malah, D. (1984) Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator. IEEE Transaction on Acoustics, Speech and Signal Processing, Vol. 32, no. 6, pp. 1109-1121.
Article Google Scholar
Höge, H. (2000) Speech database technology for commercially used recognizers-status and future issues. In Proceedings of Workshop XLDB on LREC 2000, Athens.
Google Scholar
Höge, H. and Andrassy, B. (2006) Human and machine recognition as a function of SNR. In LREC 2006 ELRA, Genoa, Italy, pp. 2060-2063.
Google Scholar
Junqua, J.C. (1993) The Lombard reflex and its role on human listeners and automatic speech recognizers. Journal Of the Acoustical Society of America, Vol. 93, pp. 510-524.
Article Google Scholar
Ramabadran, T., Sorin, A., McLaughlin, M., Chanzan, D., Pearce, D. and Hoory, R. (2004) The ETSI extended distributed speech recognition (DSR) standards. In Proceedings of IEEE ICASSP, Vol. I, pp. 53-56.
Google Scholar
Scalart, P. and Filho, J., (1996) Speech enhancement based on a priori signal to noise estimation. In Proceedings of ICASSP, pp. 629-632.
Google Scholar
Setiawan, P., Beaugeant, C., Stan, S. and Fingscheidt, T. (2005a) Least-squares weighting rule formulations in the frequency domain. In Proceedings of Electronic Speech Signal Processing Conference (ESSP), September 2005.
Google Scholar
Setiawan, P., Suhadi S., Fingscheidt, T. and Stan, S. (2005b) Robust speech recognition for mobile devices in car noise. In Proceedings of European Conference on Speech Communica-tion and Technology (EUROSPEECH). SpeechDat (2000) http://www.speechdat.org.
The Motor Industry Software Reliability Association (2004) MISRA-C: 2004—Guidelines for the use of the C language in critical systems, MIRA Ltd., Warwickshire.
Google Scholar
The SPICE User Group (2005) Automotive SPICE Process Assessment Model, Version 2.2, 2005-08-21 (see www.automotivespice.com)
Varga, I., Aalburg, S., Andrassy, B., Astrov, S., Bauer, J.G., Beaugeant, Ch., Geissler, Ch. and Höge, H. (2002) ASR in Mobile Phones—An Industrial Approach. IEEE Trans. Speech and Audio Processing, Vol. 10, no. 8, pp. 562-569.
Article Google Scholar
Wahlster, W. (2004) SmartWeb—Mobile applications of the semantic web. In P. Dadam and M. Reichert (eds.), Springer GI Jahrestagung 2004.
Google Scholar

Download references

Author information

Authors and Affiliations

Corporate Technology, Siemens AG, 81739, München, Germany
Harald Höge, Sascha Hohenner, Bernhard Kämmerer, Niels Kunstmann, Stefanie Schachtl & Martin Schönle
Universität der Bundeswehr München, München, Germany
Panji Setiawan

Authors

Harald Höge
View author publications
You can also search for this author in PubMed Google Scholar
Sascha Hohenner
View author publications
You can also search for this author in PubMed Google Scholar
Bernhard Kämmerer
View author publications
You can also search for this author in PubMed Google Scholar
Niels Kunstmann
View author publications
You can also search for this author in PubMed Google Scholar
Stefanie Schachtl
View author publications
You can also search for this author in PubMed Google Scholar
Martin Schönle
View author publications
You can also search for this author in PubMed Google Scholar
Panji Setiawan
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Höge, H. et al. (2008). Automotive Speech Recognition. In: Automatic Speech Recognition on Mobile Devices and over Communication Networks. Advances in Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-84800-143-5_16

Download citation

DOI: https://doi.org/10.1007/978-1-84800-143-5_16
Publisher Name: Springer, London
Print ISBN: 978-1-84800-142-8
Online ISBN: 978-1-84800-143-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics