Automotive Speech Recognition
In the coming years speech recognition will be a commodity feature in car. Control of communication systems integrated in the car infotainment system including telephony, audio devices and destination inputs for navigation can be done via voice. Concerning speech recognition technology biggest the challenge is the recognition of large vocabularies in noisy environments using cost sensitive hardware platforms. Further intuitive dialog design coupled with natural sounding text to speech systems has to be provided to achieve a smooth man-machine interaction. This chapter describes commercial driven activities to develop and produce speech technology components for various automotive applications including the used speech recognition, speaker characterization, speech synthesis and dialog technology, the used platforms, and a methodology for the evaluation of recognition performance.
Unable to display preview. Download preview PDF.
- Andrassy, B., Hilger, F. and Beaugeant, C. (2001) Investigations on the combination of four algorithms to increase the noise robustness of a DSR front-end for real world car data. In Proceedings of Automatic Speech Recognition and Understanding Workshop.Google Scholar
- Automotive Electronic Council (2003) Stress Test Qualification for Integrated Circuits, AEC— Q100—Rev-F.2, 2003-07-18, Automotive Electronics Council, Component Technical Committee.Google Scholar
- Bauer, J.G. (1997) Enhanced control and estimation of parameters for a telephone based isolated digit recognizer. In Proceedings of IEEE International Conference of Acoustics, Speech, and Signal Processing (ICASSP), pp. 1531-1534.Google Scholar
- Beaugeant, C., Gilg, V., Schönle, M., Jax, P. and Martin, R. (2002) Computationally efficient speech enhancement using RLS and psycho-acoustic motivated algorithm. In Proceedings of World Multi-Conference on Systemics, Cybernetics and Informatics.Google Scholar
- Berton, A., Regel-Brietzmann, P., Block, H.U. and Schachtl, S. (2006) Integration of Scalable Dialog Systems in Cars. In Proceedings of ESSV, Freiberg.Google Scholar
- Block, H.-U., Caspari, R. and Schachtl, S. (2004) Callable Manuals - Access to Product Docu-mentation via Voice. “it” Information Technology, Vol. 46, Oldenburg Verlag, München, pp. 299-305.Google Scholar
- Höge, H. (2000) Speech database technology for commercially used recognizers-status and future issues. In Proceedings of Workshop XLDB on LREC 2000, Athens.Google Scholar
- Höge, H. and Andrassy, B. (2006) Human and machine recognition as a function of SNR. In LREC 2006 ELRA, Genoa, Italy, pp. 2060-2063.Google Scholar
- Ramabadran, T., Sorin, A., McLaughlin, M., Chanzan, D., Pearce, D. and Hoory, R. (2004) The ETSI extended distributed speech recognition (DSR) standards. In Proceedings of IEEE ICASSP, Vol. I, pp. 53-56.Google Scholar
- Scalart, P. and Filho, J., (1996) Speech enhancement based on a priori signal to noise estimation. In Proceedings of ICASSP, pp. 629-632.Google Scholar
- Setiawan, P., Beaugeant, C., Stan, S. and Fingscheidt, T. (2005a) Least-squares weighting rule formulations in the frequency domain. In Proceedings of Electronic Speech Signal Processing Conference (ESSP), September 2005.Google Scholar
- Setiawan, P., Suhadi S., Fingscheidt, T. and Stan, S. (2005b) Robust speech recognition for mobile devices in car noise. In Proceedings of European Conference on Speech Communica-tion and Technology (EUROSPEECH). SpeechDat (2000) http://www.speechdat.org.
- The Motor Industry Software Reliability Association (2004) MISRA-C: 2004—Guidelines for the use of the C language in critical systems, MIRA Ltd., Warwickshire.Google Scholar
- The SPICE User Group (2005) Automotive SPICE Process Assessment Model, Version 2.2, 2005-08-21 (see www.automotivespice.com)
- Wahlster, W. (2004) SmartWeb—Mobile applications of the semantic web. In P. Dadam and M. Reichert (eds.), Springer GI Jahrestagung 2004.Google Scholar