Abstract
As in-vehicle speech systems become prevalent, there is a need for specific compilation of data in vehicle scenarios to develop/benchmark algorithms for speech systems. This paper describes the collection efforts and analysis of two corpora: (1) the UT-Dallas Vehicle Noise (UTD-VN) corpora and (2) the CU-Move in-car speech and noise corpora. The UTD-VN corpus is focused on addressing the variability of in-car noise environments. This corpus includes compilation of unique noise scenarios within the car (Engine idling, AC windows closed, etc.) as well as variability of these scenarios across different makes and models. Another aspect that in-car speech systems need to address along with noise is the emotional and task stress of the driver while performing the driving task. The CU-Move corpus focuses on collection of data to describe the variability of conversational speech in an in-car environment. A sample study is carried out where it is shown that these environments are unique across different vehicles using the UT-Dallas Vehicle Noise corpora. This shows that a detailed analysis of variability across vehicle platforms is necessary for successful deployment of speech systems. In our opinion, these corpora are the first to describe the environment variability along with conversational speech in an in-car environment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kawaguchi N, Matsubara S, Iwa H, Kajita H, Takeda K, Itakura F, Inagaki F (2000) Construction of speech corpus in moving car environment. In: Proceedings of the Interspeech-2000, vol 3, Beijing, pp 362–365
Hansen JHL, Plucienkowski J, Gallant S, Pellom B, Ward W (2000) CU-move: robust speech processing for in-vehicle speech systems. In: Proceedings of the Interspeech-2000, vol 1, Beijing, pp 524–527
Akbacak M, Hansen JHL (2003) Environmental sniffing: robust digit recognition for an in-vehicle environment. In: interspeech-2003/Eurospeech-2003, Geneva, pp 2177–2180
Akbacak M, Hansen JHL (2003) Environmental sniffing: noise knowledge estimation for robust speech systems. In: IEEE ICASSP-2003: international conference on acoustics, speech, and signal processing, vol 2, Hong Kong, pp 113–116
Akbacak M, Hansen JHL (2007) Environmental sniffing: noise knowledge estimation for robust speech systems, IEEE Trans Audio Speech Lang Process 15(2):465–477
Hansen JHL, Zhang X, Akbacak M, Yapanel U, Pellom B, Ward W (2003) CU-Move: advances in in-vehicle speech systems for route navigation. In: IEEE workshop in DSP in mobile and vehicular systems, paper 6.5, Nagoya, 4–5 April 2003, pp 1–6
Hansen JHL (2002) Getting started with the CU-Move corpus. CU-Move documentation
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Krishnamurthy, N., Lubag, R., Hansen, J.H.L. (2012). In-Vehicle Speech and Noise Corpora. In: Hansen, J., Boyraz, P., Takeda, K., Abut, H. (eds) Digital Signal Processing for In-Vehicle Systems and Safety. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-9607-7_9
Download citation
DOI: https://doi.org/10.1007/978-1-4419-9607-7_9
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-9606-0
Online ISBN: 978-1-4419-9607-7
eBook Packages: EngineeringEngineering (R0)