Environmental Adaptation with a Small Data Set of the Target Domain
In this work we present an approach to adapt speaker-independent recognizers to a new acoustical environment. The recognizers were trained with data which were recorded using a close-talking microphone. These recognizers are to be evaluated with distant-talking microphone data. The adaptation set was recorded with the same type of microphone. In order to keep the speaker-independency this set includes 33 speakers. The adaptation itself is done using maximum a posteriori (MAP) and maximum likelihood linear regression adaptation (MLLR) in combination with the Baum-Welch algorithm. Furthermore the close-talking training data were artificially reverberated to reduce the mismatch between training and test data. In this manner the performance could be increased from 9.9 % WA to 40.0 % WA in speaker-open conditions. If further speaker-dependent adaptation is applied this rate is increased up to 54.9 % WA.
KeywordsTarget Domain Adaptation Data Environmental Adaptation Microphone Array Emotional Speech
Unable to display preview. Download preview PDF.
- 1.Gales, M., Pye, D., Woodland, P.: Variance compensation within the MLLR framework for robust speech recognition and speaker adaptation. In: Proc. ICSLP 1996, Philadelphia, USA, vol. 3, pp. 1832–1835 (1996)Google Scholar
- 2.Bocchieri, E., Riley, M., Saraclar, M.: Methods for task adaptation of acoustic models with limited transcribed in-domain data. In: Proc. ICSLP 2004, Jeju Island, Korea, pp. 326–329 (2004)Google Scholar
- 3.Batliner, A., Hacker, C., Steidl, S., Nöth, E.: You stupid tin box - children interacting with the AIBO robot: A cross-linguistic emotional speech corpus. In: Proc. of the 4th International Conference of Language Resources and Evaluation 2004, Lisbon, Portugal, pp. 171–174 (2004)Google Scholar
- 6.Sony, AIBO Europe – Official Website (2005), http://ww.aibo-europe.com
- 7.Stemmer, G.: Modeling Variability in Speech Recognition, Ph.D. thesis, Chair for Pattern Recognition, University of Erlangen-Nuremberg, Germany (2005)Google Scholar