Towards a Dynamic Adjustment of the Language Weight
Most speech recognition systems use a language weight to reduce the mismatch between the language model and the acoustic models. Usually a constant value of the language weight is chosen for the whole test set. In this paper, we evaluate the possibility to adapt the language weight dynamically to the state of the dialogue or to the current utterance. Our experiments show, that the gain in performance, that can be achieved with a dynamic adjustment of the language weight on our data is very limited. This result is independent of the information source that is used for the adaption of the language weight.
KeywordsLanguage Model Acoustic Model Dynamic Adjustment Speech Recognition System Spontaneous Speech
Unable to display preview. Download preview PDF.
- 1.V. Zeissler: Verbesserte Linguistische Gewichtung in einem Spracherkenner. Master thesis (in German), Chair for Pattern Recognition, University of Erlangen-Nuremberg, Erlangen (2001)Google Scholar
- 3.Ramesh R. Sarukkai and Dana H. Ballard: Word Set Probability Boosting for Improved Spontaneous Dialogue Recognition: The AB/TAB Algorithm. University of Rochester, Rochester (1995)Google Scholar
- 4.X. Huang and M. Belin and F. Alleva and M. Hwang: Unified Stochastic Engine (USE) for Speech Recognition. Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Minneapolis (1993) 636–639Google Scholar
- 5.F. Gallwitz: Integrated Stochastic Models for Spontaneous Speech Recognition. Dissertation, University of Erlangen-Nuremberg, Erlangen (to appear)Google Scholar
- 6.W. Eckert and F. Gallwitz and H. Niemann: Combining Stochastic and Linguistic Language Models for Recognition of Spontaneous Speech. Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Atlanta (1996) 423–426Google Scholar
- 8.G. Stemmer and E. Nöth and H. Niemann: The Utility of Semantic-Pragmatic Information and Dialogue-State for Speech Recognition in Spoken Dialogue Systems. Proc. of the Third Workshop on Text, Speech, Dialogue, Brno (2000) 439–444Google Scholar
- 9.V. Fischer and S.J. Kunzmann: Acoustic Language Model Classes for a Large Vocabulary Continuous Speech Recognizer. Proc. Int. Conf. on Spoken Language Processing, Bejing (2000) 810–813Google Scholar