Science China Information Sciences

, Volume 54, Issue 12, pp 2481–2491

A flexible framework for HMM based noise robust speech recognition using generalized parametric space polynomial regression

Research Papers Special Focus

DOI: 10.1007/s11432-011-4490-6

Cite this article as:
Cheng, N., Liu, X. & Wang, L. Sci. China Inf. Sci. (2011) 54: 2481. doi:10.1007/s11432-011-4490-6


Handling variable, non-stationary ambient noise is a challenging task for automatic speech recognition (ASR) systems. To address this issue, multi-style, noise condition independent (CI) model training using speech data collected in diverse noise environments, or uncertainty decoding techniques can be used. An alternative approach is to explicitly approximate the continuous trajectory of Gaussian component mean and variance parameters against the varying noise level, for example, using variable parameter hidden Markov model (VPHMM). This paper investigates a more generalized form of variable parameter HMMs (GVP-HMM). In addition to Gaussian component means and variances, it can also provide a more compact trajectory modeling for tied linear transformations. An alternative noise condition dependent (CD) training algorithm is also proposed to handle the bias to training noise condition distribution. Consistent error rate gains were obtained over conventional VP-HMM mean and variance only trajectory modeling on a media vocabulary Mandarin Chinese in-car navigation command recognition task.


non-stationary noisegeneralized variable parameter HMMnoise robust speech recognition

Copyright information

© Science China Press and Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  1. 1.Shenzhen Institutes of Advanced TechnologyChinese Academy of Sciences/The Chinese University of Hong KongHong KongChina
  2. 2.Cambridge University Engineering DepartmentCambridgeUK