Spectral Subtraction Using Spectral Harmonics for Robust Speech Recognition in Car Environments
This paper addresses a novel noise-compensation scheme to solve the mismatch problem between training and testing condition for the automatic speech recognition (ASR) system, specifically in car environment. The conventional spectral subtraction schemes rely on the signal-to-noise ratio (SNR) such that attenuation is imposed on that part of the spectrum that appears to have low SNR, and accentuation is made on that part of high SNR. However, since these schemes are based on the postulation that the power spectrum of noise is in general at the lower level in magnitude than that of speech. Therefore, while such postulation is adequate for high SNR environment, it is grossly inadequate for low SNR scenarios such as that of car environment. This paper proposes an efficient spectral subtraction scheme focused specifically to low SNR noisy environment by representing harmonics distinctively in speech spectrum. Representative experiments confirm the superior performance of the proposed method over conventional methods. The experiments are conducted using car noise-corrupted utterances of Aurora2 corpus.
Unable to display preview. Download preview PDF.
- 2.Ealey, D., Kellher, H., Pearce, D.: Harmonic tunneling: track-ing non-stationary noises during speech, Eurospeech 2001(2001) 437–440Google Scholar
- 3.Berouti, M., Schwartz, R., Makhoul, J.: Enhancement of speech corrupted by additive noise, Proceedings of the IEEE Conference on Acoustics, Speech, and Signal Processing, April (1979) 208–211Google Scholar
- 6.Hess, W.: Pitch Determination of Speech Signals, Springer-Verlag Berlin Heidelberg New York Tokyo (1983)Google Scholar
- 7.Rabiner, L., Schafer, R.: Digital Processing of Speech Signals, Prentice-Hall (1978)Google Scholar