Speech Enhancement System Based on Auditory System and Time-Delay Neural Network

  • Jae-Seung Choi
  • Seung-Jin Park
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4432)

Abstract

This paper proposes a speech enhancement system based on an auditory system for noise reduction in speech that is degraded by background noises. Accordingly, the proposed system adjusts frame by frame the coefficients for both lateral inhibition and amplitude component according to the detected sections for each input frame, then reduces the noise signal using a time-delay neural network. Based on measuring signal-to-noise ratios, experiments confirm that the proposed system is effective for speech that is degraded by various noises.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Chien, J.T., Lee, L.M., Wang, H.C.: Noisy speech recognition by using variance adapted hidden Markov models. IEE Electronics Letters 31(18), 1555–1556 (1995)CrossRefGoogle Scholar
  2. 2.
    Sreenivas, T.V., Kirnapure, P.: Codebook constrained wiener filtering for speech enhancement. IEEE Transactions on Speech and Audio Processing 4(5), 383–389 (1996)CrossRefGoogle Scholar
  3. 3.
    Boll, S.F.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech, Signal Processing 27(2), 113–120 (1979)CrossRefGoogle Scholar
  4. 4.
    Shamma, S.A.: Speech Processing in the Auditory System II: Lateral Inhibition and the Central Processing of Speech Evoked Activity in the Auditory Nerve. The Journal of the Acoustical Society of America 78(7), 1622–1632 (1985)CrossRefGoogle Scholar
  5. 5.
    Cheng, Y.M., O’Shaughnessy, D.: Speech enhancement based conceptually on auditory evidence. IEEE Trans. Signal Processing. 39(9), 1943–1954 (1991)CrossRefGoogle Scholar
  6. 6.
    Hansen, J.H.L., Nandkumar, S.: Robust Estimation of Speech in Noisy Backgrounds Based on Aspects of the Auditory Process. The Journal of the Acoustical Society of America 97(6), 3833–3849 (1995)CrossRefGoogle Scholar
  7. 7.
    Waibel, A., Hanazawa, T., Hinton, G., Shikano, K., Lang, K.J.: Phoneme Recognition using Time-delay Neural Networks. IEEE Transactions on Acoustics, Speech, and Signal Processing 37(3), 328–339 (1989)CrossRefGoogle Scholar
  8. 8.
    Wu, Y., Li, Y.: Robust speech/non-speech detection in adverse conditions using the fuzzy polarity correlation method. In: IEEE International Conference on Systems, Man, and Cybernetics, vol. 4, pp. 2935–2939 (2000)Google Scholar
  9. 9.
    Ephraim, Y., Malah, D.: Speech Enhancement Using a Minimum Mean-Square Error Log-Spectral Amplitude Estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing 33(2), 443–445 (1985)CrossRefGoogle Scholar
  10. 10.
    Ephraim, Y., Malah, D.: Speech Enhancement Using a Minimum-Mean Square Error Short-Time Spectral Amplitude Estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing 32(6), 1109–1121 (1984)CrossRefGoogle Scholar

Copyright information

© Springer Berlin Heidelberg 2007

Authors and Affiliations

  • Jae-Seung Choi
    • 1
  • Seung-Jin Park
    • 2
  1. 1.Department of Electronics Engineering, Silla University, San 1-1, Gwaebop-dong, Sasang-gu, BusanKorea
  2. 2.Department of Biomedical Engineering, Chonnam National University Hospital & Medical School, GwangjuKorea

Personalised recommendations