Skip to main content

Introduction

  • Chapter
  • First Online:
Book cover Speech Enhancement in the STFT Domain

Abstract

This chapter introduces the problem of speech enhancement in the short-time Fourier transform (STFT) domain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. N. Wiener, Extrapolation, Interpolation, and Smoothing of Stationary Time Series (Wiley, New York, 1949)

    MATH  Google Scholar 

  2. M.R. Schroeder, Apparatus for suppressing noise and distortion in communication signals, U.S. Patent No. 3,180,936, filed 1 Dec 1960, issued 27 Apr (1965)

    Google Scholar 

  3. M.R. Schroeder, Processing of communication signals to reduce effects of noise, U.S. Patent No. 3,403,224, filed 28 May 1965, issued 24 Sept (1968)

    Google Scholar 

  4. S.F. Boll, Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. ASSP-27, 113–120 (1979)

    Article  Google Scholar 

  5. R.J. McAulay, M.L. Malpass, Speech enhancement using a soft-decision noise suppression filter. IEEE Trans. Acoust. Speech Signal Process. ASSP-28, 137–145 (1980)

    Article  Google Scholar 

  6. J.S. Lim, A.V. Oppenheim, Enhancement and bandwidth compression of noisy speech. Proc. IEEE 67, 1586–1604 (1979)

    Article  Google Scholar 

  7. M.M. Sondhi, C.E. Schmidt, L.R. Rabiner, Improving the quality of a noisy speech signal. Bell Syst. Techn. J. 60, 1847–1859 (1981)

    Google Scholar 

  8. Y. Ephraim, D. Malah, Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator. IEEE Trans. Acoust. Speech Signal Process. ASSP-32, 1109–1121 (1984)

    Article  Google Scholar 

  9. Y. Ephraim, D. Malah, Speech enhancement using a minimum mean-square error log-spectral amplitude estimator. IEEE Trans. Acoust. Speech Signal Process. ASSP-33, 443–445 (1985)

    Article  Google Scholar 

  10. P.J. Wolfe, S.J. Godsill, Simple alternatives to the Ephraim and Malah suppression rule for speech enhancement, in Proceedings IEEE ICASSP, pp. 496–499 (2001)

    Google Scholar 

  11. L.E. Baum, T. Petrie, Statistical inference for probabilistic functons of finite state Markov chains. Ann. Math. Stat. 73, 1554–1563 (1966)

    Article  MathSciNet  Google Scholar 

  12. J.K. Baker, The dragon system–an overview. IEEE Trans. Acoust. Speech Signal Process. ASSP-23, 24–29 (1975)

    Article  Google Scholar 

  13. F. Jelinek, Continuous speech recognition by statistical methods. Proc. IEEE 64, 532–536 (1976)

    Article  Google Scholar 

  14. L.R. Rabiner, A tutorial on hidden Markov models and selected applications in speech recognition. Proc. IEEE 77, 257–286 (1989)

    Article  Google Scholar 

  15. Y. Ephraim, D. Malah, B.H. Juang, On the application of hidden Markov models for enhancing noisy speech. IEEE Trans. Acoust. Speech Signal Process. 37, 1846–1856 (1989)

    Article  Google Scholar 

  16. Y. Ephraim, A Bayesian estimation approach for speech enhancement using hidden Markov models. IEEE Trans. Signal Process. 40, 725–735 (1992)

    Article  Google Scholar 

  17. Y. Ephraim, Statstical-model-based speech enhancement systems. Proc. IEEE 80, 1526–1555 (1992)

    Article  Google Scholar 

  18. H. Sameti, H. Sheikhzadeh, L. Deng, R.L. Brennan, HMM-based strategies for enhancement of speech signals embedded in nonstationary noise. IEEE Trans. Speech Audio Process. 6, 445–455 (1998)

    Article  Google Scholar 

  19. L.R. Rabiner, B.H. Juang, Fundamentals of Speech Recognition (Prentice-Hall, Englewood Cliffs, 1993)

    Google Scholar 

  20. J. Benesty, Y. Huang, A single-channel noise reduction MVDR filter, in Proceedings IEEE ICASSP, pp. 273–276 (2011)

    Google Scholar 

  21. S. So, K.K. Paliwal, Modulation-domain Kalman filtering for single-channel speech enhancemen. Speech Commun. 53, 818–829 (2011)

    Article  Google Scholar 

  22. J. Chen, J. Benesty, Y. Huang, S. Doclo, New insights into the noise reduction Wiener filter. IEEE Trans. Audio Speech Lang. Process. 14, 1218–1234 (2006)

    Article  Google Scholar 

  23. J. Capon, High resolution frequency-wavenumber spectrum analysis. Proc. IEEE 57, 1408–1418 (1969)

    Article  Google Scholar 

  24. L.J. Griffiths, C.W. Jim, An alternative approach to linearly constrained adaptive beamforming. IEEE Trans. Antennas Propag. AP-30, 27–34 (1982)

    Article  Google Scholar 

  25. B.D. Van Veen, K.M. Buckley, Beamforming: a versatile approach to spatial filtering. IEEE Audio Speech Signal Process Mag. 5, 4–24 (1988)

    Google Scholar 

  26. D.B. Ward, R.C. Williamson, R.A. Kennedy, Broadband microphone arrays for speech acquisition. Acoust. Australia 26, 17–20 (1998)

    Google Scholar 

  27. O.L. Frost, An algorithm for linearly constrained adaptive array processing. Proc. IEEE 60, 926–935 (1972)

    Article  Google Scholar 

  28. J. Benesty, J. Chen, Y. Huang, J. Dmochowski, On microphone array beamforming from a MIMO acoustic signal processing perspective. IEEE Trans. Audio Speech Lang. Process. 15, 1053–1065 (2008)

    Article  Google Scholar 

  29. H. Cox, R. Zeskind, M. Owen, Adaptive beamforming. IEEE Trans. Acoust. Speech Signal Process. 35, 1365–1376 (1987)

    Article  Google Scholar 

  30. M. Brandstein, D.B. Ward (eds.), Microphone Arrays: Signal Processing Techniques and Applications (Springer, Berlin, 2001)

    Google Scholar 

  31. C.W. Jim, A comparison of two LMS constrained optimal array structures. Proc. IEEE 65, 1730–1731 (1977)

    Article  Google Scholar 

  32. W. Herbordt, W. Kellermann, Adaptive beamforming for audio signal acquisition. in Adaptive Signal Processing: Applications to Real-World Problems, ed. by J. Benesty, Y. Huang (Springer, Berlin, 2003), chap. 6, pp. 155–194

    Google Scholar 

  33. K.M. Buckley, Broad-band beamforming and the generalized sidelobe canceller. IEEE Trans. Acoust. Speech Signal Process ASSP-34, 1322–1323 (1986)

    Article  Google Scholar 

  34. S. Werner, J.A. Apolinario, M.L.R. de Campos, On the equivalence of RLS implementations of LCMV and GSC processors. IEEE Signal Process Lett. 10, 356–359 (2003)

    Article  Google Scholar 

  35. M. Miyoshi, Y. Kaneda, Inverse filtering of room acoustics. IEEE Trans. Acoust. Speech Signal Process 36, 145–152 (1988)

    Article  Google Scholar 

  36. S. Doclo, M. Moonen, GSVD-based optimal filtering for single and multimicrophone speech enhancement. IEEE Trans. Signal Process. 50, 2230–2244 (2002)

    Article  Google Scholar 

  37. S. Gannot, I. Cohen, Speech enhancement based on the general transfer function GSC and postfiltering. IEEE Trans. Speech Audio Process. 12, 561–571 (2004)

    Article  Google Scholar 

  38. J. Chen, J. Benesty, Y. Huang, A minimum distortion noise reduction algorithm with multiple microphones. IEEE Trans. Audio Speech Lang. Process. 16, 481–493 (2008)

    Article  Google Scholar 

  39. E.J. Diethorn, A subband noise-reduction method for enhancing speech in telephony and teleconferencing, in Proceedings IEEE WASPAA (1977)

    Google Scholar 

  40. J. Chen, Y. Huang, J. Benesty, Filtering techniques for noise reduction and speech enhancement, in Adaptive Signal Processing: Application to Real-World Problems, ed. by J. Benesty, Y. Huang (Springer, Berlin, 2003) pp. 129–154

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jacob Benesty .

Rights and permissions

Reprints and permissions

Copyright information

© 2012 The Author(s)

About this chapter

Cite this chapter

Benesty, J., Chen, J., Habets, E.A.P. (2012). Introduction. In: Speech Enhancement in the STFT Domain. SpringerBriefs in Electrical and Computer Engineering(). Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23250-3_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23250-3_1

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23249-7

  • Online ISBN: 978-3-642-23250-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics