Skip to main content

Speech Enhancement Based on Noise Type and Wavelet Thresholding the Multitaper Spectrum

  • Conference paper
  • First Online:
Advances in Machine Learning and Signal Processing

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 387))

Abstract

A speech enhancement system for noisy speech signals under different environments, namely airport, babble, car, exhibition, restaurant, station, street and train, is designed and implemented in this paper. Spectral subtraction is the most popular method for speech enhancement which suffers from perceptually annoying musical noise. Musical noise is reduced here by using low variance spectrum estimators and enhancement is achieved by wavelet thresholding the multitaper spectrum of the speech. The main feature of the proposed system is a novel switching mechanism to an optimal wavelet filter bank or wavelet packet filter bank matching to the critical bands of the human ear, based on the type of noise present in the input speech. The system is evaluated using noisy speech signals under different environment noises at different signal to noise ratios. The subjective and objective test results show that the proposed method improves the quality and intelligibility of speech signals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Berouti M, Schwartz R, Makhoul J (1979) Enhancement of speech corrupted by acoustic noise. In: IEEE international conference on ICASSP’79 acoustics, speech, and signal processing, vol 4, pp 208–211. IEEE

    Google Scholar 

  2. Boll SF (1979) Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans Acoust Speech and Signal Process 27(2):113–120

    Article  Google Scholar 

  3. Cristán AC, Walden AT (2002) Multitaper power spectrum estimation and thresholding: wavelet packets versus wavelets. IEEE Tran Signal Process 50(12):2976–2986

    Article  MathSciNet  Google Scholar 

  4. Donoho DL (1995) De-noising by soft-thresholding. IEEE Trans Inf Theory 41(3):613–627

    Article  MathSciNet  MATH  Google Scholar 

  5. Donoho DL, Johnstone IM (1995) Adapting to unknown smoothness via wavelet shrinkage. J Am Stat Assoc 90(432):1200–1224

    Article  MathSciNet  MATH  Google Scholar 

  6. Donoho DL, Johnstone JM (1994) Ideal spatial adaptation by wavelet shrinkage. Biometrika 81(3):425–455

    Article  MathSciNet  MATH  Google Scholar 

  7. Ephraim Y, Malah D (1984) Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator. IEEE Trans Acoust Speech and Signal Process 32(6):1109–1121

    Article  Google Scholar 

  8. Gao HY (1997) Choice of thresholds for wavelet shrinkage estimate of the spectrum. J Time Ser Anal 18(3):231–251

    Article  MathSciNet  MATH  Google Scholar 

  9. Hu Y, Loizou PC (2003) A generalized subspace approach for enhancing speech corrupted by colored noise. IEEE Trans Speech Audio Process 11(4):334–341

    Google Scholar 

  10. Hu Y, Loizou PC (2004) Incorporating a psychoacoustical model in frequency domain speech enhancement. IEEE Signal Process Lett 11(2):270–273

    Article  Google Scholar 

  11. Hu Y, Loizou PC (2004) Speech enhancement based on wavelet thresholding the multitaper spectrum. IEEE Trans Speech Audio Process 12(1):59–67

    Google Scholar 

  12. Hu Y, Loizou PC (2007) Subjective comparison and evaluation of speech enhancement algorithms. Speech Commun 49(7):588–601

    Article  Google Scholar 

  13. Moulin P (1994) Wavelet thresholding techniques for power spectrum estimation. IEEE Trans Signal Process 42(11):3126–3136

    Article  Google Scholar 

  14. Percival D, Walden A (1993) Spectral analysis for physical applications: multitaper and conventional univariate techniques. Cambridge University Press

    Google Scholar 

  15. Riedel KS, Sidorenko A (1995) Minimum bias multiple taper spectral estimation. IEEE Trans Signal Process 43(1):188–195

    Article  Google Scholar 

  16. Shao Y, Chang CH (2007) A generalized time–frequency subtraction method for robust speech enhancement based on wavelet filter banks modeling of human auditory system. IEEE Trans Syst Man Cybern 37(4):877–889

    Google Scholar 

  17. Thomson DJ (1982) Spectrum estimation and harmonic analysis. Proc IEEE 70(9):1055–1096

    Article  Google Scholar 

  18. Virag N (1999) Single channel speech enhancement based on masking properties of the human auditory system. IEEE Trans Speech Audio Process 7(2):126–137

    Google Scholar 

  19. Walden AT, Percival DB, McCoy EJ (1998) Spectrum estimation by wavelet thresholding of multitaper estimators. IEEE Trans Signal Process 46(12):3153–3165

    Google Scholar 

  20. Yuan W, Xia B (2015) A speech enhancement approach based on noise classification. Appl Acoust 96:11–19

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to E.P. Jayakumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Jayakumar, E., Sathidevi, P. (2016). Speech Enhancement Based on Noise Type and Wavelet Thresholding the Multitaper Spectrum. In: Soh, P., Woo, W., Sulaiman, H., Othman, M., Saat, M. (eds) Advances in Machine Learning and Signal Processing. Lecture Notes in Electrical Engineering, vol 387. Springer, Cham. https://doi.org/10.1007/978-3-319-32213-1_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-32213-1_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-32212-4

  • Online ISBN: 978-3-319-32213-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics