Abstract
In this paper we address the problem of enhancing single channel speech signal corrupted with additive background noise. We present a new scheme which utilizes a different time frequency representation along with the psychoacoustic features of human ear and combines these features with the well-known noise estimation method of minimum tracking. Instead of Fourier transform, we use a perceptual wavelet packet decomposition of speech, and perform spectral tracking and filtering on the envelope of the analytic signal.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Martin, R.: Spectral Subtraction Based on Minimum Statistics. In: Proc. of EUSIPCO 1994, Edinburgh, U.K, September 13–16, 1994, pp. 1182–1185 (1994)
Martin, R.: Noise Power Spectral Density Estimation Based on Optimal Smoothing and Minimum Statistics. IEEE Tran. Speech & Audio Proc. 9(5), 504–512 (2001)
Pinter, I.: Perceptual Wavelet Representation of Speech Signals and its Application to Speech Enhancement. Computer Speech and Language 10(1), 1–22 (1996)
Derakhshan, N., Savoji, M.H.: Perceptual Speech Enhancement Using a Hilbert Transform Based Time-Frequency Representation of Speech. In: 11th International Conference of Speech and Computer, St. Petersburg, Russia, June 25-29 (2006)
Omidi, M., Savoji, M.H.: A New Hilbert Transformer Based on Parametric Wavelet Kernel Application to Analytic Signal Decomposition of Speech Subband. In: 5th International Symposium on Image and Signal Processing and Analysis, ISPA 2007, Istanbul, Turkey (2007)
Flanagan, J.L.: Phase vocoder. The Bell System Technical Journal 45, 1493–1509 (1966)
Reilly, A., Frazer, G., Boashash, B.: Analytic Signal Generation - Tips and Traps. IEEE Trans. Signal Processing 42(11), 3241–3245 (1994)
Berouti, M., Schwartz, R., Makhoul, J.: Enhancement of Speech Corrupted by Acoustic Noise. In: Proc. ICASSP 1979, pp. 208–211 (1979)
Hu, Y., Loizou, P.: Evaluation of Objective Measures for Speech Enhancement. In: Proc. of Interspeech 2006, Philadelphia, PA (September 2006)
Perceptual Evaluation of Speech Quality (PESQ), an Objective Method for End-to-End Speech Quality Assessment of Narrowband Telephone Networks and Speech Codecs. ITU-T Recommendation, p. 862 (February 2001)
Martin, R.: An Efficient Algorithm to Estimate the Instantaneous SNR of Speech Signal. In: EuroSpeech 1993, pp. 1093–1096 (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Omidi, M., Derakhshan, N., Hassan Savoji, M. (2008). The Advantage of Implementing Martin’s Noise Reduction Algorithm in Critical Bands Using Wavelet Packet Decomposition and Hilbert Transform. In: Sarbazi-Azad, H., Parhami, B., Miremadi, SG., Hessabi, S. (eds) Advances in Computer Science and Engineering. CSICC 2008. Communications in Computer and Information Science, vol 6. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89985-3_100
Download citation
DOI: https://doi.org/10.1007/978-3-540-89985-3_100
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89984-6
Online ISBN: 978-3-540-89985-3
eBook Packages: Computer ScienceComputer Science (R0)