A Threshold Denoising Algorithm Based on Mathematical Morphology for Speech Enhancement

Li, Guangyan; Zheng, Caixia; Xu, Tingfa; Cao, Xiaolin; Xingpeng, Mao; Wang, Shuangwei

doi:10.1007/978-981-10-6571-2_215

Guangyan Li³⁸,
Caixia Zheng³⁹,
Tingfa Xu⁴⁰,
Xiaolin Cao⁴¹,
Mao Xingpeng⁴² &
…
Shuangwei Wang³⁸

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 463))

Included in the following conference series:

International Conference in Communications, Signal Processing, and Systems

77 Accesses
1 Citations

Abstract

The presence of noise in speech signals can significantly degrade the performance of speech recognition systems. A threshold denoising method based on mathematical morphology is proposed to reduce background white noise. In the method we consider speech spectrograms as images and construct binary images from a normalized 256-level gray scale spectrogram image. We take advantage of a sudden slowing in the average value (ratio of the number of ‘1’ pixels to the total pixel number) of the binary image, and use it as the threshold value to zero spectrogram elements below the threshold, normalize the spectrogram, and finally, reconstruct the original speech signal to achieve the goal of speech enhancement. The main advantage of the algorithm is fast speed that is highly desired in real-time speech processing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ajmera, P.K., Jadhav, D.V., Holambe, R.S.: Text-independent speaker identification using Radon and discrete cosine transforms based features from speech spectrogram. Pattern Recogn. 44, 2749–2759 (2011)
Google Scholar
Alsteris, L.D., Paliwal, K.K.: Iterative reconstruction of speech from short-time Fourier transform phase and magnitude spectra. Comput. Speech Lang. 21, 174–186 (2007)
Google Scholar
Berouti, M., Schwartz, R., Makhoul, J.: Enhancement of speech corrupted by acoustic noise. In: IEEE, pp. 208–211 (1979)
Google Scholar
Boll, S.F.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. 27, 113–120 (1979)
Google Scholar
Cohen, L.: Time-frequency distributions - a review. Proc. IEEE 77, 941–981 (1989)
Google Scholar
Dennis, J., Tran, H.D., Li, H.: Spectrogram image feature for sound event classification in mismatched conditions. IEEE Signal Process. Lett. 18, 130–133 (2011)
Google Scholar
Mallawaarachchi, A., Ong, S.H., Chitre, M., Taylor, E.: Spectrogram denoising and automated extraction of the fundamental frequency variation of dolphin whistles. J. Acoust. Soc. Am. 124, 1159–1170 (2008)
Google Scholar
Pinkowski, B.: Principal component analysis of speech spectrogram images. Pattern Recogn. 30, 777–787 (1997)
Google Scholar
Soille, P.: Morphological image analysis: principles and applications. Springer Science & Business Media, Heidelberg (2013)
Google Scholar
Steinberg, R., Shaughnessy, D.O.: Segmentation of a speech spectrogram using mathematical morphology. In: IEEE, pp. 1637–1640 (2008)
Google Scholar
Xu, H., Tan, Z.-H., Dalsgaard, P., Lindberg, B.: Robust speech recognition by nonlocal means denoising processing. IEEE Signal Process. Lett. 15, 701–704 (2008)
Google Scholar

Download references

Acknowledgements

This work was supported by the Natural Science Foundation of China (No. 61471111)

Author information

Authors and Affiliations

School of Physics, Northeast Normal University, Changchun, China
Guangyan Li & Shuangwei Wang
School of Computer Science and Information Technology, Northeast Normal University, Changchun, China
Caixia Zheng
Photoelectric Imaging and Information Engineering Institute, Beijing Institute of Technology, Beijing, China
Tingfa Xu
College of Automotive Engineering, Jilin University, Changchun, China
Xiaolin Cao
School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin, China
Mao Xingpeng

Authors

Guangyan Li
View author publications
You can also search for this author in PubMed Google Scholar
Caixia Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Tingfa Xu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaolin Cao
View author publications
You can also search for this author in PubMed Google Scholar
Mao Xingpeng
View author publications
You can also search for this author in PubMed Google Scholar
Shuangwei Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Caixia Zheng .

Editor information

Editors and Affiliations

Department of Electrical Engineering, University of Texas at Arlington, Arlington, Texas, USA
Qilian Liang
College of Electronic and Communication Engineering, Tianjin Normal University, Tianjin, China
Jiasong Mu
Harbin Institute of Technology, Harbin, Heilongjiang, China
Min Jia
College of Electronic and Communication Engineering, Tianjin Normal University, Tianjin, China
Wei Wang
College of Electronic and Communication Engineering, Tianjin Normal University, Tianjin, China
Xuhong Feng
College of Electronic and Communication Engineering, Tianjin Normal University, Tianjin, China
Baoju Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, G., Zheng, C., Xu, T., Cao, X., Xingpeng, M., Wang, S. (2019). A Threshold Denoising Algorithm Based on Mathematical Morphology for Speech Enhancement. In: Liang, Q., Mu, J., Jia, M., Wang, W., Feng, X., Zhang, B. (eds) Communications, Signal Processing, and Systems. CSPS 2017. Lecture Notes in Electrical Engineering, vol 463. Springer, Singapore. https://doi.org/10.1007/978-981-10-6571-2_215

Download citation

DOI: https://doi.org/10.1007/978-981-10-6571-2_215
Published: 07 June 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6570-5
Online ISBN: 978-981-10-6571-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics