Introduction

Kunche, Prajna; Manikanthababu, N.

doi:10.1007/978-3-030-42746-7_1

Prajna Kunche⁴ &
N. Manikanthababu⁴

Part of the book series: SpringerBriefs in Speech Technology ((BRIEFSSPEECHTECH))

420 Accesses

Abstract

Speech enhancement is very essential for voice communication and speech recognition systems. It has wide range of applications like background noise suppression for mobile communications and voice coding. One of its best applications is to provide aids to hearing impaired people. Many speech enhancement algorithms have been existing in the literature. These algorithms are broadly classified as time and transform domain techniques. This chapter, being an introduction explores the various time and transform domain techniques such as artificial neural networks, discrete Fourier transform, discrete cosine transform, discrete wavelet transform, and Karhunen–Loeve transform based methods for speech enhancement.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 44.99; Price excludes VAT (USA)

Softcover Book: USD 59.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech, and Signal Processing, 27(2), 113–120. https://doi.org/10.1109/TASSP.1979.1163209
Article Google Scholar
Ephraim, Y., & Malah, D. (1984). Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing, 32(6), 1109–1121. https://doi.org/10.1109/TASSP.1984.1164453
Article Google Scholar
Ephraim, Y., & Malah, D. (1985). Speech enhancement using a minimum mean-square error log-spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing, 33(2), 443–445. https://doi.org/10.1109/TASSP.1985.1164550
Article Google Scholar
Ephraim, Y., & Van Trees, H. L. (1995). A signal subspace approach for speech enhancement. IEEE Transactions on Speech and Audio Processing, 3(4), 251–266. https://doi.org/10.1109/89.397090
Article Google Scholar
Lim, J., Oppenheim, A., & Braida, L. (1978). Evaluation of an adaptive comb filtering method for enhancing speech degraded by white noise addition. IEEE Transactions on Acoustics, Speech, and Signal Processing, 26(4), 354–358. https://doi.org/10.1109/TASSP.1978.1163117
Article Google Scholar
Mahmmod, B. M., Ramli, A. R., Abdulhussian, S. H., Al-Haddad, S. A. R., & Jassim, W. A. (2017). Low-distortion MMSE speech enhancement estimator based on Laplacian prior. IEEE Access, 5, 9866–9881. https://doi.org/10.1109/ACCESS.2017.2699782
Article Google Scholar
Ram, R., & Mohanty, M. N. (2018). The use of deep learning in speech enhancement. In Proceedings of the First International Conference on Information Technology and Knowledge Management (Vol. 14, pp. 107–111). https://doi.org/10.15439/2017km40
Soon, I. Y., & Koh, S. N. (2000). Low distortion speech enhancement. IEE Proceedings: Vision, Image and Signal Processing, 147(3), 247–253. https://doi.org/10.1049/ip-vis:20000323
Article Google Scholar
Soon, I. Y., Koh, S. N., & Yeo, C. K. (1998). Noisy speech enhancement using discrete cosine transform. Speech Communication, 24(3), 249–257. https://doi.org/10.1016/S0167-6393(98)00019-3
Article Google Scholar
Wang, Y., & Wang, D. (2015). A deep neural network for time-domain signal reconstruction. In ICASSP, IEEE international conference on acoustics, speech and signal processing - proceedings, 2015-August (pp. 4390–4394). https://doi.org/10.1109/ICASSP.2015.7178800
Yu, H., Ouyang, Z., Zhu, W. P., Champagne, B., & Ji, Y. (2019). A deep neural network based Kalman filter for time domain speech enhancement. In Proceedings - IEEE International Symposium on Circuits and Systems, 2019-May. https://doi.org/10.1109/ISCAS.2019.8702161
Zhang, W., Benesty, J., & Chen, J. (2016). Single-channel noise reduction via semi-orthogonal transformations and reduced-rank filtering. Speech Communication, 78, 73–83. https://doi.org/10.1016/j.specom.2015.12.007
Article Google Scholar
Zou, X., & Zhang, X. (2007). Speech enhancement using an MMSE short time DCT coefficients estimator with supergaussian speech modeling. Journal of Electronics, 24(3), 332–337. https://doi.org/10.1007/s11767-005-0174-y
Article Google Scholar

Download references

Author information

Authors and Affiliations

Indira Gandhi Centre for Atomic Research, Kalpakkam, Tamil Nadu, India
Prajna Kunche & N. Manikanthababu

Authors

Prajna Kunche
View author publications
You can also search for this author in PubMed Google Scholar
N. Manikanthababu
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kunche, P., Manikanthababu, N. (2020). Introduction. In: Fractional Fourier Transform Techniques for Speech Enhancement. SpringerBriefs in Speech Technology. Springer, Cham. https://doi.org/10.1007/978-3-030-42746-7_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-42746-7_1
Published: 17 April 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-42745-0
Online ISBN: 978-3-030-42746-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics