Abstract
Speech enhancement is very essential for voice communication and speech recognition systems. It has wide range of applications like background noise suppression for mobile communications and voice coding. One of its best applications is to provide aids to hearing impaired people. Many speech enhancement algorithms have been existing in the literature. These algorithms are broadly classified as time and transform domain techniques. This chapter, being an introduction explores the various time and transform domain techniques such as artificial neural networks, discrete Fourier transform, discrete cosine transform, discrete wavelet transform, and Karhunen–Loeve transform based methods for speech enhancement.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Boll, S. F. (1979). Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustics, Speech, and Signal Processing, 27(2), 113–120. https://doi.org/10.1109/TASSP.1979.1163209
Ephraim, Y., & Malah, D. (1984). Speech enhancement using a minimum-mean square error short-time spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing, 32(6), 1109–1121. https://doi.org/10.1109/TASSP.1984.1164453
Ephraim, Y., & Malah, D. (1985). Speech enhancement using a minimum mean-square error log-spectral amplitude estimator. IEEE Transactions on Acoustics, Speech, and Signal Processing, 33(2), 443–445. https://doi.org/10.1109/TASSP.1985.1164550
Ephraim, Y., & Van Trees, H. L. (1995). A signal subspace approach for speech enhancement. IEEE Transactions on Speech and Audio Processing, 3(4), 251–266. https://doi.org/10.1109/89.397090
Lim, J., Oppenheim, A., & Braida, L. (1978). Evaluation of an adaptive comb filtering method for enhancing speech degraded by white noise addition. IEEE Transactions on Acoustics, Speech, and Signal Processing, 26(4), 354–358. https://doi.org/10.1109/TASSP.1978.1163117
Mahmmod, B. M., Ramli, A. R., Abdulhussian, S. H., Al-Haddad, S. A. R., & Jassim, W. A. (2017). Low-distortion MMSE speech enhancement estimator based on Laplacian prior. IEEE Access, 5, 9866–9881. https://doi.org/10.1109/ACCESS.2017.2699782
Ram, R., & Mohanty, M. N. (2018). The use of deep learning in speech enhancement. In Proceedings of the First International Conference on Information Technology and Knowledge Management (Vol. 14, pp. 107–111). https://doi.org/10.15439/2017km40
Soon, I. Y., & Koh, S. N. (2000). Low distortion speech enhancement. IEE Proceedings: Vision, Image and Signal Processing, 147(3), 247–253. https://doi.org/10.1049/ip-vis:20000323
Soon, I. Y., Koh, S. N., & Yeo, C. K. (1998). Noisy speech enhancement using discrete cosine transform. Speech Communication, 24(3), 249–257. https://doi.org/10.1016/S0167-6393(98)00019-3
Wang, Y., & Wang, D. (2015). A deep neural network for time-domain signal reconstruction. In ICASSP, IEEE international conference on acoustics, speech and signal processing - proceedings, 2015-August (pp. 4390–4394). https://doi.org/10.1109/ICASSP.2015.7178800
Yu, H., Ouyang, Z., Zhu, W. P., Champagne, B., & Ji, Y. (2019). A deep neural network based Kalman filter for time domain speech enhancement. In Proceedings - IEEE International Symposium on Circuits and Systems, 2019-May. https://doi.org/10.1109/ISCAS.2019.8702161
Zhang, W., Benesty, J., & Chen, J. (2016). Single-channel noise reduction via semi-orthogonal transformations and reduced-rank filtering. Speech Communication, 78, 73–83. https://doi.org/10.1016/j.specom.2015.12.007
Zou, X., & Zhang, X. (2007). Speech enhancement using an MMSE short time DCT coefficients estimator with supergaussian speech modeling. Journal of Electronics, 24(3), 332–337. https://doi.org/10.1007/s11767-005-0174-y
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2020 The Author(s), under exclusive licence to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Kunche, P., Manikanthababu, N. (2020). Introduction. In: Fractional Fourier Transform Techniques for Speech Enhancement. SpringerBriefs in Speech Technology. Springer, Cham. https://doi.org/10.1007/978-3-030-42746-7_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-42746-7_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-42745-0
Online ISBN: 978-3-030-42746-7
eBook Packages: EngineeringEngineering (R0)