DSP Real-Time Implementation of DOST Algorithm Used for Speech Enhancement

Saoud, Safa; Bousselmi, Souha; Nasr, Mouhamed Ben; Cherif, Adnen

doi:10.1007/978-3-030-21009-0_7

Safa Saoud⁵,
Souha Bousselmi⁵,
Mouhamed Ben Nasr⁵ &
…
Adnen Cherif⁵

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 147))

Included in the following conference series:

International conference on the Sciences of Electronics, Technologies of Information and Telecommunications

745 Accesses

Abstract

The real-time implementation of speech enhancement is a vital tool destined to ameliorate the speech quality and intelligibility for auditors. In this paper, a speech denoising hardware implementation is developed in order to be used in recognition, synthesis, and coding applications. So, we propose a real-time implementation of speech enhancement approach for single channel in a noisy environment on the basis of Discrete Orthonormal Stockwell Transform (DOST) at the aim to ameliorate the speech quality and intelligibility. The speech enhancement system was tested on DSP TMS320C6416 processor and the obtained results have shown that it has met the real-time requirements in terms of memory consumption (Ko) and number of cycles (MCPS). For a subjective criterion, we have used the Mean Opinion Score (MOS) to evaluate the perceptual quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Chabane, B., Daoued, B.: On the use of Kalman filter for enhancing speech corrupted by colored noise. WSEAS Trans. Sig. Process. 4, 657–666 (2008)
Google Scholar
Sreenivas, T.V., Kirnapure, P.: Codebook constrained Wiener filtering for speech enhancement. IEEE Trans Speech Audio Process. 4, 383–389 (1996)
Article Google Scholar
Boll, S.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Sig. Process. 27(2), 113–120 (1979)
Google Scholar
Hassen, F.S.: Performance of discrete wavelet transform (DWT) based speech denoising in impulsive and Gaussian noise. J. Eng Sustain. Dev. 10(2), 175–193 (2018)
Google Scholar
Nasr, M.B., Talbi, M., Cherif, A.: Arabic speech recognition by bionic wavelet transform and MFCC using a multi layer perceptron. In: Proceedings of the SETIT’12, pp. 803–808 (2012)
Google Scholar
Zhang, Y., Zhao, Y.: Real and imaginary modulation spectral subtraction for speech enhancement. J. Speech Commun. 55, 509–522 (2012)
Article Google Scholar
Jensen, J., Hansen, J.H.L.: Speech enhancement using a constrained iterative sinusoidal model. IEEE Trans. Speech Audio Process. 9, 731–740 (2001)
Article Google Scholar
Anderson, D.V., Clements, M.A.: Audio signal noise reduction using harmonic modeling. In: Proceedings of the IEEE International Conference on Acoustics. ICASSP (1999)
Google Scholar
Epharaim, Y.: A minimum mean square error approach for speech enhancement. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (1990)
Google Scholar
Dash, T.K., Solanki, S.S.: Comparative study of speech enhancement algorithms and their effect on speech intelligibility. In: 2nd International Conference on Communication and Electronics Systems (ICCES). IEEE (2017)
Google Scholar
Paliwal, K.K., Basu, A.: A speech enhancement method based on Kalman filtering. In: Proceedings of ICASSP’87, pp. 177–180, Dallas, TX, USA (1987)
Google Scholar
Parchami, M., Zhu, W.P., Champagne, B., Plourde, E.: Recent developments in speech enhancement in the short-time Fourier transform domain. IEEE Circ. Syst. Mag. 16(3), 45–77 (2016)
Article Google Scholar
Wang, Y., Orchard, J.: On the use of the Stockwell transform for image compression. In: SPIE Electronic Imaging Algorithms System VII, p. 7245 (2009)
Google Scholar
Wójcicki, K., Milacic, M., Stark, A., Lyons, J., Paliwal, K.: Exploiting conjugate symmetry of the short-time Fourier spectrum for speech enhancement. IEEE Sig. Process. Lett. 15, 461–464 (2008)
Article Google Scholar
Stark, A.P., Wójcicki, K.K., Lyons, J.G., Paliwal, K.K.: Noise driven short-time phase spectrum compensation procedure for speech enhancement. In: Inter Speech, pp. 549–552, September 2008
Google Scholar
Samui, S., Chakrabarti, I., Ghosh, S.K.: Improved single channel phase-aware speech enhancement technique for low signal to- noise ratio signal. IET Sig. Process. 10(6), 641–650 (2016)
Article Google Scholar
Stockwell, R.G.: A basis for efficient representation of the S-transform. Digital Sig. Process. 17(1), 371–393 (2007)
Article Google Scholar
Yan, Y., Zhu, H.: The generalization of discrete Stockwell transforms. In: EURASIP, pp. 1209–1213 (2011)
Google Scholar
Huang, H., Sun, F., Babyn, P., Zhou, Z., Wang, L.: Medical-image denoising and compressing using discrete orthonormal S transform. In: 2nd International Conference on Electrical, computer Engineering and Electronics (ICECEE 2015), vol. 291, pp. 291–296. ICECEE (2015)
Google Scholar
Texas instruments: TMS320 DSP/BIOS v5. 42 users guide. -01-20(2010)
Google Scholar
Math Works: Real-time workshop for use with SIMULINK, user’s guide. Version 6, June 2004
Google Scholar
Texas instruments: TMS320 DSP/BIOS. v5.42, User Guide, spru423I, Août (2012)
Google Scholar
Hu, Y., Loizou, F.C.: Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end of speech quality assessment of narrowband telephone network and speech codecs. ITUT Recommendation, p. 862. ITU (2000)
Google Scholar
Hu, Y., Loizou, P.: NOIZEUS: a noisy speech corpus for evaluation of speech enhancement algorithms (2005)
Google Scholar
Issaoui, H., Bouzid, A., Elloouze, N.: Comparison between soft and hard thresholding on selected intrinsic mode selection. In: Proceedings of SETIT’12, pp. 712–715 (2012)
Google Scholar
Talbi, M., et al.: Speech enhancement with bionic wavelet transform and recurrent neural network. In: 5th International Conference: Sciences of Electronic, Technologies of Information and Telecommunications SETIT (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

ATSEE Laboratory, Sciences Faculty of Tunis, University of Tunis El-Manar, Tunis, Tunisia
Safa Saoud, Souha Bousselmi, Mouhamed Ben Nasr & Adnen Cherif

Authors

Safa Saoud
View author publications
You can also search for this author in PubMed Google Scholar
Souha Bousselmi
View author publications
You can also search for this author in PubMed Google Scholar
Mouhamed Ben Nasr
View author publications
You can also search for this author in PubMed Google Scholar
Adnen Cherif
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Safa Saoud .

Editor information

Editors and Affiliations

SETIT Lab, University of Sfax, Sfax, Tunisia
Med Salim Bouhlel
DIBRIS - University of Genoa, Genova, Genova, Italy
Stefano Rovetta

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Saoud, S., Bousselmi, S., Nasr, M.B., Cherif, A. (2020). DSP Real-Time Implementation of DOST Algorithm Used for Speech Enhancement. In: Bouhlel, M., Rovetta, S. (eds) Proceedings of the 8th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT’18), Vol.2. SETIT 2018. Smart Innovation, Systems and Technologies, vol 147. Springer, Cham. https://doi.org/10.1007/978-3-030-21009-0_7

Download citation

DOI: https://doi.org/10.1007/978-3-030-21009-0_7
Published: 02 August 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-21008-3
Online ISBN: 978-3-030-21009-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics