Abstract
In this paper, we have proposed field-programmable gate array (FPGA) based design and implementation of a novel speech enhancement system, which can work for a single microphone device as well as that of a dual microphone device providing background noise immunity. We proposed a microphone activity detector (MAD), which detects the presence of single or dual microphone scenario. After detecting the microphones, multiband spectral subtraction technique enhances the speech signal from different background noisy surrounds. We have implemented our proposed design in Spartan 6 LX45 FPGA using Xilinx system generator tools. The evaluation of the quality of speech of enhanced signal and its correctness of MAD to detect the single or dual microphone system implies that our proposed hardware can work as a proper embedded component for hardware-based execution for speech enhancement.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Boll, S.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. 27(2), 113–120 (1979)
Zhang, Y., Zhao, Y.: Real and imaginary modulation spectral subtraction for speech enhancement. Speech Commun. 55(4), 509–522. ISSN-0167-6393 (2013)
Kamath, S., et. al.: A multi band spectral subtraction method for enhancing speech corrupted by colored noise. In: Proceedings of Acoustics, Speech and Signal Processing, vol. 4 (2002)
Adiono, T. et al.: A hardware software co-design for a real-time spectral subtraction based noise cancellation system. In: Proceedings of ISPACS, Nov 2013
Biswas, T., et. al.: Audio denoising by spectral subtraction technique implemented on reconfigurable hardware. In: Proceedings of IC3, pp. 236–241 (2014). https://doi.org/10.1109/IC3.2014.6897179.
Halupka, D., et. al.: Low power dual microphone speech enhancement using field programmable gate arrays. IEEE Trans. Signal Process. 55(7), 3526–3535 (2007)
Yousefian, N., Loizou, P.C.: A dual-microphone speech enhancement algorithm based on the coherence function. IEEE Trans. Audio Speech Lang. Process. 20(2), 599–609 (2012)
Carter, G.C.: Tutorial overview of coherence and time delay estimation. An applied tutorial for research, development, test, and evaluation engineers, vol. 1, pp. 1–27 (1993)
Knapp, C.H., Carter. G.C.: The generalized correlation method for estimation of time delay. IEEE Trans. Acoust. Speech Signal Process. 24, 320–327 (1976)
Champagne, B., et. al.: Performance of time delay estimation in the presence of room reverberation. IEEE Trans. Speech Audio Process. 4, 148–152 (1996)
McAllister, J.: FPGA Based DSP. Springer, US, pp. 363–392. https://doi.org/10.1007/978-1-4419-6345-1_14
System Generator DSP User Guide. UG640, 2 Dec 2009. www.xilinx.com/support/sw-manual
Matlab/Simulink hardware verifications. www.mathworks.com/products/hdl-verifier
Microsoft kinect microphone array. https://msdn.microsoft.com/en-us/library/jj131033.aspx
Dual Microphone Database: The 29th Conference on VLSI Design and 15th Conference on Embedded Systems—Design Contest: VLSI 2016 (2016)
Varga, A., et. al.: Assessment for automatic speech recognition: II. NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition system. Speech Commun. 12(3), 247–251 (1993)
Hu, Y., Loizou, P.C.: Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end of speech quality assessment of narrow band telephone network and speech codecs. ITU-T Rec, p. 862 (2000)
Hardware Manual Support. www.xilinx.com/support/sw-manual
Biswas, T., Mandal, S.B., Saha, D., Chakrabarti, A.: Dual microphone sound source localization using reconfigurable hardware. In: Proceedings of CICBA: Communications in Computer and Information Science, vol. 775. Springer, Singapore (2017)
Biswas, T., Mandal, S.B., Saha, D., Chakrabarti, A.: A Novel Reconfigurable Hardware Design for Speech Enhancement Based on Multi-band Spectral Subtraction Involving Magnitude and Phase Components. School Of Information Technology, University of Calcutta (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Biswas, T., Bhattacharjee, S., Mandal, S.B., Saha, D., Chakrabarti, A. (2019). FPGA-Based Novel Speech Enhancement System Using Microphone Activity Detector. In: Chaki, R., Cortesi, A., Saeed, K., Chaki, N. (eds) Advanced Computing and Systems for Security. Advances in Intelligent Systems and Computing, vol 897. Springer, Singapore. https://doi.org/10.1007/978-981-13-3250-0_9
Download citation
DOI: https://doi.org/10.1007/978-981-13-3250-0_9
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-3249-4
Online ISBN: 978-981-13-3250-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)