FPGA-Based Novel Speech Enhancement System Using Microphone Activity Detector

Biswas, Tanmay; Bhattacharjee, Shuvadeep; Mandal, Sudhindu Bikash; Saha, Debasri; Chakrabarti, Amlan

doi:10.1007/978-981-13-3250-0_9

Tanmay Biswas¹⁸,
Shuvadeep Bhattacharjee¹⁸,
Sudhindu Bikash Mandal¹⁸,
Debasri Saha¹⁸ &
…
Amlan Chakrabarti¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 897))

250 Accesses
1 Citations

Abstract

In this paper, we have proposed field-programmable gate array (FPGA) based design and implementation of a novel speech enhancement system, which can work for a single microphone device as well as that of a dual microphone device providing background noise immunity. We proposed a microphone activity detector (MAD), which detects the presence of single or dual microphone scenario. After detecting the microphones, multiband spectral subtraction technique enhances the speech signal from different background noisy surrounds. We have implemented our proposed design in Spartan 6 LX45 FPGA using Xilinx system generator tools. The evaluation of the quality of speech of enhanced signal and its correctness of MAD to detect the single or dual microphone system implies that our proposed hardware can work as a proper embedded component for hardware-based execution for speech enhancement.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 16.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Boll, S.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. 27(2), 113–120 (1979)
Google Scholar
Zhang, Y., Zhao, Y.: Real and imaginary modulation spectral subtraction for speech enhancement. Speech Commun. 55(4), 509–522. ISSN-0167-6393 (2013)
Google Scholar
Kamath, S., et. al.: A multi band spectral subtraction method for enhancing speech corrupted by colored noise. In: Proceedings of Acoustics, Speech and Signal Processing, vol. 4 (2002)
Google Scholar
Adiono, T. et al.: A hardware software co-design for a real-time spectral subtraction based noise cancellation system. In: Proceedings of ISPACS, Nov 2013
Google Scholar
Biswas, T., et. al.: Audio denoising by spectral subtraction technique implemented on reconfigurable hardware. In: Proceedings of IC3, pp. 236–241 (2014). https://doi.org/10.1109/IC3.2014.6897179.
Halupka, D., et. al.: Low power dual microphone speech enhancement using field programmable gate arrays. IEEE Trans. Signal Process. 55(7), 3526–3535 (2007)
Google Scholar
Yousefian, N., Loizou, P.C.: A dual-microphone speech enhancement algorithm based on the coherence function. IEEE Trans. Audio Speech Lang. Process. 20(2), 599–609 (2012)
Google Scholar
Carter, G.C.: Tutorial overview of coherence and time delay estimation. An applied tutorial for research, development, test, and evaluation engineers, vol. 1, pp. 1–27 (1993)
Google Scholar
Knapp, C.H., Carter. G.C.: The generalized correlation method for estimation of time delay. IEEE Trans. Acoust. Speech Signal Process. 24, 320–327 (1976)
Google Scholar
Champagne, B., et. al.: Performance of time delay estimation in the presence of room reverberation. IEEE Trans. Speech Audio Process. 4, 148–152 (1996)
Google Scholar
McAllister, J.: FPGA Based DSP. Springer, US, pp. 363–392. https://doi.org/10.1007/978-1-4419-6345-1_14
System Generator DSP User Guide. UG640, 2 Dec 2009. www.xilinx.com/support/sw-manual
Matlab/Simulink hardware verifications. www.mathworks.com/products/hdl-verifier
Microsoft kinect microphone array. https://msdn.microsoft.com/en-us/library/jj131033.aspx
Dual Microphone Database: The 29th Conference on VLSI Design and 15th Conference on Embedded Systems—Design Contest: VLSI 2016 (2016)
Google Scholar
Varga, A., et. al.: Assessment for automatic speech recognition: II. NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition system. Speech Commun. 12(3), 247–251 (1993)
Google Scholar
Hu, Y., Loizou, P.C.: Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end of speech quality assessment of narrow band telephone network and speech codecs. ITU-T Rec, p. 862 (2000)
Google Scholar
Hardware Manual Support. www.xilinx.com/support/sw-manual
Biswas, T., Mandal, S.B., Saha, D., Chakrabarti, A.: Dual microphone sound source localization using reconfigurable hardware. In: Proceedings of CICBA: Communications in Computer and Information Science, vol. 775. Springer, Singapore (2017)
Google Scholar
Biswas, T., Mandal, S.B., Saha, D., Chakrabarti, A.: A Novel Reconfigurable Hardware Design for Speech Enhancement Based on Multi-band Spectral Subtraction Involving Magnitude and Phase Components. School Of Information Technology, University of Calcutta (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

A. K. Choudhury School of Information Technology, University of Calcutta, Kolkata, 700098, India
Tanmay Biswas, Shuvadeep Bhattacharjee, Sudhindu Bikash Mandal, Debasri Saha & Amlan Chakrabarti

Authors

Tanmay Biswas
View author publications
You can also search for this author in PubMed Google Scholar
Shuvadeep Bhattacharjee
View author publications
You can also search for this author in PubMed Google Scholar
Sudhindu Bikash Mandal
View author publications
You can also search for this author in PubMed Google Scholar
Debasri Saha
View author publications
You can also search for this author in PubMed Google Scholar
Amlan Chakrabarti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tanmay Biswas .

Editor information

Editors and Affiliations

A.K. Choudhury School of Information Technology, University of Calcutta, Kolkata, West Bengal, India
Rituparna Chaki
Dipartimento di Scienze Ambientali, Informatica e Statistica, Università Ca’ Foscari Venezia, Mestre, Venice, Italy
Agostino Cortesi
Faculty of Computer Science, Bialystok University of Technology, Bialystok, Poland
Khalid Saeed
Department of Computer Science and Engineering, University of Calcutta, Kolkata, West Bengal, India
Nabendu Chaki

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Biswas, T., Bhattacharjee, S., Mandal, S.B., Saha, D., Chakrabarti, A. (2019). FPGA-Based Novel Speech Enhancement System Using Microphone Activity Detector. In: Chaki, R., Cortesi, A., Saeed, K., Chaki, N. (eds) Advanced Computing and Systems for Security. Advances in Intelligent Systems and Computing, vol 897. Springer, Singapore. https://doi.org/10.1007/978-981-13-3250-0_9

Download citation

DOI: https://doi.org/10.1007/978-981-13-3250-0_9
Published: 08 December 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-3249-4
Online ISBN: 978-981-13-3250-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics