Skip to main content

FPGA-Based Novel Speech Enhancement System Using Microphone Activity Detector

  • Chapter
  • First Online:
Advanced Computing and Systems for Security

Abstract

In this paper, we have proposed field-programmable gate array (FPGA) based design and implementation of a novel speech enhancement system, which can work for a single microphone device as well as that of a dual microphone device providing background noise immunity. We proposed a microphone activity detector (MAD), which detects the presence of single or dual microphone scenario. After detecting the microphones, multiband spectral subtraction technique enhances the speech signal from different background noisy surrounds. We have implemented our proposed design in Spartan 6 LX45 FPGA using Xilinx system generator tools. The evaluation of the quality of speech of enhanced signal and its correctness of MAD to detect the single or dual microphone system implies that our proposed hardware can work as a proper embedded component for hardware-based execution for speech enhancement.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 16.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Boll, S.: Suppression of acoustic noise in speech using spectral subtraction. IEEE Trans. Acoust. Speech Signal Process. 27(2), 113–120 (1979)

    Google Scholar 

  2. Zhang, Y., Zhao, Y.: Real and imaginary modulation spectral subtraction for speech enhancement. Speech Commun. 55(4), 509–522. ISSN-0167-6393 (2013)

    Google Scholar 

  3. Kamath, S., et. al.: A multi band spectral subtraction method for enhancing speech corrupted by colored noise. In: Proceedings of Acoustics, Speech and Signal Processing, vol. 4 (2002)

    Google Scholar 

  4. Adiono, T. et al.: A hardware software co-design for a real-time spectral subtraction based noise cancellation system. In: Proceedings of ISPACS, Nov 2013

    Google Scholar 

  5. Biswas, T., et. al.: Audio denoising by spectral subtraction technique implemented on reconfigurable hardware. In: Proceedings of IC3, pp. 236–241 (2014). https://doi.org/10.1109/IC3.2014.6897179.

  6. Halupka, D., et. al.: Low power dual microphone speech enhancement using field programmable gate arrays. IEEE Trans. Signal Process. 55(7), 3526–3535 (2007)

    Google Scholar 

  7. Yousefian, N., Loizou, P.C.: A dual-microphone speech enhancement algorithm based on the coherence function. IEEE Trans. Audio Speech Lang. Process. 20(2), 599–609 (2012)

    Google Scholar 

  8. Carter, G.C.: Tutorial overview of coherence and time delay estimation. An applied tutorial for research, development, test, and evaluation engineers, vol. 1, pp. 1–27 (1993)

    Google Scholar 

  9. Knapp, C.H., Carter. G.C.: The generalized correlation method for estimation of time delay. IEEE Trans. Acoust. Speech Signal Process. 24, 320–327 (1976)

    Google Scholar 

  10. Champagne, B., et. al.: Performance of time delay estimation in the presence of room reverberation. IEEE Trans. Speech Audio Process. 4, 148–152 (1996)

    Google Scholar 

  11. McAllister, J.: FPGA Based DSP. Springer, US, pp. 363–392. https://doi.org/10.1007/978-1-4419-6345-1_14

  12. System Generator DSP User Guide. UG640, 2 Dec 2009. www.xilinx.com/support/sw-manual

  13. Matlab/Simulink hardware verifications. www.mathworks.com/products/hdl-verifier

  14. Microsoft kinect microphone array. https://msdn.microsoft.com/en-us/library/jj131033.aspx

  15. Dual Microphone Database: The 29th Conference on VLSI Design and 15th Conference on Embedded Systems—Design Contest: VLSI 2016 (2016)

    Google Scholar 

  16. Varga, A., et. al.: Assessment for automatic speech recognition: II. NOISEX-92: a database and an experiment to study the effect of additive noise on speech recognition system. Speech Commun. 12(3), 247–251 (1993)

    Google Scholar 

  17. Hu, Y., Loizou, P.C.: Perceptual evaluation of speech quality (PESQ), and objective method for end-to-end of speech quality assessment of narrow band telephone network and speech codecs. ITU-T Rec, p. 862 (2000)

    Google Scholar 

  18. Hardware Manual Support. www.xilinx.com/support/sw-manual

  19. Biswas, T., Mandal, S.B., Saha, D., Chakrabarti, A.: Dual microphone sound source localization using reconfigurable hardware. In: Proceedings of CICBA: Communications in Computer and Information Science, vol. 775. Springer, Singapore (2017)

    Google Scholar 

  20. Biswas, T., Mandal, S.B., Saha, D., Chakrabarti, A.: A Novel Reconfigurable Hardware Design for Speech Enhancement Based on Multi-band Spectral Subtraction Involving Magnitude and Phase Components. School Of Information Technology, University of Calcutta (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tanmay Biswas .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Biswas, T., Bhattacharjee, S., Mandal, S.B., Saha, D., Chakrabarti, A. (2019). FPGA-Based Novel Speech Enhancement System Using Microphone Activity Detector. In: Chaki, R., Cortesi, A., Saeed, K., Chaki, N. (eds) Advanced Computing and Systems for Security. Advances in Intelligent Systems and Computing, vol 897. Springer, Singapore. https://doi.org/10.1007/978-981-13-3250-0_9

Download citation

Publish with us

Policies and ethics