Skip to main content

Employing FPGA Accelerator in Real-Time Speaker Identification Systems

  • Conference paper
  • First Online:
Recent Trends in Signal and Image Processing

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 922))

Abstract

In the most recent years, numerous approaches have been accomplished in the discipline of human voice recognition for building speaker identification systems. Frequency and time domain techniques are widely used in extracting human voice features. This paper presents the most robust and most popular Mel-frequency cepstral coefficient (MFCC) technique to parameterize voices and to be used later in the voiced/unvoiced different feature extraction process methods. In addition, the direct classical techniques for human voice feature extraction purposes are used. For the purpose of the processing time consumption and to speed up the system performance for use in real-time applications, a field programming gate array (FPGA) is utilized. Its type is Altera DE2 Cyclone II.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Gaikwad SK (2010) A review on speech recognition technique. Int J Comput Appl (0975–8887) 10(3)

    Google Scholar 

  2. Bachu RG, et al (2008) Separation of voiced and unvoiced using zero crossing rate and energy of the speech signal. In: American Society for Engineering Education (ASEE) zone conference proceedings

    Google Scholar 

  3. Kaur G, Singh D, Kaur G (2015) A survey on speech recognition algorithms. Int J Emerg Res Manage Technol 4(5):289–298

    Google Scholar 

  4. Shrawankar U, Thakare V (2010) Techniques for feature extraction in speech recognition system: a comparative study. Int J Comput Appl Eng Technol Sci (IJCAETS) 2(1):412–418

    Google Scholar 

  5. Poonkuzhali C, Karthiprakash R, Valarmathy S, Kalamani M (2013) An approach to feature selection algorithm based on ant colony optimization for automatic speech recognition. Int J Adv Res Electr Electron Instrum Eng 11(2) (2013)

    Google Scholar 

  6. Majeed S, Husain H, Abdulsamad S, Idbeaa T (2015) Mel Frequency Cepstral Coefficients (MFCC) feature extraction enhancement in the application of speech recognition: a comparison study. J Theor Appl Inf Technol 79(1):38–56

    Google Scholar 

  7. Rabiner LR, Juang BBH (1993) Fundamentals of speech recognition. Prentice Hall, ‎Upper Saddle River

    Google Scholar 

  8. Huang X, Acero A, Hon H-W (2001) Spoken language processing, vol 15. Prentice Hall, ‎Upper Saddle River

    Google Scholar 

  9. Von Békésy G, Wever EG (1960) Experiments in hearing, vol 8. McGraw-Hill, New York

    Google Scholar 

  10. Quatieri TF (2002) Discrete-time speech signal processing. Pearson Education, London

    Google Scholar 

  11. Xiong X (2009) Robust speech features and acoustic models for speech recognition. PhD. Thesis, Nanyang Technological University, Singapore

    Google Scholar 

  12. Rabiner LR, Schafer RW (2010) Theory and applications of digital speech processing. Pearson, London

    Google Scholar 

  13. Ephraim Y, Rahim M (1999) On second-order statistics and linear estimation of Cepstral coefficients. IEEE Trans Speech Audio Process 7:162–176

    Article  Google Scholar 

  14. Singh P, Rani P (2014) An approach to extract features using MFCC. IOSR J Eng 4(8):21–25

    Article  Google Scholar 

  15. Churiwala S, Hyderabad I (2017) Designing with Xilinx® FPGAs. Springer International Publishing, Cham

    Book  Google Scholar 

  16. Bailey DG (2011) Design for embedded image processing on FPGAs. Wiley, New York

    Book  Google Scholar 

  17. EhKan P, Allen T, QuigleySF (2011) FPGA implementation for GMM-based speaker identification. Int J Reconfig Comput 3

    Google Scholar 

  18. Chu PP (2012) Embedded SoPC design with NIOS II processor and Verilog examples. Wiley, New York

    Book  Google Scholar 

  19. Khan ARM, Thakare AP, Gulhane SM (2010) FPGA-based design of controller for sound fetching from codec using Altera DE2 Board. Int J Sci Eng Res 1(2)

    Google Scholar 

  20. Altera DSP (2003, July) Builder–reference manual. Altera Corporation

    Google Scholar 

  21. Mathworks Homepage. https://www.mathworks.com/products/simulink.html. Last Accessed 1 Sep 2018

  22. Nurmi J et al (eds) (2015) GALILEO positioning technology. Springer, Berlin

    Google Scholar 

  23. Tlelo-Cuautle E, de la Fraga LG, Rangel-Magdaleno J (2016) Engineering applications of FPGAs. Springer, Berlin

    Book  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohammed A. Fadhel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Al-Shamma, O., Fadhel, M.A., Hasan, H.S. (2019). Employing FPGA Accelerator in Real-Time Speaker Identification Systems. In: Bhattacharyya, S., Pal, S., Pan, I., Das, A. (eds) Recent Trends in Signal and Image Processing. Advances in Intelligent Systems and Computing, vol 922. Springer, Singapore. https://doi.org/10.1007/978-981-13-6783-0_12

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-6783-0_12

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-6782-3

  • Online ISBN: 978-981-13-6783-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics