Skip to main content

Hardware Implementation of MFCC-Based Feature Extraction for Speaker Recognition

  • Conference paper
  • First Online:
Advanced Computer and Communication Engineering Technology

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 315))

Abstract

The most important issues in the field of speech recognition and representative of the speech is a feature extraction. Feature extraction based Mel Frequency Cepstral Coefficient (MFCC) is one the most important features required among various kinds of speech application. In this paper, FPGA-based for speech features extraction MFCC algorithm is proposed. The complexities of computational as well as the requirement of memory usage are characterized, analyzed, and improved. Look-up table (LUT) scheme is used to deal with the elementary function value in the MFCC algorithm and fixed-point arithmetic is implemented to reduce the cost under accuracy study. The final feature extraction design is implemented effectively into the FPGA-Xilinx Virtex2 XC2V6000 FF1157-4 chip.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kung, S.Y., Mak, M.W., Lin, S.H.: Biometric Authentication: a Machine Learning Approach, 1st edn. Prentice Hall, New Jersey, USA (2005)

    Google Scholar 

  2. Campbell, J.P.: Speaker recognition: a tutorial. Proc. IEEE. 85(9), 1437–1462 (1997)

    Article  Google Scholar 

  3. Sadaoki, F.: Fifty years of progress in speech and speaker recognition. J. Acoust. Soc. Am 116(4), 2497–2498 (2004)

    Google Scholar 

  4. Atal, B.S.: Automatic recognition of speakers from their voices. Proc. IEEE 64, 460–475 (1976)

    Article  Google Scholar 

  5. Furui, S.: An overview of speaker recognition technology, ESCA Workshop on Automatic Speaker Recognition, Identification and Verification, 1–9 (1994)

    Google Scholar 

  6. Richard, D.P., Daryl, H.G.: An introduction to speech and speaker recognition. IEEE Comput. Soc. Press 23(8), 26–33 (1990)

    Article  Google Scholar 

  7. “System Generator for DSP” Xilinx Inc. (2006). http://www.xilinx.com/ise/optional_prod/system_generator.htm

  8. Rosenberg, A.E., Soong, F.K.: Recent research in automatic speaker recognition. In: Sadaoki, F. (ed.) Advances in Speech Signal Processing, 701–738 (1992)

    Google Scholar 

  9. Moretto, P.: Mapping of speech front-end signal processing to high performance vector architectures. Technical report, International Computer Science Institute (1995)

    Google Scholar 

  10. Premakanthan, P., and Mikhad, W. B.: Speaker verification/recognition and the importance of selective feature extraction: review. In: Proceedings of the 44th IEEE 2001 Midwest Symposium on Circuits and Systems, 1(1), 57–61 (2001)

    Google Scholar 

  11. Stolcke, A., Shriberg, E., Ferrer, L., Kajarekar, S., Sonmez, K., and Tur, G.: Speech recognition as feature extraction for speaker recognition. In: IEEE Workshop on Signal Processing Applications for Public Security and Forensics 1–5 (2007)

    Google Scholar 

  12. Atal, B.S., Hanauer, L.S.: Speech analysis and synthesis by linear prediction of the speech wave. J. Acoust. Soc. Am. 50, 637–655 (1971)

    Article  Google Scholar 

  13. Davis, S. B., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust Speech Sig. Process. 28(4), 357–366 (1980)

    Google Scholar 

  14. Hermansky, H.: Perceptual Linear Predictive Analysis of Speech. J. Acoust. Soc. Am. 87(4), 1738–1752 (1990)

    Article  Google Scholar 

  15. Waleed, H.A.: Robust speaker modeling using perceptually motivated feature. Elsevier Sci. Pattern Recogn. Lett. 28(11), 1333–1342 (2007)

    Article  Google Scholar 

  16. Davis, S. B., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Sig. Process. 28(4), 357–366 (1980)

    Google Scholar 

  17. John, H., Wendy, H.: Speech Synthesis and Recognition, 2nd edn. Taylor & Francis Inc, Bristol, USA (2002)

    Google Scholar 

  18. Milner, B.: A Comparison of Front-End Configurations for Robust Speech Recognition. Proceeding of ICASSP ’2002, 1(1), 797–800 (2002)

    Google Scholar 

  19. Schmidt, N.A., Thomas, H.C.: Speaker verification by human listeners: experiments comparing human and machine performance using the NIST1998 speaker evaluation data. J. Digit. Sig. Process. 10(1–3), 249–266 (2000)

    Article  Google Scholar 

  20. Chakroborty, S., Roy, A., Saha, G.: Improved closed set text-independent speaker identification by combining MFCC with evidence from flipped filter banks. Int. J. Sig. Process. 4(2), 114–122 (2008)

    Google Scholar 

  21. Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition, 2nd. ed. Pearson Education, USA (2003)

    Google Scholar 

  22. Ben, G., Nelson, M.: Speech and Audio Signal Processing, 2nd edn. Wiley, USA (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to P. Ehkan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Ehkan, P., Zakaria, F.F., Warip, M.N.M., Sauli, Z., Elshaikh, M. (2015). Hardware Implementation of MFCC-Based Feature Extraction for Speaker Recognition. In: Sulaiman, H., Othman, M., Othman, M., Rahim, Y., Pee, N. (eds) Advanced Computer and Communication Engineering Technology. Lecture Notes in Electrical Engineering, vol 315. Springer, Cham. https://doi.org/10.1007/978-3-319-07674-4_46

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-07674-4_46

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-07673-7

  • Online ISBN: 978-3-319-07674-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics