Hardware Implementation of MFCC-Based Feature Extraction for Speaker Recognition

Ehkan, P.; Zakaria, F. F.; Warip, M. N. M.; Sauli, Z.; Elshaikh, M.

doi:10.1007/978-3-319-07674-4_46

P. Ehkan⁶,
F. F. Zakaria⁶,
M. N. M. Warip⁶,
Z. Sauli⁷ &
…
M. Elshaikh⁶

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 315))

2916 Accesses
6 Citations

Abstract

The most important issues in the field of speech recognition and representative of the speech is a feature extraction. Feature extraction based Mel Frequency Cepstral Coefficient (MFCC) is one the most important features required among various kinds of speech application. In this paper, FPGA-based for speech features extraction MFCC algorithm is proposed. The complexities of computational as well as the requirement of memory usage are characterized, analyzed, and improved. Look-up table (LUT) scheme is used to deal with the elementary function value in the MFCC algorithm and fixed-point arithmetic is implemented to reduce the cost under accuracy study. The final feature extraction design is implemented effectively into the FPGA-Xilinx Virtex2 XC2V6000 FF1157-4 chip.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Kung, S.Y., Mak, M.W., Lin, S.H.: Biometric Authentication: a Machine Learning Approach, 1st edn. Prentice Hall, New Jersey, USA (2005)
Google Scholar
Campbell, J.P.: Speaker recognition: a tutorial. Proc. IEEE. 85(9), 1437–1462 (1997)
Article Google Scholar
Sadaoki, F.: Fifty years of progress in speech and speaker recognition. J. Acoust. Soc. Am 116(4), 2497–2498 (2004)
Google Scholar
Atal, B.S.: Automatic recognition of speakers from their voices. Proc. IEEE 64, 460–475 (1976)
Article Google Scholar
Furui, S.: An overview of speaker recognition technology, ESCA Workshop on Automatic Speaker Recognition, Identification and Verification, 1–9 (1994)
Google Scholar
Richard, D.P., Daryl, H.G.: An introduction to speech and speaker recognition. IEEE Comput. Soc. Press 23(8), 26–33 (1990)
Article Google Scholar
“System Generator for DSP” Xilinx Inc. (2006). http://www.xilinx.com/ise/optional_prod/system_generator.htm
Rosenberg, A.E., Soong, F.K.: Recent research in automatic speaker recognition. In: Sadaoki, F. (ed.) Advances in Speech Signal Processing, 701–738 (1992)
Google Scholar
Moretto, P.: Mapping of speech front-end signal processing to high performance vector architectures. Technical report, International Computer Science Institute (1995)
Google Scholar
Premakanthan, P., and Mikhad, W. B.: Speaker verification/recognition and the importance of selective feature extraction: review. In: Proceedings of the 44th IEEE 2001 Midwest Symposium on Circuits and Systems, 1(1), 57–61 (2001)
Google Scholar
Stolcke, A., Shriberg, E., Ferrer, L., Kajarekar, S., Sonmez, K., and Tur, G.: Speech recognition as feature extraction for speaker recognition. In: IEEE Workshop on Signal Processing Applications for Public Security and Forensics 1–5 (2007)
Google Scholar
Atal, B.S., Hanauer, L.S.: Speech analysis and synthesis by linear prediction of the speech wave. J. Acoust. Soc. Am. 50, 637–655 (1971)
Article Google Scholar
Davis, S. B., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust Speech Sig. Process. 28(4), 357–366 (1980)
Google Scholar
Hermansky, H.: Perceptual Linear Predictive Analysis of Speech. J. Acoust. Soc. Am. 87(4), 1738–1752 (1990)
Article Google Scholar
Waleed, H.A.: Robust speaker modeling using perceptually motivated feature. Elsevier Sci. Pattern Recogn. Lett. 28(11), 1333–1342 (2007)
Article Google Scholar
Davis, S. B., Mermelstein, P.: Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust. Speech Sig. Process. 28(4), 357–366 (1980)
Google Scholar
John, H., Wendy, H.: Speech Synthesis and Recognition, 2nd edn. Taylor & Francis Inc, Bristol, USA (2002)
Google Scholar
Milner, B.: A Comparison of Front-End Configurations for Robust Speech Recognition. Proceeding of ICASSP ’2002, 1(1), 797–800 (2002)
Google Scholar
Schmidt, N.A., Thomas, H.C.: Speaker verification by human listeners: experiments comparing human and machine performance using the NIST1998 speaker evaluation data. J. Digit. Sig. Process. 10(1–3), 249–266 (2000)
Article Google Scholar
Chakroborty, S., Roy, A., Saha, G.: Improved closed set text-independent speaker identification by combining MFCC with evidence from flipped filter banks. Int. J. Sig. Process. 4(2), 114–122 (2008)
Google Scholar
Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition, 2nd. ed. Pearson Education, USA (2003)
Google Scholar
Ben, G., Nelson, M.: Speech and Audio Signal Processing, 2nd edn. Wiley, USA (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer and Communication Engineering, Universiti Malaysia Perlis, Pauh Putra Campus, 02600, Arau, Perlis, Malaysia
P. Ehkan, F. F. Zakaria, M. N. M. Warip & M. Elshaikh
School of Microelectronic Engineering, Universiti Malaysia Perlis, Pauh Putra Campus, 02600, Arau, Perlis, Malaysia
Z. Sauli

Authors

P. Ehkan
View author publications
You can also search for this author in PubMed Google Scholar
F. F. Zakaria
View author publications
You can also search for this author in PubMed Google Scholar
M. N. M. Warip
View author publications
You can also search for this author in PubMed Google Scholar
Z. Sauli
View author publications
You can also search for this author in PubMed Google Scholar
M. Elshaikh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. Ehkan .

Editor information

Editors and Affiliations

Universiti Teknikal Malaysia Melaka, Melaka, Malaysia
Hamzah Asyrani Sulaiman
Universiti Teknikal Malaysia Melaka, Melaka, Malaysia
Mohd Azlishah Othman
Universiti Teknikal Malaysia Melaka, Melaka, Malaysia
Mohd Fairuz Iskandar Othman
Universiti Teknikal Malaysia Melaka, Melaka, Malaysia
Yahaya Abd Rahim
Universiti Teknikal Malaysia Melaka, Melaka, Malaysia
Naim Che Pee

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ehkan, P., Zakaria, F.F., Warip, M.N.M., Sauli, Z., Elshaikh, M. (2015). Hardware Implementation of MFCC-Based Feature Extraction for Speaker Recognition. In: Sulaiman, H., Othman, M., Othman, M., Rahim, Y., Pee, N. (eds) Advanced Computer and Communication Engineering Technology. Lecture Notes in Electrical Engineering, vol 315. Springer, Cham. https://doi.org/10.1007/978-3-319-07674-4_46

Download citation

DOI: https://doi.org/10.1007/978-3-319-07674-4_46
Published: 02 November 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07673-7
Online ISBN: 978-3-319-07674-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics