Speech Recognition System Based on OLLO French Corpus by Using MFCCs

Youcef, Braham Chaouche; Elemine, Yessaad Mohamed; Islam, Benmaiza; Farid, Bouttout

doi:10.1007/978-3-319-48929-2_25

Braham Chaouche Youcef⁴,
Yessaad Mohamed Elemine⁴,
Benmaiza Islam⁵ &
…
Bouttout Farid⁶

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 411))

Included in the following conference series:

International Conference on Electrical Engineering and Control Applications

1227 Accesses
3 Citations

Abstract

The automatic speech recognition is an area of active study since the early 1950s, and the latest technologies in the field of stochastic processes and the discovery of Hidden Markov Models have given a new direction for this area.

This paper describes an approach of speech recognition by using the Mel-Scale Frequency Cepstral Coefficients (MFCC) from speech recognition experiments done on OLLO French corpus by different features. Our work consists in finding the most appropriate choice for this task using the Mel-Scale Frequency Cepstral Coefficients (MFCC) extracted from speech signal.

To evaluate this analysis, we built an ASR reference system based on the modeling of phonemes by the HMM (Hidden Markov Models) associated with the GMM models (Gaussian Mixture Model) using the HTK tool. The implementation of this system was made using several experiments in order to choose the best parameters used in two main steps to build an ASR system, acoustic analysis and decoding. The experiments show that the choice of 25 Gaussian components provides a good compromise between recognition accuracy and computation time, and we found also that the best parameters leading to good recognition accuracy are MFCC_E_D_A coefficients with 92.5%.

In this paper the quality and testing of speaker recognition and gender recognition system is completed and analysed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Haton, J.-P.: Reconnaissance Automatique de la Parole. Paris (2006)
Google Scholar
Patel, K., Prasad, R.K.: Speech recognition and verification using MFCC & VQ. Int. J. Emerg. Sci. Eng. (IJESE) 1(7) (2013). ISSN: 2319–6378
Google Scholar
Huang, L., Zhang, X.: Speaker independent recognition on OLLO French corpus by using different features. In: 2010 First International Conference on Pervasive Computing, Signal Processing and Applications
Google Scholar
Modic, R.: Comparative wavelet and MFCC speech recognition experiments on the Slovenian and English SpeechDat2
Google Scholar
Nguyen, Q.C.: Reconnaissance de la Parole en Langue Vietnamienne. thèse de doctorat, Institut national polytechnique de Grenoble, Juin 2002
Google Scholar
Bakis, R.: Continuous speech recognition via centisecond acoustic states, 91th. Meeting of the Acoust.Soc, avril 1976
Google Scholar
Young, S., et al.: The HTK Book (for HTK Version 3.4), p. 198 (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

LMSE Laboratory, Department of Electronics, University of Mohamed El Bachir El Ibrahimi, 34265, Bordj Bou Arréridj, Algeria
Braham Chaouche Youcef & Yessaad Mohamed Elemine
Laboratory of Spoken Communication and Signal Processing, Faculty of Electronics and Computer Sciences, USTHB, 16000, Algiers, Algeria
Benmaiza Islam
Laboratory of Signal Processing, Department of Electronics, University of Constantine, 25000, Constantine, Algeria
Bouttout Farid

Authors

Braham Chaouche Youcef
View author publications
You can also search for this author in PubMed Google Scholar
Yessaad Mohamed Elemine
View author publications
You can also search for this author in PubMed Google Scholar
Benmaiza Islam
View author publications
You can also search for this author in PubMed Google Scholar
Bouttout Farid
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Braham Chaouche Youcef .

Editor information

Editors and Affiliations

MIS (EA 4290), Université de Picardie Jules Verne, Amiens, France
Mohammed Chadli
Laboratoire d'Automatique et de Robotiqu, Université Abbes Laghrour, Khenchela, Khenchela, Algeria
Sofiane Bououden
VŠB-Technical University of Ostrava , Ostrava ba, Czech Republic
Ivan Zelinka

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Youcef, B.C., Elemine, Y.M., Islam, B., Farid, B. (2017). Speech Recognition System Based on OLLO French Corpus by Using MFCCs. In: Chadli, M., Bououden, S., Zelinka, I. (eds) Recent Advances in Electrical Engineering and Control Applications. ICEECA 2016. Lecture Notes in Electrical Engineering, vol 411. Springer, Cham. https://doi.org/10.1007/978-3-319-48929-2_25

Download citation

DOI: https://doi.org/10.1007/978-3-319-48929-2_25
Published: 02 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48928-5
Online ISBN: 978-3-319-48929-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics