Abstract
The purpose of this work is to show the results obtained when the latest technological advances in the area of Automatic Speech Recognition (ASR) are applied to the Western-Huastec Náhuatl and Huastec languages. Western-Huastec Náhuatl and Huastec are not only native (indigenous) languages in México, but also minority languages, and people who speak these languages usually are analphabetic. A speech database was created by recording the voice of native speaker when reading a set of documents used for native bilingual primary school in the official mexican state education system. A pronunciation dictionary was created for each language. A continuous Hidden Markov Models (HMM) were used for acoustical modeling, and bigrams were used for language Modeling. A Viterbi decoder was used for recognition. The word error rate of this task is below 8.621% for Western-Huastec Náhuatl language and 10.154% for Huastec language.
Keywords
- Automatic Speech Recognition
- Minority Language
- Speech Recognition System
- Word Error Rate
- Automatic Speech Recognition System
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Constitución Política de los Estados Unidos Mexicanos
Plan y Programa de Estudio para la Educación Primaria, SEP, México (1993)
Sullivan, T.O.: Compendio de la Gramática Náhuatl, Ejercicios, UNAM, Instituto de Investigaciones Históricas, Second Edition (1992)
Canales Juarez, G., Mendez González, R., Hernández Miranda, J., Roque Cerroblanco, E.: Nauatlajtoli tlen uaxtekapaj tlali, Lengua náhuatl, Region Huasteca, Hidalgo. In: Third and fourth grade, SEP (1993)
Deller, J.R., Proakis, J.G., Hansen, J.H.L.: Discrete-Time Processing of Speech Signals. Prentice Hall, Sec. 6.2 (1993)
Clarkson, P., Rosenfeld, R.: Statistical Language Modelling using the CMUCambridge Toolkit. In: Proceedings of Eurospeech, Rodhes, Greece, pp. 2707–2710 (1997)
Varela, A., Cuayáhuitl, H., Nolazco-Flores, J.A.: Creating a Mexican Spanish Version of the CMU SPHINX-III Speech Recognition System. In: Sanfeliu, A., Ruiz-Shulcloper, J. (eds.) CIARP 2003. LNCS, vol. 2905, pp. 251–258. Springer, Heidelberg (2003)
Salgado-Garza, L.R., Stern, R., Nolazco, J.A.: N-Best List Rescoring using Syntactic Trigrams. In: Monroy, R., Arroyo-Figueroa, G., Sucar, L.E., Sossa, H. (eds.) MICAI 2004. LNCS (LNAI), vol. 2972, pp. 79–88. Springer, Heidelberg (2004)
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum likelehood for incomplete data via the EM algorithm. J. Roy. Stat. Soc. 39(1), 1–38 (1977)
Huerta, J.M., Chen, S., Stern, R.M.: The 1998 CMU SPHINX-3 Broadcast News Transcription System. In: Darpa Broadcast News Workshop (1999)
Dryer, M.S.: Large Linguistic areas and lang. samp. Studies in Language 13, 257–292 (1996)
Meso-American Indian Languages. Encyclopedia Britannica. 2004. Encyclopedia Britannica Online (May 14, 2004), http://0-search.eb.com.millenium.itesm.mx:80/eb/article?eu=118158
Grossner-Lerner, E.: Los tenek de San Luis Potosi, INAH (1991)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nolazco-Flores, J.A., Salgado-Garza, L.R., Peña-Díaz, M. (2005). Speaker Dependent ASRs for Huastec and Western-Huastec Náhuatl Languages. In: Marques, J.S., Pérez de la Blanca, N., Pina, P. (eds) Pattern Recognition and Image Analysis. IbPRIA 2005. Lecture Notes in Computer Science, vol 3523. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11492542_73
Download citation
DOI: https://doi.org/10.1007/11492542_73
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26154-4
Online ISBN: 978-3-540-32238-2
eBook Packages: Computer ScienceComputer Science (R0)