Creating a Mexican Spanish Version of the CMU Sphinx-III Speech Recognition System

Varela, Armando; Cuayáhuitl, Heriberto; Nolazco-Flores, Juan Arturo

doi:10.1007/978-3-540-24586-5_30

Armando Varela⁶,
Heriberto Cuayáhuitl⁶ &
Juan Arturo Nolazco-Flores⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2905))

Included in the following conference series:

Iberoamerican Congress on Pattern Recognition

1462 Accesses
11 Citations

Abstract

In this paper we present the creation of a Mexican Spanish version of the CMU Sphinx-III speech recognition system. We trained acoustic and N-gram language models with a phonetic set of 23 phonemes. Our speech data for training and testing was collected from an auto-attendant system under telephone environments. We present experiments with different language models. Our best result scored an overall error rate of 6.32%. Using this version is now possible to develop speech applications for Spanish speaking communities. This version of the CMU Sphinx system is freely available for non-commercial use under request.

Download to read the full chapter text

Chapter PDF

Amazigh Speech Recognition System Based on CMUSphinx

The NECTEC 2015 Thai Open-Domain Automatic Speech Recognition System

A Continuous Speech Recognition System for Bangla Language

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Huerta, J.M., Thayer, E., Ravishankar, M., Stern, R.M.: The Development of the 1997 CMU Spanish Broadcast News Transcription System. In: Proc. of the DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, Virginia (February 1998)
Google Scholar
Huerta, J.M., Chen, S.J., Stern, R.M.: The 1998 Carnegie Mellon University Sphinx-III Spanish Broadcast News Transcription System. In: The proceedigns of the DARPA Broadcast News Transcription and Understanding Workshop, Herndon, Virginia (March 1999)
Google Scholar
Cuayáhuitl, H., Serridge, B.: Out-Of-Vocabulary Word Modeling and Rejection for Spanish Keyword Spotting Systems. In: Coello Coello, C.A., de Albornoz, Á., Sucar, L.E., Battistutti, O.C. (eds.) MICAI 2002. LNCS (LNAI), vol. 2313, pp. 158–167. Springer, Heidelberg (2002)
Google Scholar
Hwang, M.-Y.: Subphonetic Acoustic Modeling for Speaker-Independent Continuous Speech Recognition. Ph.D. thesis, Carnegie Mellon University (1993)
Google Scholar
Hieronymus, L.J.: ASCII Phonetic Symbols for World’s Languages: worldbet. Technical report, Bell Labs (1993)
Google Scholar
Clarkson, P., Rosenfeld, R.: Statistical Language Modeling Using the CMU Cambridge Toolkit. In: The proceedings of Eurospeech, Rodhes, Greece, pp. 2707–2710 (1997)
Google Scholar
Farfán, F., Cuayáhuitl, H., Portilla, A.: Evaluating Dialogue Strategies in a Spoken Dialogue System for Email. In: The proceedings of the IASTED Artificial Intelligence and Applications, September 2003, ACTA Press, Manalmádena (2003)
Google Scholar
CMU Robust Speech Group, Carnegie Mellon University, http://www.cs.cmu.edu/afs/cs/user/robust/www/

Download references

Author information

Authors and Affiliations

Department of Engineering and Technology, Intelligent Systems Research Group, Universidad Autónoma de Tlaxcala, Apartado Postal #140, 90300, Apizaco, Tlaxcala, Mexico
Armando Varela & Heriberto Cuayáhuitl
Instituto Tecnológico de Estudios Superiores de Monterrey, Sucursal de Correos “J”, 64849, Monterrey, Nuevo Leon, Mexico
Juan Arturo Nolazco-Flores

Authors

Armando Varela
View author publications
You can also search for this author in PubMed Google Scholar
Heriberto Cuayáhuitl
View author publications
You can also search for this author in PubMed Google Scholar
Juan Arturo Nolazco-Flores
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. System Engineering and Automation, Universitat Politècnica de Catalunya (UPC), Barcelona, Spain
Alberto Sanfeliu
Advanced Technologies Applications Center, MINBAS, Cuba
José Ruiz-Shulcloper

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Varela, A., Cuayáhuitl, H., Nolazco-Flores, J.A. (2003). Creating a Mexican Spanish Version of the CMU Sphinx-III Speech Recognition System. In: Sanfeliu, A., Ruiz-Shulcloper, J. (eds) Progress in Pattern Recognition, Speech and Image Analysis. CIARP 2003. Lecture Notes in Computer Science, vol 2905. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24586-5_30

Download citation

DOI: https://doi.org/10.1007/978-3-540-24586-5_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20590-6
Online ISBN: 978-3-540-24586-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Creating a Mexican Spanish Version of the CMU Sphinx-III Speech Recognition System

Abstract

Chapter PDF

Similar content being viewed by others

Amazigh Speech Recognition System Based on CMUSphinx

The NECTEC 2015 Thai Open-Domain Automatic Speech Recognition System

A Continuous Speech Recognition System for Bangla Language

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Creating a Mexican Spanish Version of the CMU Sphinx-III Speech Recognition System

Abstract

Chapter PDF

Similar content being viewed by others

Amazigh Speech Recognition System Based on CMUSphinx

The NECTEC 2015 Thai Open-Domain Automatic Speech Recognition System

A Continuous Speech Recognition System for Bangla Language

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation