Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7836))

Included in the following conference series:

Abstract

Automatic speech recognition technology can be integrated in an information retrieval process to allow searching on multimedia contents. But, in order to assure an adequate retrieval performance is necessary to state the quality of the recognition phase, especially in speaker-independent and domainindependent environments. This paper introduces a methodology to accomplish the evaluation of different speech recognition systems in several scenarios considering also the creation of new corpora of different types (broadcast news, interviews, etc.), especially in other languages apart from English that are not widely addressed in speech community.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval: The Concepts and Technology behind Search, 2nd edn. ACM Press Books (2011)

    Google Scholar 

  2. Varona, A., Rodríguez Fuentes, L.J., Penagarikano, M., Nieto, S., Diez, M., Bordel, G.: Search and access to information contained in the speech of multimedia resources. Procesamiento del Lenguaje Natural 45, 317–318 (2010)

    Google Scholar 

  3. Sphinx, http://cmusphinx.sourceforge.net/

  4. The HTK Speech Recognition Toolkit, http://htk.eng.cam.ac.uk/

  5. Dragon, http://www.nuance.com/naturallyspeaking/

  6. Sclite, ftp://jaguar.ncsl.nist.gov/current_docs/sctk/doc/sclite.htm

  7. Dybkjaer, L., Hemsen, H., Minker, W.: Evaluation of Text and Speech systems, pp. 1–64, 99–124. Springer (2007)

    Google Scholar 

  8. Moreno, J., Garrote, M., Martínez, P., Martínez-Fernández, J.L.: Some experiments in evaluating ASR systems applied to multimedia retrieval. In: Detyniecki, M., García-Serrano, A., Nürnberger, A. (eds.) AMR 2009. LNCS, vol. 6535, pp. 12–23. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  9. Spoken lecture processing system, MIT, http://web.sls.csail.mit.edu/lectures/

  10. Iglesias, A., Moreno, L., Ruiz-Mezcua, B., Pajares, J.L., Jiménez, J., López, J.F., Revuelta, P., Hernández, J.: Web Educational Services for All: The APEINTA project, Web Accessibility Challenge. In: 8th International Cross-Disciplinary Conference on Web Accessibility, Hyderabad, India (2011)

    Google Scholar 

  11. Oard, D., Wang, J., Jones, G., White, R., Pecina, P., Soergel, D., Huang, X., Shafran, I.: Overview of the CLEF-2006 Cross-Language speech retrieval track. In: Peters, C., Clough, P., Gey, F.C., Karlgren, J., Magnini, B., Oard, D.W., de Rijke, M., Stempfhuber, M. (eds.) CLEF 2006. LNCS, vol. 4730, pp. 744–758. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  12. Huang, X., Jack, M., Ariki, Y.: Hidden Markov Models for Speech Recognition. Edinburgh University Press (1990)

    Google Scholar 

  13. De Mori, R., Bechet, F., Hakkani-Tur, D., McTear, M., Riccardi, G., Tur, G.: Spoken Language Understanding: A Survey. IEEE Signal Processing Magazine 25, 50–58 (2008)

    Article  Google Scholar 

  14. Ogata, J., Goto, M.: Speech repair: quick error correction just by using selection operation for speech input interfaces. In: Proc. Eurospeech 2005, pp. 133–136 (2005)

    Google Scholar 

  15. Sarma, A., Palmer, D.: Context-based speech recognition error detection and correction. In: Proceedings of HLT-NAACL (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

González, M., Moreno, J., Martínez, J.L., Martínez, P. (2013). An Illustrated Methodology for Evaluating ASR Systems. In: Detyniecki, M., García-Serrano, A., Nürnberger, A., Stober, S. (eds) Adaptive Multimedia Retrieval. Large-Scale Multimedia Retrieval and Evaluation. AMR 2011. Lecture Notes in Computer Science, vol 7836. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37425-8_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37425-8_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37424-1

  • Online ISBN: 978-3-642-37425-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics