Skip to main content

Deploying DSR Technology on Today’s Mobile Phones: A Feasibility Study

  • Conference paper
Perception in Multimodal Dialogue Systems (PIT 2008)

Abstract

In this paper we study the feasibility for the deployment of Distributed Speech Recognition (DSR) technology on today’s mobile phones using publicly available tools and architectures. The existing standards for DSR front-ends are presented and discussed; arguments for the choice of an appropriate standard for a deployment on the particular platform are given. We point out hindrances and pitfalls during development and deployment. Standard conform solutions for the two most prominent development environments, Symbian C++ and Java Micro Edition, are presented and evaluated on real end-user devices.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-end Feature Extraction Algorithm; Compression Algorithm. ETSI Standard ES 201 108 (2000)

    Google Scholar 

  2. Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-end Feature Extraction Algorithm; Compression Algorithm. ETSI Standard ES 202 050 (2002)

    Google Scholar 

  3. Distributed Speech Recognition; Extended Advanced Front-end Feature Extraction Algorithm; Compression Algorithm, Back-end Speech Reconstruction Algorithm. ETSI Standard ES 202 212 (2003a)

    Google Scholar 

  4. Distributed Speech Recognition; Extended Front-end Feature Extraction Algorithm; Compression Algorithm, Back-end Speech Reconstruction Algorithm. ETSI Standard ES 202 211 (2003b)

    Google Scholar 

  5. Digital cellular telecommunications system (Phase 2+); Universal Mobile Telecommunications System (UMTS); ANSI C code for the fixed-point distributed speech recognition extended advanced front-end. ETSI Technical Specification TS 126 243 (2004)

    Google Scholar 

  6. Burileanu, C., Popescu, V.: An efficient distributed speech recognition frontend implementation using a Motorola Star Core 140 based platform. In: Volume du Symposion International d’Electronique et Télécommunications, ETC 2004, University of Timisoara, Romania (2004)

    Google Scholar 

  7. Hirsch, H.-G., Pearce, D.: The AURORA experimental framework for the performance evaluation of speech recognition systems under noisy conditions. In: Proc. ISCA ITRW ASR2000, Paris, France, pp. 181–188 (2000)

    Google Scholar 

  8. Macho, D., Mauuary, L., Noe, B., Cheng, Y., Eahey, D., Jouvet, D., Kelleher, H., Pearce, D., Saadoun, F.: Evaluation of a noise-robust DSR front-end on AURORA databases. In: Proc. ICSLP, pp. 17–20 (2002)

    Google Scholar 

  9. Mauuary, L.: Blind equalization in the cepstral domain for robust telephone based speech recognition. In: Proc. EUSPICO 1998, pp. 359–363 (1998)

    Google Scholar 

  10. Xu, H., Tan, Z.-H., Dalsgaard, P., Mattethat, R., Lindberg, B.: A configurable distributed speech recognition system. In: Proc. of the 9th European Conference on Speech Communication and Technology (2005)

    Google Scholar 

  11. Zaykovskiy, D., Schmitt, A.: Java (J2ME) front-end for distributed speech recognition. In: AINAW 2007: Proceedings of the 21st International Conference on Advanced Information Networking and Applications Workshops, pp. 353–357. IEEE Computer Society Press, Los Alamitos (2007)

    Google Scholar 

  12. Zaykovskiy, D., Schmitt, A., Lutz, M.: New Use of Mobile Phones: Towards Multimodal Information Access Systems. In: 3rd IET International Conference on Intelligent Environments, Ulm, Germany (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Elisabeth André Laila Dybkjær Wolfgang Minker Heiko Neumann Roberto Pieraccini Michael Weber

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zaykovskiy, D., Schmitt, A. (2008). Deploying DSR Technology on Today’s Mobile Phones: A Feasibility Study. In: André, E., Dybkjær, L., Minker, W., Neumann, H., Pieraccini, R., Weber, M. (eds) Perception in Multimodal Dialogue Systems. PIT 2008. Lecture Notes in Computer Science(), vol 5078. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69369-7_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-69369-7_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-69368-0

  • Online ISBN: 978-3-540-69369-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics