Skip to main content

Real-Time Statistical Speech Translation

  • Conference paper

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 275))

Abstract

This research investigates the Statistical Machine Translation approaches to translate speech in real time automatically. Such systems can be used in a pipeline with speech recognition and synthesis software in order to produce a real-time voice communication system between foreigners. We obtained three main data sets from spoken proceedings that represent three different types of human speech. TED, Europarl, and OPUS parallel text corpora were used as the basis for training of language models, for developmental tuning and testing of the translation system. We also conducted experiments involving part of speech tagging, compound splitting, linear language model interpolation, TrueCasing and morphosyntactic analysis. We evaluated the effects of variety of data preparations on the translation results using the BLEU, NIST, METEOR and TER metrics and tried to give answer which metric is most suitable for PL-EN language pair.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   219.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Koehn, P., Hoang, H.: Moses: Open Source Toolkit for Statistical Machine Translation, Prague (2007)

    Google Scholar 

  2. Marasek, K.: TED Polish-to-English translation system for the IWSLT 2012. In: IWSLT 2012, Hong Kong (2012)

    Google Scholar 

  3. Costa-Jussa, M., Fonollosa, J.: Using linear interpolation and weighted reordering hypotheses in the Moses system, Barcelona, Spain (2010)

    Google Scholar 

  4. Stolcke, A.: SRILM – An Extensible Language Modeling Toolkit. In: INTERSPEECH (2002)

    Google Scholar 

  5. Hsu, P., Glass, J.: Iterative Language Model Estimation: Efficient Data Structure & Algorithms, Cambridge, USA (2008)

    Google Scholar 

  6. Bojar, O.: Rich Morphology and What Can We Expect from Hybrid Approaches to MT. In: LIHMT 2011 (2011)

    Google Scholar 

  7. Radziszewski, A.: A tiered CRF tagger for Polish. In: Bembenik, R., Skonieczny, Ł., Rybiński, H., Kryszkiewicz, M., Niezgódka, M. (eds.) Intell. Tools for Building a Scientific Information. SCI, vol. 467, pp. 215–230. Springer, Heidelberg (2013)

    Google Scholar 

  8. Koehn, P., Hoang, H.: Factored Translation Models, Scotland, United Kingdom (2007)

    Google Scholar 

  9. Ratnaparkhi, A.: A Maximum Entropy Part-Of-Speech Tagger, Pennsylvania (1996)

    Google Scholar 

  10. Holz, F., Biemann, C.: Unsupervised and knowledge-free learning of compound splits and periphrases. In: Gelbukh, A. (ed.) CICLing 2008. LNCS, vol. 4919, pp. 117–127. Springer, Heidelberg (2008)

    Google Scholar 

  11. Cer, D., Manning, C., Jurafsky, D.: The Best Lexical Metric for Phrase-Based Statistical MT System Optimization. Stanford, USA (2010)

    Google Scholar 

  12. Gao, Q., Vogel, S.: Parallel Implementations of Word Alignment Tool (2008)

    Google Scholar 

  13. Heafield, K.: KenLM: Faster and smaller language model queries. Association for Computational Linguistics (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Krzysztof Wołk .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Wołk, K., Marasek, K. (2014). Real-Time Statistical Speech Translation. In: Rocha, Á., Correia, A., Tan, F., Stroetmann, K. (eds) New Perspectives in Information Systems and Technologies, Volume 1. Advances in Intelligent Systems and Computing, vol 275. Springer, Cham. https://doi.org/10.1007/978-3-319-05951-8_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05951-8_11

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05950-1

  • Online ISBN: 978-3-319-05951-8

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics