Interactive Translation of Conversational Speech

Waibel, Alex

doi:10.1007/978-3-642-60087-6_33

Interactive Translation of Conversational Speech

Alex Waibel²

Chapter

230 Accesses
1 Citations

Part of the book series: NATO ASI Series ((NATO ASI F,volume 169))

Summary

We present JANUS-II, a large scale system effort aimed at interactive spoken language translation. JANUS-II now accepts spontaneous conversational speech in a limited domain in English, German or Spanish and produces output in German, English, Spanish, Japanese and Korean. The challenges of coarticulated, disfluent, ill-formed speech are manifold, and have required advances in acoustic modeling, dictionary learning, language modeling, semantic parsing and generation, to achieve acceptable performance. A semantic “inter-lingua” that represents the intended meaning of an input sentence, facilitates the generation of culturally and contextually appropriate translation in the presence of irrelevant or erroneous information. Application of statistical, contextual, prosodic and discourse constraints permits a progressively narrowing search for the most plausible interpretation of an utterance. During translation, JANUS-II produces paraphrases that are used for interactive correction of translation errors. Beyond our continuing efforts to improve robustness and accuracy, we have also begun to study possible forms of deployment. Several system prototypes have been implemented to explore translation needs in different settings: speech translation in one-on-one video conferencing, as portable mobile interpreter, or as passive simultaneous conversation translator. We will discuss their usability and performance.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

T. Morimoto, T. Takezawa, F. Yato, S. Sagayama, T. Tashiro, M. Nagata, and A. Kurematsu, “ATR’s Speech Translation System: ASURA”, EUROSPEECH 1993, pp. 1295
Google Scholar
D.B. Roe, F.C.N. Pereira, R.W. Sproat, and M.D. Riley, “Efficient Grammar Processing for a Spoken Language Translation System”, ICASSP 1992, Vol. 1, pp. 213
Google Scholar
A. Waibel, A.M. Jain, A.E. McNair, H. Saito, A.G. Hauptmann, and J. Tebelskis “JANUS: A Speech-To-Speech Translation System Using Connectionist and Symbolic Processing Strategies”, ICASSP’91, 1991
Google Scholar
K. Hatazaki, J. Noguchi, A. Okumura, K. Yoshida, T. Watanabe, “INTERTALKER: An Experimental Automatic Interpretation System Using Conceptual Representation”, ICSLP 1992
Google Scholar
W. Wahlster “First Results of Verbmobil: Translation Assistance for Spontaneous Dialogues” ATR International Workshop on Speech Translation, November 8–9, 1993
Google Scholar
B. Suhm, P. Geutner, T. Kemp, A. Lavie, L. Mayfield, A.E. McNair, I. Rogina, T. Schultz, T. Sloboda, W. Ward, M. Woszczyna, and A. Waibel, “JANUS: Towards Multilingual Spoken Language Translation” in Proceedings of the ARPA Spoken Language Technology Workshop, Austin, TX, January 1995.
Google Scholar
H. Hild, ztnd A. Waibel, “Integrating Spelling into Spoken Dialogue Recognition”, EUROSPEECH, Vol. 2, pp. 1977
Google Scholar
W. Ward, “Understanding Spontaneous Speech: The Phoenix System”, ICASSP 1991, Vol. 1, pp. 365
Google Scholar
A. Lavie and M. Tomita, “GLR*-An Efficient Noise-Skipping Parsing Algorithm for Context-Free Grammars”, Proceedings of Third International Workshop on Parsing Technologies, 1993, p. 123
Google Scholar
M.T. Vo, R. Houghton, J. Yang, U. Bub, U. Maier and A. Waibel, “Multimodal Learning Interfaces”, in Proceedings of the ARPA Spoken Language Technology Workshop, Austin, TX, January 1995
Google Scholar
A.E. McNair and A. Waibel, “Improving Recognizer Acceptance through Robust, Natural Speech Repair”, ICSLP 1994, Vol. 3, pp. 1299
Google Scholar
Hutchins, W.J. and H. Somers. An Introduction to Machine Translation, Academic Press, San Diego, 1992
MATH Google Scholar
Hovy, E.H., “How MT Works”, Special Feature on Machine Translation, Byte Magazine, (167–176), January, 1993
Google Scholar
S. Nirenburg, J.C. Carbonell, M. Tomita, and K. Goodman, “Machine Translation: A Knowledge-Based Approach”, Morgan Kaufmann, San Mateo, 1992
Google Scholar

Download references

Author information

Authors and Affiliations

Interactive Systems Laboratories, Carnegie Mellon University, 5000 Forbes Avenue, Pittsburgh, PA, 15217, USA
Alex Waibel

Authors

Alex Waibel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Speech Research Unit, DERA Malvern, St. Andrew’s Road, WR14 4DT, Great Malvern, Worcs, UK
Keith Ponting

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Waibel, A. (1999). Interactive Translation of Conversational Speech. In: Ponting, K. (eds) Computational Models of Speech Pattern Processing. NATO ASI Series, vol 169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60087-6_33

Download citation

DOI: https://doi.org/10.1007/978-3-642-60087-6_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-64250-0
Online ISBN: 978-3-642-60087-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics