Summary
We present JANUS-II, a large scale system effort aimed at interactive spoken language translation. JANUS-II now accepts spontaneous conversational speech in a limited domain in English, German or Spanish and produces output in German, English, Spanish, Japanese and Korean. The challenges of coarticulated, disfluent, ill-formed speech are manifold, and have required advances in acoustic modeling, dictionary learning, language modeling, semantic parsing and generation, to achieve acceptable performance. A semantic “inter-lingua” that represents the intended meaning of an input sentence, facilitates the generation of culturally and contextually appropriate translation in the presence of irrelevant or erroneous information. Application of statistical, contextual, prosodic and discourse constraints permits a progressively narrowing search for the most plausible interpretation of an utterance. During translation, JANUS-II produces paraphrases that are used for interactive correction of translation errors. Beyond our continuing efforts to improve robustness and accuracy, we have also begun to study possible forms of deployment. Several system prototypes have been implemented to explore translation needs in different settings: speech translation in one-on-one video conferencing, as portable mobile interpreter, or as passive simultaneous conversation translator. We will discuss their usability and performance.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
T. Morimoto, T. Takezawa, F. Yato, S. Sagayama, T. Tashiro, M. Nagata, and A. Kurematsu, “ATR’s Speech Translation System: ASURA”, EUROSPEECH 1993, pp. 1295
D.B. Roe, F.C.N. Pereira, R.W. Sproat, and M.D. Riley, “Efficient Grammar Processing for a Spoken Language Translation System”, ICASSP 1992, Vol. 1, pp. 213
A. Waibel, A.M. Jain, A.E. McNair, H. Saito, A.G. Hauptmann, and J. Tebelskis “JANUS: A Speech-To-Speech Translation System Using Connectionist and Symbolic Processing Strategies”, ICASSP’91, 1991
K. Hatazaki, J. Noguchi, A. Okumura, K. Yoshida, T. Watanabe, “INTERTALKER: An Experimental Automatic Interpretation System Using Conceptual Representation”, ICSLP 1992
W. Wahlster “First Results of Verbmobil: Translation Assistance for Spontaneous Dialogues” ATR International Workshop on Speech Translation, November 8–9, 1993
B. Suhm, P. Geutner, T. Kemp, A. Lavie, L. Mayfield, A.E. McNair, I. Rogina, T. Schultz, T. Sloboda, W. Ward, M. Woszczyna, and A. Waibel, “JANUS: Towards Multilingual Spoken Language Translation” in Proceedings of the ARPA Spoken Language Technology Workshop, Austin, TX, January 1995.
H. Hild, ztnd A. Waibel, “Integrating Spelling into Spoken Dialogue Recognition”, EUROSPEECH, Vol. 2, pp. 1977
W. Ward, “Understanding Spontaneous Speech: The Phoenix System”, ICASSP 1991, Vol. 1, pp. 365
A. Lavie and M. Tomita, “GLR*-An Efficient Noise-Skipping Parsing Algorithm for Context-Free Grammars”, Proceedings of Third International Workshop on Parsing Technologies, 1993, p. 123
M.T. Vo, R. Houghton, J. Yang, U. Bub, U. Maier and A. Waibel, “Multimodal Learning Interfaces”, in Proceedings of the ARPA Spoken Language Technology Workshop, Austin, TX, January 1995
A.E. McNair and A. Waibel, “Improving Recognizer Acceptance through Robust, Natural Speech Repair”, ICSLP 1994, Vol. 3, pp. 1299
Hutchins, W.J. and H. Somers. An Introduction to Machine Translation, Academic Press, San Diego, 1992
Hovy, E.H., “How MT Works”, Special Feature on Machine Translation, Byte Magazine, (167–176), January, 1993
S. Nirenburg, J.C. Carbonell, M. Tomita, and K. Goodman, “Machine Translation: A Knowledge-Based Approach”, Morgan Kaufmann, San Mateo, 1992
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Waibel, A. (1999). Interactive Translation of Conversational Speech. In: Ponting, K. (eds) Computational Models of Speech Pattern Processing. NATO ASI Series, vol 169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60087-6_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-60087-6_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-64250-0
Online ISBN: 978-3-642-60087-6
eBook Packages: Springer Book Archive