Phrase-Based Statistical Machine Translation
This paper is based on the work carried out in the framework of the Verbmobil project, which is a limited-domain speech translation task (German-English). In the final evaluation, the statistical approach was found to perform best among five competing approaches.
In this paper, we will further investigate the used statistical translation models. A shortcoming of the single-word based model is that it does not take contextual information into account for the translation decisions. We will present a translation model that is based on bilingual phrases to explicitly model the local context. We will show that this model performs better than the single-word based model. We will compare monotone and non-monotone search for this model and we will investigate the benefit of using the sum criterion instead of the maximum approximation.
KeywordsTarget Sentence Translation Model Word Error Rate Sentence Pair Source Sentence
Unable to display preview. Download preview PDF.
- 1.Auerswald, M.: Example-based machine translation with templates.  418–427Google Scholar
- 2.Block, H.U.: Example-based incremental synchronous interpretation.  411–417Google Scholar
- 3.Brown, P.F., Della Pietra, S.A., Della Pietra, V.J., Mercer, R.L.: The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics 19 (1993) 263–311Google Scholar
- 4.Emele, M.C., Dorna, M., Lüdeling, A., Zinsmeister, H., Rohrer, C.: Semantic-based transfer.  359–376Google Scholar
- 5.Germann, U., Jahr, M., Knight, K., Marcu, D., Yamada, K.: Fast decoding and optimal decoding for machine translation. In: Proc. of the 39th Annual Meeting of the Association for Computational Linguistics (ACL), Toulouse, France (2001) 228–235Google Scholar
- 8.Nießen, S., Och, F.J., Leusch, G., Ney, H.: An evaluation tool for machine translation: Fast evaluation for MT research. In: Proc. of the Second Int. Conf. on Language Resources and Evaluation (LREC), Athens, Greece (2000) 39–45Google Scholar
- 9.Och, F. J., Ney, H.: Discriminative training and maximum entropy models for statistical machine translation. In: Proc. of the 40th Annual Meeting of the Association for Computational Linguistics (ACL). (2002) 8 pages To appear.Google Scholar
- 10.Och, F.J., Tillmann, C., Ney, H.: Improved alignment models for statistical machine translation. In: Proc. of the Joint SIGDAT Conf. on Empirical Methods in Natural Language Processing and Very Large Corpora, University of Maryland, College Park, MD (1999) 20–28Google Scholar
- 11.Papineni, K.A., Roukos, S., Ward, T., Zhu, W.J.: Bleu: a method for automatic evaluation of machine translation. Technical Report RC22176 (W0109-022), IBM Research Division, Thomas J. Watson Research Center (2001)Google Scholar
- 12.Reithinger, N., Engel, R.: Robust content extraction for translation and dialog processing.  428–437Google Scholar
- 13.Tessiore, L., v. Hahn, W.: Functional validation of a machine interpretation system: Verbmobil.  611–631Google Scholar
- 14.Tillmann, C.: Word re-ordering and dynamic programming based search algorithms for statistical machine translation. PhD thesis, Computer Science Department, RWTH Aachen, Germany (2001)Google Scholar
- 15.Tillmann, C., Ney, H.: Word re-ordering and DP-based search in statistical machine translation. In: COLING’ 00: The 18th Int. Conf. on Computational Linguistics, Saarbrücken, Germany (2000) 850–856Google Scholar
- 16.Vogel, S., Ney, H., Tillmann, C.: HMM-based word alignment in statistical translation. In: COLING’ 96: The 16th Int. Conf. on Computational Linguistics, Copenhagen, Denmark (1996) 836–841Google Scholar
- 17.Wahlster, W., ed.: Verbmobil: Foundations of speech-to-speech translations. Springer Verlag, Berlin, Germany (2000)Google Scholar
- 18.Wang, Y.Y., Waibel, A.: Modeling with structures in statistical machine translation. In: COLING-ACL’ 98: 36th Annual Meeting of the Association for Computational Linguistics and 17th Int. Conf. on Computational Linguistics. Volume 2, Montreal, Canada (1998) 1357–1363Google Scholar