Advertisement

Journal of Intelligent Information Systems

, Volume 37, Issue 2, pp 139–154 | Cite as

A vector-space dynamic feature for phrase-based statistical machine translation

  • Marta R. Costa-jussàEmail author
  • Rafael E. Banchs
Article

Abstract

In this paper, we propose and evaluate a novel dynamic feature function for log-linear model combinations in phrase-based statistical machine translation. The feature function is inspired on the popularly known vector-space model which is typically used in information retrieval and text mining applications, and it aims at improving translation unit selection at decoding time by incorporating context information from the source language. Significant improvements on an English-Spanish experimental corpus are presented and discussed.

Keywords

Statistical machine translation Source context information Vector-space model 

Notes

Acknowledgements

The authors would like to thank Barcelona Media Innovation Center and Institute for Infocomm Research for its support and permission to publish this research. We would also like to thank Bart Mellebeek for his helpful contribution. We would like to give credit to the anonymous reviewers of this paper for their valuable suggestions.

This work has been partially funded by the Spanish Department of Education and Science through the Juan de la Cierva fellowship program and the Spanish Government under the BUCEADOR project (TEC2009-14094-C04-01).

References

  1. Carpuat, M., & Wu, D. (2007). Improving statistical machine translation using word sense disambiguation. In Empirical methods in natural language processing (EMNLP) (pp. 61–72). Prague.Google Scholar
  2. Chew, P. A., Verzi, S. J., Bauer, T. L., & McClain, J. T. (2006). Evaluation of the bible as a resource for cross-language information retrieval. In Proceedings of the workshop on multilingual language resources and interoperability (pp. 68–74). Sydney, Australia.Google Scholar
  3. Costa-jussà, M. R., & Fonollosa, J. A. R. (2009). State-of-the-art word reordering approaches in statistical machine translation. IEICE Transactions on Information and Systems, 92(11), 2179–2185.CrossRefGoogle Scholar
  4. Haque, R., Kumar Naskar, S., Ma, Y., & Way, A. (2009). Using supertags as source language context in smt. In 13th annual conference of the European association for machine translation (EAMT) (pp. 234–241). Barcelona.Google Scholar
  5. Koehn, K., & Knight, K. (2003). Empirical methods for compound splitting. In Proc. of the 10th conf. of the European chapter of the association for computational linguistics (pp. 347–354). Budapest, Hungary.Google Scholar
  6. Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., et al. (2007). Moses: Open source toolkit for statistical machine translation. In Proc. of the 45th annual meeting of the association for computational linguistics (pp. 177–180). Prague, Czech Republic.Google Scholar
  7. Och, F. J. (1999). An efficient method for determining bilingual word classes. In Proc. of the 9th conf. of the European chapter of the association for computational linguistics (pp. 71–76). Bergen, Norway.Google Scholar
  8. Och, F. J. (2003). Minimum error rate training in statistical machine translation. In Proc. of the 41th annual meeting of the association for computational linguistics (pp. 160–167). Sapporo.Google Scholar
  9. Och, F. J., & Ney, H. (2000). A comparison of alignment models for statistical machine translation. In Proc. of the 18th conference on computational linguistics (pp. 1086–1090). Morristown, USA.Google Scholar
  10. Och, F. J., & Ney, H. (2002). Discriminative training and maximum entropy models for statistical machine translation. In Proc. of the 40th annual meeting of the association for computational linguistics (pp. 295–302). Philadelphia, USA.Google Scholar
  11. Papineni, K., Roukos, S., Ward, T., & Zhu, W.-J. (2002). Bleu: A method for automatic evaluation of machine translation. In Proc. of the 40th annual meeting of the association for computational linguistics (pp. 311–318). Philadelphia, PA.Google Scholar
  12. Salton, G., & McGill, M. (1983). Introduction to modern information retrieval. McGraw-Hill.Google Scholar
  13. Salton, G., Wong, A., & Yang, C. S. (1975). A vector space model for automatic indexing. Communications of the ACM, 18(11), 613–620.zbMATHCrossRefGoogle Scholar
  14. Schwenk, H., Costa-jussà, M. R., & Fonollosa, J. A. R. (2007). Smooth bilingual translation. In Empirical methods in natural language processing (EMNLP) (pp. 430–438). Prague.Google Scholar
  15. Stolcke, A. (2002). SRILM—an extensible language modeling toolkit. In Proc. of the 7th int. conf. on spoken language processing, ICSLP’02 (pp. 901–904). Denver, USA.Google Scholar
  16. Stroppa, N., van de Bosch, A., & Way, A. (2007). Exploiting source similarity for smt using context-informed features. In 11th conference on theoretical and methodological issues in machine translation (TMI) (pp. 231–240). Skövde.Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2010

Authors and Affiliations

  1. 1.Speech and Language DepartmentBarcelona Media Innovation CenterBarcelonaSpain
  2. 2.Human Language Technology DepartmentInstitute for Infocomm ResearchSingaporeSingapore

Personalised recommendations