Integration of a Noun Compound Translator Tool with Moses for English-Hindi Machine Translation and Evaluation

  • Prashant Mathur
  • Soma Paul
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7182)

Abstract

Noun Compounds are a frequently occurring multiword expression in English written texts. English noun compounds are translated into varied syntactic constructs in Hindi. The performance of existing translation system makes the point clear that there exists no satisfactorily efficient Noun Compound translation tool from English to Hindi although the need of one is unprecedented in the context of machine translation. In this paper we integrate Noun Compound Translator [13], a statistical tool for Noun Compound translation, with the state-of-the-art machine translation tool, Moses [10]. We evaluate the integrated system on test data of 300 source language sentences which contain Noun Compounds and are translated manually into Hindi. A gain of 29% on BLEU score and 27% on Human evaluation has been observed on the test data.

Keywords

Machine Translation Target Sentence Statistical Machine Translation Parallel Corpus Source Sentence 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Seaghdha, D.O.: Learning Noun Compounds semantics. PhD Thesis, Computer Laboratory, University of Cambridge. Technical Report 735 (2008)Google Scholar
  2. 2.
    Gawronska, B., Nordner, A., Johansson, C., Willners, C.: Interpreting compounds for machine translation. In: Proceedings of COLING 1994, Kyoto, Japan (1994)Google Scholar
  3. 3.
    Schmid, H.: Probabilistic Part-of-Speech Tagging Using Decision Trees. In: International Conference on New Methods in Language Processing. Manchester, UK (1994)Google Scholar
  4. 4.
    Reinhard, K., Ney, H.: Improved backing-off for n-gram language model. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Detroit, MI, vol. 1, pp. 181–182 (1995)Google Scholar
  5. 5.
    Bungum, L., Oepen, S.: Automatic Translation of Norwegian Noun Compounds. In: Proceedings of the 13th Annual Meeting of the European Association for Machine Translation, EAMT 2009 (2009)Google Scholar
  6. 6.
    Burnard, L.: User Reference Guide for the British National Corpus, Technical Report. Oxford University Computing Services (2000)Google Scholar
  7. 7.
    Federico, M., Bertoldi, N., Cettolo, M.: IRSTLM: an open source toolkit for handling large scale language models. In: INTERSPEECH 2008, pp. 1618–1621 (2008)Google Scholar
  8. 8.
    Och, F.J.: Statistical Machine Translation: From Single Words Models to Alignment Templates. PhD Thesis, RWTH Aachen, Germany (2002)Google Scholar
  9. 9.
    Och, F.: Minimum Error rate training in statistical machine translation. In: Proc. of the 41st Annual Meeting of the Association for Computational Linguistics (ACL 2003), Sapporo, Japan, pp. 160–167 (2003)Google Scholar
  10. 10.
    Koehn, P., Hoang, H., Birch, A., Callison-Burch, C., Federico, M., Bertoldi, N., Cowan, B., Shen, W., Moran, C., Zens, R., Dyer, C., Bojar, O., Constantin, A., Herbst, E.: Moses: Open Source Toolkit for Statistical Machine Translation. In: ACL 2007, Demonstration Session, Prague, Czech Republic (2007)Google Scholar
  11. 11.
    Koehn, P., Och, F.J., Marcu, D.: Statistical Phrase Based Translation. In: NAACL 2003 (2003)Google Scholar
  12. 12.
    Papineni, K., Roukos, S., Ward, T., Zhu, W.J.: BLEU: a method for automatic evaluation of machine translation. In: ACL 2002: 40th Annual Meeting of the Association for Computational Linguistics, pp. 311–318 (2002)Google Scholar
  13. 13.
    Mathur, P., Paul, S.: Automatic Translation of Nominal Compounds from English to Hindi. In: The Proceedings of International Conference on Natural Language Processing, Hyderabad, ICON (2009)Google Scholar
  14. 14.
    Patwardhan, S., Banerjee, S., Pedersen, T.: SenseRelate:TargetWord A Generalized Framework for Word Sense Disambiguation. In: Proceedings of the ACL Interactive Poster and Demonstration Sessions, Ann Arbor, MI (2005)Google Scholar
  15. 15.
    Baldwin, T., Tanaka, T.: Translation by Machine of Complex Nominals: Getting it Right. In: The Proceedings of ACL04 Workshop on Multiword Expression:Integrating Processing, Barcelona, Spain (2004)Google Scholar
  16. 16.
    Rackow, U., Dagan, I., Schwall, U.: Automatic Translation of Noun Compounds. In: COLING, pp. 1249–1253 (1992)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Prashant Mathur
    • 1
  • Soma Paul
    • 1
  1. 1.Language Technology Research CenterInternational Institute of Information TechnologyHyderabadIndia

Personalised recommendations