Advertisement

Translation Selection Through Machine Learning with Language Resources

  • Hyun Ah Lee
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4285)

Abstract

Knowledge acquisition is a critical problem for machine translation and translation selection. In this paper, I propose a tranlsation selection method that combines variable features from multiple language resources using machine learning. I introduce multiple measures for sense disambiguation and word selection that are based on language resources, and apply machine learning to combine those measures for translation selection. In evaluation, precision of translation selection improves even though a small-sized bilingual corpus is used as training data.

Keywords

Translation Selection Knowledge Acquisition Problem Machine Readable Dictionary Target Language Corpus Machine Learning 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Brown, P.F., Cocke, J., Pietra, V.D., Pietra, S.D., Jelinek, F., Lafferty, J.D., Mercer, R.L., Roossin, P.S.: A Statistical Approach to Machine Translation. Computational Linguistics 16(2) (1990)Google Scholar
  2. 2.
    Munteanu, D., Marcu, D.: Improving Machine Translation Performance by Exploiting Comparable Corpora. Computational Linguistics 31(4) (2005)Google Scholar
  3. 3.
    Dagan, I., Itai, A.: Word Sense Disambiguation Using a Second Language Monolingual Corpus. Computational Linguistics 20(4) (1994)Google Scholar
  4. 4.
    Prescher, D., Riezler, S., Rooth, M.: Using a Probabilistic Class-Based Lexicon for Lexical Ambiguity Resolution. In: Proceedings of the 18th International Conference on Computational Linguistics (2000)Google Scholar
  5. 5.
    Koehn, P., Knight, K.: Estimating Word Translation Probabilities from Unrelated Monolingual Corpora Using the EM Algorithm. In: Proceedings of National Conference on Artificial Intelligence (2000)Google Scholar
  6. 6.
    Gaussier, E., Renders, J.-M., Matveeva, I., Goutte, C., Déjean, H.: A Geometric view on bilingual lexicon extraction from comparable corpora. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (2004)Google Scholar
  7. 7.
    Lee, H.A., Yoon, J., Kim, G.C.: Translation Selection by Combining Multiple Measures for Sense Disambiguation and Word Selection. International Journal of Computer Processing of Oriental Languages 16(3) (2003)Google Scholar
  8. 8.
    Daelemans, W., Zavrel, J., Van der Sloot, K., Van den Bosch, A.: TiMBL: Tilburg Memory Based Learner, version 5.1, Reference Guide. ILK Technical Report Series 04-02 (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Hyun Ah Lee
    • 1
  1. 1.School of Computer and Software Engineering, Kumoh National Institute of TechnologyGyeongbukRepublic of Korea

Personalised recommendations