Lexical Ambiguity Resolution for Turkish in Direct Transfer Machine Translation Models

  • A. Cüneyd Tantuğ
  • Eşref Adalı
  • Kemal Oflazer
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4263)

Abstract

This paper presents a statistical lexical ambiguity resolution method in direct transfer machine translation models in which the target language is Turkish. Since direct transfer MT models do not have full syntactic information, most of the lexical ambiguity resolution methods are not very helpful. Our disambiguation model is based on statistical language models. We have investigated the performances of some statistical language model types and parameters in lexical ambiguity resolution for our direct transfer MT system.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Hajič, J., Hric, J., Kubon, V.: Machine Translation of Very Close Languages. Applied NLP Processing. NAACL, Washington (2000)Google Scholar
  2. 2.
    Canals, R., et al.: interNOSTRUM: A Spanish-Catalan Machine Translation System. EAMT Machine Translation Summit VIII, Spain (2001)Google Scholar
  3. 3.
    Hirst, G., Charniak, E.: Word Sense and Case Slot Disambiguation. In: AIII 1982, pp. 95–98 (1982)Google Scholar
  4. 4.
    Black, E.: An Experiment in Computational Discrimination of English Word Senses. IBM Journal of Research and Development 32(2), 185–194 (1988)CrossRefGoogle Scholar
  5. 5.
    Gale, W., Church, K.W., Yarowsky, D.: A Method for Disambiguating Word Senses in a Large Corpus, Statistical Research Report 104, Bell Laboratories (1992)Google Scholar
  6. 6.
    Leacock, C., Towell, G., Voorhees, E.: Corpus-based statistical sense resolution. In: Proceedings of the ARPA Human Language Technology Worskshop, Morgan Kaufmann, San Francisco (1993)Google Scholar
  7. 7.
    Schütze, H.: Automatic word sense discrimination. Computational Linguistics 24(1) (1998)Google Scholar
  8. 8.
    Yarowsky, D.: Unsupervised word sense disambiguation rivaling supervised methods. In: Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics (ACL 1995), Cambridge, MA (1995)Google Scholar
  9. 9.
    Hearst, M.A.: Noun homograph disambiguation using local context in large corpora. In: Proceedings of the 7th Annual Conf. of the University of Waterloo Centre for the New OED and Text Research, Oxford, United Kingdom, pp. 1–19 (1991)Google Scholar
  10. 10.
    McRoy, S.W.: Using multiple knowledge sources for word sense discrimination. Computational Linguistics 18(1), 1–30 (1992)Google Scholar
  11. 11.
    Clarkson, P.R., Rosenfeld, R.: Statistical Language Modeling Using the CMU-Cambridge Toolkit. In: Proceedings ESCA Eurospeech (1997)Google Scholar
  12. 12.
    Fomey Jr., G.D.: The Viterbi Algorithm. IEEE Proc. 61, 268–278 (1973)CrossRefGoogle Scholar
  13. 13.
    NIST Report, Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics (2002), http://www.nist.gov/speech/tests/mt/doc/ngramstudy.pdf

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • A. Cüneyd Tantuğ
    • 1
  • Eşref Adalı
    • 1
  • Kemal Oflazer
    • 2
  1. 1.Computer Engineering DepartmentIstanbul Technical University Faculty of Eletrical-Electronic EngineeringIstanbulTürkiye
  2. 2.Faculty Of Engineering and Natural SciencesSabancı UniversityTuzlaTürkiye

Personalised recommendations