An Unsupervised Method for Ranking Translation Words Using a Bilingual Dictionary and WordNet
In the context of machine translation, picking the correct translation for a target word among multiple candidates is an important process. In this paper, we propose an unsupervised method for ranking translation word selection for Korean verbs relying on only a bilingual Korean-English dictionary and WordNet. We focus on deciding which translation of the verb target word is the most appropriate by using a measure of inter-word semantic relatedness through the five extended relations between possible translations pair of target verb and some indicative noun clues. In order to reduce the weight of application of possibly unwanted senses for the noun translation, we rank the weight of possible senses for each noun translation word in advance. The evaluation shows that our method outperforms the default baseline performance and previous works. Moreover, this approach provides an alternative to the supervised corpus based approaches that rely on a large corpus of senses annotated data.
KeywordsTarget Word Semantic Relatedness Machine Translation Word Sense Word Sense Disambiguation
Unable to display preview. Download preview PDF.
- 1.Banerjee, S., Pedersen, T.: Extended Gloss Overlaps as a Measure of Semantic Relatedness. In: Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, pp. 805–810 (2003)Google Scholar
- 2.Brown, P., Cocke, J., Pietra, V., Peitra, S., Jelinek, F., Lafferty, J., Mercer, R., Roosin, P.: A Statistical Approach to Machine Translation. Computational Linguistics 16(2), 79–85 (1990)Google Scholar
- 4.Dagan, I., And Itai, A.: Word Sense Disambiguation Using a Second Language Monolingual Corpus. Computational Linguistics 20(4), 563–596 (1994)Google Scholar
- 6.Inkpen, D., Hirst, G.: Automatic Sense Disambiguation of the Near-Synonyms in a Dictionary Entry. In: Proceedings, Fourth Conference on Intelligent Text Processing and Computational Linguistics, pp. 258–267 (2003)Google Scholar
- 7.Jiang, J., Conrath, D.: Semantic Similarity based on Corpus Statistics and Lexicon Taxonomy. In: Proceedings on International Conference on Research in Computational Linguistics, pp. 19–33 (1997)Google Scholar
- 8.Klavans, J., Tzoukermann, E.: Combining Corpus and Machine Readable Dictionary Data for Building Bilingual Lexicons. Machine Translation 10(3), 1–34 (1996)Google Scholar
- 9.Koehn, P., Knight, K.: Knowledge Sources for Word-Level Translation Models. In: Empirical Methods in Natural Language Processing Conference, pp. 27–35 (2001)Google Scholar
- 10.Lesk, M.: Automatic Sense Disambiguation Using Machine Readable Dictionaries: how to tell a pine code from an ice cream cone. In: Proceedings of the Fifth Annual International Conference on Systems Documentations, pp. 24–26 (1986)Google Scholar
- 12.Patwardhan, S., Perdersen, T.: The cpan wordnet:similarity package (2005), http://search.cpan.org/~sid/WordNet-Similarity-0.06/