A Korean Part-of-Speech Tagging System Using Resolution Rules for Individual Ambiguous Word

  • Young-Min Ahn
  • Seung-Eun Shin
  • Hee-Geun Park
  • Hyungsuk Ji
  • Young-Hoon Seo
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4488)

Abstract

In this paper we present a Korean part-of-speech tagging system using resolution rules for individual ambiguous word. Our system resolves lexical ambiguities by common rules, rules for individual ambiguous word, and statistical approach. We built resolution rules for each word which has several distinct morphological analysis results with a view to enhancing tagging accuracy. Statistical approach based on Hidden Markov Model (HMM) is applied for ambiguous words that are not resolved by the rules. The experiment on the test set shows that the part-of-speech tagging system has high accuracy and broad coverage.

Keywords

Part-of-Speech Tagging Resolution Rules 

References

  1. 1.
    Merialdo, B.: Tagging English Text with a Probabilistic Model. Computational Linguistics 20(2), 155–171 (1994)Google Scholar
  2. 2.
    Brill, E.: Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging. Computational Linguistics 21(4), 543–564 (1995)Google Scholar
  3. 3.
    Tapanainen, P., Voutilainen, A.: Tagging accurately — Don’t guess if you know. In: Proceedings of the 7th Conference of the European Chapter of the Association for Computational Linguistics, pp. 149–156 (1994)Google Scholar

Copyright information

© Springer Berlin Heidelberg 2007

Authors and Affiliations

  • Young-Min Ahn
    • 1
  • Seung-Eun Shin
    • 2
  • Hee-Geun Park
    • 1
  • Hyungsuk Ji
    • 3
  • Young-Hoon Seo
    • 1
  1. 1.School of Electrical & Computer Engineering, Chungbuk National UniversityKorea
  2. 2.BK21 Chungbuk Information Technology Center, Chungbuk National UniversityKorea
  3. 3.School of Information & Communication Engineering, Sungkyunkwan UniversityKorea

Personalised recommendations