Advertisement

Noun and Keyword Detection of Korean in Ubiquitous Environment

  • Seong-Yoon Shin
  • Oh-Hyung Kang
  • Sang-Joon Park
  • Jong-Chan Lee
  • Seong-Bae Pyo
  • Yang-Won Rhee
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5592)

Abstract

In a language, noun and keyword extraction is a key element in a ubiquitous environment. When it comes to processing Korean language information, however, there are still a lot of problems with noun and keyword extraction. This paper proposes an effective noun extraction method that considers noun emergence features. The proposed method can be effectively used in areas like information retrieval where large volumes of documents and data need to be processed in a fast manner. In this paper, a category-based keyword construction method is also presented that uses an unsupervised learning technique to ensure high volumes of queries are automatically classified. Our experimental results show that the proposed method outperformed both the supervised learning-based X2 method known to excel in keyword extraction and the DF method, in terms of classification precision.

Keywords

Keyword Detection Noun Detection Supervised Learning-based Method χ2 Method 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Jung, M.S.: A Dictionary Composition for Syntactic Analyzer from Corpus. Graduate School of Kunsan National University (1999)Google Scholar
  2. 2.
    Lee, J.S., Park, J.D., Cha, K.H., Park, S.Y.: Morphological Analyzer and Tagger Evaluation Contest Overview. In: MATEC 1999, pp. 13–22 (1999)Google Scholar
  3. 3.
    Kim, N.C., Seo, Y.H.: A Korean Morphological Analyzer CBKMA and A Index Word Extractor CBKMA/IX. In: MATEC 1999, pp. 50–59 (1999)Google Scholar
  4. 4.
    Lee, J.Y., Shin, B.H., Lee, K.J., Kim, J.E., Ahn, S.G.: Noun Extractor based on a multi-purpose Korean Morphological engine implemented with COM. In: MATEC 1999, pp. 167–172 (1999)Google Scholar
  5. 5.
    An, D.U.: A Noun Extractor using Connectivity Information. In: MATEC 1999, pp. 173–178 (1999)Google Scholar
  6. 6.
    Shim, J.H., Kim, J.S., Cha, J.W., Lee, G.B.: Robust Part-of Speech Tagger using Statistical and Rule-based Approach. In: MATEC 1999, pp. 60–75 (1999)Google Scholar
  7. 7.
    Kwon, O.W., Chung, Y.J., Kim, M.Y., Ryu, D.W., Lee, M.K., Lee, J.H.: Korean Morphological Analyzer and Part-Of-Speech Tagger Based on CYB Algorithm Using Syllable Information. In: MATEC 1999, pp. 76–88 (1999)Google Scholar
  8. 8.
    Lee, W.J., Kim, S.B., Kim, G.Y., Choi, K.S.: Implementation of Modularized Morphological Analyzer. In: MATEC 1999, pp. 123–136 (1999)Google Scholar
  9. 9.
    Jang, D.H., Myaeng, S.H.: A Noun Extractor based on Dictionaries and Heuristic Rules Obtained from Training Data. In: MATEC 1999, pp. 151–156 (1999)Google Scholar
  10. 10.
    Nagata, M., Saito, T., Suzuki, K.: Using the web as a bilingual dictionary. In: Proceedings of the workshop on Data-driven methods in machine translation, pp. 1–8 (2001)Google Scholar
  11. 11.
    Li, Q., Myaeng, S.-H., Jin, Y., Kang, B.-Y.: Translation of unknown terms via web mining for information retrieval. In: Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. LNCS, vol. 4182, pp. 258–269. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  12. 12.
    Park, S.Y.: Automatic Construction of Korean Unknown Word Dictionary using Occurrence Frequency in Web Documents. Journal of the Korea Society of Computer and Information 13(3), 27–33 (2008)Google Scholar
  13. 13.
    Lee, D.G., Lee, S.Z., Rim, H.C.: An Efficient Method for Korean Noun Extraction Using Noun Patterns. Journal of Korean Institute of Information Scientists and Engineers 30(2) (2003)Google Scholar
  14. 14.
    Kim, J.S., Kim, Y.J., Moon, H.J., Woo, Y.T.: A Feature Selection Technique for an Efficient Document Automatic Classification. Journal of Information Technology Applications & Management 8(1), 117–128 (2001)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Seong-Yoon Shin
    • 1
  • Oh-Hyung Kang
    • 1
  • Sang-Joon Park
    • 1
  • Jong-Chan Lee
    • 1
  • Seong-Bae Pyo
    • 2
  • Yang-Won Rhee
    • 1
  1. 1.Dept. of Computer Information EngineeringKunsan Natl. Univ.Korea
  2. 2.Dept. of Computer SoftwareInduk CollegeSouth Korea

Personalised recommendations