An Investigation of an Interontologia: Comparison of the Thousand-Character Text and Roget’s Thesaurus

  • Sang-Rak Kim
  • Jae-Gun Yang
  • Jae-Hak J. Bae
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5459)

Abstract

The present study presents the lexical category analysis of the Thousand-Character Text and Roget’s Thesaurus. Through preprocessing, the Thousand-Character Text and Roget’s Thesaurus have been built into databases. In addition, for easier analysis and more efficient research, we have developed a system to search Roget’s Thesaurus for the categories corresponding to Chinese characters in the Thousand-Character Text. According to the results of this study, most of the 39 sections of Roget’s Thesaurus except the ’Creative Thought’ section were relevant to Chinese characters in the Thousand-Character Text. Three sections ’Space in General’, ’Dimensions’ and ’Matter in General’ have higher mapping rate. The correlation coefficient is also around 0.94, showing high category relevancy on the section level between the Thousand-Character Text and Roget’s Thesaurus.

Keywords

Thousand-Character Text Roget’s Thesaurus Ontology Interontologia 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
  2. 2.
  3. 3.
    Lexical FreeNet, http://www.cinfn.com/doc/
  4. 4.
    Ohno, S., Hamanishi, M.: New Synonyms Dictionary, Kadogawa Shoten, Tokyo (1981) (written in Japanese)Google Scholar
  5. 5.
    The EDR Electronic Dictionary, http://www2.nict.go.jp/r/r312/EDR/index.html
  6. 6.
  7. 7.
    CYC Ontology, http://www.cyc.com/
  8. 8.
  9. 9.
  10. 10.
  11. 11.
  12. 12.
  13. 13.
  14. 14.
  15. 15.
  16. 16.
  17. 17.
    Richard, E.N.: The Geography of Thought: How Asians and Westerners Think Differently... and Why. Simon & Schuster, New York (2004)Google Scholar
  18. 18.
    Kalfoglou, Y., Schorlemmer, M.: Ontology mapping: the state of the art. The Knowledge Engineering Review 18(1), 1–31 (2003)CrossRefGoogle Scholar
  19. 19.
    Noy, N.F.: Semantic Integration: A Survey of Ontology-Based Approaches. SIGMOD Record 33(4), 65–70 (2004)CrossRefGoogle Scholar
  20. 20.
    Kim, J.-T., Song, C.-S.: Comparison of Vocabulary Classification Systems among Thousand-Character Text, Yuhap, and Hunmongjahoi, Korean Literature Society, Linguistics and Literature, vol. 52, pp. 159–192 (1991) (written in Korean)Google Scholar
  21. 21.
    Jin, T.-H.: Problems in the Translations and Sounds of Thousand-Character Text, Hangeul-Chinese Character Culture, vol. 104, pp. 80–82 (2008) (written in Korean)Google Scholar
  22. 22.
    Kingsoft 2008 (谷歌金山词霸), http://g.iciba.com/
  23. 23.

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Sang-Rak Kim
    • 1
  • Jae-Gun Yang
    • 2
  • Jae-Hak J. Bae
    • 2
  1. 1.Institute of e-Vehicle TechnologyUniversity of Ulsan; ITSTAR Co., Ltd.UlsanSouth Korea
  2. 2.School of Computer Engineering & Information TechnologyUniversity of UlsanUlsanSouth Korea

Personalised recommendations