Abstract
The present study presents the lexical category analysis of the Thousand-Character Text and Roget’s Thesaurus. Through preprocessing, the Thousand-Character Text and Roget’s Thesaurus have been built into databases. In addition, for easier analysis and more efficient research, we have developed a system to search Roget’s Thesaurus for the categories corresponding to Chinese characters in the Thousand-Character Text. According to the results of this study, most of the 39 sections of Roget’s Thesaurus except the ’Creative Thought’ section were relevant to Chinese characters in the Thousand-Character Text. Three sections ’Space in General’, ’Dimensions’ and ’Matter in General’ have higher mapping rate. The correlation coefficient is also around 0.94, showing high category relevancy on the section level between the Thousand-Character Text and Roget’s Thesaurus.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Roget’s Thesauri, http://www.bartleby.com/thesauri/
WordNet, http://wordnet.princeton.edu/
Lexical FreeNet, http://www.cinfn.com/doc/
Ohno, S., Hamanishi, M.: New Synonyms Dictionary, Kadogawa Shoten, Tokyo (1981) (written in Japanese)
The EDR Electronic Dictionary, http://www2.nict.go.jp/r/r312/EDR/index.html
KR Ontology, http://www.jfsowa.com/ontology/
CYC Ontology, http://www.cyc.com/
Mikrokosmos Ontology, http://crl.nmsu.edu/Research/Projects/mikro/htmls/ontology-htmls/onto.index.html
SENSUS Ontology, http://www.isi.edu/natural-language/projects/ONTOLOGIES.html
Enterprise Ontology, http://www.aiai.ed.ac.uk/project/enterprise/enterprise/ontology.html
UNSPSC, http://www.unspsc.org/
RosettaNet, http://www.rosettanet.org
ISO 2788, http://www.collectionscanada.gc.ca/iso/tc46sc9/standard/2788e.htm
ANSI Z39.19, http://www.niso.org/standards/resources/Z39-19-2005.pdf
Richard, E.N.: The Geography of Thought: How Asians and Westerners Think Differently... and Why. Simon & Schuster, New York (2004)
Kalfoglou, Y., Schorlemmer, M.: Ontology mapping: the state of the art. The Knowledge Engineering Review 18(1), 1–31 (2003)
Noy, N.F.: Semantic Integration: A Survey of Ontology-Based Approaches. SIGMOD Record 33(4), 65–70 (2004)
Kim, J.-T., Song, C.-S.: Comparison of Vocabulary Classification Systems among Thousand-Character Text, Yuhap, and Hunmongjahoi, Korean Literature Society, Linguistics and Literature, vol. 52, pp. 159–192 (1991) (written in Korean)
Jin, T.-H.: Problems in the Translations and Sounds of Thousand-Character Text, Hangeul-Chinese Character Culture, vol. 104, pp. 80–82 (2008) (written in Korean)
Kingsoft 2008 (谷歌金山词霸), http://g.iciba.com/
Classical Chinese Character FrequencyList, http://lingua.mtsu.edu/chinese-computing/statistics/char/list.php?Which=CL
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, SR., Yang, JG., Bae, JH.J. (2009). An Investigation of an Interontologia: Comparison of the Thousand-Character Text and Roget’s Thesaurus. In: Li, W., Mollá-Aliod, D. (eds) Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy. ICCPOL 2009. Lecture Notes in Computer Science(), vol 5459. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00831-3_40
Download citation
DOI: https://doi.org/10.1007/978-3-642-00831-3_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00830-6
Online ISBN: 978-3-642-00831-3
eBook Packages: Computer ScienceComputer Science (R0)