An Integrated Approach for Automatic Construction of Bilingual Chinese-English WordNet

  • Renjie Xu
  • Zhiqiang Gao
  • Yingji Pan
  • Yuzhong Qu
  • Zhisheng Huang
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5367)

Abstract

This paper compares various approaches for constructing Chinese-English bilingual WordNet. First, we implement three independent approaches that translate English WordNet to Chinese WordNet automatically, including Minimum Distance (MDA), Intersection (IA) and Words Co-occurrence (WCA). Minimum Distance compares the gloss of synset with the explanations of words from dictionaries. Intersection chooses the intersection part of Chinese in a synset. Words Co-occurrence counts the results of Chinese and English words from Google. Then, we integrate these three approaches into an integrated one, which is named MIWA. Experimental results show that the integrated approach MIWA has better performance: F-measure reaches 0.615, which is higher than that of each independent one.

Keywords

WordNet Chinese WordNet Co-occurrence 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Miller, G.A., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.J.: Introduction to WordNet: An On-line Lexical Database. International Journal of Lexicography (1990)Google Scholar
  2. 2.
    George, A.: Miller.: Nouns in WordNet: A Lexical Inheritance System. International Journal of Lexicography (1990)Google Scholar
  3. 3.
    Gross, D., Miller, K.J.: Adjectives in WordNet. International Journal of Lexicography (1990)Google Scholar
  4. 4.
    Fellbaum, C.: English Verbs as a Semantic Net. International Journal of Lexicography (1990)Google Scholar
  5. 5.
    Beckwith, R., Miller, G.A., Tengi, R.: Design and Implementation of the WordNet Lexical Database and Searching Software. International Journal of Lexicography (1990)Google Scholar
  6. 6.
    Vossen, P.: Introduction to EuroWordNet. In: Computers and the Humanities, vol. 32, pp. 73–89 (1998)Google Scholar
  7. 7.
    Vossen, P., Diez-Orzas, P., Peters, W.: Multilingual design of EuroWordNet. In: Proceedings of the IJCAI 1997 workshop Multilingual Ontologies for NLP Applications (1997)Google Scholar
  8. 8.
    Choi, K.-S., Bae, H.-S.: Procedures and Problems in Korean-Chinese-Japanese Wordnet with Shared Semantic Hierarchy. In: GWC 2004, Proceedings (2004)Google Scholar
  9. 9.
    Choi, K.-S., Bae, H.-S., Kang, W., Lee, J., Kim, E., Kim, H., Kim, D., Song, Y., Shin, H.: Korean-Chinese-Japanese Multilingual Wordnet with Shared Semantic Hierarchy. In: Proceedings of LREC 2004, Portugal (2004)Google Scholar
  10. 10.
    Choi, K.-S.: CoreNet: Chinese-Japanese-Korean wordnet with shared semantic hierarchy. In: Natural Language Processing and Knowledge Engineering (2003)Google Scholar
  11. 11.
    Zhendong, D.: Knowledge Description: What, How and Who? In: The Proceedings of the International Symposium on Electronic Dictionaries, Tokyo, JapanGoogle Scholar
  12. 12.
    Dorr, B.J., Levow, G.-A., Lin, D.: Construction of Chinese-English Semantic Hierarchy for Information Retrieval. In: Proceedings of the Workshop on English-Chinese Cross Language Information Retrieval (2000)Google Scholar
  13. 13.
    Levenshtein, V.: Binary codes capable of correcting deletions, insertions, and reversals: Soviet Physics Doklady (1966)Google Scholar
  14. 14.
    Chen, H.-H., Lin, C.-C., Lin, W.-C.: Construction of a Chinese-English WordNet and Its Application to CLIR. In: Proceedings of the 5th International Workshop Information Retrieval with Asian LanguagesGoogle Scholar
  15. 15.
    Maedche, A., Staab, S.: Ontology Learning for the Semantic Web. IEEE Intelligenct Systems 16(2), 72–79 (2001)CrossRefGoogle Scholar
  16. 16.
    Alfonseca, E., Ruiz-Casado, M., Okumura, M., Castells, P.: Towards Large-scale Non-taxonomic Relation Extraction: Estimating the Precision of Rote Extractors. In: Proceedings of the 2nd Workshop on Ontology Learning and PopulationGoogle Scholar
  17. 17.
    Olsen, S.: Wordnet Wordsense Disambiguation using an Automatically Generated Ontology. In: Proceedings of the Class of, Senior Conference (2003)Google Scholar
  18. 18.
    Church, K., Gale, W., Hanks, P., Hindle, D.: Bell Laboratories and Collins.: Parsing, Word Associations and Typical Predicate-Argument Relations. In: International Workshop on Parsing Technologies, CMU (1989)Google Scholar
  19. 19.
    Liu, Y., Yu, S., Yu, J.: In: Building a Bilingual WordNet-Like Lexicon: the New Approach and Algorithms. In: Proceedings of the 19th international conference on Computational linguisticsGoogle Scholar
  20. 20.
    Yamaguchi, T.: Constructing Domain Ontologies Based on Concept Drift Analysis. In: Proceedings of the IJCAI 1999 workshop on Ontologies and Problem-Solving Methods(KRR5) Stockholm, Sweden, August 2 (1999)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Renjie Xu
    • 1
  • Zhiqiang Gao
    • 1
  • Yingji Pan
    • 1
  • Yuzhong Qu
    • 1
  • Zhisheng Huang
    • 2
  1. 1.School of Computer Science and EngineeringSoutheast UniversityChina
  2. 2.Department of Computer ScienceVrije Universiteit AmsterdamThe Netherlands

Personalised recommendations