Merging Case Relations into VSM to Improve Information Retrieval Precision

  • Wang Hongtao
  • Sun Maosong
  • Liu Shaoming
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3406)

Abstract

This paper presents an approach that merges case relations into the well-known Vector Space Model (VSM), leading to a new model named C-VSM (Case relation-based VSM). A Chinese case system with 23 case relations is established, and a Chinese Olympic news corpus of 7,662 sentences, denoted COCS, is constructed by manual annotation with these 23 case relations. We use 50 queries on COCS as a test set. Experimental results on the test set show that C-VSM outperforms W-VSM (Word-based VSM) by 3.4% on the average 11-point precision. It is worth pointing out that almost all the previous studies on semantic IR obtained no better, even worse, results than W-VSM, our work thus validates the usefulness of case relations in IR through the validation is still preliminary. The proposed model is believed to be language-independent.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Khoo, S.G.: Using Cause-effect Relations in Text to Improve Information Retrieval Precision. Information Processing and Management 37, 119–145 (2001)MATHCrossRefGoogle Scholar
  2. 2.
    Liu, G.Z.: Semantic Vector Space Model: Implementation and Evaluation. Journal of the American Society for Information Science 48(5), 395–417 (1997)CrossRefGoogle Scholar
  3. 3.
    Lin, X.G.: Lexical Semantics and Computational Linguistics. YuWen Press, Beijing (1999)Google Scholar
  4. 4.
    Lu, X.: An Application of Case Relations to Document Retrieval, Doctoral dissertation, University of Western Ontario (1990)Google Scholar
  5. 5.
    Fillmore, C.J.: The Case for Case. In: Universals in Linguistic Theory. Holt, Rinehart and Winston, Inc., New York (1968)Google Scholar
  6. 6.
    Somers, H.L.: Valency and Case in Computational Linguistics. Edinburgh University Press, Edinburgh (1987)Google Scholar
  7. 7.
    Lewis, D.A.: Case Grammar and Functional Relations. Doctoral dissertation, University of Western Ontario (1984)Google Scholar
  8. 8.
    Young, C.: Development of Language Analysis Procedures with Application to Automatic Indexing. Doctoral dissertation, The Ohio State University (1973)Google Scholar
  9. 9.
    Croft, W.B., Turtle, H.R., Lewis, D.D.: The Use of Phrases and Structured Queries in Information Retrieval. In: Proc. of the Fourteenth Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval (1991)Google Scholar
  10. 10.
    Hyoudo, Y., Niimi, K., Ikeda, T.: Comparison between Proximity Operation and Dependency Operation in Japanese Full-text Retrieval. In: Proc. of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1998)Google Scholar
  11. 11.
    Smeaton, A.F., O’Donnell, R., Kelledy, F.: Indexing Structures Derived from Syntax in TREC-3: System Description. In: Overview of the Third Text REtrieval Conference (TREC-3), National Institute of Standards and Technology Special Publication 500-225, pp. 55–67 (1995)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Wang Hongtao
    • 1
  • Sun Maosong
    • 1
  • Liu Shaoming
    • 2
  1. 1.The State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and TechnologyTsinghua UniversityBeijingChina
  2. 2.Future Technology InstituteFuji Xerox Co. LtdJapan

Personalised recommendations