Merging Case Relations into VSM to Improve Information Retrieval Precision
This paper presents an approach that merges case relations into the well-known Vector Space Model (VSM), leading to a new model named C-VSM (Case relation-based VSM). A Chinese case system with 23 case relations is established, and a Chinese Olympic news corpus of 7,662 sentences, denoted COCS, is constructed by manual annotation with these 23 case relations. We use 50 queries on COCS as a test set. Experimental results on the test set show that C-VSM outperforms W-VSM (Word-based VSM) by 3.4% on the average 11-point precision. It is worth pointing out that almost all the previous studies on semantic IR obtained no better, even worse, results than W-VSM, our work thus validates the usefulness of case relations in IR through the validation is still preliminary. The proposed model is believed to be language-independent.
Unable to display preview. Download preview PDF.
- 3.Lin, X.G.: Lexical Semantics and Computational Linguistics. YuWen Press, Beijing (1999)Google Scholar
- 4.Lu, X.: An Application of Case Relations to Document Retrieval, Doctoral dissertation, University of Western Ontario (1990)Google Scholar
- 5.Fillmore, C.J.: The Case for Case. In: Universals in Linguistic Theory. Holt, Rinehart and Winston, Inc., New York (1968)Google Scholar
- 6.Somers, H.L.: Valency and Case in Computational Linguistics. Edinburgh University Press, Edinburgh (1987)Google Scholar
- 7.Lewis, D.A.: Case Grammar and Functional Relations. Doctoral dissertation, University of Western Ontario (1984)Google Scholar
- 8.Young, C.: Development of Language Analysis Procedures with Application to Automatic Indexing. Doctoral dissertation, The Ohio State University (1973)Google Scholar
- 9.Croft, W.B., Turtle, H.R., Lewis, D.D.: The Use of Phrases and Structured Queries in Information Retrieval. In: Proc. of the Fourteenth Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval (1991)Google Scholar
- 10.Hyoudo, Y., Niimi, K., Ikeda, T.: Comparison between Proximity Operation and Dependency Operation in Japanese Full-text Retrieval. In: Proc. of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1998)Google Scholar
- 11.Smeaton, A.F., O’Donnell, R., Kelledy, F.: Indexing Structures Derived from Syntax in TREC-3: System Description. In: Overview of the Third Text REtrieval Conference (TREC-3), National Institute of Standards and Technology Special Publication 500-225, pp. 55–67 (1995)Google Scholar