Abstract
Traditional information retrieval (IR) systems respond to user queries with ranked lists of relevant documents. The separation of content and structure in XML documents allows individual XML elements to be selected in isolation. Thus, users expect XML-IR systems to return highly relevant results that are more precise than entire documents. In this paper we describe the implementation of a search engine for XML document collections. The system is keyword based and is built upon an XML inverted file system. We describe the approach that was adopted to meet the requirements of Content Only (CO) and Vague Content and Structure (VCAS) queries in INEX 2004.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Fuhr, N., Malik, S.: Overview of the Initiative for the Evaluation of XML Retrieval (INEX) 2003. In: INEX 2003 Workshop Proceedings, Schloss Dagstuhl, Germany, December 15-17, pp. 1–11 (2003)
Sigurbjornsson, B., Kamps, J., de Rijke, M.: An Element-based Approach to XML Retrieval. In: INEX 2003 Workshop Proceedings, Schloss Dagstuhl, Germany, December 15-17, pp. 19–26 (2003)
Trotman, A., O’Keefe: The Simplest Query Language That Could Possibly Work. In: INEX 2003 Workshop Proceedings, Schloss Dagstuhl, Germany, December 15-17, 2003, vol. 2004, pp. 167–174 (2004)
Trotman, A., Sigurbjörnsson, B.: Narrowed Extended XPath I, NEXI (2004), http://www.cs.otago.ac.nz/postgrads/andrew/2004-4.pdf
Van Rijsbergen, R.J.: Information Retrieval, 2nd edn. Butterworths (1979)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Geva, S. (2005). GPX – Gardens Point XML Information Retrieval at INEX 2004. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds) Advances in XML Information Retrieval. INEX 2004. Lecture Notes in Computer Science, vol 3493. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11424550_17
Download citation
DOI: https://doi.org/10.1007/11424550_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26166-7
Online ISBN: 978-3-540-32053-1
eBook Packages: Computer ScienceComputer Science (R0)