The VLDB Journal

, Volume 15, Issue 1, pp 53–83

Integrating document and data retrieval based on XML

Regular Paper

DOI: 10.1007/s00778-004-0150-4

Cite this article as:
Bremer, JM. & Gertz, M. The VLDB Journal (2006) 15: 53. doi:10.1007/s00778-004-0150-4

Abstract

For querying structured and semistructured data, data retrieval and document retrieval are two valuable and complementary techniques that have not yet been fully integrated. In this paper, we introduce integrated information retrieval (IIR), an XML-based retrieval approach that closes this gap. We introduce the syntax and semantics of an extension of the XQuery language called XQuery/IR. The extended language realizes IIR and thereby allows users to formulate new kinds of queries by nesting ranked document retrieval and precise data retrieval queries. Furthermore, we detail index structures and efficient query processing approaches for implementing XQuery/IR. Based on a new identification scheme for nodes in node-labeled tree structures, the extended index structures require only a fraction of the space of comparable index structures that only support data retrieval.

Keywords

Integrated information retrievals Data retrieval Document retrieval XML Index structures Structural join 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag 2006

Authors and Affiliations

  1. 1.Department of Computer ScienceUniversity of California at DavisDavisUSA

Personalised recommendations