Definition
Integrated DB&IR semi-structured text retrieval combines IR-style scoring and ranking methods for effective search with indexing techniques and processing algorithms from the database world for efficient query evaluation.
Historical Background
Database research has traditionally focused on semi-structured documents that represent structured data with a well-defined schema and only little unstructured, textual content (aka. “data-centric” XML). Typical examples for such documents are invoices, purchase orders, or even complete bibliographies.
Early work in the field concentrated on “classical” data management problems for XML: storing XML data in relational or native XML systems, defining query languages that integrate conditions on the structure and the content of results (like SQL for relational data), efficiently processing these queries on huge collections of...
Recommended Reading
Abiteboul S, Quass D, McHugh J, Widom J, Wiener JL. The lorel query language for semistructured data. Int J Digit Libr. 1997;1(1):68–88.
Amer-Yahia S, Botev C, Shanmugasundaram J. TeXQuery: a full-text search extension to XQuery. In: Proceedings of 12th international world wide web conference. 2004. p. 583–94.
Amer-Yahia S, Cho S, Srivastava D. Tree pattern relaxation. In: Advances in database technology, proceedings of 8th international conference on extending database technology. 2002. p. 496–513.
Amer-Yahia S, Lakshmanan LVS, Pandit S. FleXPath: flexible structure and full-text querying for XML. In: Proceedings of ACM SIGMOD international conference on management of data. 2004. p. 83–94.
Cohen S, Mamou J, Kanza Y, Sagiv Y. XSEarch: a semantic search engine for XML. In: Proceedings of 29th international conference on very large data bases. 2003. p. 45–56.
Fuhr N, Großjohann K. XIRQL: a query language for information retrieval in XML documents. In: Proceedings of 24th annual international ACM SIGIR conference on research and development in information retrieval. 2001. p. 172–80.
Guo L, Shao F, Botev C, Shanmugasundaram J. XRANK: ranked keyword search over XML documents. In: Proceedings of ACM SIGMOD international conference on management of data. 2003. p. 16–27.
Hiemstra D, Rode H, Van Os R, Flokstra J PF/Tijah: text search in an XML database system. In: Proceedings of 2nd international workshop on open source information retrieval. 2006.
Hristidis V, Papakonstantinou Y, and Balmin A. Keyword proximity search on XML graphs. In: Proceeings of 19th internatonal conference on data engineering. 2003. p. 367–78.
Marian A, Amer-Yahia S, Koudas N, Srivastava D. Adaptive processing of Top-k queries in XML. In: Proceedings of 21st international conference on data engineering. 2005. p. 162–73.
Schlieder T, Meuss H. Querying and ranking XML documents. J Am Soc Inf Sci Tech. 2002;53(6):489–503.
Theobald M, Bast H, Majumdar D, Schenkel R, Weikum G. TopX: efficient and versatile top-k query processing for semistructured data. VLDB J. 2008;17(1):81–115.
Theobald A, Weikum G. Adding relevance to XML. In: Proceedings of 3rd international workshop on the world wide web and databases. 2000. p. 105–124.
Xu Y, Papakonstantinou Y. Efficient keyword search for smallest LCAs in XML databases. In: Proceedings of ACM SIGMOD international confernce on management of data. 2005. p. 537–8.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media LLC
About this entry
Cite this entry
Schenkel, R., Theobald, M. (2017). Integrated DB and IR Approaches. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_206-3
Download citation
DOI: https://doi.org/10.1007/978-1-4899-7993-3_206-3
Received:
Accepted:
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4899-7993-3
Online ISBN: 978-1-4899-7993-3
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering
Publish with us
Chapter history
-
Latest
Integrated DB and IR Approaches- Published:
- 14 February 2017
DOI: https://doi.org/10.1007/978-1-4899-7993-3_206-3
-
Original
Integrated DB and IR Approaches- Published:
- 08 December 2016
DOI: https://doi.org/10.1007/978-1-4899-7993-3_206-2