Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Contextualization in Structured Text Retrieval

  • Jaana Kekäläinen
  • Paavo Arvola
  • Marko Junkkari
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_81

Definition

In information retrieval, contextualization refers to a (re)scoring method, where the relevance of a retrievable unit (e.g. document, image, sentence or passage) is estimated by taking its context into account. The context of the unit may consist of surrounding text or external texts associated with the unit by links. In contextualization the unit’s retrieval status value (RSV) is not calculated in isolation, but depending on its explicitly defined context.

Historical Background

In hyperlinked semi-structured documents, context is considered external in the form of citations and hyperlinks and internal in the form of the document’s structure and these sources of information are exploited as contextual evidence. It is hypothesized that units in a good context (having strong contextual evidence) should be better candidates to be relevant to the posed query, than those in a poor context. The term contextualization in this domain was introduced by Arvola and others [1].

Focused...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Arvola P, Junkkari M, Kekäläinen J. Generalized contextualization method for XML information retrieval. In: Proceedings of the International Conference on Information and Knowledge Management; 2005. p. 20–7.Google Scholar
  2. 2.
    Arvola P, Junkkari M, Kekäläinen J. Query evaluation with structural indices. In: Fuhr N, Lalmas M, Malik S, Kazai G, editors. Advances in XML information retrieval and evaluation, LNCS vol. 3977. 2006. p. 134–45.Google Scholar
  3. 3.
    Arvola P, Kekäläinen J, Junkkari M. Contextualization models for XML retrieval. Inform Process Manag. 2011;47(5):762–76.CrossRefGoogle Scholar
  4. 4.
    Callan JP. Passage-level evidence in document retrieval. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2004. p. 302–10.CrossRefGoogle Scholar
  5. 5.
    Carmel D, Shtok A, Kurland O. Position-based contextualization for passage retrieval. In: Proceedings of the International Conference on Information and Knowledge Management; 2013. p. 1241–4.Google Scholar
  6. 6.
    Dunlop MD, Van Rijsbergen CJ. Hypermedia and free text retrieval. Inform Process Manag. 1993;29:287–98.CrossRefGoogle Scholar
  7. 7.
    Fernandez RT, Losada DE, Azzopardi LA. Extending the language modeling framework for sentence retrieval to include local context. Inform Retrieval. 2011;14:355–89.CrossRefGoogle Scholar
  8. 8.
    Kazai G, Lalmas M, Reid J. Construction of a test collection for the focused retrieval of structured documents. In: Sebastiani F editor. Advances in information retrieval. LNCS vol. 2633. Heidelberg: Springer; 2003. p. 88–103.Google Scholar
  9. 9.
    Norozi MA, Arvola P. Kinship contextualization: utilizing the preceding and following structural elements. In: Proceedings of the Annual International Conference on Research and Development in Information Retrieval; 2013a. p. 837–40.Google Scholar
  10. 10.
    Norozi MA, Arvola P. When is the structural context effective? In: Proceedings of the 13th Dutch-Belgian Workshop on Information Retrieval; 2013b. p. 72–5.Google Scholar
  11. 11.
    Norozi MA, de Vries AP, Arvola P. Contextualization using hyperlinks and internal hierarchical structure of Wikipedia documents. In: Proceedings of the International Conference on Information and Knowledge Management; 2012. p. 1291–300.Google Scholar
  12. 12.
    Sigurbjörnsson B, Kamps J, De Rijke M. An element-based approach to XML retrieval. In: Proceedings of the 2nd International Workshop of the Initiative for the Evaluation of XML Retrieval; 2003. p. 19–26. https://inex.mmci.uni-saarland.de/static/proceedings/INEX2003-preproceedings.pdf. Accessed 27 June 2014.

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  • Jaana Kekäläinen
    • 1
  • Paavo Arvola
    • 1
  • Marko Junkkari
    • 1
  1. 1.University of TampereTampereFinland

Section editors and affiliations

  • Jaap Kamps
    • 1
  1. 1.University of AmsterdamAmsterdamThe Netherlands