Evaluation in Context

Kamps, Jaap; Lalmas, Mounia; Larsen, Birger

doi:10.1007/978-3-642-04346-8_33

Jaap Kamps^20,21,
Mounia Lalmas²² &
Birger Larsen²³

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5714))

Included in the following conference series:

International Conference on Theory and Practice of Digital Libraries

1606 Accesses
3 Citations

Abstract

All search happens in a particular context—such as the particular collection of a digital library, its associated search tasks, and its associated users. Information retrieval researchers usually agree on the importance of context, but they rarely address the issue. In particular, evaluation in the Cranfield tradition requires abstracting away from individual differences between users. This paper investigates if we can bring some of this context into the Cranfield paradigm. Our approach is the following: we will attempt to record the “context” of the humans already in the loop—the topic authors/assessors—by designing targeted questionnaires. The questionnaire data becomes part of the evaluation test-suite as valuable data on the context of the search requests. We have experimented with this questionnaire approach during the evaluation campaign of the INitiative for the Evaluation of XML Retrieval (INEX). The results of this case study demonstrate the viability of the questionnaire approach as a means to capture context in evaluation. This can help explain and control some of the user or topic variation in the test collection. Moreover, it allows to break down the set of topics in various meaningful categories, e.g. those that suit a particular task scenario, and zoom in on the relative performance for such a group of topics.

This research was partly funded by DELOS (an EU network of excellence in Digital Libraries) through the INEX initiative for the evaluation of XML retrieval.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Allan, J.: HARD track overview in TREC 2005: High accuracy retrieval from documents. In: The Fourteenth Text REtrieval Conference (TREC 2003). National Institute of Standards and Technology, pp. 500–255. NIST Special Publication (2004)
Google Scholar
Banks, D., Over, P., Zhang, N.-F.: Blind men and elephants: Six approaches to TREC tasks. Information Retrieval 1, 7–34 (1999)
Article Google Scholar
Buckley, C.: Why current IR engines fail. In: Proceedings of the 27th Annual International ACM SIGIR Conference, pp. 584–585. ACM Press, New York (2004)
Google Scholar
Cleverdon, C.W.: The Cranfield tests on index language devices. Aslib 19, 173–192 (1967)
Article Google Scholar
Fuhr, N., Kamps, J., Lalmas, M., Malik, S., Trotman, A.: Overview of the INEX 2007 ad hoc track. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 1–23. Springer, Heidelberg (2008)
Chapter Google Scholar
Ingwersen, P., Järvelin, K.: The Turn: Integration of Information Seeking and Retrieval in Context. Springer, Heidelberg (2005)
MATH Google Scholar
Kamps, J., Koolen, M., Lalmas, M.: Locating relevant text within XML documents. In: Proceedings of the 31th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 847–849. ACM Press, New York (2008)
Google Scholar
Kamps, J., Pehcevski, J., Kazai, G., Lalmas, M., Robertson, S.: INEX 2007 evaluation measures. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 24–33. Springer, Heidelberg (2008)
Chapter Google Scholar
Kazai, G., Lalmas, M., de Vries, A.P.: The overlap problem in content-oriented XML retrieval evaluation. In: Proceedings of the 27th Annual International ACM SIGIR Conference, pp. 72–79. ACM Press, New York (2004)
Google Scholar
Piwowarski, B., Trotman, A., Lalmas, M.: Sound and complete relevance assessments for XML retrieval. ACM Transactions in Information Systems 27(1) (2008)
Google Scholar
Reid, J., Lalmas, M., Finesilver, K., Hertzum, M.: Best entry points for structured document retrieval: Parts I & II. Information Processing and Management 42, 74–105 (2006)
Article Google Scholar
Saracevic, T.: Digital library evaluation: Toward evolution of concepts. Library Trends – Special issue on Evaluation of Digital Libraries 49(2), 350–369 (2000)
Google Scholar
Saracevic, T.: Relevance: A review of and a framework for the thinking on the notion in information science. JASIS 26, 321–343 (1975)
Article Google Scholar
Sparck Jones, K.: What’s the value of TREC – is there a gap to jump or a chasm to bridge? SIGIR Forum 40, 10–20 (2006)
Article Google Scholar
Voorhees, E.M.: The philosophy of information retrieval evaluation. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 355–370. Springer, Heidelberg (2002)
Chapter Google Scholar
Zobel, J.: How reliable are the results of large-scale information retrieval experiments? In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 307–314. ACM Press, New York (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Archives and Information Studies, University of Amsterdam, Netherlands
Jaap Kamps
ISLA, University of Amsterdam, Netherlands
Jaap Kamps
Department of Computing Science, University of Glasgow, UK
Mounia Lalmas
Information Studies, Royal School of Library and Information Science, Denmark
Birger Larsen

Authors

Jaap Kamps
View author publications
You can also search for this author in PubMed Google Scholar
Mounia Lalmas
View author publications
You can also search for this author in PubMed Google Scholar
Birger Larsen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information Engineering, University of Padua, Via Gradenigo 6/a, 35131, Padova, Italy
Maristella Agosti
Department of Computer Science and Engineering IST, Instituto Superior Técnico, Av. Rovisco Pais, 1049-001, Lisboa, Portugal
José Borbinha
Department of Archives and Library Sciences, Ionian University, 72 Ioannou Theotoki str., 49100, Corfu, Greece
Sarantos Kapidakis
Department of Archives and Library Sciences, Ionian University, 72 Ioannou Theotoiki str., 49100, Corfu, Greece
Christos Papatheodorou & Giannis Tsakonas &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kamps, J., Lalmas, M., Larsen, B. (2009). Evaluation in Context. In: Agosti, M., Borbinha, J., Kapidakis, S., Papatheodorou, C., Tsakonas, G. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2009. Lecture Notes in Computer Science, vol 5714. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04346-8_33

Download citation

DOI: https://doi.org/10.1007/978-3-642-04346-8_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04345-1
Online ISBN: 978-3-642-04346-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics