Evaluating Web Archive Search Systems

  • Miguel Costa
  • Mário J. Silva
Conference paper

DOI: 10.1007/978-3-642-35063-4_32

Part of the Lecture Notes in Computer Science book series (LNCS, volume 7651)
Cite this paper as:
Costa M., Silva M.J. (2012) Evaluating Web Archive Search Systems. In: Wang X.S., Cruz I., Delis A., Huang G. (eds) Web Information Systems Engineering - WISE 2012. WISE 2012. Lecture Notes in Computer Science, vol 7651. Springer, Berlin, Heidelberg

Abstract

The information published on the web, a representation of our collective memory, is rapidly vanishing. At least 77 web archives have been developed to cope with the web’s transience problem, but despite their technology having achieved a good maturity level, the retrieval effectiveness of the search services they provide still presents unsatisfactory results. In this work, we propose an evaluation methodology for web archive search systems based on a list of requirements compiled from previous characterizations of web archives and their users. The methodology includes the design of a test collection and the selection of evaluation measures to support realistic and reproducible experiments. The test collection enabled, for the first time, to measure the effectiveness of state-of-the-art IR technology employed in web archives. Results confirm the poor quality of search results retrieved with such technology. However, we show how to combine temporal features, along with the regular topical features, to improve the search effectiveness on web archives. The test collection is available to the research community.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Miguel Costa
    • 1
    • 2
  • Mário J. Silva
    • 3
  1. 1.Foundation for National Scientific ComputingLisbonPortugal
  2. 2.LaSIGE, Faculty of ScienceUniversity of LisbonLisbonPortugal
  3. 3.IST/INESC-IDLisbonPortugal

Personalised recommendations