Using String Comparison in Context for Improved Relevance Feedback in Different Text Media

  • Adenike M. Lam-Adesina
  • Gareth J. F. Jones
Conference paper

DOI: 10.1007/11880561_19

Volume 4209 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Lam-Adesina A.M., Jones G.J.F. (2006) Using String Comparison in Context for Improved Relevance Feedback in Different Text Media. In: Crestani F., Ferragina P., Sanderson M. (eds) String Processing and Information Retrieval. SPIRE 2006. Lecture Notes in Computer Science, vol 4209. Springer, Berlin, Heidelberg

Abstract

Query expansion is a long standing relevance feedback technique for improving the effectiveness of information retrieval systems. Previous investigations have shown it to be generally effective for electronic text, to give proportionally better improvement for automatic transcriptions of spoken documents, and to be at best of questionable utility for optical character recognized scanned text documents. We introduce two corpus-based methods based on using a string-edit distance measure in context to automatically detect and correct transcription errors. One method operates at query-time and requires no modification of the document index file, and the other at index-time and operates using the standard query-time expansion process. Experimental investigations show these methods to produce improvements in relevance feedback for all three media types, but most significantly mean that relevance feedback can now successfully be applied to scanned text documents.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Adenike M. Lam-Adesina
    • 1
  • Gareth J. F. Jones
    • 1
  1. 1.Centre for Digital Video Processing & School of ComputingDublin City UniversityDublin 9Ireland