Evaluating the Potential of Explicit Phrases for Retrieval Quality

  • Andreas Broschart
  • Klaus Berberich
  • Ralf Schenkel
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5993)

Abstract

This paper evaluates the potential impact of explicit phrases on retrieval quality through a case study with the TREC Terabyte benchmark. It compares the performance of user- and system-identified phrases with a standard score and a proximity-aware score, and shows that an optimal choice of phrases, including term permutations, can significantly improve query performance.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Büttcher, S., Clarke, C.L.A., Lushman, B.: Term proximity scoring for ad-hoc retrieval on very large text collections. In: SIGIR, pp. 621–622 (2006)Google Scholar
  2. 2.
    Clarke, C.L.A., Cormack, G.V., Tudhope, E.A.: Relevance ranking for one to three term queries. In: RIAO, pp. 388–401 (1997)Google Scholar
  3. 3.
    Croft, W.B., Turtle, H.R., Lewis, D.D.: The use of phrases and structured queries in information retrieval. In: SIGIR, pp. 32–45 (1991)Google Scholar
  4. 4.
    Fagan, J.L.: Automatic phrase indexing for document retrieval: An examination of syntactic and non-syntactic methods. In: SIGIR, pp. 91–101 (1987)Google Scholar
  5. 5.
    Liu, S., Liu, F., Yu, C.T., Meng, W.: An effective approach to document retrieval via utilizing wordnet and recognizing phrases. In: SIGIR, pp. 266–272 (2004)Google Scholar
  6. 6.
    Metzler, D., Strohman, T., Croft, W.B.: Indri trec notebook 2006: Lessons learned from three terabyte tracks. In: TREC (2006)Google Scholar
  7. 7.
    Mishne, G., de Rijke, M.: Boosting web retrieval through query operations. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 502–516. Springer, Heidelberg (2005)Google Scholar
  8. 8.
    Mitra, M., Buckley, C., Singhal, A., Cardie, C.: An analysis of statistical and syntactic phrases. In: RIAO, pp. 200–217 (1997)Google Scholar
  9. 9.
    Robertson, S.E., Zaragoza, H., Taylor, M.J.: Simple BM25 extension to multiple weighted fields. In: CIKM, pp. 42–49 (2004)Google Scholar
  10. 10.
    Schenkel, R., Broschart, A., Hwang, S.-W., Theobald, M., Weikum, G.: Efficient text proximity search. In: Ziviani, N., Baeza-Yates, R. (eds.) SPIRE 2007. LNCS, vol. 4726, pp. 287–299. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  11. 11.
    Zhang, W., et al.: Recognition and classification of noun phrases in queries for effective retrieval. In: CIKM, pp. 711–720 (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Andreas Broschart
    • 1
    • 2
  • Klaus Berberich
    • 2
  • Ralf Schenkel
    • 1
    • 2
  1. 1.Saarland UniversitySaarbrückenGermany
  2. 2.Max-Planck-Institut für InformatikSaarbrückenGermany

Personalised recommendations