Effects of Language and Topic Size in Patent IR: An Empirical Study

  • Florina Piroi
  • Mihai Lupu
  • Allan Hanbury
Conference paper

DOI: 10.1007/978-3-642-33247-0_7

Volume 7488 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Piroi F., Lupu M., Hanbury A. (2012) Effects of Language and Topic Size in Patent IR: An Empirical Study. In: Catarci T., Forner P., Hiemstra D., Peñas A., Santucci G. (eds) Information Access Evaluation. Multilinguality, Multimodality, and Visual Analytics. CLEF 2012. Lecture Notes in Computer Science, vol 7488. Springer, Berlin, Heidelberg

Abstract

We revisit the effects that various characteristics of the topic documents have on the effectiveness of the systems for the task of finding prior art in the patent domain. In doing so, we provide the reader interested in approaching the domain a guide of the issues that need to be addressed in this context.

For the current study, we select two patent based test collections with a common document representation schema and look at topic characteristics specific to the objectives of the collections. We look at the effect of languages on retrieval and at the length of the topic documents. We present the correlations between these topic facets and their retrieval results, as well as their relevant documents.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Florina Piroi
    • 1
  • Mihai Lupu
    • 1
  • Allan Hanbury
    • 1
  1. 1.Vienna University of TechnologyViennaAustria