Defining the Dynamicity and Diversity of Text Collections

  • Ilya Markov
  • Fabio Crestani
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6273)

Abstract

In Information Retrieval collections are often considered to be relatively dynamic or diverse, but no general definition has been given for these notions and no actual measure has been proposed to quantify them. We give intuitive definitions of the dynamicity and diversity properties of text collections and present measures for calculating them based on the notion of novelty. Experimental results show that the proposed measures are consistent with the definitions and can distinguish collections effectively according to their dynamicity and diversity properties.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Allan, J., Wade, C., Bolivar, A.: Retrieval and novelty detection at the sentence level. In: Proc. of the ACM SIGIR, pp. 314–321. ACM, New York (2003)Google Scholar
  2. 2.
    Callan, J.: Advances in Information Retrieval. In: Distributed Information Retrieval, ch. 5, pp. 127–150. Kluwer Academic Publishers, Dordrecht (2000)Google Scholar
  3. 3.
    Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: Proc. of the ACM SIGIR, pp. 335–336. ACM, New York (1998)Google Scholar
  4. 4.
    Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: Proc. of the ACM SIGIR, pp. 275–281. ACM, New York (1998)Google Scholar
  5. 5.
    Song, K., Tian, Y., Gao, W., Huang, T.: Diversifying the image retrieval results. In: Proc. of the ACM MM, pp. 707–710. ACM, New York (2006)Google Scholar
  6. 6.
    Zhang, Y., Callan, J., Minka, T.: Novelty and redundancy detection in adaptive filtering. In: Proc. of the ACM SIGIR, pp. 81–88. ACM, New York (2002)Google Scholar
  7. 7.
    Ziegler, C.N., McNee, S.M., Konstan, J.A., Lausen, G.: Improving recommendation lists through topic diversification. In: Proc. of the WWW, pp. 22–32. ACM, New York (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Ilya Markov
    • 1
  • Fabio Crestani
    • 1
  1. 1.Faculty of InformaticsUniversity of LuganoLuganoSwitzerland

Personalised recommendations