The BNB Distribution for Text Modeling

  • Stéphane Clinchant
  • Eric Gaussier
Conference paper

DOI: 10.1007/978-3-540-78646-7_16

Part of the Lecture Notes in Computer Science book series (LNCS, volume 4956)
Cite this paper as:
Clinchant S., Gaussier E. (2008) The BNB Distribution for Text Modeling. In: Macdonald C., Ounis I., Plachouras V., Ruthven I., White R.W. (eds) Advances in Information Retrieval. ECIR 2008. Lecture Notes in Computer Science, vol 4956. Springer, Berlin, Heidelberg

Abstract

We first review in this paper the burstiness and aftereffect of future sampling phenomena, and propose a formal, operational criterion to characterize distributions according to these phenomena. We then introduce the Beta negative binomial distribution for text modeling, and show its relations to several models (in particular to the Laplace law of succession and to the tf-itf model used in the Divergence from Randomness framework of [2]). We finally illustrate the behavior of this distribution on text categorization and information retrieval experiments.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Stéphane Clinchant
    • 1
  • Eric Gaussier
    • 2
  1. 1.Xerox Research Centre EuropeMeylanFrance
  2. 2.University Joseph Fourier (LIG). BP 53 - 38041 Grenoble cedex 9France

Personalised recommendations