Synonyms
Harter’s model; Probabilistic model of indexing
Definition
The 2-Poisson model is a mixture, that is a linear combination, of two Poisson distributions:
In the context of IR, the 2-Poisson is used to model the probability distribution of the frequency X of a term in a collection of documents.
Historical Background
The 2-Poisson model was given by Harter [201036,201037,–7], although Bookstein [1, 2] and Harter had been exchanging ideas about probabilistic models of indexing during those years. Harter coined the word “elite” to introduce his 2-Poisson model [5, pp. 68–74].
The origin of the 2-Poisson model can be traced back through all Luhn, Maroon, Damerau, Edmundson, and Wyllys [201034,201035,201036,6]. The first...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Bookstein A, Kraft D. Operations research applied to document indexing and retrieval decisions. J ACM. 1977;24(3):418–27.
Bookstein A, Swanson D. Probabilistic models for automatic indexing. J Am Soc Inf Sci. 1974;25(5):312–8.
Damerau F. An experiment in automatic indexing. Am Doc. 1965;16(4):283–9.
Edmundson HP, Wyllys RE. Automated abstracting and indexing-survey and recommendations. Commun. ACM. 1961;4(5):226–34. Reprinted in Sharp H, editor. Readings in information retrieval. New York: Scarecrow; 1964. p. 390–412.
Harter SP. A probabilistic approach to automatic keyword indexing. PhD thesis, Thesis No. T25146. Graduate Library, The University of Chicago; 1974.
Harter SP. A probabilistic approach to automatic keyword indexing. Part I: on the distribution of specialty words in a technical literature. J Am Soc Inf Sci. 1975;26(4):197–216.
Harter SP. A probabilistic approach to automatic keyword indexing. Part II: an algorithm for probabilistic indexing. J Am Soc Inf Sci. 1975;26(5):280–9.
Luhn HP. A statistical approach to mechanized encoding and searching of literary information. IBM J Res Dev. 1957;1(4):309–17.
Maron ME. Automatic indexing: an experimental inquiry. J ACM. 1961;8(3):404–17.
Puri PS, Goldie CM. Poisson mixtures and quasi-infinite divisibility of distributions. J Appl Probab. 1979;16(1):138–53.
Stone D, Rubinoff B. Statistical generation of a technical vocabulary. Am Doc. 1968;19(4):411–2.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Amati, G. (2018). Two-Poisson Model. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_920
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8265-9_920
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering