Limiting size index distributions for ball-bin models with Zipf-type frequencies
- 97 Downloads
We consider a random ball-bin model where balls are thrown randomly and sequentially into a set of bins. The frequency of choices of bins follows the Zipf-type (power-law) distribution; that is, the probability with which a ball enters the ith most popular bin is asymptotically proportional to 1/i α , α > 0. In this model, we derive the limiting size index distributions to which the empirical distributions of size indices converge almost surely, where the size index of degree k at time t represents the number of bins containing exactly k balls at t. While earlier studies have only treated the case where the power α of the Zipf-type distribution is greater than unity, we here consider the case of α ≤ 1 as well as α > 1. We first investigate the limiting size index distributions for the independent throw models and then extend the derived results to a case where bins are chosen dependently. Simulation experiments demonstrate not only that our analysis is valid but also that the derived limiting distributions well approximate the empirical size index distributions in a relatively short period.
KeywordsLimiting distributions Random ball-bin occupancy models Size indices Zipf-type distribution
Unable to display preview. Download preview PDF.
- Davis P.J. (1972) Gamma function and related functions. In: Abramowitz M., Stegun I.A. (eds) Handbook of mathematical functions with formulas, graphs, and mathematical tables, 9th printing (Chap. 6). Dover, New York, pp 253–293Google Scholar
- Evert, S. (2004). A simple LNRE model for random character sequences. Proceedings of the 7èmes Journées Internationales d’Analyse Statistique des Données Textuelles (JADT2004), 411–422.Google Scholar
- Khmaladze, E. V. (1987). The statistical analysis of a large number of rare events. Report MS-R8804, Centre for Mathematics and Computer Science, CWI, Amsterdam, The Netherlands.Google Scholar
- R Development Core Team (2007). R: A language and environment for statistical computing. http://www.R-project.org.