Abstract
The present article investigates a class of random partitioning distributions of a positive integer. This class is called the limiting conditional compound poisson (LCCP) distribution and characterized by the law of small numbers. Accordingly the LCCP distribution explains the limiting behavior of counts on a sparse contingency table by the frequencies of frequencies. The LCCP distribution is constructed via some combinations of conditioning and limiting, and this view reveals that the LCCP distribution is a subclass of several known classes that depend on a Bell polynomial. It follows that the limiting behavior of a Bell polynomial provides new asymptotics for a sparse contingency table. Also the Neyman Type A distribution and the Thomas distribution are revisited as the basis of the sparsity.
Similar content being viewed by others
References
Arratia R., Barbour A.D., Tavarè S. (2003) Logarithmic combinatorial structures: A probabilistic approach. European Mathematical Society, Zürich
Borel E. (1942) Sur l’emploi du théorème de Bernoulli pour faciliter le calcul d’un infinité de coefficients. Application au problème de l’attente à un guichet. Comptes Rendus, Académie des Sciences, Paris, Series A 214: 452–456
Breiman L. (1992) Probability. SIAM, Philadelphia
Bunge J., Fitzpatrick M. (1993) Estimating the number of species: A review. Journal of the American Statistical Association 88: 364–373
Charalambides Ch.A. (2002) Enumerative combinatorics. Chapman and Hall/CRC, Florida
Charalambides Ch.A. (2007) Distributions of random partitions and their applications. Methodology and Computing in Applied Probability 9: 163–193
Comtet L. (1974) Advanced combinatorics. D. Reidel Publishing Company, Boston
Ewens W.J. (1972) The sampling theory of selectively neutral alleles. Theoretical Population Biology 3: 87–112
Fienberg S.E., Holland P.W. (1973) Simultaneous estimation of multinomial cell probabilities. Journal of the American Statistical Association 68: 683–691
Good I.J. (1953) The population frequencies of species and the estimation of population parameters. Biometrika 40: 237–264
Hoshino N. (2005a) Engen’s extended negative binomial model revisited. Annals of the Institute of Statistical Mathematics 57: 369–387
Hoshino, N. (2005b). On a limiting quasi-multinomial distribution. Discussion Paper CIRJE-F-361. Tokyo: Faculty of Economics, University of Tokyo
Hoshino N. (2006) A discrete multivariate distribution resulting from the law of small numbers. Journal of Applied Probability 43: 852–866
Hoshino N. (2009) The quasi-multinomial distribution as a tool for disclosure risk assessment. Journal of Official Statistics 25: 269–291
Johnson N.L., Kotz S., Kemp A.W. (1993) Univariate discrete distributions (2nd ed.). Wiley, New York
Karlin S. (1967) Central limit theorems for certain infinite urn schemes. Journal of Mathematics and Mechanics 17: 373–401
Kerov S. (1995) Coherent random allocations and the Ewens–Pitman formula. PDMI Preprint. Steklov Math Institute, St. Petersburg
Kingman J.F. (1978) Random partitions in population genetics. Proceedings of the Royal Society of London, A 361: 1–20
Kolchin V.F. (1971) A problem on allocations of particles into boxes, and the cycles of random permutations. Teoriya Veroyatnostei i ee Primeneniya 16: 67–82
Koopman B.O. (1950) Necessary and sufficient conditions for Poisson’s distribution. Proceedings of the American Mathematical Society 1: 813–823
Lehmann E.L. (1991) Theory of point estimation. Wadsworth, California
Meyer R.M. (1973) A Poisson-type limit theorem for mixing sequences of dependent ‘rare’ events. Annals of Probability 1: 480–483
Nandi S.B., Dutta S.K. (1988) Some developments in the generalized Bell distribution. Sankhyā, B 50: 362–375
Pitman J. (1999) Coalescent random forests. Journal of Combinatorial Theory, A 85: 165–193
Pitman, J. (2006). Combinatorial stochastic processes. Lecture notes in mathematics (Vol. 1875). New York: Springer
Riordan J. (1968) Combinatorial identities. Wiley, New York
Sibuya M. (1993) A random clustering process. Annals of the Institute of Statistical Mathematics 45: 459–465
Steutel F.W., van Harn K. (2004) Infinite divisibility of probability distributions on the real line. Marcel Dekker, New York
Thomas M. (1949) A generalization of Poisson’s binomial limit for use in ecology. Biometrika 36: 18–25
Uppuluri V.R., Carpenter J.R. (1969) Numbers generated by the function exp(1−e x). The Fibonacci Quarterly 7: 437–448
Vershik A.M. (1996) Statistical mechanics of combinatorial partitions, and their limit shapes. Functional Analysis and its Applications 30: 90–105
Wang Y.H. (1993) On the number of successes in independent trials. Statistica Sinica 3: 295–312
Wang Y.H., Ji S. (1993) Derivations of the compound Poisson distribution and process. Statistics and Probability Letters 18: 1–7
Author information
Authors and Affiliations
Corresponding author
About this article
Cite this article
Hoshino, N. Random partitioning over a sparse contingency table. Ann Inst Stat Math 64, 457–474 (2012). https://doi.org/10.1007/s10463-011-0327-8
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10463-011-0327-8