Abstract
We present exact solutions to the birthday and generalized occupancy problems using multiple approaches. We start with two alternative ways to solve the classical birthday problem for uniform distributions by iterating over people and iterating over days, respectively. The iterate-over-day method is used to find the probabilities of not only the birthday problem but also several variants, including the exact number of people sharing a birthday and the strong birthday problem, for non-uniform distributions. This method decomposes multinomial based problems as a recursive sum of binomial distributions and leads readily to a unifying solution for generalized occupancy and coincidence problems for any probability distribution and arbitrary sets of occupancies. The birthday, strong birthday and exact number of coincidences with arbitrary probability distribution and varying number of coincidences/occupancies can all be found from the generalized solution. The generalized solution also allows us to study the details of specific coincidences. We define a z-value, which is similar to the commonly used p-value but is calculated from the generalized occupancy model, to test the conformity of two distributions and show that z-value is more effective than p-value in hypothesis testing when one category (out of five) is an outlier. Several examples are used to illustrate the practical applications of the generalized occupancy model in real-world situations.
Similar content being viewed by others
Data Availability Statement
All data generated or analyzed during this study are included in this published article.
References
Boutsikas MV, Koutras MV (2002) On the number of overflown urns and excess balls. J Stat Plan Infer 104:259–286
DasGupta A (2004) The matching, birthday and the strong birthday problem: a contemporary review. J Stat Plan Infer 130:377–389
Diaconis P, Mosteller F (1989) Methods of studying coincidences. J Amer Statist Assoc 84:853–886
Flajolet P, Gardy D, Thimonier L (1992) Birthday paradox, coupon collectors, caching algorithms and self-organizing search. Discret Appl Math 39(3):207–229. https://doi.org/10.1016/0166-218X(92)90177-C
Gail MH, Weiss GH, Mantel N, O’Brien SJ (1979) A solution to the generalized birthday problem with application to allozyme screening for cell culture contamination. J Appl Probab 16(2):242–251
Inoue K, Aki S (2008) Methods for studying generalized birthday and coupon collection problems. Commun Stat Simul Comput 37(5):844–862. https://doi.org/10.1080/03610910801943669
Knight W, Bloom DM (1973) A birthday problem. Am Math Mon 80(10):1141–1142
Kounavis M, Deutsch S, Durham D, Komijani S (2017) Non-recursive computation of the probability of more than two people having the same birthday, IEEE Symposium on Computers and Communications (ISCC), pp 1263–1270
Le Conte P, Finch S (1997) Coincident birthdays. https://web.archive.org/web/20150911211220/http://www.people.fas.harvard.edu/~sfinch/csolve/coincid.pdf. Accessed 23 Nov 2021
Liu QW, Xia AH (2020) On moderate deviations in Poisson approximation. J Appl Probab 57(3):1005–1027. https://doi.org/10.1017/jpr.2020.47
Mase S (1992) Approximations to the birthday problem with unequal occurrence probabilities and their application to the surname problem in Japan. Ann Inst Stat Math 44(3):479–499
McKinney EH (1966) Generalized birthday problem. Am Math Monthly 73:385–387
Munford A (1977) A note on the uniformity assumption in the birthday problem. Am Stat 31(3):119–119
Nunnikhoven T (1992) Birthday problem solution for nonuniform birth frequencies. Am Stat 46(4):270–274
O’Neill B (2020) The classical occupancy distribution: computation and approximation. Am Stat. https://doi.org/10.1080/00031305.2019.1699445
Von Mises R (1939) Über Aufteilungs- und Besetzungswahrscheinlichkeiten", Revue de la faculté des sciences de l'Université d'Istanbul 4:145-163, 1939, reprinted in Frank, P.; Goldstein, S.; Kac, M.; Prager, W.; Szegö, G.; Birkhoff, G., eds. (1964). Selected Papers of Richard von Mises. 2. Providence, Rhode Island: Amer. Math. Soc. pp. 313–334
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Declarations
The author did not receive support from any organization for the submitted work. The author has no relevant financial or non-financial interests to disclose. The author has no competing interests to declare that are relevant to the content of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhou, Q. Birth, Death, Coincidences and Occupancies: Solutions and Applications of Generalized Birthday and Occupancy Problems. Methodol Comput Appl Probab 25, 53 (2023). https://doi.org/10.1007/s11009-023-10028-z
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11009-023-10028-z
Keywords
- Generalized occupancy problem
- Hypothesis test
- Generalized birthday problem
- Coupon collector problem
- Multinomial distribution