Abstract
A bandit problem with infinitely many Bernoulli arms is considered. The parameters of Bernoulli arms are independent and identically distributed random variables from a generalized beta distributionG3B(a, b, λ) witha, b>0 and 0<λ<2. Under the generalized beta prior distributions, we first derive the asymptotic expected failure rates ofk-failure strategies, and then obtain a lower bound for the expected failure rate over all strategies investigated in Berry et al. (1997). The asymptotic expected failure rates for the other three strategies studied in Berry et al. (1997) are also included. Numerical estimations for a variety of generalized beta prior distributions are presented to illustrate the performances of these strategies.
Similar content being viewed by others
References
Banks, J. S., Sundaram, R. K. (1992) Denumerable-armed bandits.Econometrica 60, 1071–1096
Berry, D. A., Chen, R. W., Zame, A, Heath, D. C., Shepp, L. A. (1997) Bandit problems with infinitely many arms.Ann. Statist. 25, 2103–2116
Berry, D. A., Fristedt, B. (1985)Bandit Problems: Sequential Allocations of Experiments. Chapman and Hall, London
Chen, J. J., Novick, M. R. (1984) Bayesian analysis for Binomial models with generalized Beta prior distributions.J. Educ. Statist. 9, 163–175
Gittins, J. C. (1989)Multi-armed Bandit Allocation Indices. John Wiley & Sons, New York
Herschkorn, S. J., Pekoz, E., Ross, S. M. (1996) Policies without memory for the infinite-armed Bernoulli bandit under the average-reward criterion.Probab. Engrg. Inform. Sci. 10, 21–28
Libby, D. L., Novick, M. R. (1982) Multivariate generalized beta distributions with applications to utility assessment.J. Educ. Statist. 7, 271–294
Lin, C. T., Shiau, C. J. (2000) Some optimal strategies for bandit problems with Beta prior distributions.Ann. Inst. Statist. Math. 52, 397–405
Pham-Gia, T., Duong, Q. P. (1989) The generalized Beta- and F-distributions in statistical modelling.Mathl Comput. Modelling 12, 1613–1625
Robbins, H. (1952) Some aspects of the sequential design of experiments.Bull. Amer. Math. Soc. 58, 527–536
Whittle, P. (1982),Optimization over Time 1. John Wiley & Sons, New York
Whittle, P. (1983)Optimization over Time 2. John Wiley & Sons, New York
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Chen, KY., Lin, CT. A note on infinite-armed Bernoulli bandit problems with generalized beta prior distributions. Statistical Papers 46, 129–140 (2005). https://doi.org/10.1007/BF02762039
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02762039