A note on infinite-armed Bernoulli bandit problems with generalized beta prior distributions

Chen, Kung-Yu; Lin, Chien-Tai

doi:10.1007/BF02762039

A note on infinite-armed Bernoulli bandit problems with generalized beta prior distributions

Notes
Published: January 2005

Volume 46, pages 129–140, (2005)
Cite this article

Statistical Papers Aims and scope Submit manuscript

Kung-Yu Chen¹ &
Chien-Tai Lin¹

75 Accesses
1 Citation
Explore all metrics

Abstract

A bandit problem with infinitely many Bernoulli arms is considered. The parameters of Bernoulli arms are independent and identically distributed random variables from a generalized beta distributionG3B(a, b, λ) witha, b>0 and 0<λ<2. Under the generalized beta prior distributions, we first derive the asymptotic expected failure rates ofk-failure strategies, and then obtain a lower bound for the expected failure rate over all strategies investigated in Berry et al. (1997). The asymptotic expected failure rates for the other three strategies studied in Berry et al. (1997) are also included. Numerical estimations for a variety of generalized beta prior distributions are presented to illustrate the performances of these strategies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Banks, J. S., Sundaram, R. K. (1992) Denumerable-armed bandits.Econometrica 60, 1071–1096
Article MATH MathSciNet Google Scholar
Berry, D. A., Chen, R. W., Zame, A, Heath, D. C., Shepp, L. A. (1997) Bandit problems with infinitely many arms.Ann. Statist. 25, 2103–2116
Article MATH MathSciNet Google Scholar
Berry, D. A., Fristedt, B. (1985)Bandit Problems: Sequential Allocations of Experiments. Chapman and Hall, London
Google Scholar
Chen, J. J., Novick, M. R. (1984) Bayesian analysis for Binomial models with generalized Beta prior distributions.J. Educ. Statist. 9, 163–175
Article Google Scholar
Gittins, J. C. (1989)Multi-armed Bandit Allocation Indices. John Wiley & Sons, New York
MATH Google Scholar
Herschkorn, S. J., Pekoz, E., Ross, S. M. (1996) Policies without memory for the infinite-armed Bernoulli bandit under the average-reward criterion.Probab. Engrg. Inform. Sci. 10, 21–28
Article MATH MathSciNet Google Scholar
Libby, D. L., Novick, M. R. (1982) Multivariate generalized beta distributions with applications to utility assessment.J. Educ. Statist. 7, 271–294
Article Google Scholar
Lin, C. T., Shiau, C. J. (2000) Some optimal strategies for bandit problems with Beta prior distributions.Ann. Inst. Statist. Math. 52, 397–405
Article MATH MathSciNet Google Scholar
Pham-Gia, T., Duong, Q. P. (1989) The generalized Beta- and F-distributions in statistical modelling.Mathl Comput. Modelling 12, 1613–1625
Article MATH MathSciNet Google Scholar
Robbins, H. (1952) Some aspects of the sequential design of experiments.Bull. Amer. Math. Soc. 58, 527–536
Article MATH MathSciNet Google Scholar
Whittle, P. (1982),Optimization over Time 1. John Wiley & Sons, New York
MATH Google Scholar
Whittle, P. (1983)Optimization over Time 2. John Wiley & Sons, New York
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, Tmkang University, 251, Tamsui, Taiwan
Kung-Yu Chen & Chien-Tai Lin

Authors

Kung-Yu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chien-Tai Lin
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, KY., Lin, CT. A note on infinite-armed Bernoulli bandit problems with generalized beta prior distributions. Statistical Papers 46, 129–140 (2005). https://doi.org/10.1007/BF02762039

Download citation

Received: 31 October 2002
Revised: 04 July 2003
Issue Date: January 2005
DOI: https://doi.org/10.1007/BF02762039

Key Words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A note on infinite-armed Bernoulli bandit problems with generalized beta prior distributions

Abstract

Access this article

Similar content being viewed by others

Refined Algorithms for Infinitely Many-Armed Bandits with Deterministic Rewards

An asymptotically optimal strategy for constrained multi-armed bandit problems

The non-stationary stochastic multi-armed bandit problem

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Key Words

Navigation

A note on infinite-armed Bernoulli bandit problems with generalized beta prior distributions

Abstract

Access this article

Similar content being viewed by others

Refined Algorithms for Infinitely Many-Armed Bandits with Deterministic Rewards

An asymptotically optimal strategy for constrained multi-armed bandit problems

The non-stationary stochastic multi-armed bandit problem

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Key Words

Search

Navigation