Numerical Investigation of the Two Armed Bandit

Jones, P. W.; Kandeel, H. A.

doi:10.1007/978-1-4612-5612-0_11

P. W. Jones³ &
H. A. Kandeel³

Part of the book series: Lecture Notes in Statistics ((LNS,volume 20))

225 Accesses
1 Citations

Abstract

This paper is concerned with Bernoulli two armed bandits with independent beta priors for the unknown success probabilities where there are a finite number of trials, N, and the objective is to maximise the overall expected return. The two armed bandit with one probability known is also considered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bather, J. A. (1981). Randomised allocation of treatments in sequential experiments, (with discussion). Journ. Roy. Statist. Soc. B, 43, 265–292.
MathSciNet MATH Google Scholar
Berry, D. A. (1972). A Bernoulli two armed bandit. Ann. Math. Statist., 43. 871–897.
Article MathSciNet MATH Google Scholar
Gittins, J. C. (1979). Bandit processes and dynamic allocation indices, (with discussion). Journ. Roy. Statist. Soc. B, 41, 148–177.
MathSciNet MATH Google Scholar
Gittins, J. C. and Jones, D. M. (1979). A dynamic allocation index for the discounted multiarmed bandit problem. Biometrika, 66, 561–5.
Article Google Scholar
Hengartner, W., Kalin, D. and Theodorescu, R. (1981). On the Bernoulli two armed bandit problem. Math. Operationsforsch. Statist., 12, 307–316.
Article MathSciNet MATH Google Scholar
Jones, P. W. (1975). The two armed bandit. Biometrika, 62, 523–4.
Article MathSciNet MATH Google Scholar
Jones, P. W. (1976). Some results for the two armed bandit problem. Math. Operationsforsch. Statist., 7, 471–475.
Article MathSciNet Google Scholar
Jones, P. W. (1978). On the two armed bandit with one probability known. Metrika, 25, 235–9.
Article MathSciNet MATH Google Scholar
Kalin, D and Theodorescu, R. (1981). Abstract 81t-80. Bull. Inst. Math. Statist., 10, 5, 224. (See also unpublished research report from the authors).
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, University of Keele, Keele, Staffordshire, ST5 5BG, UK
P. W. Jones & H. A. Kandeel

Authors

P. W. Jones
View author publications
You can also search for this author in PubMed Google Scholar
H. A. Kandeel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institut für Angewandte Mathematik der Universität Bonn, Wegelerstrasse 6, 5300, Bonn, Federal Republic of Germany
Ulrich Herkenrath & Walter Vogel &
Abt. Mathematik VII, Universität Ulm, Oberer Eselsberg, 7900, Ulm, Federal Republic of Germany
Dieter Kalin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jones, P.W., Kandeel, H.A. (1983). Numerical Investigation of the Two Armed Bandit. In: Herkenrath, U., Kalin, D., Vogel, W. (eds) Mathematical Learning Models — Theory and Algorithms. Lecture Notes in Statistics, vol 20. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-5612-0_11

Download citation

DOI: https://doi.org/10.1007/978-1-4612-5612-0_11
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-90913-4
Online ISBN: 978-1-4612-5612-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics