Skip to main content

Numerical Investigation of the Two Armed Bandit

  • Conference paper
Book cover Mathematical Learning Models — Theory and Algorithms

Part of the book series: Lecture Notes in Statistics ((LNS,volume 20))

Abstract

This paper is concerned with Bernoulli two armed bandits with independent beta priors for the unknown success probabilities where there are a finite number of trials, N, and the objective is to maximise the overall expected return. The two armed bandit with one probability known is also considered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Bather, J. A. (1981). Randomised allocation of treatments in sequential experiments, (with discussion). Journ. Roy. Statist. Soc. B, 43, 265–292.

    MathSciNet  MATH  Google Scholar 

  • Berry, D. A. (1972). A Bernoulli two armed bandit. Ann. Math. Statist., 43. 871–897.

    Article  MathSciNet  MATH  Google Scholar 

  • Gittins, J. C. (1979). Bandit processes and dynamic allocation indices, (with discussion). Journ. Roy. Statist. Soc. B, 41, 148–177.

    MathSciNet  MATH  Google Scholar 

  • Gittins, J. C. and Jones, D. M. (1979). A dynamic allocation index for the discounted multiarmed bandit problem. Biometrika, 66, 561–5.

    Article  Google Scholar 

  • Hengartner, W., Kalin, D. and Theodorescu, R. (1981). On the Bernoulli two armed bandit problem. Math. Operationsforsch. Statist., 12, 307–316.

    Article  MathSciNet  MATH  Google Scholar 

  • Jones, P. W. (1975). The two armed bandit. Biometrika, 62, 523–4.

    Article  MathSciNet  MATH  Google Scholar 

  • Jones, P. W. (1976). Some results for the two armed bandit problem. Math. Operationsforsch. Statist., 7, 471–475.

    Article  MathSciNet  Google Scholar 

  • Jones, P. W. (1978). On the two armed bandit with one probability known. Metrika, 25, 235–9.

    Article  MathSciNet  MATH  Google Scholar 

  • Kalin, D and Theodorescu, R. (1981). Abstract 81t-80. Bull. Inst. Math. Statist., 10, 5, 224. (See also unpublished research report from the authors).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1983 Springer-Verlag New York Inc.

About this paper

Cite this paper

Jones, P.W., Kandeel, H.A. (1983). Numerical Investigation of the Two Armed Bandit. In: Herkenrath, U., Kalin, D., Vogel, W. (eds) Mathematical Learning Models — Theory and Algorithms. Lecture Notes in Statistics, vol 20. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-5612-0_11

Download citation

  • DOI: https://doi.org/10.1007/978-1-4612-5612-0_11

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-0-387-90913-4

  • Online ISBN: 978-1-4612-5612-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics