Skip to main content
Log in

Risk aversion in expected intertemporal discounted utilities bandit problems

  • Published:
Theory and Decision Aims and scope Submit manuscript

Abstract

We consider a situation where an individual is facing an uncertain situation, but may costly alter his knowledge of the uncertainties. We study in this context how risk aversion may modify the individual search behavior. We consider a one-armed bandit problem (where one arm is safe and the other is risky) and study how the agent risk aversion can change the sequence of arms selected. The main result is that when the utility function is more concave, the agent has more chances to select the safe arm. We also discuss how search is affected by risk aversion.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Berry, D. A., & Fristedt, B. (1985). Bandit problems: Sequential allocation of experiments. Chapman and Hall.

  • Chancelier J.-P., De Lara M., Palma A. (2007) Risk aversion, road choice and the one-armed bandit problem. Transportation Science 41(1): 1–14

    Article  Google Scholar 

  • Denardo E.V., Park H., Rothblum U.G. (2007) Risk-sensitive and risk-neutral multiarmed bandits. Mathematics of Operations Research 32(2): 374–394

    Article  Google Scholar 

  • Diamond, P., Rothschild, M. (eds) (1978) Uncertainty in economics. Academic Press, Orlando

    Google Scholar 

  • Gittins J.C. (1979) Bandit processes and dynamic allocation indices. Journal of the Royal Statistical Society. Series B 41(2): 148–177

    Google Scholar 

  • Gittins J.C. (1989) Multi-armed bandit allocation indices. New York, Wiley

    Google Scholar 

  • Gollier C. (2001) The economics of risk and time. MIT Press, Cambridge

    Google Scholar 

  • Magnac T., Robin J.-M. (1999) Dynamic stochastic dominance in bandit decision problems. Theory and Decision 47: 267–295

    Article  Google Scholar 

  • Pratt J.W. (1964) Risk aversion in the small and in the large. Econometrica 32(1–2): 61–75

    Google Scholar 

  • Rothschild M. (1974) Searching for the lowest price when the distribution of prices is unknown. Journal of Political Economy 82(4): 689–711

    Article  Google Scholar 

  • Whittle, P. (1982). Optimization over time: Dynamic programming and stochastic control (Vol. 1). New York: John Wiley & Sons.

    Google Scholar 

  • Wolpin K. (1984) An estimable stochastic model of fertility and child mortality. Journal of Political Economy 92(5): 852–874

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michel De Lara.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chancelier, JP., De Lara, M. & de Palma, A. Risk aversion in expected intertemporal discounted utilities bandit problems. Theory Decis 67, 433–440 (2009). https://doi.org/10.1007/s11238-008-9105-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11238-008-9105-3

Keywords

Navigation