Risk aversion in expected intertemporal discounted utilities bandit problems

Chancelier, Jean-Philippe; De Lara, Michel; de Palma, André

doi:10.1007/s11238-008-9105-3

Risk aversion in expected intertemporal discounted utilities bandit problems

Published: 21 April 2008

Volume 67, pages 433–440, (2009)
Cite this article

Theory and Decision Aims and scope Submit manuscript

Jean-Philippe Chancelier¹,
Michel De Lara¹ &
André de Palma²

134 Accesses
7 Citations
Explore all metrics

Abstract

We consider a situation where an individual is facing an uncertain situation, but may costly alter his knowledge of the uncertainties. We study in this context how risk aversion may modify the individual search behavior. We consider a one-armed bandit problem (where one arm is safe and the other is risky) and study how the agent risk aversion can change the sequence of arms selected. The main result is that when the utility function is more concave, the agent has more chances to select the safe arm. We also discuss how search is affected by risk aversion.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust Risk-Averse Stochastic Multi-armed Bandits

Bandit Problems

Allocation Strategies Based on Possibilistic Rewards for the Multi-armed Bandit Problem: A Numerical Study and Regret Analysis

References

Berry, D. A., & Fristedt, B. (1985). Bandit problems: Sequential allocation of experiments. Chapman and Hall.
Chancelier J.-P., De Lara M., Palma A. (2007) Risk aversion, road choice and the one-armed bandit problem. Transportation Science 41(1): 1–14
Article Google Scholar
Denardo E.V., Park H., Rothblum U.G. (2007) Risk-sensitive and risk-neutral multiarmed bandits. Mathematics of Operations Research 32(2): 374–394
Article Google Scholar
Diamond, P., Rothschild, M. (eds) (1978) Uncertainty in economics. Academic Press, Orlando
Google Scholar
Gittins J.C. (1979) Bandit processes and dynamic allocation indices. Journal of the Royal Statistical Society. Series B 41(2): 148–177
Google Scholar
Gittins J.C. (1989) Multi-armed bandit allocation indices. New York, Wiley
Google Scholar
Gollier C. (2001) The economics of risk and time. MIT Press, Cambridge
Google Scholar
Magnac T., Robin J.-M. (1999) Dynamic stochastic dominance in bandit decision problems. Theory and Decision 47: 267–295
Article Google Scholar
Pratt J.W. (1964) Risk aversion in the small and in the large. Econometrica 32(1–2): 61–75
Google Scholar
Rothschild M. (1974) Searching for the lowest price when the distribution of prices is unknown. Journal of Political Economy 82(4): 689–711
Article Google Scholar
Whittle, P. (1982). Optimization over time: Dynamic programming and stochastic control (Vol. 1). New York: John Wiley & Sons.
Google Scholar
Wolpin K. (1984) An estimable stochastic model of fertility and child mortality. Journal of Political Economy 92(5): 852–874
Article Google Scholar

Download references

Author information

Authors and Affiliations

Université Paris-Est, CERMICS, 6 et 8 avenue Blaise Pascal, 77455, Marne la Vallée Cedex 2, France
Jean-Philippe Chancelier & Michel De Lara
Université de Cergy-Pontoise, Cergy-Pontoise Cedex and ENPC, France
André de Palma

Authors

Jean-Philippe Chancelier
View author publications
You can also search for this author in PubMed Google Scholar
Michel De Lara
View author publications
You can also search for this author in PubMed Google Scholar
André de Palma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michel De Lara.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chancelier, JP., De Lara, M. & de Palma, A. Risk aversion in expected intertemporal discounted utilities bandit problems. Theory Decis 67, 433–440 (2009). https://doi.org/10.1007/s11238-008-9105-3

Download citation

Received: 27 March 2008
Accepted: 01 April 2008
Published: 21 April 2008
Issue Date: October 2009
DOI: https://doi.org/10.1007/s11238-008-9105-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Risk aversion in expected intertemporal discounted utilities bandit problems

Abstract

Access this article

Similar content being viewed by others

Robust Risk-Averse Stochastic Multi-armed Bandits

Bandit Problems

Allocation Strategies Based on Possibilistic Rewards for the Multi-armed Bandit Problem: A Numerical Study and Regret Analysis

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Risk aversion in expected intertemporal discounted utilities bandit problems

Abstract

Access this article

Similar content being viewed by others

Robust Risk-Averse Stochastic Multi-armed Bandits

Bandit Problems

Allocation Strategies Based on Possibilistic Rewards for the Multi-armed Bandit Problem: A Numerical Study and Regret Analysis

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation