Nearly optimal policies in risk-sensitive positive dynamic programming on discrete spaces

Cavazos-Cadena, Rolando; Montes-de-Oca, Raúl

doi:10.1007/s001860000068

Nearly optimal policies in risk-sensitive positive dynamic programming on discrete spaces

Published: September 2000

Volume 52, pages 133–167, (2000)
Cite this article

Mathematical Methods of Operations Research Aims and scope Submit manuscript

Rolando Cavazos-Cadena¹ &
Raúl Montes-de-Oca¹

74 Accesses
6 Citations
Explore all metrics

Abstract.

This note concerns Markov decision processes on a discrete state space. It is supposed that the reward function is nonnegative, and that the decision maker has a nonnull constant risk-sensitivity, which leads to grade random rewards via the expectation of an exponential utility function. The perfomance index is the risk-sensitive expected-total reward criterion, and the existence of approximately optimal stationary policies, in the absolute and relative senses, is studied. The main results, derived under mild conditions, extend classical theorems in risk-neutral positive dynamic programming and can be summarized as follows: Assuming that the optimal value function is finite, it is proved that (i) ε-optimal stationary policies exist when the state and action spaces are both finite, and (ii) this conclusion is extended to the denumerable state space case whenever (a) the decision maker is risk-averse, and (b) the optimal value function is bounded. This latter result is a (weak) risk-sensitive version of a classical theorem formulated by Ornstein (1969).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Risk-sensitive continuous-time Markov decision processes with unbounded rates and Borel spaces

Article 19 October 2019

Finite horizon risk-sensitive continuous-time Markov decision processes with unbounded transition and cost rates

Article 10 January 2019

Risk-sensitive infinite-horizon discounted piecewise deterministic Markov decision processes

Article 15 July 2022

Author information

Authors and Affiliations

Departamento de Estadı´stica y Cálculo, Universidad Autónoma Agraria Antonio Narro, Buenavista, Saltillo COAH 25315, MÉXICO, , , , , , MX
Rolando Cavazos-Cadena & Raúl Montes-de-Oca

Authors

Rolando Cavazos-Cadena
View author publications
You can also search for this author in PubMed Google Scholar
Raúl Montes-de-Oca
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

Manuscript received: October 1999/Final version received: April 2000

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cavazos-Cadena, R., Montes-de-Oca, R. Nearly optimal policies in risk-sensitive positive dynamic programming on discrete spaces. Mathematical Methods of OR 52, 133–167 (2000). https://doi.org/10.1007/s001860000068

Download citation

Issue Date: September 2000
DOI: https://doi.org/10.1007/s001860000068

Key words: Utility function, Constant risk-sensitivity, Discounted dynamic programming operator, Risk-sensitive expected-total reward, Ornstein's theorem, Risk-aversion

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Nearly optimal policies in risk-sensitive positive dynamic programming on discrete spaces

Abstract.

Access this article

Similar content being viewed by others

Risk-sensitive continuous-time Markov decision processes with unbounded rates and Borel spaces

Finite horizon risk-sensitive continuous-time Markov decision processes with unbounded transition and cost rates

Risk-sensitive infinite-horizon discounted piecewise deterministic Markov decision processes

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Navigation

Nearly optimal policies in risk-sensitive positive dynamic programming on discrete spaces

Abstract.

Access this article

Similar content being viewed by others

Risk-sensitive continuous-time Markov decision processes with unbounded rates and Borel spaces

Finite horizon risk-sensitive continuous-time Markov decision processes with unbounded transition and cost rates

Risk-sensitive infinite-horizon discounted piecewise deterministic Markov decision processes

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation