Abstract
This chapter deals with the estimation of the optimality deviation of control policies under a discounted criterion in semi-Markov control models. Assuming that the holding times distributionF is unknown, but it is possible to get an approximate distribution\(\tilde{F}\), such optimality deviation measures the quality of this approximation according if an optimal policy for the control model corresponding to\(\tilde{F}\) is still good for the original one determined forF.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abbad, M.; Filar, J.A. Perturbation and stability theory for Markov control processes. IEEE Trans. Automat. Control. 1992, 37: 1415–1420.
Gordienko, E.; Lemus-Rodríguez, E. Estimation of robustness for controlled diffusion processes. Stochastics. 1999, 17: 421–441.
Gordienko, E.; Minjárez-Sosa, J.A. Adaptive control for discrete-time Markov processes with unbounded costs: discounted criterion. Kybernetika. 1998, 34: 217–234.
Gordienko, E.; Salem-Silva, F. Robustness inequality for Markov control processes with unbounded costs. Systems Control Lett. 1998, 33, 125–130.
Hernández-Lerma, O.; Lasserre, J.B. Further Topics on Discrete-Time Markov Control Processes; Springer, New York, 1999.
Hilgert, N.; Minjárez-Sosa, J.A. Adaptive policies for time-varying stochastic systems under discounted criterion. Math. Meth. Oper. Res. 2001, 54: 491–505.
Luque-Vásquez, F.; Minjárez-Sosa, J.A. Semi-Markov control processes with unknown holding times distribution under a discounted criterion. Math. Meth. Oper. Res. 2005, 61: 455–468.
Luque-Vásquez, F.; Robles-Alcaraz, M.T. Controlled semi-Markov models with discounted unbounded costs. Bol. Soc. Mat. Mexicana. 1994, 39: 51–68.
Minjárez-Sosa, J.A. Approximation and estimation in Markov control processes under a discounted criterion. Kybernetika. 2004, 40,6: 681–690.
Montes-de-Oca, R.; Sakhanenko, A.I.; Salem-Silva, F. Estimates for perturbations of general discounted Markov control chains. Appl. Math. 2003, 30,3 pp.287–304.
Van Dijk, N.M. Perturbation theory for unbounded Markov reward processes with applications to queueing. Adv. Appl. Probab. 1988, 20: 99–111.
Van Dijk, N.M.; Puterman, M.L. Perturbation theory for Markov reward processes with applications to queueing systems. Adv. Appl. Probab. 1988, 20: 79–98.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
del Carmen Rosas-Rosas, L. (2012). Estimation of the Optimality Deviation in Discounted Semi-Markov Control Models. In: Hernández-Hernández, D., Minjárez-Sosa, J. (eds) Optimization, Control, and Applications of Stochastic Systems. Systems & Control: Foundations & Applications. Birkhäuser, Boston. https://doi.org/10.1007/978-0-8176-8337-5_15
Download citation
DOI: https://doi.org/10.1007/978-0-8176-8337-5_15
Published:
Publisher Name: Birkhäuser, Boston
Print ISBN: 978-0-8176-8336-8
Online ISBN: 978-0-8176-8337-5
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)