Limit Synchronization in Markov Decision Processes

Doyen, Laurent; Massart, Thierry; Shirmohammadi, Mahsa

doi:10.1007/978-3-642-54830-7_4

Laurent Doyen¹⁷,
Thierry Massart¹⁸ &
Mahsa Shirmohammadi^17,18

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8412))

Included in the following conference series:

International Conference on Foundations of Software Science and Computation Structures

878 Accesses
4 Citations

Abstract

Markov decision processes (MDP) are finite-state systems with both strategic and probabilistic choices. After fixing a strategy, an MDP produces a sequence of probability distributions over states. The sequence is eventually synchronizing if the probability mass accumulates in a single state, possibly in the limit. Precisely, for 0 ≤ p ≤ 1 the sequence is p-synchronizing if a probability distribution in the sequence assigns probability at least p to some state, and we distinguish three synchronization modes: (i) sure winning if there exists a strategy that produces a 1-synchronizing sequence; (ii) almost-sure winning if there exists a strategy that produces a sequence that is, for all ε > 0, a (1-ε)-synchronizing sequence; (iii) limit-sure winning if for all ε > 0, there exists a strategy that produces a (1-ε)-synchronizing sequence. We consider the problem of deciding whether an MDP is sure, almost-sure, or limit-sure winning, and we establish the decidability and optimal complexity for all modes, as well as the memory requirements for winning strategies. Our main contributions are as follows: (a) for each winning modes we present characterizations that give a PSPACE complexity for the decision problems, and we establish matching PSPACE lower bounds; (b) we show that for sure winning strategies, exponential memory is sufficient and may be necessary, and that in general infinite memory is necessary for almost-sure winning, and unbounded memory is necessary for limit-sure winning; (c) along with our results, we establish new complexity results for alternating finite automata over a one-letter alphabet.

This work was partially supported by the Belgian Fonds National de la Recherche Scientifique (FNRS), and by the PICS project Quaverif funded by the French Centre National de la Recherche Scientifique (CNRS).

Download to read the full chapter text

Chapter PDF

Robust Synchronization in Markov Decision Processes

Strategy Complexity of Finite-Horizon Markov Decision Processes and Simple Stochastic Games

Bounds for Synchronizing Markov Decision Processes

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Agrawal, M., Akshay, S., Genest, B., Thiagarajan, P.S.: Approximate verification of the symbolic dynamics of Markov chains. In: LICS, pp. 55–64. IEEE (2012)
Google Scholar
Aspnes, J., Herlihy, M.: Fast randomized consensus using shared memory. J. Algorithm 11(3), 441–461 (1990)
Article MATH MathSciNet Google Scholar
Baier, C., Bertrand, N., Größer, M.: On decision problems for probabilistic Büchi automata. In: Amadio, R.M. (ed.) FOSSACS 2008. LNCS, vol. 4962, pp. 287–301. Springer, Heidelberg (2008)
Chapter Google Scholar
Baier, C., Bertrand, N., Schnoebelen, P.: On computing fixpoints in well-structured regular model checking, with applications to lossy channel systems. In: Hermann, M., Voronkov, A. (eds.) LPAR 2006. LNCS (LNAI), vol. 4246, pp. 347–361. Springer, Heidelberg (2006)
Chapter Google Scholar
Baldoni, R., Bonnet, F., Milani, A., Raynal, M.: On the solvability of anonymous partial grids exploration by mobile robots. In: Baker, T.P., Bui, A., Tixeuil, S. (eds.) OPODIS 2008. LNCS, vol. 5401, pp. 428–445. Springer, Heidelberg (2008)
Chapter Google Scholar
Chadha, R., Korthikanti, V.A., Viswanathan, M., Agha, G., Kwon, Y.: Model checking MDPs with a unique compact invariant set of distributions. In: Proc. of QEST, pp. 121–130. IEEE Computer Society (2011)
Google Scholar
Chatterjee, K., Henzinger, T.A.: A survey of stochastic ω-regular games. J. Comput. Syst. Sci. 78(2), 394–413 (2012)
Article MATH MathSciNet Google Scholar
de Alfaro, L.: Formal Verification of Probabilistic Systems. PhD thesis, Stanford University (1997)
Google Scholar
de Alfaro, L., Henzinger, T.A., Kupferman, O.: Concurrent reachability games. Theor. Comput. Sci. 386(3), 188–217 (2007)
Article MATH Google Scholar
Doyen, L., Massart, T., Shirmohammadi, M.: Infinite synchronizing words for probabilistic automata. In: Murlak, F., Sankowski, P. (eds.) MFCS 2011. LNCS, vol. 6907, pp. 278–289. Springer, Heidelberg (2011)
Chapter Google Scholar
Doyen, L., Massart, T., Shirmohammadi, M.: Infinite synchronizing words for probabilistic automata (Erratum). CoRR, abs/1206.0995 (2012)
Google Scholar
Doyen, L., Massart, T., Shirmohammadi, M.: Limit synchronization in Markov decision processes. CoRR, abs/1310.2935 (2013)
Google Scholar
Fokkink, W., Pang, J.: Variations on Itai-Rodeh leader election for anonymous rings and their analysis in PRISM. Journal of Universal Computer Science 12(8), 981–1006 (2006)
Google Scholar
Gimbert, H., Oualhadj, Y.: Probabilistic automata on finite words: Decidable and undecidable problems. In: Abramsky, S., Gavoille, C., Kirchner, C., Meyer auf der Heide, F., Spirakis, P.G. (eds.) ICALP 2010, part II. LNCS, vol. 6199, pp. 527–538. Springer, Heidelberg (2010)
Chapter Google Scholar
Henzinger, T.A., Mateescu, M., Wolf, V.: Sliding window abstraction for infinite Markov chains. In: Bouajjani, A., Maler, O. (eds.) CAV 2009. LNCS, vol. 5643, pp. 337–352. Springer, Heidelberg (2009)
Chapter Google Scholar
Holzer, M.: On emptiness and counting for alternating finite automata. In: Developments in Language Theory, pp. 88–97 (1995)
Google Scholar
Jancar, P., Sawa, Z.: A note on emptiness for alternating finite automata with a one-letter alphabet. Inf. Process. Lett. 104(5), 164–167 (2007)
Article MATH MathSciNet Google Scholar
Korthikanti, V.A., Viswanathan, M., Agha, G., Kwon, Y.: Reasoning about MDPs as transformers of probability distributions. In: Proc. of QEST, pp. 199–208. IEEE Computer Society (2010)
Google Scholar
Vardi, M.Y.: Automatic verification of probabilistic concurrent finite-state programs. In: Proc. of FOCS, pp. 327–338. IEEE Computer Society (1985)
Google Scholar
Volkov, M.V.: Synchronizing automata and the Černý conjecture. In: Martín-Vide, C., Otto, F., Fernau, H. (eds.) LATA 2008. LNCS, vol. 5196, pp. 11–27. Springer, Heidelberg (2008)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

LSV, ENS Cachan & CNRS, France
Laurent Doyen & Mahsa Shirmohammadi
Université Libre de Bruxelles, Belgium
Thierry Massart & Mahsa Shirmohammadi

Authors

Laurent Doyen
View author publications
You can also search for this author in PubMed Google Scholar
Thierry Massart
View author publications
You can also search for this author in PubMed Google Scholar
Mahsa Shirmohammadi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

LaBRI, Universiy of Bordeaux, France
Anca Muscholl

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Doyen, L., Massart, T., Shirmohammadi, M. (2014). Limit Synchronization in Markov Decision Processes. In: Muscholl, A. (eds) Foundations of Software Science and Computation Structures. FoSSaCS 2014. Lecture Notes in Computer Science, vol 8412. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54830-7_4

Download citation

DOI: https://doi.org/10.1007/978-3-642-54830-7_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54829-1
Online ISBN: 978-3-642-54830-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Limit Synchronization in Markov Decision Processes

Abstract

Chapter PDF

Similar content being viewed by others

Robust Synchronization in Markov Decision Processes

Strategy Complexity of Finite-Horizon Markov Decision Processes and Simple Stochastic Games

Bounds for Synchronizing Markov Decision Processes

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Limit Synchronization in Markov Decision Processes

Abstract

Chapter PDF

Similar content being viewed by others

Robust Synchronization in Markov Decision Processes

Strategy Complexity of Finite-Horizon Markov Decision Processes and Simple Stochastic Games

Bounds for Synchronizing Markov Decision Processes

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation