Abstract
In stochastic optimal control, a key issue is the fact that “solutions” are searched for in terms of “closed-loop control laws” over available information and, as a consequence, a major potential difficulty is the fact that present control may affect future available information. This is known as the “dual effect” of control. Our main result consists in characterizing the maximal set of closed-loop control laws containing open-loop ones and for which the information provided by observations closed with such a feedback remains fixed. We give more specific results in the two following cases: multi-agent systems and discrete time stochastic input-output systems with dynamic information structure.
Keywords
Stochastic control Dual effect Information structurePreview
Unable to display preview. Download preview PDF.
References
- Bar-Shalom, Y. and E. Tse. (1974). “Dual effect, Certainty Equivalence and Separation in Stochastic Control.” IEEE Trans. Automat. Control 19, 494–500.CrossRefGoogle Scholar
- Bismut, J.M. (1973). “An Example of Interaction Between Information and Control.” IEEE Trans. Automat. Control 18, 63–64.CrossRefGoogle Scholar
- Carpentier, P., G. Cohen, and J.-C. Culioli. (1995). “Stochastic Optimal Control and Decomposition-Coordination Methods—Part I: Theory.” In Roland Durier and Christian Michelot (Eds.), Recent Developments in Optimization. LNEMS 429, 72–87, Springer-Verlag, Berlin.Google Scholar
- Cormen, T.H., C.E. Leiserson, and R.L. Rivest. (1990). Introduction to Algorithms. The MIT Press, Cambridge, Massachusetts.Google Scholar
- Dellacherie, C. and P. Meyer. (1975). Probabilités et Potentiel. Hermann, Paris.Google Scholar
- Feldbaum, A.A. (1965). Optimal Control Systems. Academic, New York.Google Scholar
- Ho, Y.C. and K.C. Chu. (1972). “Team Decision Theory and Information Structure in Optimal Control Problems – Part I.” IEEE Trans. Automat. Control, 17(1), 15–22.Google Scholar
- Ho, Y.C. and K.C. Chu. (1974). “Information Structure in Dynamic Multi-Person Control Problems.” Automatica 10, 341–351.CrossRefGoogle Scholar
- Szász, G. (1971). Théorie des Treillis. Dunod, Paris.Google Scholar
- Tse, E. and Y. Bar-Shalom. (1975). “Generalized Certainty Equivalence and Dual Effect in Stochastic Control.” IEEE Trans. Automat. Control, pp. 817–819.Google Scholar
- Witsenhausen, H.S. (1968). “A Counterexample in Stochastic Optimal Control.” SIAM J. Control 6(1):131–147.CrossRefGoogle Scholar
- Witsenhausen, H.S. (1971a). “On Information Structures, Feedback and Causality.” SIAM J. Control 9(2), 149–160.CrossRefGoogle Scholar
- Witsenhausen, H.S. (1971b). “Separation of Estimation and Control for Discrete Time Systems.” Proceedings of the IEEE 69(11), 1557–1566.Google Scholar
- Witsenhausen, H.S. (1973). “A Standard form for Sequential Stochastic Control.” Mathematical Systems Theory 7(1), 5–11.CrossRefGoogle Scholar
- Witsenhausen, H.S. (1975). The Intrinsic Model for Discrete Stochastic Control: Some Open Problems. In A. Bensoussan, and J.L. Lions (eds.), Control Theory, Numerical Methods and Computer Systems Modelling, volume 107 of Lecture Notes in Economics and Mathematical Systems, pp. 322–335. Springer-Verlag.Google Scholar