J. Bobadilla, F. Ortega, A. Hernando, A. Gutiérrez, Recommender systems survey. Knowl. Based Syst. 46, 109–132 (2013)
CrossRef
Google Scholar
B.L. Bowerman, Nonstationary Markov Decision Processes and Related Topics in Nonstationary Markov Chains (1974)
Google Scholar
G. Casella, R.L. Berger, Statistical Inference, vol. 2 (Duxbury Pacific Grove, CA, 2002)
Google Scholar
K.A. Ciosek, S. Whiteson, OFFER: off-environment reinforcement learning, in Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, February 4-9, 2017, San Francisco, California, USA, ed. by S.P. Singh, S. Markovitch (AAAI Press, 2017), pp. 1819–1825
Google Scholar
V. Conitzer, T. Sandholm, Computing the optimal strategy to commit to, in Proceedings 7th ACM Conference on Electronic Commerce (EC-2006), Ann Arbor, Michigan, USA, June 11-15, 2006, ed. by J. Feigenbaum, J.C.-I. Chuang, D.M. Pennock (ACM, 2006), pp. 82–90
Google Scholar
D. Ernst, P. Geurts, L. Wehenkel, Tree-based batch mode reinforcement learning. J. Mach. Learn. Res. 6, 503–556 (2005)
MathSciNet
MATH
Google Scholar
C. Florensa, D. Held, M. Wulfmeier, M. Zhang, P. Abbeel, Reverse curriculum generation for reinforcement learning, in 1st Annual Conference on Robot Learning, CoRL 2017, Mountain View, California, USA, November 13-15, 2017, Proceedings, vol. 78 of Proceedings of Machine Learning Research (PMLR, 2017), pp. 482–495
Google Scholar
V. Gallego, R. Naveiro, D.R. Insua, Reinforcement learning under threats, in The Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, The Thirty-First Innovative Applications of Artificial Intelligence Conference, IAAI 2019, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, Honolulu, Hawaii, USA, January 27 - February 1, 2019 (AAAI Press, 2019), pp. 9939–9940
Google Scholar
V. Gallego, R. Naveiro, D.R. Insua, D. Gómez-Ullate, Opponent aware reinforcement learning, in CoRR, abs/1908.08773 (2019)
Google Scholar
J. García, F. Fernández, A comprehensive survey on safe reinforcement learning. J. Mach. Learn. Res. 16, 1437–1480 (2015)
MathSciNet
MATH
Google Scholar
T. Haarnoja, S. Ha, A. Zhou, J. Tan, G. Tucker, S. Levine, Learning to walk via deep reinforcement learning, in Robotics: Science and Systems XV, University of Freiburg, Freiburg im Breisgau, Germany, June 22-26, 2019, ed. by A. Bicchi, H. Kress-Gazit, S. Hutchinson (2019)
Google Scholar
S. Keren, L.E. Pineda, A. Gal, E. Karpas, S. Zilberstein, Equi-reward utility maximizing design in stochastic environments, in Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, IJCAI 2017, Melbourne, Australia, August 19-25, 2017, ed. by C. Sierra, ijcai.org (2017), pp. 4353–4360
Google Scholar
B. Ravi Kiran, I. Sobh, V. Talpaert, P. Mannion, A.A. Al Sallab, S.K. Yogamani, P. Pérez, Deep reinforcement learning for autonomous driving: a survey, in CoRR. abs/2002.00444 (2020)
Google Scholar
J. Kober, J. Andrew Bagnell, J. Peters, Reinforcement learning in robotics: a survey. I. J. Robotics Res. 32(11), 1238–1274 (2013)
Google Scholar
D. Loiacono, A. Prete, P.L. Lanzi, L. Cardamone, Learning to overtake in TORCS using simple reinforcement learning, in Proceedings of the IEEE Congress on Evolutionary Computation, CEC 2010, Barcelona, Spain, 18-23 July 2010 (IEEE, 2010), pp. 1–8
Google Scholar
D. Lu, Q. Weng, A survey of image classification methods and techniques for improving classification performance. Int. J. Remote Sensing 28(5), 823–870 (2007)
Google Scholar
D.G. Luenberger, Introduction to dynamic systems; theory, models, and applications. Technical report (1979)
Google Scholar
A.M. Metelli, Exploiting Environment Configurability in Reinforcement Learning. PhD thesis, Politecnico di Milano, March 2021
Google Scholar
A.M. Metelli, E. Ghelfi, M. Restelli, Reinforcement learning in configurable continuous environments, in Proceedings of the 36th International Conference on Machine Learning, ICML 2019, 9-15 June 2019, Long Beach, California, USA, ed. by K. Chaudhuri, R. Salakhutdinov, vol. 97 of Proceedings of Machine Learning Research (PMLR, 2019), pp. 4546–4555
Google Scholar
A.M. Metelli, G. Manneschi, M. Restelli, Policy space identification in configurable environments, in CoRR, abs/1909.03984 (2019)
Google Scholar
A.M. Metelli, F. Mazzolini, L. Bisi, L. Sabbioni, M. Restelli, Control frequency adaptation via action persistence in batch reinforcement learning, in Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020, Virtual Event, vol. 119 of Proceedings of Machine Learning Research (PMLR, 2020), pp. 6862–6873
Google Scholar
A.M. Metelli, M. Mutti, M. Restelli, Configurable markov decision processes, in Proceedings of the 35th International Conference on Machine Learning, ICML 2018, Stockholmsmässan, Stockholm, Sweden, July 10-15, 2018, ed. by J.G. Dy A. Krause, vol. 80 of Proceedings of Machine Learning Research (PMLR, 2018), 3488–3497
Google Scholar
T.M. Mitchell. Machine Learning, International Edition. McGraw-Hill Series in Computer Science (McGraw-Hill, 1997)
Google Scholar
V. Mnih, K. Kavukcuoglu, D. Silver, A. Graves, I. Antonoglou, D. Wierstra, M.A. Riedmiller, Playing atari with deep reinforcement learning, in CoRR, abs/1312.5602 (2013)
Google Scholar
J.E. Moody, M. Saffell, Learning to trade via direct reinforcement. IEEE Trans. Neural Netw. 12(4), 875–889 (2001)
CrossRef
Google Scholar
J. Nash, Non-cooperative games. Ann. Math. 54(2), 286–295 (1951)
Google Scholar
T. Osa, J. Pajarinen, G. Neumann, J. Andrew Bagnell, P. Abbeel, J. Peters, An algorithmic perspective on imitation learning. Foundations Trends Robot. 7(1–2), 1–179 (2018)
Google Scholar
J. Peters, K. Mülling, Y. Altun, Relative entropy policy search, in Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2010, Atlanta, Georgia, USA, July 11-15, 2010, ed. by M. Fox, D. Poole (AAAI Press, 2010)
Google Scholar
J. Puigcerver, Are multidimensional recurrent layers really necessary for handwritten text recognition?, in 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1, pp. 67–72 (IEEE, 2017)
Google Scholar
M.L. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons (2014)
Google Scholar
G. Ramponi, A.M. Metelli, A. Concetti, M. Restelli, Online learning in non-cooperative configurable Markov decision process, in AAAI-21 Workshop on Reinforcement Learning in Games (2021)
Google Scholar
S.J. Russell, P. Norvig, Artificial Intelligence– - Modern Approach (Third International Edition, Pearson Education, 2010)
Google Scholar
L.S Shapley, Stochastic games. Proc. Natl. Acad. Sci. 39(10), 1095–1100 (1953)
Google Scholar
R. Silva, F.S. Melo, M. Veloso, What if the world were different? gradient-based exploration for new optimal policies, in GCAI-2018, 4th Global Conference on Artificial Intelligence, Luxembourg, September 18-21, 2018, ed. by D.D. Lee, A. Steen, T. Walsh, vol. 55 of EPiC Series in Computing (EasyChair, 2018), pp. 229–242
Google Scholar
B.F. Skinner, The Behavior of Organisms: An Experimental Analysis (1938)
Google Scholar
R.S. Sutton, A.G. Barto, Reinforcement Learning: An Introduction. MIT press (2018)
Google Scholar
H. Von Stackelberg, Marktform und gleichgewicht. J. Springer (1934)
Google Scholar
H. Zhang, Y. Chen, D.C. Parkes, A general approach to environment design with one agent, in IJCAI 2009, Proceedings of the 21st International Joint Conference on Artificial Intelligence, Pasadena, California, USA, July 11-17, 2009, ed. by C. Boutilier (2009), pp. 2002–2014
Google Scholar