Definition
Relational reinforcement learning is concerned with learning behavior or control policies based on a numerical feedback signal, much like standard reinforcement learning, in complex domains where states (and actions) are largely characterized by the presence of objects, their properties, and the existing relations between those objects. Relational reinforcement learning uses approaches similar to those used for standard reinforcement learning, but extends these with methods that can abstract over specific object identities and exploit the structural information available in the environment.
Motivation and Background
Reinforcement learningis a very attractive machine learning framework, as it tackles, in a sense, the whole artificial intelligence problem at a small scale: an agent acts in an unknown environment and has to learn how to behave optimally by reinforcement, i.e., through...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Blockeel H, De Raedt L (1998) Top-down induction of first order logical decision trees. Artif Intell 101(1–2):285–297
Boutilier C, Reiter R, Price B (2001) Symbolic dynamic programming for first-order MDPs. In: Proceedings of the 17th international joint conference on artificial intelligence (IJCAI-2001), Seattle, pp 690–700
Croonenborghs T (2009) Model-assisted approaches for relational reinforcement learning. Ph.D. thesis, Department of Compute Science, Katholieke Universiteit Leuven
Driessens K (2004) Relational reinforcement learning. Ph.D. thesis, Department of Computer Science, Katholieke Universiteit Leuven
Driessens K, Fern A, van Otterlo M (eds) (2005) Proceedings of ICML-2005 workshop on rich representation for reinforcement learning, Bonn
Džeroski S, De Raedt L, Blockeel H (1998) Relational reinforcement learning. In: Proceedings of the 15th international conference on machine learning (ICML-1998), San Francisco. Morgan Kaufmann, Madison, pp 136–143
Džeroski S, De Raedt L, Driessens K (2001) Relational reinforcement learning. Mach Learn 43:7–52
Fern A, Yoon S, Givan R (2006) Approximate policy iteration with a policy language bias: solving relational Markov decision processes. J Artif Intell Res 25:85–118
Friedman J (2001) Greedy function approximation: a gradient boosting machine. Ann Stat 29:1189–1232
Kersting K, Driessens K (2008) Non-parametric policy gradients: a unified treatment of propositional and relational domains. In: McAllum A, Roweis S (eds) Proceedings of the 25th international conference on machine learning (ICML 2008), Helsinki, pp 456–463
Kersting K, van Otterlo M, De Raedt L (2004) Bellman goes relational. In: Proceedings of the twenty-first international conference on machine learning (ICML-2004), Banff, pp 465–472
Rubinstein RY (1997) Optimization of computer simulation models with rare events. Eur J Oper Res 99(1):89–112
Sanner S (2008) First-order decision-theoretic planning in structured relational environments. Ph.D. thesis, Department of Compute Science, University of Toronto
Sanner S, Boutilier C (2005) Approximate linear programming for first-order MDPs. In: Proceedings of the 21st conference on Uncertainty in AI (UAI), Edinburgh
Sarjant S (2013) Policy search based relational reinforcement learning using the cross-entropy method. Ph.D. thesis, Department of Computer Science, University of Waikato
Sarjant S, Pfahringer B, Driessens K, Smith T (2014) A Direct Policy-Search Algorithm for Relational Reinforcement Learning. In: Proceedings of the 25th international conference on inductive logic programming (ILP 2013), Rio de Janeiro, pp 76–92
Sutton RS, Barto AG (1998) Reinforcement learning: an introduction. MIT, Cambridge
Sutton RS, McAllester D, Singh S, Mansour Y (2000) Policy gradient methods for reinforcement learning with function approximation. In: Advances in neural information processing systems, vol 12. MIT, Cambridge, pp 1057–1063
Tadepalli P, Givan R, Driessens K (eds) (2004) Proceedings of the ICML-2004 workshop on relational reinforcement learning, Banff
van Otterlo M (2008) The logic of adaptive learning. Ph.D. thesis, Centre for Telematics and Information Technology, University of Twente
van Otterlo M (2009) The logic of adaptive behavior: knowledge representation and algorithms for adaptive sequential decision making under uncertainty in first-order and relational domains. IOS Press, Amsterdam
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media New York
About this entry
Cite this entry
Driessens, K. (2017). Relational Reinforcement Learning. In: Sammut, C., Webb, G.I. (eds) Encyclopedia of Machine Learning and Data Mining. Springer, Boston, MA. https://doi.org/10.1007/978-1-4899-7687-1_726
Download citation
DOI: https://doi.org/10.1007/978-1-4899-7687-1_726
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4899-7685-7
Online ISBN: 978-1-4899-7687-1
eBook Packages: Computer ScienceReference Module Computer Science and Engineering