Relational Reinforcement Learning

Džeroski, Sašo; De Raedt, Luc; Driessens, Kurt

doi:10.1023/A:1007694015589

Relational Reinforcement Learning

Published: April 2001

Volume 43, pages 7–52, (2001)
Cite this article

Download PDF

Machine Learning Aims and scope Submit manuscript

Relational Reinforcement Learning

Download PDF

Sašo Džeroski¹,
Luc De Raedt² &
Kurt Driessens³

6621 Accesses
160 Citations
1 Altmetric
Explore all metrics

Abstract

Relational reinforcement learning is presented, a learning technique that combines reinforcement learning with relational learning or inductive logic programming. Due to the use of a more expressive representation language to represent states, actions and Q-functions, relational reinforcement learning can be potentially applied to a new range of learning tasks. One such task that we investigate is planning in the blocks world, where it is assumed that the effects of the actions are unknown to the agent and the agent has to learn a policy. Within this simple domain we show that relational reinforcement learning solves some existing problems with reinforcement learning. In particular, relational reinforcement learning allows us to employ structural representations, to abstract from specific goals pursued and to exploit the results of previous learning phases when addressing new (more complex) situations.

References

Baum, E. B. (1996). Toward a model of mind as a laissez-faire economy of idiots. In Proc. 13th Intl. Conf. on Machine Learning. San Mateo, CA: Morgan Kaufmann.
Google Scholar
Blockeel, H.& De Raedt, L. (1997). Lookahead and discretization in ILP. In Proc. 7th Intl.Workshop on Inductive Logic Programming (pp. 77–84). Berlin: Springer.
Google Scholar
Blockeel, H., De Raedt, L.,& Ramon, J. (1998). Top-down induction of clustering trees. In Proc. 15th Intl. Conf. on Machine Learning (pp. 55–63). San Francisco, CA: Morgan Kaufmann.
Google Scholar
Blockeel, H.& De Raedt, L. (1998). Top-down induction of first order logical decision trees. Artificial Intelligence, 101(1/2), 285–297.
Google Scholar
Blockeel, H., De Raedt, L., Jacobs, N.,& Demoen, B. (1999). Scaling up inductive logic programming by learning from interpretations. Data Mining and Knowledge Discovery, 3, 59–93.
Google Scholar
Breiman, L., Friedman, J. H., Olshen, R. A.,& Stone, C. J. (1984). Classification and regression trees. Belmont: Wadsworth.
Google Scholar
Carbonell, J.& Gill, Y. (1990). Learning by experimentation: The operator refinement method. In Y. Kodratoff& R. Michalski (Eds.), Machine learning: An artificial intelligence approach (pp. 191–213). San Mateo, CA: Morgan Kaufmann.
Google Scholar
Chapman, D.& Kaelbling, L. (1991). Input generalization in delayed reinforcement learning: An algorithm and performance comparisons. In Proc. 12th Intl. Joint Conf. on Artificial Intelligence (pp. 726–731). San Mateo, CA: Morgan Kaufmann.
Google Scholar
De Raedt, L.& Blockeel, H. (1997). Using logical decision trees for clustering. In Proc. 7th Intl. Workshop on Inductive Logic Programming (pp. 133–141). Berlin: Springer.
Google Scholar
Fikes, R. E.& Nilsson, N. J. (1971). STRIPS: A new approach to the application of theorem proving. Artificial Intelligence, 2(3/4), 189–208.
Google Scholar
Kaelbling, L., Littman, M.,& Moore, A. (1996). Reinforcement learning:Asurvey. Journal of Artificial Intelligence Research, 4, 237–285.
Google Scholar
Karalic, A.& Bratko, I. (1997). First order regression. Machine Learning, 26, 147–176.
Google Scholar
Koenig, S.& Simmons, R. G. (1996). The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithms. Machine Learning, 22, 227–250.
Google Scholar
Kramer, S. (1996). Structural regression trees. In Proc. 13th Natl. Conf. on Artificial Intelligence (pp. 812–819). Menlo Park, CA: AAAI Press.
Google Scholar
Langley, P. (1985). Strategy acquisition governed by experimentation. In L. Steels & J. A. Campbell (Eds.), Progress in artificial intelligence (P. 52). Chichester: Ellis Horwood.
Google Scholar
Langley, P. (1996). Elements of machine learning. San Matco, CA: Morgan Kaufmann.
Google Scholar
Lavrač, N.& Džeroski, S. (1994). Inductive logic programming: Techniques and applications. Chichester: Ellis Horwood.
Google Scholar
Lin, L.-J. (1992). Self-Improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning, 8, 293–321.
Google Scholar
Mitchell, T. (1997). Machine learning. New York: McGraw-Hill.
Google Scholar
Mitchell, T., Keller, R.,& Kedar-Cabelli, S. (1986). Explanation based generalization: A unifying view. Machine Learning, 1(1), 47–80.
Google Scholar
Mitchell, T., Utgoff, P. E.,& Banerji, R. (1984). Learning by experimentation: Acquiring and refining problemsolving heuristics. In R. S. Michalski, J. G. Carbonell,& T. M. Mitchell (Eds.), Machine learning: An artificial intelligence approach. Springer-Verlag. Palo Alto, CA: Tioga.
Google Scholar
Mooney, R. J.& Califf, M. E. (1995). Induction of first-order decision lists: Results on learning the past tense of English verbs. Journal of Artificial Intelligence Research, (3), 1–24.
Google Scholar
Muggleton, S.& De Raedt, L. (1994). Inductive logic programming: Theory and methods. Journal of Logic Programming 19/20, 629–679.
Google Scholar
Nedellec, C., Rouveirol, C., Adé, H., Bergadano, F.,& Tausend, B. (1996). Declarative bias in ILP. In L. De Raedt (Ed.), Advances in inductive logic programming (pp. 82–103). Amsterdam: IOS Press.
Google Scholar
Quinlan, J. R. (1986). Induction of decision trees. Machine Learning, 1, 81–106.
Google Scholar
Quinlan, J. R. (1990). Learning logical definitions from relations. Machine Learning, 5, 239–266.
Google Scholar
Quinlan, J. R. (1993). C 4.5: Programs for machine learning. Morgan Kaufmann.
Stone, P.& Veloso, M. (1999). Team partitioned, opaque transition reinforcement learning. In Proc. Third Annual Conference on Autonomous Agents (pp. 206–212). San Matco: Morgan Kaufmann. ACM Press.
Google Scholar
Sutton, R. S., Precup, D.,& Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, 181–211.
Google Scholar
Tesauro, G. (1995). Temporal difference learning and TD-GAMMON. Communications of the ACM, 38(3), 58–68.
Google Scholar
Utgoff, P. E., Berkman, N. C.,& Clause, J. A. (1997). Decision tree induction based on efficient tree restructuring. Machine Learning, 29, 5–44.
Google Scholar
Watkins, C.& Dayan, P. (1992). Q-learning. Machine Learning, 8, 279–292.
Google Scholar
Widmer, G.& Kubat, M. (Eds.) (1998). Special issue on context sensitivity and concept drift. Machine Learning, 32(2), 83–201.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Intelligent Systems, Jožef Stefan Institute, Jamova 39, SI-1000, Ljubljana, Slovenia
Sašo Džeroski
Institüt für Informatik, Albert-Lüdwigs-Universität Freiburg, Georges Köhler, llee 79, D-79110, Freiburg, Germany
Luc De Raedt
Department of Computer Science, Katholieke Universiteit Leuven, Celestijnenlaan 200A, B-3001, Heverlee, Belgium
Kurt Driessens

Authors

Sašo Džeroski
View author publications
You can also search for this author in PubMed Google Scholar
Luc De Raedt
View author publications
You can also search for this author in PubMed Google Scholar
Kurt Driessens
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Džeroski, S., De Raedt, L. & Driessens, K. Relational Reinforcement Learning. Machine Learning 43, 7–52 (2001). https://doi.org/10.1023/A:1007694015589

Download citation

Issue Date: April 2001
DOI: https://doi.org/10.1023/A:1007694015589

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Relational Reinforcement Learning

Abstract

Article PDF

Similar content being viewed by others

Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods

A practical guide to multi-objective reinforcement learning and planning

Artificial Intelligence in Physical Sciences: Symbolic Regression Trends and Perspectives

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Relational Reinforcement Learning

Abstract

Article PDF

Similar content being viewed by others

Aleatoric and epistemic uncertainty in machine learning: an introduction to concepts and methods

A practical guide to multi-objective reinforcement learning and planning

Artificial Intelligence in Physical Sciences: Symbolic Regression Trends and Perspectives

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation