Gradient-based boosting for statistical relational learning: The relational dependency network case

Natarajan, Sriraam; Khot, Tushar; Kersting, Kristian; Gutmann, Bernd; Shavlik, Jude

doi:10.1007/s10994-011-5244-9

Gradient-based boosting for statistical relational learning: The relational dependency network case

Published: 10 May 2011

Volume 86, pages 25–56, (2012)
Cite this article

Download PDF

Machine Learning Aims and scope Submit manuscript

Gradient-based boosting for statistical relational learning: The relational dependency network case

Download PDF

Sriraam Natarajan¹,
Tushar Khot²,
Kristian Kersting³,
Bernd Gutmann⁴ &
…
Jude Shavlik²

2438 Accesses
71 Citations
6 Altmetric
Explore all metrics

Abstract

Dependency networks approximate a joint probability distribution over multiple random variables as a product of conditional distributions. Relational Dependency Networks (RDNs) are graphical models that extend dependency networks to relational domains. This higher expressivity, however, comes at the expense of a more complex model-selection problem: an unbounded number of relational abstraction levels might need to be explored. Whereas current learning approaches for RDNs learn a single probability tree per random variable, we propose to turn the problem into a series of relational function-approximation problems using gradient-based boosting. In doing so, one can easily induce highly complex features over several iterations and in turn estimate quickly a very expressive model. Our experimental results in several different data sets show that this boosting method results in efficient learning of RDNs when compared to state-of-the-art statistical relational learning approaches.

Article PDF

Fast learning of relational dependency networks

Article 22 March 2016

Transfer learning by mapping and revising boosted relational dependency networks

Article 11 May 2020

Gradient-based boosting for statistical relational learning: the Markov logic network and missing data cases

Article 20 February 2015

References

Van Assche, A., Vens, C., & Blockeel, H. (2006). First order random forests: Learning relational classifiers with complex aggregates. Machine Learning, 64, 149–182
Article MATH Google Scholar
Koller, D., Taskar, B., & Abeel, P. (2002). Discriminative probabilistic models for relational data. In UAI (pp. 485–492).
Google Scholar
Bilenko, M., & Mooney, R. (2003). Adaptive duplicate detection using learnable string similarity measures. In KDD (pp. 39–48).
Google Scholar
Blockeel, H., & De Raedt, L. (1998). Top-down induction of first-order logical decision trees. Artificial Intelligence, 101, 285–297.
Article MATH MathSciNet Google Scholar
Boutilier, C., Friedman, N., Goldszmidt, M., & Koller, D. (1996). Context-specific independence in Bayesian networks. In UAI (pp. 115–123).
Google Scholar
Breiman, L. (1996). Bagging predictors. Machine Learning, 24, 123–140.
MATH MathSciNet Google Scholar
Chickering, D. (1996). Learning Bayesian networks is NP-complete. In Learning from data: Artificial intelligence and statistics V (pp. 121–130). Berlin: Springer.
Google Scholar
Craven, M., & Shavlik, J. (1996). Extracting tree-structured representations of trained networks. In NIPS (pp. 24–30).
Google Scholar
Davis, J., Ong, I., Struyf, J., Burnside, E., Page, D., & Costa, V. S. (2007). Change of representation for statistical relational learning. In IJCAI.
Google Scholar
de Salvo Braz, R., Amir, E., & Roth, D. (2005). Lifted first order probabilistic inference. In IJCAI (pp. 1319–1325).
Google Scholar
Dietterich, T. G., Ashenfelter, A., & Bulatov, Y. (2004). Training conditional random fields via gradient tree boosting. In ICML.
Google Scholar
Domingos, P., & Lowd, D. (2009). MarkovLogic: An interface layer for AI. San Rafael: Morgan & Claypool.
Google Scholar
Fierens, D., Blockeel, H., Bruynooghe, M., & Ramon, J. (2005). Logical Bayesian networks and their relation to other probabilistic Logical models. In ILP.
Google Scholar
Freund, Y., & Schapire, R. (1996). Experiments with a new boosting algorithm. In ICML.
Google Scholar
Friedman, J. H. (2001) Greedy function approximation: A gradient boosting machine. Annals of Statistics, 29, 1189–1232.
Article MATH MathSciNet Google Scholar
Getoor, L., Friedman, N., Koller, D., & Pfeffer, A. (2001). Learning probabilistic relational models. In S. Dzeroski & N. Lavrac (Eds.), Relational data mining (pp. 307–338).
Google Scholar
Getoor, L., & Grant, J. (2006). PRL: A probabilistic relational language. Machine Learning, 62(1–2), 7–31.
Article Google Scholar
Getoor, L., & Taskar, B. (2007). Introduction to statistical relational learning. Cambridge: MIT Press.
MATH Google Scholar
Gutmann, B., & Kersting, K. (2006). TildeCRF: Conditional random fields for logical sequences. In ECML.
Google Scholar
Heckerman, D., Chickering, D., Meek, C., Rounthwaite, R., & Kadie, C. (2001). Dependency networks for inference, collaborative filtering, and data visualization. Journal of Machine Learning Research, 1, 49–75.
MATH Google Scholar
Jaeger, M. (1997). Relational Bayesian networks. In Proceedings of UAI-97.
Google Scholar
Jing, Y., Pavloviä, V., & Rehg, J. (2008). Boosted Bayesian network classifiers. Machine Learning, 73(2), 155–184.
Article Google Scholar
Karwath, A., Kersting, K., & Landwehr, N. (2008). Boosting Relational Sequence alignments. In ICDM.
Google Scholar
Kersting, K., Ahmadi, B., & Natarajan, S. (2009). Counting belief propagation. In UAI.
Google Scholar
Kersting, K., & De Raedt, L. (2007). Bayesian logic programming: theory and tool. In An introduction to statistical relational learning.
Google Scholar
Kersting, K., & Driessens, K. (2008). Non-parametric policy gradients: a unified treatment of propositional and relational domains. In ICML.
Google Scholar
Kok, S., & Domingos, P. (2009). Learning Markov logic network structure via hypergraph lifting. In ICML.
Google Scholar
Kok, S., & Domingos, P. (2010). Learning Markov logic networks using structural motifs. In ICML.
Google Scholar
Lawrence, S., Giles, C., & Bollacker, K. (1999). Autonomous citation matching. In AGENTS (pp. 392–393).
Chapter Google Scholar
Mihalkova, L., & Mooney, R. (2007). Bottom-up learning of Markov logic network structure. In ICML (pp. 625–632).
Chapter Google Scholar
Milch, B., Zettlemoyer, L., Kersting, K., Haimes, M., & Pack Kaelbling, L. (2008). Lifted probabilistic inference with counting formulas. In AAAI.
Google Scholar
Muggleton, S., & De Raedt, L. (1994). Inductive logic programming: theory and methods. The Journal of Logic Programming, 19/20, 629–679.
Article Google Scholar
Natarajan, S., Tadepalli, P., Dietterich, T. G., & Fern, A. (2009). Learning first-order probabilistic models with combining rules. In AMAI.
Google Scholar
Neville, J., & Jensen, D. (2007). Relational dependency networks. In Introduction to statistical relational learning (pp. 653–692).
Google Scholar
Neville, J., Jensen, D., Friedland, L., & Hay, M. (2003). Learning relational probability trees. In KDD.
Google Scholar
Neville, J., Jensen, D., & Gallagher, B. (2003). Simple estimators for relational Bayesian classifiers. In ICDM (pp. 609–612).
Google Scholar
Parker, C., Fern, A., & Tadepalli, P. (2006). Gradient boosting for sequence alignment. In AAAI.
Google Scholar
Pearl, J. (1988). Probabilistic reasoning in intelligent systems: Networks of plausible inference. San Mateo: Morgan Kaufmann.
Google Scholar
Poole, D. (1993). Probabilistic Horn abduction and Bayesian networks. Artificial Intelligence, 64(1), 81–129.
Article MATH Google Scholar
Poole, D. (2003). First-order probabilistic inference. In IJCAI (pp. 985–991).
Google Scholar
Poon, H., & Domingos, P. (2007). Joint inference in information extraction. In AAAI (pp. 913–918).
Google Scholar
De Raedt, L., Kimmig, A., & Toivonen, H. (2007). Problog: A probabilistic prolog and its application in link discovery. In IJCAI (pp. 2468–2473).
Google Scholar
Sato, T., & Kameya, Y. (2001). Parameter learning of logic programs for symbolic-statistical modeling. In JAIR (pp. 391–454).
Google Scholar
Singla, P., & Domingos, P. (2006). Entity resolution with Markov logic. In ICDM (pp. 572–582).
Google Scholar
Singla, P., & Domingos, P. (2008). Lifted first-order belief propagation. In AAAI (pp. 1094–1099).
Google Scholar
Srinivasan, A. (2004). The Aleph manual.
Sutton, R., McAllester, D., Singh, S., & Mansour, Y. (2000). Policy gradient methods for reinforcement learning with function approximation. In NIPS.
Google Scholar
Truyen, T., Phung, D., Venkatesh, S., & Bui, H. (2006). Adaboost.mrf: Boosted Markov random forests and application to multilevel activity recognition. In CVPR (pp. 1686–1693).
Google Scholar
Vens, C., Ramon, J., & Blockeel, H. (2006). Refining aggregate conditions in relational learning. In Knowledge discovery in databases: PKDD (p. 2006).
Google Scholar
Xu, Z., Kersting, K., & Tresp, V. (2009). Multi-relational learning with Gaussian processes. In IJCAI.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Medicine, Wake Forest University, Winston Salem, USA
Sriraam Natarajan
University of Wisconsin-Madison, Madison, USA
Tushar Khot & Jude Shavlik
Frauhofer IAIS, Sankt Augustin, Germany
Kristian Kersting
K.U. Leuven, Leuven, Belgium
Bernd Gutmann

Authors

Sriraam Natarajan
View author publications
You can also search for this author in PubMed Google Scholar
Tushar Khot
View author publications
You can also search for this author in PubMed Google Scholar
Kristian Kersting
View author publications
You can also search for this author in PubMed Google Scholar
Bernd Gutmann
View author publications
You can also search for this author in PubMed Google Scholar
Jude Shavlik
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sriraam Natarajan.

Additional information

Editors: Paolo Frasconi and Francesca Lisi.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Natarajan, S., Khot, T., Kersting, K. et al. Gradient-based boosting for statistical relational learning: The relational dependency network case. Mach Learn 86, 25–56 (2012). https://doi.org/10.1007/s10994-011-5244-9

Download citation

Received: 23 July 2010
Accepted: 27 February 2011
Published: 10 May 2011
Issue Date: January 2012
DOI: https://doi.org/10.1007/s10994-011-5244-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Gradient-based boosting for statistical relational learning: The relational dependency network case

Abstract

Article PDF

Similar content being viewed by others

Fast learning of relational dependency networks

Transfer learning by mapping and revising boosted relational dependency networks

Gradient-based boosting for statistical relational learning: the Markov logic network and missing data cases

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Gradient-based boosting for statistical relational learning: The relational dependency network case

Abstract

Article PDF

Similar content being viewed by others

Fast learning of relational dependency networks

Transfer learning by mapping and revising boosted relational dependency networks

Gradient-based boosting for statistical relational learning: the Markov logic network and missing data cases

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation