Relational Learning with Statistical Predicate Invention: Better Models for Hypertext

Craven, Mark; Slattery, Seán

doi:10.1023/A:1007676901476

Relational Learning with Statistical Predicate Invention: Better Models for Hypertext

Published: April 2001

Volume 43, pages 97–119, (2001)
Cite this article

Download PDF

Machine Learning Aims and scope Submit manuscript

Relational Learning with Statistical Predicate Invention: Better Models for Hypertext

Download PDF

Mark Craven¹ &
Seán Slattery²

584 Accesses
56 Citations
Explore all metrics

Abstract

We present a new approach to learning hypertext classifiers that combines a statistical text-learning method with a relational rule learner. This approach is well suited to learning in hypertext domains because its statistical component allows it to characterize text in terms of word frequencies, whereas its relational component is able to describe how neighboring documents are related to each other by hyperlinks that connect them. We evaluate our approach by applying it to tasks that involve learning definitions for (i) classes of pages, (ii) particular relations that exist between pairs of pages, and (iii) locating a particular class of information in the internal structure of pages. Our experiments demonstrate that this new approach is able to learn more accurate classifiers than either of its constituent methods alone.

References

Cestnik, B. (1990). Estimating probabilities: A crucial task in machine learning. In Proceedings of the Ninth European Conference on Artificial Intelligence (pp. 147–150). Stockholm, Sweden: Pitman.
Google Scholar
Cohen, W. W. (1995a). Fast effective rule induction. In Proceedings of the Twelfth International Conference on Machine Learning (pp. 115–123). Tahoe City, CA: Morgan Kaufmann.
Google Scholar
Cohen, W. W. (1995b). Learning to classify English text with ILP methods. In L. D. Raedt (Ed.), Advances in Inductive Logic Programming. Amsterdam, The Netherlands: IOS Press.
Google Scholar
Craven, M., DiPasquo, D., Freitag, D., McCallum, A., Mitchell, T., Nigam, K.,& Slattery, S. (1998a). Learning to extract symbolic knowledge from the World Wide Web. In Proceedings of the Fifteenth National Conference on Artificial Intelligence (pp. 509–516). Madison, WI: AAAI Press.
Google Scholar
Craven, M., Slattery, S.,& Nigam, K. (1998b). First-order learning for Web mining. In Proceedings of the Tenth European Conference on Machine Learning (pp. 250–255). Chemnitz, Germany: Springer-Verlag.
Google Scholar
DiPasquo, D. (1998). Using HTML formatting to aid in natural language processing on the World Wide Web. Senior Thesis, Computer Science Department, Carnegie Mellon University.
Domingos, P.& Pazzani, M. (1997). On the optimality of the simple Bayesian classifier under zero-one loss. Machine Learning, 29, 103–130.
Google Scholar
Džeroski, S.& Bratko, I. (1992). Handling noise in inductive logic programming. In Proceedings of the Second International Workshop on Inductive Logic Programming (pp. 109–125). Tokyo, Japan.
Ehrenfeucht, A., Haussler, D., Kearns, M.,& Valiant, L. (1989). A general lower bound on the number of examples needed for learning. Information and Computation, 82(3), 247–251.
Google Scholar
Friedman, N., Getoor, L., Koller, D., & Pfeffer, A. (1999). Learning probabilistic relational models. In Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence (pp. 1300–1307). Stockholm, Sweden: Morgan Kaufmann.
Google Scholar
Joachims, T., Freitag, D.,& Mitchell, T. (1997).WebWatcher: A tour guide for the World Wide Web. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (pp. 770–775). Nogoya, Japan: Morgan Kaufmann.
Google Scholar
Kijsirikul, B., Numao, M.,& Shimura, M. (1992). Discrimination-based constructive induction of logic programs. In Proceedings of the Tenth National Conference on Artificial Intelligence (pp. 44–49). San Jose, CA: AAAI Press.
Google Scholar
Koller, D.& Pfeffer, A. (1997). Learning probabilities for noisy first-order rules. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (pp. 1316–1321). Nagoya, Japan: Morgan Kaufmann.
Google Scholar
Kramer, S. (1995). Predicate invention: A comprehensive view. Technical Report OFAI-TR-95-32, Austrian Research Institute for Artificial Intelligence, Vienna, Austria.
Google Scholar
Kushmerick, N., Weld, D. S.,& Doorenbos, R. (1997). Wrapper induction for information extraction. In Proceedings of the Fifteenth International Joint Conference on Artificial Intelligence (pp. 729–737). Nagoya, Japan: Morgan Kaufmann.
Google Scholar
Lewis, D. D.& Ringuette, M. (1994). A comparison of two learning algorithms for text categorization. In Proceedings of the Third Annual Symposium on Document Analysis and Information Retrieval. (pp. 81–93). ISRI; University of Nevada, Las Vegas.
Google Scholar
Lewis, D. D., Schapire, R. E., Callan, J. P.,& Papka, R. (1996). Training algorithms for linear classifiers. In Proceedings of the Nineteenth Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval (pp. 298–306). Zurich, Switzerland: ACM.
Google Scholar
Mitchell, T. (1997). Machine learning. New York: McGraw Hill.
Google Scholar
Mladeni´c, D. (1996). PersonalWebWatcher: Design and implementation. Technical Report IJS-DP-7472, Department for Intelligent Systems, J. Stefan Institute, Ljubljana, Slovenia.
Google Scholar
Moulinier, I., Raškinis, G.,& Ganascia, J.-G. (1996). Text categorization: A symbolic approach. In Proceedings of the 6th Annual Symposium on Document Analysis and Information Retrieval. Las Vegas, NV.
Pazzani, M. J., Muramatsu, J.,& Billsus, D. (1996). Syskill&Webert: Identifying interesting Web sites. In Proceedings of the Thirteenth National Conference on Artificial Intelligence (pp. 54–59). Portland, OR: AAAI Press.
Google Scholar
Porter, M. F. (1980). An algorithm for suffix stripping. Program, 14(3), 130–137.
Google Scholar
Quinlan, J. R. (1990). Learning logical definitions from relations. Machine Learning, 5, 239–266.
Google Scholar
Quinlan, J. R.& Cameron-Jones, R. M. (1993). FOIL: A midterm report. In Proceedings of the Fifth European Conference on Machine Learning (pp. 3–20). Vienna, Austria: Springer-Verlag.
Google Scholar
Richards, B.& Mooney, R. (1992). Learning relations by pathfinding. In Proceedings of the Tenth National Conference on Artificial Intelligence (pp. 50–55). San Jose, CA: AAAI Press.
Google Scholar
Silverstein, G.& Pazzani, M. J. (1991). Relational clichés: Constraining constructive induction during relational learning. In Proceedings of the Eighth International Workshop on Machine Learning (pp. 203–207). Evanston, IL: Morgan Kaufmann.
Google Scholar
Soderland, S. (1997). Learning to extract text-based information from the World Wide Web. In Proceedings of the Third International Conference on Knowledge Discovery and Data Mining (pp. 251–254). Newport Beach, CA: AAAI Press.
Google Scholar
Srinivasan, A.& Camacho, R. (1999). Numerical reasoning with an ILP system capable of lazy evaluation and customised search. The Journal of Logic Programming, 40(2/3), 185–213.
Google Scholar
Srinivasan, A., Muggleton, S.,& Bain, M. (1992). Distinguishing exceptions from noise in non-monotonic learning. In Proceedings of the Second International Workshop on Inductive Logic Programming. Tokyo, Japan.
Stahl, I. (1996). Predicate invention in inductive logic programming. In L. DeRaedt (Ed.), Advances in Inductive Logic Programming. Amsterdam, The Netherlands: IOS Press.
Google Scholar
van Rijsbergen, C. J. (1979). Information retrieval. London, England: Butterworths.
Google Scholar
Wrobel, S. (1994). Concept formation during interactive theory revision. Machine Learning, 14(2), 169–191.
Google Scholar
Yang, Y.& Pedersen, J. (1997). A comparative study on feature set selection in text categorization. In Proceedings of the Fourteenth International Conference on Machine Learning (pp. 412–420). Nashville, TN: Morgan Kaufmann.
Google Scholar
Zelle, J. M., Mooney, R. J.,& Konvisser, J. B. (1994). Combining top-down and bottom-up techniques in inductive logic programming. In Proceedings of the Eleventh International Conference on Machine Learning (pp. 343 351). Rutgers, NJ: Morgan Kaufmann.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Biostatistics & Medical Informatics, University of Wisconsin, Madison, WI, 53706, USA
Mark Craven
School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
Seán Slattery

Authors

Mark Craven
View author publications
You can also search for this author in PubMed Google Scholar
Seán Slattery
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Craven, M., Slattery, S. Relational Learning with Statistical Predicate Invention: Better Models for Hypertext. Machine Learning 43, 97–119 (2001). https://doi.org/10.1023/A:1007676901476

Download citation

Issue Date: April 2001
DOI: https://doi.org/10.1023/A:1007676901476

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Relational Learning with Statistical Predicate Invention: Better Models for Hypertext

Abstract

Article PDF

Similar content being viewed by others

Lazy and Eager Relational Learning Using Graph-Kernels

Web as a Corpus: Going Beyond the n-gram

Teaching Machine Learning: A Geometric View of Naïve Bayes

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

Relational Learning with Statistical Predicate Invention: Better Models for Hypertext

Abstract

Article PDF

Similar content being viewed by others

Lazy and Eager Relational Learning Using Graph-Kernels

Web as a Corpus: Going Beyond the n-gram

Teaching Machine Learning: A Geometric View of Naïve Bayes

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation