Abstract
In this paper, we provide a study on the use of tree kernels to encode syntactic parsing information in natural language learning. In particular, we propose a new convolution kernel, namely the Partial Tree (PT) kernel, to fully exploit dependency trees. We also propose an efficient algorithm for its computation which is futhermore sped-up by applying the selection of tree nodes with non-null kernel. The experiments with Support Vector Machines on the task of semantic role labeling and question classification show that (a) the kernel running time is linear on the average case and (b) the PT kernel improves on the other tree kernels when applied to the appropriate parsing paradigm.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Jackendoff, R.: Semantic Structures, Current Studies in Linguistics series. The MIT Press, Cambridge (1990)
Collins, M., Duffy, N.: New ranking algorithms for parsing and tagging: Kernels over discrete structures, and the voted perceptron. In: ACL (2002)
Zelenko, D., Aone, C., Richardella, A.: Kernel methods for relation extraction. JMLR (2003)
Cumby, C., Roth, D.: Kernel methods for relational learning. In: ICML (2003)
Culotta, A., Sorensen, J.: Dependency tree kernels for relation extraction. In: Proceedings ACL, Barcelona, Spain (2004)
Moschitti, A.: A study on convolution kernels for shallow semantic parsing. In: Proceedings of ACL, Barcelona, Spain (2004)
Vishwanathan, S., Smola, A.: Fast kernels on strings and trees. In: Proceedings of NIPS (2002)
Kingsbury, P., Palmer, M.: From Treebank to PropBank. In: Proceedings of LREC, Las Palmas, Spain (2002)
Fillmore, C.J.: Frame semantics. In: Linguistics in the Morning Calm. (1982)
Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of english: The Penn Treebank. CLJ (1993)
Collins, M.: Three generative, lexicalized models for statistical parsing. In: Proceedings of the ACL, Somerset, New Jersey (1997)
Klein, D., Manning, C.D.: Fast exact inference with a factored model for natural language parsing. In: NIPS (2002)
Shawe-Taylor, J., Cristianini, N.: Kernel Methods for Pattern Analysis. Cambridge University Press, Cambridge (2004)
Haussler, D.: Convolution kernels on discrete structures. Technical report ucs-crl-99-10, University of California, Santa Cruz (1999)
Gildea, D., Jurasfky, D.: Automatic labeling of semantic roles. CLJ (2002)
Zhang, D., Lee, W.S.: Question classification using support vector machines. In: Proceedings of SIGIR (2003)
Li, X., Roth, D.: Learning question classifiers: The role of semantic information. JNLE (2005)
Kazama, J., Torisawa, K.: Speeding up training with tree kernels for node relation labeling. In: Proceedings of EMNLP, Toronto, Canada (2005)
Kudo, T., Suzuki, J., Isozaki, H.: Boosting-based parse reranking with subtree features. In: Proceedings ACL 2005 (2005)
Carreras, X., Màrquez, L.: Introduction to the CoNLL-2005 shared task: Semantic role labeling. In: Proceedings of CoNLL-2005 (2005)
Joachims, T.: Making large-scale SVM learning practical. In: Schölkopf, B., Burges, C., Smola, A. (eds.) Advances in Kernel Methods - Support Vector Learning (1999)
Pradhan, S., Hacioglu, K., Krugler, V., Ward, W., Martin, J.H., Jurafsky, D.: Support vector learning for semantic argument classification. MLJ (2005)
Kudo, T., Matsumoto, Y.: Fast methods for kernel-based text analysis. In: Proceedings of ACL (2003)
Suzuki, J., Isozaki, H., Maeda, E.: Convolution kernels with feature selection for natural language processing tasks. In: Proceedings of ACL, Spain (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Moschitti, A. (2006). Efficient Convolution Kernels for Dependency and Constituent Syntactic Trees. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_32
Download citation
DOI: https://doi.org/10.1007/11871842_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)