Languages as Hyperplanes: Grammatical Inference with String Kernels

Clark, Alexander; Florêncio, Christophe Costa; Watkins, Chris

doi:10.1007/11871842_13

Alexander Clark²¹,
Christophe Costa Florêncio²¹ &
Chris Watkins²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4212))

Included in the following conference series:

European Conference on Machine Learning

5530 Accesses
7 Citations

Abstract

Using string kernels, languages can be represented as hyperplanes in a high dimensional feature space. We present a new family of grammatical inference algorithms based on this idea. We demonstrate that some mildly context sensitive languages can be represented in this way and it is possible to efficiently learn these using kernel PCA. We present some experiments demonstrating the effectiveness of this approach on some standard examples of context sensitive languages using small synthetic data sets.

Download to read the full chapter text

Chapter PDF

Distributional Models for Lexical Semantics: An Investigation of Different Representations for Natural Language Learning

The Strong, Weak, and Very Weak Finite Context and Kernel Properties

Selecting and Weighting N-Grams to Identify 1100 Languages

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Bach, E.: Discontinuous constituents in generalized categorial grammars. North East Linguistics Society (NELS 11), 1–12 (1981)
Google Scholar
Becker, T., Rambow, O., Niv, M.: The Derivational Generative Power of Formal Systems or Scrambling is Beyond LCFRS (Technical Report 92–38). Institute For Research in Cognitive Science, University of Pennsylvania (1992)
Google Scholar
Chalup, S., Blair, A.D.: Hill climbing in recurrent neural networks for learning the a ⁿ b ⁿ c ⁿ language. In: Proceedings of the Sixth International Conference on Neural Information Processing, pp. 508–513 (1999)
Google Scholar
Clark, A., Costa Florêncio, C., Watkins, C., Serayet, M.: Planar languages and learnability. In: International Colloquium on Grammatical Inference (ICGI), Tokyo (to appear, 2006)
Google Scholar
Gentner, T.Q., Fenn, K.M., Margoliash, D., Nusbaum, H.C.: Recursive syntactic pattern learning by songbirds. Nature 440, 1204–1207 (2006)
Article Google Scholar
Kearns, M., Valiant, G.: Cryptographic limitations on learning boolean formulae and finite automata. In: 21st annual ACM symposium on Theory of computation, pp. 433–444. ACM Press, New York (1989)
Google Scholar
Lodhi, H., Saunders, C., Shawe-Taylor, J., Cristianini, N., Watkins, C.: Text classification using string kernels. JMLR 2, 419–444 (2002)
Article MATH Google Scholar
Parikh, R.J.: On context-free languages. Journal of the ACM 13, 570–581 (1966)
Article MATH MathSciNet Google Scholar
Radzinski, D.: Chinese number-names, tree adjoining languages, and mild context-sensitivity. Comput. Linguist. 17, 277–299 (1991)
Google Scholar
Salomaa, A.: On languages defined by numerical parameters (Technical Report 663). Turku Centre for Computer Science (2005)
Google Scholar
Schölkopf, B., Smola, A.K. M.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation 10 (1998)
Google Scholar
Shawe-Taylor, J., Christianini, N.: Kernel methods for pattern analysis. Cambridge University Press, Cambridge (2004)
Google Scholar
Shieber, S.M.: Evidence against the context-freeness of natural language. Linguistics and Philosophy 8, 333–343 (1985)
Article Google Scholar
Starkie, B., Coste, F., van Zaanen, M.: The Omphalos context-free grammar learning competition. In: International Colloquium on Grammatical Inference, Athens, Greece, pp. 16–27 (2004)
Google Scholar
Vijay-Shanker, K., Weir, D.J., Joshi, A.K.: Characterizing structural descriptions produced by various grammatical formalisms. In: Proceedings of the 25th annual meeting on Association for Computational Linguistics, pp. 104–111. Association for Computational Linguistics, Morristown (1987)
Chapter Google Scholar
Watkins, C.: Dynamic alignment kernels. In: Smola, A.J., Bartlette, P.L., Schölkopf, B., Schuurmans, D. (eds.) Advances in large margin classifiers, pp. 39–50. MIT Press, Cambridge (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of London, Royal Holloway, Egham, TW20 0EX
Alexander Clark, Christophe Costa Florêncio & Chris Watkins

Authors

Alexander Clark
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Costa Florêncio
View author publications
You can also search for this author in PubMed Google Scholar
Chris Watkins
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Engineering Group, Technische Universität Darmstadt,
Johannes Fürnkranz
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Clark, A., Florêncio, C.C., Watkins, C. (2006). Languages as Hyperplanes: Grammatical Inference with String Kernels. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_13

Download citation

DOI: https://doi.org/10.1007/11871842_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Languages as Hyperplanes: Grammatical Inference with String Kernels

Abstract

Chapter PDF

Similar content being viewed by others

Distributional Models for Lexical Semantics: An Investigation of Different Representations for Natural Language Learning

The Strong, Weak, and Very Weak Finite Context and Kernel Properties

Selecting and Weighting N-Grams to Identify 1100 Languages

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Languages as Hyperplanes: Grammatical Inference with String Kernels

Abstract

Chapter PDF

Similar content being viewed by others

Distributional Models for Lexical Semantics: An Investigation of Different Representations for Natural Language Learning

The Strong, Weak, and Very Weak Finite Context and Kernel Properties

Selecting and Weighting N-Grams to Identify 1100 Languages

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation