A Spectral Approach for Probabilistic Grammatical Inference on Trees

Bailly, Raphaël; Habrard, Amaury; Denis, François

doi:10.1007/978-3-642-16108-7_10

Raphaël Bailly²³,
Amaury Habrard²³ &
François Denis²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6331))

Included in the following conference series:

International Conference on Algorithmic Learning Theory

1164 Accesses
3 Citations

Abstract

We focus on the estimation of a probability distribution over a set of trees. We consider here the class of distributions computed by weighted automata - a strict generalization of probabilistic tree automata. This class of distributions (called rational distributions, or rational stochastic tree languages - RSTL) has an algebraic characterization: All the residuals (conditional) of such distributions lie in a finite-dimensional vector subspace. We propose a methodology based on Principal Components Analysis to identify this vector subspace. We provide an algorithm that computes an estimate of the target residuals vector subspace and builds a model which computes an estimate of the target distribution.

This work was partially supported by the ANR LAMPADA ANR-09-EMER-007 project and by the IST Programme of the European Community, under the PASCAL2 Network of Excellence, IST-2007-216886. This publication only reflects the authors’ views.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Carrasco, R., Oncina, J., Calera-Rubio, J.: Stochastic inference of regular tree languages. Machine Learning 44, 185–197 (2001)
Article MATH Google Scholar
Denis, F., Esposito, Y.: On rational stochastic languages. Fundamenta Informaticae 86, 41–77 (2008)
MATH MathSciNet Google Scholar
Denis, F., Habrard, A.: Learning rational stochastic tree languages. In: Hutter, M., Servedio, R.A., Takimoto, E. (eds.) ALT 2007. LNCS (LNAI), vol. 4754, pp. 242–256. Springer, Heidelberg (2007)
Chapter Google Scholar
Denis, F., Gilbert, E., Habrard, A., Ouardi, F., Tommasi, M.: Relevant representations for the inference of rational stochastic tree languages. In: Grammatical Inference: Algorithms and Applications, 9th International Colloquium, pp. 57–70. Springer, Heidelberg (2008)
Chapter Google Scholar
Bailly, R., Denis, F., Ralaivola, L.: Grammatical inference as a principal component analysis problem. In: Proceedings of the 26th International Conference on Machine Learning, Montréal, Canada, pp. 33–40. Omnipress (2009)
Google Scholar
Clark, A., Costa Florêncio, C., Watkins, C.: Languages as hyperplanes: grammatical inference with string kernels. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 90–101. Springer, Heidelberg (2006)
Chapter Google Scholar
Hsu, D., Kakade, S., Zhang, T.: A spectral algorithm for learning hidden markov models. In: Proceedings of COLT 2009, Springer, Heidelberg (2009)
Google Scholar
Hsu, D., Kakade, S., Zhang, T.: A spectral algorithm for learning hidden markov models. Technical report, Arxiv archive (2009), http://arxiv.org/abs/0811.4413
Denis, F., Esposito, Y.: Learning classes of probabilistic automata. In: Shawe-Taylor, J., Singer, Y. (eds.) COLT 2004. LNCS (LNAI), vol. 3120, pp. 124–139. Springer, Heidelberg (2004)
Chapter Google Scholar
Comon, H., Dauchet, M., Gilleron, R., Jacquemard, F., Lugiez, D., Löding, C., Tison, S., Tommasi, M.: Tree automata techniques and applications (2007), http://tata.gforge.inria.fr/ (release October 12, 2007)
Berstel, J., Reutenauer, C.: Recognizable formal power series on trees. Theorical computer science 18, 115–148 (1982)
Article MATH MathSciNet Google Scholar
Högberg, J., Maletti, A., Vogler, H.: Bisimulation minimisation of weighted automata on unranked trees. Fundam. Inform. 92(1-2), 103–130 (2009)
MATH Google Scholar
Borchardt, B.: The Theory of Recognizable Tree Series. PhD thesis, TU Dresden (2004)
Google Scholar
McDiarmid, C.: On the method of bounded differences. In: Surveys in Combinatorics, pp. 148–188. Cambridge University Press, Cambridge (1989)
Google Scholar
Stewart, G., Sun, J.G.: Matrix Perturbation Theory. Academic Press, London (1990)
MATH Google Scholar
Zwald, L., Blanchard, G.: On the convergence of eigenspaces in kernel principal component analysis. In: Proceedings of NIPS 2005 (2006)
Google Scholar
Shawe-Taylor, J., Cristianini, N., Kandola, J.: On the concentration of spectral properties. In: Proc. of NIPS, vol. 14, pp. 511–517. MIT Press, Cambridge (2001)
Google Scholar
Smola, A., Gretton, A., Song, L., Schölkopf, B.: A hilbert space embedding for distributions. In: 18th International Conference on Algorithmic Learning Theory, pp. 13–31. Springer, Heidelberg (2007)
Chapter Google Scholar
Song, L., Boots, B., Saddiqi, S., Gordon, G., Smola, A.: Hilbert space embeddings of Hidden Markov Models. In: Proceedings of ICML 2010 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratoire d’Informatique Fondamentale de Marseille, UMR CNRS 6166, Aix-Marseille Université CMI, 39 rue F. Joliot Curie, 13453, Marseille cedex 13, France
Raphaël Bailly, Amaury Habrard & François Denis

Authors

Raphaël Bailly
View author publications
You can also search for this author in PubMed Google Scholar
Amaury Habrard
View author publications
You can also search for this author in PubMed Google Scholar
François Denis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Research School of Information Sciences and Engineering, Australian National University and NICTA, 0200, Canberra, ACT, Australia
Marcus Hutter
Department of Mathematics, National University of Singapore, Block S17, 10 Lower Kent Ridge Road, 119076, Singapore, Republic of Singapore
Frank Stephan
Department of Computer Science, University of London, Royal Holloway, TW20 0EX, Egham, Surrey, UK
Vladimir Vovk
Division of Computer Science, Hokkaido University, , ,, N-14, W-9, Sapporo, 060-0814, Japan
Thomas Zeugmann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bailly, R., Habrard, A., Denis, F. (2010). A Spectral Approach for Probabilistic Grammatical Inference on Trees. In: Hutter, M., Stephan, F., Vovk, V., Zeugmann, T. (eds) Algorithmic Learning Theory. ALT 2010. Lecture Notes in Computer Science(), vol 6331. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16108-7_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-16108-7_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16107-0
Online ISBN: 978-3-642-16108-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics