Learning Tree Languages from Text

Fernau, Henning

doi:10.1007/3-540-45435-7_11

Henning Fernau³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2375))

Included in the following conference series:

International Conference on Computational Learning Theory

1173 Accesses
8 Citations

Abstract

We study the problem of learning regular tree languages from text. We show that the framework of function distinguishability as introduced in our ALT 2000 paper is generalizable from the case of string languages towards tree languages, hence providing a large source of identifiable classes of regular tree languages. Each of these classes can be characterized in various ways. Moreover, we present a generic inference algorithm with polynomial update time and prove its correctness. In this way, we generalize previous works of Angluin, Sakakibara and ourselves. Moreover, we show that this way all regular tree languages can be identified approximately.

Most of the work was done while the author was with Wilhelm-Schickard-Institut für Informatik, Universität Tübingen, Sand 13, D-72076 Tübingen, Germany.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

D. Angluin. Inductive inference of formal languages from positive data. Information and Control, 45:117–135, 1980.
Article MATH MathSciNet Google Scholar
D. Angluin. Inference of reversible languages. Journal of the Association for Computing Machinery, 29(3):741–765, 1982.
MATH MathSciNet Google Scholar
M. Bernard and C. de la Higuera. GIFT: Grammatical Inference For Terms. International Conference on Inductive Logic Programming ILP. Late Breaking Paper, 1999. French journal version: Apprentissage de programmes logiques par inférence grammaticale. Revue d’Intelligence Artificielle, 14:375–396, 2001.
Google Scholar
J. Besombes and J.-Y. Marion. Identification of reversible dependency tree languages. In L. Popelínský and M. Nepil, editors, Proceedings of 3rd Workshop on Learning Languages in Logic LLL’01, pages 11–22, 2001.
Google Scholar
S. Crespi-Reghizzi, M. A. Melkanoff, and L. Lichten. The use of grammatical inference for designing programming languages. Communications of the ACM, 16:83–90, 1972.
Article MathSciNet Google Scholar
L. F. Fass. Learning context-free languages from their structured sentences. SIGACT News, 15(3):24–35, 1983.
Article Google Scholar
H. Fernau. Identification of function distinguishable languages. In H. Arimura, S. Jain, and A. Sharma, editors, Proceedings of the 11th International Conference Algorithmic Learning Theory ALT, volume 1968 of LNCS/LNAI, pages 116–130. Springer, 2000.
Chapter Google Scholar
H. Fernau. Learning of terminal distinguishable languages. In Proceedings AMAI 2000, see http://rutcor.rutgers.edu/~amai/aimath00/AcceptedCont.htm.
H. Fernau. Approximative learning of regular languages. In L. Pacholski and P. Ružička, editors, SOFSEM; Theory and Practice of Informatics, volume 2234 of LNCS, pages 223–232. Springer, 2001.
Google Scholar
H. Fernau. Learning tree languages from text. Technical Report WSI-2001-19, Universität Tübingen (Germany), Wilhelm-Schickard-Institut für Informatik, 2001.
Google Scholar
H. Fernau and A. Radl. Algorithms for learning function distinguishable regular languages. In Statistical and Syntactical Methods of Pattern Recognition SPR+SSPR, to appear in the LNCS series. Springer, 2002.
Google Scholar
C. C. Florêncio. Consistent identification in the limit of any of the classes k-valued is NP-hard. In Proceedings of the Conference on Logical Aspects of Computational Linguistics LACL, volume 2099 of LNCS/LNAI, pages 125–138. Springer, 2001.
Chapter Google Scholar
E. M. Gold. Language identification in the limit. Information and Control, 10:447–474, 1967.
Article MATH Google Scholar
R. C. Gonzalez and M. G. Thomason. Syntactic Pattern Recognition; An Introduction. Addison-Wesley, 1978.
Google Scholar
C. de la Higuera. Current trends in grammatical inference. In F. J. Ferri et al., editors, Advances in Pattern Recognition, Joint IAPR International Workshops SSPR+SPR, volume 1876 of LNCS, pages 28–31. Springer, 2000.
Chapter Google Scholar
M. Kanazawa. Learnable Classes of Categorial Grammars. CSLI, 1998.
Google Scholar
T. Knuutila. How to invent characterizable methods for regular languages. In K. P. Jantke et al., editors, 4th Workshop on Algorithmic Learning Theory ALT, volume 744 of LNCS/LNAI, pages 209–222. Springer, 1993.
Google Scholar
T. Knuutila and M. Steinby. The inference of tree languages from finite samples: an algebraic approach. Theoretical Computer Science, 129:337–367, 1994.
Article MATH MathSciNet Google Scholar
S. Kobayashi and T. Yokomori. Learning approximately regular languages with reversible languages. Theoretical Computer Science, 174(1–2):251–257, 1997.
Article MATH MathSciNet Google Scholar
D. López and S. España. Error correcting tree language inference. Pattern Recognition Letters, 23:1–12, 2002.
Article Google Scholar
D. López and I. Piñaga. Syntactic pattern recognition by error correcting analysis on tree automata. In F. J. Ferri et al., editors, Advances in Pattern Recognition, Joint IAPR International Workshops SSPR+SPR, volume 1876 of LNCS, pages 133–142. Springer, 2000.
Chapter Google Scholar
V. Radhakrishnan and G. Nagaraja. Inference of regular grammars via skeletons. IEEE Transactions on Systems, Man and Cybernetics, 17(6):982–992, 1987.
MathSciNet Google Scholar
J. R. Rico-Juan, J. Calera-Rubio, and R. C. Carrasco. Probabilistic k-testable tree languages. In A. L. Oliveira, editor, Grammatical Inference: Algorithms and Applications, 5th International Colloquium, ICGI, volume 1891 of LNCS/LNAI, pages 221–228. Springer, 2000.
Google Scholar
G. Rozenberg and A. Salomaa, editors. Handbook of Formal Languages, Volume III. Berlin: Springer, 1997.
MATH Google Scholar
Y. Sakakibara. Learning context-free grammars from structural data in polynomial time. Theoretical Computer Science, 76:223–242, 1990.
Article MathSciNet MATH Google Scholar
Y. Sakakibara. Efficient learning of context-free grammars from positive structural examples. Information and Computation, 97(1):23–60, March 1992.
Google Scholar
Y. Sakakibara and H. Muramatsu. Learning context-free grammars from partially structured examples. In A. L. Oliveira, editor, Grammatical Inference: Algorithms and Applications, 5th International Colloquium, ICGI, volume 1891 of LNCS/LNAI, pages 229–240. Springer, 2000.
Google Scholar
Y. Takada and T. Y. Nishida. A note on grammatical inference of slender context-free languages. In L. Miclet and C. de la Higuera, editors, Proceedings of the Third International Colloquium on Grammatical Inference ICGI: Learning Syntax from Sentences, volume 1147 of LNCS/LNAI, pages 117–125. Springer, 1996.
Chapter Google Scholar
H. Volger. Grammars with generalized contextfree rules and their tree automata. In Proceedings of Computational Linguists in the Netherlands Meeting CLIN; Selected Papers, pages 223–233. see http://www-uilots.let.uu.nl/publications/clin1999/papers.html, 1999.
T. Yokomori. Inductive inference of context-free languages based on context-free expressions. International Journal of Computer Mathematics, 24:115–140, 1988.
Article MATH Google Scholar
T. Yokomori. Polynomial-time learning of very simple grammars from positive data. In Proceedings 4th Workshop on Computational Learning Theory COLT, pages 213–227, San Mateo, CA, 1991. Morgan Kaufmann.
Google Scholar
T. Yokomori. On learning systolic languages. In K. P. Jantke, S. Doshita, K. Furukawa, and T. Nishida, editors, Proceedings of the 3rd Workshop on Algorithmic Learning Theory ALT, volume 743 of LNCS/LNAI, pages 41–52. Springer, 1992.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Software Engineering, University of Newcastle, University Drive, NSW, 2308, Callaghan, Australia
Henning Fernau

Authors

Henning Fernau
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Research School of Information Sciences and Engineering, Australian National University, Canberra, ACT, 0200, Australia
Jyrki Kivinen
Computer Science Department, University of Illinois at Chicago, 851 S. Morgan St., Chicago, IL, 60607, USA
Robert H. Sloan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fernau, H. (2002). Learning Tree Languages from Text. In: Kivinen, J., Sloan, R.H. (eds) Computational Learning Theory. COLT 2002. Lecture Notes in Computer Science(), vol 2375. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45435-7_11

Download citation

DOI: https://doi.org/10.1007/3-540-45435-7_11
Published: 25 June 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43836-6
Online ISBN: 978-3-540-45435-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics