Skip to main content

Learning Tree Languages from Text

  • Conference paper
  • First Online:
Computational Learning Theory (COLT 2002)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2375))

Included in the following conference series:

Abstract

We study the problem of learning regular tree languages from text. We show that the framework of function distinguishability as introduced in our ALT 2000 paper is generalizable from the case of string languages towards tree languages, hence providing a large source of identifiable classes of regular tree languages. Each of these classes can be characterized in various ways. Moreover, we present a generic inference algorithm with polynomial update time and prove its correctness. In this way, we generalize previous works of Angluin, Sakakibara and ourselves. Moreover, we show that this way all regular tree languages can be identified approximately.

Most of the work was done while the author was with Wilhelm-Schickard-Institut für Informatik, Universität Tübingen, Sand 13, D-72076 Tübingen, Germany.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. D. Angluin. Inductive inference of formal languages from positive data. Information and Control, 45:117–135, 1980.

    Article  MATH  MathSciNet  Google Scholar 

  2. D. Angluin. Inference of reversible languages. Journal of the Association for Computing Machinery, 29(3):741–765, 1982.

    MATH  MathSciNet  Google Scholar 

  3. M. Bernard and C. de la Higuera. GIFT: Grammatical Inference For Terms. International Conference on Inductive Logic Programming ILP. Late Breaking Paper, 1999. French journal version: Apprentissage de programmes logiques par inférence grammaticale. Revue d’Intelligence Artificielle, 14:375–396, 2001.

    Google Scholar 

  4. J. Besombes and J.-Y. Marion. Identification of reversible dependency tree languages. In L. Popelínský and M. Nepil, editors, Proceedings of 3rd Workshop on Learning Languages in Logic LLL’01, pages 11–22, 2001.

    Google Scholar 

  5. S. Crespi-Reghizzi, M. A. Melkanoff, and L. Lichten. The use of grammatical inference for designing programming languages. Communications of the ACM, 16:83–90, 1972.

    Article  MathSciNet  Google Scholar 

  6. L. F. Fass. Learning context-free languages from their structured sentences. SIGACT News, 15(3):24–35, 1983.

    Article  Google Scholar 

  7. H. Fernau. Identification of function distinguishable languages. In H. Arimura, S. Jain, and A. Sharma, editors, Proceedings of the 11th International Conference Algorithmic Learning Theory ALT, volume 1968 of LNCS/LNAI, pages 116–130. Springer, 2000.

    Chapter  Google Scholar 

  8. H. Fernau. Learning of terminal distinguishable languages. In Proceedings AMAI 2000, see http://rutcor.rutgers.edu/~amai/aimath00/AcceptedCont.htm.

  9. H. Fernau. Approximative learning of regular languages. In L. Pacholski and P. Ružička, editors, SOFSEM; Theory and Practice of Informatics, volume 2234 of LNCS, pages 223–232. Springer, 2001.

    Google Scholar 

  10. H. Fernau. Learning tree languages from text. Technical Report WSI-2001-19, Universität Tübingen (Germany), Wilhelm-Schickard-Institut für Informatik, 2001.

    Google Scholar 

  11. H. Fernau and A. Radl. Algorithms for learning function distinguishable regular languages. In Statistical and Syntactical Methods of Pattern Recognition SPR+SSPR, to appear in the LNCS series. Springer, 2002.

    Google Scholar 

  12. C. C. Florêncio. Consistent identification in the limit of any of the classes k-valued is NP-hard. In Proceedings of the Conference on Logical Aspects of Computational Linguistics LACL, volume 2099 of LNCS/LNAI, pages 125–138. Springer, 2001.

    Chapter  Google Scholar 

  13. E. M. Gold. Language identification in the limit. Information and Control, 10:447–474, 1967.

    Article  MATH  Google Scholar 

  14. R. C. Gonzalez and M. G. Thomason. Syntactic Pattern Recognition; An Introduction. Addison-Wesley, 1978.

    Google Scholar 

  15. C. de la Higuera. Current trends in grammatical inference. In F. J. Ferri et al., editors, Advances in Pattern Recognition, Joint IAPR International Workshops SSPR+SPR, volume 1876 of LNCS, pages 28–31. Springer, 2000.

    Chapter  Google Scholar 

  16. M. Kanazawa. Learnable Classes of Categorial Grammars. CSLI, 1998.

    Google Scholar 

  17. T. Knuutila. How to invent characterizable methods for regular languages. In K. P. Jantke et al., editors, 4th Workshop on Algorithmic Learning Theory ALT, volume 744 of LNCS/LNAI, pages 209–222. Springer, 1993.

    Google Scholar 

  18. T. Knuutila and M. Steinby. The inference of tree languages from finite samples: an algebraic approach. Theoretical Computer Science, 129:337–367, 1994.

    Article  MATH  MathSciNet  Google Scholar 

  19. S. Kobayashi and T. Yokomori. Learning approximately regular languages with reversible languages. Theoretical Computer Science, 174(1–2):251–257, 1997.

    Article  MATH  MathSciNet  Google Scholar 

  20. D. López and S. España. Error correcting tree language inference. Pattern Recognition Letters, 23:1–12, 2002.

    Article  Google Scholar 

  21. D. López and I. Piñaga. Syntactic pattern recognition by error correcting analysis on tree automata. In F. J. Ferri et al., editors, Advances in Pattern Recognition, Joint IAPR International Workshops SSPR+SPR, volume 1876 of LNCS, pages 133–142. Springer, 2000.

    Chapter  Google Scholar 

  22. V. Radhakrishnan and G. Nagaraja. Inference of regular grammars via skeletons. IEEE Transactions on Systems, Man and Cybernetics, 17(6):982–992, 1987.

    MathSciNet  Google Scholar 

  23. J. R. Rico-Juan, J. Calera-Rubio, and R. C. Carrasco. Probabilistic k-testable tree languages. In A. L. Oliveira, editor, Grammatical Inference: Algorithms and Applications, 5th International Colloquium, ICGI, volume 1891 of LNCS/LNAI, pages 221–228. Springer, 2000.

    Google Scholar 

  24. G. Rozenberg and A. Salomaa, editors. Handbook of Formal Languages, Volume III. Berlin: Springer, 1997.

    MATH  Google Scholar 

  25. Y. Sakakibara. Learning context-free grammars from structural data in polynomial time. Theoretical Computer Science, 76:223–242, 1990.

    Article  MathSciNet  MATH  Google Scholar 

  26. Y. Sakakibara. Efficient learning of context-free grammars from positive structural examples. Information and Computation, 97(1):23–60, March 1992.

    Google Scholar 

  27. Y. Sakakibara and H. Muramatsu. Learning context-free grammars from partially structured examples. In A. L. Oliveira, editor, Grammatical Inference: Algorithms and Applications, 5th International Colloquium, ICGI, volume 1891 of LNCS/LNAI, pages 229–240. Springer, 2000.

    Google Scholar 

  28. Y. Takada and T. Y. Nishida. A note on grammatical inference of slender context-free languages. In L. Miclet and C. de la Higuera, editors, Proceedings of the Third International Colloquium on Grammatical Inference ICGI: Learning Syntax from Sentences, volume 1147 of LNCS/LNAI, pages 117–125. Springer, 1996.

    Chapter  Google Scholar 

  29. H. Volger. Grammars with generalized contextfree rules and their tree automata. In Proceedings of Computational Linguists in the Netherlands Meeting CLIN; Selected Papers, pages 223–233. see http://www-uilots.let.uu.nl/publications/clin1999/papers.html, 1999.

  30. T. Yokomori. Inductive inference of context-free languages based on context-free expressions. International Journal of Computer Mathematics, 24:115–140, 1988.

    Article  MATH  Google Scholar 

  31. T. Yokomori. Polynomial-time learning of very simple grammars from positive data. In Proceedings 4th Workshop on Computational Learning Theory COLT, pages 213–227, San Mateo, CA, 1991. Morgan Kaufmann.

    Google Scholar 

  32. T. Yokomori. On learning systolic languages. In K. P. Jantke, S. Doshita, K. Furukawa, and T. Nishida, editors, Proceedings of the 3rd Workshop on Algorithmic Learning Theory ALT, volume 743 of LNCS/LNAI, pages 41–52. Springer, 1992.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Fernau, H. (2002). Learning Tree Languages from Text. In: Kivinen, J., Sloan, R.H. (eds) Computational Learning Theory. COLT 2002. Lecture Notes in Computer Science(), vol 2375. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45435-7_11

Download citation

  • DOI: https://doi.org/10.1007/3-540-45435-7_11

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43836-6

  • Online ISBN: 978-3-540-45435-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics