On the Regularity and Learnability of Ordered DAG Languages

  • Henrik Björklund
  • Johanna Björklund
  • Petter Ericson
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10329)

Abstract

Order-Preserving DAG Grammars (OPDGs) is a subclass of Hyper-Edge Replacement Grammars that can be parsed in polynomial time. Their associated class of languages is known as Ordered DAG Languages, and the graphs they generate are characterised by being acyclic, rooted, and having a natural order on their nodes. OPDGs are useful in natural-language processing to model abstract meaning representations. We state and prove a Myhill-Nerode theorem for ordered DAG languages, and translate it into a MAT-learning algorithm for the same class. The algorithm infers a minimal OPDG G for the target language in time polynomial in G and the samples provided by the MAT oracle.

References

  1. 1.
    Angluin, D.: Learning regular sets from queries and counterexamples. Inf. Comput. 75, 87–106 (1987)MathSciNetCrossRefMATHGoogle Scholar
  2. 2.
    Banarescu, L., Bonial, C., Cai, S., Georgescu, M., Griffitt, K., Hermjakob, U., Knight, K., Koehn, P., Palmer, M., Schneider, N.: Abstract meaning representation for sembanking. In: 7th Linguistic Annotation Workshop & Interoperability with Discourse, Sofia, Bulgaria (2013)Google Scholar
  3. 3.
    Björklund, H., Björklund, J., Ericson, P.: On the regularity and learnability of ordered DAG languages. Technical report UMINF 17.12, Umeå University (2017)Google Scholar
  4. 4.
    Björklund, H., Drewes, F., Ericson, P.: Between a rock and a hard place - uniform parsing for hyperedge replacement DAG grammars. In: 10th International Conference on Language and Automata Theory and Applications, Prague, Czech Republic, pp. 521–532 (2016)Google Scholar
  5. 5.
    Björklund, J., Fernau, H., Kasprzik, A.: Polynomial inference of universal automata from membership and equivalence queries. Inf. Comput. 246, 3–19 (2016)MathSciNetCrossRefMATHGoogle Scholar
  6. 6.
    Chiang, D., Andreas, J., Bauer, D., Hermann, K.M., Jones, B., Knight, K.: Parsing graphs with hyperedge replacement grammars. In: 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013), Sofia, Bulgaria, pp. 924–932 (2013)Google Scholar
  7. 7.
    Drewes, F., Björklund, J., Maletti, A.: MAT learners for tree series: an abstract data type and two realizations. Acta Informatica 48(3), 165 (2011)MathSciNetCrossRefMATHGoogle Scholar
  8. 8.
    Drewes, F., Högberg, J.: Query learning of regular tree languages: how to avoid dead states. Theory Comput. Syst. 40(2), 163–185 (2007)MathSciNetCrossRefMATHGoogle Scholar
  9. 9.
    Drewes, F., Kreowski, H.-J., Habel, A.: Hyperedge replacement graph grammars. In: Rozenberg, G. (ed.), Handbook of Graph Grammars, vol. 1, pp. 95–162. World Scientific (1997)Google Scholar
  10. 10.
    Drewes, F., Vogler, H.: Learning deterministically recognizable tree series. J. Automata, Lang. Comb 12(3), 332–354 (2007)MathSciNetMATHGoogle Scholar
  11. 11.
    Hara, S., Shoudai, T.: Polynomial time MAT learning of c-deterministic regular formal graph systems. In: International Conference on Advanced Applied Informatics (IIAI AAI 2014), Kitakyushu, Japan, pp. 204–211 (2014)Google Scholar
  12. 12.
    Högberg, J.: A randomised inference algorithm for regular tree languages. Nat. Lang. Eng. 17(02), 203–219 (2011)CrossRefGoogle Scholar
  13. 13.
    Kozen, D.: On the Myhill-Nerode theorem for trees. Bull. EATCS 47, 170–173 (1992)MATHGoogle Scholar
  14. 14.
    Maletti, A.: Learning deterministically recognizable tree series — revisited. In: Bozapalidis, S., Rahonis, G. (eds.) CAI 2007. LNCS, vol. 4728, pp. 218–235. Springer, Heidelberg (2007). doi:10.1007/978-3-540-75414-5_14 CrossRefGoogle Scholar
  15. 15.
    Okada, R., Matsumoto, S., Uchida, T., Suzuki, Y., Shoudai, T.: Exact learning of finite unions of graph patterns from queries. In: Hutter, M., Servedio, R.A., Takimoto, E. (eds.) ALT 2007. LNCS (LNAI), vol. 4754, pp. 298–312. Springer, Heidelberg (2007). doi:10.1007/978-3-540-75225-7_25 CrossRefGoogle Scholar
  16. 16.
    Rozenberg, G., Welzl, E.: Boundary NLC graph grammars-basic definitions, normal forms, and complexity. Inf. Control 69(1–3), 136–167 (1986)MathSciNetCrossRefMATHGoogle Scholar
  17. 17.
    Sakakibara, Y.: Learning context-free grammars from structural data in polynomial time. Theor. Comput. Sci. 76(2–3), 223–242 (1990)MathSciNetCrossRefMATHGoogle Scholar
  18. 18.
    Shirakawa, H., Yokomori, T.: Polynomial-time MAT learning of c-deterministic context-free grammars. Trans. Inf. Process. Soc. Jpn. 34, 380–390 (1993)Google Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Henrik Björklund
    • 1
  • Johanna Björklund
    • 1
  • Petter Ericson
    • 1
  1. 1.Department of Computing ScienceUmeå UniversityUmeåSweden

Personalised recommendations