Skip to main content

On Dependency Analysis via Contractions and Weighted FSTs

  • Chapter
Book cover Shall We Play the Festschrift Game?
  • 494 Accesses

Abstract

Arc contractions in syntactic dependency graphs can be used to decide which graphs are trees. The paper observes that these contractions can be expressed with weighted finite-state transducers (weighted FST) that operate on string-encoded trees. The observation gives rise to a finite-state parsing algorithm that computes the parse forest and extracts the best parses from it. The algorithm is customizable to functional and bilexical dependency parsing, and it can be extended to non-projective parsing via a multi-planar encoding with prior results on high recall. Our experiments support an analysis of projective parsing according to which the worst-case time complexity of the algorithm is quadratic to the sentence length, and linear to the overlapping arcs and the number of functional categories of the arcs. The results suggest several interesting directions towards efficient and high-precision dependency parsing that takes advantage of the flexibility and the demonstrated ambiguity-packing capacity of such a parser.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    It would be interesting to study how the minorization relation compares with the derivation relation of tree adjoining grammars. In both cases, the derived tree is manipulated from inside.

  2. 2.

    This article is published on the occasion of Professor Lauri Carlson’s birthday. As he co-supervised my Ph.D. research together with Kimmo Koskenniemi a decade ago, it is now a great privilege for me to write about these new advances in the research area where we started together.

  3. 3.

    The tree is drawn with the xdag.sty package written by Denys Duchier, Ralph Debusmann and Robert Grabowski. For convenience, the orientation of the tree is flipped in the context of the linguistic example that is typeset with expex.sty.

  4. 4.

    This definition excludes arc deletion that is normally included in the definition of graph minors.

  5. 5.

    These minorization and “majorization” phases could be compared to the forward and backward procedures used in trellis algorithms for Hidden Markov Models.

References

  • Abney, Steven. 1996. Partial parsing via finite state cascades. In Proceedings of the ESSLLI’96 robust parsing workshop, Prague, Czech Republic.

    Google Scholar 

  • Debusmann, Ralph, Denys Duchier, and Geert-Jan M. Kruijff. 2004. Extensible dependency grammar: A new methodology. In Proceedings of the COLING 2004 workshop of recent advances in dependency grammar, Geneva, Switzerland, eds. Geert-Jan M. Kruijff and Denys Duchier, 78–84.

    Google Scholar 

  • Didriksen, Tino. 2010. Constraint grammar manual: 3rd version of the CG formalism variant. GrammarSoft ApS, Denmark. http://beta.visl.sdu.dk/cg3/vislcg3.pdf.

  • Eisner, Jason. 1997. Bilexical grammars and a cubic-time probabilistic parser. In Proceedings of the 4th international workshop on parsing technologies, 54–65. Cambridge: MIT.

    Google Scholar 

  • Elworthy, David. 2000. A finite state parser with dependency structure output. In Proceedings of sixth international workshop on parsing technologies (IWPT 2000). Trento: Institute for Scientific and Technological Research.

    Google Scholar 

  • Gómez-Rodríguez, Carlos, and Joakim Nivre. 2010. A transition-based parser for 2-planar dependency structures. In Proceedings of the 48th annual meeting of the Association for Computational Linguistics (ACL 2010), Uppsala, Sweden, 1492–1501.

    Google Scholar 

  • Gómez-Rodríguez, Carlos, David Weir, and John Carroll. 2009. Parsing mildly non-projective dependency structures. In Proceedings of the 12th conference of the European chapter of the Association for Computational Linguistics (EACL 2009), 291–299.

    Google Scholar 

  • Hulden, Mans. 2011. Constraint Grammar parsing with left and right sequential finite transducers. In Proceedings of the 9th international workshop on finite state methods and natural language processing (FSMNLP 2011), 39–47. Blois: Association for Computational Linguistics. http://www.aclweb.org/anthology/W11-4406.

    Google Scholar 

  • Joshi, Aravind K. 1996. A parser from antiquity: an early application of finite state transducers to natural language parsing. In Proceedings of the ECAI’96 workshop on extended finite state models of language, ed. András Kornai. Studies in natural language processing, 33–34. Cambridge: Cambridge University Press.

    Google Scholar 

  • Koskenniemi, Kimmo, Pasi Tapanainen, and Atro Voutilainen. 1992. Compiling and using finite-state syntactic rules. In Proceedings of the 14th COLING 1992 conference, Nantes, France. Vol. 1, 156–162.

    Google Scholar 

  • Kuhlmann, Marco. 2010. In Dependency structures and lexicalized grammars. An algebraic approach, Vol. 6270 of Lecture notes in artificial intelligence, FoLLI publications on logic, language and information. Springer: Berlin.

    Chapter  Google Scholar 

  • Kuhlmann, Marco, and Giorgio Satta. 2009. Treebank grammar techniques for non-projective dependency parsing. In Proceedings of the 12th conference of the European chapter of the Association for Computational Linguistics (EACL’09), 478–486.

    Google Scholar 

  • Lombardo, Vincenzo, and Leonardo Lesmo. 1996. An Earley-type recognizer for dependency grammar. In Proceedings of the 16th COLING conference, Copenhagen, Denmark, Vol. 2, 723–728.

    Google Scholar 

  • Maruyama, Hiroshi. 1990. Structural disambiguation with constraint propagation. In Proceedings of the 28th ACL 1989 conference, Pittsburgh, Pennsylvania, 31–38.

    Google Scholar 

  • Mohri, Mehryar. 2009. Weighted automata algorithms. In Handbook of weighted automata, eds. Manfred Droste, Werner Kuich, and Heiko Vogler, 213–254. Berlin: Springer.

    Chapter  Google Scholar 

  • Neuhaus, Peter, and Norbert Bröker. 1997. The complexity of recognition of linguistically adequate dependency grammars. In Proceedings of the 35th annual meeting of the Association for Computational Linguistics and the 8th conf. of the European chapter of the Association for Computational Linguistics, Madrid, Spain, 337–343.

    Chapter  Google Scholar 

  • Nivre, Joakim. 2006. Constraints on non-projective dependency parsing. In Proceedings of the 11th conference of the European chapter of the Association for Computational Linguistics (EACL 2006), 73–80.

    Google Scholar 

  • Nivre, Joakim. 2008. Algorithms for deterministic incremental dependency parsing. Computational Linguistics 34: 513–553.

    Article  MathSciNet  Google Scholar 

  • Oflazer, Kemal. 2003. Dependency parsing with an extended finite-state approach. Computational Linguistics 29: 515–544.

    Article  MathSciNet  Google Scholar 

  • Plátek, Martin, Markéta Lopatková, and Karel Oliva. 2003. Restarting automata: motivations and applications. In Workshop ‘Petrinetze’ and 13. Theorietag ‘Formale Sprachen und Automaten’, ed. M. Holzer, 90–96. Institut für Informatik, Technische Universität München.

    Google Scholar 

  • Roche, Emmanuel. 1997. Parsing with finite-state transducers. In Finite-state language processing, eds. Emmanuel Roche and Yves Schabes, 241–281. Cambridge: MIT Press. Chap. 8.

    Google Scholar 

  • Tapanainen, Pasi. 1999. Parsing in two frameworks: finite-state and functional dependency grammar. Doctoral diss., University of Helsinki, Finland.

    Google Scholar 

  • Tesnière, Lucien. 1959. Éléments de syntaxe structurale. Paris: Éditions Klincksieck.

    Google Scholar 

  • Voutilainen, Atro. 1994. Designing a parsing grammar, Vol. 22 of Publications of the Department of General Linguistics, University of Helsinki. Helsinki: Yliopistopaino.

    Google Scholar 

  • Yli-Jyrä, Anssi Mikael. 2003. Multiplanarity—a model for dependency structures in treebanks. In TLT 2003. Proceedings of the second workshop on treebanks and linguistic theories, eds. Joakim Nivre and Erhard Hinrichs, Vol. 9 of Mathematical modelling in physics, engineering and cognitive sciences, 189–200. Växjö: Växjö University Press.

    Google Scholar 

  • Yli-Jyrä, Anssi. 2004. Axiomatization of restricted non-projective dependency trees through finite-state constraints that analyse crossing bracketings. In Proceedings of the COLING 2004 workshop of recent advances in dependency grammar, Geneva, Switzerland, 33–40.

    Google Scholar 

  • Yli-Jyrä, Anssi. 2005. Approximating dependency grammars through intersection of star-free regular languages. International Journal of Foundations of Computer Science 16: 565–579.

    Article  MathSciNet  MATH  Google Scholar 

  • Yli-Jyrä, Anssi. 2008. Transducers from parallel replace rules and modes with generalized lenient composition. In 6th international workshop, finite-state methods and natural language processing, FSMNLP-2007. Revised papers, 197–212. Potsdam: Potsdam University Press.

    Google Scholar 

  • Yli-Jyrä, Anssi. 2011. An efficient constraint grammar parser based on inward deterministic automata. In Proceedings of the NODALIDA 2011 workshop constraint grammar applications, eds. Eckhard Bick, Kristin Hagen, Kaili Müürisep, and Trond Trosterud, Vol. 14 of NEALT proceedings series, 50–60.

    Google Scholar 

  • Yli-Jyrä, Anssi, and Kimmo Koskenniemi. 2004. Compiling contextual restrictions on strings into finite-state automata. In Proceedings of the Eindhoven FASTAR days 2004, eds. Loek Cleophas and Bruce W. Watson, Vol. 04-40 of Computer science reports. Eindhoven: Technische Universiteit Eindhoven.

    Google Scholar 

  • Yli-Jyrä, Anssi, and Matti Nykänen. 2004. A hierarchy of mildly context sensitive dependency grammars. In Proceedings of the 9th conference on formal grammar (FGNancy 2004), eds. Gerald Penn, Gerhard Jäger, Paola Monachesi, and Shuly Wintner, 151–165.

    Google Scholar 

Download references

Acknowledgements

The research has been made possible by the Academy of Finland grant number 128536 “Open and Language Independent Automata-Based Resource Production Methods for Common Language Research Infrastructure”, and, more recently, by the FIN-CLARIN project steered by Krister Lindén. Kimmo Koskenniemi, Pasi Tapanainen, Atro Voutilainen and Lauri Carlson supported my first investigations into contractions in finite-state intersection parsing since 1995. More recently, my thinking has benefited from several related discussions with Carlos Gómez-Rodríguez, Jason Eisner, Joakim Nivre, Marco Kuhlmann, and John Hale. During the multi-year creative process, I felt several times need for heavenly empowerment. I look gratefully back to every inspired moment.

I am indebted to the prior anonymous reviewers of the CIAA 2011 and FSMNLP 2011 meetings, as well as Aarne Ranta, Wanjiku Nganga, Jussi Piitulainen, and Miikka Silfverberg for their valuable comments pointing out many areas for further study. The remaining imperfections in the text are mine, of course.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anssi Yli-Jyrä .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Yli-Jyrä, A. (2012). On Dependency Analysis via Contractions and Weighted FSTs. In: Santos, D., Lindén, K., Ng’ang’a, W. (eds) Shall We Play the Festschrift Game?. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30773-7_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-30773-7_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-30772-0

  • Online ISBN: 978-3-642-30773-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics