Probabilistic LR-Parsing with Symbolic Postprocessing

  • Tobias Ruland
Part of the Artificial Intelligence book series (AI)

Abstract

This article describes a novel approach to probabilistic LR-parsing of spontaneously spoken utterances developed in Verbmobil. It extends the use of context knowledge within the probabilistic model of the parser and improves its output by applying tree transformation rules learned from corpora. The parser was developed for German, English and Japanese and achieves more than 90% Labeled Recall/Precision on parsed Verbmobil utterances.

Keywords

Context Free Grammar Training Corpus Syntactic Analysis Word Lattice Tree Transformation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aho, A.V., Sethi, R., and Ullman, J.D. (1986) Compilers: Principle, Techniques and Tools. Reading, Mass.: Addison Wesley.Google Scholar
  2. Batliner, A., Block, H.-U., Kießling, A., Kompe, R., Niemann, H., Nöth, E., Ruland, T., and Schachtl, S. (1997). Improving Parsing of Spontaneous Speech With the Help of Prosodic Boundaries. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’97). München, Germany.Google Scholar
  3. Block, H. U. (1993). Compiling Trace and Unification Grammar for Parsing and Generation. In Strzalkowski, T., ed., Reversible Grammar in Natural Language Processing, 155–174, Boston, Dordrecht, London: Kluwer.Google Scholar
  4. Bod, R. (1995). The Problem of Computing the Most Probable Tree in Data-Oriented Parsing and Stochastic Tree Grammars. In Proceedings of the Seventh Conference of the European Chapter of the ACL. Dublin, 104–111.Google Scholar
  5. Brill, E. (1993a). A Corpus-Based Approach to Language Learning. Ph.D. Dissertation, University of Pennsylvania, Department of Computer and Information Science.Google Scholar
  6. Brill, E. (1993b). Automatic Grammar Induction and Parsing Free Text: A Transformation Based Approach. In Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics. Columbus, Ohio.Google Scholar
  7. Briscoe, T. and Carroll, J. (1993). Generalized Probabilistic LR Parsing of Natural Language (Corpora) With Unification-Based Grammars. Computational Linguistics 19 (1).Google Scholar
  8. Briscoe, T. and Carroll, J. (1996) Apportioning Development Effort in a Probabilistic LR-Parsing System Through Evaluation. In Proceedings of the ACL SIGDAT Conference on Empirical Methods in Natural Language Processing. Philadelphia, PA., 92–100.Google Scholar
  9. Charniak, E. (1993). Statistical Language Learning. Cambridge, Mass.: MIT Press.Google Scholar
  10. Charniak, E. (1997). Statistical Techniques for Natural Language Parsing. AI Magazine. Google Scholar
  11. Collins, M. (1999) Head-Driven Statistical Models for Natural Language Parsing. Ph.D. Dissertation, University of Pennsylvania, Philadelphia.Google Scholar
  12. Good, I.J. (1953) The Population Frequencies of Species and the Estimation of Population Parameters. Biometrika 40(3,4). 237–263.MathSciNetMATHGoogle Scholar
  13. Hermjakob, U. (1997). Learning Parse and Translation Decisions From Examples With Rich Context. Ph.D. Dissertation, University of Texas, Austin, TX.Google Scholar
  14. Hinrichs, E.W., Kübler, S., Kordoni, V., and Müller, F., (a). Robust Chunk Parsing For Spontaneous Speech. In this volume. Google Scholar
  15. Hinrichs, E.W., Bartels, J., Kawata, Y., S., Kordoni, V., and Telljohann, H., (b). The Tübingen Treebanks for Spoken German, English and Japanese. In this volume. Google Scholar
  16. Inui, K., Sornlertlamvanich, V., Tanaka, H., and Tokunaga, T. (1997a). A New Formalization of Probabilistic GLR Parsing. In Proceedings of the International Workshop on Parsing Technologies. Google Scholar
  17. Inui, K., Shirai, K., Sornlertlamvanich, V., Tanaka, H., and Tokunaga, T. (1997b). Empirical Evaluation of Probabilistic GLR Parsing. Natural Language Pacific-Rim Symposium.Google Scholar
  18. Kiefer B., Krieger, H.-U., and Nederhof, M.-J. Efficient and Robust HPSG Parsing of Word Graphs. In this volume. Google Scholar
  19. Lavie, A. (1996). GLR*: A Robust Grammar-Focused Parser for Spontaneously Spoken Language. Ph.D. Dissertation, Carnegie Mellon University, Pittsburgh.Google Scholar
  20. Magerman, D. (1994). Natural Language Parsing as Statistical Pattern Recognition. Ph.D. Dissertation, Stanford University, Stanford, CA.Google Scholar
  21. Marcus, M. P. (1980). A Theory of Syntactic Recognition for Natural Language. Cambridge, Mass.: MIT Press.MATHGoogle Scholar
  22. Nagao, M. (1990). Knowledge and Inference. San Diego: Academic Press.MATHGoogle Scholar
  23. Ney, H. and Oerder, M. (1993). An Efficient Interface Between Continuous-Speech Recognition and Language Understanding. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’93). Minneapolis, MN.Google Scholar
  24. Pinkal, M., Rupp C., and Worm, K. Robust Semantic Processing of Spoken Language. In this volume. Google Scholar
  25. Quinlan, J. R. (1986). Induction of Decision Trees. In: Machine Learning 1 (1). 81–106.Google Scholar
  26. Ruland, T. (1995). Inkrementelles probabilistisches Parsing von Worthypothesengraphen. Diploma Thesis, University of Erlangen-Nürnberg, IMMD 8.Google Scholar
  27. Rupp, C.J., Spilker, J., Klarner, M., and Worm, K. Combining Analyses From Various Parsers. In this volume. Google Scholar
  28. Schiehlen, M. Semantic Construction. In this volume. Google Scholar
  29. Schmid, L. (1994). Parsing Word Graphs Using a Linguistic Grammar and a Statistical Language Model. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’94). Adelaide, Australia.Google Scholar
  30. Tomita, M. (1991). ed., Generalised LR Parsing. Boston: Kiuwer Academic Publishers.Google Scholar
  31. Waibel, A. et al. (1996) Janus-II—Translation of Spontaneous Conversational Speech. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’96). Atlanta, GA.Google Scholar
  32. Weber, H. (1994). LR-inkrementelles, probabilistisches Chartparsing von Worthypothesenmengen mit Unifikationsgrammatiken. Ph.D. Dissertation, University of Hamburg.Google Scholar
  33. Wright, J. H. and Wrigley, E. N. (1991). GLR Parsing With Probability. In Tomita, M.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2000

Authors and Affiliations

  • Tobias Ruland
    • 1
  1. 1.Siemens AGCorporate TechnologyMünchenGermany

Personalised recommendations