Probabilistic LR-Parsing with Symbolic Postprocessing
Chapter
Abstract
This article describes a novel approach to probabilistic LR-parsing of spontaneously spoken utterances developed in Verbmobil. It extends the use of context knowledge within the probabilistic model of the parser and improves its output by applying tree transformation rules learned from corpora. The parser was developed for German, English and Japanese and achieves more than 90% Labeled Recall/Precision on parsed Verbmobil utterances.
Keywords
Context Free Grammar Training Corpus Syntactic Analysis Word Lattice Tree Transformation
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Preview
Unable to display preview. Download preview PDF.
References
- Aho, A.V., Sethi, R., and Ullman, J.D. (1986) Compilers: Principle, Techniques and Tools. Reading, Mass.: Addison Wesley.Google Scholar
- Batliner, A., Block, H.-U., Kießling, A., Kompe, R., Niemann, H., Nöth, E., Ruland, T., and Schachtl, S. (1997). Improving Parsing of Spontaneous Speech With the Help of Prosodic Boundaries. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’97). München, Germany.Google Scholar
- Block, H. U. (1993). Compiling Trace and Unification Grammar for Parsing and Generation. In Strzalkowski, T., ed., Reversible Grammar in Natural Language Processing, 155–174, Boston, Dordrecht, London: Kluwer.Google Scholar
- Bod, R. (1995). The Problem of Computing the Most Probable Tree in Data-Oriented Parsing and Stochastic Tree Grammars. In Proceedings of the Seventh Conference of the European Chapter of the ACL. Dublin, 104–111.Google Scholar
- Brill, E. (1993a). A Corpus-Based Approach to Language Learning. Ph.D. Dissertation, University of Pennsylvania, Department of Computer and Information Science.Google Scholar
- Brill, E. (1993b). Automatic Grammar Induction and Parsing Free Text: A Transformation Based Approach. In Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics. Columbus, Ohio.Google Scholar
- Briscoe, T. and Carroll, J. (1993). Generalized Probabilistic LR Parsing of Natural Language (Corpora) With Unification-Based Grammars. Computational Linguistics 19 (1).Google Scholar
- Briscoe, T. and Carroll, J. (1996) Apportioning Development Effort in a Probabilistic LR-Parsing System Through Evaluation. In Proceedings of the ACL SIGDAT Conference on Empirical Methods in Natural Language Processing. Philadelphia, PA., 92–100.Google Scholar
- Charniak, E. (1993). Statistical Language Learning. Cambridge, Mass.: MIT Press.Google Scholar
- Charniak, E. (1997). Statistical Techniques for Natural Language Parsing. AI Magazine. Google Scholar
- Collins, M. (1999) Head-Driven Statistical Models for Natural Language Parsing. Ph.D. Dissertation, University of Pennsylvania, Philadelphia.Google Scholar
- Good, I.J. (1953) The Population Frequencies of Species and the Estimation of Population Parameters. Biometrika 40(3,4). 237–263.MathSciNetMATHGoogle Scholar
- Hermjakob, U. (1997). Learning Parse and Translation Decisions From Examples With Rich Context. Ph.D. Dissertation, University of Texas, Austin, TX.Google Scholar
- Hinrichs, E.W., Kübler, S., Kordoni, V., and Müller, F., (a). Robust Chunk Parsing For Spontaneous Speech. In this volume. Google Scholar
- Hinrichs, E.W., Bartels, J., Kawata, Y., S., Kordoni, V., and Telljohann, H., (b). The Tübingen Treebanks for Spoken German, English and Japanese. In this volume. Google Scholar
- Inui, K., Sornlertlamvanich, V., Tanaka, H., and Tokunaga, T. (1997a). A New Formalization of Probabilistic GLR Parsing. In Proceedings of the International Workshop on Parsing Technologies. Google Scholar
- Inui, K., Shirai, K., Sornlertlamvanich, V., Tanaka, H., and Tokunaga, T. (1997b). Empirical Evaluation of Probabilistic GLR Parsing. Natural Language Pacific-Rim Symposium.Google Scholar
- Kiefer B., Krieger, H.-U., and Nederhof, M.-J. Efficient and Robust HPSG Parsing of Word Graphs. In this volume. Google Scholar
- Lavie, A. (1996). GLR*: A Robust Grammar-Focused Parser for Spontaneously Spoken Language. Ph.D. Dissertation, Carnegie Mellon University, Pittsburgh.Google Scholar
- Magerman, D. (1994). Natural Language Parsing as Statistical Pattern Recognition. Ph.D. Dissertation, Stanford University, Stanford, CA.Google Scholar
- Marcus, M. P. (1980). A Theory of Syntactic Recognition for Natural Language. Cambridge, Mass.: MIT Press.MATHGoogle Scholar
- Nagao, M. (1990). Knowledge and Inference. San Diego: Academic Press.MATHGoogle Scholar
- Ney, H. and Oerder, M. (1993). An Efficient Interface Between Continuous-Speech Recognition and Language Understanding. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’93). Minneapolis, MN.Google Scholar
- Pinkal, M., Rupp C., and Worm, K. Robust Semantic Processing of Spoken Language. In this volume. Google Scholar
- Quinlan, J. R. (1986). Induction of Decision Trees. In: Machine Learning 1 (1). 81–106.Google Scholar
- Ruland, T. (1995). Inkrementelles probabilistisches Parsing von Worthypothesengraphen. Diploma Thesis, University of Erlangen-Nürnberg, IMMD 8.Google Scholar
- Rupp, C.J., Spilker, J., Klarner, M., and Worm, K. Combining Analyses From Various Parsers. In this volume. Google Scholar
- Schiehlen, M. Semantic Construction. In this volume. Google Scholar
- Schmid, L. (1994). Parsing Word Graphs Using a Linguistic Grammar and a Statistical Language Model. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’94). Adelaide, Australia.Google Scholar
- Tomita, M. (1991). ed., Generalised LR Parsing. Boston: Kiuwer Academic Publishers.Google Scholar
- Waibel, A. et al. (1996) Janus-II—Translation of Spontaneous Conversational Speech. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP ’96). Atlanta, GA.Google Scholar
- Weber, H. (1994). LR-inkrementelles, probabilistisches Chartparsing von Worthypothesenmengen mit Unifikationsgrammatiken. Ph.D. Dissertation, University of Hamburg.Google Scholar
- Wright, J. H. and Wrigley, E. N. (1991). GLR Parsing With Probability. In Tomita, M.Google Scholar
Copyright information
© Springer-Verlag Berlin Heidelberg 2000