Probabilistic Feature Grammars

Goodman, Joshua

doi:10.1007/978-94-015-9470-7_4

Probabilistic Feature Grammars

Joshua Goodman⁵

Chapter

110 Accesses
1 Citations

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 16))

Abstract

We present a new formalism, probabilistic feature grammar (PFG). PFGs combine most of the best properties of several other formalisms, including those of Collins, Magerman, and Charniak, and in experiments have comparable or better performance. PFGs generate features one at a time, probabilistically, conditioning the probabilities of each feature on other features in a local context. Because the conditioning is local, efficient polynomial time parsing algorithms exist for computing inside, outside, and Viterbi parses. PFGs can produce probabilities of strings, making them potentially useful for language modeling. Precision and recall results are comparable to the state of the art with words, and the best reported without words.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abney, S. (1996). Stochastic attribute-value grammars. Available as cmp-lg/9610003.
Google Scholar
Baker, J. K. (1979). Trainable grammars for speech recognition. In Proceedings of the Spring Conference of the Acoustical Society of America, pp. 547–550, Boston, MA.
Google Scholar
Black, E., Garside, G., and Leech, G. (1993). Statistically-Driven Computer Grammars of English: the IBM/Lancaster Approach, volume 8 of Language and Computers: Studies in Practical Linguistics. Rodopi, Amsterdam.
Google Scholar
Black, E., Jelinek, F., Lafferty, J., Magerman, D. M., Mercer, R., and Roukos, S. (1992a). Towards history-based grammars: Using richer models for probabilistic parsing. In Proceedings of the February 1992 DARPA Speech and Natural Language Workshop.
Google Scholar
Black, E., Lafferty, J., and Roukos, S. (1992b). Development and evaluation of a broad-coverage probabilistic grammar of English-language computer manuals. In Proceedings of the 30th Annual Meeting of the ACL, pp. 185–192.
Google Scholar
Brew, C. (1995). Stochastic HPSG. In Proceedings of the Seventh Conference of the European Chapter of the ACL, pp. 83–89, Dublin, Ireland.
Google Scholar
Briscoe, T. and Carroll, J. (1993). Generalized probabilistic LR parsing of natural language (corpora) with unification-based grammars. Computational Linguistics, 19:25–59.
Google Scholar
Carroll, J. and Briscoe, T. (1992). Probabilistic normalisation and unpacking of packed parse forests for unification-based grammars. In Proceedings of the AAAI Fall Symposium on Probabilistic Approaches to Natural Language, pp. 33–38, Cambridge, MA.
Google Scholar
Charniak, E. (1996). Tree-bank grammars. Technical Report CS-96–02, Department of Computer Science, Brown University. Available from ftp://ftp.cs.brown.edu/pub/techreports/96/cs96–02.ps.Z.
Google Scholar
Charniak, E. (1997). Statistical parsing with a context-free grammar and word statistics. In Proceedings of the AAAI, pp. 598–603, Providence, RI. AAAI Press/MIT Press.
Google Scholar
Collins, M. (1996). A new statistical parser based on bigram lexical dependencies. In Proceedings of the 34th Annual Meeting of the ACL, pp. 184–191, Santa Cruz, CA. Available as cmp-lg/9605012.
Google Scholar
Collins, M. (1997). Three generative, lexicalised models for statistical parsing. In Proceedings of the 35th Annual Meeting of the ACL, pp. 16–23, Madrid, Spain. Available as cmp-lg/9706022.
Google Scholar
Eisele, A. (1994). Towards probabilistic extensions of constraint-based grammars. DYANA-2 Deliverable R1.2.B. Available from ftp://ftp.ims.uni-stuttgart.de/papers/DYANA2/R1.2.B.
Google Scholar
Goodman, J. (1996). Parsing algorithms and metrics. In Proceedings of the 34th Annual Meeting of the ACL, pp. 177–183, Santa Cruz, CA. Available as cmp-lg/9605036.
Google Scholar
Goodman, J. (1997). Global thresholding and multiple-pass parsing. In Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, pp. 11–25.
Google Scholar
Goodman, J. (1998). Parsing Inside-Out. PhD thesis, Harvard University. Available as cmp-lg/9805007 and from http://www.research.microsoft.com/~joshuago/thesis.ps.
Google Scholar
Lari, K. and Young, S. (1990). The estimation of stochastic context-free grammars using the inside-outside algorithm. Computer Speech and Language, 4:35–56.
Article Google Scholar
Magerman, D. (1994). Natural Language Parsing as Statistical Pattern Recognition. PhD thesis, Stanford University University. Available as cmp-lg/9405009.
Google Scholar
Magerman, D. (1995). Statistical decision-models for parsing. In Proceedings of the 33rd Annual Meeting of the ACL, pp. 276–283, Cambridge, MA.
Google Scholar
Miller, S., Stallard, D., Bobrow, R., and Schwartz, R. (1996). A fully statistical approach to natural language interfaces. In Proceedings of the 34th Annual Meeting of the ACL, pp. 55–61, Santa Cruz, CA.
Google Scholar
Ratnaparkhi, A. (1997). A linear observed time statistical parser based on maximum entropy models. In Proceedings of the Second Conference on Empirical Methods in Natural Language Processing, pp. 1–10.
Google Scholar
Stolcke, A. (1993). An efficient probabilistic context-free parsing algorithm that computes prefix probabilities. Technical Report TR-93–065, International Computer Science Institute, Berkeley, CA. Available as cmp-lg/9411029.
Google Scholar
Zavaliagkos, G., Anastasakos, T., Chou, G., Lapre, C., Kubala, F., Makhoul, J., Nguyen, L., Schwartz, R., and Zhao, Y. (1994). Improved search, acoustic and language modeling in the BBN Byblos large vocabulary CSR system. In Proceedings of the ARPA Workshop on Spoken Language Technology, pp. 81–88, Plainsboro, New Jersey.
Google Scholar

Download references

Author information

Authors and Affiliations

Harvard University, 40 Oxford St., Cambridge, MA, 02138, USA
Joshua Goodman

Authors

Joshua Goodman
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Tilburg University, The Netherlands
Harry Bunt
University of Twente, Enschede, The Netherlands
Anton Nijholt

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Goodman, J. (2000). Probabilistic Feature Grammars. In: Bunt, H., Nijholt, A. (eds) Advances in Probabilistic and Other Parsing Technologies. Text, Speech and Language Technology, vol 16. Springer, Dordrecht. https://doi.org/10.1007/978-94-015-9470-7_4

Download citation

DOI: https://doi.org/10.1007/978-94-015-9470-7_4
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-5579-8
Online ISBN: 978-94-015-9470-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics