High Precision Extraction of Grammatical Relations

Carroll, John; Briscoe, Ted

doi:10.1007/1-4020-2295-6_3

John Carroll¹⁵ &
Ted Briscoe¹⁶

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 23))

Abstract

A parsing system returning analyses in the form of sets of grammatical relations can obtain high precision if it hypothesises a particular grammatical relation only when it is certain that the relation is correct. We operationalise this technique-in a statistical parser using a manually-developed wide-coverage grammar of English — by only returning relations that form part of all analyses licensed by the grammar. We observe an increase in precision from 75% to over 90% (at the cost of a reduction in recall) on a test corpus of naturally-occurring text.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

When Rules Meet Bigrams

Constraining Parse Ambiguity with Grammatical Codes

Predicate Argument Structures for Information Extraction from Dependency Representations: Null Elements are Missing

References

Aït-Mokhtar, S. and J-P. Chanod (1997). Subject and object dependency extraction using finite-state transducers. In Proceedings of the ACL/EACL Workshop on Automatic Information Extraction and Building of Lexical Semantic Resources, 71–77. Madrid, Spain.
Google Scholar
Argamon, S., I. Dagan and Y. Krymolowski (1998). A memory-based approach to learning shallownatural language patterns. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics, 67–73. Montreal.
Google Scholar
Blaheta, D. and E. Charniak (2000). Assigning function tags to parsed text. In Proceedings of the 1st Conference of the North American Chapter of the Association for Computational Linguistics, 234–240. Seattle, WA.
Google Scholar
Bouma, G., G. van Noord and R. Malouf (2001). Alpino: wide-coverage computational analysis of Dutch. Computational Linguistics in the Netherlands 2000. Selected Papers from the 11th CLIN Meeting.
Google Scholar
Brants, T., W. Skut and B. Krenn (1997). Tagging grammatical functions. In Proceedings of the 2nd Conference on Empirical Methods in Natural Language Processing, 64–74. Providence, RI.
Google Scholar
Brent, M. (1993). From grammar to lexicon: unsupervised learning of lexical syntax. Computational Linguistics, 19(3):243–262.
Google Scholar
Briscoe, E. and J. Carroll (1997). Automatic extraction of subcategorization from corpora. In Proceedings of the 5th Association for Computational Linguistics Conference on Applied Natural Language Processing, 356–363. Washington, DC.
Google Scholar
Briscoe, E. and J. Carroll (2002). Robust Accurate Statistical Annotation of GeneralText. In Proceedings of the 3rd International Conference on Language Resources and Evaluation, Las Palmas, Gran Canaria. 1499–1504.
Google Scholar
Buchholz, S., J. Veenstra and W. Daelemans (1999). Cascaded grammatical relation assignment. In Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, College Park, MD. 239–246.
Google Scholar
Carroll, J., E. Briscoe and A. Sanfilippo (1998). Parser evaluation: a survey and a new proposal. In Proceedings of the 1st International Conference on Language Resources and Evaluation, 447–454. Granada, Spain.
Google Scholar
Carroll, J., G. Minnen and E. Briscoe (1998). Can subcategorisation probabilities help a statistical parser?. In Proceedings of the 6th ACL/S1GDAT Workshop on Very Large Corpora, 118–126. Montreal, Canada.
Google Scholar
Clark, S. and D. Weir (2001). Class-based probability estimation using a semantic hierarchy. In Proceedings of the 2nd Conference of the North American Chapter of the Association for Computational Linguistics, 95–102. Pittsburgh, PA.
Google Scholar
Collins, M. (1999). Head-driven statistical models for natural language parsing. PhD thesis, University of Pennsylvania.
Google Scholar
Grefenstette, G. (1997). SQLET: short query linguistic expansion techniques, palliating one-word queries by providing intermediate structure to text. In Proceedings of the RIAO’97, 500–509. Montreal, Canada.
Google Scholar
Grefenstette, G. (1999). Light parsing as finite-state filtering. In A. Kornai (Ed.), Extended Finite State Models of Language, Cambridge University Press. 86–94.
Google Scholar
Harrison, P., S. Abney, E. Black, D. Flickinger, C. Gdaniec, R. Grishman, D. Hindle, B. Ingria, M. Marcus, B. Santorini and T. Strzalkowski (1991). Evaluating syntax performance of parser/grammars of English. In Proceedings of the ACL Workshop on Evaluating Natural Language Processing Systems, 71–78. Berkeley, CA.
Google Scholar
Karlsson, F., A. Voutilainen, J. Heikkilä and A. Anttila (1995). Constraint Grammar: a Language-Independent System for Parsing Unrestricted Text. Berlin, Germany: de Gruyter.
Book Google Scholar
Kiefer, B., H-U. Krieger, J. Carroll and R. Malouf (1999). A bag of useful techniques for efficient and robust parsing. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, 473–80. University of Maryland.
Google Scholar
Lafferty, J., D. Sleator and D. Temperley (1992). Grammatical trigrams: a probabilistic model of link grammar. In Proceedings of the AAAI Fall Symposium on Probabilistic Approaches to Natural Language, 89–97. Cambridge, MA.
Google Scholar
Leech, G. (1992). 100 million words of English: the British National Corpus. Language Research, 28(1):1–13.
MathSciNet Google Scholar
Lin, D. (1998). Dependency-based evaluation of MINIPAR. In Proceedings of the Evaluation of Parsing Systems: Workshop at the 1st International Conference on Language Resources and Evaluation. Granada, Spain (also available as University of Sussex technical report CSRP-489).
Google Scholar
Lin, D. (1999). Automatic identification of non-compositional phrases. In Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, 317–324. College Park, MD.
Google Scholar
McCarthy, D. (2000). Using semantic preferences to identify verbal participation in role switching alternations. In Proceedings of the 1st Conference of the North American Chapter of the Association for Computational Linguistics, 256–263. Seattle, WA.
Google Scholar
Oepen, S. and J. Carroll (2000). Ambiguity packing in constraint-based parsing — practical results. In Proceedings of the 1st Conference of the North American Chapter of the Association for Computational Linguistics, 162–169. Seattle, WA.
Google Scholar
Palmer, M., R. Passonneau, C. Weir and T. Finin (1993). The KERNEL text understanding system. Artificial Intelligence, 63:17–68.
Article Google Scholar
Pantel, P. and D. Lin (2000). An unsupervised approach to prepositional phrase attachment using contextually similar words. In Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, 101–108. Hong Kong.
Google Scholar
Sampson, G. (1995). English for the Computer. Oxford University Press.
Google Scholar
Schmid, H. and M. Rooth (2001). Parse forest computation of expected governors. In Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics, 458–465. Toulouse, France.
Google Scholar
Srinivas, B. (2000). A lightweight dependency analyzer for partialparsing. Natural Language Engineering, 6(2):113–138.
Article Google Scholar
Yeh, A. (2000). Using existing systems to supplement small amounts of annotated grammatical relations training data. In Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, 126–132. Hong Kong.
Google Scholar

Download references

Author information

Authors and Affiliations

Cognitive and Computing Sciences, University of Sussex, Falmer, Brighton, BN1 9QH, UK
John Carroll
Computer Laboratory, University of Cambridge, JJ Thomson Avenue, Cambridge, CB3 0FD, UK
Ted Briscoe

Authors

John Carroll
View author publications
You can also search for this author in PubMed Google Scholar
Ted Briscoe
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Tilburg University, Tilburg, The Netherlands
Harry Bunt
University of Sussex, Brighton, UK
John Carroll
University of Padua, Padua, Italy
Giorgio Satta

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Carroll, J., Briscoe, T. (2004). High Precision Extraction of Grammatical Relations. In: Bunt, H., Carroll, J., Satta, G. (eds) New Developments in Parsing Technology. Text, Speech and Language Technology, vol 23. Springer, Dordrecht. https://doi.org/10.1007/1-4020-2295-6_3

Download citation

DOI: https://doi.org/10.1007/1-4020-2295-6_3
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-2293-7
Online ISBN: 978-1-4020-2295-1
eBook Packages: Humanities, Social Sciences and Law

Publish with us

Policies and ethics

High Precision Extraction of Grammatical Relations

Abstract

Access this chapter

Preview

Similar content being viewed by others

When Rules Meet Bigrams

Constraining Parse Ambiguity with Grammatical Codes

Predicate Argument Structures for Information Extraction from Dependency Representations: Null Elements are Missing

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

High Precision Extraction of Grammatical Relations

Abstract

Access this chapter

Preview

Similar content being viewed by others

When Rules Meet Bigrams

Constraining Parse Ambiguity with Grammatical Codes

Predicate Argument Structures for Information Extraction from Dependency Representations: Null Elements are Missing

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation