Interactive Parsing

Toselli, Alejandro Héctor; Vidal, Enrique; Casacuberta, Francisco

doi:10.1007/978-0-85729-479-1_9

Alejandro Héctor Toselli,
Enrique Vidal &
Francisco Casacuberta

600 Accesses

Abstract

This chapter introduces the Interactive Parsing (IP) framework for obtaining the correct syntactic parse tree of a given sentence. This formal framework allows us to make the construction of interactive systems for tree annotation. These interactive systems can help to human annotators in creating error-free parse trees with little effort, when compared with manual post-editing of the trees provided by an automatic parser.

In principle, the interaction protocol defined in the IP framework differs from the left-to-right interaction protocol used throughout this book. Specifically, the IP protocol will be of desultory order; that is, in IP the user can edit any part of the parse tree and in any order. However, in order to efficiently calculate the next best tree in IP framework, in Sect. 9.4, a left-to-right depth-first tree review order will be introduced. In addition, this order also introduces computational advantages into the lookout of most probable tree for interactive bottom-up parsing algorithms. The use of Confidence Measures in IP is also presented as an efficient technique to detect erroneous parse trees. Confidence Measures can be efficiently computed in the IP framework and can help in detecting erroneous constituents within the IP process more quickly, as they provide discriminant information over all the IP process.

With Contribution Of: José Miguel Benedí, Joan Andreu Sánchez and Ricardo Sánchez-Sáez.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://nltk.sourceforge.net/.

References

Baker, J. K. (1979). Trainable grammars for speech recognition. The Journal of the Acoustical Society of America, 65, 31–35.
Google Scholar
Benedí, J. M., & Sánchez, J. A. (2005). Estimation of stochastic context-free grammars and their use as language models. Computer Speech & Language, 19(3), 249–274.
Article Google Scholar
Benedí, J. M., Sánchez, J. A., & Sanchis, A. (2007). Confidence measures for stochastic parsing. In Proceedings of the international conference recent advances in natural language processing (pp. 58–63), Borovets, Bulgaria.
Google Scholar
Carter, D. (1997). The TreeBanker. A tool for supervised training of parsed corpora. In Proceedings of the workshop on computational environments for grammar development and linguistic engineering (pp. 9–15), Madrid, Spain.
Google Scholar
Charniak, E. (1997). Statistical parsing with a context-free grammar and word statistics. In Proceedings of the national conference on artificial intelligence (pp. 598–603), Providence, Rhode Island, USA.
Google Scholar
Charniak, E. (2000). A maximum-entropy-inspired parser. In Proceedings of the first conference on North American chapter of the association for computational linguistics (pp. 132–139), Seattle, Washington, USA.
Google Scholar
Charniak, E., Knight, K., & Yamada, K. (2003). Syntax-based language models for statistical machine translation. In Machine translation summit, IX international association for machine translation, New Orleans, Louisiana, USA.
Google Scholar
Chelba, F., & Jelinek, C. (2000). Structured language modeling. Computer Speech and Language, 14(4), 283–332.
Article Google Scholar
Chiang, D. (2007). Hierarchical phrase-based translation. Computational Linguistics, 33(2), 201–228.
Article MATH Google Scholar
Collins, M. (2003). Head-driven statistical models for natural language parsing. Computational Linguistics, 29(4), 589–637.
Article MathSciNet MATH Google Scholar
de la Clergerie, E. V., Hamon, O., Mostefa, D., Ayache, C., Paroubek, P., & Vilnat, A. (2008). PASSAGE: from French parser evaluation to large sized treebank. In Proceedings of the sixth international language resources and evaluation (pp. 3570–3577), Marrakech, Morocco.
Google Scholar
Earley, J. (1970). An efficient context-free parsing algorithm. Communications of the ACM, 8(6), 451–455.
Google Scholar
Gascó, G., & Sánchez, J. A. (2007). A* parsing with large vocabularies. In Proceedings of the international conference recent advances in natural language processing (pp. 215–219), Borovets, Bulgaria.
Google Scholar
Gascó, G., Sánchez, J. A., & Benedí, J. M. (2010). Enlarged search space for sitg parsing. In Proceedings of the North American chapter of the association for computational linguistics—human language technologies conference (pp. 653–656), Los Angeles, California.
Google Scholar
Hopcroft, J. E., & Ullman, J. D. (1979). Introduction to automata theory, languages and computation. Reading: Addison-Wesley.
MATH Google Scholar
Huang, L., & Chiang, D. (2005). Better k-best parsing. In Proceedings of the ninth international workshop on parsing technology (pp. 53–64), Vancouver, British Columbia. Menlo Park: Association for Computational Linguistics.
Chapter Google Scholar
Jain, A. K., Duin, R. P., & Mao, J. (2000). Statistical pattern recognition: A review. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 4–37.
Article Google Scholar
Klein, D., & Manning, C. D. (2003). Accurate unlexicalized parsing. In Proceedings of the 41st annual meeting on association for computational linguistics (Vol. 1, pp. 423–430), Association for Computational Linguistics Morristown, NJ, USA.
Google Scholar
Lease, M., Charniak, E., Johnson, M., & McClosky, D. (2006). A look at parsing and its applications. In Proceedings of the twenty-first national conference on artificial intelligence, Boston, Massachusetts, USA.
Google Scholar
Marcus, M. P., Santorini, B., & Marcinkiewicz, M. A. (1994). Building a large annotated corpus of English: The Penn Treebank. Computational Linguistics, 19(2), 313–330.
Google Scholar
Oepen, S., Flickinger, D., Toutanova, K., & Manning, C. D. (2004). LinGO redwoods. Research on Language and Computation, 2(4), 575–596.
Article Google Scholar
Pereira, F., & Schabes, Y. (1992). Inside-outside reestimation from partially bracketed corpora. In Proceedings of the 30th annual meeting of the association for computational linguistics (pp. 128–135). Newark: University of Delaware.
Chapter Google Scholar
Petrov, S., & Klein, D. (2007). Improved inference for unlexicalized parsing. In Conference of the North American chapter of the association for computational linguistics; proceedings of the main conference (pp. 404–411), Rochester, New York.
Google Scholar
Roark, B. (2001). Probabilistic top-down parsing and language modeling. Computational Linguistics, 27(2), 249–276.
Article MathSciNet Google Scholar
Salvador, I., & Benedí, J. M. (2002). RNA modeling by combining stochastic context-free grammars and n-gram models. International Journal of Pattern Recognition and Artificial Intelligence, 16(3), 309–315.
Article Google Scholar
San-Segundo, R., Pellom, B., Hacioglu, K., Ward, W., & Pardo, J. M. (2001). Confidence measures for spoken dialogue systems. In IEEE international conference on acoustic speech and signal processing (Vol. 1), Salt Lake City, Utah, USA.
Google Scholar
Sánchez-Sáez, R., Sánchez, J. A., & Benedí, J. M. (2009). Statistical confidence measures for probabilistic parsing. In Proceedings of the international conference on recent advances in natural language processing (pp. 388–392), Borovets, Bulgaria.
Google Scholar
Sánchez-Sáez, R., Leiva, L., Sánchez, J. A., & Benedí, J. M. (2010). Confidence measures for error discrimination in an interactive predictive parsing framework. In 23rd International conference on computational linguistics (pp. 1220–1228), Beijing, China.
Google Scholar
Serrano, N., Sanchis, A., & Juan, A. (2010). Balancing error and supervision effort in interactive-predictive handwriting recognition. In Proceeding of the 14th international conference on intelligent user interfaces (pp. 373–376), Hong Kong, China.
Chapter Google Scholar
Stolcke, A. (1995). An efficient probabilistic context-free parsing algorithm that computes prefix probabilities. Computational Linguistics, 21(2), 165–200.
MathSciNet Google Scholar
Tarazón, L., Pérez, D., Serrano, N., Alabau, V., Terrades, O. R., Sanchis, A., & Juan, A. (2009). Confidence measures for error correction in interactive transcription of handwritten text. In LNCS: Vol. 5716. Proceedings of the 15th international conference on image analysis and processing (pp. 567–574), Salerno, Italy.
Google Scholar
Ueffing, N., & Ney, H. (2007). Word-level confidence estimation for machine translation. Computational Linguistics, 33(1), 9–40.
Article MATH Google Scholar
Wessel, F., Schluter, R., Macherey, K., & Ney, H. (2001). Confidence measures for large vocabulary continuous speech recognition. IEEE Transactions on Speech and Audio Processing, 9(3), 288–298.
Article Google Scholar
Wu, D. (1997). Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics, 23(3), 377–404.
Google Scholar
Yamada, K., & Knight, K. (2002). A decoder for syntax-based statistical MT. In Meeting of the association for computational linguistics, Philadelphia, Pensilvania, USA.
Google Scholar
Yamamoto, R., Sako, S., Nishimoto, T., & Sagayama, S. (2006). On-line recognition of handwritten mathematical expressions based on stroke-based stochastic context-free grammar. In 10th international workshop on frontiers in handwriting recognition (pp. 249–254), La Baule, France.
Google Scholar

Download references

Author information

Authors and Affiliations

Authors

Dr. Alejandro Héctor Toselli
View author publications
You can also search for this author in PubMed Google Scholar
Dr. Enrique Vidal
View author publications
You can also search for this author in PubMed Google Scholar
Prof. Francisco Casacuberta
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alejandro Héctor Toselli .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Toselli, A.H., Vidal, E., Casacuberta, F. (2011). Interactive Parsing. In: Multimodal Interactive Pattern Recognition and Applications. Springer, London. https://doi.org/10.1007/978-0-85729-479-1_9

Download citation

DOI: https://doi.org/10.1007/978-0-85729-479-1_9
Publisher Name: Springer, London
Print ISBN: 978-0-85729-478-4
Online ISBN: 978-0-85729-479-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics