Abstract
Pronominal anaphora resolution denotes antecedent identification for anaphoric pronouns expressed in discourses. Effective resolution relies on the kinds of features to be concerned and how they are appropriately weighted at antecedent identification. In this paper, a rich feature set including the innovative discourse features are employed so as to resolve those commonly-used Chinese pronouns in modern Chinese written texts. Moreover, a maximum-entropy based model is presented to estimate the confidence for each antecedent candidate. Experimental results show that our method achieves 83.5% success rate which is better than those obtained by rule-based and SVM-based methods.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Berger, A.L., Piertra, S.A., Pietra, V.J.: A maximum entropy approach to natural language processing. Compuational Linguistics 22(1), 39–71 (1996)
Bergsma, S., Lin, D.: Bootstrapping Path-Based Pronoun Resolution. In: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL, pp. 33–40 (2006)
Chen, Shou-Yi.: Corpus-based Coherence Relation Tagging in Chinese Discourses, Master Thesis, National Chiao Tung University, Taiwan (2006)
Converse, S.P.: Resolving Pronominal References in Chinese with the Hobbs Algorithm. In: Proceedings of the 4th SIGHAN Workshop on Chinese Language Processing, pp. 116–122 (2005)
Grosz, B.J., Joshi, A.K., Weinstein, S.: Centering: A Framework for Modeling the Local Coherence of Discourse. Computational Linguistics 21(2), 203–225 (1995)
Lappin, S., Leass, H.: An Algorithm for Pronominal Anaphora Resolution. Computational Linguistics 20(4), 535–561 (1994)
Liang, T., Yeh, C.H., Wu, D.S.: A Corpus-based Categorization for Chinese Proper Nouns. In: Proceedings of the National Computer Symposium, pp. 434–443 (2003)
Kehler, A.: Coherence, reference, and the theory of grammar. CSLI Publications, Stanford (2002)
McCallum, A.K.: MALLET: A Machine Learning for Language Toolkit (2002), http://www.cs.umass.edu/~mccallum/mallet
Mitkov, R.: Multilingual Anaphora Resolution. Machine Translation 14(3-4), 281–299 (1999)
Ng, V.: Machine learning for coreference resolution: From local classification to global ranking. In: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics, pp. 157–164 (2005)
Ng, V., Cardie, C.: Improving machine learning approaches to coreference resolution. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pp. 104–111 (2002)
Strube, M., Muller, C.: A machine learning approach to pronoun resolution in spoken dialogue. In: Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, pp. 168–175 (2003)
Strube, M., Hahn, U.: Functional centering. In: Proceedings of the 34th annual meeting on Association for Computational Linguistics, pp. 270–277 (1996)
Wang, H.F., Mei, Z.: Robust Pronominal Resolution within Chinese Text. Journal of Software 16, 700–707 (2005)
Wang, N., Yuan, C.F., Wang, K.F., Li, W.J.: Anaphora Resolution in Chinese Financial News for Information Extraction. In: Proceedings of the 4th World Congress on Intelligent Control and Automation, pp. 2422–2426 (2002)
Yang, X.F., Su, J., Tan, C.L.: Kernel-Based Pronoun Resolution with Structured Syntactic Knowledge. In: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL, pp. 41–48 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liang, T., Wu, DS. (2008). Improving Chinese Pronominal Anaphora Resolution by Extensive Feature Representation and Confidence Estimation. In: Nordström, B., Ranta, A. (eds) Advances in Natural Language Processing. GoTAL 2008. Lecture Notes in Computer Science(), vol 5221. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85287-2_28
Download citation
DOI: https://doi.org/10.1007/978-3-540-85287-2_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85286-5
Online ISBN: 978-3-540-85287-2
eBook Packages: Computer ScienceComputer Science (R0)