Abstract
This paper presents a two-step approach to determining whether a transliterated personal name from different Chinese texts stands for the same referent. A heuristic strategy based on biographical information and “colleague” names is firstly used to form an initial set of coreference chains, and then, a clustering algorithm based Vector Space Model (VSM) is applied to merge chains under the control of a full name consistent constraint. Experimental results show that this approach achieves a good performance.
Supported by National Natural Science Foundation of China (No.60473138, 60173005)
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Amit, B., Baldwin, B.: Entity-Based Cross-Document Coreferencing Using the Vector Space Model. In: Proc. of the 36th Annual Meeting of the ACL and the 17th International Conreference on Computational Linguistics (COLING-ACL), pp. 79–85 (1998)
Amit, B., Alan, B.: A Methodology for Cross-Document Coreference. In: Proc. of the fifth Joint Conference on Information Sciences(JCIS), pp. 207–210 (2000)
Meng, S.W., Ng, H.T., Lim, D.C.Y.: A machine Learning Approach to Coreference Resolution of Noun Phrases. Computational Linguistics(Special Issue on Computational Anaphora Resolution), vol. 27(4), pp. 521–544 (2001)
Michael, F., Hovy, E.: Multi-Document Personal Name Resolution. In: Proc. Reference Resolution Workshop, 42nd Annual Meeting of the ACL, pp. 1–8 (2004)
Christopher, D., Zielund, T.: Cross Document Co-Reference Resolution Application for People in the Legal Domain. In: Proc. Reference Resolution Workshop, 42nd Annual Meeting of the ACL, pp. 9–16 (2004)
Gideon, M., Yarowsky, D.: Unsupervised Personal Name Disambiguation. In: Proc. of CoNLL, Edmonton,Canada, pp. 33–40 (2003)
Gooi, C.H., Allan, J.: Cross-Document Coreference on a Large Scale Corpus. In: Proc. of HLT-NAACL 2004, Boston, pp. 9–16 (2004)
Shiwen, Y.: The Grammatical Knowledge-Base of Contemporary Chinese – A Complete Specification. Tsinghua University Press, China (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, H. (2005). Cross-Document Transliterated Personal Name Coreference Resolution. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11540007_2
Download citation
DOI: https://doi.org/10.1007/11540007_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28331-7
Online ISBN: 978-3-540-31828-6
eBook Packages: Computer ScienceComputer Science (R0)