Abstract
This paper proposes a bi-gram model based on dynamic programming to Chinese person named entity recognition. By studying the previous work, we concluded that we can improve the precision of NER by improving the recall rate and narrowing the gap between the recall rate and the precision rate. The algorithm defines five recognition rules which ensure the names can be recognized and returned firstly to improve the recall rate. This paper’s innovation is a filtering stage introduced to filter out the invalid names by combining the inverse-maximum-matching with bi-gram model. The bi-gram model takes four pairs of transition probability into consideration when segments the sentence which can effectively narrow the gap between precision rate and recall rate. We take the open test in different corpus and materials extracted from the Internet straightly, its precision rate achieves 83.53 %, recall rate achieves 91.43 % and its F-value achieves 87.3 %.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kashif R (2010) Rule-based named entity recognition in Urdu. In: Proceedings of the 2010 named entities workshop, Curran Associates, Inc., Uppsala, Sweden, pp 126–135
Laura C, Rajasekar K, Yunyao Li Frederick R, Shivakumar V (2010) Domain adaptation of rule-based annotators for named-entity recognition tasks. In: Conference on empirical methods in natural language processing, Massachusetts, pp 1002–1012
Dilek K, Adnan Y (2009) Named entity recognition experiments on Turkish texts. In: 8th international conference flexible query answering systems, Springer, Denmark, pp 524–535
Zhang HuaPing, Liu Qun (2003) Chinese named entity recognition using role model. J Comput Linguist Chin Lang Process 8:29–60
GuoHong Fu, Jian Su (2002) Named entity recognition using an HMM-based tagger. In: 40th annual meeting of the Association for Computational Linguistics (ACL), Philadelphia, pp 473–480
GuoHong, Kang-Kwong Luke (2005) Chinese named entity recognition using lexicalized HMMs. J ACM SIGKDD Explor Newsl 7:19–25 (New York)
Hua Y, Tan Y, Hao W (2009) A method of Chinese named entity recognition based on maximum entropy model. In: ICMA international conference, IEEE Press, China, pp 2472–2477
Lufeng Z, Pascale F, Richard S, Marine C, Dekai W (2004) Using N-best lists for named entity recognition from Chinese speech. In: Proceedings of HLT-NAACL, short papers, IEEE Press, Boston, Massachusetts
FuChun Peng, FangFang Feng, McCallum A (2004) Chinese segmentation and new word detection using conditional random fields. In: COLING, Geneva, pp 562–568
HongPing Hu, HuiPing Zhou (2008) Chinese named entity recognition with CRFs. In: 2008 international conference on computational intelligence and security, NW Washington, pp 1–6
XiaoFeng Yu (2007) Chinese named entity recognition with cascaded hybrid model. In: Proceedings of the NAACL-Short 07’ human language technologies 2007: the conference of the North American chapter of the association for computational linguistics, companion volume, short papers, New York, pp 197–200
Xiaoyan Zhang, TingWang, Tang J, Zhou H, HuoWang Chen (2005) Chinese named entity recognition with a hybrid-statistical model. In: Web technologies research and development—APWeb 2005, Lecture notes in computer science, Vol 3399/2005, pp 900–912
ZhuoYe Ding, DeGen Huang, Huiwei Zhou (2008) A hybrid model based on CRFs for Chinese named entity recognition. In: 2008 International conference on advanced language processing and web information technology, IEEE Press, China, pp 127–132
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Le, J., Niu, Z. (2014). Chinese Named Entity Recognition Using Improved Bi-gram Model Based on Dynamic Programming. In: Sun, F., Li, T., Li, H. (eds) Knowledge Engineering and Management. Advances in Intelligent Systems and Computing, vol 214. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37832-4_40
Download citation
DOI: https://doi.org/10.1007/978-3-642-37832-4_40
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37831-7
Online ISBN: 978-3-642-37832-4
eBook Packages: EngineeringEngineering (R0)