Advertisement

Chinese Named Entity Recognition Using Improved Bi-gram Model Based on Dynamic Programming

Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 214)

Abstract

This paper proposes a bi-gram model based on dynamic programming to Chinese person named entity recognition. By studying the previous work, we concluded that we can improve the precision of NER by improving the recall rate and narrowing the gap between the recall rate and the precision rate. The algorithm defines five recognition rules which ensure the names can be recognized and returned firstly to improve the recall rate. This paper’s innovation is a filtering stage introduced to filter out the invalid names by combining the inverse-maximum-matching with bi-gram model. The bi-gram model takes four pairs of transition probability into consideration when segments the sentence which can effectively narrow the gap between precision rate and recall rate. We take the open test in different corpus and materials extracted from the Internet straightly, its precision rate achieves 83.53 %, recall rate achieves 91.43 % and its F-value achieves 87.3 %.

Keywords

Named entity recognition Chinese person named recognition Bi-gram Dynamic programming Inverse-maximum-matching 

References

  1. 1.
    Kashif R (2010) Rule-based named entity recognition in Urdu. In: Proceedings of the 2010 named entities workshop, Curran Associates, Inc., Uppsala, Sweden, pp 126–135Google Scholar
  2. 2.
    Laura C, Rajasekar K, Yunyao Li Frederick R, Shivakumar V (2010) Domain adaptation of rule-based annotators for named-entity recognition tasks. In: Conference on empirical methods in natural language processing, Massachusetts, pp 1002–1012Google Scholar
  3. 3.
    Dilek K, Adnan Y (2009) Named entity recognition experiments on Turkish texts. In: 8th international conference flexible query answering systems, Springer, Denmark, pp 524–535Google Scholar
  4. 4.
    Zhang HuaPing, Liu Qun (2003) Chinese named entity recognition using role model. J Comput Linguist Chin Lang Process 8:29–60 Google Scholar
  5. 5.
    GuoHong Fu, Jian Su (2002) Named entity recognition using an HMM-based tagger. In: 40th annual meeting of the Association for Computational Linguistics (ACL), Philadelphia, pp 473–480Google Scholar
  6. 6.
    GuoHong, Kang-Kwong Luke (2005) Chinese named entity recognition using lexicalized HMMs. J ACM SIGKDD Explor Newsl 7:19–25 (New York)Google Scholar
  7. 7.
    Hua Y, Tan Y, Hao W (2009) A method of Chinese named entity recognition based on maximum entropy model. In: ICMA international conference, IEEE Press, China, pp 2472–2477Google Scholar
  8. 8.
    Lufeng Z, Pascale F, Richard S, Marine C, Dekai W (2004) Using N-best lists for named entity recognition from Chinese speech. In: Proceedings of HLT-NAACL, short papers, IEEE Press, Boston, MassachusettsGoogle Scholar
  9. 9.
    FuChun Peng, FangFang Feng, McCallum A (2004) Chinese segmentation and new word detection using conditional random fields. In: COLING, Geneva, pp 562–568Google Scholar
  10. 10.
    HongPing Hu, HuiPing Zhou (2008) Chinese named entity recognition with CRFs. In: 2008 international conference on computational intelligence and security, NW Washington, pp 1–6Google Scholar
  11. 11.
    XiaoFeng Yu (2007) Chinese named entity recognition with cascaded hybrid model. In: Proceedings of the NAACL-Short 07’ human language technologies 2007: the conference of the North American chapter of the association for computational linguistics, companion volume, short papers, New York, pp 197–200Google Scholar
  12. 12.
    Xiaoyan Zhang, TingWang, Tang J, Zhou H, HuoWang Chen (2005) Chinese named entity recognition with a hybrid-statistical model. In: Web technologies research and development—APWeb 2005, Lecture notes in computer science, Vol 3399/2005, pp 900–912Google Scholar
  13. 13.
    ZhuoYe Ding, DeGen Huang, Huiwei Zhou (2008) A hybrid model based on CRFs for Chinese named entity recognition. In: 2008 International conference on advanced language processing and web information technology, IEEE Press, China, pp 127–132Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2014

Authors and Affiliations

  1. 1.College of Computer ScienceBeijing Institute of TechnologyBeijingChina

Personalised recommendations