A Novel Method for Chinese Named Entity Recognition Based on Character Vector

  • Jing LuEmail author
  • Mao Ye
  • Zhi Tang
  • Xiao-Jun Huang
  • Jia-Le Ma
Conference paper
Part of the Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering book series (LNICST, volume 163)


In this paper, a novel method using for Chinese named entity recognition is proposed. For each class, A posteriori probability model is acquired by combing probabilistic model and character vector, which are acquired from each class by using training data. After segment Chinese sentence into words, the posteriori probability of every words in each class can be calculated by using model we proposed, and thus the type of word could be determined according to maximum posteriori probability.


Named entity recognition Word vector Character vector 


  1. 1.
    Qi, Z., Zhao, J., Yang, F.: A new method for open named entity recognition of Chinese (2009)Google Scholar
  2. 2.
    Iwakur, T.: A named entity recognition method based on decomposition and concatenation of word chunks. ACM Trans. Asian Lang. Inf. Process. (TALIP) 12 (2013)Google Scholar
  3. 3.
    Pan, S.J.: Transfer joint embedding for cross-domain named entity recognition. ACM Trans. Inf. Syst. 31(2), 7:1–7:27 (2013)CrossRefGoogle Scholar
  4. 4.
    Konkol, M., Brychcín, T., Konopík, M.: Latent semantics in Named Entity Recognition. Expert Syst. Appl. 42(7), 3470–3479 (2015)CrossRefGoogle Scholar
  5. 5.
    Zhang, H., Liu, Q.: Automatic recognition of Chinese personal name based on role tagging. Chin. J. Comput. 27(1), 85–91 (2004)Google Scholar
  6. 6.
    Yu, H.: Recognition of Chinese organization name based on role tagging. In: Advances in Computation of Oriental Languages, pp. 79–87 (2003)Google Scholar
  7. 7.
    Wang, N., Ge, R.: Company name identification in Chinese financial domain. J. Chin. Inf. Process. 16(2), 1–6 (2002)Google Scholar
  8. 8.
    Zeng, G.: CRFs-based Chinese named entity recognition with improved tag set. Master degree theses of master of Being University of Posts and Telecommunications (2009)Google Scholar
  9. 9.
    Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. (JMLR) 12, 2493–2537 (2011)zbMATHGoogle Scholar
  10. 10.
    Tomáš, M.: Statistical language models based on neural networks. PhD thesis, Brno University of Technology (2012)Google Scholar
  11. 11.
    Yao, J.: Study on CRF-based Chinese named entity recognition. Master degree theses of master of Suzhou University (2010)Google Scholar
  12. 12.
    Yu, H.: Chinese named entity identification using cascaded hidden Markov model. J. Commun. 27(2), 87–94 (2006)Google Scholar
  13. 13.
    Zhou, J.: Chinese named entity recognition via joint identification and categorization. Chin. J. Electron. 22(2), 225–230 (2013)Google Scholar

Copyright information

© Institute for Computer Sciences, Social Informatics and Telecommunications Engineering 2016

Authors and Affiliations

  • Jing Lu
    • 1
    • 2
    • 3
    Email author
  • Mao Ye
    • 2
  • Zhi Tang
    • 1
    • 2
  • Xiao-Jun Huang
    • 2
  • Jia-Le Ma
    • 2
  1. 1.Institute of Computer Science and TechnologyPeking UniversityBeijingChina
  2. 2.State Key Laboratory of Digital Publishing TechnologyPeking University Founder Group Co., Ltd.BeijingChina
  3. 3.Postdoctoral Workstation of the Zhongguancun Haidian Science ParkBeijingChina

Personalised recommendations