Chinese Named Entity Recognition Based on Dynamically Adjusting Feature Weights

Lv, Qing; Zheng, Limin; Wang, Miao

doi:10.1007/978-3-030-92635-9_1

Qing Lv¹⁷,
Limin Zheng¹⁷ &
Miao Wang¹⁷

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 406))

Included in the following conference series:

International Conference on Collaborative Computing: Networking, Applications and Worksharing

976 Accesses

Abstract

Named entity recognition is a basic task in NLP, and it is an important basic tool for many NLP tasks such as information extraction, parsing, question answering system and machine translation. The extraction of sequence features of datasets directly affects the recognition effect of named entities, and only the accumulation of local sequence features cannot capture the long distance dependencies. The extraction of global sequence features improves this problem, but loses some local features. Long entities are nested within short entities and have different entity attributes from short entities, resulting in identification errors. To solve these problems, a Chinese named entity recognition algorithm based on Bert +FL-LGWF+CRF is proposed. In this method, the text is encoded into a word vector matrix by Bert as the input to FL-LGWF (Entity Level-Local And Global Weighted Fusion). FL-LGWF utilizes CNN (Convolutional Neural) to extract the local sequence features of the text vector, and use BISTM (Bidirectional Long Short-Term Memory) to extract contextual global sequence features, and perform dynamic weight fusion on the extracted sequence features. Then the score matrix of the tag is obtained according to the entity attribute level. Finally, the global optimal tag sequence is obtained through the CRF layer. Experimental results show that the proposed Bert +FL-LGWF+CRF model has higher F1 value on both public data sets and self-created data sets.

Supported by the National Key Research and Development Program of China (2017YFC1601803) and Beijing innovation team project of modern agricultural industrial technology system(BAIC02-2020).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Sun, P., Yang, X., Zhao, X., et al.: An overview of named entity recognition. In: 2018 International Conference on Asian Language Processing (IALP). IEEE (2019)
Google Scholar
Xie, R., Liu, Z., Jia, J., et al.: Representation learning of knowledge graphs with entity descriptions. In: Thirtieth AAAI Conference on Artificial Intelligence (2016)
Google Scholar
Riedel, S., Yao, L., McCallum, A., et al.: Relation extraction with matrix factorization and universal schemas. In: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2013, pp. 74–84 (2013)
Google Scholar
Shen, W., Wang, J., Luo, P., et al.: Linden: linking named entities with knowledge base via semantic knowledge. In: Proceedings of the 21st International Conference on World Wide Web, pp. 449–458 (2012)
Google Scholar
Zhu, J., Uren, V., Motta, E.: ESpotter: adaptive named entity recognition for web browsing. In: Althoff, K.-D., Dengel, A., Bergmann, R., Nick, M., Roth-Berghofer, T. (eds.) WM 2005. LNCS (LNAI), vol. 3782, pp. 518–529. Springer, Heidelberg (2005). https://doi.org/10.1007/11590019_59
Chapter Google Scholar
Babych, B., Hartley, A.: Improving machine translation quality with automatic named entity recognition. In: Proceedings of the 7th International EAMT Work- shop on MT and Other Language Technology Tools, Improving MT Through Other Language Technology Tools, Resource and Tools for Building MT at EACL 2003 (2003)
Google Scholar
Bordes, A., Usunier, N., Chopra, S., et al.: Large-scale simple question answering with memory networks. arXiv preprint arXiv:15060.02075 (2015)
Rau, L.F.: Extracting company names from text. In: IEEE Conference on Artificial Intelligence Application. IEEE (1991)
Google Scholar
Bengio, Y., Frasconi, P.: An input output HMM architecture. Adv. Neural Inf. Process. Syst. 7(4), 427–434 (1995)
Google Scholar
Sutton, C.: Dynamic conditional random fields: factorized probabilistic models for labeling and segmenting sequence data. In: Proceedings of the 21st International Conference on Machine Learning 2004 (2007)
Google Scholar
Bishop, C.M.: Neural Networks or Pattern Recognition (2005)
Google Scholar
Collobert, R.: Natural language processing from scratch (2011)
Google Scholar
Lample, G., Ballesteros, M., Subramanian, S., et al.: Neural architectures for named entity recognition (2016)
Google Scholar
Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF Models for Sequence Tagging (2015)
Google Scholar
Devlin, J., Chang, M.W., Lee, K., et al.: BERT: pre-training of deep bidirectional transformers for language understanding (2018)
Google Scholar
Dong, C., Zhang, J., Zong, C., et al.: Character-based LSTM-CRF with radical-level features for Chinese named entity recognition (2016)
Google Scholar
Cui, Y., Che, W., Liu, T., et al.: Pre-training with whole word masking for Chinese BERT (2019)
Google Scholar
Vaswani, A., Shazeer, N., Parmar, N., et al.: Attention is all you need. arXiv (2017)
Google Scholar
Gu, J., Wang, Z., Kuen, J., et al.: Recent advances in convolutional neural networks. Pattern Recogn. (2015)
Google Scholar
Ye, J., Zou, B., Hong, Y., Shen, L., Zhu, Q., Zhou, Q.: Negation and speculation scope detection in Chinese. J. Comput. Res. Dev. 56(7), 1506–1516 (2019)
Google Scholar
Pinto, D., McCallum, A., Wei, X., et al.: Table extraction using conditional random fields. In: 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 235–242 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Information and Electrical Engineering, China Agriculture University, Beijing, China
Qing Lv, Limin Zheng & Miao Wang

Authors

Qing Lv
View author publications
You can also search for this author in PubMed Google Scholar
Limin Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Miao Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Limin Zheng .

Editor information

Editors and Affiliations

Shanghai University, Shanghai, China
Honghao Gao
Xi’an Jiaotong-Liverpool University, Suzhou, China
Xinheng Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lv, Q., Zheng, L., Wang, M. (2021). Chinese Named Entity Recognition Based on Dynamically Adjusting Feature Weights. In: Gao, H., Wang, X. (eds) Collaborative Computing: Networking, Applications and Worksharing. CollaborateCom 2021. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 406. Springer, Cham. https://doi.org/10.1007/978-3-030-92635-9_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-92635-9_1
Published: 01 January 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92634-2
Online ISBN: 978-3-030-92635-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Chinese Named Entity Recognition Based on Dynamically Adjusting Feature Weights