Abstract
Named entity recognition is a basic task in NLP, and it is an important basic tool for many NLP tasks such as information extraction, parsing, question answering system and machine translation. The extraction of sequence features of datasets directly affects the recognition effect of named entities, and only the accumulation of local sequence features cannot capture the long distance dependencies. The extraction of global sequence features improves this problem, but loses some local features. Long entities are nested within short entities and have different entity attributes from short entities, resulting in identification errors. To solve these problems, a Chinese named entity recognition algorithm based on Bert +FL-LGWF+CRF is proposed. In this method, the text is encoded into a word vector matrix by Bert as the input to FL-LGWF (Entity Level-Local And Global Weighted Fusion). FL-LGWF utilizes CNN (Convolutional Neural) to extract the local sequence features of the text vector, and use BISTM (Bidirectional Long Short-Term Memory) to extract contextual global sequence features, and perform dynamic weight fusion on the extracted sequence features. Then the score matrix of the tag is obtained according to the entity attribute level. Finally, the global optimal tag sequence is obtained through the CRF layer. Experimental results show that the proposed Bert +FL-LGWF+CRF model has higher F1 value on both public data sets and self-created data sets.
Supported by the National Key Research and Development Program of China (2017YFC1601803) and Beijing innovation team project of modern agricultural industrial technology system(BAIC02-2020).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Sun, P., Yang, X., Zhao, X., et al.: An overview of named entity recognition. In: 2018 International Conference on Asian Language Processing (IALP). IEEE (2019)
Xie, R., Liu, Z., Jia, J., et al.: Representation learning of knowledge graphs with entity descriptions. In: Thirtieth AAAI Conference on Artificial Intelligence (2016)
Riedel, S., Yao, L., McCallum, A., et al.: Relation extraction with matrix factorization and universal schemas. In: Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2013, pp. 74–84 (2013)
Shen, W., Wang, J., Luo, P., et al.: Linden: linking named entities with knowledge base via semantic knowledge. In: Proceedings of the 21st International Conference on World Wide Web, pp. 449–458 (2012)
Zhu, J., Uren, V., Motta, E.: ESpotter: adaptive named entity recognition for web browsing. In: Althoff, K.-D., Dengel, A., Bergmann, R., Nick, M., Roth-Berghofer, T. (eds.) WM 2005. LNCS (LNAI), vol. 3782, pp. 518–529. Springer, Heidelberg (2005). https://doi.org/10.1007/11590019_59
Babych, B., Hartley, A.: Improving machine translation quality with automatic named entity recognition. In: Proceedings of the 7th International EAMT Work- shop on MT and Other Language Technology Tools, Improving MT Through Other Language Technology Tools, Resource and Tools for Building MT at EACL 2003 (2003)
Bordes, A., Usunier, N., Chopra, S., et al.: Large-scale simple question answering with memory networks. arXiv preprint arXiv:15060.02075 (2015)
Rau, L.F.: Extracting company names from text. In: IEEE Conference on Artificial Intelligence Application. IEEE (1991)
Bengio, Y., Frasconi, P.: An input output HMM architecture. Adv. Neural Inf. Process. Syst. 7(4), 427–434 (1995)
Sutton, C.: Dynamic conditional random fields: factorized probabilistic models for labeling and segmenting sequence data. In: Proceedings of the 21st International Conference on Machine Learning 2004 (2007)
Bishop, C.M.: Neural Networks or Pattern Recognition (2005)
Collobert, R.: Natural language processing from scratch (2011)
Lample, G., Ballesteros, M., Subramanian, S., et al.: Neural architectures for named entity recognition (2016)
Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF Models for Sequence Tagging (2015)
Devlin, J., Chang, M.W., Lee, K., et al.: BERT: pre-training of deep bidirectional transformers for language understanding (2018)
Dong, C., Zhang, J., Zong, C., et al.: Character-based LSTM-CRF with radical-level features for Chinese named entity recognition (2016)
Cui, Y., Che, W., Liu, T., et al.: Pre-training with whole word masking for Chinese BERT (2019)
Vaswani, A., Shazeer, N., Parmar, N., et al.: Attention is all you need. arXiv (2017)
Gu, J., Wang, Z., Kuen, J., et al.: Recent advances in convolutional neural networks. Pattern Recogn. (2015)
Ye, J., Zou, B., Hong, Y., Shen, L., Zhu, Q., Zhou, Q.: Negation and speculation scope detection in Chinese. J. Comput. Res. Dev. 56(7), 1506–1516 (2019)
Pinto, D., McCallum, A., Wei, X., et al.: Table extraction using conditional random fields. In: 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 235–242 (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Lv, Q., Zheng, L., Wang, M. (2021). Chinese Named Entity Recognition Based on Dynamically Adjusting Feature Weights. In: Gao, H., Wang, X. (eds) Collaborative Computing: Networking, Applications and Worksharing. CollaborateCom 2021. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 406. Springer, Cham. https://doi.org/10.1007/978-3-030-92635-9_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-92635-9_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92634-2
Online ISBN: 978-3-030-92635-9
eBook Packages: Computer ScienceComputer Science (R0)