Abstract
Named Entity Recognition, as one of the typical tasks of information extraction, has a wide range of applications. However, the difficulty of data collection and the lack of data annotation are common problems in real scenarios. It is difficult for classical named entity recognition methods to fully obtain hidden information when the dataset and data labels are insufficient. In this case, the recognition accuracy will drop significantly. In resource-poor scenarios, the use of multi-task learning, data augmentation, and transfer learning methods can effectively utilize limited data resources or introduce data from other fields to optimize model performance. In this survey, firstly, we introduce the reader to the state of research on named entity recognition, and outline the reasons for the survey and the contribution of this paper. Then, we comprehensively describe the datasets and evaluation methods included in named entity recognition, and propose a Classification of named entity recognition in the case of lack of resources. Finally, we analyze the existing problems on the basis of the above, and further look forward to the future development direction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Etzioni, O., Cafarella, M., Downey, D., Popescu, A. M., Shaked, T., Soderland, S., Weld, D., S., Yates, A.: Unsupervised named-entity extraction from the Web: An experimental study. Artificial Intelligence. 165.1:91–134(2005)
Mollá, D., Van Zaanen, M., Smith, D.: Named entity recognition for question answering. In: Proceedings of the Australasian language technology workshop, pp. 51–58 (2006)
Guo, J., Xu, G., Cheng, X., Li, H.: Named entity recognition in query. In: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pp. 267–274 (2009)
Ganea, O.E., Hofmann, T.: Deep joint entity disambiguation with local neural attention. In: Proceedings of the 2017 Conference on empirical methods in natural language processing, pp. 2619–2629. Association for Computational Linguistics (2017)
Le, P., Titov, I.: Improving entity linking by modeling latent relations between mentions. In: Proceedings of the 56th Annual meeting of the association for computational linguistics, vol. 1, Long Papers, pp. 1595–1604 (2018)
Zhang, Z., Han, X., Liu, Z., Jiang, X., Sun, M., Liu, Q.: Ernie: enhanced language representation with informative entities. In: Proceedings of the 57th Annual meeting of the association for computational linguistics (2019)
Cheng, P., Erk, K.: Attending to entities for better text understanding. Vol. 34, No. 05, pp. 7554–7561. In: Proceedings of the AAAI conference on artificial intelligence (2020)
Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30(1), 3–26 (2007)
Marrero, M., Urbano, J., Sánchez-Cuadrado, S., Morato, J., Gómez-BerbÃs, J.M.: Named entity recognition: fallacies, challenges and opportunities. Comput. Stand. & Interfaces 35(5), 482–489 (2013)
Campos, D., Matos, S., Oliveira, J.L.: Biomedical named entity recognition: a survey of machine-learning tools. Theory Appl. Adv. Text Min. 11, 175–195 (2012)
Alshaikhdeeb, B., Ahmad, K.: Biomedical named entity recognition: a review. Int. J. Adv. Sci., Eng. Inf. Technol. 6(6), 889–895 (2016)
Li, J., Sun, A., Han, J., Li, C.: A survey on deep learning for named entity recognition. IEEE Trans. Knowl. Data Eng. 34(1), 50–70 (2020)
Yadav, V., Bethard, S.: A Survey on Recent Advances in Named Entity Recognition from Deep Learning models. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 2145–2158 (2018)
Gao, C., Zhang, X., Han, M., Liu, H.: A review on cyber security named entity recognition. Front. Inf. Technol. & Electron. Eng. 22(9), 1153–1168 (2021)
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12(ARTICLE), 2493–2537 (2011)
Passos, A., Kumar, V., McCallum, A.: Lexicon infused phrase embeddings for named entity resolution. In: Proceedings of the eighteenth conference on computational natural language learning, pp. 78–86 (2014)
Pennington, J., Socher, R., Manning, C. D.: Glove: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp. 1532–1543 (2014)
Yao, L., Liu, H., Liu, Y., Li, X., Anwar, M.W.: Biomedical named entity recognition based on deep neutral network. Int. J. Hybrid Inf. Technol. 8(8), 279–288 (2015)
Huang, Z., Xu, W., Yu, K.: Bidirectional L1STM-CRF models for sequence tagging. Computer Science (2015)
Strubell, E., Verga, P., Belanger, D., McCallum, A.: Fast and accurate entity recognition with iterated dilated convolutions. In: Proceedings of the 2017 Conference on empirical methods in natural language processing, pp. 2670–2680 (2017)
Shen, Y., Ma, X., Tan, Z., Zhang, S., Wang, W., Lu, W.: Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition. In: Proceedings of the 59th Annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing, vol. 1, Long Papers, pp. 2782–2794 (2021)
Zheng, X., Chen, H., Xu, T.: Deep learning for chinese word segmentation and pos tagging. In: Proceedings of the 2013 conference on empirical methods in natural language processing, pp. 647–657 (2013)
Gillick, D., Brunk, C., Vinyals, O., Subramanya, A.: Multilingual language processing from bytes. In: Proceedings of NAACL-HLT, pp. 1296–1306 (2016)
Kim, Y., Jernite, Y., Sontag, D., Rush, A.: Character-aware neural language model, vol. 30, no. 1. In: Proceedings of the AAAI conference on artificial intelligence (2016)
Kuru, O., Can, O. A., Yuret, D.: Charner: Character-level named entity recognition. In: Proceedings of COLING 2016, the 26th International conference on computational linguistics: Technical Papers, pp. 911–921 (2016)
Dong, C., Zhang, J., Zong, C., Hattori, M., Di, H: Character-based LSTM-CRF with radical-level features for chinese named entity recognition. In Natural Language Understanding and Intelligent Applications, pages 239–250. Springer (2016)
Akbik, A., Blythe, D., Vollgraf, R.: Contextual string embeddings for sequence labeling. In: Proceedings of the 27th international conference on computational linguistics, pp. 1638–1649 (2018)
Peters, M., Neumann, M., Iyyer, M., Gardner, M., Zettlemoyer, L.: Deep contextualized word representations. (2018)
Peters, M.E., Ammar, W., Bhagavatula, C., Power, R.: Semi-supervised sequence tagging with bidirectional language models. In: Proceedings of the 55th Annual meeting of the association for computational linguistics, vol. 1, Long Papers, pp. 1756–1765, Vancouver, Canada, July. Association for Computational Linguistics (2017)
Zhu, Y., Wang, G.: CAN-NER: Convolutional attention network for chinese named entity recognition. In: Proceedings of NAACL-HLT, pp. 3384–3393 (2019)
Ma, X., Hovy, E.: End-to-end sequence labeling via Bi-directional LSTM-CNNs-CRF. In: Proceedings of the 54th Annual meeting of the association for computational linguistics, vol.1, Long Papers, pp. 1064–1074 (2016)
Chiu, J.P., Nichols, E.: Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguist. 4, 357–370 (2016)
Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural Architectures for named entity recognition. In: Proceedings of NAACL-HLT, pp. 260–270 (2016)
Bharadwaj, A., Mortensen, D. R., Dyer, C., Carbonell, J. G.: Phonologically aware neural model for named entity recognition in low resource transfer settings. In: Proceedings of the conference on empirical methods in natural language processing, pp. 1462–1472 (2016)
Lin, B.Y., Xu, F. F., Luo, Z., Zhu, K.: Multi-channel bilstm-crf model for emerging named entity recognition in social media. In: Proceedings of the 3rd Workshop on Noisy User-generated Text, pp. 160–165 (2017)
Gui, T., Zou, Y., Zhang, Q., Peng, M., Fu, J., Wei, Z., Huang, X. J.: A lexicon-based graph neural network for Chinese NER. In: Proceedings of the 2019 Conference on empirical methods in natural language Processing and the 9th International joint conference on natural language processing, pp.1040–1050 (2019)
Liu, T., Yao, J. G., Lin, C. Y.: Towards improving neural named entity recognition with gazetteers. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp. 5301–5307 (2019)
Ghaddar, A., Langlais, P: Robust lexical features for improved neural network named-entity recognition. In: Proceedings of the 27th International conference on computational linguistics, pp. 1896–1907 (2018)
Zhang, Y., Yang, J.: Chinese NER Using Lattice LSTM. In: Proceedings of the 56th Annual meeting of the association for computational linguistics, vol. 1: Long Papers, pp. 1554–1564 (2018)
Cao, P., Chen, Y., Liu, K., Zhao, J., Liu, S.: Adversarial transfer learning for Chinese named entity recognition with self-attention mechanism. In: Proceedings of the 2018 conference on empirical methods in natural language processing, pp. 182–192 (2018)
Liu, W., Xu, T., Xu, Q., Song, J., Zu, Y.: An encoding strategy based word-character LSTM for Chinese NER. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1. Long and Short Papers. pp. 2379–2389 (2019)
Ding, R., Xie, P., Zhang, X., Lu, W., Li, L., Si, L.: A neural multi-digraph model for Chinese NER with gazetteers. In: Proceedings of the 57th Annual meeting of the association for computational linguistics. pp. 1462–1467 (2019)
Wu, F., Liu, J., Wu, C., Huang, Y., Xie, X.: Neural Chinese named entity recognition via CNN-LSTM-CRF and joint training with word segmentation. In: The World Wide Web Conference. pp. 3342–3348 (2019)
Song, C.H., Sehanobish, A.: Using chinese glyphs for named entity recognition (student abstract). In Proceedings of the AAAI Conference on Artificial Intelligence. 34(10), 13921–13922 (2020)
Peng, N., Dredze, M.: Improving named entity recognition for Chinese social media with word segmentation representation learning. In: Proceedings of the 54th Annual meeting of the association for computational linguistics, vol. 2: Short Papers. pp. 149–155 (2016)
Zhao, S., Liu, T., Zhao, S., Wang, F.: A neural multi-task learning framework to jointly model medical named entity recognition and normalization. In Proceedings of the AAAI Conference on Artificial Intelligence. 33(01), 817–824 (2019)
Liu, Z., Winata, G.I., Fung, P.: Zero-resource cross-domain named entity recognition. In: Proceedings of the 5th workshop on representation learning for NLP. pp. 1–6 (2020)
Zhou, J.T., Zhang, H., Jin, D., Zhu, H., Fang, M., Goh, R. S. M., Kwok, K.: Dual adversarial neural transfer for low-resource named entity recognition. In: Proceedings of the 57th Annual meeting of the association for computational linguistics, pp. 3461–3471 (2019)
Zhang, H., Guo, Y., Li, T.: Domain named entity recognition combining GAN and BiLSTM-attention-CRF. J. Comput. Res. Dev. 56(9), 1851–1858 (2019)
Ding, B., Liu, L., Bing, L., Kruengkrai, C., Nguyen, T. H., Joty, S., Miao, C.: DAGA: Data augmentation with a generation approach for low-resource tagging tasks. In: Proceedings of the 2020 conference on empirical methods in natural language processing, pp. 6045–6057 (2020)
Zhou, R., Li, X., He, R., Bing, L., Cambria, E., Si, L., Miao, C.: MELM: Data augmentation with masked entity language modeling for low-resource NER. In: Proceedings of the 60th annual meeting of the association for computational linguistics, vol. 1: Long Papers, pp. 2251–2262 (2022)
Tsygankova, T., Marini, F., Mayhew, S., Roth, D.: Building low-resource NER models using non-speaker annotations. In: Proceedings of the second workshop on data science with human in the loop: language advances, pp. 62–69 (2021)
Zhou, R., Li, X., He, R., Bing, L., Cambria, E., Si, L.: MulDA: A multilingual data augmentation framework for low-resource cross-lingual NER (2021)
Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)
Giorgi, J.M., Bader, G.D.: Transfer learning for biomedical named entity recognition with neural networks. Bioinform. 34(23), 4087–4094 (2018)
Xie, J., Yang, Z., Neubig, G., Smith, N. A., Carbonell, J. G.: Neural cross-lingual named entity recognition with minimal resources. In: Proceedings of the 2018 Conference on empirical methods in natural language processing. pp. 369–379 (2018)
Johnson, A., Karanasou, P., Gaspers, J., Klakow, D.: Cross-lingual transfer learning for Japanese named entity recognition. In: Proceedings of the 2019 conference of the North American Chapter of the association for computational linguistics: Human language technologies, vol. 2. Industry Papers. pp. 182–189 (2019)
Chaudhary, A., Xie, J., Sheikh, Z., Neubig, G., Carbonell, J. G.: A little annotation does a lot of good: a study in bootstrapping low-resource named entity recognizers. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th International joint conference on natural language processing. pp. 5164–5174 (2019)
Bari, M.S., Joty, S., Jwalapuram, P.: Zero-resource cross-lingual named entity recognition. In Proceedings of the AAAI conference on artificial intelligence. 34(05), 7415–7423 (2020)
Wu, Q., Lin, Z., Karlsson, B. F., Lou, J. G., Huang, B.: Single-/Multi-source cross-lingual NER via teacher-student learning on unlabeled data in target language. (2020)
Chen, W., Jiang, H., Wu, Q., Karlsson, B., Guan, Y.: AdvPicker: Effectively leveraging Unlabeled data via adversarial discriminator for cross-lingual NER (2021)
Acknowledgment
This work was supported by Hainan Provincial Natural Science Foundation of China (Grant No. 620MS021, 621QN211), National Natural Science Foundation of China (NSFC) (Grant No. 62162024, 62162022), the Key Research and Development Program of Hainan Province (Grant No. ZDYF2020040, ZDYF2021GXJS003), the Major science and technology project of Hainan Province (Grant No. ZDKJ2020012), Science and Technology Development Center of the Ministry of Education Industry-university-Research Innovation Fund (2021JQR017), the Key Laboratory of PK System Technologies Research of Hainan.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Tang, X., Xia, D., Li, Y., Xu, T., Xiong, N.N. (2024). A Survey of Low-Resource Named Entity Recognition. In: Qiu, X., Xiao, Y., Wu, Z., Zhang, Y., Tian, Y., Liu, B. (eds) The 7th International Conference on Information Science, Communication and Computing. ISCC2023 2023. Smart Innovation, Systems and Technologies, vol 350. Springer, Singapore. https://doi.org/10.1007/978-981-99-7161-9_19
Download citation
DOI: https://doi.org/10.1007/978-981-99-7161-9_19
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-7160-2
Online ISBN: 978-981-99-7161-9
eBook Packages: EngineeringEngineering (R0)