Skip to main content

A Survey of Low-Resource Named Entity Recognition

  • Conference paper
  • First Online:
The 7th International Conference on Information Science, Communication and Computing (ISCC2023 2023)

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 350))

  • 135 Accesses

Abstract

Named Entity Recognition, as one of the typical tasks of information extraction, has a wide range of applications. However, the difficulty of data collection and the lack of data annotation are common problems in real scenarios. It is difficult for classical named entity recognition methods to fully obtain hidden information when the dataset and data labels are insufficient. In this case, the recognition accuracy will drop significantly. In resource-poor scenarios, the use of multi-task learning, data augmentation, and transfer learning methods can effectively utilize limited data resources or introduce data from other fields to optimize model performance. In this survey, firstly, we introduce the reader to the state of research on named entity recognition, and outline the reasons for the survey and the contribution of this paper. Then, we comprehensively describe the datasets and evaluation methods included in named entity recognition, and propose a Classification of named entity recognition in the case of lack of resources. Finally, we analyze the existing problems on the basis of the above, and further look forward to the future development direction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Etzioni, O., Cafarella, M., Downey, D., Popescu, A. M., Shaked, T., Soderland, S., Weld, D., S., Yates, A.: Unsupervised named-entity extraction from the Web: An experimental study. Artificial Intelligence. 165.1:91–134(2005)

    Google Scholar 

  2. Mollá, D., Van Zaanen, M., Smith, D.: Named entity recognition for question answering. In: Proceedings of the Australasian language technology workshop, pp. 51–58 (2006)

    Google Scholar 

  3. Guo, J., Xu, G., Cheng, X., Li, H.: Named entity recognition in query. In: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pp. 267–274 (2009)

    Google Scholar 

  4. Ganea, O.E., Hofmann, T.: Deep joint entity disambiguation with local neural attention. In: Proceedings of the 2017 Conference on empirical methods in natural language processing, pp. 2619–2629. Association for Computational Linguistics (2017)

    Google Scholar 

  5. Le, P., Titov, I.: Improving entity linking by modeling latent relations between mentions. In: Proceedings of the 56th Annual meeting of the association for computational linguistics, vol. 1, Long Papers, pp. 1595–1604 (2018)

    Google Scholar 

  6. Zhang, Z., Han, X., Liu, Z., Jiang, X., Sun, M., Liu, Q.: Ernie: enhanced language representation with informative entities. In: Proceedings of the 57th Annual meeting of the association for computational linguistics (2019)

    Google Scholar 

  7. Cheng, P., Erk, K.: Attending to entities for better text understanding. Vol. 34, No. 05, pp. 7554–7561. In: Proceedings of the AAAI conference on artificial intelligence (2020)

    Google Scholar 

  8. Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investigationes 30(1), 3–26 (2007)

    Article  Google Scholar 

  9. Marrero, M., Urbano, J., Sánchez-Cuadrado, S., Morato, J., Gómez-Berbís, J.M.: Named entity recognition: fallacies, challenges and opportunities. Comput. Stand. & Interfaces 35(5), 482–489 (2013)

    Article  Google Scholar 

  10. Campos, D., Matos, S., Oliveira, J.L.: Biomedical named entity recognition: a survey of machine-learning tools. Theory Appl. Adv. Text Min. 11, 175–195 (2012)

    Google Scholar 

  11. Alshaikhdeeb, B., Ahmad, K.: Biomedical named entity recognition: a review. Int. J. Adv. Sci., Eng. Inf. Technol. 6(6), 889–895 (2016)

    Article  Google Scholar 

  12. Li, J., Sun, A., Han, J., Li, C.: A survey on deep learning for named entity recognition. IEEE Trans. Knowl. Data Eng. 34(1), 50–70 (2020)

    Article  Google Scholar 

  13. Yadav, V., Bethard, S.: A Survey on Recent Advances in Named Entity Recognition from Deep Learning models. In: Proceedings of the 27th International Conference on Computational Linguistics, pp. 2145–2158 (2018)

    Google Scholar 

  14. Gao, C., Zhang, X., Han, M., Liu, H.: A review on cyber security named entity recognition. Front. Inf. Technol. & Electron. Eng. 22(9), 1153–1168 (2021)

    Article  Google Scholar 

  15. Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12(ARTICLE), 2493–2537 (2011)

    Google Scholar 

  16. Passos, A., Kumar, V., McCallum, A.: Lexicon infused phrase embeddings for named entity resolution. In: Proceedings of the eighteenth conference on computational natural language learning, pp. 78–86 (2014)

    Google Scholar 

  17. Pennington, J., Socher, R., Manning, C. D.: Glove: Global vectors for word representation. In: Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), pp. 1532–1543 (2014)

    Google Scholar 

  18. Yao, L., Liu, H., Liu, Y., Li, X., Anwar, M.W.: Biomedical named entity recognition based on deep neutral network. Int. J. Hybrid Inf. Technol. 8(8), 279–288 (2015)

    Google Scholar 

  19. Huang, Z., Xu, W., Yu, K.: Bidirectional L1STM-CRF models for sequence tagging. Computer Science (2015)

    Google Scholar 

  20. Strubell, E., Verga, P., Belanger, D., McCallum, A.: Fast and accurate entity recognition with iterated dilated convolutions. In: Proceedings of the 2017 Conference on empirical methods in natural language processing, pp. 2670–2680 (2017)

    Google Scholar 

  21. Shen, Y., Ma, X., Tan, Z., Zhang, S., Wang, W., Lu, W.: Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition. In: Proceedings of the 59th Annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing, vol. 1, Long Papers, pp. 2782–2794 (2021)

    Google Scholar 

  22. Zheng, X., Chen, H., Xu, T.: Deep learning for chinese word segmentation and pos tagging. In: Proceedings of the 2013 conference on empirical methods in natural language processing, pp. 647–657 (2013)

    Google Scholar 

  23. Gillick, D., Brunk, C., Vinyals, O., Subramanya, A.: Multilingual language processing from bytes. In: Proceedings of NAACL-HLT, pp. 1296–1306 (2016)

    Google Scholar 

  24. Kim, Y., Jernite, Y., Sontag, D., Rush, A.: Character-aware neural language model, vol. 30, no. 1. In: Proceedings of the AAAI conference on artificial intelligence (2016)

    Google Scholar 

  25. Kuru, O., Can, O. A., Yuret, D.: Charner: Character-level named entity recognition. In: Proceedings of COLING 2016, the 26th International conference on computational linguistics: Technical Papers, pp. 911–921 (2016)

    Google Scholar 

  26. Dong, C., Zhang, J., Zong, C., Hattori, M., Di, H: Character-based LSTM-CRF with radical-level features for chinese named entity recognition. In Natural Language Understanding and Intelligent Applications, pages 239–250. Springer (2016)

    Google Scholar 

  27. Akbik, A., Blythe, D., Vollgraf, R.: Contextual string embeddings for sequence labeling. In: Proceedings of the 27th international conference on computational linguistics, pp. 1638–1649 (2018)

    Google Scholar 

  28. Peters, M., Neumann, M., Iyyer, M., Gardner, M., Zettlemoyer, L.: Deep contextualized word representations. (2018)

    Google Scholar 

  29. Peters, M.E., Ammar, W., Bhagavatula, C., Power, R.: Semi-supervised sequence tagging with bidirectional language models. In: Proceedings of the 55th Annual meeting of the association for computational linguistics, vol. 1, Long Papers, pp. 1756–1765, Vancouver, Canada, July. Association for Computational Linguistics (2017)

    Google Scholar 

  30. Zhu, Y., Wang, G.: CAN-NER: Convolutional attention network for chinese named entity recognition. In: Proceedings of NAACL-HLT, pp. 3384–3393 (2019)

    Google Scholar 

  31. Ma, X., Hovy, E.: End-to-end sequence labeling via Bi-directional LSTM-CNNs-CRF. In: Proceedings of the 54th Annual meeting of the association for computational linguistics, vol.1, Long Papers, pp. 1064–1074 (2016)

    Google Scholar 

  32. Chiu, J.P., Nichols, E.: Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguist. 4, 357–370 (2016)

    Article  Google Scholar 

  33. Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural Architectures for named entity recognition. In: Proceedings of NAACL-HLT, pp. 260–270 (2016)

    Google Scholar 

  34. Bharadwaj, A., Mortensen, D. R., Dyer, C., Carbonell, J. G.: Phonologically aware neural model for named entity recognition in low resource transfer settings. In: Proceedings of the conference on empirical methods in natural language processing, pp. 1462–1472 (2016)

    Google Scholar 

  35. Lin, B.Y., Xu, F. F., Luo, Z., Zhu, K.: Multi-channel bilstm-crf model for emerging named entity recognition in social media. In: Proceedings of the 3rd Workshop on Noisy User-generated Text, pp. 160–165 (2017)

    Google Scholar 

  36. Gui, T., Zou, Y., Zhang, Q., Peng, M., Fu, J., Wei, Z., Huang, X. J.: A lexicon-based graph neural network for Chinese NER. In: Proceedings of the 2019 Conference on empirical methods in natural language Processing and the 9th International joint conference on natural language processing, pp.1040–1050 (2019)

    Google Scholar 

  37. Liu, T., Yao, J. G., Lin, C. Y.: Towards improving neural named entity recognition with gazetteers. In: Proceedings of the 57th annual meeting of the association for computational linguistics, pp. 5301–5307 (2019)

    Google Scholar 

  38. Ghaddar, A., Langlais, P: Robust lexical features for improved neural network named-entity recognition. In: Proceedings of the 27th International conference on computational linguistics, pp. 1896–1907 (2018)

    Google Scholar 

  39. Zhang, Y., Yang, J.: Chinese NER Using Lattice LSTM. In: Proceedings of the 56th Annual meeting of the association for computational linguistics, vol. 1: Long Papers, pp. 1554–1564 (2018)

    Google Scholar 

  40. Cao, P., Chen, Y., Liu, K., Zhao, J., Liu, S.: Adversarial transfer learning for Chinese named entity recognition with self-attention mechanism. In: Proceedings of the 2018 conference on empirical methods in natural language processing, pp. 182–192 (2018)

    Google Scholar 

  41. Liu, W., Xu, T., Xu, Q., Song, J., Zu, Y.: An encoding strategy based word-character LSTM for Chinese NER. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol. 1. Long and Short Papers. pp. 2379–2389 (2019)

    Google Scholar 

  42. Ding, R., Xie, P., Zhang, X., Lu, W., Li, L., Si, L.: A neural multi-digraph model for Chinese NER with gazetteers. In: Proceedings of the 57th Annual meeting of the association for computational linguistics. pp. 1462–1467 (2019)

    Google Scholar 

  43. Wu, F., Liu, J., Wu, C., Huang, Y., Xie, X.: Neural Chinese named entity recognition via CNN-LSTM-CRF and joint training with word segmentation. In: The World Wide Web Conference. pp. 3342–3348 (2019)

    Google Scholar 

  44. Song, C.H., Sehanobish, A.: Using chinese glyphs for named entity recognition (student abstract). In Proceedings of the AAAI Conference on Artificial Intelligence. 34(10), 13921–13922 (2020)

    Article  Google Scholar 

  45. Peng, N., Dredze, M.: Improving named entity recognition for Chinese social media with word segmentation representation learning. In: Proceedings of the 54th Annual meeting of the association for computational linguistics, vol. 2: Short Papers. pp. 149–155 (2016)

    Google Scholar 

  46. Zhao, S., Liu, T., Zhao, S., Wang, F.: A neural multi-task learning framework to jointly model medical named entity recognition and normalization. In Proceedings of the AAAI Conference on Artificial Intelligence. 33(01), 817–824 (2019)

    Article  Google Scholar 

  47. Liu, Z., Winata, G.I., Fung, P.: Zero-resource cross-domain named entity recognition. In: Proceedings of the 5th workshop on representation learning for NLP. pp. 1–6 (2020)

    Google Scholar 

  48. Zhou, J.T., Zhang, H., Jin, D., Zhu, H., Fang, M., Goh, R. S. M., Kwok, K.: Dual adversarial neural transfer for low-resource named entity recognition. In: Proceedings of the 57th Annual meeting of the association for computational linguistics, pp. 3461–3471 (2019)

    Google Scholar 

  49. Zhang, H., Guo, Y., Li, T.: Domain named entity recognition combining GAN and BiLSTM-attention-CRF. J. Comput. Res. Dev. 56(9), 1851–1858 (2019)

    Google Scholar 

  50. Ding, B., Liu, L., Bing, L., Kruengkrai, C., Nguyen, T. H., Joty, S., Miao, C.: DAGA: Data augmentation with a generation approach for low-resource tagging tasks. In: Proceedings of the 2020 conference on empirical methods in natural language processing, pp. 6045–6057 (2020)

    Google Scholar 

  51. Zhou, R., Li, X., He, R., Bing, L., Cambria, E., Si, L., Miao, C.: MELM: Data augmentation with masked entity language modeling for low-resource NER. In: Proceedings of the 60th annual meeting of the association for computational linguistics, vol. 1: Long Papers, pp. 2251–2262 (2022)

    Google Scholar 

  52. Tsygankova, T., Marini, F., Mayhew, S., Roth, D.: Building low-resource NER models using non-speaker annotations. In: Proceedings of the second workshop on data science with human in the loop: language advances, pp. 62–69 (2021)

    Google Scholar 

  53. Zhou, R., Li, X., He, R., Bing, L., Cambria, E., Si, L.: MulDA: A multilingual data augmentation framework for low-resource cross-lingual NER (2021)

    Google Scholar 

  54. Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)

    Article  Google Scholar 

  55. Giorgi, J.M., Bader, G.D.: Transfer learning for biomedical named entity recognition with neural networks. Bioinform. 34(23), 4087–4094 (2018)

    Article  Google Scholar 

  56. Xie, J., Yang, Z., Neubig, G., Smith, N. A., Carbonell, J. G.: Neural cross-lingual named entity recognition with minimal resources. In: Proceedings of the 2018 Conference on empirical methods in natural language processing. pp. 369–379 (2018)

    Google Scholar 

  57. Johnson, A., Karanasou, P., Gaspers, J., Klakow, D.: Cross-lingual transfer learning for Japanese named entity recognition. In: Proceedings of the 2019 conference of the North American Chapter of the association for computational linguistics: Human language technologies, vol. 2. Industry Papers. pp. 182–189 (2019)

    Google Scholar 

  58. Chaudhary, A., Xie, J., Sheikh, Z., Neubig, G., Carbonell, J. G.: A little annotation does a lot of good: a study in bootstrapping low-resource named entity recognizers. In: Proceedings of the 2019 conference on empirical methods in natural language processing and the 9th International joint conference on natural language processing. pp. 5164–5174 (2019)

    Google Scholar 

  59. Bari, M.S., Joty, S., Jwalapuram, P.: Zero-resource cross-lingual named entity recognition. In Proceedings of the AAAI conference on artificial intelligence. 34(05), 7415–7423 (2020)

    Article  Google Scholar 

  60. Wu, Q., Lin, Z., Karlsson, B. F., Lou, J. G., Huang, B.: Single-/Multi-source cross-lingual NER via teacher-student learning on unlabeled data in target language. (2020)

    Google Scholar 

  61. Chen, W., Jiang, H., Wu, Q., Karlsson, B., Guan, Y.: AdvPicker: Effectively leveraging Unlabeled data via adversarial discriminator for cross-lingual NER (2021)

    Google Scholar 

Download references

Acknowledgment

This work was supported by Hainan Provincial Natural Science Foundation of China (Grant No. 620MS021, 621QN211), National Natural Science Foundation of China (NSFC) (Grant No. 62162024, 62162022), the Key Research and Development Program of Hainan Province (Grant No. ZDYF2020040, ZDYF2021GXJS003), the Major science and technology project of Hainan Province (Grant No. ZDKJ2020012), Science and Technology Development Center of the Ministry of Education Industry-university-Research Innovation Fund (2021JQR017), the Key Laboratory of PK System Technologies Research of Hainan.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dongwan Xia .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tang, X., Xia, D., Li, Y., Xu, T., Xiong, N.N. (2024). A Survey of Low-Resource Named Entity Recognition. In: Qiu, X., Xiao, Y., Wu, Z., Zhang, Y., Tian, Y., Liu, B. (eds) The 7th International Conference on Information Science, Communication and Computing. ISCC2023 2023. Smart Innovation, Systems and Technologies, vol 350. Springer, Singapore. https://doi.org/10.1007/978-981-99-7161-9_19

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-7161-9_19

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-7160-2

  • Online ISBN: 978-981-99-7161-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics