Skip to main content

Attending to Entity Class Attributes for Named Entity Recognition with Few-Shot Learning

  • Conference paper
  • First Online:
Intelligent Systems and Applications (IntelliSys 2023)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 824))

Included in the following conference series:

  • 135 Accesses

Abstract

Named Entity Recognition (NER) serves as the foundation for several natural language applications like question answering, chatbots and intent classification. Identification of entity boundaries and its categorization into entity types poses a significant challenge in domain-dependent and low-resource settings, with limited training data availability. To this end, we propose AtEnA, a novel NER framework utilizing entity class attributes from external knowledge source for few-shot learning. We use a two-stage fine-tuning process, wherein a language model is initially trained to “attend” to the different entity class attributes along with the textual context, and is then fine-tuned for the downstream application data with few annotated training examples. Experiments on benchmark NER datasets depict AtEnA to perform around 10 F1 score points better than the existing NER methodologies, specifically for few-shot limited training scenarios.

Work done while the author was at Huawei Research, Ireland

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://spacy.io/ and https://deeppavlov.ai/.

  2. 2.

    www.luis.ai & www.cloud.google.com/dialogflow.

  3. 3.

    https://metatext.io/datasets/wikiner.

  4. 4.

    https://www.kaggle.com/datasets/abhinavwalia95/entity-annotated-corpus.

References

  1. Aliod, D.M., van Zaanen, M., Smith, D.: Named entity recognition for question answering. In: ALTA, pp. 51–58 (2006)

    Google Scholar 

  2. Aone, C., Halverson, L., Hampton, T., Ramos-Santacruz, M.: SRA: description of the IE2 system used for MUC-7. In: MUC-7 (1998)

    Google Scholar 

  3. Aone, C., Okurowski, M.E., Gorlinsky, J.: A trainable summarizer with knowledge acquired from robust NLP techniques, pp. 71–80 (2022)

    Google Scholar 

  4. Bocklisch, T., Faulkner, J., Pawlowski, N., Nichol, A.: Rasa: open source language understanding and dialogue management. In: NIPS Workshop on Conversational AI (2017)

    Google Scholar 

  5. Bouarroudj, W., Boufaida, Z., Bellatreche, L.: Named entity disambiguation in short texts over knowledge graphs. Knowl. Inf. Syst. 64, 325–351 (2022)

    Article  Google Scholar 

  6. Cheng, P., Erk, K.: Attending to entities for better text understanding. In: AAAI, pp. 7554–7561 (2020)

    Google Scholar 

  7. Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. J. Mach. Learn. Res. 12, 2493–2537 (2011)

    Google Scholar 

  8. Eddy, S.R.: Hidden markov models. Curr. Opin. Struct. Biol. 6(3), 361–365 (1996)

    Article  Google Scholar 

  9. Etzioni, O., Cafarella, M., Downey, D., Popescu, A.M., Shaked, T., Soderland, S., Weld, D.S., Yates, A.: Unsupervised named entity extraction from the web: an experimental study. Artif. Intell. 165(1), 91–134 (2005)

    Article  Google Scholar 

  10. Goyal, A., Gupta, V., Kumar, M.: Recent named entity recognition and classification techniques: a systematic review. Comput. Sci. Rev. 29, 21–43 (2018)

    Article  Google Scholar 

  11. Hearst, M.A., Dumais, S.T., Osuna, E., Platt, J., Scholkopf, B.: Support vector machines. IEEE Intell. Syst. & Their Appl. 13(4), 18–28 (1998)

    Article  Google Scholar 

  12. Huang, Z., Xu, W., Yu, K.: Bidirectional LSTM-CRF models for sequence tagging (2015). arXiv:1508.01991

  13. Krupka, G., IsoQuest, K.: Description of the NetOWL extractor system as used for MUC-7. In: MUC-7, pp. 21–28 (2005)

    Google Scholar 

  14. Lafferty, J.D., McCallum, A., Pereira, F.C.N.: Conditional random fields: probabilistic models for segmenting and labeling sequence data. In: ICML, pp. 282–289 (2001)

    Google Scholar 

  15. Lample, G., Ballesteros, M., Subramanian, S., Kawakami, K., Dyer, C.: Neural architectures for named entity recognition. In: NAACL, pp. 260–270 (2016)

    Google Scholar 

  16. Li, J., Sun, A., Han, J., Li, C.: A Survey on Deep Learning for Named Entity Recognition. IEEE Trans. Knowl, Data Eng (2020)

    Google Scholar 

  17. Li, X., Feng, J., Meng, Y., Han, Q., Wu, F., Li, J.: A unified MRC framework for named entity recognition. In: ACL, pp. 5849–5859 (2020)

    Google Scholar 

  18. Lin, B.Y., Xu, F., Luo, Z., Zhu, K.: Multi-channel BiLSTM-CRF model for emerging named entity recognition in social media. In: W-NUT, pp. 160–165 (2017)

    Google Scholar 

  19. Liu, W., Zhou, P., Zhao, Z., Wang, Z., Qi, J., Deng, H.: and Ping Wang. Enabling Language Representation with Knowledge Graph. In: AAAI, K-BERT (2020)

    Google Scholar 

  20. Nadeau, D., Sekine, S.: A survey of named entity recognition and classification. Lingvisticae Investig. 30(1), 3–26 (2007)

    Article  Google Scholar 

  21. Panchendrarajan, R., Amaresan, A.: Bidirectional LSTM-CRF for named entity recognition. In: PACLIC, pp. 531–540 (2018)

    Google Scholar 

  22. Peters, M.E., Ammar, W., Bhagavatula, C., Power, R.: Semisupervised sequence tagging with bidirectional language models. In: ACL, pp. 1756–1765 (2017)

    Google Scholar 

  23. Peters, M.E., Neumann, M., Logan, R., Schwartz, R., Joshi, V., Singh, S., Smith, N.A.: Knowledge enhanced contextual word representations. In: EMNLP-IJCNLP, pp. 43–54 (2019)

    Google Scholar 

  24. Ri, R., Yamada, I., Tsuruoka, Y.: mLUKE: the power of entity representations in multilingual pretrained language models. In: ACL, pp. 7316–7330 (2022)

    Google Scholar 

  25. Sang, E.F., De Meulder, F.: Introduction to the CoNLL-2003 shared task: language-independent named entity recognition (2003). arXiv:cs/0306050

  26. Swietojanski, P., Liu, X., Eshghi, A., Rieser, V.: Benchmarking natural language understanding services for building conversational agents. In: Proceedings of the Tenth International Workshop on Spoken Dialogue Systems Technology (IWSDS), Ortigia, Siracusa (SR), Italy, April 2019. Springer

    Google Scholar 

  27. Yamada, I., Asai, A., Shindo, H., Takeda, H., Matsumoto, Y.: LUKE: deep contextualized entity representations with entity-aware self-attention. In: EMNLP, pp. 6442–6454 (2020)

    Google Scholar 

  28. Yosef, M.A., Hoffart, J., Bordino, I., Spaniol, M., Weikum, G.: AIDA: an online tool for accurate disambiguation of named entities in text and tables. Proc. VLDB Endow. 4(12), 1450–1453 (2011)

    Article  Google Scholar 

  29. Zhang, S., Elhadad, N.: Unsupervised biomedical named entity recognition: experiments with clinical and biological texts. J. Biomed. Inform. 46(6), 1088–1098 (2013)

    Article  Google Scholar 

  30. Zhang, Z., Han, X., Liu, Z., Jiang, X., Sun, M., Liu, Q.: ERNIE: enhanced language representation with informative entities. In: ACL, pp. 1441–1451 (2019)

    Google Scholar 

  31. Zhou, G., Su, J.: Named entity recognition using an HMM based chunk tagger. In: ACL, pp. 473–480 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Raj Nath Patel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Patel, R.N., Dutta, S., Assem, H. (2024). Attending to Entity Class Attributes for Named Entity Recognition with Few-Shot Learning. In: Arai, K. (eds) Intelligent Systems and Applications. IntelliSys 2023. Lecture Notes in Networks and Systems, vol 824. Springer, Cham. https://doi.org/10.1007/978-3-031-47715-7_57

Download citation

Publish with us

Policies and ethics