Improving Candidate Generation for Entity Linking

  • Yuhang Guo
  • Bing Qin
  • Yuqin Li
  • Ting Liu
  • Sheng Li
Conference paper

DOI: 10.1007/978-3-642-38824-8_19

Part of the Lecture Notes in Computer Science book series (LNCS, volume 7934)
Cite this paper as:
Guo Y., Qin B., Li Y., Liu T., Li S. (2013) Improving Candidate Generation for Entity Linking. In: Métais E., Meziane F., Saraee M., Sugumaran V., Vadera S. (eds) Natural Language Processing and Information Systems. NLDB 2013. Lecture Notes in Computer Science, vol 7934. Springer, Berlin, Heidelberg

Abstract

Entity linking is the task of linking names in free text to the referent entities in a knowledge base. Most recently proposed linking systems can be broken down into two steps: candidate generation and candidate ranking. The first step searches candidates from the knowledge base and the second step disambiguates them. Previous works have been focused on the recall of the generation because if the target entity is absent in the candidate set, no ranking method can return the correct result. Most of the recall-driven generation strategies will increase the number of the candidates. However, with large candidate sets, memory/time consuming systems are impractical for online applications. In this paper, we propose a novel candidate generation approach to generate high recall candidate set with small size. Experimental results on two KBP data sets show that the candidate generation recall achieves more than 93%. By leveraging our approach, the candidate number is reduced from hundreds to dozens, the system runtime is saved by 70.3% and 76.6% over the baseline and the highest micro-averaged accuracy in the evaluation is improved by 2.2% and 3.4%.

Keywords

Natural Language Processing Information Extraction Entity Linking Candidate Generation Candidate Pruning 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Yuhang Guo
    • 1
  • Bing Qin
    • 1
  • Yuqin Li
    • 2
  • Ting Liu
    • 1
  • Sheng Li
    • 1
  1. 1.School of Computer Science and TechnologyHarbin Institute of TechnologyHarbinChina
  2. 2.Beijing Information Science and Technology UniversityBeijingChina

Personalised recommendations