Semantic Web Evaluation Challenge

Semantic Web Evaluation Challenges pp 28-39 | Cite as

A Hybrid Approach for Entity Recognition and Linking

Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 548)

Abstract

Numerous research efforts are tackling the entity recognition and entity linking tasks resulting in a large body of literature. One could roughly categorize the proposed approaches in two different strategies: linguistic-based and semantic-based methods. In this paper, we present our participation to the OKE challenge, where we experiment with a hybrid approach, which combines the strength of a linguistic-based method augmented by a high coverage in the annotation obtained by using a large knowledge base as entity dictionary. The main goal of this hybrid approach is to improve the extraction and recognition level to get the best recall in order to apply a pruning step. On the training set, the results are promising and the breakdown figures are comparable with the state of the art performance of top ranked systems. Our hybrid approach has been ranked first to the OKE Challenge on the test set.

Keywords

Entity recognition Entity linking Entity filtering Learning to rank OKE challenge 

References

  1. 1.
    Aha, D., Kibler, D.: Instance-based learning algorithms. Mach. Learn. 6, 37–66 (1991)MATHGoogle Scholar
  2. 2.
    Cano, A.E., Rizzo, G., Varga, A., Rowe, M., Milan, S., Dadzie, A.-S.: Making sense of microposts (#microposts2014) named entity extraction & linking challenge. In: 4th International Workshop on Making Sense of Microposts, Seoul, South Korea (2014)Google Scholar
  3. 3.
    Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by gibbs sampling. In: 43rd Annual Meeting on Association for Computational Linguistics (ACL), pp. 363–370, Stroudsburg, PA, USA (2005)Google Scholar
  4. 4.
    Hoffart, J., Altun, Y., Weikum, G.: Discovering emerging entities with ambiguous names. In: 23rd International Conference on World Wide Web (WWW), pp. 385–396, Seoul, Korea (2014)Google Scholar
  5. 5.
    Hoffart, J., Yosef, M.A., Bordino, I., Fürstenau, H., Pinkal, M., Spaniol, M., Taneva, B., Thater, S., Weikum, G.: Robust disambiguation of named entities in text. In: 8th Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 782–792, Stroudsburg, PA, USA (2011)Google Scholar
  6. 6.
    Ji, H., Nothman, J., Hachey, B.: Overview of TAC-KBP2014 entity discovery and linking tasks. In: Text Analysis Conference (TAC), Gaithersburg, USA (2014)Google Scholar
  7. 7.
    Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: DBpedia spotlight: shedding light on the web of documents. In: 7th International Conference on Semantic Systems (I-Semantics), pp. 1–8 (2011)Google Scholar
  8. 8.
    Moro, A., Raganato, A., Navigli, R.: Entity linking meets word sense disambiguation: a unified approach. TACL 2, 231–244 (2014)Google Scholar
  9. 9.
    Piccinno, F., Ferragina, P.: From TagME to WAT: a new entity annotator. In: 1st ACM International Workshop on Entity Recognition & Disambiguation (ERD), pp. 55–62, Gold Coast, Australia (2014)Google Scholar
  10. 10.
    Toutanova, K., Christopher, D.: Enriching the knowledge sources used in a maximum entropy part-of-speech tagger. In: North American Chapter of the Association for Computational Linguistics (HLT-NAACL), pp. 252–259, Edmond, Canada (2003)Google Scholar
  11. 11.
    Usbeck, R., Röder, M., Ngonga Ngomo, A.-C., Baron, C., Both, A., Brümmer, M., Ceccarelli, D., Cornolti, M., Cherix, D., Eickmann, B., Ferragina, P., Lemke, C., Moro, A., Navigli, R., Piccinno, F., Rizzo, G., Sack, H., Speck, R., Troncy, R., Waitelonis, J., Wesemann, L.: GERBIL - general entity annotation benchmark Framework. In: 24th World Wide Web Conference (WWW), Florence, Italy (2015)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  1. 1.EURECOMSophia AntipolisFrance

Personalised recommendations