Abstract
Numerous research efforts are tackling the entity recognition and entity linking tasks resulting in a large body of literature. One could roughly categorize the proposed approaches in two different strategies: linguistic-based and semantic-based methods. In this paper, we present our participation to the OKE challenge, where we experiment with a hybrid approach, which combines the strength of a linguistic-based method augmented by a high coverage in the annotation obtained by using a large knowledge base as entity dictionary. The main goal of this hybrid approach is to improve the extraction and recognition level to get the best recall in order to apply a pruning step. On the training set, the results are promising and the breakdown figures are comparable with the state of the art performance of top ranked systems. Our hybrid approach has been ranked first to the OKE Challenge on the test set.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
References
Aha, D., Kibler, D.: Instance-based learning algorithms. Mach. Learn. 6, 37ā66 (1991)
Cano, A.E., Rizzo, G., Varga, A., Rowe, M., Milan, S., Dadzie, A.-S.: Making sense of microposts (#microposts2014) named entity extraction & linking challenge. In: 4th International Workshop on Making Sense of Microposts, Seoul, South Korea (2014)
Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by gibbs sampling. In: 43rd Annual Meeting on Association for Computational Linguistics (ACL), pp. 363ā370, Stroudsburg, PA, USA (2005)
Hoffart, J., Altun, Y., Weikum, G.: Discovering emerging entities with ambiguous names. In: 23rd International Conference on World Wide Web (WWW), pp. 385ā396, Seoul, Korea (2014)
Hoffart, J., Yosef, M.A., Bordino, I., FĆ¼rstenau, H., Pinkal, M., Spaniol, M., Taneva, B., Thater, S., Weikum, G.: Robust disambiguation of named entities in text. In: 8th Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 782ā792, Stroudsburg, PA, USA (2011)
Ji, H., Nothman, J., Hachey, B.: Overview of TAC-KBP2014 entity discovery and linking tasks. In: Text Analysis Conference (TAC), Gaithersburg, USA (2014)
Mendes, P.N., Jakob, M., GarcĆa-Silva, A., Bizer, C.: DBpedia spotlight: shedding light on the web of documents. In: 7th International Conference on Semantic Systems (I-Semantics), pp. 1ā8 (2011)
Moro, A., Raganato, A., Navigli, R.: Entity linking meets word sense disambiguation: a unified approach. TACL 2, 231ā244 (2014)
Piccinno, F., Ferragina, P.: From TagME to WAT: a new entity annotator. In: 1st ACM International Workshop on Entity Recognition & Disambiguation (ERD), pp. 55ā62, Gold Coast, Australia (2014)
Toutanova, K., Christopher, D.: Enriching the knowledge sources used in a maximum entropy part-of-speech tagger. In: North American Chapter of the Association for Computational Linguistics (HLT-NAACL), pp. 252ā259, Edmond, Canada (2003)
Usbeck, R., Rƶder, M., Ngonga Ngomo, A.-C., Baron, C., Both, A., BrĆ¼mmer, M., Ceccarelli, D., Cornolti, M., Cherix, D., Eickmann, B., Ferragina, P., Lemke, C., Moro, A., Navigli, R., Piccinno, F., Rizzo, G., Sack, H., Speck, R., Troncy, R., Waitelonis, J., Wesemann, L.: GERBIL - general entity annotation benchmark Framework. In: 24th World Wide Web Conference (WWW), Florence, Italy (2015)
Acknowledgments
This work was partially supported by the EIT Digital 3cixty project and by French National Research Agency (ANR) within the WAVE Project, under grant number ANR-12-CORD-0027.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
Ā© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Plu, J., Rizzo, G., Troncy, R. (2015). A Hybrid Approach for Entity Recognition and Linking. In: Gandon, F., Cabrio, E., Stankovic, M., Zimmermann, A. (eds) Semantic Web Evaluation Challenges. SemWebEval 2015. Communications in Computer and Information Science, vol 548. Springer, Cham. https://doi.org/10.1007/978-3-319-25518-7_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-25518-7_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25517-0
Online ISBN: 978-3-319-25518-7
eBook Packages: Computer ScienceComputer Science (R0)