Implicit Entity Linking in Tweets

  • Sujan Perera
  • Pablo N. Mendes
  • Adarsh Alex
  • Amit P. Sheth
  • Krishnaprasad Thirunarayan
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9678)

Abstract

Over the years, Twitter has become one of the largest communication platforms providing key data to various applications such as brand monitoring, trend detection, among others. Entity linking is one of the major tasks in natural language understanding from tweets and it associates entity mentions in text to corresponding entries in knowledge bases in order to provide unambiguous interpretation and additional context. State-of-the-art techniques have focused on linking explicitly mentioned entities in tweets with reasonable success. However, we argue that in addition to explicit mentions – i.e. ‘The movie Gravity was more expensive than the mars orbiter mission’ – entities (movie Gravity) can also be mentioned implicitly – i.e. ‘This new space movie is crazy. you must watch it!.’ This paper introduces the problem of implicit entity linking in tweets. We propose an approach that models the entities by exploiting their factual and contextual knowledge. We demonstrate how to use these models to perform implicit entity linking on a ground truth dataset with 397 tweets from two domains, namely, Movie and Book. Specifically, we show: (1) the importance of linking implicit entities and its value addition to the standard entity linking task, and (2) the importance of exploiting contextual knowledge associated with an entity for linking their implicit mentions. We also make the ground truth dataset publicly available to foster the research in this new research areacity.

Keywords

Implicit entities Entity modeling Entity linking Contextual knowledge 

References

  1. 1.
    Chang, M.-W., Hsu, B.-J., Ma, H., Loynd, R., Wang, K.: E2e: an end-to-end entity linking system for short and noisy text. In: Making Sense of Microposts (2014)Google Scholar
  2. 2.
    Cucerzan, S.: Large-scale named entity disambiguation based on wikipedia data. In: EMNLP-CoNLL, vol. 7, pp. 708–716 (2007)Google Scholar
  3. 3.
    Daiber, J., Jakob, M., Hokamp, C., Mendes, P.N.: Improving efficiency and accuracy in multilingual entity extraction. In: Proceedings of the 9th International Conference on Semantic Systems (I-Semantics) (2013)Google Scholar
  4. 4.
    Derczynski, L., Maynard, D., Rizzo, G., van Erp, M., Gorrell, G., Troncy, R., Petrak, J., Bontcheva, K.: Analysis of named entity recognition and linking for tweets. Inf. Process. Manage. 51(2), 32–49 (2015)CrossRefGoogle Scholar
  5. 5.
    Dredze, M., McNamee, P., Rao, D., Gerber, A., Finin, T.: Entity disambiguation for knowledge base population. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 277–285. Association for Computational Linguistics (2010)Google Scholar
  6. 6.
    Ferragina, P., Scaiella, U.: Tagme: on-the-fly annotation of short text fragments (by wikipedia entities). In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 1625–1628. ACM (2010)Google Scholar
  7. 7.
    Guo, S., Chang, M.-W., Kiciman, E.: To link or not to link? a study on end-to-end tweet entity linking. In: HLT-NAACL, pp. 1020–1030 (2013)Google Scholar
  8. 8.
    Hoffart, J., Altun, Y., Weikum, G.: Discovering emerging entities with ambiguous names. In: Proceedings of the 23rd International Conference on World Wide Web, pp. 385–396. International World Wide Web Conferences Steering Committee (2014)Google Scholar
  9. 9.
    Hoffart, J., Yosef, M.A., Bordino, I., Fürstenau, H., Pinkal, M., Spaniol, M., Taneva, B., Thater, S., Weikum, G.: Robust disambiguation of named entities in text. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 782–792. Association for Computational Linguistics (2011)Google Scholar
  10. 10.
    Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hellmann, S., et al.: DDBpedia-a large-scale, multilingual knowledge base extracted from Wikipedia. Semant. Web 6(2), 167–195 (2015)Google Scholar
  11. 11.
    Liu, X., Li, Y., Haocheng, W., Zhou, M., Wei, F., Yi, L.: Entity linking for tweets. In: ACL, vol. 1, pp. 1304–1311 (2013)Google Scholar
  12. 12.
    Meij, E., Weerkamp, W., de Rijke, M.: Adding semantics to microblog posts. In: Proceedings of the Fifth ACM International Conference on Web Search and Data Mining, pp. 563–572. ACM (2012)Google Scholar
  13. 13.
    Mendes, P.N., Jakob, M., García-Silva, A., Bizer, C.: DBpedia spotlight, shedding light on the web of documents. In: Proceedings of the 7th International Conference on Semantic Systems, pp. 1–8. ACM (2011)Google Scholar
  14. 14.
    Milne, D., Witten, I.H.: Learning to link with wikipedia. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, pp. 509–518. ACM (2008)Google Scholar
  15. 15.
    Perera, S., Mendes, P., Sheth, A., Thirunarayan, K., Alex, A., Heid, C., Mott, G.: Implicit entity recognition in clinical documents. In: Proceedings of the Fourth Joint Conference on Lexical and Computational Semantics (*SEM), pp. 228–238 (2015)Google Scholar
  16. 16.
    Rao, D., McNamee, P., Dredze, M.: Entity linking: finding extracted entities in a knowledge base. In: Poibeau, T., Saggion, H., Piskorski, J., Yangarber, R. (eds.) Multi-source, Multilingual Information Extraction and Summarization, pp. 93–115. Springer, Heidelberg (2013)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  • Sujan Perera
    • 1
  • Pablo N. Mendes
    • 2
  • Adarsh Alex
    • 1
  • Amit P. Sheth
    • 1
  • Krishnaprasad Thirunarayan
    • 1
  1. 1.Kno.e.sis CenterWright State UniversityDaytonUSA
  2. 2.IBM ResearchSan JoseUSA

Personalised recommendations