Co-mention and Context-Based Entity Linking

Conference paper
Part of the Springer Proceedings in Complexity book series (SPCOM)

Abstract

Recently, online news has become one of the most important resources from which people get useful information. Linking named entities in news articles to existing knowledge bases is a critical task to facilitate readers to understand the news well. In this paper, we propose an approach for linking entities in Chinese news articles to Chinese knowledge bases. Our approach first recognizes three types of named entities (i.e., person, location, and organization) and then uses a disambiguation method to link entities occurring in news articles to entities in knowledge bases. In the disambiguation process, co-mentioned entities are used as features to compute the context similarities between entities in news and entities in knowledge bases; the disambiguation results are decided by a threshold-filtering method on the context similarities. Experiments on linking entities in Sina news to Hudong knowledge base validate the effectiveness of our approach; it achieves 84.39%, 84.02%, and 86.16% F1-scores in the task of linking person entities, location entities, and organization entities, respectively.

Keywords

Coherence Extractor Harness 

Notes

Acknowledgements

The work is supported by the Natural Science Foundation of China (No. 61035004, No. 60973102), 863 High Technology Program (2011AA01A207), European Union 7th Framework Project FP7-288342, and THU-NUS NExT Co-Lab and the project cooperated with Chongqing research institute of science and technology.

References

  1. 1.
    Bunescu, R.C., Pasca, M.: Using encyclopedic knowledge for named entity disambiguation. In: EACL’06 pp. 9–16 (2006)Google Scholar
  2. 2.
    Dill, S., Eiron, N., Gibson, D., Gruhl, D., Guha, R.V., Jhingran, A., Kanungo, T., Rajagopalan, S., Tomkins, A., Tomlin, J.A., Zien, J.Y.: Semtag and seeker: bootstrapping the semantic web via automated semantic annotation. In: WWW’03, pp. 178–186 (2003)Google Scholar
  3. 3.
    Mihalcea, R., Csomai, A.: Wikify!: linking documents to encyclopedic knowledge. In: CIKM’07, pp. 233–242 (2007)Google Scholar
  4. 4.
    Milne, D.N., Witten, I.H.: Learning to link with wikipedia. In: CIKM’08, pp. 509–518 (2008)Google Scholar
  5. 5.
    Cucerzan, S.: Large-scale named entity disambiguation based on wikipedia data. In: EMNLP-CoNLL’07, pp. 708–716 (2007)Google Scholar
  6. 6.
    Kulkarni, S., Singh, A., Ramakrishnan, G., Chakrabarti, S.: Collective annotation of wikipedia entities in web text. In: KDD’09, pp. 457–466 (2009)Google Scholar
  7. 7.
    Hoffart, J., Yosef, M.A., Bordino, I., Fürstenau, H., Pinkal, M., Spaniol, M., Taneva, B., Thater, S., Weikum, G.: Robust disambiguation of named entities in text. In: EMNLP’11, pp. 782–792 (2011)Google Scholar
  8. 8.
    Han, X., Sun, L., Zhao, J.: Collective entity linking in web text: a graph-based method. In: SIGIR’11, pp. 765–774 (2011)Google Scholar
  9. 9.
    Han, X., Zhao, J.: Named entity disambiguation by leveraging wikipedia semantic knowledge. In: CIKM’09, pp. 215–224 (2009)Google Scholar
  10. 10.
    Chakaravarthy, V.T., Gupta, H., Roy, P., Mohania, M.K.: Efficiently linking text documents with relevant structured information. In: VLDB’06, pp. 667–678 (2006)Google Scholar
  11. 11.
    Wang, C., Chakrabarti, K., Cheng, T., Chaudhuri, S.: Targeted disambiguation of ad-hoc, homogeneous sets of named entities. In: WWW’12, pp. 719–728 (2012)Google Scholar
  12. 12.
    Shen, W., Wang, J., Luo, P., Wang, M.: Linden: linking named entities with knowledge base via semantic knowledge. In: WWW’12, pp. 449–458 (2012)Google Scholar
  13. 13.
    Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: Dbpedia – a crystallization point for the web of data. JWS' 2009, 154–165 (2009)Google Scholar
  14. 14.
    Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: Dbpedia: a nucleus for a web of open data. In: ISWC/ASWC’07, pp. 722–735 (2007)Google Scholar
  15. 15.
    Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a large ontology from Wikipedia and wordnet. JWS' 2008, 203–217 (2008)Google Scholar
  16. 16.
    Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: WWW’07, pp. 697–706 (2007)Google Scholar
  17. 17.
    Wang, Z., Wang, Z., Li, J., Pan, J.Z.: Building a large scale knowledge base from Chinese wiki encyclopedia. In: JIST’11, pp. 80–95 (2011)Google Scholar
  18. 18.
    Zhang, H., Liu, Q., Zhao, J.: Chinese name entity recognition using role model. Comput. Linguist. Chin. Lang. Process., 29–602 (2003)Google Scholar

Copyright information

© Springer Science+Business Media New York 2013

Authors and Affiliations

  1. 1.Department of Computer Science and TechnologyTsinghua UniversityBeijingPeople’s Republic of China
  2. 2.College of Information Science and TechnologyBeijing Normal UniversityBeijingPeople’s Republic of China

Personalised recommendations