Skip to main content

Emerging Entity Discovery Using Web Sources

Part of the Communications in Computer and Information Science book series (CCIS,volume 1134)


The rapidly increasing amount of entities in knowledge bases (KBs) can be beneficial for many applications, where the key issue is to link entity mentions in text with entities in the KB, also called entity linking (EL). Many methods have been proposed to tackle this problem. However, the KB can never be complete, such that emerging entity discovery (EED) is essential for detecting emerging entities (EEs) that are mentioned in text but not yet contained in the KB. In this paper, we propose a new topic-driven approach to EED by representing EEs using the context harvested from online Web sources. Experimental results show that our solution outperforms the state-of-the-art methods in terms of F1 measure for the EED task as well as Micro Accuracy and Macro Accuracy in the full EL setting.

This is a preview of subscription content, access via your institution.

Buying options

USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions


  1. 1.

    We choose Microsoft Bing as the Web search engine in this work.


  1. Färber, M., Rettinger, A., Asmar, B.E.: On emerging entity detection. In: EKAW, pp. 223–238 (2016)

    Google Scholar 

  2. Fetahu, B., Anand, A., Anand, A.: How much is Wikipedia lagging behind news? In: WebSci, pp. 28:1–28:9 (2015)

    Google Scholar 

  3. Hoffart, J., Altun, Y., Weikum, G.: Discovering emerging entities with ambiguous names. In: WWW, pp. 385–396 (2014)

    Google Scholar 

  4. Hoffart, J., et al.: Robust disambiguation of named entities in text. In: EMNLP, pp. 782–792 (2011)

    Google Scholar 

  5. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: Dbpedia: a nucleus for a web of open data. In: ISWC, pp. 722–735 (2007)

    Google Scholar 

  6. Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: WWW, pp. 697–706 (2007)

    Google Scholar 

  7. Finkel, J.R., Grenager, T., Manning, C.D.: Incorporating non-local information into information extraction systems by gibbs sampling. In: ACL, pp. 363–370 (2005)

    Google Scholar 

  8. Kulkarni, S., Singh, A., Ramakrishnan, G., Chakrabarti, S.: Collective annotation of wikipedia entities in web text. In: KDD, pp. 363–370 (2009)

    Google Scholar 

  9. Ratinov, L., Roth, D., Downey, D., Anderson, M.: Local and global algorithms for disambiguation to Wikipedia. In: ACL, pp. 1375–1384 (2011)

    Google Scholar 

  10. Han, X., Sun, L., Zhao, J.: Collective entity linking in web text: a graph-based method. In: SIGIR, pp. 765–774 (2011)

    Google Scholar 

  11. Raghunathan, K., et al.: A multi-pass sieve for coreference resolution. In: EMNLP, pp. 492–501 (2010)

    Google Scholar 

  12. Toutanova, K., Klein, D., Manning, C.D., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: HLT-NAACL (2003)

    Google Scholar 

  13. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  14. Griffiths, T.L., Steyvers, M.: Finding scientific topics. In: PNAS, vol. 101, suppl. 1, pp. 5228–5235 (2004)

    CrossRef  Google Scholar 

  15. Parker, R.: English gigaword fifth edition. Technical report (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Lei Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, L., Wu, T., Xu, L., Wang, M., Qi, G., Sack, H. (2019). Emerging Entity Discovery Using Web Sources. In: Zhu, X., Qin, B., Zhu, X., Liu, M., Qian, L. (eds) Knowledge Graph and Semantic Computing: Knowledge Computing and Language Understanding. CCKS 2019. Communications in Computer and Information Science, vol 1134. Springer, Singapore.

Download citation

  • DOI:

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-15-1955-0

  • Online ISBN: 978-981-15-1956-7

  • eBook Packages: Computer ScienceComputer Science (R0)