TRank: Ranking Entity Types Using the Web of Data

Tonon, Alberto; Catasta, Michele; Demartini, Gianluca; Cudré-Mauroux, Philippe; Aberer, Karl

doi:10.1007/978-3-642-41335-3_40

Alberto Tonon²⁶,
Michele Catasta²⁷,
Gianluca Demartini²⁶,
Philippe Cudré-Mauroux²⁶ &
…
Karl Aberer²⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8218))

Included in the following conference series:

International Semantic Web Conference

4646 Accesses
22 Citations

Abstract

Much of Web search and browsing activity is today centered around entities. For this reason, Search Engine Result Pages (SERPs) increasingly contain information about the searched entities such as pictures, short summaries, related entities, and factual information. A key facet that is often displayed on the SERPs and that is instrumental for many applications is the entity type. However, an entity is usually not associated to a single generic type in the background knowledge bases but rather to a set of more specific types, which may be relevant or not given the document context. For example, one can find on the Linked Open Data cloud the fact that Tom Hanks is a person, an actor, and a person from Concord, California. All those types are correct but some may be too general to be interesting (e.g., person), while other may be interesting but already known to the user (e.g., actor), or may be irrelevant given the current browsing context (e.g., person from Concord, California). In this paper, we define the new task of ranking entity types given an entity and its context. We propose and evaluate new methods to find the most relevant entity type based on collection statistics and on the graph structure interconnecting entities and types. An extensive experimental evaluation over several document collections at different levels of granularity (e.g., sentences, paragraphs, etc.) and different type hierarchies (including DBPedia, Freebase, and schema.org) shows that hierarchy-based approaches provide more accurate results when picking entity types to be displayed to the end-user while still being highly scalable.

Download to read the full chapter text

Chapter PDF

Example Based Entity Search in the Web of Data

A Review of Graph-Based Models for Entity-Oriented Search

Article 30 August 2021

Entity Linking for Web Search Queries

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: DBpedia - A crystallization point for the Web of Data. Journal of Web Semantics, 154–165 (2009)
Google Scholar
Bollacker, K., Evans, C., Paritosh, P., Sturge, T., Taylor, J.: Freebase: a collaboratively created graph database for structuring human knowledge. In: SIGMOD, pp. 1247–1250 (2008)
Google Scholar
Campinas, S., Ceccarelli, D., Perry, T.E., Delbru, R., Balog, K., Tummarello, G.: The Sindice-2011 dataset for entity-oriented search in the web of data. In: 1st International Workshop on Entity-Oriented Search (EOS), pp. 26–32 (2011)
Google Scholar
Ciaramita, M., Altun, Y.: Broad-coverage sense disambiguation and information extraction with a supersense sequence tagger. In: EMNLP, pp. 594–602 (2006)
Google Scholar
Cunningham, H., Humphreys, K., Gaizauskas, R., Wilks, Y.: GATE: a general architecture for text engineering. In: ANLC, pp. 29–30 (1997)
Google Scholar
Demartini, G., Difallah, D.E., Cudré-Mauroux, P.: ZenCrowd: leveraging probabilistic reasoning and crowdsourcing techniques for large-scale entity linking. In: WWW, pp. 469–478 (2012)
Google Scholar
Fang, Y., Si, L., Yu, Z., et al.: Purdue at TREC 2010 Entity Track: A Probabilistic Framework for Matching Types Between Candidate and Target Entities. In: TREC (2010)
Google Scholar
Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by gibbs sampling. In: ACL, pp. 363–370 (2005)
Google Scholar
Finkel, J.R., Manning, C.D.: Joint parsing and named entity recognition. In: NAACL, pp. 326–334 (2009)
Google Scholar
Gangemi, A., Nuzzolese, A.G., Presutti, V., Draicchio, F., Musetti, A., Ciancarini, P.: Automatic typing of dbpedia entities. In: Cudré-Mauroux, P., Heflin, J., Sirin, E., Tudorache, T., Euzenat, J., Hauswirth, M., Parreira, J.X., Hendler, J., Schreiber, G., Bernstein, A., Blomqvist, E. (eds.) ISWC 2012, Part I. LNCS, vol. 7649, pp. 65–81. Springer, Heidelberg (2012)
Chapter Google Scholar
Holmes, G., Hall, M., Frank, E.: Generating rule sets from model trees. In: Foo, N.Y. (ed.) AI 1999. LNCS, vol. 1747, pp. 1–12. Springer, Heidelberg (1999)
Chapter Google Scholar
Järvelin, K., Kekäläinen, J.: Cumulated gain-based evaluation of IR techniques. In: TOIS, pp. 422–446 (2002)
Google Scholar
Kalyanpur, A., Murdock, J.W., Fan, J., Welty, C.: Leveraging community-built knowledge for type coercion in question answering. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part II. LNCS, vol. 7032, pp. 144–156. Springer, Heidelberg (2011)
Chapter Google Scholar
Kumar, R., Tomkins, A.: A characterization of online search behavior. IEEE Data Eng. Bull. (2009)
Google Scholar
Liu, T.-Y.: Learning to rank for information retrieval. In: FTIR, pp. 225–331 (2009)
Google Scholar
Matuszek, C., Cabral, J., Witbrock, M., DeOliveira, J.: An introduction to the syntax and content of cyc. In: AAAI Spring Symposium (2006)
Google Scholar
Mühleisen, H., Bizer, C.: Web data commons - extracting structured data from two large web corpora. In: LDOW (2012)
Google Scholar
Nadeau, D.: Semi-supervised named entity recognition: learning to recognize 100 entity types with little supervision. PhD thesis (2007)
Google Scholar
Nadeau, D., Turney, P.D., Matwin, S.: Unsupervised named-entity recognition: generating gazetteers and resolving ambiguity. In: Lamontagne, L., Marchand, M. (eds.) Canadian AI 2006. LNCS (LNAI), vol. 4013, pp. 266–277. Springer, Heidelberg (2006)
Chapter Google Scholar
Pound, J., Mika, P., Zaragoza, H.: Ad-hoc object retrieval in the web of data. In: WWW, pp. 771–780 (2010)
Google Scholar
Quinlan, J.R.: Learning with continuous classes. In: AI, pp. 343–348 (1992)
Google Scholar
Suchanek, F.M., Abiteboul, S., Senellart, P.: Paris: Probabilistic alignment of relations, instances, and schema. In: PVLDB, pp. 157–168 (2011)
Google Scholar
Suchanek, F.M., Kasneci, G., Weikum, G.: Yago: a core of semantic knowledge. In: WWW, pp. 697–706 (2007)
Google Scholar
Tonon, A., Demartini, G., Cudré-Mauroux, P.: Combining inverted indices and structured search for ad-hoc object retrieval. In: SIGIR, pp. 125–134 (2012)
Google Scholar
Tylenda, T., Sozio, M., Weikum, G.: Einstein: physicist or vegetarian? summarizing semantic type graphs for knowledge discovery. In: WWW, pp. 273–276 (2011)
Google Scholar
Welty, C., Murdock, J.W., Kalyanpur, A., Fan, J.: A comparison of hard filters and soft evidence for answer typing in watson. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012, Part II. LNCS, vol. 7650, pp. 243–256. Springer, Heidelberg (2012)
Chapter Google Scholar
Whang, S.E., Garcia-Molina, H.: Joint Entity Resolution on Multiple Datasets. The VLDB Journal (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

eXascale Infolab, University of Fribourg, Switzerland
Alberto Tonon, Gianluca Demartini & Philippe Cudré-Mauroux
EPFL, Lausanne, Switzerland
Michele Catasta & Karl Aberer

Authors

Alberto Tonon
View author publications
You can also search for this author in PubMed Google Scholar
Michele Catasta
View author publications
You can also search for this author in PubMed Google Scholar
Gianluca Demartini
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Cudré-Mauroux
View author publications
You can also search for this author in PubMed Google Scholar
Karl Aberer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Media Institute, The Open University, Milton Keynes, UK
Harith Alani
Massachusetts Institute of Technology, Cambridge, MA, USA
Lalana Kagal
IBM Research, Hawthorne, NY, USA
Achille Fokoue
Free University Amsterdam, The Netherlands
Paul Groth
Technical University Darmstadt, Germany
Chris Biemann
Digital Enterprise Research Institute, National University of Ireland, Galway, Ireland
Josiane Xavier Parreira
VU Amsterdam, The Netherlands
Lora Aroyo
Stanford University, CA, USA
Natasha Noy
IBM Research, Yorktown Heights, NY, USA
Chris Welty
University of California, Santa Barbara, CA, USA
Krzysztof Janowicz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tonon, A., Catasta, M., Demartini, G., Cudré-Mauroux, P., Aberer, K. (2013). TRank: Ranking Entity Types Using the Web of Data. In: Alani, H., et al. The Semantic Web – ISWC 2013. ISWC 2013. Lecture Notes in Computer Science, vol 8218. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41335-3_40

Download citation

DOI: https://doi.org/10.1007/978-3-642-41335-3_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41334-6
Online ISBN: 978-3-642-41335-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

TRank: Ranking Entity Types Using the Web of Data

Abstract

Chapter PDF

Similar content being viewed by others

Example Based Entity Search in the Web of Data

A Review of Graph-Based Models for Entity-Oriented Search

Entity Linking for Web Search Queries

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

TRank: Ranking Entity Types Using the Web of Data

Abstract

Chapter PDF

Similar content being viewed by others

Example Based Entity Search in the Web of Data

A Review of Graph-Based Models for Entity-Oriented Search

Entity Linking for Web Search Queries

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation