LinkSUM: Using Link Analysis to Summarize Entity Data

  • Andreas Thalhammer
  • Nelia Lasierra
  • Achim Rettinger
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9671)

Abstract

The amount of structured data published on the Web is constantly growing. A significant part of this data is published in accordance to the Linked Data principles. The explicit graph structure enables machines and humans to retrieve descriptions of entities and discover information about relations to other entities. In many cases, descriptions of single entities include thousands of statements and for human users it becomes difficult to comprehend the data unless a selection of the most relevant facts is provided.

In this paper we introduce LinkSUM, a lightweight link-based approach for the relevance-oriented summarization of knowledge graph entities. LinkSUM optimizes the combination of the PageRank algorithm with an adaption of the Backlink method together with new approaches for predicate selection. Both, quantitative and qualitative evaluations have been conducted to study the performance of the method in comparison to an existing entity summarization approach. The results show a significant improvement over the state of the art and lead us to conclude that prioritizing the selection of related resources leads to better summaries.

Keywords

Entity summarization Linked data Knowledge graph Information filtering 

References

  1. 1.
    Blanco, R., Cambazoglu, B.B., Mika, P., Torzec, N.: Entity recommendations in web search. In: Alani, H., et al. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 33–48. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  2. 2.
    Bobić, T., Waitelonis, J., Sack, H.: FRanCo - a ground truth corpus for fact ranking evaluation. In: Joint Proceedings of SumPre and HSWI 2015, Co-located with the 12th Extended Semantic Web Conference, vol. 1556. CEUR-WS (2016)Google Scholar
  3. 3.
    Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. In: Proceedings of the 7th International Conference on World Wide Web 7. Elsevier (1998)Google Scholar
  4. 4.
    Cheng, G., Tran, T., Qu, Y.: RELIN: relatedness and informativeness-based centrality for entity summarization. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part I. LNCS, vol. 7031, pp. 114–129. Springer, Heidelberg (2011)CrossRefGoogle Scholar
  5. 5.
    Demšar, J.: Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res. 7, 1–30 (2006)MathSciNetMATHGoogle Scholar
  6. 6.
    Franz, T., Schultz, A., Sizov, S., Staab, S.: TripleRank: ranking semantic web data by tensor decomposition. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 213–228. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  7. 7.
    Gunaratna, K., Thirunarayan, K., Sheth, A.P.: FACES: diversity-aware entity summarization using incremental hierarchical conceptual clustering. In: Proceedings of the 29th AAAI Conference Artificial Intelligence, 2015, Austin, Texas, USA (2015)Google Scholar
  8. 8.
    Hogan, A., Harth, A., Umrich, J., Decker, S.: Towards a scalable search and query engine for the web. In: Proceedings of the 16th International Conference on World Wide Web, WWW 2007, pp. 1301–1302. ACM, New York, NY, USA (2007)Google Scholar
  9. 9.
    Krötzsch, M., Vrandečić, D., Völkel, M.: Semantic mediaWiki. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 935–942. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  10. 10.
    Meusel, R., Petrovski, P., Bizer, C.: The WebDataCommons microdata, RDFa and microformat dataset series. In: Mika, P., et al. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 277–292. Springer, Heidelberg (2014)Google Scholar
  11. 11.
    Roa-Valverde, A., Thalhammer, A., Toma, I., Sicilia, M.-A.: Towards a formal model for sharing and reusing ranking computations. In: Proceedings of the 6th International Workshop on Ranking in Databases in conjunction with VLDB 2012 (2012)Google Scholar
  12. 12.
    Schäfer, B., Ristoski, P., Paulheim, H.: What is special about Bethlehem, Pennsylvania? identifying unusual facts about DBpedia entities. In: Proceedings of the ISWC 2015 Posters and Demonstrations Track (2015)Google Scholar
  13. 13.
    Singhal, A.: Introducing the knowledge graph: things, not strings (2012). http://goo.gl/kH1NKq
  14. 14.
    Sydow, M., Pikuła, M., Schenkel, R.: The notion of diversity in graphical entity summarisation on semantic knowledge graphs. J. Intell. Inf. Syst. 41(2), 109–149 (2013)CrossRefGoogle Scholar
  15. 15.
    Thalhammer, A.: DBpedia PageRank dataset (2016). http://people.aifb.kit.edu/ath#DBpedia_PageRank
  16. 16.
    Thalhammer, A., Knuth, M., Sack, H.: Evaluating entity summarization using a game-based ground truth. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012, Part II. LNCS, vol. 7650, pp. 350–361. Springer, Heidelberg (2012)CrossRefGoogle Scholar
  17. 17.
    Thalhammer, A., Rettinger, A.: Browsing DBpedia entities with summaries. In: Presutti, V., Blomqvist, E., Troncy, R., Sack, H., Papadakis, I., Tordai, A. (eds.) ESWC Satellite Events 2014. LNCS, vol. 8798, pp. 511–515. Springer, Heidelberg (2014)Google Scholar
  18. 18.
    Thalhammer, A., Stadtmüller, S.: SUMMA: a common API for linked data entity summaries. In: Cimiano, P., Frasincar, F., Houben, G.-J., Schwabe, D. (eds.) ICWE 2015. LNCS, vol. 9114, pp. 430–446. Springer, Heidelberg (2015)CrossRefGoogle Scholar
  19. 19.
    Thalhammer, A., Toma, I., Roa-Valverde, A.J., Fensel, D.: Leveraging usage data for linked data movie entity summarization. In: Proceedings of the 2nd International Ws. on Usage Analysis and the Web of Data (USEWOD2012) (2012)Google Scholar
  20. 20.
    Waitelonis, J., Sack, H.: Towards exploratory video search using linked data. Multimedia Tools Appl. 59, 645–672 (2012). doi:10.1007/s11042-011-0733-1 CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  • Andreas Thalhammer
    • 1
  • Nelia Lasierra
    • 2
  • Achim Rettinger
    • 1
  1. 1.AIFB, Karlsruhe Institute of TechnologyKarlsruheGermany
  2. 2.University for Health Sciences, Medical Informatics and TechnologyHall in TirolAustria

Personalised recommendations