Towards Generating Text Summaries for Entity Chains

Chhabra, Shruti; Bedathur, Srikanta

doi:10.1007/978-3-319-06028-6_12

Shruti Chhabra²² &
Srikanta Bedathur²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8416))

Included in the following conference series:

European Conference on Information Retrieval

2957 Accesses
2 Citations

Abstract

Given a large knowledge graph, discovering meaningful relationships between a given pair of entities has gained a lot of attention in the recent times. Most existing algorithms focus their attention on identifying one or more structures –such as relationship chains or subgraphs– between the entities. The burden of interpreting these results, after combining with contextual information and description of relationships, lies with the user. In this paper, we present a framework that eases this burden by generating a textual summary which incorporates the context and description of individual (dyadic) relationships, and combines them to generate a ranked list of summaries. We develop a model that captures key properties of a well-written text, such as coherence and information content. We focus our attention on a special class of relationship structures, two-length entity chains, and show that the generated ranked list of summaries have 79% precision at rank-1. Our results demonstrate that the generated summaries are quite useful to users.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Chakraborty, S., Gollapudi, S., Kannan, A., Kenthapadi, K.: Empowering authors to diagnose comprehension burden in textbooks. In: KDD, pp. 967–975 (2012)
Google Scholar
Anyanwu, K., Sheth, A.: The ρ operator: discovering and ranking associations on the semantic web. ACM SIGMOD Record 31(4), 42–47 (2002)
Article Google Scholar
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: Dbpedia - a crystallization point for the web of data. Web Semant. 7(3), 154–165 (2009)
Article Google Scholar
Blanco, R., Zaragoza, H.: Finding support sentences for entities. In: SIGIR, pp. 339–346 (2010)
Google Scholar
Bollacker, K., Evans, C., Paritosh, P., Sturge, T., Taylor, J.: Freebase: A collaboratively created graph database for structuring human knowledge. In: SIGMOD, pp. 1247–1250 (2008)
Google Scholar
Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka Jr., E.R., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: AAAI Conf. on Artifical Intelligence (2010)
Google Scholar
Chhabra, S., Bedathur, S.: Generating text summaries of graph snippets. In: COMAD, pp. 121–124 (2013)
Google Scholar
Cohen, T., Whitfield, G., Schvaneveldt, R., Mukund, K., Rindflesch, T.: Epiphanet: An interactive tool to support biomedical discoveries. Journal of Biomedical Discovery and Collaboration 5, 21–49 (2010)
Google Scholar
Etzioni, O., Fader, A., Christensen, J., Soderland, S., Mausam, M.: Open information extraction: The second generation. In: IJCAI, pp. 3–10 (2011)
Google Scholar
Fang, L., Sarma, A.D., Yu, C., Bohannon, P.: Rex: explaining relationships between entity pairs. Proc. VLDB Endow. 5(3) (2011)
Google Scholar
Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by gibbs sampling. In: ACL, pp. 363–370 (2005)
Google Scholar
Foltz, P.W.: Latent semantic analysis for text-based research. Behavior Research Methods, Instruments, & Computers 28(2), 197–202 (1996)
Article Google Scholar
Foltz, P.W., Kintsch, W., Landauer, T.K.: The measurement of textual coherence with latent semantic analysis. Discourse Processes 25(2-3), 285–307 (1998)
Article Google Scholar
Gray, W.S., Leary, B.E.: What makes a book readable. Univ. Chicago Press (1935)
Google Scholar
Halaschek, C., Aleman-Meza, B., Arpinar, I.B., Sheth, A.P.: Discovering and ranking semantic associations over a large rdf metabase. In: VLDB, pp. 1317–1320 (2004)
Google Scholar
Hoffart, J., Suchanek, F.M., Berberich, K., Lewis-Kelham, E., de Melo, G., Weikum, G.: Yago2: Exploring and querying world knowledge in time, space, context, and many languages. In: WWW, pp. 229–232 (2011)
Google Scholar
Hristovski, D., Friedman, C., Rindflesch, T.C., Peterlin, B.: Exploiting semantic relations for literature-based discovery. In: AMIA Annual Symp., vol. 2006, pp. 349–353 (2006)
Google Scholar
Hristovski, D., Kastrin, A., Peterlin, B., Rindflesch, T.C.: Combining semantic relations and dna microarray data for novel hypotheses generation. In: Proceedings of the 2009 Workshop of the BioLink Special Interest Group, International Conference on Linking Literature, Information, and Knowledge for Biology, pp. 53–61 (2010)
Google Scholar
Jin, W., Srihari, R.K., Ho, H.H., Wu, X.: Improving knowledge discovery in document collections through combining text retrieval and link analysis techniques. In: ICDM, pp. 193–202 (2007)
Google Scholar
Kasneci, G., Ramanath, M., Sozio, M., Suchanek, F.M., Weikum, G.: Star: Steiner-tree approximation in relationship graphs. In: ICDE, pp. 868–879 (2009)
Google Scholar
Kintsch, W., Van Dijk, T.A.: Toward a model of text comprehension and production. Psychological Review 85(5), 363–394 (1978)
Article Google Scholar
Laham, T.K., Laham, D., Foltz, P.W.: Learning human-like knowledge by singular value decomposition: A progress report. In: NIPS, vol. 10, pp. 45–51 (1998)
Google Scholar
Lapata, M., Barzilay, R.: Automatic evaluation of text coherence: Models and representations. In: IJCAI, pp. 1085–1090 (2005)
Google Scholar
Lin, C.-Y.: Rouge: A package for automatic evaluation of summaries. In: Text Summarization Branches Out: Proceedings of the ACL 2004 Workshop, pp. 74–81 (2004)
Google Scholar
Nakashole, N., Weikum, G., Suchanek, F.: Patty: a taxonomy of relational patterns with semantic types. In: EMNLP, pp. 1135–1145 (2012)
Google Scholar
Pitler, E., Nenkova, A.: Revisiting readability: a unified framework for predicting text quality. In: EMNLP, pp. 186–195 (2008)
Google Scholar
Řehůřek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: LREC Workshop on New Challenges for NLP Frameworks, pp. 45–50 (2010)
Google Scholar
Smalheiser, N.R., Swanson, D.R.: Indomethacin and alzheimer’s disease. Neurology 46(2), 583–583 (1996)
Article Google Scholar
Smalheiser, N.R., Swanson, D.R.: Linking estrogen to alzheimer’s disease an informatics approach. Neurology 47(3), 809–810 (1996)
Article Google Scholar
Srihari, R.K., Xu, L., Saxena, T.: Use of ranked cross document evidence trails for hypothesis generation. In: KDD, pp. 677–686 (2007)
Google Scholar
Srinivasan, P.: Text mining: generating hypotheses from medline. Journal of American Society for Information Science and Technology 55(5), 396–413 (2004)
Article Google Scholar
Swanson, D.R.: Two medical literatures that are logically but not bibliographically connected. Journal of the American Society for Information Science 38(4), 228–233 (1987)
Article Google Scholar
Swanson, D.R., Smalheiser, N.R.: An interactive system for finding complementary literatures: a stimulus to scientific discovery. Artificial Intelligence 91(2), 183–203 (1997)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Indraprastha Institute of Information Technology, New Delhi, India
Shruti Chhabra & Srikanta Bedathur

Authors

Shruti Chhabra
View author publications
You can also search for this author in PubMed Google Scholar
Srikanta Bedathur
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Amsterdam, Amsterdam, The Netherlands
Maarten de Rijke & Tom Kenter &
Centrum Wiskunde en Informatica, Amsterdam, The Netherlands and Delft University of Technology, Delft, The Netherlands
Arjen P. de Vries
University of Illinois at Urbana-Champaign, Urbana, IL, USA
ChengXiang Zhai
University of Twente, Twente, The Netheralnds and Erasmus University Rotterdam, Rotterdam, The Netherlands
Franciska de Jong
SalesPredict, Haifa, Israel
Kira Radinsky
Microsoft Research, Cambridge, UK
Katja Hofmann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chhabra, S., Bedathur, S. (2014). Towards Generating Text Summaries for Entity Chains. In: de Rijke, M., et al. Advances in Information Retrieval. ECIR 2014. Lecture Notes in Computer Science, vol 8416. Springer, Cham. https://doi.org/10.1007/978-3-319-06028-6_12

Download citation

DOI: https://doi.org/10.1007/978-3-319-06028-6_12
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06027-9
Online ISBN: 978-3-319-06028-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics