Abstract
Given a large knowledge graph, discovering meaningful relationships between a given pair of entities has gained a lot of attention in the recent times. Most existing algorithms focus their attention on identifying one or more structures –such as relationship chains or subgraphs– between the entities. The burden of interpreting these results, after combining with contextual information and description of relationships, lies with the user. In this paper, we present a framework that eases this burden by generating a textual summary which incorporates the context and description of individual (dyadic) relationships, and combines them to generate a ranked list of summaries. We develop a model that captures key properties of a well-written text, such as coherence and information content. We focus our attention on a special class of relationship structures, two-length entity chains, and show that the generated ranked list of summaries have 79% precision at rank-1. Our results demonstrate that the generated summaries are quite useful to users.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Chakraborty, S., Gollapudi, S., Kannan, A., Kenthapadi, K.: Empowering authors to diagnose comprehension burden in textbooks. In: KDD, pp. 967–975 (2012)
Anyanwu, K., Sheth, A.: The ρ operator: discovering and ranking associations on the semantic web. ACM SIGMOD Record 31(4), 42–47 (2002)
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., Hellmann, S.: Dbpedia - a crystallization point for the web of data. Web Semant. 7(3), 154–165 (2009)
Blanco, R., Zaragoza, H.: Finding support sentences for entities. In: SIGIR, pp. 339–346 (2010)
Bollacker, K., Evans, C., Paritosh, P., Sturge, T., Taylor, J.: Freebase: A collaboratively created graph database for structuring human knowledge. In: SIGMOD, pp. 1247–1250 (2008)
Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka Jr., E.R., Mitchell, T.M.: Toward an architecture for never-ending language learning. In: AAAI Conf. on Artifical Intelligence (2010)
Chhabra, S., Bedathur, S.: Generating text summaries of graph snippets. In: COMAD, pp. 121–124 (2013)
Cohen, T., Whitfield, G., Schvaneveldt, R., Mukund, K., Rindflesch, T.: Epiphanet: An interactive tool to support biomedical discoveries. Journal of Biomedical Discovery and Collaboration 5, 21–49 (2010)
Etzioni, O., Fader, A., Christensen, J., Soderland, S., Mausam, M.: Open information extraction: The second generation. In: IJCAI, pp. 3–10 (2011)
Fang, L., Sarma, A.D., Yu, C., Bohannon, P.: Rex: explaining relationships between entity pairs. Proc. VLDB Endow. 5(3) (2011)
Finkel, J.R., Grenager, T., Manning, C.: Incorporating non-local information into information extraction systems by gibbs sampling. In: ACL, pp. 363–370 (2005)
Foltz, P.W.: Latent semantic analysis for text-based research. Behavior Research Methods, Instruments, & Computers 28(2), 197–202 (1996)
Foltz, P.W., Kintsch, W., Landauer, T.K.: The measurement of textual coherence with latent semantic analysis. Discourse Processes 25(2-3), 285–307 (1998)
Gray, W.S., Leary, B.E.: What makes a book readable. Univ. Chicago Press (1935)
Halaschek, C., Aleman-Meza, B., Arpinar, I.B., Sheth, A.P.: Discovering and ranking semantic associations over a large rdf metabase. In: VLDB, pp. 1317–1320 (2004)
Hoffart, J., Suchanek, F.M., Berberich, K., Lewis-Kelham, E., de Melo, G., Weikum, G.: Yago2: Exploring and querying world knowledge in time, space, context, and many languages. In: WWW, pp. 229–232 (2011)
Hristovski, D., Friedman, C., Rindflesch, T.C., Peterlin, B.: Exploiting semantic relations for literature-based discovery. In: AMIA Annual Symp., vol. 2006, pp. 349–353 (2006)
Hristovski, D., Kastrin, A., Peterlin, B., Rindflesch, T.C.: Combining semantic relations and dna microarray data for novel hypotheses generation. In: Proceedings of the 2009 Workshop of the BioLink Special Interest Group, International Conference on Linking Literature, Information, and Knowledge for Biology, pp. 53–61 (2010)
Jin, W., Srihari, R.K., Ho, H.H., Wu, X.: Improving knowledge discovery in document collections through combining text retrieval and link analysis techniques. In: ICDM, pp. 193–202 (2007)
Kasneci, G., Ramanath, M., Sozio, M., Suchanek, F.M., Weikum, G.: Star: Steiner-tree approximation in relationship graphs. In: ICDE, pp. 868–879 (2009)
Kintsch, W., Van Dijk, T.A.: Toward a model of text comprehension and production. Psychological Review 85(5), 363–394 (1978)
Laham, T.K., Laham, D., Foltz, P.W.: Learning human-like knowledge by singular value decomposition: A progress report. In: NIPS, vol. 10, pp. 45–51 (1998)
Lapata, M., Barzilay, R.: Automatic evaluation of text coherence: Models and representations. In: IJCAI, pp. 1085–1090 (2005)
Lin, C.-Y.: Rouge: A package for automatic evaluation of summaries. In: Text Summarization Branches Out: Proceedings of the ACL 2004 Workshop, pp. 74–81 (2004)
Nakashole, N., Weikum, G., Suchanek, F.: Patty: a taxonomy of relational patterns with semantic types. In: EMNLP, pp. 1135–1145 (2012)
Pitler, E., Nenkova, A.: Revisiting readability: a unified framework for predicting text quality. In: EMNLP, pp. 186–195 (2008)
Řehůřek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: LREC Workshop on New Challenges for NLP Frameworks, pp. 45–50 (2010)
Smalheiser, N.R., Swanson, D.R.: Indomethacin and alzheimer’s disease. Neurology 46(2), 583–583 (1996)
Smalheiser, N.R., Swanson, D.R.: Linking estrogen to alzheimer’s disease an informatics approach. Neurology 47(3), 809–810 (1996)
Srihari, R.K., Xu, L., Saxena, T.: Use of ranked cross document evidence trails for hypothesis generation. In: KDD, pp. 677–686 (2007)
Srinivasan, P.: Text mining: generating hypotheses from medline. Journal of American Society for Information Science and Technology 55(5), 396–413 (2004)
Swanson, D.R.: Two medical literatures that are logically but not bibliographically connected. Journal of the American Society for Information Science 38(4), 228–233 (1987)
Swanson, D.R., Smalheiser, N.R.: An interactive system for finding complementary literatures: a stimulus to scientific discovery. Artificial Intelligence 91(2), 183–203 (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Chhabra, S., Bedathur, S. (2014). Towards Generating Text Summaries for Entity Chains. In: de Rijke, M., et al. Advances in Information Retrieval. ECIR 2014. Lecture Notes in Computer Science, vol 8416. Springer, Cham. https://doi.org/10.1007/978-3-319-06028-6_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-06028-6_12
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-06027-9
Online ISBN: 978-3-319-06028-6
eBook Packages: Computer ScienceComputer Science (R0)