Incorporating Cross-Document Relationships Between Sentences for Single Document Summarizations

  • Xiaojun Wan
  • Jianwu Yang
  • Jianguo Xiao
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4172)


Graph-based ranking algorithms have recently been proposed for single document summarizations and such algorithms evaluate the importance of a sentence by making use of the relationships between sentences in the document in a recursive way. In this paper, we investigate using other related or relevant documents to improve summarization of one single document based on the graph-based ranking algorithm. In addition to the within-document relationships between sentences in the specified document, the cross-document relationships between sentences in different documents are also taken into account in the proposed approach. We evaluate the performance of the proposed approach on DUC 2002 data with the ROUGE metric and results demonstrate that the cross-document relationships between sentences in different but related documents can significantly improve the performance of single document summarization.


Latent Semantic Analysis Related Document Single Document Affinity Matrix Longe Common Subsequence 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Amini, M.R., Gallinari, P.: The Use of Unlabeled Data to Improve Supervised Learning for Text Summarization. In: Proceedings of SIGIR 2002, pp. 105–112 (2002)Google Scholar
  2. 2.
    Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30, 1–7 (1998)CrossRefGoogle Scholar
  3. 3.
    Carbonell, J., Goldstein, J.: The Use of MMR, Diversity-based Reranking for Reordering Documents and Producing Summaries. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 335–336 (1998)Google Scholar
  4. 4.
    Conroy, J.M., O’Leary, D.P.: Text Summarization via Hidden Markov Models. In: Proceedings of SIGIR 2001, pp. 406–407 (2001)Google Scholar
  5. 5.
    Edmundson, H.P.: New Methods in Automatic Abstracting. Journal of the Association for computing Machinery 16(2), 264–285 (1969)zbMATHGoogle Scholar
  6. 6.
    ErKan, G., Radev, D.: LexPageRank: Prestige in Multi-Document Text Summarization. In: Proceedings of EMNLP 2004 (2004)Google Scholar
  7. 7.
    Gong, Y.H., Liu, X.: Generic Text Summarization Using Relevance Measure and Latent Semantic Analysis. In: Proceedings of SIGIR 2001, pp. 19–25 (2001)Google Scholar
  8. 8.
    Hovy, E., Lin, C.Y.: Automated Text Summarization in SUMMARIST. In: Proceeding of ACL 1997/EACL 1997 Worshop on Intelligent Scalable Text Summarization (1997)Google Scholar
  9. 9.
    Jing, H.: Sentence Reduction for Automatic Text Summarization. In: Proceedings of ANLP 2000 (2000)Google Scholar
  10. 10.
    Jing, H., McKeown, K.R.: Cut and Paste Based Text Summarization. In: Proceedings of NAACL 2000, pp. 178–185 (2000)Google Scholar
  11. 11.
    Jones, W.P., Furnas, G.W.: Pictures of relevance: a geometric analysis of similarity measure. Journal of the American Society for Information Science 38(6), 420–442 (1987)CrossRefGoogle Scholar
  12. 12.
    Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. Journal of the ACM 46(5), 604–632 (1999)zbMATHCrossRefMathSciNetGoogle Scholar
  13. 13.
    Knight, K., Marcu, D.: Summarization beyond Sentence Extraction: A Probabilistic Approach to Sentence Compression. Artificial Intelligence 139(1), 91–107 (2002)zbMATHCrossRefMathSciNetGoogle Scholar
  14. 14.
    Kupiec, J., Pedersen, J., Chen, F.: A Trainable Document Summarizer. In: Proceedings of SIGIR 1995, pp. 68–73 (1995)Google Scholar
  15. 15.
    Lin, C.Y., Hovy, E.: Automatic Evaluation of Summaries Using N-Gram Co-Occurrence Statistics. In: Proceedings of HLT-NAACL 2003 (2003)Google Scholar
  16. 16.
    Lin, C.Y., Hovy, E.: The Automated Acquisition of Topic Signatures for Text Summarization. In: Proceedings of the 17th Conference on Computational Linguistics, pp. 495–501 (2000)Google Scholar
  17. 17.
    Luhn, H.P.: The Automatic Creation of literature Abstracts. IBM Journal of Research and Development 2(2) (1969)Google Scholar
  18. 18.
    McDonald, D., Chen, H.: Using Sentence-Selection Heuristics to Rank Text Segment in TXTRACTOR. In: Proceedings of JCDL 2002, pp. 28–35 (2002)Google Scholar
  19. 19.
    Mihalcea, R., Tarau, P.: TextRank: Bringing Order into Texts. In: Proceedings of EMNLP 2004 (2004)Google Scholar
  20. 20.
    Mihalcea, R., Tarau, P.: A language independent algorithm for single and multiple document summarization. In: Proceedings of IJCNLP 2005 (2005)Google Scholar
  21. 21.
    Nomoto, T., Matsumoto, Y.: A New Approach to Unsupervised Text Summarization. In: Proceedings of SIGIR 2001, pp. 26–34 (2001)Google Scholar
  22. 22.
    Silber, H.G., McCoy, K.: Efficient Text Summarization Using Lexical Chains. In: Proceedings of the 5th International Conference on Intelligent User Interfaces, pp. 252–255 (2000)Google Scholar
  23. 23.
    Zha, H.Y.: Generic Summarization and Keyphrase Extraction Using Mutual Reinforcement Principle and Sentence Clustering. In: Proceedings of SIGIR 2002, pp. 113–120 (2002)Google Scholar
  24. 24.
    Zhang, B., Li, H., Liu, Y., Ji, L., Xi, W., Fan, W., Chen, Z., Ma, W.-Y.: Improving web search results using affinity graph. In: Proceedings of SIGIR 2005 (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Xiaojun Wan
    • 1
  • Jianwu Yang
    • 1
  • Jianguo Xiao
    • 1
  1. 1.Institute of Computer Science and TechnologyPeking UniversityBeijingChina

Personalised recommendations