Building Document Graphs for Multiple News Articles Summarization: An Event-Based Approach

  • Wei Xu
  • Chunfa Yuan
  • Wenjie Li
  • Mingli Wu
  • Kam-Fai Wong
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4285)


Since most of news articles report several events and these events are referred in many related documents, we propose an event-based approach to visualize documents as graph on different conceptual granularities. With graph-based ranking algorithm, we illustrate the application of document graph to multi-document summarization. Experiments on DUC data indicate that our approach is competitive with state-of-the-art summarization techniques. This graphical representation which does not require training corpora can be potentially adapted to other languages.


Event Element News Article Ranking Algorithm Document Graph Name Entity 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Page, L., Brin, S.: The Anatomy of a Large-scale Hypertextual Web Search Engine. Computer Networks and ISDN Systems 30, 107–117 (1998)CrossRefGoogle Scholar
  2. 2.
    Dom, B., Eiron, I., Cozzi, A., Shang, Y.: Graph-based ranking algorithms for e-mail expertise analysis. In: Proceedings of the 8th ACM SIGMOD workshop on Research Issues in Data Mining and Knowledge Discovery, pp. 42–48 (2003)Google Scholar
  3. 3.
    Mihalcea, R.: Graph-based Ranking Algorithms for Sentence Extraction, Applied to Text Summarization. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, pp. 170–173 (2004)Google Scholar
  4. 4.
    Erkan, G., Radev, D.R.: LexRank: Graph-based lexical as Salience in Text Summarization. Journal of Artificial Intelligence Research 22, 457–479 (2004)Google Scholar
  5. 5.
    Vanderwende, L., Banko, M., Menezes, A.: Event-Centric Summary Generation. In: Proceedings of the Document Understanding Conference Workshop (2004)Google Scholar
  6. 6.
    Yoshioka, M., Haraguchi, M.: Multiple News Articles Summarization based on Event Reference Information. In: Working Notes of the 4th NTCIR Workshop (2004)Google Scholar
  7. 7.
    Filatova, E., Hatzivassiloglou, V.: Event-based Extractive Summarization. In: Proceedings of ACL Workshop on Summarization, pp. 104–111 (2004)Google Scholar
  8. 8.
    Bradley, J., Rockwell, G.: What Scientific Visualization Teaches Us about Text Analysis. In: ALLC/ACH Conference (1994)Google Scholar
  9. 9.
    Li, W., Xu, W., Wu, M., Yuan, C., Lu, Q.: Extractive Summarization using Inter- and Intra- Event Relevance. In: Proceedings of COLING-ACL (2006)Google Scholar
  10. 10.
    Lin, C., Hovy, E.: Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics. In: Proceeding of HLT-NAACL, pp. 71–78 (2003)Google Scholar
  11. 11.
    Radev, D.R., Jing, H., Stys, M., Tam, D.: Centroid-based Summarization of Multiple Documents. Information Processing and Management 40, 919–938 (2004)MATHCrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Wei Xu
    • 1
  • Chunfa Yuan
    • 1
  • Wenjie Li
    • 2
  • Mingli Wu
    • 2
  • Kam-Fai Wong
    • 3
  1. 1.Department of Computer Science and TechnologyTsinghua UniversityChina
  2. 2.Department of ComputingThe Hong Kong Polytechnic UniversityHong Kong
  3. 3.Department of System EngineeringThe Chinese University of Hong KongHong Kong

Personalised recommendations