A Hybrid Model for Sentence Ordering in Extractive Multi-document Summarization

  • Dexi Liu
  • Zengchang Zhang
  • Yanxiang He
  • Donghong Ji
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4182)


Ordering information is a critical task for multi-document summarization because it heavily influent the coherence of the generated summary. In this paper, we propose a hybrid model for sentence ordering in extractive multi-document summarization that combines four relations between sentences. This model regards sentence as vertex and combined relation as edge of a directed graph on which the approximately optimal ordering can be generated with PageRank analysis. Evaluation of our hybrid model shows a significant improvement of the ordering over strategies losing some relations and the results also indicate that this hybrid model is robust for articles with different genre.


Hybrid Model Combine Relation Precedence Graph Sentence Order Chronological Relation 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Mani, I.: Automatic Summarization. John Benjamins, Amsterdam (2001)MATHGoogle Scholar
  2. 2.
    Radev, D.R., Hovy, E.H., McKeown, K.: Introduction to the Special Issue on Summarization. Computational Linguistics 28(4), 399–408 (2002)CrossRefGoogle Scholar
  3. 3.
    Okazaki, N., Matsuo, Y., Ishizuka, M.: Coherent Arrangement of Sentences Extracted from Multiple Newspaper Articles. In: Zhang, C., W. Guesgen, H., Yeap, W.-K. (eds.) PRICAI 2004. LNCS (LNAI), vol. 3157, pp. 882–891. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  4. 4.
    Barzilay, R., Elhadad, E., McKeown, K.: Inferring strategies for sentence ordering in multidocument summarization. Journal of Artificial Intelligence Research 17, 35–55 (2002)MATHGoogle Scholar
  5. 5.
    Lapata, M.: Probabilistic text structuring: experiments with sentence ordering. In: Proceedings of the 41st Meeting of the Association of Computational Linguistics, pp. 545–552 (2003)Google Scholar
  6. 6.
    Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30, 1–7 (1998)CrossRefGoogle Scholar
  7. 7.
    Paul, O., James, Y.: An Introduction to DUC-2004. In: Proceedings of the 4th Document Understanding Conference, DUC 2004 (2004)Google Scholar
  8. 8.
    Lin, C.Y., Hovy, E.: Automatic Evaluation of Summaries Using N-gram Co-Occurrence Statistics. In: Proceedings of the Human Technology Conference (HLTNAACL 2003), Edmonton, Canada (2003)Google Scholar
  9. 9.
    Lebanon, G., Lafferty, J.: Combining rankings using conditional probability models on permutations. In: Proceedings of the 19th International Conference on Machine Learning, pp. 363–370. Morgan Kaufmann Publishers, San Francisco (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Dexi Liu
    • 1
    • 2
  • Zengchang Zhang
    • 1
  • Yanxiang He
    • 2
  • Donghong Ji
    • 3
  1. 1.School of PhysicsXiangfan UniversityXiangfanP.R. China
  2. 2.School of ComputerWuhan UniversityWuhanP.R. China
  3. 3.Institute for Infocomm ResearchSingapore

Personalised recommendations