Advertisement

A Hybrid Sentence Ordering Strategy in Multi-document Summarization

  • Yanxiang He
  • Dexi Liu
  • Hua Yang
  • Donghong Ji
  • Chong Teng
  • Wenqing Qi
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4255)

Abstract

In extractive summarization, a proper arrangement of extracted sentences must be found if we want to generate a logical, coherent and readable summary. This issue is special in multi-document summarization. In this paper, several existing methods each of which generate a reference relation are combined through linear combination of the resulting relations. We use 4 types of relationships between sentences (chronological relation, positional relation, topical relation and dependent relation) to build a graph model where the vertices are sentences and edges are weighed relationships of the 4 types. And then apply a variation of page rank to get the ordering of sentences for multi-document summaries. We tested our hybrid model with two automatic methods: distance to manual ordering and ROUGE score. Evaluation results show a significant improvement of the ordering over strategies losing some relations. The results also indicate that this hybrid model is robust for articles with different genre which were used on DUC2004 and DUC2005.

Keywords

Hybrid Model Dependent Relation Precedence Graph Chronological Relation Document Summarization 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Mani, I.: Automatic Summarization. John Benjamins (2001) Google Scholar
  2. 2.
    Radev, D.R., Hovy, E.H., McKeown, K.: Introduction to the Special Issue on Summarization. Computational Linguistics 28(4), 399–408 (2002)CrossRefGoogle Scholar
  3. 3.
    Okazaki, N., Matsuo, Y., Ishizuka, M.: Coherent Arrangement of Sentences Extracted from Multiple Newspaper Articles. In: Zhang, C., Guesgen, H.W., Yeap, W.-K. (eds.) PRICAI 2004. LNCS (LNAI), vol. 3157, pp. 882–891. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  4. 4.
    Barzilay, R., Elhadad, E., McKeown, K.: Inferring strategies for sentence ordering in multidocument summarization. Journal of Artificial Intelligence Research (JAIR) 17, 35–55 (2002)MATHGoogle Scholar
  5. 5.
    Lapata, M.: Probabilistic text structuring: experiments with sentence ordering. In: Proceedings of the 41st Meeting of the Association of Computational Linguistics, pp. 545–552 (2003)Google Scholar
  6. 6.
    Asher, N., Lascarides, A.: Logics of Conversation. Cambridge University Press, Cambridge (2003)Google Scholar
  7. 7.
    Grosz, B., Joshi, A., Weinstein, S.: Centering: A framework for modeling the local coherence of discourse. Computational Linguistics 21(2), 203–225 (1995)Google Scholar
  8. 8.
    Cohen, W., Schapire, R., Singer, Y.: Learning to order things. Journal of Artificial Intelligence 10, 243–270 (1999)MathSciNetMATHGoogle Scholar
  9. 9.
    Galil, Z., Megido, N.: Cyclic ordering is np-complete. Theoretical Computer Science 5, 179–182 (1977)CrossRefMathSciNetGoogle Scholar
  10. 10.
    Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems 30, 1–7 (1998)CrossRefGoogle Scholar
  11. 11.
    Mihalcea, R., Tarau, P.: A Language Independent Algorithm for Single and Multiple Document Summarization (2005), http://acl.ldc.upenn.edu/I/I05/
  12. 12.
    Paul, O., James, Y.: An Introduction to DUC 2004. In: Proceedings of the 4th Document Understanding Conference (DUC 2004) (2004)Google Scholar
  13. 13.
    Lin, C.Y., Hovy, E.: Automatic Evaluation of Summaries Using N-gram Co-Occurrence Statistics. In: Proceedings of the Human Technology Conference (HLTNAACL 2003), Edmonton, Canada (2003)Google Scholar
  14. 14.
    Dang, H.T.: Overview of DUC 2005 (DRAFT). In: Proceedings of the 5th Document Understanding Conference (DUC 2005) (2005)Google Scholar
  15. 15.
    Lebanon, G., Lafferty, J.: Combining rankings using conditional probability models on permutations. In: Proceedings of the 19th International Conference on Machine Learning, pp. 363–370. Morgan Kaufmann Publishers, San Francisco (2002)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Yanxiang He
    • 1
  • Dexi Liu
    • 1
    • 2
  • Hua Yang
    • 1
    • 2
  • Donghong Ji
    • 2
    • 3
  • Chong Teng
    • 1
    • 2
  • Wenqing Qi
    • 1
    • 4
  1. 1.School of ComputerWuhan UniversityWuhanP.R. China
  2. 2.Center for Study of Language and InformationWuhan UniversityWuhanP.R. China
  3. 3.Institute for Infocomm Research, Heng Mui Keng TerraceSingapore
  4. 4.Huangshi Institute of TechnologyHuangshiP.R. China

Personalised recommendations