A Hybrid Model for Sentence Ordering in Extractive Multi-document Summarization
Ordering information is a critical task for multi-document summarization because it heavily influent the coherence of the generated summary. In this paper, we propose a hybrid model for sentence ordering in extractive multi-document summarization that combines four relations between sentences. This model regards sentence as vertex and combined relation as edge of a directed graph on which the approximately optimal ordering can be generated with PageRank analysis. Evaluation of our hybrid model shows a significant improvement of the ordering over strategies losing some relations and the results also indicate that this hybrid model is robust for articles with different genre.
KeywordsHybrid Model Combine Relation Precedence Graph Sentence Order Chronological Relation
Unable to display preview. Download preview PDF.
- 5.Lapata, M.: Probabilistic text structuring: experiments with sentence ordering. In: Proceedings of the 41st Meeting of the Association of Computational Linguistics, pp. 545–552 (2003)Google Scholar
- 7.Paul, O., James, Y.: An Introduction to DUC-2004. In: Proceedings of the 4th Document Understanding Conference, DUC 2004 (2004)Google Scholar
- 8.Lin, C.Y., Hovy, E.: Automatic Evaluation of Summaries Using N-gram Co-Occurrence Statistics. In: Proceedings of the Human Technology Conference (HLTNAACL 2003), Edmonton, Canada (2003)Google Scholar
- 9.Lebanon, G., Lafferty, J.: Combining rankings using conditional probability models on permutations. In: Proceedings of the 19th International Conference on Machine Learning, pp. 363–370. Morgan Kaufmann Publishers, San Francisco (2002)Google Scholar