Soft Computing

, Volume 21, Issue 7, pp 1785–1801

Query-based multi-documents summarization using linguistic knowledge and content word expansion

  • Asad Abdi
  • Norisma Idris
  • Rasim M. Alguliyev
  • Ramiz M. Aliguliyev
Methodologies and Application

DOI: 10.1007/s00500-015-1881-4

Cite this article as:
Abdi, A., Idris, N., Alguliyev, R.M. et al. Soft Comput (2017) 21: 1785. doi:10.1007/s00500-015-1881-4
  • 205 Downloads

Abstract

In this paper, a query-based summarization method, which uses a combination of semantic relations between words and their syntactic composition, to extract meaningful sentences from document sets is introduced. The problem with current statistical methods is that they fail to capture the meaning when comparing a sentence and a user query; hence there is often a conflict between the extracted sentences and users’ requirements. However, this particular method can improve the quality of document summaries because it is able to avoid extracting a sentence whose similarity with the query is high but whose meaning is different. The method is executed by computing the semantic and syntactic similarity of the sentence-to-sentence and sentence-to-query. To reduce redundancy in summary, this method uses the greedy algorithm to impose diversity penalty on the sentences. In addition, the proposed method expands the words in both the query and the sentences to tackle the problem of information limit. It bridges the lexical gaps for semantically similar contexts that are expressed using different wording. The experimental results display that the proposed method is able to improve performance compared with the participating systems in DUC 2006. The experimental results also showed that the proposed method demonstrates better performance as compared to other existing techniques on DUC 2005 and DUC 2006 datasets.

Keywords

Query-based multi-document summarization Graph-based sentence ranking Query expansion Extractive summarization 

Copyright information

© Springer-Verlag Berlin Heidelberg 2015

Authors and Affiliations

  • Asad Abdi
    • 1
  • Norisma Idris
    • 1
  • Rasim M. Alguliyev
    • 2
  • Ramiz M. Aliguliyev
    • 2
  1. 1.Department of Artificial Intelligence, Faculty of Computer Science and Information TechnologyUniversity of MalayaKuala LumpurMalaysia
  2. 2.Institute of Information TechnologyAzerbaijan National Academy of SciencesBakuAzerbaijan

Personalised recommendations