Abstract
Multi document summarization is the process of automatic creation of a summary of one or more text documents. We developed a multi-document summarization system which generate an extractive generic summary with maximum relevance and minimum redundancy. To achieve this, four features associated with sentences, that can influence the summarization process are extracted. It is difficult to find the appropriate weights corresponding to the features, which leads to good results. We propose a metaheuristic optimization based on solution population with multiple objective functions. The objective functions used takes care of both the statistical and semantic aspects of the documents. Our population based optimization converges rapidly to produce candidate sentences for summary. Evaluation of the proposed system is performed on DUC 2002 dataset using ROGUE tool kit. Experimental results shows that our system outperforms the state of the art works in terms of Recall and Precision.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alguliev, R.M., Aliguliyev, R.M., Isazade, N.R.: Multiple documents summarization based on evolutionary optimization algorithm. Expert Syst. Appl. 40(5), 1675–1689 (2013)
Dunlavy, D.M., OLeary, D.P., Conroy, J.M., Schlesinger, J.D.: Qcs: a system for querying, clustering and summarizing documents. Inf. Process. Manage. 43(6), 1588–1605 (2007)
Erkan, G., Radev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22, 457–479 (2004)
Fattah, M.A., Ren, F.: Ga, mr, ffnn, pnn and gmm based models for automatic text summarization. Comput. Speech Lang. 23(1), 126–144 (2009)
Gong, Y., Liu, X.: Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 19–25. ACM (2001)
Gupta, A., Kaur, M., Singh, A., Goel, A., Mirkin, S.: Text summarization through entailment-based minimum vertex cover. In: Lexical and Computational Semantics (* SEM 2014), p. 75 (2014)
Hammouda, K.M., Kamel, M.S.: Models of distributed data clustering in peer-to-peer environments. Knowl. Inf. Syst. 38(2), 303–329 (2014)
Hennig, L., Labor, D.: Topic-based multi-document summarization with probabilistic latent semantic analysis. In: RANLP, pp. 144–149 (2009)
Jing, H.: Sentence reduction for automatic text summarization. In: Proceedings of the Sixth Conference on Applied Natural Language Processing, pp. 310–315. Association for Computational Linguistics (2000)
Knight, K., Marcu, D.: Summarization beyond sentence extraction: a probabilistic approach to sentence compression. Artif. Intell. 139(1), 91–107 (2002)
Ledeneva, Y., GarcÃa-Hernández, R.A., Gelbukh, A.: Graph ranking on maximal frequent sequences for single extractive text summarization. In: Gelbukh, A. (ed.) CICLing 2014, Part II. LNCS, vol. 8404, pp. 466–480. Springer, Heidelberg (2014)
Lin, C.-Y.: Rouge: a package for automatic evaluation of summaries. In: Proceedings of the ACL-2004 Workshop on Text Summarization Branches Out, vol. 8 (2004)
Lovins, J.B.: Development of a Stemming Algorithm. MIT Information Processing Group, Electronic Systems Laboratory (1968)
Mishra, R., Bian, J., Fiszman, M., Weir, C.R., Jonnalagadda, S., Mostafa, J., Fiol, D.G.: Text summarization in the biomedical domain: a systematic review of recent research. J. Biomed. Inf. 52, 457–467 (2014)
Nishino, M., Yasuda, N., Hirao, T., Minato, S., Nagata, M.: A dynamic programming algorithm for tree trimming-based text summarization. In: Proceedings of NAACL HLT, pp. 462–471 (2015)
Patil, A., Pharande, K., Nale, D., Agrawal, R.: Automatic text summarization. Int. J. Comput. Appl. 109(17), 975–8887 (2015)
Silva, G., Ferreira, R., Lins, R.D., Cabral, L., Oliveira, H., Simske, S.J., Riss, M.: Automatic text document summarization based on machine learning. In: Proceedings of the 2015 ACM Symposium on Document Engineering, DocEng 2015, pp. 191–194 (2015)
Steinberger, J., Ježek, K.: Text summarization and singular value decomposition. In: Yakhno, T. (ed.) ADVIS 2004. LNCS, vol. 3261, pp. 245–254. Springer, Heidelberg (2004)
Van Rijsbergen, C.J., Robertson, S.E., Porter, M.F.: New models in probabilistic information retrieval. Computer Laboratory, University of Cambridge (1980)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Premjith, P.S., John, A., Wilscy, M. (2015). Metaheuristic Optimization Using Sentence Level Semantics for Extractive Document Summarization. In: Prasath, R., Vuppala, A., Kathirvalavakumar, T. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2015. Lecture Notes in Computer Science(), vol 9468. Springer, Cham. https://doi.org/10.1007/978-3-319-26832-3_33
Download citation
DOI: https://doi.org/10.1007/978-3-319-26832-3_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26831-6
Online ISBN: 978-3-319-26832-3
eBook Packages: Computer ScienceComputer Science (R0)