Metaheuristic Optimization Using Sentence Level Semantics for Extractive Document Summarization

Premjith, P. S.; John, Ansamma; Wilscy, M.

doi:10.1007/978-3-319-26832-3_33

P. S. Premjith¹⁶,
Ansamma John¹⁶ &
M. Wilscy¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9468))

Included in the following conference series:

International Conference on Mining Intelligence and Knowledge Exploration

1781 Accesses
1 Citations

Abstract

Multi document summarization is the process of automatic creation of a summary of one or more text documents. We developed a multi-document summarization system which generate an extractive generic summary with maximum relevance and minimum redundancy. To achieve this, four features associated with sentences, that can influence the summarization process are extracted. It is difficult to find the appropriate weights corresponding to the features, which leads to good results. We propose a metaheuristic optimization based on solution population with multiple objective functions. The objective functions used takes care of both the statistical and semantic aspects of the documents. Our population based optimization converges rapidly to produce candidate sentences for summary. Evaluation of the proposed system is performed on DUC 2002 dataset using ROGUE tool kit. Experimental results shows that our system outperforms the state of the art works in terms of Recall and Precision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alguliev, R.M., Aliguliyev, R.M., Isazade, N.R.: Multiple documents summarization based on evolutionary optimization algorithm. Expert Syst. Appl. 40(5), 1675–1689 (2013)
Article Google Scholar
Dunlavy, D.M., OLeary, D.P., Conroy, J.M., Schlesinger, J.D.: Qcs: a system for querying, clustering and summarizing documents. Inf. Process. Manage. 43(6), 1588–1605 (2007)
Article Google Scholar
Erkan, G., Radev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22, 457–479 (2004)
Google Scholar
Fattah, M.A., Ren, F.: Ga, mr, ffnn, pnn and gmm based models for automatic text summarization. Comput. Speech Lang. 23(1), 126–144 (2009)
Article Google Scholar
Gong, Y., Liu, X.: Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 19–25. ACM (2001)
Google Scholar
Gupta, A., Kaur, M., Singh, A., Goel, A., Mirkin, S.: Text summarization through entailment-based minimum vertex cover. In: Lexical and Computational Semantics (* SEM 2014), p. 75 (2014)
Google Scholar
Hammouda, K.M., Kamel, M.S.: Models of distributed data clustering in peer-to-peer environments. Knowl. Inf. Syst. 38(2), 303–329 (2014)
Article Google Scholar
Hennig, L., Labor, D.: Topic-based multi-document summarization with probabilistic latent semantic analysis. In: RANLP, pp. 144–149 (2009)
Google Scholar
Jing, H.: Sentence reduction for automatic text summarization. In: Proceedings of the Sixth Conference on Applied Natural Language Processing, pp. 310–315. Association for Computational Linguistics (2000)
Google Scholar
Knight, K., Marcu, D.: Summarization beyond sentence extraction: a probabilistic approach to sentence compression. Artif. Intell. 139(1), 91–107 (2002)
Article MathSciNet MATH Google Scholar
Ledeneva, Y., García-Hernández, R.A., Gelbukh, A.: Graph ranking on maximal frequent sequences for single extractive text summarization. In: Gelbukh, A. (ed.) CICLing 2014, Part II. LNCS, vol. 8404, pp. 466–480. Springer, Heidelberg (2014)
Chapter Google Scholar
Lin, C.-Y.: Rouge: a package for automatic evaluation of summaries. In: Proceedings of the ACL-2004 Workshop on Text Summarization Branches Out, vol. 8 (2004)
Google Scholar
Lovins, J.B.: Development of a Stemming Algorithm. MIT Information Processing Group, Electronic Systems Laboratory (1968)
Google Scholar
Mishra, R., Bian, J., Fiszman, M., Weir, C.R., Jonnalagadda, S., Mostafa, J., Fiol, D.G.: Text summarization in the biomedical domain: a systematic review of recent research. J. Biomed. Inf. 52, 457–467 (2014)
Article Google Scholar
Nishino, M., Yasuda, N., Hirao, T., Minato, S., Nagata, M.: A dynamic programming algorithm for tree trimming-based text summarization. In: Proceedings of NAACL HLT, pp. 462–471 (2015)
Google Scholar
Patil, A., Pharande, K., Nale, D., Agrawal, R.: Automatic text summarization. Int. J. Comput. Appl. 109(17), 975–8887 (2015)
Google Scholar
Silva, G., Ferreira, R., Lins, R.D., Cabral, L., Oliveira, H., Simske, S.J., Riss, M.: Automatic text document summarization based on machine learning. In: Proceedings of the 2015 ACM Symposium on Document Engineering, DocEng 2015, pp. 191–194 (2015)
Google Scholar
Steinberger, J., Ježek, K.: Text summarization and singular value decomposition. In: Yakhno, T. (ed.) ADVIS 2004. LNCS, vol. 3261, pp. 245–254. Springer, Heidelberg (2004)
Chapter Google Scholar
Van Rijsbergen, C.J., Robertson, S.E., Porter, M.F.: New models in probabilistic information retrieval. Computer Laboratory, University of Cambridge (1980)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of CSE, TKM College of Engineering, Kollam, India
P. S. Premjith & Ansamma John
Department of CSE, University of Kerala, Kariavattom Campus, Trivandrum, India
M. Wilscy

Authors

P. S. Premjith
View author publications
You can also search for this author in PubMed Google Scholar
Ansamma John
View author publications
You can also search for this author in PubMed Google Scholar
M. Wilscy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ansamma John .

Editor information

Editors and Affiliations

Norwegian Univ. of Science & Technology, Trondheim, Norway
Rajendra Prasath
Intl Inst of Info Tech Hyderabad, Hyderabad, India
Anil Kumar Vuppala
V.H.N.S.N.College (Autonomous), Virudhunagar, Tamil Nadu, India
T. Kathirvalavakumar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Premjith, P.S., John, A., Wilscy, M. (2015). Metaheuristic Optimization Using Sentence Level Semantics for Extractive Document Summarization. In: Prasath, R., Vuppala, A., Kathirvalavakumar, T. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2015. Lecture Notes in Computer Science(), vol 9468. Springer, Cham. https://doi.org/10.1007/978-3-319-26832-3_33

Download citation

DOI: https://doi.org/10.1007/978-3-319-26832-3_33
Published: 03 January 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26831-6
Online ISBN: 978-3-319-26832-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics