Skip to main content

Metaheuristic Optimization Using Sentence Level Semantics for Extractive Document Summarization

  • Conference paper
  • First Online:
Mining Intelligence and Knowledge Exploration (MIKE 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9468))

Abstract

Multi document summarization is the process of automatic creation of a summary of one or more text documents. We developed a multi-document summarization system which generate an extractive generic summary with maximum relevance and minimum redundancy. To achieve this, four features associated with sentences, that can influence the summarization process are extracted. It is difficult to find the appropriate weights corresponding to the features, which leads to good results. We propose a metaheuristic optimization based on solution population with multiple objective functions. The objective functions used takes care of both the statistical and semantic aspects of the documents. Our population based optimization converges rapidly to produce candidate sentences for summary. Evaluation of the proposed system is performed on DUC 2002 dataset using ROGUE tool kit. Experimental results shows that our system outperforms the state of the art works in terms of Recall and Precision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Alguliev, R.M., Aliguliyev, R.M., Isazade, N.R.: Multiple documents summarization based on evolutionary optimization algorithm. Expert Syst. Appl. 40(5), 1675–1689 (2013)

    Article  Google Scholar 

  2. Dunlavy, D.M., OLeary, D.P., Conroy, J.M., Schlesinger, J.D.: Qcs: a system for querying, clustering and summarizing documents. Inf. Process. Manage. 43(6), 1588–1605 (2007)

    Article  Google Scholar 

  3. Erkan, G., Radev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22, 457–479 (2004)

    Google Scholar 

  4. Fattah, M.A., Ren, F.: Ga, mr, ffnn, pnn and gmm based models for automatic text summarization. Comput. Speech Lang. 23(1), 126–144 (2009)

    Article  Google Scholar 

  5. Gong, Y., Liu, X.: Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 19–25. ACM (2001)

    Google Scholar 

  6. Gupta, A., Kaur, M., Singh, A., Goel, A., Mirkin, S.: Text summarization through entailment-based minimum vertex cover. In: Lexical and Computational Semantics (* SEM 2014), p. 75 (2014)

    Google Scholar 

  7. Hammouda, K.M., Kamel, M.S.: Models of distributed data clustering in peer-to-peer environments. Knowl. Inf. Syst. 38(2), 303–329 (2014)

    Article  Google Scholar 

  8. Hennig, L., Labor, D.: Topic-based multi-document summarization with probabilistic latent semantic analysis. In: RANLP, pp. 144–149 (2009)

    Google Scholar 

  9. Jing, H.: Sentence reduction for automatic text summarization. In: Proceedings of the Sixth Conference on Applied Natural Language Processing, pp. 310–315. Association for Computational Linguistics (2000)

    Google Scholar 

  10. Knight, K., Marcu, D.: Summarization beyond sentence extraction: a probabilistic approach to sentence compression. Artif. Intell. 139(1), 91–107 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  11. Ledeneva, Y., García-Hernández, R.A., Gelbukh, A.: Graph ranking on maximal frequent sequences for single extractive text summarization. In: Gelbukh, A. (ed.) CICLing 2014, Part II. LNCS, vol. 8404, pp. 466–480. Springer, Heidelberg (2014)

    Chapter  Google Scholar 

  12. Lin, C.-Y.: Rouge: a package for automatic evaluation of summaries. In: Proceedings of the ACL-2004 Workshop on Text Summarization Branches Out, vol. 8 (2004)

    Google Scholar 

  13. Lovins, J.B.: Development of a Stemming Algorithm. MIT Information Processing Group, Electronic Systems Laboratory (1968)

    Google Scholar 

  14. Mishra, R., Bian, J., Fiszman, M., Weir, C.R., Jonnalagadda, S., Mostafa, J., Fiol, D.G.: Text summarization in the biomedical domain: a systematic review of recent research. J. Biomed. Inf. 52, 457–467 (2014)

    Article  Google Scholar 

  15. Nishino, M., Yasuda, N., Hirao, T., Minato, S., Nagata, M.: A dynamic programming algorithm for tree trimming-based text summarization. In: Proceedings of NAACL HLT, pp. 462–471 (2015)

    Google Scholar 

  16. Patil, A., Pharande, K., Nale, D., Agrawal, R.: Automatic text summarization. Int. J. Comput. Appl. 109(17), 975–8887 (2015)

    Google Scholar 

  17. Silva, G., Ferreira, R., Lins, R.D., Cabral, L., Oliveira, H., Simske, S.J., Riss, M.: Automatic text document summarization based on machine learning. In: Proceedings of the 2015 ACM Symposium on Document Engineering, DocEng 2015, pp. 191–194 (2015)

    Google Scholar 

  18. Steinberger, J., Ježek, K.: Text summarization and singular value decomposition. In: Yakhno, T. (ed.) ADVIS 2004. LNCS, vol. 3261, pp. 245–254. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  19. Van Rijsbergen, C.J., Robertson, S.E., Porter, M.F.: New models in probabilistic information retrieval. Computer Laboratory, University of Cambridge (1980)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ansamma John .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Premjith, P.S., John, A., Wilscy, M. (2015). Metaheuristic Optimization Using Sentence Level Semantics for Extractive Document Summarization. In: Prasath, R., Vuppala, A., Kathirvalavakumar, T. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2015. Lecture Notes in Computer Science(), vol 9468. Springer, Cham. https://doi.org/10.1007/978-3-319-26832-3_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26832-3_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26831-6

  • Online ISBN: 978-3-319-26832-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics