Skip to main content

A Graph Based Approach on Extractive Summarization

  • Conference paper
  • First Online:
Emerging Technologies in Data Mining and Information Security

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 813))

Abstract

With the advent of Information technology and the Internet, the world is producing several terabytes of information every second. Several online news feeds have popped up in the past decade that reports an incident almost instantly. This has led to a dire need to reduce content and present the user only with what is necessary, called the summary. In this paper an Extractive Summarization technique based on graph theory is proposed. The method tries to create a representative summary or abstract of the entire document, by finding the most informative sentences by means of infomap clustering after a graphical representation of the entire document.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Nenkova, A., McKeown, K.: A Survey of Text Summarization Techniques. Springer Science+Business Media (2012)

    Google Scholar 

  2. Meena, Y.K., Gopalani, D.: Evolutionary algorithms for extractive automatic text summarization. In: Procedia Comput. Sci., 48(Suppl. C), 244 – 249 (2015). (International Conference on Computer, Communication and Convergence (ICCC 2015))

    Google Scholar 

  3. Saggion, H., Lapalme, G.: Generating indicative-informative summaries with sumum. Comput. Linguist. 28(4), 497–526 (2002)

    Google Scholar 

  4. Gong, Y., Liu, X.: Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 19–25. ACM (2001)

    Google Scholar 

  5. Dunning, T.: Accurate methods for the statistics of surprise and coincidence. Comput. Linguist. 19(1), 61–74 (1993)

    Google Scholar 

  6. Hovy, E., Lin, C.-Y.: Automated text summarization and the summarist system. In: Proceedings of a Workshop on Held at Baltimore, Maryland: 13–15 Oct 1998, TIPSTER’98, pp. 197–214, Stroudsburg, PA, USA, 1998. Association for Computational Linguistics

    Google Scholar 

  7. Lin, C.-Y., Hovy, E.: The automated acquisition of topic signatures for text summarization. In: COLING’00 Proceedings of the 18th conference on Computational linguistics, pp. 495–501. Association for Computational Linguistics Stroudsburg, PA, USA (2000)

    Google Scholar 

  8. Wei, T., Lu, Y., Chang, H., Zhou, Q., Bao, X.: A semantic approach for text clustering using wordnet and lexical chains. Expert Syst. Appl. 42(4), 2264–2275 (2015)

    Google Scholar 

  9. Alpaslan, F.N., Cicekli, I.: Text summarization using latent semantic analysis. J. Inf. Sci. 37(4), 405–417 (2011)

    Google Scholar 

  10. Kan, M.-Y., McKeown, K.R., Klavans, J.L.: Applying natural language generation to indicative summarization. In: Proceedings of the 8th European Workshop on Natural Language Generation, vol. 8, pp. 1–9. Association for Computational Linguistics (2001)

    Google Scholar 

  11. Erkan, G., Radev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 457–479 (2004)

    Google Scholar 

  12. Harabagiu, S., Lacatusu, F.: Topic themes for multi-document summarization. In: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 202–209. ACM, New York, NY, USA (2005)

    Google Scholar 

  13. Radev, D.R., Jing, H., Stys, M., Tam, D.: Centroid-based summarization of multiple documents. Inf. Process. Manag. 40, 919–938 (2003)

    Google Scholar 

  14. Landauer, T.K., Foltz, P.W., Laham, D.: An introduction to latent semantic analysis. Discourse Process. 25(2–3), 259–284 (1998)

    Google Scholar 

  15. Parveen, D., Strube, M.: Integrating importance, non-redundancy and coherence in graph-based extractive summarization. In: Proceedings of the 24th International Conference on Artificial Intelligence, IJCAI’15, pp. 1298–1304. AAAI Press (2015)

    Google Scholar 

  16. Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM 46(5), 604–632 (1999)

    Google Scholar 

  17. Agrawal, N., Sharma, S., Sinha, P., Bagai, S.: A graph based ranking strategy for automated text summarization. DU J. Undergrad. Res. Innov. 1(1) (2015)

    Google Scholar 

  18. Mihalcea, R., Tarau, P.: TextRank: bringing order into texts. In: Proceedings of EMNLP-04 and the 2004 Conference on Empirical Methods in Natural Language Processing, July 2004

    Google Scholar 

  19. Dutta, S., Ghatak, S., Roy, M., Ghosh, S., Das, A.K.: A graph based clustering technique for tweet summarization. In: 2015 4th International Conference on Reliability, Infocom Technologies and Optimization (ICRITO) (Trends and Future Directions), pp. 1–6. IEEE (2015)

    Google Scholar 

  20. Beautifulsoup documentation. https://www.crummy.com/software/BeautifulSoup/bs4/doc/. Accessed 29 Nov 2017

  21. Python 2.7.14 documentation. https://docs.python.org/2/index.html. Accessed 29 Nov 2017

  22. Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python. O’Reilly (2009)

    Google Scholar 

  23. Lin, C.-Y.: Rouge: a package for automatic evaluation of summaries. In: Proceedings of the ACL Workshop: Text Summarization Braches Out 2004, pp. 10, 01 2004

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Madhurima Dutta or Ajit Kumar Das .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Dutta, M., Das, A.K., Mallick, C., Sarkar, A., Das, A.K. (2019). A Graph Based Approach on Extractive Summarization. In: Abraham, A., Dutta, P., Mandal, J., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 813. Springer, Singapore. https://doi.org/10.1007/978-981-13-1498-8_16

Download citation

Publish with us

Policies and ethics