A Graph Based Approach on Extractive Summarization

Dutta, Madhurima; Das, Ajit Kumar; Mallick, Chirantana; Sarkar, Apurba; Das, Asit K.

doi:10.1007/978-981-13-1498-8_16

Madhurima Dutta¹⁹,
Ajit Kumar Das¹⁹,
Chirantana Mallick¹⁹,
Apurba Sarkar¹⁹ &
…
Asit K. Das¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 813))

1202 Accesses
13 Citations

Abstract

With the advent of Information technology and the Internet, the world is producing several terabytes of information every second. Several online news feeds have popped up in the past decade that reports an incident almost instantly. This has led to a dire need to reduce content and present the user only with what is necessary, called the summary. In this paper an Extractive Summarization technique based on graph theory is proposed. The method tries to create a representative summary or abstract of the entire document, by finding the most informative sentences by means of infomap clustering after a graphical representation of the entire document.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Nenkova, A., McKeown, K.: A Survey of Text Summarization Techniques. Springer Science+Business Media (2012)
Google Scholar
Meena, Y.K., Gopalani, D.: Evolutionary algorithms for extractive automatic text summarization. In: Procedia Comput. Sci., 48(Suppl. C), 244 – 249 (2015). (International Conference on Computer, Communication and Convergence (ICCC 2015))
Google Scholar
Saggion, H., Lapalme, G.: Generating indicative-informative summaries with sumum. Comput. Linguist. 28(4), 497–526 (2002)
Google Scholar
Gong, Y., Liu, X.: Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 19–25. ACM (2001)
Google Scholar
Dunning, T.: Accurate methods for the statistics of surprise and coincidence. Comput. Linguist. 19(1), 61–74 (1993)
Google Scholar
Hovy, E., Lin, C.-Y.: Automated text summarization and the summarist system. In: Proceedings of a Workshop on Held at Baltimore, Maryland: 13–15 Oct 1998, TIPSTER’98, pp. 197–214, Stroudsburg, PA, USA, 1998. Association for Computational Linguistics
Google Scholar
Lin, C.-Y., Hovy, E.: The automated acquisition of topic signatures for text summarization. In: COLING’00 Proceedings of the 18th conference on Computational linguistics, pp. 495–501. Association for Computational Linguistics Stroudsburg, PA, USA (2000)
Google Scholar
Wei, T., Lu, Y., Chang, H., Zhou, Q., Bao, X.: A semantic approach for text clustering using wordnet and lexical chains. Expert Syst. Appl. 42(4), 2264–2275 (2015)
Google Scholar
Alpaslan, F.N., Cicekli, I.: Text summarization using latent semantic analysis. J. Inf. Sci. 37(4), 405–417 (2011)
Google Scholar
Kan, M.-Y., McKeown, K.R., Klavans, J.L.: Applying natural language generation to indicative summarization. In: Proceedings of the 8th European Workshop on Natural Language Generation, vol. 8, pp. 1–9. Association for Computational Linguistics (2001)
Google Scholar
Erkan, G., Radev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 457–479 (2004)
Google Scholar
Harabagiu, S., Lacatusu, F.: Topic themes for multi-document summarization. In: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 202–209. ACM, New York, NY, USA (2005)
Google Scholar
Radev, D.R., Jing, H., Stys, M., Tam, D.: Centroid-based summarization of multiple documents. Inf. Process. Manag. 40, 919–938 (2003)
Google Scholar
Landauer, T.K., Foltz, P.W., Laham, D.: An introduction to latent semantic analysis. Discourse Process. 25(2–3), 259–284 (1998)
Google Scholar
Parveen, D., Strube, M.: Integrating importance, non-redundancy and coherence in graph-based extractive summarization. In: Proceedings of the 24th International Conference on Artificial Intelligence, IJCAI’15, pp. 1298–1304. AAAI Press (2015)
Google Scholar
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM 46(5), 604–632 (1999)
Google Scholar
Agrawal, N., Sharma, S., Sinha, P., Bagai, S.: A graph based ranking strategy for automated text summarization. DU J. Undergrad. Res. Innov. 1(1) (2015)
Google Scholar
Mihalcea, R., Tarau, P.: TextRank: bringing order into texts. In: Proceedings of EMNLP-04 and the 2004 Conference on Empirical Methods in Natural Language Processing, July 2004
Google Scholar
Dutta, S., Ghatak, S., Roy, M., Ghosh, S., Das, A.K.: A graph based clustering technique for tweet summarization. In: 2015 4th International Conference on Reliability, Infocom Technologies and Optimization (ICRITO) (Trends and Future Directions), pp. 1–6. IEEE (2015)
Google Scholar
Beautifulsoup documentation. https://www.crummy.com/software/BeautifulSoup/bs4/doc/. Accessed 29 Nov 2017
Python 2.7.14 documentation. https://docs.python.org/2/index.html. Accessed 29 Nov 2017
Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python. O’Reilly (2009)
Google Scholar
Lin, C.-Y.: Rouge: a package for automatic evaluation of summaries. In: Proceedings of the ACL Workshop: Text Summarization Braches Out 2004, pp. 10, 01 2004
Google Scholar

Download references

Author information

Authors and Affiliations

Indian Institute of Engineering Science and Technology, Shibpur, Shibpur, India
Madhurima Dutta, Ajit Kumar Das, Chirantana Mallick, Apurba Sarkar & Asit K. Das

Authors

Madhurima Dutta
View author publications
You can also search for this author in PubMed Google Scholar
Ajit Kumar Das
View author publications
You can also search for this author in PubMed Google Scholar
Chirantana Mallick
View author publications
You can also search for this author in PubMed Google Scholar
Apurba Sarkar
View author publications
You can also search for this author in PubMed Google Scholar
Asit K. Das
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Madhurima Dutta or Ajit Kumar Das .

Editor information

Editors and Affiliations

Machine Intelligence Research Labs, Auburn, WA, USA
Ajith Abraham
Department of Computer and System Sciences, Visva-Bharati University, Santiniketan, West Bengal, India
Paramartha Dutta
Department of Computer Science and Engineering, University of Kalyani, Kalyani, India
Jyotsna Kumar Mandal
Institute of Engineering and Management, Kolkata, West Bengal, India
Abhishek Bhattacharya
Institute of Engineering and Management, Kolkata, West Bengal, India
Soumi Dutta

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dutta, M., Das, A.K., Mallick, C., Sarkar, A., Das, A.K. (2019). A Graph Based Approach on Extractive Summarization. In: Abraham, A., Dutta, P., Mandal, J., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 813. Springer, Singapore. https://doi.org/10.1007/978-981-13-1498-8_16

Download citation

DOI: https://doi.org/10.1007/978-981-13-1498-8_16
Published: 02 September 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1497-1
Online ISBN: 978-981-13-1498-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics