Abstract
With the advent of Information technology and the Internet, the world is producing several terabytes of information every second. Several online news feeds have popped up in the past decade that reports an incident almost instantly. This has led to a dire need to reduce content and present the user only with what is necessary, called the summary. In this paper an Extractive Summarization technique based on graph theory is proposed. The method tries to create a representative summary or abstract of the entire document, by finding the most informative sentences by means of infomap clustering after a graphical representation of the entire document.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Nenkova, A., McKeown, K.: A Survey of Text Summarization Techniques. Springer Science+Business Media (2012)
Meena, Y.K., Gopalani, D.: Evolutionary algorithms for extractive automatic text summarization. In: Procedia Comput. Sci., 48(Suppl. C), 244 – 249 (2015). (International Conference on Computer, Communication and Convergence (ICCC 2015))
Saggion, H., Lapalme, G.: Generating indicative-informative summaries with sumum. Comput. Linguist. 28(4), 497–526 (2002)
Gong, Y., Liu, X.: Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 19–25. ACM (2001)
Dunning, T.: Accurate methods for the statistics of surprise and coincidence. Comput. Linguist. 19(1), 61–74 (1993)
Hovy, E., Lin, C.-Y.: Automated text summarization and the summarist system. In: Proceedings of a Workshop on Held at Baltimore, Maryland: 13–15 Oct 1998, TIPSTER’98, pp. 197–214, Stroudsburg, PA, USA, 1998. Association for Computational Linguistics
Lin, C.-Y., Hovy, E.: The automated acquisition of topic signatures for text summarization. In: COLING’00 Proceedings of the 18th conference on Computational linguistics, pp. 495–501. Association for Computational Linguistics Stroudsburg, PA, USA (2000)
Wei, T., Lu, Y., Chang, H., Zhou, Q., Bao, X.: A semantic approach for text clustering using wordnet and lexical chains. Expert Syst. Appl. 42(4), 2264–2275 (2015)
Alpaslan, F.N., Cicekli, I.: Text summarization using latent semantic analysis. J. Inf. Sci. 37(4), 405–417 (2011)
Kan, M.-Y., McKeown, K.R., Klavans, J.L.: Applying natural language generation to indicative summarization. In: Proceedings of the 8th European Workshop on Natural Language Generation, vol. 8, pp. 1–9. Association for Computational Linguistics (2001)
Erkan, G., Radev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 457–479 (2004)
Harabagiu, S., Lacatusu, F.: Topic themes for multi-document summarization. In: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 202–209. ACM, New York, NY, USA (2005)
Radev, D.R., Jing, H., Stys, M., Tam, D.: Centroid-based summarization of multiple documents. Inf. Process. Manag. 40, 919–938 (2003)
Landauer, T.K., Foltz, P.W., Laham, D.: An introduction to latent semantic analysis. Discourse Process. 25(2–3), 259–284 (1998)
Parveen, D., Strube, M.: Integrating importance, non-redundancy and coherence in graph-based extractive summarization. In: Proceedings of the 24th International Conference on Artificial Intelligence, IJCAI’15, pp. 1298–1304. AAAI Press (2015)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM 46(5), 604–632 (1999)
Agrawal, N., Sharma, S., Sinha, P., Bagai, S.: A graph based ranking strategy for automated text summarization. DU J. Undergrad. Res. Innov. 1(1) (2015)
Mihalcea, R., Tarau, P.: TextRank: bringing order into texts. In: Proceedings of EMNLP-04 and the 2004 Conference on Empirical Methods in Natural Language Processing, July 2004
Dutta, S., Ghatak, S., Roy, M., Ghosh, S., Das, A.K.: A graph based clustering technique for tweet summarization. In: 2015 4th International Conference on Reliability, Infocom Technologies and Optimization (ICRITO) (Trends and Future Directions), pp. 1–6. IEEE (2015)
Beautifulsoup documentation. https://www.crummy.com/software/BeautifulSoup/bs4/doc/. Accessed 29 Nov 2017
Python 2.7.14 documentation. https://docs.python.org/2/index.html. Accessed 29 Nov 2017
Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python. O’Reilly (2009)
Lin, C.-Y.: Rouge: a package for automatic evaluation of summaries. In: Proceedings of the ACL Workshop: Text Summarization Braches Out 2004, pp. 10, 01 2004
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Dutta, M., Das, A.K., Mallick, C., Sarkar, A., Das, A.K. (2019). A Graph Based Approach on Extractive Summarization. In: Abraham, A., Dutta, P., Mandal, J., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 813. Springer, Singapore. https://doi.org/10.1007/978-981-13-1498-8_16
Download citation
DOI: https://doi.org/10.1007/978-981-13-1498-8_16
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1497-1
Online ISBN: 978-981-13-1498-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)