Graph-Based Text Summarization Using Modified TextRank

Mallick, Chirantana; Das, Ajit Kumar; Dutta, Madhurima; Das, Asit Kumar; Sarkar, Apurba

doi:10.1007/978-981-13-0514-6_14

Chirantana Mallick¹⁹,
Ajit Kumar Das¹⁹,
Madhurima Dutta¹⁹,
Asit Kumar Das¹⁹ &
…
Apurba Sarkar¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 758))

1502 Accesses
54 Citations

Abstract

Nowadays, the efficient access of enormous amounts of information has become more difficult due to the rapid growth of the Internet. To manage the vast information, we need efficient and effective methods and tools. In this paper, a graph-based text summarization method has been described which captures the aboutness of a text document. The method has been developed using modified TextRank computed based on the concept of PageRank defined for each page in the Web pages. The proposed method constructs a graph with sentences as the nodes and similarity between two sentences as the weight of the edge between them. Modified inverse sentence frequency-cosine similarity is used to give different weightage to different words in the sentence, whereas traditional cosine similarity treats the words equally. The graph is made sparse and partitioned into different clusters with the assumption that the sentences within a cluster are similar to each other and sentences of different cluster represent their dissimilarity. The performance evaluation of proposed summarization technique shows the effectiveness of the method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Beautifulsoup documentation. https://www.crummy.com/software/BeautifulSoup/bs4/doc/ (2017). Accessed 30 Nov 2017
Dutta, S., Ghatak, S., Roy, M., Ghosh, S., Das, A.K.: A graph based clustering technique for tweet summarization. In: 2015 4th International Conference on Reliability, Infocom Technologies and Optimization (ICRITO) (Trends and Future Directions), pp. 1–6. IEEE (2015)
Google Scholar
Erkan, G., Radev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 22, 457–479 (2004)
Article Google Scholar
Goldstein, J., Mittal, V., Carbonell, J., Kantrowitz, M.: Multi-document summarization by sentence extraction. In: Proceedings of the 2000 NAACL-ANLPWorkshop on Automatic Summarization, vol. 4, pp. 40–48. Association for Computational Linguistics (2000)
Google Scholar
Gong, Y., Liu, X.: Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 19–25. ACM (2001)
Google Scholar
Harabagiu, S., Lacatusu, F.: Topic themes for multi-document summarization. In: Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 202–209. ACM (2005)
Google Scholar
Kan, M.-Y., McKeown, K.R., Klavans, J.L.: Applying natural language generation to indicative summarization. In: Proceedings of the 8th European workshop on Natural Language Generation, vol. 8, pp. 1–9. Association for Computational Linguistics (2001)
Google Scholar
Khan, A., Salim, N.: A review on abstractive summarization methods. J. Theor. Appl. Inf. Technol. 59(1), 64–72 (2014)
Google Scholar
Lin, C.-Y.: Rouge: a package for automatic evaluation of summaries. In: Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, Barcelona, Spain, vol. 8 (2004)
Google Scholar
Litvak, M., Last, M.: Graph-based keyword extraction for single-document summarization. In: Proceedings of the workshop on Multi-source Multilingual Information Extraction and Summarization, pp. 17–24. Association for Computational Linguistics (2008)
Google Scholar
Loaiciga Sanchez, S.: Pronominal anaphora and verbal tenses in machine translation. Ph.D. thesis, University of Geneva (2017)
Google Scholar
Mihalcea, R.: Graph-based ranking algorithms for sentence extraction, applied to text summarization. In: Proceedings of the ACL 2004 on Interactive Poster and Demonstration Sessions, pp. 20. Association for Computational Linguistics (2004)
Google Scholar
Moens, M.-F., Uyttendaele, C., Dumortier, J.: Abstracting of legal cases: the potential of clustering based on the selection of representative objects. J. Assoc. Inf. Sci. Technol. 50(2), 151 (1999)
Google Scholar
Nenkova, A., McKeown, K.: A survey of text summarization techniques. Mining Text Data, pp. 43–76 (2012)
Google Scholar
Python 2.7.14 documentation. https://docs.python.org/2/index.html (2017). Accessed 30 Nov 2017
Radev, D.R., Jing, H., Styś, M., Tam, D.: Centroid-based summarization of multiple documents. Inf. Process. Manag. 40(6), 919–938 (2004)
Article Google Scholar
Saggion, H., Lapalme, G.: Generating indicative-informative summaries with sumum. Comput. linguist. 28(4), 497–526 (2002)
Article Google Scholar
Salton, G., Singhal, A., Mitra, M., Buckley, C.: Automatic text structuring and summarization. Inf. Process. Manag. 33(2), 193–207 (1997)
Article Google Scholar
Seki, Y.: Sentence extraction by tf/idf and position weighting from newspaper articles (2002)
Google Scholar
Tang, J., Yao, L., Chen, D.: Multi-topic based query-oriented summarization. In: Proceedings of the 2009 SIAM International Conference on Data Mining, pp. 1148–1159. SIAM (2009)
Google Scholar
Wong, K.-F., Wu, M., Li, W.: Extractive summarization using supervised and semi-supervised learning. In: Proceedings of the 22nd International Conference on Computational Linguistics, vol. 1, pp. 985–992. Association for Computational Linguistics (2008)
Google Scholar
Yeh, J.-Y., Ke, H.-R., Yang, W.-P., Meng, I.-H.: Text summarization using a trainable summarizer and latent semantic analysis. Inf. process. Manag. 41(1), 75–95 (2005)
Google Scholar
Zha, H.: Generic summarization and keyphrase extraction using mutual reinforcement principle and sentence clustering. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 113–120. ACM (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology, Shibpur, Howrah, India
Chirantana Mallick, Ajit Kumar Das, Madhurima Dutta, Asit Kumar Das & Apurba Sarkar

Authors

Chirantana Mallick
View author publications
You can also search for this author in PubMed Google Scholar
Ajit Kumar Das
View author publications
You can also search for this author in PubMed Google Scholar
Madhurima Dutta
View author publications
You can also search for this author in PubMed Google Scholar
Asit Kumar Das
View author publications
You can also search for this author in PubMed Google Scholar
Apurba Sarkar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Chirantana Mallick or Ajit Kumar Das .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Sri Sivani College of Engineering, Srikakulam, Andhra Pradesh, India
Janmenjoy Nayak
Machine Intelligence Research Labs (MIR Labs), Scientific Network for Innovation and Research Excellence, Washington, USA
Ajith Abraham
Department of Mechanical Engineering, Sri Sivani College of Engineering, Srikakulam, Andhra Pradesh, India
B. Murali Krishna
Department of Electrical and Electronics Engineering, Sri Sivani College of Engineering, Srikakulam, Andhra Pradesh, India
G. T. Chandra Sekhar
Department of Computer Science and Technology, Indian Institute of Engineering Science and Technology (IIEST), Shibpur, Howrah, West Bengal, India
Asit Kumar Das

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mallick, C., Das, A.K., Dutta, M., Das, A.K., Sarkar, A. (2019). Graph-Based Text Summarization Using Modified TextRank. In: Nayak, J., Abraham, A., Krishna, B., Chandra Sekhar, G., Das, A. (eds) Soft Computing in Data Analytics . Advances in Intelligent Systems and Computing, vol 758. Springer, Singapore. https://doi.org/10.1007/978-981-13-0514-6_14

Download citation

DOI: https://doi.org/10.1007/978-981-13-0514-6_14
Published: 22 August 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-0513-9
Online ISBN: 978-981-13-0514-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics