Context-Based Multi-document Summarization

Sonawane, Sheetal; Ghotkar, Archana; Hinge, Sonam

doi:10.1007/978-981-13-1540-4_16

Sheetal Sonawane¹⁷,
Archana Ghotkar¹⁷ &
Sonam Hinge¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 812))

243 Accesses
1 Citations

Abstract

Automatic text summarization is leading topic of information retrieval research due to increasing online transfer of information. The large volume of information is limited due to constraint of memory devices and access time. The existing summarization system uses the sentence extraction technique where the important sentences are extracted and presented as summary. Various summarization methods are used which do not take context into consideration. The proposed system focuses on multi-document summarization which is based on context score. Bernoulli model of randomness is used to provide an informative score of bi-gram terms based on lexical association. The resulting weight is then used in the graph-based iterative algorithm to generate a summary. Experiments have been conducted over the self-generated 100 document and benchmark DUC data sets. It has been shown that proposed system outperforms the existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Das, D., Martins, A.F.T.: A survey on automatic text summarization. In: Literature Survey for the Language and Statistics II course at CMU 4, pp. 192–195 (2007)
Google Scholar
Zhang, J., Sun, L., Zhou, q.: A cue-based hub-authority approach for multi-document text summarization. In: International Conference on Natural Language Processing and Knowledge Engineering, pp. 642–645 (2005)
Google Scholar
Weu, F., He, Y., Li, W., Lu, Q.: A query-sensitive graph-based sentence ranking algorithm for query-oriented multi-document summarization. In: International Symposiums on Information Processing, pp. 9–13 (2008)
Google Scholar
Thakkar, K.S., Dharaskar, R.V., Chandak, M.: Graph-based algorithms for text summarization. In: 3rd IEEE International Conference in Emerging Trends in Engineering and Technology (lCETET), pp. 516– 519 (2010)
Google Scholar
Chatterjee, N., Mittal, A., Goyal, S.: Single document extractive texts summarization using genetic algorithms. In: Third International Conference on Emerging Applications of Information Technology (EAIT), pp. 19–23 (2012)
Google Scholar
Sornil, O.. Gree-ut, K.: An Automatic text summarization approach using content-based and graph-based characteristics. In: IEEE Conference on Cybernetics and Intelligent Systems, pp. 1–6 (2006)
Google Scholar
Ge, S.S., Zhang, Z., He, H.: Weighted graph model based sentence clustering and ranking for document summarization. In: 4th IEEE International Conference on in Interaction Sciences (ICIS), pp. 90–95 (2011)
Google Scholar
Liu, D.-X., Hi, D.-X., Ji, D.-H., Yang, H.: A novel Chinese multi-document summarization using clustering based sentence extraction. In: Proceedings of the Fifth International Conference on Machine Learning and Cybernetics, Dalian, pp. 2592–2597 (2006)
Google Scholar
Sonawane, S.S: Graph based information retrieval. IJACKD J. Res. 3(1) (2014)
Google Scholar
Sonawane, S.S., Kulkarni, P.A.: Graph based representation and analysis of text document: a survey of techniques. Int. J. Comput. Appl. 96(19) (2014)
Google Scholar
Ramesh, A., Srinivasa, K.G,, Pramod, N.: SentenceRank—a graph based approach to summarize text. In: Fifth International Conference on Applications of Digital Information and Web Technologies (ICADIWT), pp. 177–182 (2014)
Google Scholar
Wei, Y.: Document summarization method based on heterogeneous graph. In: 9th IEEE International Conference on Fuzzy Systems and Knowledge Discovery, pp. 1285–1289 (2012)
Google Scholar
Lin, Y.-S., Jiang, J.-Y., Lee, S.J.: A similarity measure for text classification and clustering. IEEE Trans. Knowl. Data Eng. 26(7), 1575–1590 (2014)
Article Google Scholar
Ren, Pengjie, Zhumin Chen, Zhaochun Ren, Furu Wei, Jun Ma, and Maarten de Rijke. Leveraging contextual sentence relations for extractive summarization using a neural attention model. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 95–104 (2017).
Google Scholar
Bhakkad, A., Dharamadhikari, S.C., Kulkarni, P.: Efficient approach to find bigram frequency in text document using E-VSM. Int. J. Comput. Appl. 68 (19), 9–11 (2013)
Article Google Scholar
Erkan, G., Ramdev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Intell. Res. 457–479 (2004)
Article Google Scholar
Amati, G., Van Rijsbergen, C.J.: Probabilistic Models of Information Retrieval Based on Measuring the Divergence from Randomness. ACM Trans. Inf. Syst. 20, 357–389 (2002)
Article Google Scholar
Berberich, K., Bedathur, S., Weikum, G., Vazirgiannis, M.: Comparing Apples and oranges: normalized PageRank for evolving graphs. In: Proceedings of the 16th International Conference on World Wide Web, 1145–1146 (2007)
Google Scholar
Dubey, H., Roy, B.N.: An improved page rank algorithm based on optimized normalization technique,. Int. J. Comput. Sci. Inf. Technol. 2(5), 2183-2188 (2011)
Google Scholar
Over, P., Liggett, W.: Introduction to DUC: an intrinsic evaluation of generic news text summarization systems. In: Proceedings of DUC Workshop Text Summarization (2002)
Google Scholar
Lin, C.Y.: ROUGH: a package for automatic evaluation of summaries. In: Proceedings of the Workshop on Text Summarization Branches Out, (2004)
Google Scholar
Steinberger, J., Jezek, K.: Using latent semantic analysis in text summarization and summary evaluation. In: Proceedings of ISIM, 93–100 (2014)
Google Scholar
Mihalcea, R., Tarau, P.: Textrank: bringing order into texts. In: Proceedings of EMNLP, pp. 404–411 (2004)
Google Scholar
Goyal, P., Behera, L., & McGinnity, T. M. A context-based word indexing model for document summarization. IEEE Transactions on Knowledge and Data Engineering, 25(8), 1693–1705 (2013)
Article Google Scholar
Sonawane, S.: Extractivd Summarization dataset. Mendeley Data 1 (2018). http://dx.doi.org/10.17632/z59vy3rb2r.1

Download references

Author information

Authors and Affiliations

Pune Institute of Computer Technology, Savitribai Phule Pune University, Pune, Maharashtra, India
Sheetal Sonawane, Archana Ghotkar & Sonam Hinge

Authors

Sheetal Sonawane
View author publications
You can also search for this author in PubMed Google Scholar
Archana Ghotkar
View author publications
You can also search for this author in PubMed Google Scholar
Sonam Hinge
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sheetal Sonawane .

Editor information

Editors and Affiliations

Department Computer Science and Engineering, University of Kalyani, Kalyani, West Bengal, India
Jyotsna Kumar Mandal
Department Computer Science and Engineering, University of Calcutta, Kolkata, West Bengal, India
Devadatta Sinha
Institute of Radio Physics and Electronics, University of Calcutta, Kolkata, West Bengal, India
J.P. Bandopadhyay

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sonawane, S., Ghotkar, A., Hinge, S. (2019). Context-Based Multi-document Summarization. In: Mandal, J., Sinha, D., Bandopadhyay, J. (eds) Contemporary Advances in Innovative and Applicable Information Technology. Advances in Intelligent Systems and Computing, vol 812. Springer, Singapore. https://doi.org/10.1007/978-981-13-1540-4_16

Download citation

DOI: https://doi.org/10.1007/978-981-13-1540-4_16
Published: 02 October 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1539-8
Online ISBN: 978-981-13-1540-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics