Skip to main content

Literature Study on Multi-document Text Summarization Techniques

  • Conference paper
  • First Online:
Smart Trends in Information Technology and Computer Communications (SmartCom 2016)

Abstract

Text summarization is a method which generates a shorter and a preciseform of one or more text documents. Automatic text summarization plays an essential role in finding information from large text corpus or an internet. What had actually started as a single document Text Summarization has now evolved and developed into generating multi-document summarization. There are a number of approaches to multi-document summarization such as Graph, Cluster, Term-Frequency, Latent Semantic Analysis (LSA) based etc. In this paper we have started with introduction of multi-document summarization and then have further discussed comparison and analysis of various approaches which comes under the multi-document summarization. The paper also contains details about the benefits and problems in the existing methods. This would especially be helpful for researchers working in this field of text data mining. By using this data, researchers can build new or mixed based approaches for multidocument summarization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  1. Kan, M.-Y., Klavans, J.L.: Using librarian techniques in automatic text summarization for information retrieval. In: Proceedings of the 2nd ACMlIEEE-CS Joint Conference on Digital Libraries, pp. 36–45. ACM (2002)

    Google Scholar 

  2. Meena, Y.K., Jain, A., Gopalani, D.: Survey on graph and cluster based approaches in multi-document text summarization. In: Recent Advances and Innovations in Engineering (ICRAIE), Jaipur, pp. 1–5 (2014). doi:10.1109/ICRAIE.2014.6909126

  3. Haque, M., Pervin, S., Begum, Z., et al.: Literature review of automatic multiple documents text summarization. Int. J. Innov. Appl. Stud. 3(1), 121–129 (2013)

    Google Scholar 

  4. Mihalcea, R., Tarau, P.: Textrank: bringing order into texts. In: Proceedings of EMNLP, vol. 4, Barcelona, Spain (2004)

    Google Scholar 

  5. Zhang, J., Sun, L., Zhou, Q.: A cue-based hub-authority approach for multi-document text summarization. In: Proceedings of 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2005, pp. 642–645. IEEE (2005)

    Google Scholar 

  6. Hariharan, S., Srinivasan, R.: Studies on graph based approaches for single and multi-document summarizations. Int. Comput. Theory Eng 1, 1793–8201 (2009)

    Google Scholar 

  7. Thakkar, K.S., Dharaskar, R.V., Chandak, M.: Graph-based algorithms for text summarization. In: 3rd International Conference on Emerging Trends in Engineering and Technology (lCETET) 2010, pp. 516– 519. IEEE (2010)

    Google Scholar 

  8. Ge, S.S., Zhang, Z., He, H.: Weighted graph model based sentence clustering and ranking for document summarization. In: 2011 4th International Conference on Interaction Sciences (ICIS), pp. 90–95. IEEE (2011)

    Google Scholar 

  9. Nguyen-Hoang, T.-A., Nguyen, K., Tran, Q.-V.: Tsgvi: a graphbased summarization system for vietnamese documents. J. Ambient Intell. Humanized Comput. 3(4), 305–313 (2012)

    Article  Google Scholar 

  10. Schlesinger, J.D., O’Leary, D.P., Conroy, J.M.: Arabic/English multi-document summarization with CLASSY—the past and the future. In: Gelbukh, A. (ed.) CICLing 2008. LNCS, vol. 4919, pp. 568–581. Springer, Heidelberg (2008). doi:10.1007/978-3-540-78135-6_49

    Chapter  Google Scholar 

  11. Ma, X.-C., Yu, G.-B., Ma, L.: Multi-document summarization using clustering algorithm. In: International Workshop on Intelligent Systems and Applications, ISA 2009, pp. 1–4. IEEE (2009)

    Google Scholar 

  12. Gupta, V.K., Siddiqui, T.J.: Multi-document summarization using sentence clustering. In: 4th International Conference on Intelligent Human Computer Interaction (IHCI), pp. 1–5. IEEE (2012)

    Google Scholar 

  13. Salton, G.: Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison Wesley Publishing Company, USA (1989)

    Google Scholar 

  14. Fukumoto, J.I.: Multi-document summarization using document set type classification. In: Proceedings of NTCIR-2004, Tokyo, pp. 412–416 (2004)

    Google Scholar 

  15. Xiong, S., Luo, Y.: A new approach for multi-document summarization based on latent semantic analysis. In: 2014 Seventh International Symposium on Computational Intelligence and Design (ISCID), Hangzhou, pp. 177–180 (2014)

    Google Scholar 

  16. Steinberger, J., Jezek, K.: Using latent semantic analysis in text summarization and summary evaluation. In: Proceeding ISIM 2004, pp. 93–100 (2004)

    Google Scholar 

  17. Lioret, E., Palomar, M.: Text summarization in progress: a literature review. Artif. Intell. Rev. 37(I), 1–41 (2012)

    Article  Google Scholar 

  18. Das,D., Martins, A.F.: A survey on automatic text summarization. In: Literature Survey for the Language and Statistics II course at CMU, vol. 4, pp. 192–195 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Chintan Shah or Anjali Jivani .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Shah, C., Jivani, A. (2016). Literature Study on Multi-document Text Summarization Techniques. In: Unal, A., Nayak, M., Mishra, D.K., Singh, D., Joshi, A. (eds) Smart Trends in Information Technology and Computer Communications. SmartCom 2016. Communications in Computer and Information Science, vol 628. Springer, Singapore. https://doi.org/10.1007/978-981-10-3433-6_53

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-3433-6_53

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-3432-9

  • Online ISBN: 978-981-10-3433-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics