Extraction Based Automatic Text Summarization System with HMM Tagger

Manne, Suneetha; Shaik Mohd., Zaheer Parvez; Sameen Fatima, S.

doi:10.1007/978-3-642-27443-5_48

Suneetha Manne⁵,
Zaheer Parvez Shaik Mohd.⁵ &
S. Sameen Fatima⁶

Part of the book series: Advances in Intelligent and Soft Computing ((AINSC,volume 132))

1253 Accesses
2 Citations

Abstract

A rough estimation of world’s famous search engine Google in year 2010 revealed that the total size of internet has now turned to 2 petabytes. The increase in the performance and fast accessing of web resources has made a new challenge of browsing among huge data on internet. It is hence browsing on web is an under laid topic for researchers. The research on web has turned its steps towards Browsing among Information (BAI) rather than Browsing for Information (BFI).The field of Information Extraction (IE) is offering a huge scope to concise and compact the information enabling the user to decide by mere check at snippets of each link. Automatic text summarization is the process of condensing the source text into a shorter version preserving its information content and overall meaning. In this paper, we propose a frequent term based text summarization technique based on the analysis of Parts of Speech for generating effective and efficient summary.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Luhn, H.P.: The Automatic Creation of Literature Abstracts. IBM Journal, 159–165 (April 1958)
Google Scholar
Edmundson, H.P.: New Methods in Automatic Extracting. Journal of the Association for Computing Machinery 16(2), 264–285 (1969)
MATH Google Scholar
Pollock, J.J., Zamora, A.: Automatic Abstracting Research at Chemical Abstracts Service. Journal of Chemical Information and Computer Sciences 15(4), 226–232 (1975)
Article Google Scholar
Brown Tagset, http://www.scs.leeds.ac.uk/amalgam/tagsets/brown.html
McKeown, K.R.: Discourse Strategies for Generating Natural Language Text. Department of Computer Science, Columbia University, New York (1982)
Google Scholar
Brandow, R., Mitze, K., Rau, L.F.: Automatic condensation of electronic publications by sentence selection. Information Processing Management 31(5), 675–685 (1995)
Article Google Scholar
Barzilay, R., Elhadad, M., Boguraev, Kennedy, M.: Using Lexical Chains for Text Summarization. In: Workshop on Intelligent Scalable Text Summarization, Ben Gurion University of the Negev, Be’er Sheva (1997)
Google Scholar
Radev, R., Blair-goldensohn, S., Zhang, Z.: Experiments in Single and Multi-Docuemtn Summarization using MEAD. In: First Document Understanding Conference, New Orleans, LA (2001)
Google Scholar
Karthik Kumar, G., Sudheer, K., Avinesh, P.V.S.: Comparative Study of Various Machine Learning Methods for Telugu Part of Speech Tagging. In: Proceeding of the NLPAI Machine Learning Competition (2006)
Google Scholar
Bahl, L., Mercer, R.L.: Part-Of-Speech assignment by a statistical decision algorithm. In: IEEE International Symposium on Information Theory, pp. 88–89 (1976)
Google Scholar
Gupta, V., Lehal, G.S.: A Survey of Text Summarization Extractive Techniques. Journal of Emerging Technologies In Web Intelligence 2(3) (August 2010)
Google Scholar
Radev, D.R., Hovy, E., McKeown, K.: Introduction to the special issue on summarization. Computational Linguistics 28(4), 399–408 (2002)
Article Google Scholar
Nahm, U.Y., Mooney, R.J.: Text mining with information extraction. In: AAAI 2002, Spring Symposium on Mining Answers from Texts and Knowledge Bases (2002)
Google Scholar
Nou, C.: Khmer Part-of-Speech Tagging. Global Information and Telecommunication Studies. Waseda University
Google Scholar
Suneetha, M., Sameen Fatima, S.: Corpus based Automatic Text Summarization System with HMM Tagger. International Journal of Soft Computing and Engineering (IJSCE) 1(3), 118–123 (2011) ISSN: 2231-2307
Google Scholar

Download references

Author information

Authors and Affiliations

Department of IT, VRSEC, Vijayawada, India
Suneetha Manne & Zaheer Parvez Shaik Mohd.
Department of CSE, Osmania University, Hyderabad, India
S. Sameen Fatima

Authors

Suneetha Manne
View author publications
You can also search for this author in PubMed Google Scholar
Zaheer Parvez Shaik Mohd.
View author publications
You can also search for this author in PubMed Google Scholar
S. Sameen Fatima
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept of Computer Science and Engineering ANITS, Andhra University, Sangivalasa, 530003, Vishakapatnam, India
Suresh Chandra Satapathy
College of Engineering Dept. of CS&SE ANITS, Andhra University, Sangivalasa, 530003, Vishakapatnam, India
P. S. Avadhani
Machine Intelligence Research Labs, Auburn, WA, USA
Ajith Abraham

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Manne, S., Shaik Mohd., Z.P., Sameen Fatima, S. (2012). Extraction Based Automatic Text Summarization System with HMM Tagger. In: Satapathy, S.C., Avadhani, P.S., Abraham, A. (eds) Proceedings of the International Conference on Information Systems Design and Intelligent Applications 2012 (INDIA 2012) held in Visakhapatnam, India, January 2012. Advances in Intelligent and Soft Computing, vol 132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27443-5_48

Download citation

DOI: https://doi.org/10.1007/978-3-642-27443-5_48
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27442-8
Online ISBN: 978-3-642-27443-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics