An Automatic Process to Convert Documents into Abstracts by Using Natural Language Processing Techniques

Jayaraju, Ch.; Basha, Zareena Noor; Madhavarao, E.; Kalyani, M.

doi:10.1007/978-3-319-03107-1_4

Ch. Jayaraju⁶,
Zareena Noor Basha⁶,
E. Madhavarao⁶ &
…
M. Kalyani⁶

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 248))

2264 Accesses

Abstract

Now a days each and every people using internet and collects the information. At the same time the internet is growing exponentially, huge amount of information is available online. That’s why the information overload problem is faced by every end user. So Automatic Process of Document Abstracts is recognized as an important task. For this intention we used various approaches these are Anaphora resolution, mining methods and TFxIDF. However these techniques have some limitations and mainly the drawback is from the end user’s perspective, the requestor may not be aware of all the knowledge that constitutes the methods. That’s why in this paper we focussed on developing Abstracts, that is Summarization method based on Natural Language Processing Techniques. At the same time it is also useful to multi-documents summarization. We explore some of the metrics and evaluation strategies, features in document abstracts or summarization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Tanasa, D.: Advanced Data Preprocessing Mining. IEEE Intelligent Systems 19(2)
Google Scholar
Luhn, H.P.: The Automatic Creation of Literature Abstracts. IBM Journal of Research and Development 2(2), 159–165 (1958)
Article MathSciNet Google Scholar
Buckley, C., Cardie, C.: SMART Summarization System. In: Hand, T.F., Sundheim, B, eds. (1997)
Google Scholar
Radev, D.R., Jing, H., Budzikowska, M.: Summarization of multiple documents: clustering, sentence extraction, and evaluation. In: ANLP/NAACL Workshop on Summarization, Seattle, WA (April 2000)
Google Scholar
Dhillon, I.S.: A divisive information theoretic feature clustering algorithm for text classification. Journal of Machine Learning Research 3 (2003)
Google Scholar
Mihalcea, R., Tarau, P.: A Language Independent Algorithm for Single and Multiple Document Summarization. University of North Texas
Google Scholar
Alam, H., Kumar, A., Nakamura, M.: Structured and Unstructured Document Summarization: Design of a Commercial Summarizer using Lexical Chains
Google Scholar
Santorini, B.: Part-of-Speech Tagging Guidelines for the Penn Treebank Project
Google Scholar
World of computing.net/pos-tagging/markov-models.html
Google Scholar
A survey of named entity recognition and classification David Nadeau. Satoshi Sekine National Research Council Canada / New York University
Google Scholar
dictionary generation for low-resourced language pairs Varga István Yamagata uNIVERSITY, Graduate School of Science and Engineering dyn36150@dip.yz.yamagata-u.ac.jp
Google Scholar
Ontology’s, Web 2.0 and Beyond. Keynote presentation at the Ontology Summit 2007 – Ontology, Taxonomy, Folksonomy: Understanding theDistinctions (March 1, 2007)
Google Scholar
Detecting Opinions Using Deep Syntactic Analysis Caroline Brun Xerox Research Centre Europe Meylan, France
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science & Engineering, Vignan’s Lara Institute of Technology and Science, Guntur, A.P., India
Ch. Jayaraju, Zareena Noor Basha, E. Madhavarao & M. Kalyani

Authors

Ch. Jayaraju
View author publications
You can also search for this author in PubMed Google Scholar
Zareena Noor Basha
View author publications
You can also search for this author in PubMed Google Scholar
E. Madhavarao
View author publications
You can also search for this author in PubMed Google Scholar
M. Kalyani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ch. Jayaraju .

Editor information

Editors and Affiliations

Dept. of Computer Science and Engineering, Anil Neerukonda Institute of Technology and Sciences, Vishakapatnam, India
Suresh Chandra Satapathy
College of Engineering(A), Andhra University, Vishakapatnam, India
P. S. Avadhani
University of Hyderabad, Hyderabad, India
Siba K. Udgata
CSIR-National Institute of Oceanography, Visakhapatnam, India
Sadasivuni Lakshminarayana

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jayaraju, C., Basha, Z.N., Madhavarao, E., Kalyani, M. (2014). An Automatic Process to Convert Documents into Abstracts by Using Natural Language Processing Techniques. In: Satapathy, S., Avadhani, P., Udgata, S., Lakshminarayana, S. (eds) ICT and Critical Infrastructure: Proceedings of the 48th Annual Convention of Computer Society of India- Vol I. Advances in Intelligent Systems and Computing, vol 248. Springer, Cham. https://doi.org/10.1007/978-3-319-03107-1_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-03107-1_4
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-03106-4
Online ISBN: 978-3-319-03107-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics