Abstract
Nowadays, with increasing volume of electronic text information, the need for production of summary systems becomes essential. Summary systems capture and summarize the most important concepts of the documents and help the user to go through the main points of the text faster and make the processing of information much easier. An important class of such systems is the ones that produce extractive summaries. This summary is produced by selecting most important parts of the document without doing any modification on the main text. One approach for producing this kind of summary is using the graph theory. In this paper a new algorithm based on the graph theory is introduced to select the most important sentences of the document. In this algorithm the nodes and edges will be assigned with different weights and then the final weight of each one will be defined by combining these values. This final weight indicates the importance of the sentence and the probability of appearing this sentence in the final summary. The results show that considering simultaneous different criteria generate a summary which is more similar to human one.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Frankel, David S (2003) Model driven architecture: applying MDA to enterprise computing. OMG Press, Wiley, New York
Mani I (2001) Automatic summarization John Benjamin’s publishing Co, pp 1–22
Shamsfard M (2007) Processing persian texts and its challenges. In: The second workshop on Persian language and computer. pp 172–189. (in Persian)
Lin CY, Hovy EH (1997) Identify topic by position. In: Proceedings of 5th conference on applied natural language processing, March 1997
Mazdak N (2004) A Persian text summarizer, master thesis, department of linguistics, Stockholm University, Jan 2004
Kupiec, Jullian M, Schuetze, Hinrich (2004) System for genre specific summarization of documents, Xerox corporation
Rada M (2004) Graph-based ranking algorithms for sentence extraction, applied to text summarization, annual meeting of the ACL 2004, pp 170–173
Patil K, Brazdil P (2007) Sumgraph: Text summarization using centrality in the pathfinder network. IADIS Int J Comput Sci Info Sys 2:18–32
Wills RS (2006) Google’s pagerank: the math behind the search engine
Saeedeh G, Mohsen AS, Bahareh G (2009) A comprehensive survey on text summarization systems”. CSA 2:462–467
Martin H, Nima M (2004) A Persian text summarizer. In: International conference on computational linguistics
Zohre K, Mehrnoush S (2007) A system for automatic persian text summarization. In: 12th international CSI computer conference, (in Persian)
Azadeh Z, Behrouz M-B, Mohsen S (2008) A new hybrid farsi text summarization technique based on term co-occurrence and conceptual property of the text, In: 9th ACIS international conference on software engineering, artificial intelligence, networking and parallel/distributed computing
Dalianis H (2000) SweSum—A text summarizer for Swedish, Technical report, TRITA-NA-P0015, IPLab-174, NADA, KTH, Oct 2000
Erkan G, Radev DR (2004) LexRank: graph-based centrality as salience in text summarization, J Artif Intell Res 22, pp 457–459
Rada M, Tarau P (2004) TextRank: bringing order into texts. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP 2004)
Lin Z (2006–07) Graph-Based methods for automatic text summarization, Ph.D. thesis, school of computing National University of Singapore 2006–07
Nenkova A (2006) summarization evaluation for text and speech: issues and approaches, Stanford University
Norshuhani Z, Arian G (2010) A hybrid approach for malay text summarizer, The 3rd international multi-conference on engineering and technological innovation 2010
Lin C (2004) Rouge: a package for automatic evaluation of summaries. In: proceedings of the workshop on text summarization branches out, 42nd annual meeting of the association for computational linguistics. 25–26 July, Barcelona, Spain, pp 74–81
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media B.V.
About this paper
Cite this paper
Shakeri, H., Gholamrezazadeh, S., Salehi, M.A., Ghadamyari, F. (2012). A New Graph-Based Algorithm for Persian Text Summarization. In: J. (Jong Hyuk) Park, J., Chao, HC., S. Obaidat, M., Kim, J. (eds) Computer Science and Convergence. Lecture Notes in Electrical Engineering, vol 114. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2792-2_3
Download citation
DOI: https://doi.org/10.1007/978-94-007-2792-2_3
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-2791-5
Online ISBN: 978-94-007-2792-2
eBook Packages: EngineeringEngineering (R0)