A New Graph-Based Algorithm for Persian Text Summarization

Shakeri, Hassan; Gholamrezazadeh, Saeedeh; Salehi, Mohsen Amini; Ghadamyari, Fatemeh

doi:10.1007/978-94-007-2792-2_3

Hassan Shakeri⁵,
Saeedeh Gholamrezazadeh⁵,
Mohsen Amini Salehi⁵ &
…
Fatemeh Ghadamyari⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 114))

1385 Accesses
7 Citations

Abstract

Nowadays, with increasing volume of electronic text information, the need for production of summary systems becomes essential. Summary systems capture and summarize the most important concepts of the documents and help the user to go through the main points of the text faster and make the processing of information much easier. An important class of such systems is the ones that produce extractive summaries. This summary is produced by selecting most important parts of the document without doing any modification on the main text. One approach for producing this kind of summary is using the graph theory. In this paper a new algorithm based on the graph theory is introduced to select the most important sentences of the document. In this algorithm the nodes and edges will be assigned with different weights and then the final weight of each one will be defined by combining these values. This final weight indicates the importance of the sentence and the probability of appearing this sentence in the final summary. The results show that considering simultaneous different criteria generate a summary which is more similar to human one.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Frankel, David S (2003) Model driven architecture: applying MDA to enterprise computing. OMG Press, Wiley, New York
Google Scholar
Mani I (2001) Automatic summarization John Benjamin’s publishing Co, pp 1–22
Google Scholar
Shamsfard M (2007) Processing persian texts and its challenges. In: The second workshop on Persian language and computer. pp 172–189. (in Persian)
Google Scholar
Lin CY, Hovy EH (1997) Identify topic by position. In: Proceedings of 5th conference on applied natural language processing, March 1997
Google Scholar
Mazdak N (2004) A Persian text summarizer, master thesis, department of linguistics, Stockholm University, Jan 2004
Google Scholar
Kupiec, Jullian M, Schuetze, Hinrich (2004) System for genre specific summarization of documents, Xerox corporation
Google Scholar
Rada M (2004) Graph-based ranking algorithms for sentence extraction, applied to text summarization, annual meeting of the ACL 2004, pp 170–173
Google Scholar
Patil K, Brazdil P (2007) Sumgraph: Text summarization using centrality in the pathfinder network. IADIS Int J Comput Sci Info Sys 2:18–32
Google Scholar
Wills RS (2006) Google’s pagerank: the math behind the search engine
Google Scholar
Saeedeh G, Mohsen AS, Bahareh G (2009) A comprehensive survey on text summarization systems”. CSA 2:462–467
Google Scholar
Martin H, Nima M (2004) A Persian text summarizer. In: International conference on computational linguistics
Google Scholar
Zohre K, Mehrnoush S (2007) A system for automatic persian text summarization. In: 12th international CSI computer conference, (in Persian)
Google Scholar
Azadeh Z, Behrouz M-B, Mohsen S (2008) A new hybrid farsi text summarization technique based on term co-occurrence and conceptual property of the text, In: 9th ACIS international conference on software engineering, artificial intelligence, networking and parallel/distributed computing
Google Scholar
Dalianis H (2000) SweSum—A text summarizer for Swedish, Technical report, TRITA-NA-P0015, IPLab-174, NADA, KTH, Oct 2000
Google Scholar
Erkan G, Radev DR (2004) LexRank: graph-based centrality as salience in text summarization, J Artif Intell Res 22, pp 457–459
Google Scholar
Rada M, Tarau P (2004) TextRank: bringing order into texts. In: Proceedings of the conference on empirical methods in natural language processing (EMNLP 2004)
Google Scholar
Lin Z (2006–07) Graph-Based methods for automatic text summarization, Ph.D. thesis, school of computing National University of Singapore 2006–07
Google Scholar
Nenkova A (2006) summarization evaluation for text and speech: issues and approaches, Stanford University
Google Scholar
Norshuhani Z, Arian G (2010) A hybrid approach for malay text summarizer, The 3rd international multi-conference on engineering and technological innovation 2010
Google Scholar
Lin C (2004) Rouge: a package for automatic evaluation of summaries. In: proceedings of the workshop on text summarization branches out, 42nd annual meeting of the association for computational linguistics. 25–26 July, Barcelona, Spain, pp 74–81
Google Scholar

Download references

Author information

Authors and Affiliations

Islamic Azad University, Mashhad Branch, Mashhad, Iran
Hassan Shakeri, Saeedeh Gholamrezazadeh, Mohsen Amini Salehi & Fatemeh Ghadamyari

Authors

Hassan Shakeri
View author publications
You can also search for this author in PubMed Google Scholar
Saeedeh Gholamrezazadeh
View author publications
You can also search for this author in PubMed Google Scholar
Mohsen Amini Salehi
View author publications
You can also search for this author in PubMed Google Scholar
Fatemeh Ghadamyari
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hassan Shakeri .

Editor information

Editors and Affiliations

SeoulTech, Computer Science and Engineering, Seoul University of Science & Technology, Gongreung 2-dong 172, Seoul, 139-742, Korea, Republic of (South Korea)
James J. (Jong Hyuk) Park
Inst. Computer Science & Information, Engineering, National Ilan University, 1 Sec. 1, Shen-Lung Rd., I-Lan, 260, Taiwan R.O.C.
Han-Chieh Chao
, Computer Science & Software Engineering, Monmouth University, W. Long Branch, 07764, USA
Mohammad S. Obaidat
, Division of e-Business, Kyungnam University, Changwon, Korea, Republic of (South Korea)
Jongsung Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shakeri, H., Gholamrezazadeh, S., Salehi, M.A., Ghadamyari, F. (2012). A New Graph-Based Algorithm for Persian Text Summarization. In: J. (Jong Hyuk) Park, J., Chao, HC., S. Obaidat, M., Kim, J. (eds) Computer Science and Convergence. Lecture Notes in Electrical Engineering, vol 114. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2792-2_3

Download citation

DOI: https://doi.org/10.1007/978-94-007-2792-2_3
Published: 10 December 2011
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-2791-5
Online ISBN: 978-94-007-2792-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics