Automatic Text Summarization of Scientific Articles Based on Classification of Extract’s Population

Jaoua, Maher; Hamadou, Abdelmajid Ben

doi:10.1007/3-540-36456-0_70

Maher Jaoua⁵ &
Abdelmajid Ben Hamadou⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2588))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

915 Accesses
9 Citations

Abstract

We propose in this paper a summarization method that creates indicative summaries from scientific papers. Unlike conventional methods that extract important sentences, our method considers the extract as the minimal unit for extraction and uses two steps: the generation and the classification. The first step combines text sentences to produce a population of extracts. The second step evaluates each extract using global criteria in order to select the best one. In this case, the criteria are defined according to the whole extract rather than sentences. We have developed a prototype of the summarization system for French language called ExtraGen that implements a genetic algorithm simulating the mechanism of generation and classification.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Luhn, H.P.: The Automatic creation of literature Abstracts. In: IBM J.R & D2-2 (1958) 156–165
Google Scholar
Edmundson, H.P.: New methods in automatic extracting. In: Newspaper of ACM tea, 16–2 (1969) 264–85
MATH Google Scholar
Paice, C.D., and Al.: The identification of important concepts in highly structures technical papers. In: proceeding of sixteenth annual international ACM SIGIR Conference, ACM PRESS (1993) 69–78
Google Scholar
Boguraev, B., Kennedy, C.: Salience-based Content Characterization of Text Documents. In: Proceedings of the Workshop on Intelligent Scalable Text Summarization. ACL/EACL Conference Madrid Spain (1997) 2–9
Google Scholar
Gerard, S., Allan, J., Singhal, A.: Automatic text decomposition and structuring. In: Information Procssing & Management, 32(2) (1996) 127–138.
Article Google Scholar
Kan, M.Y., Klavans, J., McKeown, K.: Using the Annotated Bibliography as a Resource for Indicative Summarization. In: Proc LREC 2002 Las Palmas Spain (2002)
Google Scholar
Goldstein, J., Kantrowitz, M., Mittal, V., Carbonell, J.(1999).: Summarizing text Documents:sentence selection and evaluation metrics. In: proceeding of SIGIR’99 (1999)
Google Scholar
Kupiec, J., and Al.: A trainable document summarizer. In: SIGIR 95 Sattle Wa, USA, (1995)
Google Scholar
Conroy, J., Leary, D.P.O.: Text Summarization via Hidden Markov Models and Pivoted QR Matrix Decomposition. Technical Report, Dept.Comp.Sci. CS-TR-221. Univ. Maryland ( 2001)
Google Scholar
Minel, J.L., et Al.: Seraphin, système pour l’extraction automatique d’énoncés importants. dans: les actes du colloque IA 95-Quinzièmes journés internationales de gńie linguistiques Montpellier France (1995)
Google Scholar
Marcu, D.: Discourse-based summarization in duc-200. In: Proceedings of the Document Understanding,Conference DUC’01 (2001)
Google Scholar
Teufel, S., Moens, M.: Sentence Extraction as a Classification Task. In: Proceedings of the Workshop on Intelligent Scalable Summarization ACL/EACL Conference Madrid Spain (1997) 58–65
Google Scholar
Mckeown, R.K., and Al.: Generating summaries of multiple news Articles. In: proceeding of the Seventeenth Annual International ACM/SIGIR Washington (1995) 74–82
Google Scholar
Strzalkowski, T., Wand, J., Wise, B.: A robust practical text summarization. In: AAAI 98 Spring Symposium on Intelligent Text Summarization (1998) 26–33
Google Scholar
Holland, J.H. and Al.: Classified systems and genetic algorithms. In: revue Artificial Intelligence N°40 (1989) 235–282
Google Scholar
Srivinas, N., Deb, K.: Multiobjective optimization using nondominated sorting in genetic Algorithms. Technical report, Department of Mechanical Engineering, Institute of Technology India (1993)
Google Scholar
Marcu, D.: The automatic construction of large-scale corpora for summarization research. In: The 22nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’99). Berkeley, CA, (1999) 137–144
Google Scholar
Orasan, C.: Building annotated resources for automatic text summarisation. In: Proceedings of LREC-2002. Las Palmas, Spain 2002.
Google Scholar

Download references

Author information

Authors and Affiliations

Faculté des Sciences Economiques et de Gestion de Sfax, Laboratoire LARIS, B.P. 1088- 3018, Sfax, Tunisie
Maher Jaoua & Abdelmajid Ben Hamadou

Authors

Maher Jaoua
View author publications
You can also search for this author in PubMed Google Scholar
Abdelmajid Ben Hamadou
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Centro de Investigación en Computación (CIC), Instituto Politécnico Nacional (IPN), Col. Zacatenco, CP 07738, Mexico D.F., Mexico
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jaoua, M., Hamadou, A.B. (2003). Automatic Text Summarization of Scientific Articles Based on Classification of Extract’s Population. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2003. Lecture Notes in Computer Science, vol 2588. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36456-0_70

Download citation

DOI: https://doi.org/10.1007/3-540-36456-0_70
Published: 30 April 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00532-2
Online ISBN: 978-3-540-36456-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics