Abstract
We propose in this paper a summarization method that creates indicative summaries from scientific papers. Unlike conventional methods that extract important sentences, our method considers the extract as the minimal unit for extraction and uses two steps: the generation and the classification. The first step combines text sentences to produce a population of extracts. The second step evaluates each extract using global criteria in order to select the best one. In this case, the criteria are defined according to the whole extract rather than sentences. We have developed a prototype of the summarization system for French language called ExtraGen that implements a genetic algorithm simulating the mechanism of generation and classification.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Luhn, H.P.: The Automatic creation of literature Abstracts. In: IBM J.R & D2-2 (1958) 156–165
Edmundson, H.P.: New methods in automatic extracting. In: Newspaper of ACM tea, 16–2 (1969) 264–85
Paice, C.D., and Al.: The identification of important concepts in highly structures technical papers. In: proceeding of sixteenth annual international ACM SIGIR Conference, ACM PRESS (1993) 69–78
Boguraev, B., Kennedy, C.: Salience-based Content Characterization of Text Documents. In: Proceedings of the Workshop on Intelligent Scalable Text Summarization. ACL/EACL Conference Madrid Spain (1997) 2–9
Gerard, S., Allan, J., Singhal, A.: Automatic text decomposition and structuring. In: Information Procssing & Management, 32(2) (1996) 127–138.
Kan, M.Y., Klavans, J., McKeown, K.: Using the Annotated Bibliography as a Resource for Indicative Summarization. In: Proc LREC 2002 Las Palmas Spain (2002)
Goldstein, J., Kantrowitz, M., Mittal, V., Carbonell, J.(1999).: Summarizing text Documents:sentence selection and evaluation metrics. In: proceeding of SIGIR’99 (1999)
Kupiec, J., and Al.: A trainable document summarizer. In: SIGIR 95 Sattle Wa, USA, (1995)
Conroy, J., Leary, D.P.O.: Text Summarization via Hidden Markov Models and Pivoted QR Matrix Decomposition. Technical Report, Dept.Comp.Sci. CS-TR-221. Univ. Maryland ( 2001)
Minel, J.L., et Al.: Seraphin, système pour l’extraction automatique d’énoncés importants. dans: les actes du colloque IA 95-Quinzièmes journés internationales de gńie linguistiques Montpellier France (1995)
Marcu, D.: Discourse-based summarization in duc-200. In: Proceedings of the Document Understanding,Conference DUC’01 (2001)
Teufel, S., Moens, M.: Sentence Extraction as a Classification Task. In: Proceedings of the Workshop on Intelligent Scalable Summarization ACL/EACL Conference Madrid Spain (1997) 58–65
Mckeown, R.K., and Al.: Generating summaries of multiple news Articles. In: proceeding of the Seventeenth Annual International ACM/SIGIR Washington (1995) 74–82
Strzalkowski, T., Wand, J., Wise, B.: A robust practical text summarization. In: AAAI 98 Spring Symposium on Intelligent Text Summarization (1998) 26–33
Holland, J.H. and Al.: Classified systems and genetic algorithms. In: revue Artificial Intelligence N°40 (1989) 235–282
Srivinas, N., Deb, K.: Multiobjective optimization using nondominated sorting in genetic Algorithms. Technical report, Department of Mechanical Engineering, Institute of Technology India (1993)
Marcu, D.: The automatic construction of large-scale corpora for summarization research. In: The 22nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’99). Berkeley, CA, (1999) 137–144
Orasan, C.: Building annotated resources for automatic text summarisation. In: Proceedings of LREC-2002. Las Palmas, Spain 2002.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jaoua, M., Hamadou, A.B. (2003). Automatic Text Summarization of Scientific Articles Based on Classification of Extract’s Population. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2003. Lecture Notes in Computer Science, vol 2588. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36456-0_70
Download citation
DOI: https://doi.org/10.1007/3-540-36456-0_70
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00532-2
Online ISBN: 978-3-540-36456-6
eBook Packages: Springer Book Archive