COMPENDIUM: A Text Summarization System for Generating Abstracts of Research Papers

Lloret, Elena; Romá-Ferri, María Teresa; Palomar, Manuel

doi:10.1007/978-3-642-22327-3_2

Elena Lloret¹⁹,
María Teresa Romá-Ferri²⁰ &
Manuel Palomar¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6716))

Included in the following conference series:

International Conference on Application of Natural Language to Information Systems

1908 Accesses
6 Citations

Abstract

This paper presents compendium, a text summarization system, which has achieved good results in extractive summarization. Therefore, our main goal in this research is to extend it, suggesting a new approach for generating abstractive-oriented summaries of research papers. We conduct a preliminary analysis where we compare the extractive version of compendium (\(\textsc{compendium}_{E}\)) with the new abstractive-oriented approach (\(\textsc{compendium}_{E-A}\)). The final summaries are evaluated according to three criteria (content, topic, and user satisfaction) and, from the results obtained, we can conclude that the use of compendium is appropriate for producing summaries of research papers automatically, going beyond the simple selection of sentences.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Belz, A.: Automatic Generation of Weather Forecast Texts Using Comprehensive Probabilistic Generation-space Models. Natural Language Engineering 14(4), 431–455 (2008)
Article Google Scholar
Bouras, C., Tsogkas, V.: Noun retrieval effect on text summarization and delivery of personalized news articles to the user’s desktop. Data Knowledge Engineering 69, 664–677 (2010)
Article Google Scholar
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Computer Networks ISDN Systems 30, 107–117 (1998)
Article Google Scholar
Carenini, G., Cheung, J.C.K.: Extractive vs. NLG-based abstractive summarization of evaluative text: The effect of corpus controversiality. In: Proc. of the 5th International Natural Language Generation Conference, pp. 33–40 (2008)
Google Scholar
Erkan, G., Radev, D.R.: Lexrank: Graph-based lexical centrality as salience in text summarization. Journal of Artificial Intelligence Research (JAIR) 22, 457–479 (2004)
Google Scholar
Ferrández, Ó., Micol, D., Muñoz, R., Palomar, M.: A perspective-based approach for solving textual entailment recognition. In: Proc. of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing, pp. 66–71 (2007)
Google Scholar
Filippova, K.: Multi-sentence compression: Finding shortest paths in word graphs. In: Proc. of the 23rd International Conference on Computational Linguistics, pp. 322–330 (2010)
Google Scholar
Givón, T.: A functional-typological introduction, vol. II. John Benjamins, Amsterdam (1990)
Google Scholar
Kumar, M., Das, D., Agarwal, S., Rudnicky, A.: Non-textual Event Summarization by Applying Machine Learning to Template-based Language Generation. In: Proc. of the 2009 Workshop on Language Generation and Summarisation, pp. 67–71 (2009)
Google Scholar
Lal, P., Rüger, S.: Extract-based summarization with simplification. In: Workshop on Text Summarization in Conjunction with the ACL (2002)
Google Scholar
Lin, C.Y.: ROUGE: a package for automatic evaluation of summaries. In: Proc. of ACL Text Summarization Workshop, pp. 74–81 (2004)
Google Scholar
Liu, M., Li, W., Wu, M., Lu, Q.: Extractive summarization based on event term clustering. In: Proc. of the 45th ACL, pp. 185–188 (2007)
Google Scholar
Luhn, H.P.: The automatic creation of literature abstracts. In: Mani, I., Maybury, M. (eds.) Advances in Automatic Text Summarization, pp. 15–22. MIT Press, Cambridge (1958)
Google Scholar
Mohammad, S., Dorr, B., Egan, M., Hassan, A., Muthukrishan, P., Qazvinian, V., Radev, D., Zajic, D.: Using citations to generate surveys of scientific paradigms. In: Proc. of the North American Chapter of the ACL, pp. 584–592 (2009)
Google Scholar
Pollock, J.J., Zamora, A.: Automatic abstracting research at chemical abstracts. In: Mani, I., Maybury, M. (eds.) Advances in Automatic Text Summarization, pp. 43–49. MIT Press, Cambridge (1999)
Google Scholar
Saggion, H.: A classification algorithm for predicting the structure of summaries. In: Proc. of the Workshop on Language Generation and Summarisation, pp. 31–38 (2009)
Google Scholar
Saggion, H., Lapalme, G.: Selective analysis for automatic abstracting: Evaluating indicativeness and acceptability. In: Proceedings of Content-Based Multimedia Information Access, pp. 747–764 (2000)
Google Scholar
Sauper, C., Barzilay, R.: Automatically generating wikipedia articles: A structure-aware approach. In: Proc. of the 47th Association of Computational Linguistics, pp. 208–216 (2009)
Google Scholar
Shen, D., Yang, Q., Chen, Z.: Noise Reduction through Summarization for Web-page Classification. Information Processing and Management 43(6), 1735–1747 (2007)
Article Google Scholar
Spärck Jones, K.: Automatic summarising: The state of the art. Information Processing & Management 43(6), 1449–1481 (2007)
Article Google Scholar
Wong, K.F., Wu, M., Li, W.: Extractive summarization using supervised and semi-supervised learning. In: Proc. of the 22nd International Conference on Computational Linguistics, pp. 985–992 (2008)
Google Scholar
Yu, J., Reiter, E., Hunter, J., Mellish, C.: Choosing the Content of Textual Summaries of Large Time-series Data Sets. Natural Language Engineering 13(1), 25–49 (2007)
Article Google Scholar
Zhou, L., Ticrea, M., Hovy, E.: Multi-document biography summarization. In: Proc. of the International Conference on Empirical Methods in NLP, pp. 434–441 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. Lenguajes y Sistemas Informáticos, Universidad de Alicante, Apdo. de correos, 99, E-03080, Alicante, Spain
Elena Lloret & Manuel Palomar
Department of Nursing, University of Alicante, Spain
María Teresa Romá-Ferri

Authors

Elena Lloret
View author publications
You can also search for this author in PubMed Google Scholar
María Teresa Romá-Ferri
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Palomar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, University of Alicante, 03080, Alicante, Spain
Rafael Muñoz
Department of Software and Computing Systems, University of Alicante, Aptdo. de Correos 99, 03080, Alicante, Spain
Andrés Montoyo
CNAM- Laboratoire Cédric, 292 Rue St. Martin, 75141, Paris Cedex 03, France
Elisabeth Métais

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lloret, E., Romá-Ferri, M.T., Palomar, M. (2011). COMPENDIUM: A Text Summarization System for Generating Abstracts of Research Papers. In: Muñoz, R., Montoyo, A., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2011. Lecture Notes in Computer Science, vol 6716. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22327-3_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-22327-3_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22326-6
Online ISBN: 978-3-642-22327-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics