Advertisement

A Multilingual Study of Compressive Cross-Language Text Summarization

  • Elvys Linhares PontesEmail author
  • Stéphane Huet
  • Juan-Manuel Torres-Moreno
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11289)

Abstract

Cross-Language Text Summarization generates summaries in a language different from the language of the source documents. Recent methods use information from both languages to generate summaries with the most informative sentences. However, these methods have performance that can vary according to languages, which can reduce the quality of summaries. In this paper, we propose a compressive framework to generate cross-language summaries. In order to analyze performance and especially stability, we tested our system and extractive baselines on a dataset available in four languages (English, French, Portuguese, and Spanish) to generate English and French summaries. An automatic evaluation showed that our method outperformed extractive state-of-art CLTS methods with better and more stable ROUGE scores for all languages.

Notes

Acknowledgement

This work was granted by the European Project CHISTERA-AMIS ANR-15-CHR2-0001. We also like to acknowledge the support given by the Laboratoire VERIFORM from the École Polytechnique de Montréal and her coordinator Hanifa Boucheneb.

References

  1. 1.
    Yao, J., Wan, X., Xiao, J.: Phrase-based compressive cross-language summarization. In: EMNLP, pp. 118–127 (2015)Google Scholar
  2. 2.
    Wan, X., Luo, F., Sun, X., Huang, S., Yao, J.G.: Cross-language document summarization via extraction and ranking of multiple summaries. Knowl. Inf. Syst. (2018).  https://doi.org/10.1007/s10115-018-1152-7
  3. 3.
    Linhares Pontes, E., Huet, S., Torres-Moreno, J.M., Linhares, A.C.: Cross-language text summarization using sentence and multi-sentence compression. In: Silberztein, M., Atigui, F., Kornyshova, E., Métais, E., Meziane, F. (eds.) NLDB 2018. Lecture Notes in Computer Science, vol. 10859. Springer, Cham (2018).  https://doi.org/10.1007/978-3-319-91947-8_48CrossRefGoogle Scholar
  4. 4.
    Giannakopoulos, G., El-Haj, M., Favre, B., Litvak, M., Steinberger, J., Varma, V.: TAC2011 multiling pilot overview. In: 4th Text Analysis Conference TAC (2011)Google Scholar
  5. 5.
    Wan, X.: Using bilingual information for cross-language document summarization. In: ACL, pp. 1546–1555 (2011)Google Scholar
  6. 6.
    Moirón, B.V., Tiedemann, J.: Identifying idiomatic expressions using automatic word-alignment. In: EACL 2006 Workshop on Multiword Expressions in a Multilingual Context (2006)Google Scholar
  7. 7.
    de Caseli, H.M., Ramisch, C., das Graças Volpe Nunes, M., Villavicencio, A.: Alignment-based extraction of multiword expressions. Lang. Resour. Eval. 44, 59–77 (2010)CrossRefGoogle Scholar
  8. 8.
    Manning, C., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S., McClosky, D.: The Stanford CoreNLP natural language processing toolkit. In: 52nd Annual Meeting of the Association for Computational Linguistics (ACL): System Demonstrations, pp. 55–60 (2014)Google Scholar
  9. 9.
    Linhares Pontes, E., Huet, S., Gouveia da Silva, T., Linhares, A.C., Torres-Moreno, J.M.: Multi-sentence compression with word vertex-labeled graphs and integer linear programming. In: TextGraphs-12: The Workshop on Graph-based Methods for Natural Language Processing, Association for Computational Linguistics (2018)Google Scholar
  10. 10.
    Filippova, K.: Multi-sentence compression: finding shortest paths in word graphs. In: COLING, pp. 322–330 (2010)Google Scholar
  11. 11.
    Wan, X., Li, H., Xiao, J.: Cross-language document summarization based on machine translation quality prediction. In: ACL, pp. 917–926 (2010)Google Scholar
  12. 12.
    Boudin, F., Huet, S., Torres-Moreno, J.M.: A graph-based approach to cross-language multi-document summarization. Polibits 43, 113–118 (2011)CrossRefGoogle Scholar
  13. 13.
    Zhang, J., Zhou, Y., Zong, C.: Abstractive cross-language summarization via translation model enhanced predicate argument structure fusing. IEEE/ACM Trans. Audio Speech Lang. Process. 24, 1842–1853 (2016)CrossRefGoogle Scholar
  14. 14.
    Bing, L., Li, P., Liao, Y., Lam, W., Guo, W., Passonneau, R.J.: Abstractive multi-document summarization via phrase selection and merging. In: ACL, The Association for Computer Linguistics, pp. 1587–1597 (2015)Google Scholar
  15. 15.
    Lin, C.Y.: ROUGE: a package for automatic evaluation of summaries. In: Workshop Text Summarization Branches Out (ACL 2004), pp. 74–81 (2004)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  • Elvys Linhares Pontes
    • 1
    • 2
    • 3
    Email author
  • Stéphane Huet
    • 1
  • Juan-Manuel Torres-Moreno
    • 1
    • 2
    • 3
  1. 1.LIA, Université d’Avignon et des Pays de VaucluseAvignonFrance
  2. 2.Département de GIGL/VERIFORM, École Polytechnique de MontréalMontréalCanada
  3. 3.Laboratoire GDAC, Université du Québec à MontréalMontrélCanada

Personalised recommendations