Update Summarization Based on Latent Semantic Analysis

  • Josef Steinberger
  • Karel Ježek
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5729)

Abstract

This paper deals with our recent research in text summarization. We went from single-document summarization through multi-document summarization to update summarization. We describe the development of our summarizer which is based on latent semantic analysis (LSA) and propose the update summarization component which determines the redundancy and novelty of each topic discovered by LSA. The final part of this paper presents the results of our participation in the experiment of Text Analysis Conference 2008.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Document understanding conference (2007), http://duc.nist.gov/
  2. 2.
    Text analysis conference (2008), http://www.nist.gov/tac/tracks/2008/index.html
  3. 3.
    Landauer, T., Dumais, S.: A solution to platos problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psychological Review 104 (1997)Google Scholar
  4. 4.
    Steinberger, J., Ježek, K.: Text summarization and singular value decomposition. In: Yakhno, T. (ed.) ADVIS 2004. LNCS, vol. 3261, pp. 245–254. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  5. 5.
    Gong, Y., Liu, X.: Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings of ACM SIGIR (2002)Google Scholar
  6. 6.
    Steinberger, J., Křišťan, M.: Lsa-based multi-document summarization. In: Proceedings of 8th International Workshop on Systems and Control (2007)Google Scholar
  7. 7.
    Hovy, E., Lin, C.: Automated text summarization in summarist. In: Proceedings of ACL/EACL workshop on intelligent scalable text summarization (1997)Google Scholar
  8. 8.
    Berry, M., Dumais, S., O’Brien, G.: Using linear algebra for intelligent ir. SIAM Review 37(4) (1995)Google Scholar
  9. 9.
    Choi, F., Wiemer-Hastings, P., Moore, J.: Latent semantic analysis for text segmentation. In: Proceedings of EMNLP (2001)Google Scholar
  10. 10.
    Lee, C.H., Yang, H.C., Ma, S.M.: A novel multilingual text categorization system using latent semantic indexing. In: Proceedings of the First International Conference on Innovative Computing, Information and Control. IEEE Computer Society, Los Alamitos (2006)Google Scholar
  11. 11.
    Ding, C.: A probabilistic model for latent semantic indexing. Journal of the American Society for Information Science and Technology 56(6) (2005)Google Scholar
  12. 12.
    Nenkova, A., Passonneau, R.: Evaluating content selection in summarization: The pyramid method. In: Document Understanding Conference (2005)Google Scholar
  13. 13.
    Lin, C.: Rouge: A package for automatic evaluation of summaries. In: Proceedings of the Workshop on Text Summarization Branches Out (2004)Google Scholar
  14. 14.
    Hovy, E., Lin, C.Y., Zhou, L.: Evaluating duc 2005 using basic elements. In: Proceedings of the Document Understanding Conference (2005)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Josef Steinberger
    • 1
  • Karel Ježek
    • 1
  1. 1.University of West BohemiaPlzeňCzech Republic

Personalised recommendations