Combining Sources of Evidence for Recognition of Relevant Passages in Texts

  • Alexander Gelbukh
  • NamO Kang
  • SangYong Han
Conference paper

DOI: 10.1007/11533962_25

Part of the Lecture Notes in Computer Science book series (LNCS, volume 3563)
Cite this paper as:
Gelbukh A., Kang N., Han S. (2005) Combining Sources of Evidence for Recognition of Relevant Passages in Texts. In: Ramos F.F., Larios Rosillo V., Unger H. (eds) Advanced Distributed Systems. ISSADS 2005. Lecture Notes in Computer Science, vol 3563. Springer, Berlin, Heidelberg

Abstract

Automatically recognizing in large electronic texts short selfcontained passages relevant for a user query is necessary for fast and accurate information access to large text archives. Surprisingly, most search engines practically do not provide any help to the user in this tedious task, just presenting a list of whole documents supposedly containing the requested information. We show how different sources of evidence can be combined in order to assess the quality of different passages in a document and present the highest ranked ones to the user. Specifically, we take into account the relevance of a passage to the user query, structural integrity of the passage with respect to paragraphs and sections of the document, and topic integrity with respect to topic changes and topic threads in the text. Our experiments show that the results are promising.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Alexander Gelbukh
    • 1
    • 2
  • NamO Kang
    • 1
  • SangYong Han
    • 1
  1. 1.Chung-Ang UniversityKorea
  2. 2.National Polytechnic InstituteMexico

Personalised recommendations