Abstract
Information retrieval is based on the assumption that occurrences of indexing features in a document will tell us something about the relevance of this document. This assumption implies the cluster hypothesis: closely associated documents tend to be relevant to the same requests (van Rijsbergen 1979, p. 45). In this section, we will describe how texts can be characterized by the distribution of occurrences of textual indexing features. For consistency, we will use the more general terminology of multimedia information retrieval. In particular, we will use the notion of indexing features rather than indexing terms and feature frequency rather than term frequency, even though throughout this chapter about text retrieval, every indexing feature ϕi denotes a term and the feature frequency f f(ϕ i ,d j ) denotes the corresponding term frequency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Notes
As usual iff means if and only if.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 1997 Springer Science+Business Media New York
About this chapter
Cite this chapter
Schäuble, P. (1997). Text Retrieval. In: Multimedia Information Retrieval. The Springer International Series in Engineering and Computer Science, vol 397. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-6163-7_3
Download citation
DOI: https://doi.org/10.1007/978-1-4615-6163-7_3
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-7825-9
Online ISBN: 978-1-4615-6163-7
eBook Packages: Springer Book Archive