Abstract
Once the processing tokens and other metadata that is associated with items have been identified, the system then needs to construct the searchable index that will be used with the user’s queries to return results. Before discussing the detailed alternatives, the concept of indexing is discussed along with what its objective. The automatic textual indexing techniques are discussed in the context of statistical, natural language and concept indexing approaches. Statistical is the major technique used in current systems and the major algorithms used are discussed (e.g., term frequency/inverse document frequency with item length normalization). Finally the indexing techniques associated with multimedia items (audio, image and video items) are described.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsAuthor information
Authors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer US
About this chapter
Cite this chapter
Kowalski, G. (2011). Indexing. In: Information Retrieval Architecture and Algorithms. Springer, Boston, MA. https://doi.org/10.1007/978-1-4419-7716-8_4
Download citation
DOI: https://doi.org/10.1007/978-1-4419-7716-8_4
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-7715-1
Online ISBN: 978-1-4419-7716-8
eBook Packages: Computer ScienceComputer Science (R0)