Web Information Retrieval

  • Stefano Ceri
  • Alessandro Bozzon
  • Marco Brambilla
  • Emanuele Della Valle
  • Piero Fraternali
  • Silvia Quarteroni

Part of the Data-Centric Systems and Applications book series (DCSA)

Table of contents

  1. Front Matter
    Pages I-XIV
  2. Principles of Information Retrieval

    1. Front Matter
      Pages 1-1
    2. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 3-11
    3. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 13-26
    4. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 27-37
    5. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 39-56
    6. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 57-68
  3. Information Retrieval for the Web

    1. Front Matter
      Pages 69-69
    2. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 71-90
    3. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 91-110
    4. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 111-120
    5. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 121-133
  4. Advanced Aspects of Web Search

    1. Front Matter
      Pages 135-135
    2. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 137-159
    3. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 161-179
    4. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 181-206
    5. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 207-221
    6. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 223-234
    7. Stefano Ceri, Alessandro Bozzon, Marco Brambilla, Emanuele Della Valle, Piero Fraternali, Silvia Quarteroni
      Pages 235-257
  5. Back Matter
    Pages 259-284

About this book

Introduction

With the proliferation of huge amounts of (heterogeneous) data on the Web, the importance of information retrieval (IR) has grown considerably over the last few years. Big players in the computer industry, such as Google, Microsoft and Yahoo!, are the primary contributors of technology for fast access to Web-based information; and searching capabilities are now integrated into most information systems, ranging from business management software and customer relationship systems to social networks and mobile phone applications.

Ceri and his co-authors aim at taking their readers from the foundations of modern information retrieval to the most advanced challenges of Web IR. To this end, their book is divided into three parts. The first part addresses the principles of IR and provides a systematic and compact description of basic information retrieval techniques (including binary, vector space and probabilistic models as well as natural language search processing) before focusing on its application to the Web. Part two addresses the foundational aspects of Web IR by discussing the general architecture of search engines (with a focus on the crawling and indexing processes), describing link analysis methods (specifically Page Rank and HITS), addressing recommendation and diversification, and finally presenting advertising in search (the main source of revenues for search engines). The third and final part describes advanced aspects of Web search, each chapter providing a self-contained, up-to-date survey on current Web research directions. Topics in this part include meta-search and multi-domain search, semantic search, search in the context of multimedia data, and crowd search.

The book is ideally suited to courses on information retrieval, as it covers all Web-independent foundational aspects. Its presentation is self-contained and does not require prior background knowledge. It can also be used in the context of classic courses on data management, allowing the instructor to cover both structured and unstructured data in various formats. Its classroom use is facilitated by a set of slides, which can be downloaded from www.search-computing.org.

Keywords

Web advertizing Web crawler Web search crowd search information retrieval multimedia search natural language processing recommender systems search computing semantic search

Authors and affiliations

  • Stefano Ceri
    • 1
  • Alessandro Bozzon
    • 2
  • Marco Brambilla
    • 3
  • Emanuele Della Valle
    • 4
  • Piero Fraternali
    • 5
  • Silvia Quarteroni
    • 6
  1. 1.e InformazionePolitecnico di Milano Dipartimento di ElettronicaMilanItaly
  2. 2.e InformazionePolitecnico di Milano Dipartimento di ElettronicaMilanItaly
  3. 3.e InformazionePolitecnico di Milano Dipartimento di ElettronicaMilanItaly
  4. 4.e InformazionePolitecnico di Milano Dipartimento di ElettronicaMilanItaly
  5. 5.e InformazionePolitecnico di Milano Dipartimento di ElettronicaMilanItaly
  6. 6.e InformazionePolitecnico di Milano Dipartimento di ElettronicaMilanItaly

Bibliographic information

  • DOI https://doi.org/10.1007/978-3-642-39314-3
  • Copyright Information Springer-Verlag Berlin Heidelberg 2013
  • Publisher Name Springer, Berlin, Heidelberg
  • eBook Packages Computer Science
  • Print ISBN 978-3-642-39313-6
  • Online ISBN 978-3-642-39314-3
  • About this book