Concepts and Methods for a Librarian of the Web

  • Mario Kubek

Part of the Studies in Big Data book series (SBD, volume 62)

Table of contents

  1. Front Matter
    Pages i-x
  2. Mario Kubek
    Pages 1-6
  3. Mario Kubek
    Pages 7-14
  4. Mario Kubek
    Pages 15-34
  5. Mario Kubek
    Pages 73-101
  6. Mario Kubek
    Pages 117-140
  7. Mario Kubek
    Pages 141-161
  8. Mario Kubek
    Pages 163-166
  9. Back Matter
    Pages 167-173

About this book


The World Wide Web can be considered a huge library that in consequence needs a capable librarian responsible for the classification and retrieval of documents as well as the mediation between library resources and users. Based on this idea, the concept of the “Librarian of the Web” is introduced which comprises novel, librarian-inspired methods and technical solutions to decentrally search for text documents in the web using peer-to-peer technology.

The concept’s implementation in the form of an interactive peer-to-peer client, called “WebEngine”, is elaborated on in detail. This software extends and interconnects common web servers creating a fully integrated, decentralised and self-organising web search system on top of the existing web structure. Thus, the web is turned into its own powerful search engine without the need for any central authority.

This book is intended for researchers and practitioners having a solid background in the fields of Information Retrieval and Web Mining.



Web Engine Web Search Engine P2P-system Co-occurrence Graph Librarian of the Web

Authors and affiliations

  • Mario Kubek
    • 1
  1. 1.FernUniversität in HagenLehrgebiet KommunikationsnetzeHagenGermany

Bibliographic information