Web Communities

Analysis and Construction

  • Yanchun Zhang
  • Jeffrey Xu Yu
  • Jingyu Hou

Table of contents

  1. Front Matter
    Pages I-XI
  2. Pages 1-6
  3. Pages 7-16
  4. Pages 111-143
  5. Pages 169-172
  6. Back Matter
    Pages 173-187

About this book

Introduction

Due to the lack of a uniform schema for Web documents and the sheer amount and dynamics of Web data, both the effectiveness and the efficiency of information management and retrieval of Web data is often unsatisfactory when using conventional data management techniques.

Web community, defined as a set of Web-based documents with its own logical structure, is a flexible and efficient approach to support information retrieval and to implement various applications. Zhang and his co-authors explain how to construct and analyse Web communities based on information like Web document contents, hyperlinks, or user access logs. Their approaches combine results from Web search algorithms, Web clustering methods, and Web usage mining. They also detail the necessary preliminaries needed to understand the algorithms presented, and they discuss several successful existing applications.

Researchers and students in information retrieval and Web search find in this all the necessary basics and methods to create and understand Web communities. Professionals developing Web applications will additionally benefit from the samples presented for their own designs and implementations.

Keywords

ALS Content DOM Origin Web Web Clustering Web Data Management Web Search Web Usage Mining WebP organization search engine marketing (SEM)

Authors and affiliations

  • Yanchun Zhang
    • 1
  • Jeffrey Xu Yu
    • 2
  • Jingyu Hou
    • 3
  1. 1.School of Computer Science and MathematicsVictoria University of TechnologyMelbourne CityAustralia
  2. 2.Dept. of Systems Engineering and Engineering ManagementChinese University of Hong KongShatin, N.T., Hong KongChina
  3. 3.School of Information TechnologyDeakin UniversityBurwoodAustralia

Bibliographic information

  • DOI https://doi.org/10.1007/3-540-27739-0
  • Copyright Information Springer-Verlag Berlin Heidelberg 2006
  • Publisher Name Springer, Berlin, Heidelberg
  • eBook Packages Computer Science
  • Print ISBN 978-3-540-27737-8
  • Online ISBN 978-3-540-27739-2
  • About this book