Detecting Search Engine Spam from a Trackback Network in Blogspace

  • Masahiro Kimura
  • Kazumi Saito
  • Kazuhiro Kazama
  • Shin-ya Sato
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3684)

Abstract

We aim to develop a technique to detect search engine optimization (SEO) spam websites. Specifically, we propose four methods for extracting the SEO spam entries from a given trackback network in blogspace that are based on fundamental metrics on a network. Using real data of trackback networks in blogspace, we experimentally evaluate the performance of the proposed methods, and demonstrate that the method of ranking entries based on average degrees of nearest neighbors can be a very promising approach for extracting SEO spam entries.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Masahiro Kimura
    • 1
  • Kazumi Saito
    • 2
  • Kazuhiro Kazama
    • 3
  • Shin-ya Sato
    • 3
  1. 1.Department of Electronics and InformaticsRyukoku UniversityOtsu, ShigaJapan
  2. 2.NTT Communication Science LaboratoriesNTT CorporationSeika-cho, KyotoJapan
  3. 3.NTT Network Innovation LaboratoriesNTT CorporationMusashino, TokyoJapan

Personalised recommendations