ACM/IFIP/USENIX International Conference on Distributed Systems Platforms and Open Distributed Processing

Middleware 2003: Middleware 2003 pp 1-20

Approximate Object Location and Spam Filtering on Peer-to-Peer Systems

  • Feng Zhou
  • Li Zhuang
  • Ben Y. Zhao
  • Ling Huang
  • Anthony D. Joseph
  • John Kubiatowicz
Conference paper

DOI: 10.1007/3-540-44892-6_1

Volume 2672 of the book series Lecture Notes in Computer Science (LNCS)

Abstract

Recent work in P2P overlay networks allow for decentralized object location and routing (DOLR) across networks based on unique IDs. In this paper, we propose an extension to DOLR systems to publish objects using generic feature vectors instead of content-hashed GUIDs, which enables the systems to locate similar objects.We discuss the design of a distributed text similarity engine, named Approximate Text Addressing (ATA), built on top of this extension that locates objects by their text descriptions. We then outline the design and implementation of a motivating application on ATA, a decentralized spam-filtering service. We evaluate this system with 30,000 real spam email messages and 10,000 non-spam messages, and find a spam identification ratio of over 97% with zero false positives.

Download to read the full conference paper text

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Feng Zhou
    • 1
  • Li Zhuang
    • 1
  • Ben Y. Zhao
    • 1
  • Ling Huang
    • 1
  • Anthony D. Joseph
    • 1
  • John Kubiatowicz
    • 1
  1. 1.Computer Science Division, U. C.Berkeley