Approximate Object Location and Spam Filtering on Peer-to-Peer Systems
- Feng ZhouAffiliated withComputer Science Division, U. C.
- , Li ZhuangAffiliated withComputer Science Division, U. C.
- , Ben Y. ZhaoAffiliated withComputer Science Division, U. C.
- , Ling HuangAffiliated withComputer Science Division, U. C.
- , Anthony D. JosephAffiliated withComputer Science Division, U. C.
- , John KubiatowiczAffiliated withComputer Science Division, U. C.
Recent work in P2P overlay networks allow for decentralized object location and routing (DOLR) across networks based on unique IDs. In this paper, we propose an extension to DOLR systems to publish objects using generic feature vectors instead of content-hashed GUIDs, which enables the systems to locate similar objects.We discuss the design of a distributed text similarity engine, named Approximate Text Addressing (ATA), built on top of this extension that locates objects by their text descriptions. We then outline the design and implementation of a motivating application on ATA, a decentralized spam-filtering service. We evaluate this system with 30,000 real spam email messages and 10,000 non-spam messages, and find a spam identification ratio of over 97% with zero false positives.
- Approximate Object Location and Spam Filtering on Peer-to-Peer Systems
- Book Title
- Middleware 2003
- Book Subtitle
- ACM/IFIP/USENIX International Middleware Conference Rio de Janeiro, Brazil, June 16–20, 2003 Proceedings
- pp 1-20
- Print ISBN
- Online ISBN
- Series Title
- Lecture Notes in Computer Science
- Series Volume
- Series ISSN
- Springer Berlin Heidelberg
- Copyright Holder
- Springer-Verlag Berlin Heidelberg
- Additional Links
- Industry Sectors
- eBook Packages
- Editor Affiliations
- 4. Departamento de Informática, PUC-Rio
- 5. Department of Electrical Engineering and Computer Science, Vanderbilt University
- Author Affiliations
- 6. Computer Science Division, U. C., Berkeley
To view the rest of this content please follow the download PDF link above.