An Efficient and Spam-Robust Proximity Measure Between Communication Entities

Jeon, Joo Hyuk; Song, Jihwan; Kwon, Jeong Eun; Lee, Yoon Joon; Park, Man Ho; Kim, Myoung Ho

doi:10.1007/s11390-013-1339-z

An Efficient and Spam-Robust Proximity Measure Between Communication Entities

Short Paper
Published: 12 March 2013

Volume 28, pages 394–400, (2013)
Cite this article

Journal of Computer Science and Technology Aims and scope Submit manuscript

Joo Hyuk Jeon¹,
Jihwan Song¹,
Jeong Eun Kwon²,
Yoon Joon Lee¹,
Man Ho Park³ &
…
Myoung Ho Kim¹

154 Accesses
1 Citation
Explore all metrics

Abstract

Electronic communication service providers are obliged to retain communication data for a certain amount of time by their local laws. The retained communication data or the communication logs are used in various applications such as crime detection, viral marketing, analytical study, and so on. Many of these applications rely on effective techniques for analyzing communication logs. In this paper, we focus on measuring the proximity between two communication entities, which is a fundamental and important step toward further analysis of communication logs, and propose a new proximity measure called ESP (Efficient and Spam-Robust Proximity measure). Our proposed measure considers only the (graph-theoretically) shortest paths between two entities and gives small values to those between spam-like entities and others. Thus, it is not only computationally efficient but also spam-robust. By conducting several experiments on real and synthetic datasets, we show that our proposed proximity measure is more accurate, computationally efficient and spam-robust than the existing measures in most cases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Understanding SMS Spam in a Large Cellular Network: Characteristics, Strategies and Defenses

Neighborhoods and bands: an analysis of the origins of spam

Article Open access 11 May 2015

Osvaldo Fonseca, Elverton Fazzion, … Marcelo HP Chaves

SOCIO-LENS: Spotting Unsolicited Caller Through Network Analysis

References

Kotzanikolaou P (2008) Data retention and privacy in electronic communications. IEEE Security and Privacy 6(5):46–52
Article Google Scholar
Canter D, Alison L J. The Social Psychology of Crime: Groups, Teams and Networks. Aldershot, UK: Ashgate, 1999.
Aery M, Chakravarthy S. eMailSift: Email classification based on structure and content. In Proc. the 15th ICDM, November 2005, pp.18–25.
Yu B, Xu Z (2008) A comparative study for content-based dynamic spam classification using four machine learning algorithms. Knowledge-Based Systems 21(4):355–362
Article Google Scholar
Layfield R, Thuraisingham B, Khan L, Kantarcioglu M (2009) Design and implementation of a secure social network system. International Journal of Computer Systems Science & Engineering 24(2):71–84
Google Scholar
Song H H, Cho T W, Dave V, Zhang Y, Qiu L. Scalable proximity estimation and link prediction in online social networks. In Proc. the 9th IMC, November 2009, pp.322–335.
Pan J Y, Yang H J, Faloutsos C, Duygulu P. Automatic multimedia crossmodal correlation discovery. In Proc. the 10th SIGKDD, August 2004, pp.653–658.
Sozio M, Gionis A. The community-search problem and how to plan a successful cocktail party. In Proc. the 16th SIGKDD, July 2010, pp.939–948.
Pirmez L, Carmo LFRC, Bacellar LF (2010) Enhancing Levenshtein distance algorithm for assessing behavioral trust. Int J Computer Systems Science & Engineering 25(1):5–14
Google Scholar
Tong H, Faloutsos C. Center-piece subgraphs: Problem definition and fast solutions. In Proc. the 12th SIGKDD, August 2006, pp.404–413.
Tong H, Faloutsos C, Pan JY (2008) Random walk with restart: Fast solutions and applications. Knowledge of Information Systems 14(3):327–346
Article MATH Google Scholar
Tong H, Qu H, Jamjoom H. Measuring proximity on graphs with side information. In Proc. ICDM, December 2008, pp.598–607.
Koren Y, North S C, Volinsky C. Measuring and extracting proximity graphs in networks. ACM Trans. Knowledge Discovery from Data, 2007, 1(3), Article No.12.
Faloutsos C, McCurley K S, Tomkins A. Fast discovery of connection subgraphs. In Proc. the 10th SIGKDD, August 2004, pp.118–127.
Airoldi EM, Blei DM, Fienberg SE, Xing EP (2008) Mixed membership stochastic blockmodels. Journal of Machine Learning Research 9:1981–2014
MATH Google Scholar
Kemp C, Tenenbaum J B, Griffiths T L, Yamada T, Ueda N. Learning systems of concepts with an infinite relational model. In Proc. the 21st AAAI, July 2006, pp.381–388.
Kubica J, Moore A, Schneider J, Yang Y. Stochastic link and group detection. In Proc. the 18th AAAI, July 28-August 1, 2002, pp.798–806.
Kurihara K, Kameya Y, Sato T. A frequency-based stochastic blockmodel. In Proc. Workshop on Information-Based Induction Sciences, October 2006.
Lantuejoul C, Maisonneuve F (1984) Geodesic methods in quantitative image analysis. Pattern Recognition 17(2):177–187
Article MathSciNet MATH Google Scholar
Grazzini J, Soille P, Bielskiy C. On the use of geodesic distances for spatial interpolation. In Proc. GeoComputation, September 2007.
Borgatti SP, Everett MG (2006) A graph-theoretic perspective on centrality. Social Networks 28(4):466–484
Article Google Scholar
Shetty J, Adibi J. The Enron email dataset database schema and brief statistical report. Technical Report, Information Sciences Institute, University of Southern California, 2004.

Download references

Author information

Authors and Affiliations

Department of Computer Science, Korea Advanced Institute of Science and Technology, Daejeon, 305-701, Korea
Joo Hyuk Jeon, Jihwan Song, Yoon Joon Lee (Member, ACM, IEEE) & Myoung Ho Kim
Biz Solution Team, SK Telecom Information Technology R&D Center, Seoul, 100-999, Korea
Jeong Eun Kwon
Mobile Communication Convergence Research Team, Electronics and Telecommunications Research Institute, Daejeon, 305-700, Korea
Man Ho Park

Authors

Joo Hyuk Jeon
View author publications
You can also search for this author in PubMed Google Scholar
Jihwan Song
View author publications
You can also search for this author in PubMed Google Scholar
Jeong Eun Kwon
View author publications
You can also search for this author in PubMed Google Scholar
Yoon Joon Lee
View author publications
You can also search for this author in PubMed Google Scholar
Man Ho Park
View author publications
You can also search for this author in PubMed Google Scholar
Myoung Ho Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Joo Hyuk Jeon.

Electronic Supplementary Material

Below is the link to the electronic supplementary material.

(DOC 28.0 KB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Jeon, J.H., Song, J., Kwon, J.E. et al. An Efficient and Spam-Robust Proximity Measure Between Communication Entities. J. Comput. Sci. Technol. 28, 394–400 (2013). https://doi.org/10.1007/s11390-013-1339-z

Download citation

Received: 05 March 2012
Revised: 29 September 2012
Published: 12 March 2013
Issue Date: March 2013
DOI: https://doi.org/10.1007/s11390-013-1339-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

An Efficient and Spam-Robust Proximity Measure Between Communication Entities

Abstract

Access this article

Similar content being viewed by others

Understanding SMS Spam in a Large Cellular Network: Characteristics, Strategies and Defenses

Neighborhoods and bands: an analysis of the origins of spam

SOCIO-LENS: Spotting Unsolicited Caller Through Network Analysis

References

Author information

Authors and Affiliations

Corresponding author

Electronic Supplementary Material

(DOC 28.0 KB)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An Efficient and Spam-Robust Proximity Measure Between Communication Entities

Abstract

Access this article

Similar content being viewed by others

Understanding SMS Spam in a Large Cellular Network: Characteristics, Strategies and Defenses

Neighborhoods and bands: an analysis of the origins of spam

SOCIO-LENS: Spotting Unsolicited Caller Through Network Analysis

References

Author information

Authors and Affiliations

Corresponding author

Electronic Supplementary Material

(DOC 28.0 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation