Subspace Clustering on Mobile Data for Discovering Circle of Friends

  • Tao Wu
  • Yujie Fan
  • Zhiling Hong
  • Lifei Chen
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9403)


The discovery of circle of friends has risen rapidly in recent years. Traditional methods are mainly based on social network analysis which relies heavily on self-report data, such that these methods have isolated successes with limited accuracy, breadth, and depth. In this paper, we propose a new method which combines clustering technique to automatically discover the circle of friends on mobile data. In our method, the circle of friends is modeled as non-overlapping subspace clusters on mobile data with a Vector Space Model (VSM) based representation, for which a new subspace clustering algorithm is proposed to mine the underlying friend-relationship. The experimental studies on real mobile data demonstrate the effectiveness of the new method, and the results show that our clustering algorithm achieves better performance than the existing clustering algorithms.


Circle of friends Mobile data Non-overlapping subspaces Subspace clustering 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Schwartz, M.F., Wood, D.: Discovering shared interests using graph analysis. Communications of the ACM 36, 78–89 (1993)CrossRefGoogle Scholar
  2. 2.
    Nakajima, S., Tatemura, J., Hino, Y., et al.: Discovering important bloggers based on analyzing blog threads. Annual Workshop on the Weblogging Ecosystem. Springer, Heidelberg (2005)Google Scholar
  3. 3.
    Domingos, P., Richardson, M.: Mining the network value of customers. In: Proceedings of the Seventh International Conference on Knowledge Discovery and Data Mining, pp. 57–66. ACM Press, New York (2001)Google Scholar
  4. 4.
    Shen, D., Sun, J.T., Yang, Q., et al.: Latent friend mining from blog data. In: International Conference on Data Mining, pp. 552–561. IEEE Press, Piscataway (2006)Google Scholar
  5. 5.
    Kuan, S.T., Wu, B.Y., Lee, W.J.: Finding friend groups in blogosphere. In: 22nd International Conference on Advanced Information Networking and Applications-Workshops, pp. 1046–1050. IEEE Press, Piscataway (2008)Google Scholar
  6. 6.
    Java, A., Song, X., Finin, T., et al.: Why we twitter: understanding microblogging usage and communities. In: Proceedings of the 9th WebKDD and 1st SNA-KDD Workshop on Web Mining and Social Network Analysis, pp. 56–65. ACM Press, New York (2007)Google Scholar
  7. 7.
    Eagle, N., Pentland, A.S., Lazer, D.: Inferring friendship network structure by using mobile phone data. Proceedings of the National Academy of Sciences. 106, 15274–15278 (2009)CrossRefGoogle Scholar
  8. 8.
    Park, H.S., Jun, C.H.: A simple and fast algorithm for K-medoids clustering. Expert Systems with Applications 36, 3336–3341 (2009)CrossRefGoogle Scholar
  9. 9.
    Aggarwal, C.C., Wolf, J.L., Yu, P.S., et al.: Fast algorithms for projected clustering. In: ACM SIGMOD Record, pp. 61–72. ACM Press, New York (1999)Google Scholar
  10. 10.
    Jing, L., Ng, M.K., Huang, J.Z.: An entropy weighting k-means algorithm for subspace clustering of high-dimensional sparse data. IEEE Transactions on Knowledge and Data Engineering. 19, 1026–1041 (2007)CrossRefGoogle Scholar
  11. 11.
    Gan, G., Wu, J., Yang, Z.-J.: A Fuzzy Subspace Algorithm for Clustering High Dimensional Data. In: Li, X., Zaïane, O.R., Li, Z. (eds.) ADMA 2006. LNCS (LNAI), vol. 4093, pp. 271–278. Springer, Heidelberg (2006) CrossRefGoogle Scholar
  12. 12.
    Chen, L., Jiang, Q., Wang, S.: Model-based method for projective clustering. IEEE Transactions on Knowledge and Data Engineering 24, 1291–1305 (2012)CrossRefGoogle Scholar
  13. 13.
    Han, J., Kamber, M.: Data mining: concepts and techniques, 2nd edn. China Machine Press, Beijing (2007). Fan Ming, Meng Xiaofeng zbMATHGoogle Scholar
  14. 14.
    Strehl, A., Ghosh, J.: Cluster ensembles-a knowledge reuse framework for combining multiple partitions. The Journal of Machine Learning Research 3, 583–617 (2003)MathSciNetzbMATHGoogle Scholar
  15. 15.
    Zhao, Y., Karypis, G.: Comparison of Agglomerative and Partitional Document Clustering Algorithms. Technical Report#02-014. University of Minnesota (2002)Google Scholar
  16. 16.
    Hubert, L., Arabie, P.: Comparing partitions. Journal of Classification 2(1), 193–218 (1985)CrossRefzbMATHGoogle Scholar
  17. 17.
    Iam-On, N., Boongoen, T., Garrett, S., et al.: A Link-Based Approach to the Cluster Ensemble Problem. IEEE Transactions on Pattern Analysis & Machine Intelligence 33, 2396–2409 (2011)CrossRefGoogle Scholar
  18. 18.
    Eagle, N., De Montjoye, Y., Bettencourt, L.M.A.: Community computing: Comparisons between rural and urban societies using mobile phone data. In: International Conference on Computational Science and Engineering, pp. 144–150. IEEE, Piscataway (2009)Google Scholar
  19. 19.
    Papalexakis, E.E.: Automatic Unsupervised Tensor Mining with Quality Assessment (2015). arXiv:1503.0335
  20. 20.
    Farrahi, K., Gatica-Perez, D.: Daily routine classification from mobile phone data. Machine Learning for Multimodal Interaction. Springer, Heidelberg (2008)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Tao Wu
    • 1
    • 2
  • Yujie Fan
    • 1
    • 2
  • Zhiling Hong
    • 3
  • Lifei Chen
    • 1
    • 2
  1. 1.School of Mathematics and Computer ScienceFujian Normal UniversityFuzhouChina
  2. 2.Fujian Provincial Key Laboratory of Network Security and CryptologyFujian Normal UniversityFuzhouChina
  3. 3.Software SchoolXiamen UniversityXiamenChina

Personalised recommendations