Comprehensive Graph and Content Feature Based User Profiling

  • Peihao Tong
  • Junjie Yao
  • Liping Wang
  • Shiyu Yang
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9877)


Nowadays, users post a lot of their ordinary life records to online social sites. Rich social content covers discussion, interaction and communication activities etc. The social data provides insights into users’ interest, preference and communication aspects. An interesting problem is how to profile users’ occupation, i.e., professional categories. It has great values for users’ recommendation and personalized delivery services. However, it is very challenging, compared to gender or age prediction, due to the multiple categories and complex scenarios.

This paper takes a new perspective to tackle the occupation prediction. We propose novel methods to transfer the commonly used social network feature and textual content feature into vector space representation. Specifically, we use the embedding method to transfer the social network feature into a low dimensional space. We then propose an integrated framework that combines the graph and content feature for the occupation classification problem. Empirical study on a large real social dataset verifies the effectiveness and usefulness of the proposed approach.


User profiling Graph embedding Prediction model 



The research is supported by the National Natural Science Foundation of China under Grant No. 61502169, 61401155 and NSFC-Zhejiang Joint Fund for the Integration of Industrialization and Informatization Grant No. U1509219.


  1. 1.
    Abou-Rjeili, A., Karypis, G.: Multilevel algorithms for partitioning power-law graphs. In: 20th International Parallel and Distributed Processing Symposium, IPDPS 2006, p. 10-pp. IEEE (2006)Google Scholar
  2. 2.
    Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 35(8), 1798–1828 (2013)CrossRefGoogle Scholar
  3. 3.
    Cao, S., Lu, W., Xu, Q.: GraRep: Learning graph representations with global structural information. In: Proceeding of CIKM, pp. 891–900 (2015)Google Scholar
  4. 4.
    Cha, M., Haddadi, H., Benevenuto, F., Gummadi, P.K.: Measuring user influence in twitter: The million follower fallacy. In: Proceeding of ICWSM, pp. 10–17 (2010)Google Scholar
  5. 5.
    Cox, T.F., Cox, M.A.: Multidimensional Scaling. CRC Press, Boca Raton (2000)zbMATHGoogle Scholar
  6. 6.
    Farseev, A., Nie, L., Akbari, M., Chua, T.S.: Harvesting multiple sources for user profile learning: a big data study. In: Proceeding of ACM Multimedia, pp. 235–242 (2015)Google Scholar
  7. 7.
    Huang, Y., Yu, L., Wang, X., Cui, B.: A multi-source integration framework for user occupation inference in social media systems. World Wide Web 18(5), 1247–1267 (2015)CrossRefGoogle Scholar
  8. 8.
    Manning, C.D., Raghavan, P., Schütze, H., et al.: Introduction to Information Retrieval, vol. 1. Cambridge University Press, Cambridge (2008)CrossRefzbMATHGoogle Scholar
  9. 9.
    Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013)
  10. 10.
    Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: NIPS, pp. 3111–3119 (2013)Google Scholar
  11. 11.
    Perozzi, B., Al-Rfou, R., Skiena, S.: DeepWalk: Online learning of social representations. In: Proceeding of SIGKDD, pp. 701–710 (2014)Google Scholar
  12. 12.
    Roweis, S.T., Saul, L.K.: Nonlinear dimensionality reduction by locally linear embedding. Science 290(5500), 2323–2326 (2000)CrossRefGoogle Scholar
  13. 13.
    Sun, Y., Norick, B., Han, J., Yan, X., Yu, P.S., Yu, X.: Integrating meta-path selection with user-guided object clustering in heterogeneous information networks. In: Proceeding of SIGKDD, pp. 1348–1356 (2012)Google Scholar
  14. 14.
    Tang, L., Liu, H.: Relational learning via latent social dimensions. In: Proceeding of SIGKDD, pp. 817–826 (2009)Google Scholar
  15. 15.
    Yan, S., Xu, D., Zhang, B., Zhang, H.J., Yang, Q., Lin, S.: Graph embedding and extensions: a general framework for dimensionality reduction. IEEE Trans. Pattern Anal. Mach. Intell. 29(1), 40–51 (2007)CrossRefGoogle Scholar
  16. 16.
    Yang, S.H., Long, B., Smola, A., Sadagopan, N., Zheng, Z., Zha, H.: Like like alike joint friendship and interest propagation in social networks. In: Proceeding of WWW, pp. 537–546 (2011)Google Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  • Peihao Tong
    • 1
  • Junjie Yao
    • 1
  • Liping Wang
    • 1
  • Shiyu Yang
    • 2
  1. 1.School of Computer Science and Software EngineeringEast China Normal UniversityShanghaiChina
  2. 2.School of Computer Science and EngineeringThe University of New South WalesSydneyAustralia

Personalised recommendations