Skip to main content

Tag-Based User Interest Discovery Though Keywords Extraction in Social Network

  • Conference paper
  • First Online:
Big Data Computing and Communications (BigCom 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9196))

Included in the following conference series:

Abstract

We consider the problem of exploiting to discover user interests from social network. User tags in social networks convey abundant implications of user interests,which great benefit various tasks ranging from user profile construction to user similarity calculation based recommendation. However,user interests extraction from social tags suffer from large diversity of word choices due to different user preference,especially the words that quite specific in minority knowledge domains. In addition,the deficiency of uniform concept hierarchy and lack of explicit semantic association between tags obscure the real interests of users. To obtain user interests from tags,we propose a tag normalization algorithm based on world knowledge to underpin the construction of common tags as well as the organization of user hierarchy interest. Experiments with Sina Micro-blog (http://weibo.com/) show that our algorithm can infer user’s interests better than traditional method based on contents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Blei, D.M.: Probabilistic topic models. Communications of the ACM 55(4), 77–84 (2012)

    Article  MathSciNet  Google Scholar 

  2. Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. The Journal of Machine Learning Research 3, 993–1022 (2003)

    Google Scholar 

  3. Chan, P.K.: A non-invasive learning approach to building web user profiles (1999)

    Google Scholar 

  4. Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. The Journal of Machine Learning Research 12, 2493–2537 (2011)

    MATH  Google Scholar 

  5. Gabrilovich, E., Markovitch, S.: Computing semantic relatedness using wikipedia-based explicit semantic analysis. In: IJCAI, vol. 7, pp. 1606–1611 (2007)

    Google Scholar 

  6. Kim, H.R., Chan, P.K.: Learning implicit user interest hierarchy for context in personalization. In: Proceedings of the 8th International Conference on Intelligent User Interfaces, pp. 101–108. ACM (2003)

    Google Scholar 

  7. Li, M., Zhang, C., Sun, L., Shao, X.: Topic extraction based on knowledge cluster in the field of micro-blog. In: Huang, D.-S., Jo, K.-H., Wang, L. (eds.) ICIC 2014. LNCS, vol. 8589, pp. 542–550. Springer, Heidelberg (2014)

    Google Scholar 

  8. Li, W.: Random texts exhibit zipf’s-law-like word frequency distribution. IEEE Transactions on Information Theory 38(6), 1842–1845 (1992)

    Article  Google Scholar 

  9. Liu, Q., Niu, K., He, Z., He, X.: Microblog user interest modeling based on feature propagation. In: 2013 Sixth International Symposium on Computational Intelligence and Design (ISCID), vol. 1, pp. 383–386. IEEE (2013)

    Google Scholar 

  10. Liu, Z., Chen, X., Sun, M.: Mining the interests of chinese microbloggers via keyword extraction. Frontiers of Computer Science 6(1), 76–87 (2012)

    MathSciNet  Google Scholar 

  11. Ma, H., Zhou, D., Liu, C., Lyu, M.R., King, I.: Recommender systems with social regularization. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 287–296. ACM (2011)

    Google Scholar 

  12. Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)

    Google Scholar 

  13. Newman, M.E.: Power laws, pareto distributions and zipf’s law. Contemporary Physics 46(5), 323–351 (2005)

    Article  Google Scholar 

  14. Ramage, D., Hall, D., Nallapati, R., Manning, C.D.: Labeled lda: a supervised topic model for credit attribution in multi-labeled corpora. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 1, pp. 248–256. Association for Computational Linguistics (2009)

    Google Scholar 

  15. Wang, J., Li, L., Ren, F.: An improved method of keywords extraction based on short technology text. In: 2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE), pp. 1–6. IEEE (2010)

    Google Scholar 

  16. Wang, Q., Wu, W., Gu, Y.: The application of lucene in information leakage monitoring and querying system. In: 2010 2nd International Conference on Information Engineering and Computer Science (ICIECS), pp. 1–4. IEEE (2010)

    Google Scholar 

  17. Xu, H., Li, J.M.: Design and implementation of web search engine based on lucene. Journal of Hebei Software Institute 1, 024 (2009)

    Google Scholar 

  18. Xu, Z., Lu, R., Xiang, L., Yang, Q.: Discovering user interest on twitter with a modified author-topic model. In: 2011 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), vol. 1, pp. 422–429. IEEE (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ping Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Yang, P., Song, Y., Ji, Y. (2015). Tag-Based User Interest Discovery Though Keywords Extraction in Social Network. In: Wang, Y., Xiong, H., Argamon, S., Li, X., Li, J. (eds) Big Data Computing and Communications. BigCom 2015. Lecture Notes in Computer Science(), vol 9196. Springer, Cham. https://doi.org/10.1007/978-3-319-22047-5_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-22047-5_29

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-22046-8

  • Online ISBN: 978-3-319-22047-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics