Abstract
We consider the problem of exploiting to discover user interests from social network. User tags in social networks convey abundant implications of user interests,which great benefit various tasks ranging from user profile construction to user similarity calculation based recommendation. However,user interests extraction from social tags suffer from large diversity of word choices due to different user preference,especially the words that quite specific in minority knowledge domains. In addition,the deficiency of uniform concept hierarchy and lack of explicit semantic association between tags obscure the real interests of users. To obtain user interests from tags,we propose a tag normalization algorithm based on world knowledge to underpin the construction of common tags as well as the organization of user hierarchy interest. Experiments with Sina Micro-blog (http://weibo.com/) show that our algorithm can infer user’s interests better than traditional method based on contents.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Blei, D.M.: Probabilistic topic models. Communications of the ACM 55(4), 77–84 (2012)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. The Journal of Machine Learning Research 3, 993–1022 (2003)
Chan, P.K.: A non-invasive learning approach to building web user profiles (1999)
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., Kuksa, P.: Natural language processing (almost) from scratch. The Journal of Machine Learning Research 12, 2493–2537 (2011)
Gabrilovich, E., Markovitch, S.: Computing semantic relatedness using wikipedia-based explicit semantic analysis. In: IJCAI, vol. 7, pp. 1606–1611 (2007)
Kim, H.R., Chan, P.K.: Learning implicit user interest hierarchy for context in personalization. In: Proceedings of the 8th International Conference on Intelligent User Interfaces, pp. 101–108. ACM (2003)
Li, M., Zhang, C., Sun, L., Shao, X.: Topic extraction based on knowledge cluster in the field of micro-blog. In: Huang, D.-S., Jo, K.-H., Wang, L. (eds.) ICIC 2014. LNCS, vol. 8589, pp. 542–550. Springer, Heidelberg (2014)
Li, W.: Random texts exhibit zipf’s-law-like word frequency distribution. IEEE Transactions on Information Theory 38(6), 1842–1845 (1992)
Liu, Q., Niu, K., He, Z., He, X.: Microblog user interest modeling based on feature propagation. In: 2013 Sixth International Symposium on Computational Intelligence and Design (ISCID), vol. 1, pp. 383–386. IEEE (2013)
Liu, Z., Chen, X., Sun, M.: Mining the interests of chinese microbloggers via keyword extraction. Frontiers of Computer Science 6(1), 76–87 (2012)
Ma, H., Zhou, D., Liu, C., Lyu, M.R., King, I.: Recommender systems with social regularization. In: Proceedings of the Fourth ACM International Conference on Web Search and Data Mining, pp. 287–296. ACM (2011)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Newman, M.E.: Power laws, pareto distributions and zipf’s law. Contemporary Physics 46(5), 323–351 (2005)
Ramage, D., Hall, D., Nallapati, R., Manning, C.D.: Labeled lda: a supervised topic model for credit attribution in multi-labeled corpora. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, vol. 1, pp. 248–256. Association for Computational Linguistics (2009)
Wang, J., Li, L., Ren, F.: An improved method of keywords extraction based on short technology text. In: 2010 International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE), pp. 1–6. IEEE (2010)
Wang, Q., Wu, W., Gu, Y.: The application of lucene in information leakage monitoring and querying system. In: 2010 2nd International Conference on Information Engineering and Computer Science (ICIECS), pp. 1–4. IEEE (2010)
Xu, H., Li, J.M.: Design and implementation of web search engine based on lucene. Journal of Hebei Software Institute 1, 024 (2009)
Xu, Z., Lu, R., Xiang, L., Yang, Q.: Discovering user interest on twitter with a modified author-topic model. In: 2011 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), vol. 1, pp. 422–429. IEEE (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Yang, P., Song, Y., Ji, Y. (2015). Tag-Based User Interest Discovery Though Keywords Extraction in Social Network. In: Wang, Y., Xiong, H., Argamon, S., Li, X., Li, J. (eds) Big Data Computing and Communications. BigCom 2015. Lecture Notes in Computer Science(), vol 9196. Springer, Cham. https://doi.org/10.1007/978-3-319-22047-5_29
Download citation
DOI: https://doi.org/10.1007/978-3-319-22047-5_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22046-8
Online ISBN: 978-3-319-22047-5
eBook Packages: Computer ScienceComputer Science (R0)