Abstract
Alongside the enormous volume of user-generated content posted to World Wide Web, there exists a thriving demand for search personalization services, especially those utilizing collaborative tagging data. To provide personalized services, a user model is usually required. We address the setting adopted by the majority of previous work, where a user model consists solely of the user’s past information. We construct an augmented user model from a number of tags and documents. These resources are further processed according to the user’s past information by exploring external knowledge base. A novel generative model is proposed for user model generation. This model leverages recent advances in neural language models such as Word Embeddings with latent semantic models such as Latent Dirichlet Allocation. We further present a new query expansion method to facilitate the desired personalized retrieval. Experiments conducted by utilizing real-world collaborative tagging data show that the methods proposed in the current paper outperform several non-personalized methods as well as existing personalized search methods by utilizing user models solely constructed from usage histories.
Keywords
- Personalized search
- Collaborative tagging systems
- Latent semantic models
- Word embeddings
- Query expansion
This is a preview of subscription content, access via your institution.
Buying options
References
Ghorab, M.R., Zhou, D., O’Connor, A., Wade, V.: Personalised information retrieval: survey and classification. User Model. User-Adap. Inter. 23, 381–443 (2013)
Zhou, D., Lawless, S., Wu, X., Zhao, W., Liu, J.: A study of user profile representation for personalized cross-language information retrieval. Aslib J. Inf. Manage. 68, 448–477 (2016)
Xu, S., Bao, S., Fei, B., Su, Z., Yu, Y.: Exploring folksonomy for personalized search. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 155–162. ACM (2008)
Bouadjenek, M.R., Hacid, H., Bouzeghoub, M.: Sopra: a new social personalized ranking function for improving web search. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 861–864. ACM, Dublin (2013)
Xie, H., Li, X., Wang, T., Chen, L., Li, K., Wang, F.L., Cai, Y., Li, Q., Min, H.: Personalized search for social media via dominating verbal context. Neurocomputing 172, 27–37 (2016)
Xie, H., Li, X., Wang, T., Lau, R.Y.K., Wong, T.-L., Chen, L., Wang, F.L., Li, Q.: Incorporating sentiment into tag-based user profiles and resource profiles for personalized search in folksonomy. Inf. Process. Manage. 52, 61–72 (2016)
Bouadjenek, M.R., Hacid, H., Bouzeghoub, M., Daigremont, J.: Personalized social query expansion using social bookmarking systems. In: Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1113–1114. ACM, Beijing (2011)
Zhou, D., Lawless, S., Wade, V.: Improving search via personalized query expansion using social media. Inf. Retr. 15, 218–242 (2012)
Zhou, D., Lawless, S., Wade, V.: Web search personalization using social data. In: Zaphiris, P., Buchanan, G., Rasmussen, E., Loizides, F. (eds.) TPDL 2012. LNCS, vol. 7489, pp. 298–310. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33290-6_32
Bender, M., Crecelius, T., Kacimi, M., Michel, S., Neumann, T., Parreira, J.X., Schenkel, R., Weikum, G.: Exploiting social relations for query expansion and result ranking. In: Proceedings of the IEEE 24th International Conference on Data Engineering Workshop, ICDEW 2008, pp. 501–506. IEEE (2008)
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, NIPS 2013, pp. 3111–3119 (2013)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Dou, Z., Song, R., Wen, J.-R.: A large-scale evaluation and analysis of personalized search strategies. In: Proceedings of the 16th International Conference on World Wide Web, pp. 581–590. ACM, Banff (2007)
Zhou, D., Lawless, S., Liu, J., Zhang, S., Xu, Y.: Query expansion for personalized cross-language information retrieval. In: Proceedings of the 10th International Workshop on Semantic and Social Media Adaptation and Personalization, SMAP 2015, pp. 1–5. IEEE, Trento (2015)
Chirita, P.-A., Firan, C.S., Nejdl, W.: Personalized query expansion for the web. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 7–14. ACM, Amsterdam (2007)
Wang, Q., Jin, H.: Exploring online social activities for adaptive search personalization. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 999–1008. ACM, Toronto (2010)
Cai, Y., Li, Q.: Personalized search by tag-based user profile and resource profile in collaborative tagging systems. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 969–978. ACM, Toronto (2010)
Das, R., Zaheer, M., Dyer, C.: Gaussian LDA for topic models with word embeddings. In: Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, ACL 2015, pp. 795–804. ACL, Beijing (2015)
Liu, Y., Liu, Z., Chua, T.-S., Sun, M.: Topical word embeddings. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, AAAI 2015, pp. 2418–2424. AAAI Press, Austin (2015)
Lavrenko, V., Croft, W.B.: Relevance based language models. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 120–127. ACM, New Orleans (2001)
Ganguly, D., Leveling, J., Jones, G.J.F.: Topical relevance model. In: Hou, Y., Nie, J.-Y., Sun, L., Wang, B., Zhang, P. (eds.) AIRS 2012. LNCS, vol. 7675, pp. 326–335. Springer, Heidelberg (2012). doi:10.1007/978-3-642-35341-3_28
Zubiaga, A., Garcia-Plaza, A.P., Fresno, V., Martinez, R.: Content-based clustering for tag cloud visualization. In: Proceedings of the International Conference on Advances in Social Network Analysis and Mining, ASONAM 2009, pp. 316–319. IEEE (2009)
Zubiaga, A., Fresno, V., Martinez, R., Garcia-Plaza, A.P.: Harnessing folksonomies to produce a social classification of resources. IEEE Trans. Knowl. Data Eng. 25, 1801–1813 (2013)
Zhai, C., Lafferty, J.: Model-based feedback in the language modeling approach to information retrieval. In: Proceedings of the Tenth International Conference on Information and Knowledge Management, pp. 403–410. ACM (2001)
Diaz, F., Metzler, D.: Improving the estimation of relevance models using large external corpora. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 154–161. ACM, Seattle (2006)
Acknowledgments
This research was supported by the National Natural Science Foundation of China (61300129, 61572187 and 61272063), Scientific Research Fund of Hunan Provincial Education Department of China (16K030), Hunan Provincial Innovation Foundation For Postgraduate (CX2016B575). This research was also supported by the ADAPT Centre for Digital Content Technology, which is funded under the Science Foundation Ireland Research Centres Programme (13/RC/2106) and is co-funded under the European Regional Development Fund.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
About this paper
Cite this paper
Zhou, D., Wu, X., Zhao, W., Lawless, S., Liu, J. (2017). Exploring External Knowledge Base for Personalized Search in Collaborative Tagging Systems. In: Wang, S., Zhou, A. (eds) Collaborate Computing: Networking, Applications and Worksharing. CollaborateCom 2016. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 201. Springer, Cham. https://doi.org/10.1007/978-3-319-59288-6_37
Download citation
DOI: https://doi.org/10.1007/978-3-319-59288-6_37
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59287-9
Online ISBN: 978-3-319-59288-6
eBook Packages: Computer ScienceComputer Science (R0)