Abstract
In the common formulation, the recommendation problem is reduced to the problem of estimating the utilization for the items that have not been seen by a user [1]. Micro-blog recommendation will recommend micro-blogs interest users, mostly those related to the micro-blogs that a user had issued or trending topics. One indispensable step in realizing effective recommendation is to compute short text similarities between micro-blogs. In this paper, we utilize two kinds of approaches, traditional cosine-based approach and WordNet-based semantic approach, to compute similarities between micro-blogs and recommend top related ones to users. We conduct experimental study on the effectiveness of two approaches using a set of evaluation measures. The results show that semantic similarity based approach has relatively higher precision than that of traditional cosine-based method using 548 twitters as dataset.
This research was undertaken as part of Project 61003130 funded by National Natural Science Foundation of China.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adomavicius, G., Tuzhilin, A.: Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering 17(6), 734–749 (2005)
Metzler, D., Dumais, S., Meek, C.: Similarity Measures for Short Segments of Text. In: Amati, G., Carpineto, C., Romano, G. (eds.) ECIR 2007. LNCS, vol. 4425, pp. 16–27. Springer, Heidelberg (2007)
Ramage, D., Dumais, S., Liebling, D.: Characterizing Microblogs with Topic Models. Association for the Advancement of Artificial Intelligence (2010)
Newman, M.E.J., Park, J.: Why social networks are different from other types of networks. Phys. Rev. EÂ 68(3), 036122 (2003)
Yin, D., Hong, L., Xiong, X., Davison, B.D.: Link Formation Analysis in Microblogs. In: Proceedings of SIGIR 2001, Beijing, China, pp. 24–28 (2001)
Krovetz, R.: Viewing morphology as an inference process. In: Proceedings of SIGIR 1993, pp. 191–202 (1993)
Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Deerwester, S., Dumais, S., Landauer, T., Furnas, G., Harshman, R.: Indexing by latent semantic analysis. JASIST 41(6), 391–407 (1990)
Berger, A., Lafferty, J.: Information retrieval as statistical translation. In: Proceedings of SIGIR 1999, pp. 222–229 (1999)
Zhai, C., Lafferty, J.: Model-based feedback in the language modeling approach to information retrieval. In: Proceedings of CIKM 2001, pp. 403–410 (2001)
Lavrenko, V., Croft, W.B.: Relevance based language models. In: Proceedings of SIGIR 2001, pp. 120–127 (2001)
Rocchio, J.J.: Relevance Feedback in Information Retrieval, pp. 313–323. Prentice-Hall (1971)
Sahami, M., Heilman, T.: A web-based kernel function for measuring the similarity of short text snippets. In: Proceedings of WWW 2006, pp. 377–386 (2006)
Zhao, S., Du, N., Nauerz, A.: Improved recommendation based on collaborative tagging behaviors. In: Proceedings of the 13th International Conference on Intelligent User Interfaces, New York, NY, USA
Han, B., Baldwin, T.: Lexical normalisation of short text messages: Makn sens a #twitter. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, USA
Kendall tau distance, http://en.wikipedia.org/wiki/Kendall_tau_distance
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, L., Xiao, H., Xu, G. (2012). Finding Related Micro-blogs Based on WordNet. In: Yu, H., Yu, G., Hsu, W., Moon, YS., Unland, R., Yoo, J. (eds) Database Systems for Advanced Applications. DASFAA 2012. Lecture Notes in Computer Science, vol 7240. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29023-7_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-29023-7_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29022-0
Online ISBN: 978-3-642-29023-7
eBook Packages: Computer ScienceComputer Science (R0)