A Weighted Multi-factor Algorithm for Microblog Search

  • Lulin Zhao
  • Yi Zeng
  • Ning Zhong
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6890)

Abstract

As a fast and social information communication media, microblog, especially Twitter, has gained increasing popularity in recent years. Given the fact that a great volume of new tweets are being generated every second, ranking them to find the most relevant information is a challenging matter. The short length of tweets makes direct adoptions of traditional information retrieval algorithms to microblog search very hard. In this paper, we focus on the ranking strategies of microblogs, six factors are summarized to measure a user’s social influence, and each of them are highly relevant to the social network properties of the microblog authors and the properties of the microblog itself. Based on these factors, several ranking measures for Twitter search are examined. As a step forward, we propose a weighted multi-factor ranking algorithm (WMFR). By using a public Twitter search dataset, through Kendall’s τ correlation analysis on user selection and algorithm selection of tweets, we conclude that the proposed WMFR algorithm is more effective compared to several existing algorithms.

Keywords

Social Influence Weighted Coefficient Short Message Service Rank List Ranking Algorithm 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
  2. 2.
    Milstein, S., Chowdhury, A., Hochmuth, G., Lorica, B., Magoulas, R.: Twitter and the micro-messaging revolution: communication, connections, and immediacy-140 characters at a time. O’Reilly, Sebastopol (2008)Google Scholar
  3. 3.
    Cheng, A., Evans, M.: Inside Twitter: An in-depth look inside the Twitter world, http://www.sysomos.com/insidetwitter/
  4. 4.
    Efron, M.: Information search and retrieval in microblogs. Journal of the American Society for Information Science and Technology 62(6), 996–1008 (2011)CrossRefMathSciNetGoogle Scholar
  5. 5.
    Ye, S., Wu, S.F.: Measuring message propagation and social influence on twitter.com. In: Bolc, L., Makowski, M., Wierzbicki, A. (eds.) SocInfo 2010. LNCS, vol. 6430, pp. 216–231. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  6. 6.
    Lampos, V., De Bie, T., Cristianini, N.: Flu Detector - Tracking Epidemics on Twitter. In: Balcázar, J.L., Bonchi, F., Gionis, A., Sebag, M. (eds.) ECML PKDD 2010. LNCS, vol. 6323, pp. 599–602. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  7. 7.
    Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes twitter users: real-time event detection by social sensors. In: Proceedings of the 19th International Conference on World Wide Web, pp. 851–860 (2010)Google Scholar
  8. 8.
    David, K., Jon, K., Eva, T.: Influential nodes in a diffusion model for social networks. In: Proceedings of the 32nd International Colloquium on Automata, Languages and Programming, pp. 1127–1138 (2005)Google Scholar
  9. 9.
    Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: bringing order to the Web, Technical Report. Stanford InfoLab (1999)Google Scholar
  10. 10.
    Daniel, G.A.: Nepotistic relationships in Twitter and their impact on rank prestige algorithms. CoRR abs/1004.0816 (2010)Google Scholar
  11. 11.
  12. 12.
    Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, Cambridge (2008)CrossRefMATHGoogle Scholar
  13. 13.
    Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. In: Proceedings of the 9th Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 668–677 (1999)Google Scholar
  14. 14.
    Cha, M., Haddadi, H., Benevenuto, F., Gummadi, K.P.: Measuring user influence in Twitter: The million follower fallacy. In: Proceedings of the 4th International AAAI Conference on Weblogs and Social Media, pp. 10–17 (2010)Google Scholar
  15. 15.
    Weng, J., Lim, E., Jiang, J., He, Q.: TwitterRank: finding topic-sensitive influential Twitterers. In: Proceedings of the 3rd ACM International Conference on Web Search and Data Mining, pp. 261–270 (2010)Google Scholar
  16. 16.
    Kwak, H., Lee, C., Park, H., Sue, M.: What is Twitter, a social network or a news media? In: Proceedings of the 19th International Conference on World Wide Web, pp. 591–600 (2010)Google Scholar
  17. 17.
  18. 18.
    Hacker, S., Ahn, L.V.: Matchin: Eliciting user preferences with an online game. In: Proceedings of the 27th International Conference on Human Factors in Computing Systems, pp. 1207–1216 (2009)Google Scholar
  19. 19.
    Sarma, A.D., Gollapudi, S.: Ranking mechanisms in Twitter-like forums. In: Proceedings of the 3rd ACM International Conference on Web Search and Data Mining, pp. 21–30 (2010)Google Scholar
  20. 20.
    David, H.: The Method of Paired Comparisons. Oxford University Press, New York (1988)MATHGoogle Scholar
  21. 21.
    Nagmoti, R., Teredesai, A., De Cock, M.: Ranking approaches for microblog search. In: Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, pp. 153–157 (2010)Google Scholar
  22. 22.
    Twitter Authority Based Search (TABS), http://cssgate.insttech.washington.edu:8184/TABS/
  23. 23.
    Hong, L., Davison, B.D.: Empirical study of topic modeling in Twitter. In: Proceedings of the 1st Workshop on Social Media Analytics, pp. 80–88 (2010)Google Scholar
  24. 24.
    Kendall, M.G.: A new measure of rank correlation. Biometrika 30(1/2), 81–93 (1938)CrossRefMATHGoogle Scholar
  25. 25.
    Michelson, M., Macskassy, S.A.: Discovering users’ topics of interest on Twitter: a first look. In: Proceedings of the 4th Workshop on Analytics for Noisy Unstructured Text Data, pp. 73–80 (2010)Google Scholar
  26. 26.
    Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. Journal of Machine Learning Research 3, 993–1022 (2003)MATHGoogle Scholar
  27. 27.
    Griffiths, T.L., Steyvers, M.: Finding scientific topics. Proceedings of the National Academy of Sciences of the United States of America 101, 5228–5235 (2004)CrossRefGoogle Scholar
  28. 28.
    Gao, J., An, B., Song, A., Wang, X.: A new topic influence model research in online community. In: Proceedings of the 2007 International Conference on Computational Intelligence and Security, pp. 466–469 (2007)Google Scholar
  29. 29.
    Ramage, D., Dumais, S., Liebling, D.: Characterizing microblogs with topic models. In: Proceedings of the 4th International AAAI Conference on Weblogs and Social Media, pp. 1–8 (2010)Google Scholar
  30. 30.

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Lulin Zhao
    • 1
  • Yi Zeng
    • 1
  • Ning Zhong
    • 1
    • 2
  1. 1.International WIC InstituteBeijing University of TechnologyBeijingChina
  2. 2.Department of Life Science and InformaticsMaebashi Institute of TechnologyMaebashi-CityJapan

Personalised recommendations