Evaluating Article Quality and Editor Reputation in Wikipedia

  • Yuqing Lu
  • Lei Zhang
  • Juanzi Li
Part of the Communications in Computer and Information Science book series (CCIS, volume 406)

Abstract

We study a novel problem of quality and reputation evaluation for Wikipedia articles. We propose a difficult and interesting question: How to generate reasonable article quality score and editor reputation in a framework at the same time? In this paper, We propose a dual wing factor graph(DWFG) model, which utilizes the mutual reinforcement between articles and editors to generate article quality and editor reputation. To learn the proposed factor graph model, we further design an efficient algorithm. We conduct experiments to validate the effectiveness of the proposed model. By leveraging the belief propagation between articles and editors, our approach obtains significant improvement over several alternative methods(SVM, LR, PR, CRF).

Keywords

factor graph quality evaluation editor reputation 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Smith, T.F., Waterman, M.S.: Identification of Common Molecular Subsequences. J. Mol. Biol. 147, 195–197 (1981)CrossRefGoogle Scholar
  2. 2.
    May, P., Ehrlich, H.-C., Steinke, T.: ZIB Structure Prediction Pipeline: Composing a Complex Biological Workflow Through Web Services. In: Nagel, W.E., Walter, W.V., Lehner, W. (eds.) Euro-Par 2006. LNCS, vol. 4128, pp. 1148–1158. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  3. 3.
    Foster, I., Kesselman, C.: The Grid: Blueprint for a New Computing Infrastructure. Morgan Kaufmann, San Francisco (1999)Google Scholar
  4. 4.
    Adler, B.T., Chatterjee, K., De Alfaro, L., Faella, M., Pye, I., Raman, V.: Assigning trust to Wikipedia content. In: Proceedings of the 4th International Symposium on Wikis. ACM Press (2008)Google Scholar
  5. 5.
    Adler, B.T., de Alfaro, L.: A Content-Driven Reputation System for the Wikipedia. ACM Press (2007)Google Scholar
  6. 6.
    Wu, Q., Irani, D., Pu, C., Ramaswamy, L.: Elusive vandalism detection in wikipedia: a text stability-based approach. In: Proceedings of the 19th ACM International Conference on Information and Knowledge Management, pp. 1797–1800. ACM Press (2010)Google Scholar
  7. 7.
    Zeng, H., Alhossaini, M.A., Ding, L., Fikes, R., McGuinness, D.L.: Computing trust from revision history. In: Proceedings of the 2006 International Conference on Privacy, Security and Trust: Bridge the Gap Between PST Technologies and Business Services, vol. 8, ACM Press (2006)Google Scholar
  8. 8.
    West, A.G., Kannan, S., Lee, I.: Detecting Wikipedia vandalism via spatio-temporal analysis of revision metadata. In: Proceedings of the Third European Workshop on System Security, pp. 22–28. ACM Press (2010)Google Scholar
  9. 9.
    Wang, W.Y., McKeown, K.R.: Got you!: automatic vandalism detection in Wikipedia with web-based shallow syntactic-semantic modeling. In: Proceedings of the 23rd International Conference on Computational Linguistics, pp. 1146–1154. Association for Computational Linguistic (2010)Google Scholar
  10. 10.
    Smets, K., Goethals, B., Verdonk, B.: Automatic vandalism detection in Wikipedia: Towards a machine learning approach. In: AAAI Workshop on Wikipedia and Artificial Intelligence: An Evolving Synergy, pp. 43–48. ACM Press (2008)Google Scholar
  11. 11.
    Itakura, K.Y., Clarke, C.L.A.: Using dynamic markov compression to detect vandalism in the wikipedia. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 822–823. ACM Press (2009)Google Scholar
  12. 12.
    Rassbach, L., Pincock, T., Mingus, B.: Exploring the Feasibility of Automatically Rating Online Article Quality (2008)Google Scholar
  13. 13.
    Stvilia, B., Twidale, M.B., Smith, L.C., Gasser, L.: Assessing information quality of a community-based encyclopedia. In: Proceedings of the International Conference on Information Quality, vol. 11. Citeseer (2005)Google Scholar
  14. 14.
    McGuinness, D.L., Zeng, H., Da Silva, P.P., Ding, L., Narayanan, D., Bhaowal, M.: Investigations into trust for collaborative information repositories: A wikipedia case study. In: Proceedings of the Workshop on Models of Trust for the Web. Citeseer (2006)Google Scholar
  15. 15.
    Page, L., Brin, S., Motwani, R., Winograd, T.: The PageRank citation ranking: Bringing order to the web. Stanford InfoLab (1999)Google Scholar
  16. 16.
    West, A.G., Chang, J., Venkatasubramanian, K.K., Lee, I.: Trust in collaborative web applications. In: Future Generation Computer Systems. Elsevier (2011)Google Scholar
  17. 17.
    Kschischang, F.R., Frey, B.J., Loeliger, H.A.: Factor graphs and the sum-product algorithm. IEEE Transactions on Information Theory 47, 498–519 (2001)CrossRefMATHMathSciNetGoogle Scholar
  18. 18.
    Loeliger, H.A.: An introduction to factor graphs. IEEE Signal Processing Magazine 21, 28–41 (2004)CrossRefGoogle Scholar
  19. 19.
    Murphy, K.P., Weiss, Y., Jordan, M.I.: Loopy belief propagation for approximate inference: An empirical study. In: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, pp. 467–475. Morgan Kaufmann Publishers Inc. (1999)Google Scholar
  20. 20.
    Yang, Z., Cai, K., Tang, J., Zhang, L., Su, Z., Li, J.: Social context summarization. In: Proceedings of the 34th ACM SIGIR Conference (2011)Google Scholar
  21. 21.
    West, A.G.: Calculating and Presenting Trust in Collaborative Content. University of Pennsylvania (2010)Google Scholar
  22. 22.
    Blumenstock, J.E.: Size matters: word count as a measure of quality on wikipedia. In: Proceedings of the 17th International Conference on World Wide Web, pp. 1095–1096. ACM (2008)Google Scholar
  23. 23.
    Chin, S.C., Street, W.N., Srinivasan, P., Eichmann, D.: Detecting Wikipedia vandalism with active learning and statistical language models. In: Proceedings of the 4th Workshop on Information Credibility, pp. 3–10. ACM (2010)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Yuqing Lu
    • 1
  • Lei Zhang
    • 2
  • Juanzi Li
    • 1
  1. 1.Shenzhen Key Laboratory of Broadband Network and Multimedia, Graduate School at ShenzhenTsinghua UniversityChina
  2. 2.Department of Computer Science and TechnologyTsinghua UniversityChina

Personalised recommendations