Quality Evaluation of Wikipedia Articles through Edit History and Editor Groups

  • Se Wang
  • Mizuho Iwaihara
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6612)

Abstract

Wikipedia is well known as a free encyclopedia, which is a type of collaborative repository system that allows the viewer to create and edit articles directly in the web browser. The weakness of the Wikipedia system is the possibility of manipulation and vandalism cannot be ruled out, so that the quality of any given Wikipedia article is not guaranteed. It is an important work to establish a quality evaluation method to help users decide how much they should trust an article in Wikipedia. In this paper we investigate the edit history of Wikipedia articles and propose a model of network structure of editors. We propose an algorithm to calculate the network structural indicator restoreratio. We use the proposed indicator combined with existing metrics to predict the quality of Wikipedia articles through support vector machine technology. The experimental results show that the proposed indicator has better performance in quality evaluation than several existing metrics.

Keywords

Wikipedia quality evaluation web mining edit network web trust 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
  2. 2.
    Adler, B.T., Chatterjee, K., de Alfaro, L., Faella, M., Pye, I., Raman, V.: Assigning Trust To Wikipedia Content. In: Proc. 2008 Int’l Symp. Wikis., Porto (September 2008)Google Scholar
  3. 3.
    Blumenstock, J.E.: Size Matters: Word Count as a Measure of Quality on Wikipedia. In: Proc. 17th Int’l Conf. World Wide Web, Beijing, pp. 1095–1096 (April 2008)Google Scholar
  4. 4.
    Cross, T.: Puppy smoothies: Improving the reliability of open, collaborative wikis. First Monday  11(9) (September 2006)Google Scholar
  5. 5.
    Cunningham, W., Leuf, B.: The Wiki Way. Quick Collaboration on the Web. Addison-Wesley, Reading (2001)Google Scholar
  6. 6.
    Dondio, P., Barrett, S.: Computational Trust in Web Content Quality: A Comparative Evalutation on the Wikipedia Project. Informatica – An International Journal of Computing and Informatics 31(2), 151–160 (2007)Google Scholar
  7. 7.
    Giles, G.: Internet encyclopedias go head to head. Nature 438( 7070), 900–901 (2005)Google Scholar
  8. 8.
    Hoisl, B., Aigner, W., Miksch, S.: Social rewarding in wiki systems – motivating the community. In: Schuler, D. (ed.) HCII 2007 and OCSC 2007. LNCS, vol. 4564, pp. 362–371. Springer, Heidelberg (2007)CrossRefGoogle Scholar
  9. 9.
    Hunt, J., McIlroy, M.:: An algorithm for differential file comparison. Computer Science Technical Report 41. Bell Laboratories (1975)Google Scholar
  10. 10.
    Kittur, A., Suh, B., Pendleton, B.A., Chi, E.H.: He says, she says: Conflict and coordination in Wikipedia. In: Proc. 25th Annual ACM Conf. Human Factors in Computing Systems (CHI 2007), San Jose, USA, pp. 453–462 (April/May 2007)Google Scholar
  11. 11.
    Kramer, M., Gregorowicz, A., Iyer, B.: Wiki Trust Metrics based on Phrasal Analysis. In: Proc. 2008 Int’l Symp. Wikis, Porto, Portugal (September 2008)Google Scholar
  12. 12.
    Lih, A.: Wikipedia as participatory journalism: Reliable sources? metrics for evaluating collaborative media as a news resource. In: Proc. 5th Int’l Symp. Online Journalism, Austin, USA (April 2004)Google Scholar
  13. 13.
    Lim, E.P., Vuong, B.Q., Lauw, H.W., Sun, A.: Measuring Qualities of Articles Contributed by OnlineCommunities. In: Proc. 2006 IEEE/WIC/ACM Int’l Conf. Web Intelligence, Hong Kong, pp. 81–87 (December 2006)Google Scholar
  14. 14.
  15. 15.
    Priedhorsky, R., Chen, J., Lam, S.K., Panciera, K., Terveen, L., Riedl, J.: Creating, Destroying, and Restoring Value in Wikipedia. In: Proc. Int’l ACM Conf. Supporting Group Work, Sanibel Island, USA, pp. 259–268 (November 2007)Google Scholar
  16. 16.
    Sabel, M.: Structuring wiki revision history. In: Proc. 2007 Int’l Symp. On Wikis, Montreal, Canada, pp. 125–130 (October 2007)Google Scholar
  17. 17.
    Smets, K., Goethals, B., Verdonk, B.: Automatic Vandalism Detection in Wikipedia: Towards a Machine Learning Approach. In: Proc. AAAI Workshop, Wikipedia and Artificial Intelligence: An Evolving Synergy (WikiAI 2008), Chicago, USA (July 2008)Google Scholar
  18. 18.
    Stvilia, B., Twidale, M.B., Smith, L.C., Gasser, L.: Assessing information quality of a community-based encyclopedia. In: Proc. Int’l Conf. Information Quality, Cambridge, USA, pp. 442–454 (November 2005)Google Scholar
  19. 19.
    Potthast, M., Stein, B., Gerling, R.: Automatic Vandalism Detection in Wikipedia. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 663–668. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  20. 20.
    Wöhner, T., Peters, R.: Assessing the quality of Wikipedia Articles with Lifecycle Based Metrics. In: Proc. 2009 Int’l Symp. Wikis, Orlando (October 2009)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  • Se Wang
    • 1
  • Mizuho Iwaihara
    • 1
  1. 1.Graduate School of Information, Production and SystemsWaseda UniversityFukuokaJapan

Personalised recommendations