Quality and Importance of Wikipedia Articles in Different Languages

  • Włodzimierz Lewoniewski
  • Krzysztof Węcel
  • Witold Abramowicz
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 639)

Abstract

This article aims to analyse the importance of the Wikipedia articles in different languages (English, French, Russian, Polish) and the impact of the importance on the quality of articles. Based on the analysis of literature and our own experience we collected measures related to articles, specifying various aspects of quality that will be used to build the models of articles’ importance. For each language version, the influential parameters are selected that may allow automatic assessment of the validity of the article. Links between articles in different languages offer opportunities in terms of comparison and verification of the quality of information provided by various Wikipedia communities. Therefore, the model can be used not only for a relative assessment of the content of the whole article, but also for a relative assessment of the quality of data contained in their structural parts, the so-called infoboxes.

Keywords

Wikipedia DBpedia Information quality Data quality WikiRank Article importance 

References

  1. 1.
    Węcel, K., Lewoniewski, W.: Modelling the quality of attributes in Wikipedia infoboxes. In: Abramowicz, W. (ed.) BIS 2015. LNBIP, vol. 228, pp. 308–320. Springer, Heidelberg (2015)CrossRefGoogle Scholar
  2. 2.
    Stvilia, B., Twidale, M.B., Smith, L.C., Gasser, L.: Assessing information quality of a community-based encyclopedia. In: Proceedings of ICIQ, pp. 442–454 (2005)Google Scholar
  3. 3.
    Blumenstock, J.E.: Size matters: word count as a measure of quality on Wikipedia. In: WWW, pp. 1095–1096 (2008)Google Scholar
  4. 4.
    Hu, M., Lim, E.P., Sun, A., Lauw, H.W., Vuong, B.Q.: Measuring article quality in Wikipedia. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management (CIKM 2007), pp. 243–252 (2007)Google Scholar
  5. 5.
    Wöhner, T., Peters, R.: Assessing the quality of Wikipedia articles with lifecycle based metrics. In: Proceedings of the 5th International Symposium on Wikis and Open Collaboration (WikiSym 2009), p. 16 (2009)Google Scholar
  6. 6.
    Dalip, D.H., Gonçalves, M.A., Cristo, M., Calado, P.: Automatic quality assessment of content created collaboratively by web communities: a case study of Wikipedia. In: Proceedings of the 9th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 295–304 (2009)Google Scholar
  7. 7.
    Lex, E., Voelske, M., Errecalde, M., Ferretti, E., Cagnina, L., Horn, C., Stein, B., Granitzer, M.: Measuring the quality of web content using factual information. In: Proceedings of the 2nd Joint WICOW/AIRWeb Workshop on Web Quality (WebQuality 2012), p. 7 (2012)Google Scholar
  8. 8.
    Lipka, N., Stein, B.: Identifying featured articles in Wikipedia: writing style matters. In: Proceedings of the 19th International Conference on World Wide Web, pp. 1147–1148 (2010)Google Scholar
  9. 9.
    Xu, Y., Luo, T.: Measuring article quality in Wikipedia: lexical clue model. In: IEEE Symposium on Web Society, vol. 19, pp. 141–146 (2011)Google Scholar
  10. 10.
    Anderka, M.: Analyzing and predicting quality flaws in user-generated content: the case of Wikipedia. Bauhaus-Universitaet Weimar Germany, Ph.d. (2013)Google Scholar
  11. 11.
    Lewoniewski, W., Węcel, K., Abramowicz, W.: Analiza porównawcza modeli jakości informacji w narodowych wersjach Wikipedii. In: Porębska-Miąc, T., (ed.) Systemy Wspomagania Organizacji (SWO 2015). Wydawnictwo Uniwersytetu Ekonomicznego w Katowicach, pp. 133–154 (2015)Google Scholar
  12. 12.
    Wilkinson, D.M., Huberman, B.A.: Cooperation and quality in Wikipedia. In: Proceedings of the 2007 International Symposium on Wikis (WikiSym 2007), pp. 157–164 (2007)Google Scholar
  13. 13.
    Kittur, A., Kraut, R.E.: Harnessing the wisdom of crowds in Wikipedia. In: Proceedings of the ACM 2008 Conference on Computer Supported Cooperative Work (CSCW 2008), P. 37 (2008)Google Scholar
  14. 14.
    Arazy, O.: Determinants of Wikipedia quality: the roles of global and local contribution inequality. In: Proceedings of the 2010 ACM Conference on Computer Supported Cooperative Work, CSCW 2010. ACM, New York, pp. 233–236 (2010). http://dx.doi.org/10.1145/1718918.1718963
  15. 15.
    Stein, K., Hess, C.: Does it matter who contributes: a study on featured articles in the German Wikipedia. In: Proceedings of the Eighteenth Conference on Hypertext and Hypermedia (HT 2007), pp. 171–174 (2007)Google Scholar
  16. 16.
    Suzuki, Y., Yoshikawa, M.: Mutual evaluation of editors and texts for assessing quality of Wikipedia articles. In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration (WikiSym 2012), vol. 18: 1–18: 10. ACM, New York (2012)Google Scholar
  17. 17.
    Halfaker, A., Kraut, R., Riedl, J.: A jury of your peers: quality, experience and ownership in Wikipedia. In: WikiSym 2009, pp. 1–10 (2009)Google Scholar
  18. 18.
    Adler, B.T., De Alfaro, L.: A content-driven reputation system for the Wikipedia. In: Proceedings of the 16th International Conference on World Wide Web (WWW 2007), 7(Generic), p. 261 (2007)Google Scholar
  19. 19.
    Lih, A.: Wikipedia as participatory journalism: reliable sources? Metrics for evaluating collaborative media as a news resource. In: 5th International Symposium on Online Journalism, p. 31 (2004)Google Scholar
  20. 20.
    Blumenstock, J.E.: Automatically assessing the quality of Wikipedia articles. Technical report (2008)Google Scholar
  21. 21.
    Dalip, D.H., Gonçalves, M.A., Cristo, M., Calado, P.: Automatic assessment of document quality in web collaborative digital libraries. J. Data Inf. Qual. 2(3), 1–30 (2011)CrossRefGoogle Scholar
  22. 22.
    Warncke-wang, M., Cosley, D., Riedl, J.: Tell me more : an actionable quality model for Wikipedia. In: WikiSym 2013, pp. 1–10 (2013)Google Scholar
  23. 23.
    Lewoniewski, W., Węcel, K., Abramowicz, W.: Analiza porównawcza modeli klasyfikacyjnych w kontekście oceny jakości artykułów wikipedii. In: VI Ogólnopolska Konferencja Naukowa. Matematyka i informatyka na usługach ekonomii im. Profesora Zbigniewa Czerwińskiego (2016, in press)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2016

Authors and Affiliations

  • Włodzimierz Lewoniewski
    • 1
  • Krzysztof Węcel
    • 1
  • Witold Abramowicz
    • 1
  1. 1.Poznań University of Economics and BusinessPoznańPoland

Personalised recommendations