Quality and Importance of Wikipedia Articles in Different Languages
Abstract
This article aims to analyse the importance of the Wikipedia articles in different languages (English, French, Russian, Polish) and the impact of the importance on the quality of articles. Based on the analysis of literature and our own experience we collected measures related to articles, specifying various aspects of quality that will be used to build the models of articles’ importance. For each language version, the influential parameters are selected that may allow automatic assessment of the validity of the article. Links between articles in different languages offer opportunities in terms of comparison and verification of the quality of information provided by various Wikipedia communities. Therefore, the model can be used not only for a relative assessment of the content of the whole article, but also for a relative assessment of the quality of data contained in their structural parts, the so-called infoboxes.
Keywords
Wikipedia DBpedia Information quality Data quality WikiRank Article importanceReferences
- 1.Węcel, K., Lewoniewski, W.: Modelling the quality of attributes in Wikipedia infoboxes. In: Abramowicz, W. (ed.) BIS 2015. LNBIP, vol. 228, pp. 308–320. Springer, Heidelberg (2015)CrossRefGoogle Scholar
- 2.Stvilia, B., Twidale, M.B., Smith, L.C., Gasser, L.: Assessing information quality of a community-based encyclopedia. In: Proceedings of ICIQ, pp. 442–454 (2005)Google Scholar
- 3.Blumenstock, J.E.: Size matters: word count as a measure of quality on Wikipedia. In: WWW, pp. 1095–1096 (2008)Google Scholar
- 4.Hu, M., Lim, E.P., Sun, A., Lauw, H.W., Vuong, B.Q.: Measuring article quality in Wikipedia. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management (CIKM 2007), pp. 243–252 (2007)Google Scholar
- 5.Wöhner, T., Peters, R.: Assessing the quality of Wikipedia articles with lifecycle based metrics. In: Proceedings of the 5th International Symposium on Wikis and Open Collaboration (WikiSym 2009), p. 16 (2009)Google Scholar
- 6.Dalip, D.H., Gonçalves, M.A., Cristo, M., Calado, P.: Automatic quality assessment of content created collaboratively by web communities: a case study of Wikipedia. In: Proceedings of the 9th ACM/IEEE-CS Joint Conference on Digital Libraries, pp. 295–304 (2009)Google Scholar
- 7.Lex, E., Voelske, M., Errecalde, M., Ferretti, E., Cagnina, L., Horn, C., Stein, B., Granitzer, M.: Measuring the quality of web content using factual information. In: Proceedings of the 2nd Joint WICOW/AIRWeb Workshop on Web Quality (WebQuality 2012), p. 7 (2012)Google Scholar
- 8.Lipka, N., Stein, B.: Identifying featured articles in Wikipedia: writing style matters. In: Proceedings of the 19th International Conference on World Wide Web, pp. 1147–1148 (2010)Google Scholar
- 9.Xu, Y., Luo, T.: Measuring article quality in Wikipedia: lexical clue model. In: IEEE Symposium on Web Society, vol. 19, pp. 141–146 (2011)Google Scholar
- 10.Anderka, M.: Analyzing and predicting quality flaws in user-generated content: the case of Wikipedia. Bauhaus-Universitaet Weimar Germany, Ph.d. (2013)Google Scholar
- 11.Lewoniewski, W., Węcel, K., Abramowicz, W.: Analiza porównawcza modeli jakości informacji w narodowych wersjach Wikipedii. In: Porębska-Miąc, T., (ed.) Systemy Wspomagania Organizacji (SWO 2015). Wydawnictwo Uniwersytetu Ekonomicznego w Katowicach, pp. 133–154 (2015)Google Scholar
- 12.Wilkinson, D.M., Huberman, B.A.: Cooperation and quality in Wikipedia. In: Proceedings of the 2007 International Symposium on Wikis (WikiSym 2007), pp. 157–164 (2007)Google Scholar
- 13.Kittur, A., Kraut, R.E.: Harnessing the wisdom of crowds in Wikipedia. In: Proceedings of the ACM 2008 Conference on Computer Supported Cooperative Work (CSCW 2008), P. 37 (2008)Google Scholar
- 14.Arazy, O.: Determinants of Wikipedia quality: the roles of global and local contribution inequality. In: Proceedings of the 2010 ACM Conference on Computer Supported Cooperative Work, CSCW 2010. ACM, New York, pp. 233–236 (2010). http://dx.doi.org/10.1145/1718918.1718963
- 15.Stein, K., Hess, C.: Does it matter who contributes: a study on featured articles in the German Wikipedia. In: Proceedings of the Eighteenth Conference on Hypertext and Hypermedia (HT 2007), pp. 171–174 (2007)Google Scholar
- 16.Suzuki, Y., Yoshikawa, M.: Mutual evaluation of editors and texts for assessing quality of Wikipedia articles. In: Proceedings of the Eighth Annual International Symposium on Wikis and Open Collaboration (WikiSym 2012), vol. 18: 1–18: 10. ACM, New York (2012)Google Scholar
- 17.Halfaker, A., Kraut, R., Riedl, J.: A jury of your peers: quality, experience and ownership in Wikipedia. In: WikiSym 2009, pp. 1–10 (2009)Google Scholar
- 18.Adler, B.T., De Alfaro, L.: A content-driven reputation system for the Wikipedia. In: Proceedings of the 16th International Conference on World Wide Web (WWW 2007), 7(Generic), p. 261 (2007)Google Scholar
- 19.Lih, A.: Wikipedia as participatory journalism: reliable sources? Metrics for evaluating collaborative media as a news resource. In: 5th International Symposium on Online Journalism, p. 31 (2004)Google Scholar
- 20.Blumenstock, J.E.: Automatically assessing the quality of Wikipedia articles. Technical report (2008)Google Scholar
- 21.Dalip, D.H., Gonçalves, M.A., Cristo, M., Calado, P.: Automatic assessment of document quality in web collaborative digital libraries. J. Data Inf. Qual. 2(3), 1–30 (2011)CrossRefGoogle Scholar
- 22.Warncke-wang, M., Cosley, D., Riedl, J.: Tell me more : an actionable quality model for Wikipedia. In: WikiSym 2013, pp. 1–10 (2013)Google Scholar
- 23.Lewoniewski, W., Węcel, K., Abramowicz, W.: Analiza porównawcza modeli klasyfikacyjnych w kontekście oceny jakości artykułów wikipedii. In: VI Ogólnopolska Konferencja Naukowa. Matematyka i informatyka na usługach ekonomii im. Profesora Zbigniewa Czerwińskiego (2016, in press)Google Scholar