Advertisement

Mining for Practices in Community Collections: Finds From Simple Wikipedia

  • Matthijs den Besten
  • Alessandro Rossi
  • Loris Gaio
  • Max Loubser
  • Jean-Michel Dalle
Part of the IFIP – The International Federation for Information Processing book series (IFIPAICT, volume 275)

Abstract

The challenges of commons based peer production are usually associated with the development of complex software projects such as Linux and Apache. But the case of open content production should not be treated as a trivial one. For instance, while the task of maintaining a collection of encyclopedic articles might seem negligible compared to the one of keeping together a software system with its many modules and interdependencies, it still poses quite demanding problems. In this paper, we describe the methods and practices adopted by Simple Wikipedia to keep its articles easy to read. Based on measurements of article readability and similarity, we conclude that while the mechanisms adopted by the community had some effect, in the long run more efforts and new practices might be necessary in order to maintain an acceptable level of readability in the Simple Wikipedia collection.

Keywords

Cosine Similarity Computational Linguistics Community Collection Readability Index Open Source Development 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Copyright information

© International Federation for Information Processing 2008

Authors and Affiliations

  • Matthijs den Besten
    • 1
  • Alessandro Rossi
    • 2
  • Loris Gaio
    • 2
  • Max Loubser
    • 3
  • Jean-Michel Dalle
    • 4
  1. 1.University of OxfordOxford e-Research CentreOxfordUK
  2. 2.Department of Computer and Management ScienceUniversity of TrentoTrenteItaly
  3. 3.Oxford Internet InstituteOxfordUK
  4. 4.Université Pierre et Marie CurieParisFrance

Personalised recommendations