Estimation of the Locations of the Language-Versions of Wikipedia - a Case Study on Geographic Data Mining

  • Tobias DahindenEmail author
Conference paper
Part of the Lecture Notes in Geoinformation and Cartography book series (LNGC, volume 6)


People write about things they believe to know, and in particular those things that are within the environment they live in. They also write in a language they know. Therefore, there is a relation between the individual local environment and the language used for the description.

In this paper the areas of several languages are estimated according to the geographic footprint of the language versions of Wikipedia. These estimated language areas are compared to those represented in linguistic maps. The results of this comparison are presented for a subset of Germanic languages.


Kernel Density Estimation Language Version Language Area Knowledge Repository German Dialect 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Alder T (2009) WikiProjekt Vorlagenauswertung. Accessed 4 May 2009, from\_Vorlagenauswertung.
  2. Bausch K-H (2002) Die deutsche Sprache - eine Dialektlandschaft. Bildung und Kultur. In: (eds)Bildung und Kultur. Leibniz-Institut für Länderkunde: 94-95.Google Scholar
  3. Bragues G (2009) Wiki-philosophizing in a marketplace of ideas: Evaluating Wikipedia's entrieson seven great minds. MediaTropes eJournal 2(1): 117-158.Google Scholar
  4. Brockhaus FA (1894). Karte der deutschen Mundarten, Geogr.-artist. Anstalt, Leipzip. 14.Auflage.Google Scholar
  5. Clauson KA, Polen HH, Boulos MNK, Dzenowagis JH (2008) Scope, completeness, andaccuracy of drug information in Wikipedia. Ann Pharmacother 42(12): 1814-1821.CrossRefGoogle Scholar
  6. Dahinden T (2009) Localization of uncertain and fuzzy-bordered areas by geocoded articles of aknowledge repository. In: Garcia-Huidobro CJV (eds) Proceedings of the InternationalCartographic Conference, Santiago de Chile.Google Scholar
  7. Dahinden T, Sester M (2009) Categorization of Linear Objects for Map Generalization UsingGeocoded Articles of a Knowledge Repository. In: Winter S, Tenbrink T (eds) Workshop onPresenting spatial information: Granularity, relevance and integration, Conference on SpatialInformation Theory, Aber Wrac'h, France.Google Scholar
  8. Dolan S (2008) Six degrees of Wikipedia. Accessed 29 May 2008, from
  9. ESRI Support Centre (2010) ArcGIS Desktop Help: Kernel Density. Accessed 13 April 2010,from
  10. Fagan SMB (2009) German: a lingustic introduction, Cambridge University Press.Google Scholar
  11. Halavais A, Lackaff D (2008) An Analysis of Topical Coverage of Wikipedia. Jounral ofComputer-Mediated Communication 13(2): 429-440.CrossRefGoogle Scholar
  12. Haspelmath M, Dryer MS, Gil D, Comrie B, Eds. (2005) World Atlas of Language Structures,Oxford University Press.Google Scholar
  13. Hecht B, Moxley E (2009) Terabytes of Tobler: Evaluating the First Law in a Massive, Domain-Neutral Representation of World Knowledge. Spatial Information Theory, 9th InternationalConference, COSIT 2009, Aber Wrac'h, France, September 21-25, 2009, Proceedings 5756:88-105.Google Scholar
  14. Hecht B, Raubal M (2008). GeoSR: Geographically Explore Semantic Relations in WorldKnowledge. AGILE Conf., Springer: 95-113.Google Scholar
  15. Kittur A, Chi E, Pendleton BA, Suh B, Mytkowicz T (2007) Power of the Few vs. Wisdom ofthe Crowd: Wikipedia and the Rise of the Bourgeoisie. In: (eds) 25th Annual ACMConference on Human Factors in Computing Systems (CHI), San Jose, CA.Google Scholar
  16. König W, Paul H-J (2007) dtv-Atlas Deutsche Sprache. Mit 155 farbigen Abbildungsseiten.,Deutscher Taschenbuch Verlag.Google Scholar
  17. Kühn S, Alder T (2009) WP : GEO / Wikipedia-World. Accessed 31 August 2009, from
  18. Maurmann E (1888-1923) Karte der Deutschen Mundarten. Sprachatlas des Deutschen Reichs.Mediawiki (2010) Wikimedia downloads. Accessed 11 June 2010, from
  19. Ortega F, Gonzales-Barahona JM, Robles G (2007) The Top-Ten Wikipedias, A QuantitativeAnalysis using WikiXRay. International Conference on Software and Data Technology: 46-53.Google Scholar
  20. Paelke V, Dahinden T, Eggert D, Mondzech J (2010) Location Based Context AwarenessThrough Tag-Cloud Visualization. In: (eds) Joint International Conference on Theory, DataHandling and Modelling in GeoSpatial Information Science, Hong Kong.Google Scholar
  21. Pfeil U, Zaphiris P, Ang CS (2006) Cultural differences in collaborative authoring of Wikipedia.Journal of Computer-Mediated Communication 12(1).Google Scholar
  22. Protze H (1969) Die deutschen Mundarten. Kleine Enzyklopädie Deutsche Sprache.Samuels ML (1972) Linguistic evolution : with special reference to English. London, CambridgeUniversity Press.Google Scholar
  23. Schöning J, Hecht B, Rohs M, Starosielski N (2007) WikEar: Automatically GeneratedLocation-Based Audio Stories between Public City Maps. Adjunct Proceedings of the 9thInternational Conference on Ubiquitous Computing.Google Scholar
  24. Silverman BW (1986) Density estimation for statistics and data analysis. London, Chapman andHall (CRC Press).Google Scholar
  25. Smith MJd, Goodchild MF, Longley PA (2007) Geospatial Analysis, Second Edition, TroubadorPublishing Ltd.Google Scholar
  26. Ther P, Siljak A (2001) Redrawing Nations: Ethnic Cleaning in East-Central Europe, 1944-1948., Rowman & Littlefield.Google Scholar
  27. Tobler WR (1970) A Computer Movie Simulating Urban Growth in the Detroit Region.Economic Geography 46: 234-240.CrossRefGoogle Scholar
  28. Wenker G (1888-1923) Sprachatlas des Deutschen Reichs. Marburg.Google Scholar
  29. Wikipedia (2007) Die deutschen und niederländischen Dialekte nach dem Jahr 1945. Accessed28 April 2007, from
  30. Wikipedia (2009a) Translation. Accessed 27 December 2009, from
  31. Wikipedia (2009b) WP : GEO / mehrere Artikel an einer Koordinate. Accessed 31 August2009, from
  32. Wikipedia (2010a) Articles needing coordinates. Accessed 3 January 2010, from\_needing\_coordinates.
  33. Wikipedia (2010b) List of Wikipedias. Accessed 6 January 2010, from

Copyright information

© Springer-Verlag Berlin Heidelberg 2011

Authors and Affiliations

  1. 1.Institut für Kartographie und GeoinformatikLeibniz Universität HannoverHannoverGermany

Personalised recommendations