Skip to main content

Estimation of the Locations of the Language-Versions of Wikipedia - a Case Study on Geographic Data Mining

  • Conference paper
  • First Online:
Advances in Cartography and GIScience. Volume 2

Part of the book series: Lecture Notes in Geoinformation and Cartography ((ICA,volume 6))

Abstract

People write about things they believe to know, and in particular those things that are within the environment they live in. They also write in a language they know. Therefore, there is a relation between the individual local environment and the language used for the description.

In this paper the areas of several languages are estimated according to the geographic footprint of the language versions of Wikipedia. These estimated language areas are compared to those represented in linguistic maps. The results of this comparison are presented for a subset of Germanic languages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Alder T (2009) WikiProjekt Vorlagenauswertung. Accessed 4 May 2009, fromhttp://de.wikipedia.org/wiki/Wikipedia:WikiProjekt\_Vorlagenauswertung.

  • Bausch K-H (2002) Die deutsche Sprache - eine Dialektlandschaft. Bildung und Kultur. In: (eds)Bildung und Kultur. Leibniz-Institut für Länderkunde: 94-95.

    Google Scholar 

  • Bragues G (2009) Wiki-philosophizing in a marketplace of ideas: Evaluating Wikipedia's entrieson seven great minds. MediaTropes eJournal 2(1): 117-158.

    Google Scholar 

  • Brockhaus FA (1894). Karte der deutschen Mundarten, Geogr.-artist. Anstalt, Leipzip. 14.Auflage.

    Google Scholar 

  • Clauson KA, Polen HH, Boulos MNK, Dzenowagis JH (2008) Scope, completeness, andaccuracy of drug information in Wikipedia. Ann Pharmacother 42(12): 1814-1821.

    Article  Google Scholar 

  • Dahinden T (2009) Localization of uncertain and fuzzy-bordered areas by geocoded articles of aknowledge repository. In: Garcia-Huidobro CJV (eds) Proceedings of the InternationalCartographic Conference, Santiago de Chile.

    Google Scholar 

  • Dahinden T, Sester M (2009) Categorization of Linear Objects for Map Generalization UsingGeocoded Articles of a Knowledge Repository. In: Winter S, Tenbrink T (eds) Workshop onPresenting spatial information: Granularity, relevance and integration, Conference on SpatialInformation Theory, Aber Wrac'h, France.

    Google Scholar 

  • Dolan S (2008) Six degrees of Wikipedia. Accessed 29 May 2008, fromhttp://www.netsoc.tcd.ie/~mu/wiki/.

  • ESRI Support Centre (2010) ArcGIS Desktop Help: Kernel Density. Accessed 13 April 2010,fromhttp://webhelp.esri.com/arcgisdesktop/9.3/index.cfm?id=6190&pid=6188&topicname=Kernel_Density.

  • Fagan SMB (2009) German: a lingustic introduction, Cambridge University Press.

    Google Scholar 

  • Halavais A, Lackaff D (2008) An Analysis of Topical Coverage of Wikipedia. Jounral ofComputer-Mediated Communication 13(2): 429-440.

    Article  Google Scholar 

  • Haspelmath M, Dryer MS, Gil D, Comrie B, Eds. (2005) World Atlas of Language Structures,Oxford University Press.

    Google Scholar 

  • Hecht B, Moxley E (2009) Terabytes of Tobler: Evaluating the First Law in a Massive, Domain-Neutral Representation of World Knowledge. Spatial Information Theory, 9th InternationalConference, COSIT 2009, Aber Wrac'h, France, September 21-25, 2009, Proceedings 5756:88-105.

    Google Scholar 

  • Hecht B, Raubal M (2008). GeoSR: Geographically Explore Semantic Relations in WorldKnowledge. AGILE Conf., Springer: 95-113.

    Google Scholar 

  • Kittur A, Chi E, Pendleton BA, Suh B, Mytkowicz T (2007) Power of the Few vs. Wisdom ofthe Crowd: Wikipedia and the Rise of the Bourgeoisie. In: (eds) 25th Annual ACMConference on Human Factors in Computing Systems (CHI), San Jose, CA.

    Google Scholar 

  • König W, Paul H-J (2007) dtv-Atlas Deutsche Sprache. Mit 155 farbigen Abbildungsseiten.,Deutscher Taschenbuch Verlag.

    Google Scholar 

  • Kühn S, Alder T (2009) WP : GEO / Wikipedia-World. Accessed 31 August 2009, fromhttp://de.wikipedia.org/wiki/Wikipedia:WikiProjekt_Georeferenzierung/Wikipedia-World.

  • Maurmann E (1888-1923) Karte der Deutschen Mundarten. Sprachatlas des Deutschen Reichs.Mediawiki (2010) Wikimedia downloads. Accessed 11 June 2010, fromhttp://download.wikimedia.org/.

  • Ortega F, Gonzales-Barahona JM, Robles G (2007) The Top-Ten Wikipedias, A QuantitativeAnalysis using WikiXRay. International Conference on Software and Data Technology: 46-53.

    Google Scholar 

  • Paelke V, Dahinden T, Eggert D, Mondzech J (2010) Location Based Context AwarenessThrough Tag-Cloud Visualization. In: (eds) Joint International Conference on Theory, DataHandling and Modelling in GeoSpatial Information Science, Hong Kong.

    Google Scholar 

  • Pfeil U, Zaphiris P, Ang CS (2006) Cultural differences in collaborative authoring of Wikipedia.Journal of Computer-Mediated Communication 12(1).

    Google Scholar 

  • Protze H (1969) Die deutschen Mundarten. Kleine Enzyklopädie Deutsche Sprache.Samuels ML (1972) Linguistic evolution : with special reference to English. London, CambridgeUniversity Press.

    Google Scholar 

  • Schöning J, Hecht B, Rohs M, Starosielski N (2007) WikEar: Automatically GeneratedLocation-Based Audio Stories between Public City Maps. Adjunct Proceedings of the 9thInternational Conference on Ubiquitous Computing.

    Google Scholar 

  • Silverman BW (1986) Density estimation for statistics and data analysis. London, Chapman andHall (CRC Press).

    Google Scholar 

  • Smith MJd, Goodchild MF, Longley PA (2007) Geospatial Analysis, Second Edition, TroubadorPublishing Ltd.

    Google Scholar 

  • Ther P, Siljak A (2001) Redrawing Nations: Ethnic Cleaning in East-Central Europe, 1944-1948., Rowman & Littlefield.

    Google Scholar 

  • Tobler WR (1970) A Computer Movie Simulating Urban Growth in the Detroit Region.Economic Geography 46: 234-240.

    Article  Google Scholar 

  • Wenker G (1888-1923) Sprachatlas des Deutschen Reichs. Marburg.

    Google Scholar 

  • Wikipedia (2007) Die deutschen und niederländischen Dialekte nach dem Jahr 1945. Accessed28 April 2007, from http://de.wikipedia.org/w/index.php?title=Datei:Deutsche_Dialekte.PNG.

  • Wikipedia (2009a) Translation. Accessed 27 December 2009, fromhttp://en.wikipedia.org/wiki/Wikipedia:Translation.

  • Wikipedia (2009b) WP : GEO / mehrere Artikel an einer Koordinate. Accessed 31 August2009, fromhttp://de.wikipedia.org/wiki/Wikipedia:WikiProjekt_Georeferenzierung/mehrere_Artikel_an_einer_Koordinate.

  • Wikipedia (2010a) Articles needing coordinates. Accessed 3 January 2010, fromhttp://en.wikipedia.org/wiki/Category:Articles\_needing\_coordinates.

  • Wikipedia (2010b) List of Wikipedias. Accessed 6 January 2010, fromhttp://en.wikipedia.org/wiki/List_of_Wikipedias.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tobias Dahinden .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dahinden, T. (2011). Estimation of the Locations of the Language-Versions of Wikipedia - a Case Study on Geographic Data Mining. In: Ruas, A. (eds) Advances in Cartography and GIScience. Volume 2. Lecture Notes in Geoinformation and Cartography(), vol 6. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19214-2_32

Download citation

Publish with us

Policies and ethics