Multi-source Toponym Data Integration and Mediation for a Meta-Gazetteer Service

  • Philip D. Smart
  • Christopher B. Jones
  • Florian A. Twaroch
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6292)

Abstract

A variety of gazetteers exist based on administrative or user contributed data. Each of these data sources has benefits for particular geographical analysis and information retrieval tasks but none is a one fit all solution. We present a mediation framework to access and integrate distributed gazetteer resources to build a meta-gazetteer that generates augmented versions of place name information. The approach combines different aspects of place name data from multiple gazetteer sources that refer to the same geographic place and employs several similarity metrics to identify equivalent toponyms.

Keywords

gazetteers place names spatial data integration mediation architecture geo-web services 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Goodchild, M.F., Hill, L.L.: Introduction to Digital Gazetteer Research. International Journal of Geographic Information Science 22(10), 1039–1044 (2008)CrossRefGoogle Scholar
  2. 2.
    Goldberg, D.W., Wilson, J.P., Knoblock, C.A.: Extracting Geographic Features from the Internet to Automatically Build Detailed Regional Gazetteers. International Journal of Geographical Information Science 23(1), 93–128 (2009)CrossRefGoogle Scholar
  3. 3.
    Hastings, J.T.: Automated Conflation of Digital Gazetteer Data. International Journal of Geographical Information Science 22(10), 1109–1127 (2008)CrossRefMathSciNetGoogle Scholar
  4. 4.
    Keßler, C., Janowicz, K., Bishr, M.: An Agenda for the Next Generation Gazetteer: Geographic Information Contribution and Retrieval. In: 17th ACM SIGSPATIAL International Symposium on Advances in Geographic Information Systems, ACM-GIS 2009, Seattle, Washington, USA (2009)Google Scholar
  5. 5.
    Mikheev, A., Moens, M., Grover, C.: Named Entity Recognition without Gazetteers. In: Proceedings of the Ninth Conference of the European Chapter of the Association for Computational Linguistics, pp. 1–8 (1999)Google Scholar
  6. 6.
    Uryupina, O.: Semi-Supervised Learning of Geographical Gazetteers from the Internet. In: HLT-NAACL 2003, Workshop on Analysis of Geographic References, Alberta, Canada, pp. 18–25 (2003)Google Scholar
  7. 7.
    Christen, P., Churches, T.: A Probabilistic Deduplication, Record Linkage and Geocoding System. In: ARC Health Data Mining Workshop, Canberra, AU, The Australian National University (2005)Google Scholar
  8. 8.
    Hollenstein, L.: Capturing Vernacular Geography from Georeferenced Tags, Msc Thesis, in Department of Geography, University of Zurich (2008)Google Scholar
  9. 9.
    Rattenbury, T., Naaman, M.: Methods for Extracting Place Semantics from Flickr Tags. ACM Trans. Web 3(1), 1–30 (2009)CrossRefGoogle Scholar
  10. 10.
    Popescu, A., Grefenstette, G., Moëllic, P.-A.: Gazetiki: Automatic Creation of a Geographical Gazetteer. In: JCDL 2008: Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries, pp. 85–93. ACM, New York (2008)CrossRefGoogle Scholar
  11. 11.
    Wiederhold, G.: Mediators in the Architecture of Future Information Systems, vol. 25(3), pp. 38–49. Computer IEEE Computer Society Press, Los Alamitos (1992)Google Scholar
  12. 12.
    Callan, J.: Distributed Information Retrieval. In: Advances in Information Retrieval, pp. 127–150. Kluwer Academic Publishers, Dordrecht (2000)Google Scholar
  13. 13.
    Smith, J.M., Bernstein, P.A., Dayal, U., Goodman, N., Landers, T., Lin, K.W.T., Wong, E.: Multibase - Integrating Heterogeneous Distributed Database Systems. In: AFIPS National Computer Conference (1981)Google Scholar
  14. 14.
    Gupta, A., Marciano, R., Zaslavsky, I., Baru, C.K.: Integrating GIS and Imagery through XML-Based Information Mediation. In: Agouris, P., Stefanidis, A. (eds.) ISD 1999. LNCS, vol. 1737, pp. 211–234. Springer, Heidelberg (1999)CrossRefGoogle Scholar
  15. 15.
    Zaslavsky, I., Gupta, A., Marciano, R., Baru, C.: Xml-Based Spatial Data Mediation Infrastructure for Global Interoperability. In: 4th Global Spatial Data Infrastructure Conference (2000), http://www.gsdi.org/capetown/program.htm
  16. 16.
    Gupta, A., Memon, A., Tran, J., Bharadwaja, R.P., Zaslavsky, I.: Information Mediation across Heterogeneous Government Spatial Data Sources. In: Annual National Conference on Digital Government Research, pp. 1–6. Digital Government Society of North America, Los Angeles (2002)Google Scholar
  17. 17.
    Diggle, P.J., Besag, J., Gleaves, T.J.: Statistical Analysis of Spatial Point Patterns by Means of Distance Methods. Biometrics 32(3), 659–667 (1976)MATHCrossRefGoogle Scholar
  18. 18.
    Levenshtein, V.I.: Binary Codes Capable of Correcting Deletions, Insertions, and Reversals. Soviet Physics Doklady 10(8), 707–710 (1966)MathSciNetGoogle Scholar
  19. 19.
    Russell, R., Odell, M.: The Soundex Indexing System. National Archives and Records Administration (1918), http://www.nara.gov/genealogy/coding.html
  20. 20.
    Sinnott, R.W.: Virtues of the Haversine. Sky and Telescope 68(2), 159–162 (1984)MathSciNetGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Philip D. Smart
    • 1
  • Christopher B. Jones
    • 1
  • Florian A. Twaroch
    • 1
  1. 1.School of Computer ScienceCardiff University 

Personalised recommendations