Skip to main content

Extracting Geographic Context from the Web: GeoReferencing in MyMoSe

  • Conference paper
Advances in Information Retrieval (ECIR 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5478))

Included in the following conference series:

Abstract

Many Web pages are clearly related to specific locations. Identifying this geographic focus is the cornerstone of the next generation of geographic context aware search services. This paper shows a multistage method for assigning a geographic focus to Web pages (GeoReferencing), using several heuristics for toponym disambiguation and a scoring function for focus determination. We provide an experimental methodology for evaluating the accuracy of the system with Web pages in English and Spanish. Finally, we have obtained promising results, reaching an accuracy of over 70% with a town-level resolution.

Partially funded by Telefónica Investigación y Desarrollo (MyMobileSearch) and MCyT (TIN2006-15071-C03-02).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amitay, E., Har’El, N., Sivan, R., Soffer, A.: Web-a-where: geotagging web content. In: SIGIR 2004: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 273–280. ACM, New York (2004)

    Google Scholar 

  2. Gale, W.A., Church, K.W., Yarowsky, D.: One sense per discourse. In: HLT 1991: Proceedings of the workshop on Speech and Natural Language, Morristown, NJ, USA, pp. 233–237. Association for Computational Linguistics (1991)

    Google Scholar 

  3. Himmelstein, M.: Local search: The internet is the yellow pages. Computer 38(2), 26–34 (2005)

    Article  Google Scholar 

  4. Kanada, Y.: A method of geographical name extraction from japanese text for thematic geographical search. In: CIKM 1999: Proceedings of the eighth international conference on Information and knowledge management, pp. 46–54. ACM, New York (1999)

    Chapter  Google Scholar 

  5. Li, H., Srihari, R.K., Niu, C., Li, W.: Location normalization for information extraction. In: Proceedings of the 19th international conference on Computational linguistics, Morristown, NJ, USA, pp. 1–7. Association for Computational Linguistics (2002)

    Google Scholar 

  6. Markowetz, A., Chen, Y.-Y., Suel, T., Long, X., Seeger, B.: Design and implementation of a geographic search engine. In: Eighth International Workshop on the Web and Databases (2005)

    Google Scholar 

  7. Martins, B., Silva, M.J.: A graph-ranking algorithm for geo-referencing documents. In: ICDM 2005: Proceedings of the Fifth IEEE International Conference on Data Mining, Washington, DC, USA, 2005, pp. 741–744. IEEE Computer Society, Los Alamitos (2005)

    Google Scholar 

  8. Mountain, D., MacFarlane, A.: Geographic information retrieval in a mobile environment: evaluating the needs of mobile individuals. Journal of Information Science 33(5), 515–530 (2007)

    Article  Google Scholar 

  9. Sanderson, M., Kohler, J.: Analyzing geographic queries. In: Workshop on Geographic Information Retrieval SIGIR (2004)

    Google Scholar 

  10. Sang, E.F.T.K., Meulder, F.D.: Introduction to the conll-2003 shared task: language-independent named entity recognition. In: Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003, Morristown, NJ, USA, pp. 142–147. Association for Computational Linguistics (2003)

    Google Scholar 

  11. Silva, M.J., Martins, B., Chaves, M.S., Cardoso, N., Afonso, A.P.: Adding geographic scopes to web resources. CEUS - Computers, Environment and Urban Systems 30(4), 378–399 (2006)

    Article  Google Scholar 

  12. Smith, D.A., Crane, G.: Disambiguating geographic names in a historical digital library. In: Constantopoulos, P., Sølvberg, I.T. (eds.) ECDL 2001. LNCS, vol. 2163, pp. 127–136. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  13. Woodruff, A.G., Plaunt, C.: Gipsy: automated geographic indexing of text documents. J. Am. Soc. Inf. Sci. 45(9), 645–655 (1994)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zubizarreta, Á. et al. (2009). Extracting Geographic Context from the Web: GeoReferencing in MyMoSe. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds) Advances in Information Retrieval. ECIR 2009. Lecture Notes in Computer Science, vol 5478. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00958-7_50

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-00958-7_50

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-00957-0

  • Online ISBN: 978-3-642-00958-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics