Advertisement

Geographic Expansion of Queries to Improve the Geographic Information Retrieval Task

  • José M. Perea-Ortega
  • L. Alfonso Ureña-López
Part of the Lecture Notes in Computer Science book series (LNCS, volume 7337)

Abstract

Geographic Information Retrieval (GIR) is concerned with improving the quality of geographically-specific Information Retrieval (IR), focusing on access to unstructured documents. Since GIR can be considered as an extension of IR, the application of Natural Language Processing (NLP) techniques, such as query expansion, can lead to significant improvements. In this paper we propose two NLP techniques of query expansion related to the augmentation of the geospatial part that is usually identified in a geographic query. The aim of both approaches is to retrieve possible relevant documents that are not retrieved using the original query. Then, we propose to add such new documents to the list of documents retrieved using the original query. In this way, the geo-reranking process takes into account more possible relevant documents. We have evaluated the proposed approaches using GeoCLEF as evaluation framework for GIR systems. The results obtained show that the use of proposed query expansion techniques can be a good strategy to improve the overall performance of a GIR system.

Keywords

Geographic Information Retrieval Query Expansion GeoCLEF 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Amati, G.: Probabilistic Models for Information Retrieval based on Divergence from Randomness. Ph.D. thesis, School of Computing Science, University of Glasgow (2003)Google Scholar
  2. 2.
    Andrade, L., Silva, M.J.: Relevance ranking for geographic ir. In: Purves, R., Jones, C. (eds.) GIR. Department of Geography, University of ZurichGoogle Scholar
  3. 3.
    Anick, P.: Using terminological feedback for web search refinement: a log-based study. In: SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval, pp. 88–95. ACM, New York (2003)CrossRefGoogle Scholar
  4. 4.
    Buscaldi, D., Rosso, P., Arnal, E.S.: Using the WordNet Ontology in the GeoCLEF Geographical Information Retrieval Task. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, pp. 939–946. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  5. 5.
    Cardoso, N., Silva, M.J.: Query expansion through geographical feature types. In: Purves, R., Jones, C. (eds.) GIR, pp. 55–60. ACM (2007)Google Scholar
  6. 6.
    Fu, G., Jones, C.B., Abdelmoty, A.I.: Ontology-Based Spatial Query Expansion in Information Retrieval. In: Meersman, R., Tari, Z. (eds.) OTM 2005, Part II. LNCS, vol. 3761, pp. 1466–1482. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  7. 7.
    Gan, Q., Attenberg, J., Markowetz, A., Suel, T.: Analysis of geographic queries in a search engine log. In: Proceedings of the First International Workshop on Location and the Web, pp. 49–56. ACM, Beijing (2008)CrossRefGoogle Scholar
  8. 8.
    Gey, F.C., Larson, R.R., Sanderson, M., Joho, H., Clough, P., Petras, V.: GeoCLEF: The CLEF 2005 Cross-Language Geographic Information Retrieval Track Overview. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, pp. 908–919. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  9. 9.
    Gravano, L., Hatzivassiloglou, V., Lichtenstein, R.: Categorizing web queries according to geographical locality. In: Proceedings of the 12th International Conference on Information and Knowledge Management, pp. 325–333 (2003)Google Scholar
  10. 10.
    Jansen, B.J., Booth, D.L., Spink, A.: Patterns of query reformulation during web searching. JASIST 60(7), 1358–1371 (2009)CrossRefGoogle Scholar
  11. 11.
    Jones, C.B., Purves, R.S.: Geographical information retrieval. International Journal of Geographical Information Science 22(3), 219–228 (2008)CrossRefGoogle Scholar
  12. 12.
    Jones, R., Zhang, W.V., Rey, B., Jhala, P., Stipp, E.: Geographic intention and modification in web search. International Journal of Geographical Information Science 22(3), 229–246 (2008)CrossRefGoogle Scholar
  13. 13.
    Kohler, J.: Analysing search engine queries for the use of geographic terms. Master’s thesis, University of Sheffield - United King (2003)Google Scholar
  14. 14.
    Larson, R.: Geographic information retrieval and spatial browsing. In: Smith, Gluck, M. (eds.) Geographic Information Systems and Libraries: Patronsand Mapsand and Spatial Information, pp. 81–124 (1996)Google Scholar
  15. 15.
    Li, Y., Moffat, A., Stokes, N., Cavedon, L.: Exploring probabilistic toponym resolution for geographical information retrieval. In: Purves, R., Jones, C. (eds.) GIR. Department of Geography, University of Zurich (2006)Google Scholar
  16. 16.
    Li, Z., Wang, C., Xie, X., Wang, X., Ma, W.Y.: Indexing implicit locations for geographical information retrieval. In: Purves, R., Jones, C. (eds.) GIR. Department of Geography, University of Zurich (2006)Google Scholar
  17. 17.
    Mandl, T., Carvalho, P., Di Nunzio, G.M., Gey, F., Larson, R.R., Santos, D., Womser-Hacker, C.: GeoCLEF 2008: The CLEF 2008 Cross-Language Geographic Information Retrieval Track Overview. In: Peters, C., Deselaers, T., Ferro, N., Gonzalo, J., Jones, G.J.F., Kurimo, M., Mandl, T., Peñas, A., Petras, V. (eds.) CLEF 2008. LNCS, vol. 5706, pp. 808–821. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  18. 18.
    Perea-Ortega, J.M., García-Cumbreras, M.Á., García-Vega, M., Ureña-López, L.A.: Comparing Several Textual Information Retrieval Systems for the Geographical Information Retrieval Task. In: Kapetanios, E., Sugumaran, V., Spiliopoulou, M. (eds.) NLDB 2008. LNCS, vol. 5039, pp. 142–147. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  19. 19.
    Perea-Ortega, J.M., Martínez-Santiago, F., Montejo-Ráez, A., Ureña-López, L.A.: Geo-NER: un reconocedor de entidades geográficas para inglés basado en GeoNames y Wikipedia. Sociedad Española para el Procesamiento del Lenguaje Natural (SEPLN) 43, 33–40 (2009)Google Scholar
  20. 20.
    Perea-Ortega, J.M., Ureña-López, L.A., García-Vega, M., García-Cumbreras, M.A.: Using Query Reformulation and Keywords in the Geographic Information Retrieval Task. In: Peters, C., Deselaers, T., Ferro, N., Gonzalo, J., Jones, G.J.F., Kurimo, M., Mandl, T., Peñas, A., Petras, V. (eds.) CLEF 2008. LNCS, vol. 5706, pp. 855–862. Springer, Heidelberg (2009)CrossRefGoogle Scholar
  21. 21.
    Sanderson, M., Kohler, J.: Analyzing geographic queries. In: Proceedings Workshop on Geographical Information Retrieval SIGIR (2004)Google Scholar
  22. 22.
    Spink, A., Jansen, B.J., Ozmultu, C.H.: Use of query reformulation and relevance feedback by excite users. Internet Research: Electronic Networking Applications and Policy 10(4), 317–328 (2000)CrossRefGoogle Scholar
  23. 23.
    Stokes, N., Li, Y., Moffat, A., Rong, J.: An empirical study of the effects of nlp components on geographic ir performance. International Journal of Geographical Information Science 22(3), 247–264 (2008)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • José M. Perea-Ortega
    • 1
  • L. Alfonso Ureña-López
    • 2
  1. 1.Languages and Information Systems DepartmentUniversity of SevillaSpain
  2. 2.Computer Science DepartmentUniversity of JaénSpain

Personalised recommendations