, Volume 14, Issue 3, pp 307–331 | Cite as

Exploiting geographic references of documents in a geographical information retrieval system using an ontology-based index

  • Nieves R. Brisaboa
  • Miguel R. LuacesEmail author
  • Ángeles S. Places
  • Diego Seco


Both Geographic Information Systems and Information Retrieval have been very active research fields in the last decades. Lately, a new research field called Geographic Information Retrieval has appeared from the intersection of these two fields. The main goal of this field is to define index structures and techniques to efficiently store and retrieve documents using both the text and the geographic references contained within the text. We present in this paper two contributions to this research field. First, we propose a new index structure that combines an inverted index and a spatial index based on an ontology of geographic space. This structure improves the query capabilities of other proposals. Then, we describe the architecture of a system for geographic information retrieval that defines a workflow for the extraction of the geographic references in documents. The architecture also uses the index structure that we propose to solve pure spatial and textual queries as well as hybrid queries that combine both a textual and a spatial component. Furthermore, query expansion can be performed on geographic references because the index structure is based in an ontology.


Geographic information retrieval Spatial index Textual index Ontology System architecture 


  1. 1.
    Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. Addison-Wesley, HarlowGoogle Scholar
  2. 2.
    Worboys MF (2004) GIS: a computing perspective. CRC, Boca Raton. ISBN: 0415283752Google Scholar
  3. 3.
    ISO/IEC (2002) Geographic information—reference model. International standard 19101, ISO/IECGoogle Scholar
  4. 4.
    Open GIS Consortium, Inc. (2003) OpenGIS reference model. OpenGIS Project Document 03-040, Open GIS ConsortiumGoogle Scholar
  5. 5.
    Global Spatial Data Infrastructure Association (2008) Online documentation. Retrieved from
  6. 6.
    Lieberman MD, Samet H, Sankaranarayanan J, Sperling J (2007) STEWARD: architecture of a spatio-textual search engine. In: Proceedings of the 15th ACM Int. Symp. on Advances in GIS (ACMGIS’07). ACM, New York, pp 186–193Google Scholar
  7. 7.
    Chen YY, Suel T, Markowetz A (2006) Efficient query processing in geographic web search engines. In: SIGMOD conference, pp 277–288Google Scholar
  8. 8.
    Martins B, Silva MJ, Andrade L (2005) Indexing and ranking in Geo-IR systems. In: GIR ’05: proceedings of the 2005 workshop on Geogr. Inform. Retrieval. ACM, New York, pp 31–34. doi: CrossRefGoogle Scholar
  9. 9.
    Gaede V, Günther O (1998) Multidimensional access methods. ACM Comput Surv 30(2):170–231. doi: CrossRefGoogle Scholar
  10. 10.
    Guttman A (1984) R-Trees: a dynamic index structure for spatial searching. In: Yormark B (ed) SIGMOD’84, proceedings of annual meeting, Boston, Massachusetts, June 18–21, 1984. ACM, New York, pp 47–57Google Scholar
  11. 11.
    Amitay E, Har’El N, Sivan R, Soffer A (2004) Web-a-where: geotagging web content. In: SIGIR ’04: proceedings of the 27th ACM SIGIR. ACM, New York, pp 273–280. doi: Google Scholar
  12. 12.
    Rauch E, Bukatin M, Baker K (2003) A confidence-based framework for disambiguating geographic terms. In: Proceedings of the HLT-NAACL 2003 workshop on analysis of geogr. references. Association for Computational Linguistics, Morristown, USA, pp 50–54. doi: 10.3115/1119394.1119402
  13. 13.
    Jones CB, Purves R, Ruas A, Sanderson M, Sester M, van Kreveld M, Weibel R et al (2002) Spatial information retrieval and geographical ontologies an overview of the SPIRIT project. In: Proceedings of the 25th ACM SIGIR conference, pp 387–388Google Scholar
  14. 14.
    Jones CB, Abdelmoty AI, Fu G (2003) Maintaining ontologies for geographical information retrieval on the web. In: Proceedings of on the move to meaningful internet systems 2003: ODBASE 03, LNCS, vol 2888Google Scholar
  15. 15.
    Jones CB, Abdelmoty AI, Fu G, Vaid S (2004) The SPIRIT spatial search engine: architecture, ontologies and spatial indexing. In: Proceedings of the 3rd int. conf. on geogr. inform. Science, LNCS, vol. 3234, pp. 125–139Google Scholar
  16. 16.
    Vaid S, Jones CB, Joho H, Sanderson M (2005) Spatio-textual indexing for geographical search on the web. In: Proceedings of the 9th Int. Symp. on Spatial and Temporal Databases (SSTD). LNCS, vol 3633, pp 218–235Google Scholar
  17. 17.
    Fu G, Jones CB, Abdelmoty AI (2005) Ontology-based spatial query expansion in information retrieval. In: Proceedings of In On the Move to Meaningful Internet Systems 2005: ODBASE 2005. LNCS, vol. 3761, pp. 1466–1482)Google Scholar
  18. 18.
    Zhou Y, Xie X, Wang C, Gong Y, Ma WY (2005) Hybrid index structures for location-based web search. In: Proceedings of CIKM 05, pp. 155–162. ACM, New York. doi: CrossRefGoogle Scholar
  19. 19.
    Hariharan R, Hore B, Li C, Mehrotra S (2007) Processing Spatial-Keyword (SK) queries in Geographic Information Retrieval (GIR) systems. In: Proceedings of the 19th int. conf. on Scientific and Statistical Database Management (SSDBM07). IEEE Computer Society. doi:
  20. 20.
    Gruber TR (1993) A translation approach to portable ontology specifications. Knowl Acquis 5(2):199–220CrossRefGoogle Scholar
  21. 21.
    Dellis E, Paliouras G (2006) Management of large spatial ontology bases. In: Proceedings of the workshop on Ontologies-based techniques for DataBases and Information Systems (ODBIS) of the 32nd int. conf. on Very Large Data Bases (VLDB 2006)Google Scholar
  22. 22.
    Gruber TR (1993) Towards principles for the design of ontologies used for knowledge sharing. In: Guarino N, Poli R (eds) Formal ontology in conceptual analysis and knowledge representation. Kluwer Academic, DeventerGoogle Scholar
  23. 23.
    World Wide Consortium (2008) Owl web ontology language reference. Retrieved March 2008 from
  24. 24.
    Open GIS Consortium, Inc. (2002) OpenGIS web map service implementation specification. OpenGIS Project Document 01-068r3, Open GIS ConsortiumGoogle Scholar
  25. 25.
    Apache (2008) Lucene. Retrieved March 2008 from
  26. 26.
    National Institute of Standards and Technology (NIST) (2008) TREC special database 22, TREC document database: disk 4. Retrieved March 2008 from
  27. 27.
    Alias-i (2008) LingPipe, natural language tool. Retrieved March 2008 from
  28. 28.
    Geonames (2008) Gazetteer. Retrieved March 2008 from
  29. 29.
    National Imagery and Mapping Agency (NIMA) (2008) Vector map level 0. Retrieved March 2008 from
  30. 30.
    FWTools (2008) Open source GIS binary kit for Windows and Linux. Retrieved March 2008 from
  31. 31.
    Refractions Research (2008) PostGIS. Retrieved March 2008 from
  32. 32.
    Gamma E, Helm R, Johnson R, Vlissides J (1996) Design patterns: elements of reusable object-oriented software. Addison-Wesley, HarlowGoogle Scholar
  33. 33.
    Gospodnetić O, Hatcher E (2005) Lucene IN ACTION. Manning. ISBN: 1932394281Google Scholar
  34. 34.
    Lucene A (2005) Scoring. Retrieved June 2008 from
  35. 35.
    Van Kreveld M, Reinbacher I, Arampatzis A, Van Zwol R Multi-dimensional scattered ranking methods for geographic information retrieval. Geoinformatica 9(1):61–84. doi: 10.1007/s10707-004-5622-6
  36. 36.
    Godoy F, Rodríguez A (2004) Defining and comparing content measures of topological relations. GeoInformatica, pp 347–371Google Scholar
  37. 37.
    Andrade L, Silva MJ (2006) Relevance ranking for geographic IR. In: GIR ’06: proceedings of the 2006 workshop on Geogr. Inform. Retrieval, SIGIR. ACM, New YorkGoogle Scholar
  38. 38.
    Jones C, Alani H, Tudhope D (2001) Geographical information retrieval with ontologies of place. In: COSIT’01: 3rd international conference on spatial information theory, pp 322–335Google Scholar
  39. 39.
    Yu B, Cai G (2007) A query-aware document ranking method for geographic information retrieval. In: GIR ’07: proceedings of the 2007 workshop on Geogr. Inform. Retrieval. ACM, New YorkGoogle Scholar
  40. 40.
    OSGeo (2008) Open Layers API. Retrieved May 2008 from
  41. 41.
    Beckmann N, Kriegel HP, Schneider R, Seeger B (1990) The R*-tree: an efficient and robust access method for points and rectangles. SIGMOD Rec. 19(2):322–331. doi: CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC 2010

Authors and Affiliations

  • Nieves R. Brisaboa
    • 1
  • Miguel R. Luaces
    • 1
    Email author
  • Ángeles S. Places
    • 1
  • Diego Seco
    • 1
  1. 1.Database LaboratoryUniversity of A CoruñaA CoruñaSpain

Personalised recommendations