Skip to main content
Log in

Exploiting geographic references of documents in a geographical information retrieval system using an ontology-based index

  • Published:
GeoInformatica Aims and scope Submit manuscript

Abstract

Both Geographic Information Systems and Information Retrieval have been very active research fields in the last decades. Lately, a new research field called Geographic Information Retrieval has appeared from the intersection of these two fields. The main goal of this field is to define index structures and techniques to efficiently store and retrieve documents using both the text and the geographic references contained within the text. We present in this paper two contributions to this research field. First, we propose a new index structure that combines an inverted index and a spatial index based on an ontology of geographic space. This structure improves the query capabilities of other proposals. Then, we describe the architecture of a system for geographic information retrieval that defines a workflow for the extraction of the geographic references in documents. The architecture also uses the index structure that we propose to solve pure spatial and textual queries as well as hybrid queries that combine both a textual and a spatial component. Furthermore, query expansion can be performed on geographic references because the index structure is based in an ontology.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. Addison-Wesley, Harlow

    Google Scholar 

  2. Worboys MF (2004) GIS: a computing perspective. CRC, Boca Raton. ISBN: 0415283752

    Google Scholar 

  3. ISO/IEC (2002) Geographic information—reference model. International standard 19101, ISO/IEC

  4. Open GIS Consortium, Inc. (2003) OpenGIS reference model. OpenGIS Project Document 03-040, Open GIS Consortium

  5. Global Spatial Data Infrastructure Association (2008) Online documentation. Retrieved from http://www.gsdi.org/

  6. Lieberman MD, Samet H, Sankaranarayanan J, Sperling J (2007) STEWARD: architecture of a spatio-textual search engine. In: Proceedings of the 15th ACM Int. Symp. on Advances in GIS (ACMGIS’07). ACM, New York, pp 186–193

    Google Scholar 

  7. Chen YY, Suel T, Markowetz A (2006) Efficient query processing in geographic web search engines. In: SIGMOD conference, pp 277–288

  8. Martins B, Silva MJ, Andrade L (2005) Indexing and ranking in Geo-IR systems. In: GIR ’05: proceedings of the 2005 workshop on Geogr. Inform. Retrieval. ACM, New York, pp 31–34. doi:http://doi.acm.org/10.1145/1096985.1096993

    Chapter  Google Scholar 

  9. Gaede V, Günther O (1998) Multidimensional access methods. ACM Comput Surv 30(2):170–231. doi:http://doi.acm.org/10.1145/280277.280279

    Article  Google Scholar 

  10. Guttman A (1984) R-Trees: a dynamic index structure for spatial searching. In: Yormark B (ed) SIGMOD’84, proceedings of annual meeting, Boston, Massachusetts, June 18–21, 1984. ACM, New York, pp 47–57

    Google Scholar 

  11. Amitay E, Har’El N, Sivan R, Soffer A (2004) Web-a-where: geotagging web content. In: SIGIR ’04: proceedings of the 27th ACM SIGIR. ACM, New York, pp 273–280. doi:http://doi.acm.org/10.1145/1008992.1009040

    Google Scholar 

  12. Rauch E, Bukatin M, Baker K (2003) A confidence-based framework for disambiguating geographic terms. In: Proceedings of the HLT-NAACL 2003 workshop on analysis of geogr. references. Association for Computational Linguistics, Morristown, USA, pp 50–54. doi:10.3115/1119394.1119402

  13. Jones CB, Purves R, Ruas A, Sanderson M, Sester M, van Kreveld M, Weibel R et al (2002) Spatial information retrieval and geographical ontologies an overview of the SPIRIT project. In: Proceedings of the 25th ACM SIGIR conference, pp 387–388

  14. Jones CB, Abdelmoty AI, Fu G (2003) Maintaining ontologies for geographical information retrieval on the web. In: Proceedings of on the move to meaningful internet systems 2003: ODBASE 03, LNCS, vol 2888

  15. Jones CB, Abdelmoty AI, Fu G, Vaid S (2004) The SPIRIT spatial search engine: architecture, ontologies and spatial indexing. In: Proceedings of the 3rd int. conf. on geogr. inform. Science, LNCS, vol. 3234, pp. 125–139

  16. Vaid S, Jones CB, Joho H, Sanderson M (2005) Spatio-textual indexing for geographical search on the web. In: Proceedings of the 9th Int. Symp. on Spatial and Temporal Databases (SSTD). LNCS, vol 3633, pp 218–235

  17. Fu G, Jones CB, Abdelmoty AI (2005) Ontology-based spatial query expansion in information retrieval. In: Proceedings of In On the Move to Meaningful Internet Systems 2005: ODBASE 2005. LNCS, vol. 3761, pp. 1466–1482)

  18. Zhou Y, Xie X, Wang C, Gong Y, Ma WY (2005) Hybrid index structures for location-based web search. In: Proceedings of CIKM 05, pp. 155–162. ACM, New York. doi:http://doi.acm.org/10.1145/1099554.1099584

    Chapter  Google Scholar 

  19. Hariharan R, Hore B, Li C, Mehrotra S (2007) Processing Spatial-Keyword (SK) queries in Geographic Information Retrieval (GIR) systems. In: Proceedings of the 19th int. conf. on Scientific and Statistical Database Management (SSDBM07). IEEE Computer Society. doi:http://doi.ieeecomputersociety.org/10.1109/SSDBM.2007.22

  20. Gruber TR (1993) A translation approach to portable ontology specifications. Knowl Acquis 5(2):199–220

    Article  Google Scholar 

  21. Dellis E, Paliouras G (2006) Management of large spatial ontology bases. In: Proceedings of the workshop on Ontologies-based techniques for DataBases and Information Systems (ODBIS) of the 32nd int. conf. on Very Large Data Bases (VLDB 2006)

  22. Gruber TR (1993) Towards principles for the design of ontologies used for knowledge sharing. In: Guarino N, Poli R (eds) Formal ontology in conceptual analysis and knowledge representation. Kluwer Academic, Deventer

    Google Scholar 

  23. World Wide Consortium (2008) Owl web ontology language reference. Retrieved March 2008 from http://www.w3.org/TR/owl-ref/

  24. Open GIS Consortium, Inc. (2002) OpenGIS web map service implementation specification. OpenGIS Project Document 01-068r3, Open GIS Consortium

  25. Apache (2008) Lucene. Retrieved March 2008 from http://lucene.apache.org

  26. National Institute of Standards and Technology (NIST) (2008) TREC special database 22, TREC document database: disk 4. Retrieved March 2008 from http://www.nist.gov/srd/nistsd22.htm

  27. Alias-i (2008) LingPipe, natural language tool. Retrieved March 2008 from http://www.alias-i.com/lingpipe/

  28. Geonames (2008) Gazetteer. Retrieved March 2008 from http://www.geonames.org

  29. National Imagery and Mapping Agency (NIMA) (2008) Vector map level 0. Retrieved March 2008 from http://www.mapability.com

  30. FWTools (2008) Open source GIS binary kit for Windows and Linux. Retrieved March 2008 from http://fwtools.maptools.org

  31. Refractions Research (2008) PostGIS. Retrieved March 2008 from http://postgis.refractions.net

  32. Gamma E, Helm R, Johnson R, Vlissides J (1996) Design patterns: elements of reusable object-oriented software. Addison-Wesley, Harlow

    Google Scholar 

  33. Gospodnetić O, Hatcher E (2005) Lucene IN ACTION. Manning. ISBN: 1932394281

  34. Lucene A (2005) Scoring. Retrieved June 2008 from http://lucene.apache.org/java/2_2_0/scoring.html

  35. Van Kreveld M, Reinbacher I, Arampatzis A, Van Zwol R Multi-dimensional scattered ranking methods for geographic information retrieval. Geoinformatica 9(1):61–84. doi:10.1007/s10707-004-5622-6

  36. Godoy F, Rodríguez A (2004) Defining and comparing content measures of topological relations. GeoInformatica, pp 347–371

  37. Andrade L, Silva MJ (2006) Relevance ranking for geographic IR. In: GIR ’06: proceedings of the 2006 workshop on Geogr. Inform. Retrieval, SIGIR. ACM, New York

    Google Scholar 

  38. Jones C, Alani H, Tudhope D (2001) Geographical information retrieval with ontologies of place. In: COSIT’01: 3rd international conference on spatial information theory, pp 322–335

  39. Yu B, Cai G (2007) A query-aware document ranking method for geographic information retrieval. In: GIR ’07: proceedings of the 2007 workshop on Geogr. Inform. Retrieval. ACM, New York

    Google Scholar 

  40. OSGeo (2008) Open Layers API. Retrieved May 2008 from http://openlayers.org

  41. Beckmann N, Kriegel HP, Schneider R, Seeger B (1990) The R*-tree: an efficient and robust access method for points and rectangles. SIGMOD Rec. 19(2):322–331. doi:http://doi.acm.org/10.1145/93605.98741

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Miguel R. Luaces.

Additional information

This work has been partially supported by Ministerio de Educación y Ciencia (PGE and FEDER), grant TIN2009-14560-C03-02, and by “Xunta de Galicia” ref. 08SIN009CT.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Brisaboa, N.R., Luaces, M.R., Places, Á.S. et al. Exploiting geographic references of documents in a geographical information retrieval system using an ontology-based index. Geoinformatica 14, 307–331 (2010). https://doi.org/10.1007/s10707-010-0106-3

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10707-010-0106-3

Keywords

Navigation