Abstract
Both Geographic Information Systems and Information Retrieval have been very active research fields in the last decades. Lately, a new research field called Geographic Information Retrieval has appeared from the intersection of these two fields. The main goal of this field is to define index structures and techniques to efficiently store and retrieve documents using both the text and the geographic references contained within the text. We present in this paper two contributions to this research field. First, we propose a new index structure that combines an inverted index and a spatial index based on an ontology of geographic space. This structure improves the query capabilities of other proposals. Then, we describe the architecture of a system for geographic information retrieval that defines a workflow for the extraction of the geographic references in documents. The architecture also uses the index structure that we propose to solve pure spatial and textual queries as well as hybrid queries that combine both a textual and a spatial component. Furthermore, query expansion can be performed on geographic references because the index structure is based in an ontology.
Similar content being viewed by others
References
Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. Addison-Wesley, Harlow
Worboys MF (2004) GIS: a computing perspective. CRC, Boca Raton. ISBN: 0415283752
ISO/IEC (2002) Geographic information—reference model. International standard 19101, ISO/IEC
Open GIS Consortium, Inc. (2003) OpenGIS reference model. OpenGIS Project Document 03-040, Open GIS Consortium
Global Spatial Data Infrastructure Association (2008) Online documentation. Retrieved from http://www.gsdi.org/
Lieberman MD, Samet H, Sankaranarayanan J, Sperling J (2007) STEWARD: architecture of a spatio-textual search engine. In: Proceedings of the 15th ACM Int. Symp. on Advances in GIS (ACMGIS’07). ACM, New York, pp 186–193
Chen YY, Suel T, Markowetz A (2006) Efficient query processing in geographic web search engines. In: SIGMOD conference, pp 277–288
Martins B, Silva MJ, Andrade L (2005) Indexing and ranking in Geo-IR systems. In: GIR ’05: proceedings of the 2005 workshop on Geogr. Inform. Retrieval. ACM, New York, pp 31–34. doi:http://doi.acm.org/10.1145/1096985.1096993
Gaede V, Günther O (1998) Multidimensional access methods. ACM Comput Surv 30(2):170–231. doi:http://doi.acm.org/10.1145/280277.280279
Guttman A (1984) R-Trees: a dynamic index structure for spatial searching. In: Yormark B (ed) SIGMOD’84, proceedings of annual meeting, Boston, Massachusetts, June 18–21, 1984. ACM, New York, pp 47–57
Amitay E, Har’El N, Sivan R, Soffer A (2004) Web-a-where: geotagging web content. In: SIGIR ’04: proceedings of the 27th ACM SIGIR. ACM, New York, pp 273–280. doi:http://doi.acm.org/10.1145/1008992.1009040
Rauch E, Bukatin M, Baker K (2003) A confidence-based framework for disambiguating geographic terms. In: Proceedings of the HLT-NAACL 2003 workshop on analysis of geogr. references. Association for Computational Linguistics, Morristown, USA, pp 50–54. doi:10.3115/1119394.1119402
Jones CB, Purves R, Ruas A, Sanderson M, Sester M, van Kreveld M, Weibel R et al (2002) Spatial information retrieval and geographical ontologies an overview of the SPIRIT project. In: Proceedings of the 25th ACM SIGIR conference, pp 387–388
Jones CB, Abdelmoty AI, Fu G (2003) Maintaining ontologies for geographical information retrieval on the web. In: Proceedings of on the move to meaningful internet systems 2003: ODBASE 03, LNCS, vol 2888
Jones CB, Abdelmoty AI, Fu G, Vaid S (2004) The SPIRIT spatial search engine: architecture, ontologies and spatial indexing. In: Proceedings of the 3rd int. conf. on geogr. inform. Science, LNCS, vol. 3234, pp. 125–139
Vaid S, Jones CB, Joho H, Sanderson M (2005) Spatio-textual indexing for geographical search on the web. In: Proceedings of the 9th Int. Symp. on Spatial and Temporal Databases (SSTD). LNCS, vol 3633, pp 218–235
Fu G, Jones CB, Abdelmoty AI (2005) Ontology-based spatial query expansion in information retrieval. In: Proceedings of In On the Move to Meaningful Internet Systems 2005: ODBASE 2005. LNCS, vol. 3761, pp. 1466–1482)
Zhou Y, Xie X, Wang C, Gong Y, Ma WY (2005) Hybrid index structures for location-based web search. In: Proceedings of CIKM 05, pp. 155–162. ACM, New York. doi:http://doi.acm.org/10.1145/1099554.1099584
Hariharan R, Hore B, Li C, Mehrotra S (2007) Processing Spatial-Keyword (SK) queries in Geographic Information Retrieval (GIR) systems. In: Proceedings of the 19th int. conf. on Scientific and Statistical Database Management (SSDBM07). IEEE Computer Society. doi:http://doi.ieeecomputersociety.org/10.1109/SSDBM.2007.22
Gruber TR (1993) A translation approach to portable ontology specifications. Knowl Acquis 5(2):199–220
Dellis E, Paliouras G (2006) Management of large spatial ontology bases. In: Proceedings of the workshop on Ontologies-based techniques for DataBases and Information Systems (ODBIS) of the 32nd int. conf. on Very Large Data Bases (VLDB 2006)
Gruber TR (1993) Towards principles for the design of ontologies used for knowledge sharing. In: Guarino N, Poli R (eds) Formal ontology in conceptual analysis and knowledge representation. Kluwer Academic, Deventer
World Wide Consortium (2008) Owl web ontology language reference. Retrieved March 2008 from http://www.w3.org/TR/owl-ref/
Open GIS Consortium, Inc. (2002) OpenGIS web map service implementation specification. OpenGIS Project Document 01-068r3, Open GIS Consortium
Apache (2008) Lucene. Retrieved March 2008 from http://lucene.apache.org
National Institute of Standards and Technology (NIST) (2008) TREC special database 22, TREC document database: disk 4. Retrieved March 2008 from http://www.nist.gov/srd/nistsd22.htm
Alias-i (2008) LingPipe, natural language tool. Retrieved March 2008 from http://www.alias-i.com/lingpipe/
Geonames (2008) Gazetteer. Retrieved March 2008 from http://www.geonames.org
National Imagery and Mapping Agency (NIMA) (2008) Vector map level 0. Retrieved March 2008 from http://www.mapability.com
FWTools (2008) Open source GIS binary kit for Windows and Linux. Retrieved March 2008 from http://fwtools.maptools.org
Refractions Research (2008) PostGIS. Retrieved March 2008 from http://postgis.refractions.net
Gamma E, Helm R, Johnson R, Vlissides J (1996) Design patterns: elements of reusable object-oriented software. Addison-Wesley, Harlow
Gospodnetić O, Hatcher E (2005) Lucene IN ACTION. Manning. ISBN: 1932394281
Lucene A (2005) Scoring. Retrieved June 2008 from http://lucene.apache.org/java/2_2_0/scoring.html
Van Kreveld M, Reinbacher I, Arampatzis A, Van Zwol R Multi-dimensional scattered ranking methods for geographic information retrieval. Geoinformatica 9(1):61–84. doi:10.1007/s10707-004-5622-6
Godoy F, Rodríguez A (2004) Defining and comparing content measures of topological relations. GeoInformatica, pp 347–371
Andrade L, Silva MJ (2006) Relevance ranking for geographic IR. In: GIR ’06: proceedings of the 2006 workshop on Geogr. Inform. Retrieval, SIGIR. ACM, New York
Jones C, Alani H, Tudhope D (2001) Geographical information retrieval with ontologies of place. In: COSIT’01: 3rd international conference on spatial information theory, pp 322–335
Yu B, Cai G (2007) A query-aware document ranking method for geographic information retrieval. In: GIR ’07: proceedings of the 2007 workshop on Geogr. Inform. Retrieval. ACM, New York
OSGeo (2008) Open Layers API. Retrieved May 2008 from http://openlayers.org
Beckmann N, Kriegel HP, Schneider R, Seeger B (1990) The R*-tree: an efficient and robust access method for points and rectangles. SIGMOD Rec. 19(2):322–331. doi:http://doi.acm.org/10.1145/93605.98741
Author information
Authors and Affiliations
Corresponding author
Additional information
This work has been partially supported by Ministerio de Educación y Ciencia (PGE and FEDER), grant TIN2009-14560-C03-02, and by “Xunta de Galicia” ref. 08SIN009CT.
Rights and permissions
About this article
Cite this article
Brisaboa, N.R., Luaces, M.R., Places, Á.S. et al. Exploiting geographic references of documents in a geographical information retrieval system using an ontology-based index. Geoinformatica 14, 307–331 (2010). https://doi.org/10.1007/s10707-010-0106-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10707-010-0106-3