An Ontology-Based Index to Retrieve Documents with Geographic Information

  • Miguel R. Luaces
  • Jose R. Paramá
  • Oscar Pedreira
  • Diego Seco
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5069)


Both Geographic Information Systems and Information Retrieval have been very active research fields in the last decades. Lately, a new research field called Geographic Information Retrieval has appeared from the intersection of these two fields. The main goal of this field is to define index structures and techniques to efficiently store and retrieve documents using both the text and the geographic references contained within the text.

We present in this paper a new index structure that combines an inverted index, a spatial index, and an ontology-based structure. This structure improves the query capabilities of other proposals. In addition, we describe the architecture of a system for geographic information retrieval that uses this new index structure. This architecture defines a workflow for the extraction of the geographic references in the document.


Index Structure Query Expansion Inverted Index Query User Interface Spatial Data Infrastructure 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison Wesley, Reading (1999)Google Scholar
  2. 2.
    Worboys, M.F.: GIS: A Computing Perspective. CRC, Boca Raton (2004)Google Scholar
  3. 3.
    ISO/IEC: Geographic Information – Reference Model. International Standard 19101, ISO/IEC (2002)Google Scholar
  4. 4.
    Open GIS Consortium, Inc.: OpenGIS Reference Model. OpenGIS Project Document 03-040, Open GIS Consortium, Inc.(2003)Google Scholar
  5. 5.
    Global Spatial Data Infrastructure Association: Online documentation (Retrieved May 2007),
  6. 6.
    Lieberman, M.D., Samet, H., Sankaranarayanan, J., Sperling, J.: STEWARD: Architecture of a Spatio-Textual Search Engine. In: Proceedings of the 15th ACM Int. Symp. on Advances in Geographic Information Systems (ACMGIS 2007), pp. 186–193. ACM Press, New York (2007)Google Scholar
  7. 7.
    Chen, Y.Y., Suel, T., Markowetz, A.: Efficient query processing in geographic web search engines. In: SIGMOD Conference, pp. 277–288 (2006)Google Scholar
  8. 8.
    Martins, B., Silva, M.J., Andrade, L.: Indexing and ranking in Geo-IR systems. In: GIR 2005: Proceedings of the 2005 workshop on Geographic information retrieval, pp. 31–34. ACM Press, New York (2005)CrossRefGoogle Scholar
  9. 9.
    Gaede, V., Günther, O.: Multidimensional access methods. ACM Comput. Surv. 30(2), 170–231 (1998)CrossRefGoogle Scholar
  10. 10.
    Guttman, A.: R-Trees: A Dynamic Index Structure for Spatial Searching. In: Yormark, B. (ed.) SIGMOD 1984, Proceedings of Annual Meeting, Boston, Massachusetts, June 18-21, 1984, pp. 47–57. ACM Press, New York (1984)CrossRefGoogle Scholar
  11. 11.
    Jones, C.B., Purves, R., Ruas, A., Sanderson, M., Sester, M., van Kreveld, M., Weibel, R.: Spatial information retrieval and geographical ontologies an overview of the SPIRIT project. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 387–388 (2002)Google Scholar
  12. 12.
    Jones, C.B., Abdelmoty, A.I., Fu, G.: Maintaining ontologies for geographical information retrieval on the web. In: Meersman, R., Tari, Z., Schmidt, D.C. (eds.) CoopIS 2003, DOA 2003, and ODBASE 2003. LNCS, vol. 2888, pp. 934–951. Springer, Heidelberg (2003)Google Scholar
  13. 13.
    Jones, C.B., Abdelmoty, A.I., Fu, G., Vaid, S.: The SPIRIT Spatial Search Engine: Architecture, Ontologies and Spatial Indexing. In: Egenhofer, M.J., Freksa, C., Miller, H.J. (eds.) GIScience 2004. LNCS, vol. 3234, pp. 125–139. Springer, Heidelberg (2004)Google Scholar
  14. 14.
    Vaid, S., Jones, C.B., Joho, H., Sanderson, M.: Spatio-Textual Indexing for Geographical Search on the Web. In: Bauzer Medeiros, C., Egenhofer, M.J., Bertino, E. (eds.) SSTD 2005. LNCS, vol. 3633, pp. 218–235. Springer, Heidelberg (2005)Google Scholar
  15. 15.
    Fu, G., Jones, C.B., Abdelmoty, A.I.: Ontology-Based Spatial Query Expansion in Information Retrieval. In: Meersman, R., Tari, Z. (eds.) OTM 2005. LNCS, vol. 3761, pp. 1466–1482. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  16. 16.
    Zhou, Y., Xie, X., Wang, C., Gong, Y., Ma, W.Y.: Hybrid index structures for location-based web search. In: CIKM 2005: Proceedings of the 14th ACM international conference on Information and knowledge management, pp. 155–162. ACM, New York (2005)CrossRefGoogle Scholar
  17. 17.
    Hariharan, R., Hore, B., Li, C., Mehrotra, S.: Processing Spatial-Keyword (SK) Queries in Geographic Information Retrieval (GIR) Systems. In: Proceedings of the 19th Int. Conf. on Scientific and Statistical Database Management (SSDBM 2007). IEEE Computer Society, Los Alamitos (2007)Google Scholar
  18. 18.
    Gruber, T.R.: A Translation Approach to Portable Ontology Specifications. Knowledge Acquisition 5(2), 199–220 (1993)CrossRefGoogle Scholar
  19. 19.
    Dellis, E., Paliouras, G.: Management of Large Spatial Ontology Bases. In: Proceedings of the Workshop on Ontologies-based techniques for DataBases and Information Systems (ODBIS) of the 32nd International Conference on Very Large Data Bases (VLDB 2006) (September 2006)Google Scholar
  20. 20.
    Open GIS Consortium, Inc.: OpenGIS Web Map Service Implementation Specification. OpenGIS Project Document 01-068r3, Open GIS Consortium, Inc. (2002)Google Scholar
  21. 21.
    Apache: Lucene (Retrieved October 2007),
  22. 22.
    National Institute of Standards and Technology (NIST): TREC Special Database 22, TREC Document Database: Disk 4 (Retrieved November 2007),
  23. 23.
    Amitay, E., Har’El, N., Sivan, R., Soffer, A.: Web-a-where: geotagging web content. In: SIGIR 2004: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 273–280. ACM, New York (2004)Google Scholar
  24. 24.
    Rauch, E., Bukatin, M., Baker, K.: A confidence-based framework for disambiguating geographic terms. In: Proceedings of the HLT-NAACL 2003 workshop on Analysis of geographic references, Morristown, NJ, USA, pp. 50–54. Association for Computational Linguistics (2003)Google Scholar
  25. 25.
    Alias-i: LingPipe, Natural Language Tool (Retrieved October 2007),
  26. 26.
    Geonames: Gazetteer (Retrieved September 2007),
  27. 27.
    National Imagery and Mapping Agency (NIMA): Vector Map Level 0 (Retrieved September 2007),
  28. 28.
    FWTools: Open Source GIS Binary Kit for Windows and Linux (Retrieved September 2007),
  29. 29.
    Refractions Research: PostGIS (Retrieved June 2007),
  30. 30.
    Gamma, E., Helm, R., Johnson, R., Vlissides, J.: Design Patterns: Elements of Reusable Object-oriented Software. Addison-Wesley, Reading (1996)zbMATHGoogle Scholar
  31. 31.
    Google: Google Maps API (Retrieved November 2007),
  32. 32.
    Beckmann, N., Kriegel, H.P., Schneider, R., Seeger, B.: The R*-tree: an efficient and robust access method for points and rectangles. SIGMOD Rec. 19(2), 322–331 (1990)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Miguel R. Luaces
    • 1
  • Jose R. Paramá
    • 1
  • Oscar Pedreira
    • 1
  • Diego Seco
    • 1
  1. 1.Database LaboratoryUniversity of A CoruñaCoruñaSpain

Personalised recommendations