Advertisement

The SPIRIT Spatial Search Engine: Architecture, Ontologies and Spatial Indexing

  • Christopher B. Jones
  • Alia I. Abdelmoty
  • David Finch
  • Gaihua Fu
  • Subodh Vaid
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3234)

Abstract

The SPIRIT search engine provides a test bed for the development of web search technology that is specialised for access to geographical information. Major components include the user interface, a geographical ontology, maintenance and retrieval functions for a test collection of web documents, textual and spatial indexes, relevance ranking and metadata extraction. Here we summarise the functionality and interaction between these components before focusing on the design of the geo-ontology and the development of spatio-textual indexing methods. The geo-ontology supports functionality for disambiguation, query expansion, relevance ranking and metadata extraction. Geographical place names are accompanied by multiple geometric footprints and qualitative spatial relationships. Spatial indexing of documents has been integrated with text indexing through the use of spatio-textual keys in which terms are concatenated with spatial cells to which they relate. Preliminary experiments demonstrate considerable performance benefits when compared with pure text indexing and with text indexing followed by a spatial filtering stage.

Keywords

Search Engine Query Time Query Term Query Expansion Geographical Context 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [Ale]
    Alexandria Digital Library Project, http://www.alexandria.ucsb.edu/
  2. [BCGM+99]
    Buyukokkten, O., Cho, J., Garcia-Molina, H., Gravano, L., Shivakumar, N.: Exploiting Geographical Location Information of Web Pages. In: Proceedings of Workshop on Web Databases (WebDB 1999) held in conjunction with ACM SIGMOD 1999, pp. 91–96. ACM Press, New York (1999)Google Scholar
  3. [BOL00]
    Bressan, S., Ooi, B.C., Lee, F.: Global Atlas: Calibrating and Indexing Documents from the Internet in the Cartographic Paradigm. In: Proceedings of the 1st International Conference on Web Information Systems Engineering, vol. 1, pp. 117–124 (2000)Google Scholar
  4. [CAT01]
    Jones, C.B., Alani, H., Tudhope, D.: Geographical information retrieval with ontologies of place. In: Montello, D.R. (ed.) COSIT 2001. LNCS, vol. 2205, pp. 323–335. Springer, Heidelberg (2001)Google Scholar
  5. [DGS00]
    Ding, J., Gravano, L., Shivakumar, N.: Computing Geographical Scopes ofWeb Resources. In: Proceedings of the 26th Very-Large Database (VLDB) Conference, pp. 546–556. Morgan Kaufmann, San Francisco (2000)Google Scholar
  6. [Egn]
  7. [Geo]
    GeoURL ICBM Address Server, http://geourl.org/
  8. [Get]
  9. [GLA]
    GLASS: Online Documentation, http://dis.shef.ac.uk/mark/glass/
  10. [Goo]
  11. [HKS03]
    Heinzle, F., Kopczynski, M., Sester, M.: Spatial Data Interpretation for the Intelligent Access to Spatial Information in the Internet. In: Proceedings of 21st International Cartographic Conference (2003)Google Scholar
  12. [Li03]
    Li, H.: Infoxtract location normalization: a hybrid approach to geographic references in information extraction. In: Proc. of the HLT-NAACL 2003 Workshop on Analysis of Geographic References, pp. 39–44 (2003)Google Scholar
  13. [Map]
  14. [McC01]
    McCurley, K.S.: Geospatial Mapping and Navigation of the Web. In: Proceedings of Tenth International World Wide Web Conference, pp. 221–229. ACM Press, New York (2001)CrossRefGoogle Scholar
  15. [Met]
  16. [MG.]
    MG. Information Retrieval System, http://www.cs.mu.oz.au/mg/
  17. [Mir]
    Mirago: Mirago the UK Search Engine, http://www.mirago.co.uk/
  18. [Nor]
  19. [RWB+95]
    Robertson, S.E., Walker, S., Beaulieu, M.M., Gatford, M., Payne, A.: Okapi at trec-4. In: Proc. of the 4th Text REtrieval Conference (TREC-4), pp. 73–96 (1995)Google Scholar
  20. [Sea]
    Seamless Administrative Boundaries of Europe (SABE) dataset, http://www.eurogeographics.org/eng/04-sabe.asp
  21. [Vic]

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Christopher B. Jones
    • 1
  • Alia I. Abdelmoty
    • 1
  • David Finch
    • 1
  • Gaihua Fu
    • 1
  • Subodh Vaid
    • 1
  1. 1.School of Computer ScienceCardiff UniversityCardiffUK

Personalised recommendations