Skip to main content

DCbot: Finding Spatial Information on the Web

  • Conference paper

Part of the Lecture Notes in Computer Science book series (LNISA,volume 3453)

Abstract

The WWW provides an overwhelming amount of information, which – spatially indexed – can be a valuable additional data source for location-based applications. By manually building a spatial index, only a fraction of the available resources can be covered. This paper introduces a system for the automatic mapping of web pages to geographical locations. Our web robot uses several sets of domain specific keywords, lexical context rules, that are automatically learned, and a hierarchical catalogue of geographical locations that provides exact geographical coordinates for locations. Spatially indexed web pages are used to construct Geographical Web Portals, which can be accessed by different location-based applications. In addition, we present experimental results demonstrating the quantity and the quality of automatically indexed web pages.

Keywords

  • Address Recognition
  • Resource Description Framework
  • Postal Code Area
  • Postal Address
  • Geographical Reference

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Berners-Lee, T., Fischetti, M.: Weaving the web. 1. paperback ed., HarperCollins (2000)

    Google Scholar 

  2. Califf, M.E., Mooney, R.J.: Relational Learning of Pattern-Match Rules for Information Extraction. In: Proceedings of AAAI 1998 Spring Symposium on Applying Machine Learning to Discourse Processing, March 23-25 (1998)

    Google Scholar 

  3. Ding, J., Gravano, L., Shivakumar, N.: Computing Geographical Scopes of Web Resources. In: 26th International Conference on Very Large Databases (VLDB), September 10-14 (2000)

    Google Scholar 

  4. Dublin Core Metadata Element Set, http://www.dublincore.org/documents/dces/

  5. The Getty Thesaurus of Geographic Names, http://www.getty.edu/research/~conducting_research/vocabularies/tgn/

  6. Google Search by Location, http://labs.google.com/location

  7. Leonhardi, U.K., Rothermel, K.: Virtual Information Towers – A metaphor for intuitive, location-aware information access in a mobile environment. In: Proc. of third International Symposium on Wearable Computers, San Francisco, CA (1999)

    Google Scholar 

  8. Markowetz, T.B., Seeger, B.: Geographic Information Retrieval. In: 3rd International Workshop on Web Dynamics (2004)

    Google Scholar 

  9. Nicklas, D., Großmann, M., Schwarz, T., Volz, S., Mitschang, B.: A Model-Based, Open Architecture for Mobile, Spatially Aware Applications. In: 7th International Symposium on Spatial and Temporal Databases (SSTD), Redondo Beach, CA, USA (2001)

    Google Scholar 

  10. Nicklas, D., Mitschang, B.: On building location aware applications using an open platform based on the Nexus Augmented World Model. Software and Systems Modeling 3(4) (2004)

    Google Scholar 

  11. Sütö, M.: Ortsbasierter Web-Zugriff (In German) University of Stuttgart (2002)

    Google Scholar 

  12. W3C: Resource Description Framework (RDF), http://w3.org/RDF/

  13. W3C: Web Ontology Language (OWL), http://w3.org/2004/OWL/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Jakob, M., Grossmann, M., Nicklas, D., Mitschang, B. (2005). DCbot: Finding Spatial Information on the Web. In: Zhou, L., Ooi, B.C., Meng, X. (eds) Database Systems for Advanced Applications. DASFAA 2005. Lecture Notes in Computer Science, vol 3453. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11408079_71

Download citation

  • DOI: https://doi.org/10.1007/11408079_71

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25334-1

  • Online ISBN: 978-3-540-32005-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics