Skip to main content

Estimation of Geographic Relevance for Web Objects Using Probabilistic Models

  • Conference paper
Web and Wireless Geographical Information Systems (W2GIS 2008)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5373))

Abstract

The rapidly increasing use of geographically restrictive web search has made determination of the geographic relevance of web content an important task. We have developed a method that uses Gaussian mixture models to estimate the geographic relevance of web pages and arbitrary topics and have implemented a visualization interface that maps pages and topics to geographic space. The system enables the user to retrieve web pages and topics expressed on the Web that are relevant to an arbitrary geographic area.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Raper, J.: Geographic relevance. Journal of Documentation 63(6), 836–852 (2007)

    Article  Google Scholar 

  2. McCurley, K.S.: Geospatial mapping and navigation of the Web. In: Proceedings of the 10th International World Wide Web Conference, Hong Kong, China, pp. 221–229 (2001)

    Google Scholar 

  3. Gao, W., Lee, H.C., Miao, Y.: Geographically focused collaborative crawling. In: Proceedings of the 15th International World Wide Web Conference, Edinburgh, Scotland, pp. 287–296 (2006)

    Google Scholar 

  4. Zhou, Y., Xie, X., Wang, C., Gong, Y., Ma, W.Y.: Hybrid index structures for location-based web search. In: Proceedings of the 14th ACM International Conference on Information and Knowledge Management, Bremen, Germany, pp. 155–162 (2005)

    Google Scholar 

  5. Matsumoto, C., Ma, Q., Tanaka, K.: Web information retrieval based on the localness degree. In: Proceedings of the 13th International Conference on Database and Expert Systems Applications, Aix-en-Provence, France, pp. 172–181 (2004)

    Google Scholar 

  6. Chen, L., Zhang, L., Jing, F., Deng, K., Ma, W.Y.: Ranking web objects from multiple communities. In: Proceedings of the International Conference on Information and Knowledge Management, Arlington, Virginia, pp. 377–386 (2006)

    Google Scholar 

  7. Nie, Z., Ma, Y., Shi, S., Wen, J.R., Ma, W.Y.: Web object retrieval. In: Proceedings of the 16th International World Wide Web Conference, Banff, Canada, pp. 81–90 (2007)

    Google Scholar 

  8. Nie, Z., Wen, J.R., Ma, W.Y.: Object-level vertical search. In: Proceedings of the 3rd Biennial Conference on Innovative Data Systems Research, Asilomar, California, pp. 235–246 (2007)

    Google Scholar 

  9. Kwok, C., Etzioni, O., Weld, D.S.: Scaling question answering to the Web. In: Proceedings of the 10th International World Wide Web Conference, Hong Kong, pp. 150–161 (2001)

    Google Scholar 

  10. Radev, D., Fan, W., Qi, H., Wu, H., G.: Probabilistic question answering on the Web. Journal of the American Society for Information Science and Technology 56(6), 571–583 (2005)

    Article  Google Scholar 

  11. Buyukkokten, O., Cho, J., Garcia-Molina, H., Gravano, L., Shivakumar, N.: Exploiting geographical location information of Web pages. In: Proceedings of the ACM SIGMOD Workshop on the Web and Databases, Philadelphia, Pennsylvania (1999)

    Google Scholar 

  12. Gravano, L., Hatzivassiloglou, V., Lichtenstein, R.: Categorizing web queries according to geographical locality. In: Proceedings of the 12th International Conference on Information and Knowledge Management, New Orleans, Lousiana, pp. 325–333 (2003)

    Google Scholar 

  13. Mei, Q., Liu, C., Su, H., Zhai, C.: A probabilistic approach to spatiotemporal theme pattern mining on weblogs. In: Proceedings of the 15th International World Wide Web Conference, Edinburgh, Scotland, pp. 533–542 (2006)

    Google Scholar 

  14. Tezuka, T., Kurashima, T., Tanaka, K.: Toward tighter integration of web search with a geographic information system. In: Proceedings of the 15th World Wide Web Conference, Edinburgh, Scotland, pp. 277–286 (2006)

    Google Scholar 

  15. Davis, C.A., Fonseca, F.T.: Assessing the certainty of locations produced by an address geocoding system. Geoinformatica 11(1), 103–129 (2007)

    Article  Google Scholar 

  16. Amitay, E., Har’El, N., Sivan, R., Soffer, A.: Web-a-Where: geotagging web content. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Sheffield, United Kingdom, pp. 273–280 (2004)

    Google Scholar 

  17. Lieberman, M.D., Sperling, J.: STEWARD: Architecture of a spatio-textual search engine. In: Proceedings of the 15th Annual ACM International Symposium on Advances in Geographic Information Systems, Seattle, Washington, Article No. 25 (2007)

    Google Scholar 

  18. Sengar, V., Joshi, T., Joy, J., Prakash, S., Toyama, K.: Robust location search from text queries. In: Proceedings of the 15th Annual ACM International Symposium on Advances in Geographic Information Systems, Seattle, Washington, Article No. 24 (2007)

    Google Scholar 

  19. Schneider, M.: Geographic data modeling: Fuzzy topological predicates, their properties, and their integration into query languages. In: Proceedings of the 9th ACM international symposium on advances in geographic information systems, Atlanta, Georgia, pp. 9–14 (2001)

    Google Scholar 

  20. Shi, W., Liu, K.: A fuzzy topology for computing the interior, boundary, and exterior of spatial objects quantitatively in GIS. Computers & Geosciences 33(7), 898–915 (2007)

    Article  Google Scholar 

  21. Bishop, C.M.: Pattern recognition and machine learning. Springer, Heidelberg (2006)

    MATH  Google Scholar 

  22. Bilmes, J.A.: A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian Mixture and Hidden Markov Models, Technical Report, University of Berkeley, ICSI-TR-97-021 (1997)

    Google Scholar 

  23. van Rijsbergen, C.J.: Information Retrieval - Second Edition. Butterworth & Co Publishers Ltd (1979)

    Google Scholar 

  24. Geographical Survey Institute of Japan, http://www.gsi.go.jp/ENGLISH/

  25. MeCab, http://mecab.sourceforge.net/

  26. Yahoo!, API, http://developer.yahoo.co.jp/

  27. Google Maps, API, http://google.com/apis/maps/

  28. Wikipedia, http://wikipedia.org/

  29. Google Maps, http://maps.google.com/

  30. Yahoo! Local Maps, http://map.yahoo.com/

  31. Live Search Maps, http://maps.live.com/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tezuka, T., Kondo, H., Tanaka, K. (2008). Estimation of Geographic Relevance for Web Objects Using Probabilistic Models. In: Bertolotto, M., Ray, C., Li, X. (eds) Web and Wireless Geographical Information Systems. W2GIS 2008. Lecture Notes in Computer Science, vol 5373. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89903-7_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-89903-7_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-89902-0

  • Online ISBN: 978-3-540-89903-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics