Abstract
The rapidly increasing use of geographically restrictive web search has made determination of the geographic relevance of web content an important task. We have developed a method that uses Gaussian mixture models to estimate the geographic relevance of web pages and arbitrary topics and have implemented a visualization interface that maps pages and topics to geographic space. The system enables the user to retrieve web pages and topics expressed on the Web that are relevant to an arbitrary geographic area.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Raper, J.: Geographic relevance. Journal of Documentation 63(6), 836–852 (2007)
McCurley, K.S.: Geospatial mapping and navigation of the Web. In: Proceedings of the 10th International World Wide Web Conference, Hong Kong, China, pp. 221–229 (2001)
Gao, W., Lee, H.C., Miao, Y.: Geographically focused collaborative crawling. In: Proceedings of the 15th International World Wide Web Conference, Edinburgh, Scotland, pp. 287–296 (2006)
Zhou, Y., Xie, X., Wang, C., Gong, Y., Ma, W.Y.: Hybrid index structures for location-based web search. In: Proceedings of the 14th ACM International Conference on Information and Knowledge Management, Bremen, Germany, pp. 155–162 (2005)
Matsumoto, C., Ma, Q., Tanaka, K.: Web information retrieval based on the localness degree. In: Proceedings of the 13th International Conference on Database and Expert Systems Applications, Aix-en-Provence, France, pp. 172–181 (2004)
Chen, L., Zhang, L., Jing, F., Deng, K., Ma, W.Y.: Ranking web objects from multiple communities. In: Proceedings of the International Conference on Information and Knowledge Management, Arlington, Virginia, pp. 377–386 (2006)
Nie, Z., Ma, Y., Shi, S., Wen, J.R., Ma, W.Y.: Web object retrieval. In: Proceedings of the 16th International World Wide Web Conference, Banff, Canada, pp. 81–90 (2007)
Nie, Z., Wen, J.R., Ma, W.Y.: Object-level vertical search. In: Proceedings of the 3rd Biennial Conference on Innovative Data Systems Research, Asilomar, California, pp. 235–246 (2007)
Kwok, C., Etzioni, O., Weld, D.S.: Scaling question answering to the Web. In: Proceedings of the 10th International World Wide Web Conference, Hong Kong, pp. 150–161 (2001)
Radev, D., Fan, W., Qi, H., Wu, H., G.: Probabilistic question answering on the Web. Journal of the American Society for Information Science and Technology 56(6), 571–583 (2005)
Buyukkokten, O., Cho, J., Garcia-Molina, H., Gravano, L., Shivakumar, N.: Exploiting geographical location information of Web pages. In: Proceedings of the ACM SIGMOD Workshop on the Web and Databases, Philadelphia, Pennsylvania (1999)
Gravano, L., Hatzivassiloglou, V., Lichtenstein, R.: Categorizing web queries according to geographical locality. In: Proceedings of the 12th International Conference on Information and Knowledge Management, New Orleans, Lousiana, pp. 325–333 (2003)
Mei, Q., Liu, C., Su, H., Zhai, C.: A probabilistic approach to spatiotemporal theme pattern mining on weblogs. In: Proceedings of the 15th International World Wide Web Conference, Edinburgh, Scotland, pp. 533–542 (2006)
Tezuka, T., Kurashima, T., Tanaka, K.: Toward tighter integration of web search with a geographic information system. In: Proceedings of the 15th World Wide Web Conference, Edinburgh, Scotland, pp. 277–286 (2006)
Davis, C.A., Fonseca, F.T.: Assessing the certainty of locations produced by an address geocoding system. Geoinformatica 11(1), 103–129 (2007)
Amitay, E., Har’El, N., Sivan, R., Soffer, A.: Web-a-Where: geotagging web content. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Sheffield, United Kingdom, pp. 273–280 (2004)
Lieberman, M.D., Sperling, J.: STEWARD: Architecture of a spatio-textual search engine. In: Proceedings of the 15th Annual ACM International Symposium on Advances in Geographic Information Systems, Seattle, Washington, Article No. 25 (2007)
Sengar, V., Joshi, T., Joy, J., Prakash, S., Toyama, K.: Robust location search from text queries. In: Proceedings of the 15th Annual ACM International Symposium on Advances in Geographic Information Systems, Seattle, Washington, Article No. 24 (2007)
Schneider, M.: Geographic data modeling: Fuzzy topological predicates, their properties, and their integration into query languages. In: Proceedings of the 9th ACM international symposium on advances in geographic information systems, Atlanta, Georgia, pp. 9–14 (2001)
Shi, W., Liu, K.: A fuzzy topology for computing the interior, boundary, and exterior of spatial objects quantitatively in GIS. Computers & Geosciences 33(7), 898–915 (2007)
Bishop, C.M.: Pattern recognition and machine learning. Springer, Heidelberg (2006)
Bilmes, J.A.: A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian Mixture and Hidden Markov Models, Technical Report, University of Berkeley, ICSI-TR-97-021 (1997)
van Rijsbergen, C.J.: Information Retrieval - Second Edition. Butterworth & Co Publishers Ltd (1979)
Geographical Survey Institute of Japan, http://www.gsi.go.jp/ENGLISH/
Yahoo!, API, http://developer.yahoo.co.jp/
Google Maps, API, http://google.com/apis/maps/
Wikipedia, http://wikipedia.org/
Google Maps, http://maps.google.com/
Yahoo! Local Maps, http://map.yahoo.com/
Live Search Maps, http://maps.live.com/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tezuka, T., Kondo, H., Tanaka, K. (2008). Estimation of Geographic Relevance for Web Objects Using Probabilistic Models. In: Bertolotto, M., Ray, C., Li, X. (eds) Web and Wireless Geographical Information Systems. W2GIS 2008. Lecture Notes in Computer Science, vol 5373. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89903-7_12
Download citation
DOI: https://doi.org/10.1007/978-3-540-89903-7_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89902-0
Online ISBN: 978-3-540-89903-7
eBook Packages: Computer ScienceComputer Science (R0)