Abstract
Web search engines have become extremely popular in providing requested information to the user. The result set effectiveness of Web search engines has been continuously improving over the years. However, the documents of the result set may also contain irrelevant information having no importance to the user. So, the user has to spend some effort in searching for relevant information in these result set documents. To overcome this searching overhead, Web object search engines have been proposed. Such systems are built by extracting object information from various Web documents and integrating them into object repository. The user is provided with the facility to submit object search queries and the required object information is retrieved. Unlike, Web search engines, providing results to geography-specific queries is still in nascent stage for Web object search engines. Recently, Gaussian Mixture Model based technique for geographical labeling of Web objects was proposed in the literature. However, there is significant scope to improve the labeling accuracy results obtained in this technique. In this chapter, maximum marginal classifier-based technique for Web object geographical labeling is proposed. The advantages of this proposed technique are empirically exhibited on a real-world data set. This proposed technique, outperforms the contemporary technique by at least 40% in labeling accuracy, and is twice better in execution efficiency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
ZaiqingNie, Yunxiao Ma, Shuming Shi, Ji-Rong Wen, Wei-Ying Ma Web Object Retrieval WWW 2007, May8-12-2007, Banff, Alberta, Canada
J. Raper, Geographic relevance. Journal of Documentation 63(6), 836–852 (2007)
T. Tezuka, H. Kondo, K. Tanaka, Estimation of Geographic Relevance for Web objects Using Probabilistic Models (Springer, Berlin/Heidelberg, 2008)
K.S. McCurley, Geospatial mapping and navigation of the Web, in Proceedings of the 10th international world wide web conference, Hong kong, China, pp. 221–229 (2001)
W. Gao, H.C. Lee, Y. Miao, Geographically focused collaborative crawling, in Proceedings of the 15th international world wide web conference, Edinburgh, Scotland, pp. 287–296 (2006)
Q. Mei, C. Liu, H. Su, C. Zhai, A probabilistic approach to spatiotemporal theme pattern mining on weblogs, in Proceedings of the 15th international world wide web conference, Edinburgh, Scotland, pp. 533–542 (2006)
L. Gravano, V. Hatzivassiloglou, R. Litchenstein, Categorizing Web queries according to geographical locality, in Proceedings of the 12th international conference on information and knowledge management, New Orleans, Lousiana, pp. 325–333 (2003)
L. Chen, L. Zhang, F. Jing, K. Deng, W.Y. Ma, Ranking Web objects from multiple communities, in Proceedings of the international conference on information and knowledge management, Arlington, Virginia, pp. 377–386 (2006)
Z. Nie, Y. Ma, S. Shi, J.R. Wen, W.Y. Ma, Web object retrieval, in Proceedings of the 16th international world wide web conference, Banff, Canada, pp. 81–90 (2007)
Z. Nie, J.R. Wen, W.Y. Ma, Object-level vertical search, in Proceedings of the 3rd biennial conference on innovative data systems research, Asilomar, California, pp. 235–246 (2007)
O. Buyukkokten, J. Cho, H. Garcia-Molina, L. Gravano, N. Shivakumar, Exploiting geographical location information of Web pages, in proceedings of the ACM SIGMOD workshop on the web and databases, Philadelphia, Pennsylvania (1999)
T. Tezuka, T. Kurashima, K. Tanaka, Toward tighter integration of Web search with a geographic information system, in Proceedings of the 15th world wide Web conference, Edinburgh, Scotland, pp. 277–286 (2006)
C.A. Davis, F.T. Fonseca, Assessing the certainty of locations produced by an address geo coding system. Geoinformatica 11(1), 103–129 (2007)
E. Amitay, N. Har El, R. Sivan, A. Soffer, Geotagging web content, in Proceedings of the 27th annual international ACM SIGIR conference on research and development in information retrieval, Sheffield, United Kingdom, pp. 273–280
M.D. Lieberman, J. Sperling, STEWARD Architecture of a Spatio-textual Search Engine, in Proceedings of the 15th annual ACM international symposium on advances in geo-graphic information systems, Seattle, Washington, Article No.25 (2007)
M. Schneider, Geographic Data Modeling: Fuzzy Topological Predicates, their Properties and their Integration into Query Languages, in Proceedings of the 9th ACM international symposium on advances in geographic information systems, Atlanta, Georgia, (2001), pp. 9–14
J. Coffman, A.C. Weaver, A Framework for Evaluating Database Keyword Search Strategies, in Proceedings of the 19th ACM International Conference on Information and Knowledge Management (2010). 978-1-4503-0099-5
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Anjan Kumar, K.N., Satish Kumar, T., Reshma, J. (2021). Geographical Labeling of Web Objects Through Maximum Marginal Classification. In: Stahlbock, R., Weiss, G.M., Abou-Nasr, M., Yang, CY., Arabnia, H.R., Deligiannidis, L. (eds) Advances in Data Science and Information Engineering. Transactions on Computational Science and Computational Intelligence. Springer, Cham. https://doi.org/10.1007/978-3-030-71704-9_52
Download citation
DOI: https://doi.org/10.1007/978-3-030-71704-9_52
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-71703-2
Online ISBN: 978-3-030-71704-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)