Abstract
The language model for information retrieval has statistical background and can adapt previous text information retrieval model. Therefore, this model has attracted much attention in recent years. This retrieval model considers only text information. However, we focus on the Web page retrieval in one of the retrieval tasks. Web pages also have some kind of features, so that we should consider another information for the Web page retrieval. Especially, Web pages consist the hyperlink information that is beneficial information for Web page retrieval. In this paper, we propose new retrieval approach considering a feature of term in neighboring Web pages using the hyperlink information.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Charniak, E.: Statistical Language Learning. The MIT Press, Cambridge (1996)
Clarke, C.L.A., Craswell, N., Soboroff, I.: Overview of the trec 2009 web track. In: Text Retrieval Conference (TREC) (2009)
Hollander, M., Wolfe, D.A.: Nonparametric Statistical Methods. Wiley Interscience, Hoboken (1999)
Jelinek, F., Mercer, R.L.: Interpolated estimation of markov source parameters from sparse data. In: Proceeding of the Workshop on Pattern Recognition in Practice, pp. 381–397 (1980)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. In: SODA 1998: Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 668–677. Society for Industrial and Applied Mathematics, Philadelphia (1998)
Lawrence, P., Sergey, B., Rajeev, M., Terry, W.: The PageRank Citation Ranking: Bringing Order to the Web. Technical Report 1999-66, Stanford InfoLab (1999), http://ilpubs.stanford.edu:8090/422/
Liu, X., Croft, W.B.: Cluster-based retrieval using language models. In: SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 186–193. ACM, New York (2004), doi: http://doi.acm.org/10.1145/1008992.1009026
Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: SIGIR 1998: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 275–281. ACM, New York (1998), doi: http://doi.acm.org/10.1145/290941.291008
Song, F., Croft, W.: A general language model for information retrieval. In: CIKM 1999: Proceedings of the Eighth International Conference on Information and Knowledge Management, pp. 316–321. ACM, New York (1999), doi: http://doi.acm.org/10.1145/319950.320022
Tamura, K., Hatano, K., Yadohisa, H.: Characterizing web pages based on the query likelihoods of neighboring pages. In: Proceedings of the 5th International Conference on Digital Information Management (ICDIM 2010), pp. 392–397 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tamura, K., Hatano, K., Yadohisa, H. (2011). Calculating Query Likelihoods Based on Web Data Analysis. In: Watada, J., Phillips-Wren, G., Jain, L.C., Howlett, R.J. (eds) Intelligent Decision Technologies. Smart Innovation, Systems and Technologies, vol 10. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22194-1_70
Download citation
DOI: https://doi.org/10.1007/978-3-642-22194-1_70
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22193-4
Online ISBN: 978-3-642-22194-1
eBook Packages: EngineeringEngineering (R0)