Skip to main content

Calculating Query Likelihoods Based on Web Data Analysis

  • Conference paper
Intelligent Decision Technologies

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 10))

  • 1751 Accesses

Abstract

The language model for information retrieval has statistical background and can adapt previous text information retrieval model. Therefore, this model has attracted much attention in recent years. This retrieval model considers only text information. However, we focus on the Web page retrieval in one of the retrieval tasks. Web pages also have some kind of features, so that we should consider another information for the Web page retrieval. Especially, Web pages consist the hyperlink information that is beneficial information for Web page retrieval. In this paper, we propose new retrieval approach considering a feature of term in neighboring Web pages using the hyperlink information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Charniak, E.: Statistical Language Learning. The MIT Press, Cambridge (1996)

    Google Scholar 

  2. Clarke, C.L.A., Craswell, N., Soboroff, I.: Overview of the trec 2009 web track. In: Text Retrieval Conference (TREC) (2009)

    Google Scholar 

  3. Hollander, M., Wolfe, D.A.: Nonparametric Statistical Methods. Wiley Interscience, Hoboken (1999)

    MATH  Google Scholar 

  4. Jelinek, F., Mercer, R.L.: Interpolated estimation of markov source parameters from sparse data. In: Proceeding of the Workshop on Pattern Recognition in Practice, pp. 381–397 (1980)

    Google Scholar 

  5. Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. In: SODA 1998: Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 668–677. Society for Industrial and Applied Mathematics, Philadelphia (1998)

    Google Scholar 

  6. Lawrence, P., Sergey, B., Rajeev, M., Terry, W.: The PageRank Citation Ranking: Bringing Order to the Web. Technical Report 1999-66, Stanford InfoLab (1999), http://ilpubs.stanford.edu:8090/422/

  7. Liu, X., Croft, W.B.: Cluster-based retrieval using language models. In: SIGIR 2004: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 186–193. ACM, New York (2004), doi: http://doi.acm.org/10.1145/1008992.1009026

    Google Scholar 

  8. Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: SIGIR 1998: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 275–281. ACM, New York (1998), doi: http://doi.acm.org/10.1145/290941.291008

    Chapter  Google Scholar 

  9. Song, F., Croft, W.: A general language model for information retrieval. In: CIKM 1999: Proceedings of the Eighth International Conference on Information and Knowledge Management, pp. 316–321. ACM, New York (1999), doi: http://doi.acm.org/10.1145/319950.320022

    Chapter  Google Scholar 

  10. Tamura, K., Hatano, K., Yadohisa, H.: Characterizing web pages based on the query likelihoods of neighboring pages. In: Proceedings of the 5th International Conference on Digital Information Management (ICDIM 2010), pp. 392–397 (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tamura, K., Hatano, K., Yadohisa, H. (2011). Calculating Query Likelihoods Based on Web Data Analysis. In: Watada, J., Phillips-Wren, G., Jain, L.C., Howlett, R.J. (eds) Intelligent Decision Technologies. Smart Innovation, Systems and Technologies, vol 10. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22194-1_70

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-22194-1_70

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-22193-4

  • Online ISBN: 978-3-642-22194-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics