Skip to main content

Mining Neighbors’ Topicality to Better Control Authority Flow

  • Conference paper
Advances in Information Retrieval (ECIR 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5993))

Included in the following conference series:

  • 2149 Accesses

Abstract

Web pages are often recognized by others through contexts. These contexts determine how linked pages influence and interact with each other. When differentiating such interactions, the authority of web pages can be better estimated by controlling the authority flows among pages. In this work, we determine the authority distribution by examining the topicality relationship between associated pages. In addition, we find it is not enough to quantify the influence of authority propagation from only one type of neighbor, such as parent pages in PageRank algorithm, since web pages, like people, are influenced by diverse types of neighbors within the same network. We propose a probabilistic method to model authority flows from different sources of neighbor pages. In this way, we distinguish page authority interaction by incorporating the topical context and the relationship between associated pages. Experiments on the 2003 and 2004 TREC Web Tracks demonstrate that this approach outperforms other competitive topical ranking models and produces a more than 10% improvement over PageRank on the quality of top 10 search results. When increasing the types of incorporated neighbor sources, the performance shows stable improvements.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cai, D., He, X., Wen, J.-R., Ma, W.-Y.: Block-level link analysis. In: Proc. 27th Annual Int’l ACM SIGIR Conf. on Research and Dev. in Information Retrieval (July 2004)

    Google Scholar 

  2. Haveliwala, T.H.: Topic-sensitive PageRank. In: Proc. of the 11th Int’l World Wide Web Conf., pp. 517–526. ACM Press, New York (2002)

    Google Scholar 

  3. McCallum, A.K.: Bow: A toolkit for statistical language modeling, text retrieval, classification and clustering (1996), http://www.cs.cmu.edu/~mccallum/bow

  4. Nie, L., Davison, B.D.: Separate and inequal: Preserving heterogeneity in topical authority flows. In: Proc. 31st Annual Int’l ACM SIGIR Conf. on Research and Dev. in Information Retrieval, July 2008, pp. 443–450 (2008)

    Google Scholar 

  5. Nie, L., Davison, B.D., Qi, X.: Topical link analysis for web search. In: Proc. 29th Annual Int’l ACM SIGIR Conf. on Research & Dev. in Info. Retrieval, August 2006, pp. 91–98 (2006)

    Google Scholar 

  6. The dmoz Open Directory Project, ODP (2009), http://www.dmoz.org/

  7. Qin, T., Liu, T.-Y., Zhang, X.-D., Chen, Z., Ma, W.-Y.: A study of relevance propagation for web search. In: Proc. 28th Annual Int’l ACM SIGIR Conf. on Research and Dev. in Information Retrieval, pp. 408–415 (2005)

    Google Scholar 

  8. Robertson, S.E.: Overview of the OKAPI projects. Journal of Documentation 53, 3–7 (1997)

    Article  Google Scholar 

  9. Shakery, A., Zhai, C.: A probabilistic relevance propagation model for hypertext retrieval. In: Proc. of the 15th ACM Int’l Conf. on Information and Knowledge Management (CIKM), pp. 550–558 (2006)

    Google Scholar 

  10. Shakery, A., Zhai, C.: Smoothing document language models with probabilistic term count propagation. Inf. Retr. 11(2), 139–164 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Dai, N., Davison, B.D., Wang, Y. (2010). Mining Neighbors’ Topicality to Better Control Authority Flow. In: Gurrin, C., et al. Advances in Information Retrieval. ECIR 2010. Lecture Notes in Computer Science, vol 5993. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12275-0_69

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12275-0_69

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12274-3

  • Online ISBN: 978-3-642-12275-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics