Abstract
We explore statistical properties of links within Wikipedia. We demonstrate that a simple algorithm can predict many of the links that would normally be added to a new article, without considering the topic of the article itself. We then explore a variant of topic-oriented PageRank, which can effectively identify topical links within existing articles, when compared with manual judgments of their topical relevance. Based on these results, we suggest that linkages within Wikipedia arise from a combination of structural requirements and topical relationships.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adafre, S.F., de Rijke, M.: Discovering missing links in Wikipedia. In: 3rd International Workshop on Link Discovery, Chicago, pp. 90–97 (2005)
Büttcher, S., Clarke, C.L.A., Cormack, G.V.: Information Retrieval: Implementing and Evaluating Search Engines. MIT Press, Cambridge (2010)
Gardner, J.J., Xiong, L.: Automatic link detection: A sequence labeling approach. In: 18th CIKM, Hong Kong, pp. 1701–1704 (2009)
Huang, D.W.C., Xu, Y., Trotman, A., Geva, S.: Overview of INEX 2007 link the wiki track. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 373–387. Springer, Heidelberg (2008)
Huang, W.C., Geva, S., Trotman, A.: Overview of the INEX 2008 link the wiki track. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2008. LNCS, vol. 5631, pp. 314–325. Springer, Heidelberg (2009)
Itakura, K.Y., Clarke, C.L.A.: University of waterloo at INEX2007: Adhoc and link-the-wiki tracks. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 417–425. Springer, Heidelberg (2008)
Itakura, K.Y., Clarke, C.L.A.: University of waterloo at INEX 2009: Ad hoc, book, entity ranking, and link-the-wiki tracks. In: Geva, S., Kamps, J., Trotman, A. (eds.) INEX 2009. LNCS, vol. 6203, pp. 331–341. Springer, Heidelberg (2010)
Mihalcea, R., Csomai, A.: Wikify!: Linking documents to encyclopedic knowledge. In: 16th CIKM, Lisbon, pp. 233–242 (2007)
Milne, D., Witten, I.H.: Learning to link with Wikipedia. In: 17th CIKM, pp. 509–518. Napa Valley, California (2008)
Voorhees, E.M.: Variations in relevance judgments and the measurement of retrieval effectiveness. In: 21st SIGIR, pp. 315–323 (1998)
Zhang, J., Kamps, J.: Link detection in XML documents: What about repeated links. In: SIGIR 2008 Workshop on Focused Retrieval, pp. 59–66 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Itakura, K.Y., Clarke, C.L.A., Geva, S., Trotman, A., Huang, W.C. (2011). Topical and Structural Linkage in Wikipedia. In: Clough, P., et al. Advances in Information Retrieval. ECIR 2011. Lecture Notes in Computer Science, vol 6611. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20161-5_45
Download citation
DOI: https://doi.org/10.1007/978-3-642-20161-5_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20160-8
Online ISBN: 978-3-642-20161-5
eBook Packages: Computer ScienceComputer Science (R0)