Skip to main content

CSIR at INEX 2008 Link-the-Wiki Track

  • Conference paper
  • 407 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5631))

Abstract

In this paper, we describe methods taken by CSIR in the INEX 2008 Link-the-Wiki track. For the incoming link detection, we use p(d|t), the probability to generate a document, when given the topic file, to judge which documents are proper link sources for the given topic. For the file-to-file task of outgoing link detection, we take a two-step approach: first, we identify a group of candidate target documents by literally matching the topic file title and document content; then, candidate documents are ranked by the number of incoming links. For the anchor-to-BEP task, we use p(d|a,t), the probability to generate a document, when given the topic file and an anchor name, to select anchors and link targets for a given topic.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Huang, D.W.C., Xu, Y., Trotman, A.: Overview of INEX 2007 Link the Wiki Track. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 373–387. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  2. Jenkinson, D., Trotman, A.: Wikipedia Ad Hoc Passage Retrieval and Wikipedia Document Linking. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 426–439. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  3. Itakura, K.Y., Clarke, C.L.A.: University of Waterloo at INEX2007:Ad Hoc and Link-the-Wiki Tracks. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 417–425. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  4. Fachry, K.N., Kamps, J., Koolen, M., Zhang, J.: The University of Amsterdam at INEX 2007. In: Focused Access to XML Documents, 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007, Dagstuhl Castle, Germany, December 17-19, pp. 388–402 (2007)

    Google Scholar 

  5. Geva, S.: GPX@INEX 2007:Ad-Hoc Queries and Automated Link Discovery in the Wikipedia. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 404–416. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  6. Huang, D.W.C., Trotman, A., Geva, S.: Experiments and Evaluation of link discovery in the wikipedia. In: Proceedings of SIGIR workshop on Focused Retrieval (2008)

    Google Scholar 

  7. Zhang, J., Kamps, J.: Link Detection in XML Documents: What about repeated links? In: Proceedings of SIGIR workshop on Focused Retrieval (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lu, W., Liu, D., Fu, Z. (2009). CSIR at INEX 2008 Link-the-Wiki Track. In: Geva, S., Kamps, J., Trotman, A. (eds) Advances in Focused Retrieval. INEX 2008. Lecture Notes in Computer Science, vol 5631. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03761-0_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03761-0_39

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03760-3

  • Online ISBN: 978-3-642-03761-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics