Abstract
We describe Dublin City University (DCU)’s participation in the VideoCLEF 2009 Linking Task. Two approaches were implemented using the Lemur information retrieval toolkit. Both approaches first extracted a search query from the transcriptions of the Dutch TV broadcasts. One method first performed search on a Dutch Wikipedia archive, then followed links to corresponding pages in the English Wikipedia. The other method first translated the extracted query using machine translation and then searched the English Wikipedia collection directly. We found that using the original Dutch transcription query for searching the Dutch Wikipedia yielded better results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Larson, M., Newman, E., Jones, G.J.F.: Overview of VideoCLEF 2009: New Perspectives on Speech-based Multimedia Content Enrichment. In: Peters, C., et al. (eds.) CLEF 2009 Workshop, Part II. LNCS, vol. 6242, pp. 354–368. Springer, Heidelberg (2010)
The Lemur Toolkit, http://www.lemurproject.org/
Jones, G.J.F., Fantino, F., Newman, E., Zhang, Y.: Domain-Specific Query Translation for Multilingual Information Access Using Machine Translation Augmented With Dictionaries Mined From Wikipedia. In: Proceedings of the 2nd International Workshop on Cross Lingual Information Access - Addressing the Information Need of Multilingual Societies (CLIA-2008), Hyderabad, India, pp. 34–41 (2008)
Don, M.: Indri Retrieval Model Overview, http://ciir.cs.umass.edu/~metzler/indriretmodel.html
Oleander Stemming Library, http://sourceforge.net/projects/porterstemmers/
Snowball, http://snowball.tartarus.org/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gyarmati, Á., Jones, G.J.F. (2010). When to Cross Over? Cross-Language Linking Using Wikipedia for VideoCLEF 2009. In: Peters, C., et al. Multilingual Information Access Evaluation II. Multimedia Experiments. CLEF 2009. Lecture Notes in Computer Science, vol 6242. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15751-6_53
Download citation
DOI: https://doi.org/10.1007/978-3-642-15751-6_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15750-9
Online ISBN: 978-3-642-15751-6
eBook Packages: Computer ScienceComputer Science (R0)