Abstract
The increasing amount of archival multimedia content available online is creating increasing opportunities for users who are interested in exploratory search behaviour such as browsing. The user experience with online collections could therefore be improved by enabling navigation and recommendation within multimedia archives, which can be supported by allowing a user to follow a set of hyperlinks created within or across documents. The main goal of this study is to compare the performance of different multimedia features for automatic hyperlink generation. In our work we construct multimedia hyperlinks by indexing and searching textual and visual features extracted from the blip.tv dataset. A user-driven evaluation strategy is then proposed by applying the Amazon Mechanical Turk (AMT) crowdsourcing platform, since we believe that AMT workers represent a good example of “real world” users. We conclude that textual features exhibit better performance than visual features for multimedia hyperlink construction. In general, a combination of ASR transcripts and metadata provides the best results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Alonso, O., Rose, D.E., Stewart, B.: Crowdsourcing for Relevance Evaluation. SIGIR Forum 42(2), 9–15 (2008)
Bron, M., Huurnink, B., de Rijke, M.: Linking Archives Using Document Enrichment and Term Selection. In: Gradmann, S., Borri, F., Meghini, C., Schuldt, H. (eds.) TPDL 2011. LNCS, vol. 6966, pp. 360–371. Springer, Heidelberg (2011)
Bron, M., van Gorp, J., Nack, F., de Rijke, M.: Exploratory Search in an Audio-Visual Archive: Evaluating a Professional Search Tool for Non-Professional Users. In: 1st European Workshop on Human-Computer Interaction and Information Retrieval, EuroHCIR 2011 (2011)
Chatfield, K., Lempitsky, V., Vedaldi, A., Zisserman, A.: The devil is in the details: an evaluation of recent feature encoding methods. In: Proceedings of the British Machine Vision Conference, pp. 76.1–76.12 (2011)
Chen, S., Jones, G.J.F., O’Connor, N.E.: DCU Linking Runs at Mediaeval 2012 Search and Hyperlinking Task. In: MediaEval. CEUR Workshop Proceedings, vol. 927. CEUR-WS.org (2012)
Eskevich, M., Jones, G.J.F., Aly, R., Ordelman, R.J., Chen, S., Nadeem, D., Guinaudeau, C., Gravier, G., Sébillot, P., de Nies, T., Debevere, P., Van de Walle, R., Galuscakova, P., Pecina, P., Larson, M.: Multimedia Information Seeking Through Search and Hyperlinking. In: Proceedings of the 3rd ACM Conference on International Conference on Multimedia Retrieval, ICMR 2013, pp. 287–294 (2013)
Eskevich, M., Jones, G.J.F., Chen, S., Aly, R.B.N., Ordelman, R.J.F., Larson, M.: Search and Hyperlinking Task at Mediaeval 2012. In: MediaEval 2012 Multimedia Benchmark Workshop, Pisa, Italy, p. 14. CEUR-WS.org, Aachen (2012)
Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J.: LIBLINEAR: A Library for Large Linear Classification. J. Mach. Learn. Res. 9, 1871–1874 (2008)
Jones, G.J.F.: An Introduction to Crowdsourcing for Language and Multimedia Technology Research. In: Agosti, M., Ferro, N., Forner, P., Müller, H., Santucci, G. (eds.) PROMISE Winter School 2012. LNCS, vol. 7757, pp. 132–154. Springer, Heidelberg (2013)
Kelm, P., Schmiedeke, S., Sikora, T.: Feature-based Video Key Frame Extraction for Low Quality Video Sequences. In: 10th Workshop on Image Analysis for Multimedia Interactive Services, WIAMIS 2009, London, United Kingdom, May 6-8, pp. 25–28 (2009)
Lamel, L., Gauvain, J.-L.: Speech Processing for Audio Indexing. In: Nordström, B., Ranta, A. (eds.) GoTAL 2008. LNCS (LNAI), vol. 5221, pp. 4–15. Springer, Heidelberg (2008)
Larson, M., Newman, E., Jones, G.J.F.: Overview of Videoclef 2009: New Perspectives on Speech-Based Multimedia Content Enrichment. In: Multilingual Information Access Evaluation II. Multimedia Experiments, vol. 6242, pp. 354–368 (2010)
Li, L.-J., Su, H., Xing, E.P., Fei-Fei, L.: Object Bank: A High-Level Image Representation for Scene Classification and Semantic Feature Sparsification. In: Neural Information Processing Systems (NIPS), Vancouver, Canada (December 2010)
Lowe, D.: Object Recognition from Local Scale-Invariant Features. In: The Proceedings of the Seventh IEEE International Conference on Computer Vision, vol. 2, pp. 1150–1157 (1999)
Mihalcea, R., Csomai, A.: Wikify!: Linking Documents to Encyclopedic Knowledge. In: Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management, CIKM 2007, pp. 233–242 (2007)
Milne, D., Witten, I.H.: Learning to Link with Wikipedia. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, CIKM 2008, pp. 509–518 (2008)
Rousseau, A., Bougares, F., Delglise, P., Schwenk, H., Estv, Y.: LIUM’s Systems for the IWSLT 2011 Speech Translation Tasks. In: Proceedings of IWSLT 2011I (2011)
Schmiedeke, S., Xu, P., Ferrané, I., Eskevich, M., Kofler, C., Larson, M.A., Estève, Y., Lamel, L., Jones, G.J.F., Sikora, T.: Blip10000: A Social Video Dataset Containing SPUG Content for Tagging and Retrieval. In: Multimedia Systems Conference 2013 (MMSys 2013), pp. 96–101 (2013)
Sonawane, A.: Using Apache Lucene to search text - Easily Build Search and Index Capabilities into your Applications (August 2009), http://www.ibm.com/developerworks/library/os-apache-lucenesearch/
Tang, L.-X., Kang, I.-S., Kimura, F., Lee, Y.-H., Trotman, A., Geva, S., Xu, Y.: Overview of the NTCIR-10 Cross-Lingual Link Discovery Task. In: Proceedings of NTCIR-10 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Chen, S., Eskevich, M., Jones, G.J.F., O’Connor, N.E. (2014). An Investigation into Feature Effectiveness for Multimedia Hyperlinking. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds) MultiMedia Modeling. MMM 2014. Lecture Notes in Computer Science, vol 8326. Springer, Cham. https://doi.org/10.1007/978-3-319-04117-9_23
Download citation
DOI: https://doi.org/10.1007/978-3-319-04117-9_23
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-04116-2
Online ISBN: 978-3-319-04117-9
eBook Packages: Computer ScienceComputer Science (R0)