Mining the Web for Multimedia-Based Enriching
As the amount of social media shared on the Internet grows increasingly, it becomes possible to explore a topic with a novel, people based viewpoint. We aim at performing topic enriching using media items mined from social media sharing platforms. Nevertheless, such data collected from the Web is likely to contain noise, hence the need to further process collected documents to ensure relevance. To this end, we designed an approach to automatically propose a cleaned set of media items related to events mined from search trends. Events are described using word tags and a pool of videos is linked to each event in order to propose relevant content. This pool has previously been filtered out from non-relevant data using information retrieval techniques. We report the results of our approach by automatically illustrating the popular moments of four celebrities.
KeywordsOutlier Detection Query Expansion Relevant Content Media Item Outlier Score
Unable to display preview. Download preview PDF.
- 2.Breunig, M., Kriegel, H.-P., Ng, R.T., Sander, J.: LOF: Identifying Density-Based Local Outliers. In: Proceedings of the 2000 ACM SIGMOD International Confrence on Management of Data, pp. 93–104. ACM (2000)Google Scholar
- 4.Capra, R.G., Lee, C.A., Marchionini, G., Russell, T., Shah, C., Stutzman, F.: Selection and context scoping for digital video collections: an investigation of youtube and blogs. In: Proceedings of the 8th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL 2008, pp. 211–220. ACM, New York (2008)Google Scholar
- 5.Knorr, E.M., Ng, R.T.: Algorithms for Mining Distance-Based Outliers in Large Datasets, pp. 392–403 (1998)Google Scholar
- 6.Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1998, pp. 275–281. ACM, New York (1998)Google Scholar
- 8.Robertson, S., Walker, S., Jones, S., Hancock-Beaulieu, M., Gatford, M.: Okapi at TREC-3, pp. 109–126 (1996)Google Scholar
- 9.Sahuguet, M., Huet, B.: Socially Motivated Multimedia Topic Timeline Summarization. In: Proceedings of the 2013 International Workshop on Socially-aware Multimedia, SAM 2013. ACM (2013)Google Scholar
- 10.Shuyo, N.: Language Detection Library for Java (2010)Google Scholar
- 11.Xu, Y., Jones, G.J., Wang, B.: Query dependent pseudo-relevance feedback based on wikipedia. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2009, pp. 59–66. ACM, New York (2009)Google Scholar
- 12.Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to Ad Hoc information retrieval. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2001, pp. 334–342. ACM, New York (2001)CrossRefGoogle Scholar