Advertisement

Mining the Web for Multimedia-Based Enriching

  • Mathilde Sahuguet
  • Benoit Huet
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8326)

Abstract

As the amount of social media shared on the Internet grows increasingly, it becomes possible to explore a topic with a novel, people based viewpoint. We aim at performing topic enriching using media items mined from social media sharing platforms. Nevertheless, such data collected from the Web is likely to contain noise, hence the need to further process collected documents to ensure relevance. To this end, we designed an approach to automatically propose a cleaned set of media items related to events mined from search trends. Events are described using word tags and a pool of videos is linked to each event in order to propose relevant content. This pool has previously been filtered out from non-relevant data using information retrieval techniques. We report the results of our approach by automatically illustrating the popular moments of four celebrities.

Keywords

Outlier Detection Query Expansion Relevant Content Media Item Outlier Score 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Angiulli, F., Pizzuti, C.: Fast Outlier Detection in High Dimensional Spaces. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) PKDD 2002. LNCS (LNAI), vol. 2431, pp. 15–26. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  2. 2.
    Breunig, M., Kriegel, H.-P., Ng, R.T., Sander, J.: LOF: Identifying Density-Based Local Outliers. In: Proceedings of the 2000 ACM SIGMOD International Confrence on Management of Data, pp. 93–104. ACM (2000)Google Scholar
  3. 3.
    Cao, G., Nie, J.-Y., Gao, J., Robertson, S.: Selecting good expansion terms for pseudo-relevance feedback. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2008, pp. 243–250. ACM, New York (2008)CrossRefGoogle Scholar
  4. 4.
    Capra, R.G., Lee, C.A., Marchionini, G., Russell, T., Shah, C., Stutzman, F.: Selection and context scoping for digital video collections: an investigation of youtube and blogs. In: Proceedings of the 8th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL 2008, pp. 211–220. ACM, New York (2008)Google Scholar
  5. 5.
    Knorr, E.M., Ng, R.T.: Algorithms for Mining Distance-Based Outliers in Large Datasets, pp. 392–403 (1998)Google Scholar
  6. 6.
    Ponte, J.M., Croft, W.B.: A language modeling approach to information retrieval. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1998, pp. 275–281. ACM, New York (1998)Google Scholar
  7. 7.
    Ramaswamy, S., Rastogi, R., Shim, K.: Efficient algorithms for mining outliers from large data sets. SIGMOD Rec. 29(2), 427–438 (2000)CrossRefGoogle Scholar
  8. 8.
    Robertson, S., Walker, S., Jones, S., Hancock-Beaulieu, M., Gatford, M.: Okapi at TREC-3, pp. 109–126 (1996)Google Scholar
  9. 9.
    Sahuguet, M., Huet, B.: Socially Motivated Multimedia Topic Timeline Summarization. In: Proceedings of the 2013 International Workshop on Socially-aware Multimedia, SAM 2013. ACM (2013)Google Scholar
  10. 10.
    Shuyo, N.: Language Detection Library for Java (2010)Google Scholar
  11. 11.
    Xu, Y., Jones, G.J., Wang, B.: Query dependent pseudo-relevance feedback based on wikipedia. In: Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2009, pp. 59–66. ACM, New York (2009)Google Scholar
  12. 12.
    Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to Ad Hoc information retrieval. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2001, pp. 334–342. ACM, New York (2001)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Mathilde Sahuguet
    • 1
  • Benoit Huet
    • 1
  1. 1.EurecomSophia-AntipolisFrance

Personalised recommendations