DCU at VideoClef 2008
We describe a baseline system for the VideoCLEF Vid2RSS task in which videos are to be classified into thematic categories based on their content. The system uses an off-the-shelf Information Retrieval system. Speech transcripts generated using automated speech recognition are indexed using default stemming and stopping methods. The categories are populated by using the category theme (or label) as a query on the collection, and assigning the retrieved items to that particular category. Run 4 of our system achieved the highest f-score in the task by maximising recall. We discuss this in terms of the primary aims of the task, i.e., automating video classification.
KeywordsClassification Information Retrieval Automatic Speech Recognition
Unable to display preview. Download preview PDF.
- 1.Apache Software Foundation. Lucene: Java-based Indexing and Searching technology, http://lucene.apache.org/
- 2.Larson, M., Newman, E., Jones, G.J.F.: Overview of VideoCLEF2008: Automatic Generation of Topic-based Feeds for Dual Language Audio-Visual Content. In: Peters, C., et al. (eds.) CLEF 2008. LNCS, vol. 5706, pp. 906–917. Springer, Heidelberg (2009)Google Scholar
- 3.Porter, M.: An algorithm for suffix stripping. Program (July 1980)Google Scholar