Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition

  • Marijn Huijbregts
  • Roeland Ordelman
  • Franciska de Jong
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4816)

Abstract

This paper reports on the setup and evaluation of robust speech recognition system parts, geared towards transcript generation for heterogeneous, real-life media collections. The system is deployed for generating speech transcripts for the NIST/TRECVID-2007 test collection, part of a Dutch real-life archive of news-related genres. Performance figures for this type of content are compared to figures for broadcast news test data.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Marijn Huijbregts
    • 1
  • Roeland Ordelman
    • 1
  • Franciska de Jong
    • 1
  1. 1.University of Twente, Dept. of Electrical Engineering, Mathematics and Computer Science, P.O. Box 217, 7500 AE, EnschedeThe Netherlands

Personalised recommendations