The ISL RT-06S Speech-to-Text System

  • Christian Fügen
  • Shajith Ikbal
  • Florian Kraft
  • Kenichi Kumatani
  • Kornel Laskowski
  • John W. McDonough
  • Mari Ostendorf
  • Sebastian Stüker
  • Matthias Wölfel
Conference paper

DOI: 10.1007/11965152_36

Part of the Lecture Notes in Computer Science book series (LNCS, volume 4299)
Cite this paper as:
Fügen C. et al. (2006) The ISL RT-06S Speech-to-Text System. In: Renals S., Bengio S., Fiscus J.G. (eds) Machine Learning for Multimodal Interaction. MLMI 2006. Lecture Notes in Computer Science, vol 4299. Springer, Berlin, Heidelberg

Abstract

This paper describes the 2006 lecture and conference meeting speech-to-text system developed at the Interactive Systems Laboratories (ISL), for the individual head-mounted microphone (IHM), single distant microphone (SDM), and multiple distant microphone (MDM) conditions, which was evaluated in the RT-06S Rich Transcription Meeting Evaluation sponsored by the US National Institute of Standards and Technologies (NIST). We describe the principal differences between our current system and those submitted in previous years, namely improved acoustic and language models, cross adaptation between systems with different front-ends and phoneme sets, and the use of various automatic speech segmentation algorithms.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Christian Fügen
    • 1
  • Shajith Ikbal
    • 1
  • Florian Kraft
    • 1
  • Kenichi Kumatani
    • 1
  • Kornel Laskowski
    • 1
  • John W. McDonough
    • 1
  • Mari Ostendorf
    • 1
    • 2
  • Sebastian Stüker
    • 1
  • Matthias Wölfel
    • 1
  1. 1.Interactive Systems LaboratoriesUniversität Karlsruhe (TH)KarlsruheGermany
  2. 2.Dept. of Electrical EngineeringUniversity of WashingtonSeattleUSA

Personalised recommendations