Skip to main content
  • Conference proceedings
  • © 2005

Machine Learning for Multimodal Interaction

First International Workshop, MLMI 2004, Martigny, Switzerland, June 21-23, 2004, Revised Selected Papers

Conference proceedings info: MLMI 2004.

Buying options

eBook USD 39.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

This is a preview of subscription content, access via your institution.

Table of contents (30 papers)

  1. Front Matter

  2. MLMI 2004

    1. HCI and Applications

      1. Browsing Recorded Meetings with Ferret
        • Pierre Wellner, Mike Flynn, Maël Guillemot
        Pages 12-21
      2. Meeting Modelling in the Context of Multimodal Research
        • Dennis Reidsma, Rutger Rienks, Natas̃a Jovanović
        Pages 22-35
      3. Artificial Companions
        • Yorick Wilks
        Pages 36-45
    2. Structuring and Interaction

      1. Towards Computer Understanding of Human Interactions
        • Iain McCowan, Daniel Gatica-Perez, Samy Bengio, Darren Moore, Hervé Bourlard
        Pages 56-75
      2. Multistream Dynamic Bayesian Network for Meeting Segmentation
        • Alfred Dielmann, Steve Renals
        Pages 76-86
      3. Using Static Documents as Structured and Thematic Interfaces to Multimedia Meeting Archives
        • Denis Lalanne, Rolf Ingold, Didier von Rotz, Ardhendu Behera, Dalila Mekhaldi, Andrei Popescu-Belis
        Pages 87-100
      4. An Integrated Framework for the Management of Video Collection
        • Nicolas Moënne-Loccoz, Bruno Janvier, Stéphane Marchand-Maillet, Eric Bruno
        Pages 101-110
    3. Multimodal Processing

      1. Mapping from Speech to Images Using Continuous State Space Models
        • Tue Lehn-Schiøler, Lars Kai Hansen, Jan Larsen
        Pages 136-145
      2. An Online Algorithm for Hierarchical Phoneme Classification
        • Ofer Dekel, Joseph Keshet, Yoram Singer
        Pages 146-158
      3. Mixture of SVMs for Face Class Modeling
        • Julien Meynet, Vlad Popovici, Jean-Philippe Thiran
        Pages 173-181
      4. AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking
        • Guillaume Lathoud, Jean-Marc Odobez, Daniel Gatica-Perez
        Pages 182-195
    4. Speech Processing

      1. The 2004 ICSI-SRI-UW Meeting Recognition System
        • Chuck Wooters, Nikki Mirghafori, Andreas Stolcke, Tuomo Pirinen, Ivan Bulyko, Dave Gelbart et al.
        Pages 196-208
      2. On the Adequacy of Baseform Pronunciations and Pronunciation Variants
        • Mathew Magimai-Doss, Hervé Bourlard
        Pages 209-222
      3. Tandem Connectionist Feature Extraction for Conversational Speech Recognition
        • Qifeng Zhu, Barry Chen, Nelson Morgan, Andreas Stolcke
        Pages 223-231

Other Volumes

  1. Machine Learning for Multimodal Interaction

Keywords

  • Augmented Reality
  • Multimedia
  • classification
  • cognition
  • emotion analysis
  • face recognition
  • human-computer interaction
  • human-computer interaction (HCI)
  • intelligent user interfaces
  • learning
  • machine learning
  • multimodal meetings
  • neural learning
  • speech processing
  • speech recognition

Editors and Affiliations

  • IDIAP Research Institute, Martigny, Switzerland

    Samy Bengio, Hervé Bourlard

Bibliographic Information

Buying options

eBook USD 39.99
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions