Multimodal Technologies for Perception of Humans

International Evaluation Workshops CLEAR 2007 and RT 2007, Baltimore, MD, USA, May 8-11, 2007, Revised Selected Papers

  • Editors
  • Rainer Stiefelhagen
  • Rachel Bowers
  • Jonathan Fiscus
Conference proceedings RT 2007, CLEAR 2007

Part of the Lecture Notes in Computer Science book series (LNCS, volume 4625)

Table of contents

  1. Front Matter
  2. CLEAR 2007

    1. Front Matter
      Pages 1-1
    2. Rainer Stiefelhagen, Keni Bernardin, Rachel Bowers, R. Travis Rose, Martial Michel, John Garofolo
      Pages 3-34
    3. 3D Person Tracking

      1. Nikos Katsarakis, Fotios Talantzis, Aristodemos Pnevmatikakis, Lazaros Polymenakos
        Pages 35-46
      2. Alessio Brutti
        Pages 47-56
      3. Oswald Lanz, Paul Chippendale, Roberto Brunelli
        Pages 57-69
      4. Keni Bernardin, Tobias Gehrig, Rainer Stiefelhagen
        Pages 70-81
      5. C. Segura, A. Abad, J. Hernando, C. Nadeu
        Pages 82-90
      6. C. Canton-Ferrer, J. Salvador, J. R. Casas, M. Pardàs
        Pages 91-103
      7. Teemu Korhonen, Pasi Pertilä
        Pages 104-112
    4. 2D Face Detection and Tracking

      1. Andreas Stergiou, Ghassan Karame, Aristodemos Pnevmatikakis, Lazaros Polymenakos
        Pages 113-125
      2. Michael C. Nechyba, Louis Brandy, Henry Schneiderman
        Pages 126-137
      3. Yuan Li, Chang Huang, Haizhou Ai
        Pages 138-147
    5. Person and Vehicle Tracking on Surveillance Data

      1. Andreas Stergiou, Aristodemos Pnevmatikakis, Lazaros Polymenakos
        Pages 148-159
      2. Murtaza Taj, Emilio Maggio, Andrea Cavallaro
        Pages 160-173
      3. Andrew Miller, Arslan Basharat, Brandyn White, Jingen Liu, Mubarak Shah
        Pages 174-178
      4. Son Tran, Zhe Lin, David Harwood, Larry Davis
        Pages 179-190
      5. B. Wu, V. K. Singh, C. -H. Kuo, L. Zhang, S. C. Lee, R. Nevatia
        Pages 191-196
      6. Sung Chun Lee, Ram Nevatia
        Pages 197-202
    6. Vehicle and Person Tracking Aerial Videos

      1. Jiangjian Xiao, Changjiang Yang, Feng Han, Hui Cheng
        Pages 203-214
      2. Andrew Miller, Pavel Babenko, Min Hu, Mubarak Shah
        Pages 215-220
    7. Person Identification

      1. Andreas Stergiou, Aristodemos Pnevmatikakis, Lazaros Polymenakos
        Pages 221-232
      2. Claude Barras, Xuan Zhu, Cheung-Chi Leung, Jean-Luc Gauvain, Lori Lamel
        Pages 233-239
      3. Ming Liu, Yanxiang Chen, Xi Zhou, Xiaodan Zhuang, Mark Hasegawa-Johnson, Thomas Huang
        Pages 248-255
      4. Hazım Kemal Ekenel, Qin Jin, Mika Fischer, Rainer Stiefelhagen
        Pages 256-265
    8. Head Pose Estimation

      1. Shuicheng Yan, Zhenqiu Zhang, Yun Fu, Yuxiao Hu, Jilin Tu, Thomas Huang
        Pages 297-306
      2. C. Canton-Ferrer, J. R. Casas, M. Pardàs
        Pages 317-327
    9. Acoustic Event Detection

      1. C. Boukis, L. C. Polymenakos
        Pages 328-337
      2. Christian Zieger
        Pages 338-344
      3. Xi Zhou, Xiaodan Zhuang, Ming Liu, Hao Tang, Mark Hasegawa-Johnson, Thomas Huang
        Pages 345-353
      4. Andrey Temko, Climent Nadeu, Joan-Isaac Biel
        Pages 354-363
      5. Toni Heittola, Anssi Klapuri
        Pages 364-370
  3. RT 2007

    1. Front Matter
      Pages 371-371
    2. Jonathan G. Fiscus, Jerome Ajot, John S. Garofolo
      Pages 373-389
    3. Susanne Burger
      Pages 390-400
    4. Meghan Lammie Glenn, Stephanie Strassel
      Pages 401-413
    5. Speech-to-Text

      1. Thomas Hain, Lukas Burget, John Dines, Giulia Garau, Martin Karafiat, David van Leeuwen et al.
        Pages 414-428
      2. Jing Huang, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos
        Pages 429-441
      3. L. Lamel, E. Bilinski, J. L. Gauvain, G. Adda, C. Barras, X. Zhu
        Pages 442-449
      4. Andreas Stolcke, Xavier Anguera, Kofi Boakye, Özgür Çetin, Adam Janin, Mathew Magimai-Doss et al.
        Pages 450-463
      5. Matthias Wölfel, Sebastian Stüker, Florian Kraft
        Pages 464-474
    6. Speaker Diarization

      1. David A. van Leeuwen, Matej Konečný
        Pages 475-483
      2. Eugene Chin Wei Koh, Hanwu Sun, Tin Lay Nwe, Trung Hieu Nguyen, Bin Ma, Eng-Siong Chng et al.
        Pages 484-496
      3. Jing Huang, Etienne Marcheret, Karthik Visweswariah, Gerasimos Potamianos
        Pages 497-508
      4. Chuck Wooters, Marijn Huijbregts
        Pages 509-519

About these proceedings


This book constitutes the thoroughly refereed joint post-workshop proceedings of two co-located events: the Second International Workshop on Classification of Events, Activities and Relationships, CLEAR 2007, and the 5th Rich Transcription 2007 Meeting Recognition evaluation, RT 2007, held in succession in Baltimore, MD, USA, in May 2007.

The workshops had complementary evaluation efforts; CLEAR for the evaluation of human activities, events, and relationships in multiple multimodal data domains; and RT for the evaluation of speech transcription-related technologies from meeting room audio collections. The 35 revised full papers presented from CLEAR 2007 cover 3D person tracking, 2D face detection and tracking, person and vehicle tracking on surveillance data, vehicle and person tracking aerial videos, person identification, head pose estimation, and acoustic event detection. The 15 revised full papers presented from RT 2007 are organized in topical sections on speech-to-text, and speaker diarization.


2D/3D vision acoustic scene analysis classification computer vision face authentication face recognition face tracking gesture recognition human tracking kernel methods motion analysis motion estimation multimodal biometrics object tracking tar

Bibliographic information

  • DOI
  • Copyright Information Springer-Verlag Berlin Heidelberg 2008
  • Publisher Name Springer, Berlin, Heidelberg
  • eBook Packages Computer Science
  • Print ISBN 978-3-540-68584-5
  • Online ISBN 978-3-540-68585-2
  • Series Print ISSN 0302-9743
  • Series Online ISSN 1611-3349
  • Buy this book on publisher's site