Abstract
At the Universitat Politècnica de Catalunya (UPC), a Smart Room has been equipped with 85 microphones and 8 cameras. This paper describes the setup of the sensors, gives an overview of the underlying hardware and software infrastructure and indicates possibilities for high- and low-level multi-modal interaction. An example of usage of the information collected from the distributed sensor network is explained in detail: the system supports a group of students that have to solve a lab assignment related problem.
Chapter PDF
Similar content being viewed by others
Keywords
- Linear Discriminant Analysis
- Gaussian Mixture Model
- Gesture Recognition
- Automatic Speech Recognition
- Acoustic Event
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Josep R. Casas, R. Stiefelhagen, et al, “Multi-camera/multi-microphone system design for continuous room monitoring,” CHIL-WP4-D4.1-V2.1-2004-07-08-CO, CHIL Consortium Deliverable D4.1, July 2004.
J-L. Landabaso, L-O. Xu, M. Pardas, Robust Tracking and Object Classification Towards Automated Video Surveillance, Proc. of International Conference on Image Analysis and Recognition ICIAR 2004, Porto, Portugal, September 29 — October 1, 2004, Proceedings, Part II, p. 463–470
J. L. Landabaso, M. Pardàs, L.-Q. Xu, Hierarchical Representation of Scenes using Activity Information, Proc of ICASSP 2005, March 18–23, Philadelphia, USA.
Josep R. Casas, O. Garcia, et al, “Initial multi-sensor selection strategy to get the best camera/microphone at any time,” CHIL-WP4-D4.2-V2.0-2004-10-18-CO, CHIL Deliverable D4.2, October 2004.
O. Garcia, J.R. Casas, “Functionalities for mapping 2D images and 3D world objects in a Multicamera Environment,” 6th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS), Montreux, Switzerland, April, 2005.
A. Laurentini, “The visual hull concept for silhouette-based image understanding,” IEEE Trans, Pattern Anal. Mach. Intell, 16(2): 150–162,1994.
J.L. Landabaso, M. Pardas, “Foreground regions extraction and characterization towards real-time object tracking,” In Proceedings of Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI’ 05), 2005. 3
NIST smart space system, http://www.nist.gov/smartspace
Padrell J., Macho D., Nadeu C., “Robust Speech Activity Detection Using LDA Applied to FF Parameters”, Proc. ICASSP’05, Philadelphia, PA, USA, March 2005.
Macho D., Padrell J., Abad A., Nadeu C., Hernando J., McDonough J., Wölfel M., Klee U., Omologo M., Brutti A., Svaizer P., Potamianos G., Chu S.M., “Automatic Speech Activity Detection, Source Localization, and Speech Recognition on the CHIL Seminar Corpus”, Proc. ICME 2005, Amsterdam, The Netherlands, July 2005.
Omologo M., Svaizer P., “Acoustic event localization using a crosspower-spectrum phase based technique,” in Proc. ICASSP’94, Adelaide, 1994.
Abad A., Macho D., Segura C., Hernando J., Nadeu C., “Effect of Head Orientation on the Speaker Localization Performance in Smart-room Environment”, Proc. INTERSPEECH — EUROSPEECH 2005, Lisbon, Portugal, September 2005.
Temko A., Macho D., Nadeu C., “Selection of features and combination of classifiers using a fuzzy approach for acoustic event classification”, Proc. of 9th European Conference on Speech Communication and Technology, Interspeech 2005, Lisbon, Portugal, September 2005.
Temko A., Macho D., Nadeu C., “Improving the performance of acoustic event classification by selecting and combining information sources using the fuzzy integral”, Lecture Notes in Computer Science (LNCS), vol. 3869, February 2006
Temko A., Nadeu C., “Classification of Acoustic Events using SVM-based Clustering Schemes”, Pattern Recognition, in press, Elsevier, 2006
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 International Federation for Information Processing
About this paper
Cite this paper
Neumann, J., Casas, J.R., Macho, D., Hidalgo, J.R. (2006). Multimodal Integration of Sensor Network. In: Maglogiannis, I., Karpouzis, K., Bramer, M. (eds) Artificial Intelligence Applications and Innovations. AIAI 2006. IFIP International Federation for Information Processing, vol 204. Springer, Boston, MA . https://doi.org/10.1007/0-387-34224-9_36
Download citation
DOI: https://doi.org/10.1007/0-387-34224-9_36
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-34223-8
Online ISBN: 978-0-387-34224-5
eBook Packages: Computer ScienceComputer Science (R0)