TUT Acoustic Source Tracking System 2007
This paper is a documentation of the acoustic person tracking system developed by TUT. The system performance was evaluated in the CLEAR 2007 evaluation. The proposed system is designed to track a speaker position in a meeting room domain using only audio data. Audio data provided for the evaluation consists of recordings from multiple microphone arrays. The meeting rooms are equipped with three to seven arrays.
Speaker localization is performed by mapping pairwise cross-correlations of microphone signals into a three dimensional likelihood field. The resulting likelihood is used as source evidence for a particle filtering algorithm. A point estimate for the speaker position for each time frame is derived from the resulting sequential process. Results indicate an 85% success rate of localization with 15 cm average precision.
Unable to display preview. Download preview PDF.
- 1.Bernardin, K.: Clear 2007 evaluation plan v.1.0 (2007), http://isl.ira.uka.de/clear07/downloads/?download=CLEAR07-3DPT-2007-03-09.pdf
- 2.Mostefa, D., Moreau, N., Choukri, K., Potamianos, G., Chu, S.M., Tyagi, A., Casas, J.R., Turmo, J., Christoforetti, L., Tobia, F., Pnevmatikakis, A., Mylonakis, V., Talantzis, F., Burger, S., Stiefelhagen, R., Bernardin, K., Rochet, C.: The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms (accepted for publication, Kluwer Academic publishers). Journal of Language Resources and Evaluation (2007)Google Scholar
- 5.DiBiase, J., Silverman, H., Brandstein, M.: Microphone Arrays, ch. 8. Springer, Heidelberg (2001)Google Scholar