Multicamera human detection and tracking supporting natural interaction with large-scale displays

Zabulis, Xenophon; Grammenos, Dimitris; Sarmis, Thomas; Tzevanidis, Konstantinos; Padeleris, Pashalis; Koutlemanis, Panagiotis; Argyros, Antonis A.

doi:10.1007/s00138-012-0408-6

Multicamera human detection and tracking supporting natural interaction with large-scale displays

Original Paper
Published: 08 February 2012

Volume 24, pages 319–336, (2013)
Cite this article

Machine Vision and Applications Aims and scope Submit manuscript

Xenophon Zabulis¹,
Dimitris Grammenos¹,
Thomas Sarmis¹,
Konstantinos Tzevanidis^1,2,
Pashalis Padeleris¹,
Panagiotis Koutlemanis¹ &
…
Antonis A. Argyros^1,2

525 Accesses
16 Citations
3 Altmetric
Explore all metrics

Abstract

This paper presents a computer vision system that supports non-instrumented, location-based interaction of multiple users with digital representations of large-scale artifacts. The proposed system is based on a camera network that observes multiple humans in front of a very large display. The acquired views are used to volumetrically reconstruct and track the humans robustly and in real time, even in crowded scenes and challenging human configurations. Given the frequent and accurate monitoring of humans in space and time, a dynamic and personalized textual/graphical annotation of the display can be achieved based on the location and the walk-through trajectory of each visitor. The proposed system has been successfully deployed in an archaeological museum, offering its visitors the capability to interact with and explore a digital representation of an ancient wall painting. This installation permits an extensive evaluation of the proposed system in terms of tracking robustness, computational performance and usability. Furthermore, it proves that computer vision technology can be effectively used to support non-instrumented interaction of humans with their environments in realistic settings.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

ByteTrack: Multi-object Tracking by Associating Every Detection Box

BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking

Article Open access 12 April 2024

Vukasin D. Stanojevic & Branimir T. Todorovic

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

Article Open access 08 October 2020

Jonathon Luiten, Aljos̆a Os̆ep, … Bastian Leibe

References

Allard, J., Franco, J., Menier, C., Boyer, E., Raffin, B.: The GrImage platform: a mixed reality environment for interactions. In: IEEE International Conference on Computer Vision Systems, p. 46 (2006)
Argyros, A.A., Lourakis, M.I.A.: Real time tracking of multiple skin-colored objects with a possibly moving camera. In: European Conference on Computer Vision, pp. 368–379 (2004)
Bannon L., Benford S., Bowers J., Heath C.: Hybrid design creates innovative museum experiences. Commun ACM 48(3), 62–65 (2005)
Article Google Scholar
Bernardin, K., Stiefelhagen, R.: Evaluating multiple object tracking performance: The clear mot metrics. EURASIP J Image Video Process (2008)
Blomberg, J., Giacomi, J., Mosher, A., Swenton-Wall, P.: Ethnographic field methods and their relation to design. In: Participatory design: Principles and practices, pp. 123–155. Lawrence Erlbaum Associates (2003)
Bobick A., Intille S., Davis J., Baird F., Pinhanez C., Campbell L., Ivanov Y., Schutte A., Wilson A.: The KidsRoom: a perceptually-based interactive and immersive story environment. Presence Teleoper Virtual Environ 8(4), 369–393 (1999)
Article Google Scholar
Brooke J.: SUS: a quick and dirty usability scale, pp. 189–194. Taylor and Francis, UK (1996)
Google Scholar
Darrell, T., Gordon, G., Harville, M., Woodfill, J.: Integrated person tracking using stereo, color, and pattern detection. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 601–609 (1998)
Falcao, G., Hurtos, N., Massich, J., Fofi, D.: Projector-camera calibration toolbox (2009). http://code.google.com/p/procamcalib
Fleuret F., Berclaz J., Lengagne R., Fua P.: Multicamera people tracking with a probabilistic occupancy map. IEEE Trans Pattern Anal Mach Intell 30(2), 267–282 (2008)
Article Google Scholar
Franco, J., Menier, C., Boyer, E., Raffin, B.: A distributed approach for real time 3D modeling. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, p. 31 (2004)
Gronbaek, K., Iversen, O., Kortbek, K., Nielsen, L., Rand Aagaard, K.: IGameFloor: a platform for co-located collaborative games. In: ACE: Advances in Computer Entertainment Technology, pp. 64–71 (2007)
Haro G., Pardàs M.: Shape from incomplete silhouettes based on the reprojection error. Image Vis Comput 28(9), 1354–1368 (2010)
Article Google Scholar
Hornecker, E., Stifter, M.: Learning from interactive museum installations about interaction design for public settings. In: Australian conference on Computer-Human Interaction, pp. 135–142. Sydney, Australia (2006)
Khan, S., Shah, M.: A multiview approach to tracking people in crowded scenes using a planar homography constraint. In: European Conference on Computer Vision, pp. 133–146 (2006)
Kortbek, K., Gronbaek, K.: Interactive spatial multimedia for communication of art in the physical museum space. In: ACM Multimedia, pp. 609–618 (2008)
Laakso S., Laakso M.: Design of a body-driven multiplayer game system. Comput Entertain 4, 7 (2006)
Article Google Scholar
Ladikos, A., Benhimane, S., Navab, N.: Efficient visual hull computation for real-time 3d reconstruction using CUDA. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 1–8 (2008)
Laurentini A.: The visual hull concept for silhouette-based image understanding. IEEE Trans Pattern Anal Mach Intell 16(2), 150–162 (1994)
Article Google Scholar
Liem, M., Gavrila, D.M.: Multi-person tracking with overlapping cameras in complex, dynamic environments. In: British Machine Vision Conference (2009)
Lourakis, M.I.A., Argyros, A.A.: SBA: A software package for generic sparse bundle adjustment. ACM Trans Math Softw 36(1) (2009)
Macedonia: from fragments to pixels (2010). http://www.makedonopixels.org. Demonstration video: http://www.makedonopixels.org/videos.php?c=8&sub_c=7&l=e
Mittal, A., Davis, L.: M2tracker: a multi-view approach to segmenting and tracking people in a cluttered scene. Int J Comput Vis 189–203 (2003)
Nielsen J.: Usability Engineering, chapter Thinking Aloud, pp. 195–199. Academic Press, san diego (1993)
Google Scholar
Paradiso, J., Abler, C., Hsiao, K., Reynolds, M.: The magic carpet: physical sensing for immersive environments. In: Human factors in computing systems, pp. 277–278 (1997)
Pietroni, E., Antinucci, F.: The rule confirmation: virtual experience among the characters of Giotto’s work. In: International Symposium on Virtual Reality, Archaeology and Cultural Heritage (2010)
Raskar, R., Welch, G., Fuchs, H.: Seamless projection overlaps using image warping and intensity blending. In: Virtual Systems and Multimedia (1998)
Reddy, D., Sankaranarayanan, A., Cevher, V., Chellappa, R.: Compressed sensing for multi-view tracking and 3-D voxel reconstruction. In: IEEE International Conference on Image Processing, pp. 221–224 (2008)
Point Grey Research. MultiSync. http://www.ptgrey.com/products/multisync
Robertson, T., Mansfield, T., Loke, L.: Designing an immersive environment for public use. In: Conference on Participatory design, pp. 31–40 (2006)
Sarmis, T., Zabulis, X., Argyros, A.A.: A checkerboard detection utility for intrinsic and extrinsic camera cluster calibration. Technical Report 397, FORTH-ICS (2009)
Schick, A., Stiefelhagen, R.: Real-time GPU-based voxel carving with systematic occlusion handling. In: DAGM Symposium on Pattern Recognition, pp. 372–81 (2009)
Shotton, J., Fitzgibbon, A., Cook, M., Sharp, T., Finocchio, M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: EEE Conference on Computer Vision and Pattern Recognition (2011)
Snibbe, S., Raffle, H.: Social immersive media: pursuing best practices for multi-user interactive camera/projector exhibits. In: Human factors in computing systems, pp. 1447–1456 (2009)
Sparacino, F.: Scenographies of the past and museums of the future: from the wunderkammer to body-driven interactive narrative spaces. In: ACM Multimedia, pp. 72–79 (2004)
Tran, S., Lin, Z., Harwood, D., Davis, L.: UMD VDT, an integration of detection and tracking methods for multiple human tracking. In: Multimodal Technologies for Perception of Humans. Lecture Notes in Computer Science, vol. 4625/2008, pp. 179–190. Springer, Berlin (2008)
Tyagi, A., Keck, M., Davis, J., Potamianos, G.: Kernel-based 3d tracking. In: IEEE Conference on Computer Vision and Pattern Recognition (2007)
Tzevanidis, K., Zabulis, X., Sarmis, T., Koutlemanis, P., Kyriazis, N., Argyros, A.: From multiple views to textured 3d meshes: a gpu-powered approach. In: European Conference on Computer Vision Workshops, pp. 5–11 (2010)
Wu, B., Singh, V., Kuo, C., Zhang, L., Lee, S., Nevatia, R.: CLEAR’07 evaluation of usc human tracking system for surveillance videos. In: Multimodal Technologies for Perception of Humans. Lecture Notes in Computer Science, vol. 4625/2008, pp. 191–196. Springer, Berlin (2008)
Zabulis, X., Grammenos, D., Sarmis, T., Tzevanidis, K., Argyros, A.A.: Exploration of large-scale museum artifacts through non-instrumented, location-based, multi-user interaction. In: International Symposium on Virtual Reality, Archaeology and Cultural Heritage (2010)
Zabulis, X., Sarmis, T., Argyros, A.A.: 3d head pose estimation from multiple distant views. In: British Machine Vision Conference (2009)
Zabulis, X., Sarmis, T., Tzevanidis, K., Koutlemanis, P., Grammenos, D., Argyros, A.A.: A platform for monitoring aspects of human presence in real-time. In: International Symposium on Visual Computing (2010)
Zivkovic, Z.: Improved adaptive Gaussian mixture model for background subtraction. In: International Conference on Pattern Recognition, pp. 28–31 (2004)

Download references

Author information

Authors and Affiliations

Institute of Computer Science, FORTH, N. Plastira 100, Vassilika Vouton, 700-13, Heraklion, Crete, Greece
Xenophon Zabulis, Dimitris Grammenos, Thomas Sarmis, Konstantinos Tzevanidis, Pashalis Padeleris, Panagiotis Koutlemanis & Antonis A. Argyros
Computer Science Department, University of Crete, Crete, Greece
Konstantinos Tzevanidis & Antonis A. Argyros

Authors

Xenophon Zabulis
View author publications
You can also search for this author in PubMed Google Scholar
Dimitris Grammenos
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Sarmis
View author publications
You can also search for this author in PubMed Google Scholar
Konstantinos Tzevanidis
View author publications
You can also search for this author in PubMed Google Scholar
Pashalis Padeleris
View author publications
You can also search for this author in PubMed Google Scholar
Panagiotis Koutlemanis
View author publications
You can also search for this author in PubMed Google Scholar
Antonis A. Argyros
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xenophon Zabulis.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zabulis, X., Grammenos, D., Sarmis, T. et al. Multicamera human detection and tracking supporting natural interaction with large-scale displays. Machine Vision and Applications 24, 319–336 (2013). https://doi.org/10.1007/s00138-012-0408-6

Download citation

Received: 08 March 2011
Revised: 09 January 2012
Accepted: 17 January 2012
Published: 08 February 2012
Issue Date: February 2013
DOI: https://doi.org/10.1007/s00138-012-0408-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multicamera human detection and tracking supporting natural interaction with large-scale displays

Abstract

Access this article

Similar content being viewed by others

ByteTrack: Multi-object Tracking by Associating Every Detection Box

BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Multicamera human detection and tracking supporting natural interaction with large-scale displays

Abstract

Access this article

Similar content being viewed by others

ByteTrack: Multi-object Tracking by Associating Every Detection Box

BoostTrack: boosting the similarity measure and detection confidence for improved multiple object tracking

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation