Abstract
Detecting, locating, and tracking people in a dynamic environment is important in many applications, ranging from security and environmental surveillance to assistance to people in domestic environments, to the analysis of human activities. To this end, several methods for tracking people have been developed in the field of Computer Vision using different settings, such as monocular cameras, stereo sensors, multiple cameras.
In this article we describe a method for People Localization and Tracking (PLT) based on a calibrated fixed stereo vision sensor, its implementation and experimental results. The system analyzes three components of the stereo data (the left intensity image, the disparity image, and the 3-D world locations of measured points) to dynamically update a model of the background; extract foreground objects, such as people and rearranged furniture; track their positions in the world.
The system is mostly suitable for indoor medium size environments. It can reliably detect and track people moving in an medium size area (a room or a corridor) in front of the sensor with high reliability and good precision.
Similar content being viewed by others
References
Bahadori S, Cesta A, Iocchi L, Leone GR, Nardi D, Pecora F, Rasconi R, Scozzafava L (2004) Towards ambient intelligence for the domestic care of the elderly. In: Remagnino P, Foresti GL, Ellis T (eds), Ambient intelligence: a novel paradigm. Springer
Bahadori S, Grisetti G, Iocchi L, Leone GR, Nardi D (2005) Real-time tracking of multiple people through stereo vision. In: Proc. of IEE international workshop on intelligent environments
Beymer D, Konolige K (1999) Real-time tracking of multiple people using stereo. In: Proc. of IEEE frame rate workshop
Brown MZ, Burschka D, Hager GD (2003) Advances in computational stereo. PAMI 25(8):993–1008
Conte D, Foggia P, Petretta M, Tufano F, Vento M (2005) Evaluation and improvements of a real-time background subtraction method. In: Proc. of international conference on image analysis and recognition (ICIAR)
Cucchiara R, Grana C, Piccardi M, Prati A (2003) Detecting moving objects, ghosts, and shadows in video streams. IEEE Trans Pattern Anal Mach Intell 25(10):1337–1342
Cucchiara R, Grana C, Piccardi M, Prati A, Sirotti S (2001) Improving shadow suppression in moving object detection with hsv color information. In: Proceeding of IEEE conf. on intelligent transportation systems, 2001, pp 334–339
Cucchiara R, Grana C, Tardini G, Vezzani R (2004) Probabilistic people tracking for occlusion handling. In: Proc. of 17th int. conf. on pattern recognition (ICPR’04)
Darrell T, Demirdjian D, Checka N, Felzenszwalb PF (2001) Plan-view trajectory estimation with dense stereo background models. In: Proc. of 8th int. conf. on computer vision (ICCV’01), pp 628–635
Darrell T, Gordon G, Harville M, Woodfill J (2000) Integrated person tracking using stereo, color, and pattern detection. Int J Comput Vis 37(2):175–185
Elgammal AM, Harwood D, Davis LS (2000) Non-parametric model for background subtraction. In: Proc. of the 6th European Conference on Computer (ECCV). Springer-Verlag, London, UK, pp 751–767
Focken D, Stiefelhagen R (2002) Towards vision-based 3-d people tracking in a smart room. In: Proc. 4th IEEE int. conf. on multimodal interfaces (ICMI’02)
Gupte S, Masoud O, Martin RFK, Papanikolopoulos NP (2002) Detection and classification of vehicles. IEEE Trans Intell Transp Sys 3(1):37–47
Haritaoglu I, Harwood D, Davis LS (1998) W4S: A real-time system detecting and tracking people in 2 1/2D. In: Proceedings of the 5th european conference on computer vision. Springer-Verlag, pp 877–892
Haritaoglu I, Harwood D, Davis LS (2000) An appearance-based body model for multiple people tracking. In: Proc. of 15th int. conf. on pattern recognition (ICPR’00)
Haritaoglu I, Harwood D, Davis LS (2000) W4: Real-time surveillance of people and their activities. IEEE Trans Pattern Anal Mach Intell 22(8)
Harville M, Gordon G, Woodfill J (2001) Foreground segmentation using adaptive mixture models in color and depth. In: Proc. of IEEE workshop on detection and recognition of events in video, pp 3–11
Iocchi L, Bolles RC (2005) Integrating plan-view tracking and color-based person models for multiple people tracking. In: Proc. of IEEE international conference on image processing (ICIP’05)
Jabri S, Duric Z, Wechsler H, Rosenfeld A (2000) Detection and location of people in video images using adaptive fusion of color and edge information. In: Proc. of 15th international conference on pattern recognition (ICPR’00), 4:4627
Kang J, Cohen I, Medioni G (2004) Object reacquisition using invariant appearance model. In: Proc. of 17th int. conf. on pattern recognition (ICPR’04)
Konolige K (1997) Small vision systems: hardware and implementation. In: Proc. of 8th international symposium on robotics research
Krumm J, Harris S, Meyers B, Brumitt B, Hale M, Shafer S (2000) Multi-camera multi-person tracking for easyliving. In: Proc. of int. workshop on visual surveillance
Lenz R, Tsai R (1988) Techniques for calibration of the scale factor and image center for high accuracy 3-d machine vision metrology. IEEE Trans Pattern Anal Mach Intell 10(5)
Li J, Chua CS, Ho YK (2002) Color based multiple people tracking. In: Proc. of 7th int. conf. on control, automation, robotics and vision
Lipton AJ, Haering N (2002) Commode: an algorithm for video background modeling and object segmentation. In: 7th International Conference on Control, Automation, Robotics and Vision, 2002. ICARCV 2002, vol(3), pp 1603–1608
Matthews KE, Namazi NM (1998) A bayes decision test for detecting uncovered-background and moving pixels in image sequences. IEEE Trans Image Proc 7:720–728
Mittal A, Davis LS (2002) M2Tracker: A multi-view approach to segmenting and tracking people in a cluttered scene using region-based stereo. In: Proc. of the 7th european conf. on computer vision (ECCV’02). Springer-Verlag, pp 18–36
Nadimi S, Bhanu B (2004) Physical models for moving shadow and object detection in video. IEEE Trans Pattern Anal Mach Intell 26:1079–1087
Ohta N (2001) A statistical approach to background subtraction for surveillance systems. In Proc. of eighth IEEE international conference on computer vision, vol(2), pp 481–486
Robocare project. http://robocare.istc.cnr.it
Roh K, Kang S, Lee SW (2000) Multiple people tracking using an appearance model based on temporal color. In: Proc. of 15th int. conf. on pattern recognition (ICPR’00)
Seki M, Fujiwara H, Sumi K (2000) A robust background subtraction method for changing background. In: Proc. of fifth IEEE workshop on applications of computer vision, pp 207–213
Senior AW (2002) Tracking with probabilistic appearance models. In: Proc, of ECCV workshop on performance evaluation of tracking and surveillance systems (PETS’02), pp 48–55
Stauffer C, Grimson WEL (2000) Learning patterns of activity using real-time tracking. IEEE Trans Pattern Anal Mach Intell 22(8):747–757
Tai J-C, Song K-T (2004) Background segmentation and its application to traffic monitoring using modified histogram. IEEE Int Conf Netw, Sensing Control 1:13–18
Wren CR, Azarbayejani A, Darrell T, Pentland A (1997) Pfinder: real-time tracking of the human body. IEEE Trans Pattern Anal Mach Intell 19(7):780–785
Yang MT, Shih YC, Wang SC (2004) People tracking by integrating multiple features. In: Proc. of 17th int. conf. on pattern recognition (ICPR’04), pp 929–932
Zhao T, Nevatia R (2004) Tracking multiple humans in crowded environment. In: IEEE conf. on computer vision and pattern recognition (CVPR’04)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bahadori, S., Iocchi, L., Leone, G.R. et al. Real-time people localization and tracking through fixed stereo vision. Appl Intell 26, 83–97 (2007). https://doi.org/10.1007/s10489-006-0013-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-006-0013-3