Object Detection, Classification, and Tracking

Rezaei, Mahdi; Klette, Reinhard

doi:10.1007/978-3-319-50551-0_4

Mahdi Rezaei¹⁴ &
Reinhard Klette¹⁵

Part of the book series: Computational Imaging and Vision ((CIVI,volume 45))

1989 Accesses
2 Citations

Abstract

In this chapter we outline object detection and object recognition techniques which are of relevance for the remainder of the book. We focus on supervised and unsupervised learning approaches. The chapter provides technical details for each method, discussions on the strengths and weaknesses of each method, and gives examples and various applications for each method. Material is provided to support a decision for an appropriate object detection technique for computer vision applications, including driver-assistance systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Notes

1.
Named after the Hungarian mathematician Alfréd Haar (1885–1933).
2.
A region of interest is commonly a rectangular sub-image, say of size k × l, with \(\;k \ll N_{rows}\;\) and \(\;l \ll N_{cols}\;\), in which a weak classifier searches for an object.
3.
The sliding window, which defines the ROI, is a moving window starting from the top-left corner of an image, which moves over the image, from left to right, and top to bottom, in order to find feature matches in the query image. The sliding window starts with a small size (e.g. k = l = 20), and its size increases in each search iteration (up to the Max size of N _cols × N _rows). The aim is to find feature matches for different window sizes, so ultimately detect any existing object, in any size, that falls in the region of a sliding window (or ROI).
4.
Different to a shadow, which is a silhouette cast by an object that blocks the source of either an indoor (e.g. candle) or outdoor light (e.g. sun), a shade is “darkness” that only applies to outdoor applications such as the shade underneath a tree, or the shade underneath a car. Of course, a car itself can also have a shadow. In our application, there are many challenges for driver’s face or vehicle detection, due to existing outdoor shades.

Bibliography

C.M. Bishop, Pattern Recognition and Machine Learning (Springer, New York, 2006)
MATH Google Scholar
G.R. Bradski, Computer vision face tracking for use in a perceptual user interface. Intel Technology J. 2nd Quarter (1998)
Google Scholar
G. Bradski, A. Kaehler, Learning OpenCV (O’Reilly Media, Beijing, 2008)
Google Scholar
R.G. Brown, P.Y.C. Hwang, Introduction to Random Signals and Applied Kalman Filtering: With MATLAB Exercises and Solutions, 2nd edn. (Wiley Publishing, New York, 1991)
MATH Google Scholar
Y. Cheng, Mean shift, mode seeking, and clustering. IEEE Trans. Pattern Anal. Mach. Intell. 17, 790–799 (1998)
Article Google Scholar
D. Comaniciu, V. Ramesh, P. Meer, Real-time tracking of non-rigid objects using mean shift, in Proceedings of the IEEE Computer Vision Pattern Recognition (2000), pp. 673–678
Google Scholar
C. Cortes, V. Vapnik, Support-vector networks. Mach. learn. 20, 273–297 (1995)
MATH Google Scholar
N. Dalal, B. Triggs, Histograms of oriented gradients for human detection, in Proceedings of the IEEE Computer Vision Pattern Recognition (2005), pp. 886–893
Google Scholar
A.P. Dempster, N.M. Laird, D.B. Rubin, Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. 39, 1–38 (1997)
MathSciNet MATH Google Scholar
E.W. Forgy, Cluster analysis of multivariate data: efficiency versus interpretability of classifications. Biometrics 21, 768–769 (1965)
Google Scholar
Y. Freund, R.E. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting, in Proceedings of the European Conference on Computational Learning Theory (1995), pp. 23–37
Google Scholar
INRIA Person Data Set, lear.inrialpes.fr/data (2005)
Google Scholar
O.L.R. Jacobs, Introduction to Control Theory, 2nd edn. (Oxford University Press, Oxford/New York, 1993)
MATH Google Scholar
R. Jain, K. Rangachar, G.S. Brian, Machine Vision (McGraw-Hill, New York, 1995)
Google Scholar
R.E. Kalman, A new approach to linear filtering and prediction problems. J. Basic Eng. 82, 35–45 (1960)
Article Google Scholar
L.W. Kheng, Mean Shift Tracking. Department of Computer Science, National University of Singapore (2012), pp. 1–28
Google Scholar
R. Klette, Concise Computer Vision: An Introduction into Theory and Algorithms (Springer, London, 2014)
Book MATH Google Scholar
S.P. Lloyd, Least square quantization in PCM. Bell Telephone Laboratories Paper (1982)
Google Scholar
B.S. Lucas, T. Kanade, An iterative image registration technique with an application to stereo vision, in Proceedings of International Joint Conference on Artificial Intelligence, vol. 2 (1981), pp. 674–679
Google Scholar
G.J. McLachlan, T. Krishnan, The EM algorithm and its extensions. J. Am. Stat. Assoc. 93, 403–405 (1997)
MATH Google Scholar
C. Papageorgiou, T. Poggio, A trainable system for object detection. Int. J. Comput. Vis. 38, 15–33 (2000)
Article MATH Google Scholar
J. Shi, C. Tomasi, Good features to track, in Proceedings of the IEEE Computer Vision Pattern Recognition (1994), pp. 593–600
Google Scholar
K. Teknomo, k-means clustering tutorials. Medicine 100, 1–12 (2005)
Google Scholar
C. Tomasi, T. Kanade, Detection and tracking of point features. Technical report CMU-CS-91-132. Int. J. Comput. Vis. (1991)
Google Scholar
P. Viola, M. Jones, Rapid object detection using a boosted cascade of simple features, in Proceedings of the IEEE Computer Vision Pattern Recognition, vol. 1 (2001), pp. 511–518
Google Scholar
Wikipedia, k-means clustering (2016), en.wikipedia.org/wiki/K-means_clustering
O.R. Zaiane, Principals of knowledge discovery in data. University of Alberta (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering, Qazvin Islamic Azad University, Qazvin, Iran
Mahdi Rezaei
Department of Electrical and Electronic Engineering, Auckland University of Technology, Auckland, New Zealand
Reinhard Klette

Authors

Mahdi Rezaei
View author publications
You can also search for this author in PubMed Google Scholar
Reinhard Klette
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Rezaei, M., Klette, R. (2017). Object Detection, Classification, and Tracking. In: Computer Vision for Driver Assistance. Computational Imaging and Vision, vol 45. Springer, Cham. https://doi.org/10.1007/978-3-319-50551-0_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-50551-0_4
Published: 07 February 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50549-7
Online ISBN: 978-3-319-50551-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics