Skip to main content

Object Detection, Classification, and Tracking

  • Chapter
  • First Online:
Computer Vision for Driver Assistance

Part of the book series: Computational Imaging and Vision ((CIVI,volume 45))

Abstract

In this chapter we outline object detection and object recognition techniques which are of relevance for the remainder of the book. We focus on supervised and unsupervised learning approaches. The chapter provides technical details for each method, discussions on the strengths and weaknesses of each method, and gives examples and various applications for each method. Material is provided to support a decision for an appropriate object detection technique for computer vision applications, including driver-assistance systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Notes

  1. 1.

    Named after the Hungarian mathematician Alfréd Haar (1885–1933).

  2. 2.

    A region of interest is commonly a rectangular sub-image, say of size k × l, with \(\;k \ll N_{rows}\;\) and \(\;l \ll N_{cols}\;\), in which a weak classifier searches for an object.

  3. 3.

    The sliding window, which defines the ROI, is a moving window starting from the top-left corner of an image, which moves over the image, from left to right, and top to bottom, in order to find feature matches in the query image. The sliding window starts with a small size (e.g. k = l = 20), and its size increases in each search iteration (up to the Max size of N cols × N rows ). The aim is to find feature matches for different window sizes, so ultimately detect any existing object, in any size, that falls in the region of a sliding window (or ROI).

  4. 4.

    Different to a shadow, which is a silhouette cast by an object that blocks the source of either an indoor (e.g. candle) or outdoor light (e.g. sun), a shade is “darkness” that only applies to outdoor applications such as the shade underneath a tree, or the shade underneath a car. Of course, a car itself can also have a shadow. In our application, there are many challenges for driver’s face or vehicle detection, due to existing outdoor shades.

Bibliography

  1. C.M. Bishop, Pattern Recognition and Machine Learning (Springer, New York, 2006)

    MATH  Google Scholar 

  2. G.R. Bradski, Computer vision face tracking for use in a perceptual user interface. Intel Technology J. 2nd Quarter (1998)

    Google Scholar 

  3. G. Bradski, A. Kaehler, Learning OpenCV (O’Reilly Media, Beijing, 2008)

    Google Scholar 

  4. R.G. Brown, P.Y.C. Hwang, Introduction to Random Signals and Applied Kalman Filtering: With MATLAB Exercises and Solutions, 2nd edn. (Wiley Publishing, New York, 1991)

    MATH  Google Scholar 

  5. Y. Cheng, Mean shift, mode seeking, and clustering. IEEE Trans. Pattern Anal. Mach. Intell. 17, 790–799 (1998)

    Article  Google Scholar 

  6. D. Comaniciu, V. Ramesh, P. Meer, Real-time tracking of non-rigid objects using mean shift, in Proceedings of the IEEE Computer Vision Pattern Recognition (2000), pp. 673–678

    Google Scholar 

  7. C. Cortes, V. Vapnik, Support-vector networks. Mach. learn. 20, 273–297 (1995)

    MATH  Google Scholar 

  8. N. Dalal, B. Triggs, Histograms of oriented gradients for human detection, in Proceedings of the IEEE Computer Vision Pattern Recognition (2005), pp. 886–893

    Google Scholar 

  9. A.P. Dempster, N.M. Laird, D.B. Rubin, Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. 39, 1–38 (1997)

    MathSciNet  MATH  Google Scholar 

  10. E.W. Forgy, Cluster analysis of multivariate data: efficiency versus interpretability of classifications. Biometrics 21, 768–769 (1965)

    Google Scholar 

  11. Y. Freund, R.E. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting, in Proceedings of the European Conference on Computational Learning Theory (1995), pp. 23–37

    Google Scholar 

  12. INRIA Person Data Set, lear.inrialpes.fr/data (2005)

    Google Scholar 

  13. O.L.R. Jacobs, Introduction to Control Theory, 2nd edn. (Oxford University Press, Oxford/New York, 1993)

    MATH  Google Scholar 

  14. R. Jain, K. Rangachar, G.S. Brian, Machine Vision (McGraw-Hill, New York, 1995)

    Google Scholar 

  15. R.E. Kalman, A new approach to linear filtering and prediction problems. J. Basic Eng. 82, 35–45 (1960)

    Article  Google Scholar 

  16. L.W. Kheng, Mean Shift Tracking. Department of Computer Science, National University of Singapore (2012), pp. 1–28

    Google Scholar 

  17. R. Klette, Concise Computer Vision: An Introduction into Theory and Algorithms (Springer, London, 2014)

    Book  MATH  Google Scholar 

  18. S.P. Lloyd, Least square quantization in PCM. Bell Telephone Laboratories Paper (1982)

    Google Scholar 

  19. B.S. Lucas, T. Kanade, An iterative image registration technique with an application to stereo vision, in Proceedings of International Joint Conference on Artificial Intelligence, vol. 2 (1981), pp. 674–679

    Google Scholar 

  20. G.J. McLachlan, T. Krishnan, The EM algorithm and its extensions. J. Am. Stat. Assoc. 93, 403–405 (1997)

    MATH  Google Scholar 

  21. C. Papageorgiou, T. Poggio, A trainable system for object detection. Int. J. Comput. Vis. 38, 15–33 (2000)

    Article  MATH  Google Scholar 

  22. J. Shi, C. Tomasi, Good features to track, in Proceedings of the IEEE Computer Vision Pattern Recognition (1994), pp. 593–600

    Google Scholar 

  23. K. Teknomo, k-means clustering tutorials. Medicine 100, 1–12 (2005)

    Google Scholar 

  24. C. Tomasi, T. Kanade, Detection and tracking of point features. Technical report CMU-CS-91-132. Int. J. Comput. Vis. (1991)

    Google Scholar 

  25. P. Viola, M. Jones, Rapid object detection using a boosted cascade of simple features, in Proceedings of the IEEE Computer Vision Pattern Recognition, vol. 1 (2001), pp. 511–518

    Google Scholar 

  26. Wikipedia, k-means clustering (2016), en.wikipedia.org/wiki/K-means_clustering

  27. O.R. Zaiane, Principals of knowledge discovery in data. University of Alberta (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Cite this chapter

Rezaei, M., Klette, R. (2017). Object Detection, Classification, and Tracking. In: Computer Vision for Driver Assistance. Computational Imaging and Vision, vol 45. Springer, Cham. https://doi.org/10.1007/978-3-319-50551-0_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-50551-0_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-50549-7

  • Online ISBN: 978-3-319-50551-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics