Skip to main content

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

Abstract

We present a method for the detection of instances of an object class, such as cars or pedestrians, in natural images. Similarly to some previous work, this is accomplished via the generalized Hough transform, where the detections of individual object parts cast probabilistic votes for possible locations of the centroid of the whole object; the detection hypotheses then correspond to the maxima of the Hough image that accumulates the votes from all parts. However, whereas previous methods detect object parts using generative codebooks of part appearances, we take a more discriminative approach to object part detection. Towards this end, we train a class-specific Hough forest, which is a decision forest that directly maps the image patch appearance to the probabilistic vote about the possible location of the object centroid. We demonstrate that Hough forests improve the results of the Hough-transform object detection significantly and achieve state-of-the-art performance for several classes and datasets. Parts of this chapter are reprinted, with permission, from Gall and Lempitsky, Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVRP) (2009), © 2012 IEEE.

This chapter is based on the CVPR’09 conference paper [118].

Parts of this chapter are reprinted, with permission, from [118], © 2012 IEEE.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Agarwal S, Awan A, Roth D (2004) Learning to detect objects in images via a sparse, part-based representation. IEEE Trans Pattern Anal Mach Intell 26(11)

    Google Scholar 

  2. Amit Y, Geman D, Wilder K (1997) Joint induction of shape features and tree classifiers. IEEE Trans Pattern Anal Mach Intell 19

    Google Scholar 

  3. Andriluka M, Roth S, Schiele B (2008) People-tracking-by-detection and people-detection-by-tracking. In: Proc IEEE conf computer vision and pattern recognition (CVPR)

    Google Scholar 

  4. Ballard DH (1981) Generalizing the Hough transform to detect arbitrary shapes. Pattern Recognit 13(2)

    Google Scholar 

  5. Barinova O, Lempitsky VS, Kohli P (2010) On detection of multiple object instances using Hough transforms. In: Proc IEEE conf computer vision and pattern recognition (CVPR)

    Google Scholar 

  6. Barinova O, Lempitsky VS, Kohli P (2012) On detection of multiple object instances using Hough transforms. IEEE Trans Pattern Anal Mach Intell

    Google Scholar 

  7. Borenstein E, Ullman S (2002) Class-specific, top-down segmentation. In: Proc European conf on computer vision (ECCV). LNCS, vol 2351. Springer, Berlin

    Google Scholar 

  8. Breiman L (2001) Random forests. Mach Learn 45(1)

    Google Scholar 

  9. Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Proc IEEE conf computer vision and pattern recognition (CVPR), June 2005, vol 2

    Google Scholar 

  10. Fanelli G, Gall J (2011) Real time head pose estimation with random regression forests. In: Proc IEEE conf computer vision and pattern recognition (CVPR)

    Google Scholar 

  11. Gall J, Lempitsky VS (2009) Class-specific Hough forests for object detection. In: Proc IEEE conf computer vision and pattern recognition (CVPR)

    Google Scholar 

  12. Gall J, Yao A, Razavi N, van Gool LJ, Lempitsky VS (2011) Hough forests for object detection, tracking, and action recognition. IEEE Trans Pattern Anal Mach Intell 33(11)

    Google Scholar 

  13. Girshick R, Shotton J, Kohli P, Criminisi A, Fitzgibbon A (2011) Efficient regression of general-activity human poses from depth images. In: Proc IEEE intl conf on computer vision (ICCV)

    Google Scholar 

  14. Glocker B, Pauly O, Konukoglu E, Criminisi A (2012) Joint classification-regression forests for spatially structured multi-object segmentation. In: Proc European conf on computer vision (ECCV). Springer, Berlin

    Google Scholar 

  15. Godec M, Roth PM, Bischof H (2011) Hough-based tracking of non-rigid objects. In: Proc IEEE intl conf on computer vision (ICCV)

    Google Scholar 

  16. Lampert C, Blaschko M, Hofmann T (2008) Beyond sliding windows: object localization by efficient subwindow search. In: Proc IEEE conf computer vision and pattern recognition (CVPR)

    Google Scholar 

  17. Leibe B, Schiele B (2003) Interleaved object categorization and segmentation. In: Proc British machine vision conference (BMVC), vol II

    Google Scholar 

  18. Leibe B, Leonardis A, Schiele B (2008) Robust object detection with interleaved categorization and segmentation. Int J Comput Vis 77(1–3)

    Google Scholar 

  19. Lepetit V, Lagger P, Fua P (2005) Randomized trees for real-time keypoint recognition. In: Proc IEEE conf computer vision and pattern recognition (CVPR)

    Google Scholar 

  20. Marée R, Geurts P, Piater J, Wehenkel L (2005) Random subwindows for robust image classification. In: Proc IEEE conf computer vision and pattern recognition (CVPR), vol 1. IEEE, New York

    Google Scholar 

  21. Moosmann F, Triggs B, Jurie F (2006) Fast discriminative visual codebooks using randomized clustering forests. In: Advances in neural information processing systems (NIPS)

    Google Scholar 

  22. Mutch J, Lowe DG (2006) Multiclass object recognition with sparse, localized features. In: Proc IEEE conf computer vision and pattern recognition (CVPR)

    Google Scholar 

  23. Nowak E, Jurie F, Triggs B (2006) Sampling strategies for bag-of-features image classification. In: Proc European conf on computer vision (ECCV). Springer, Berlin

    Google Scholar 

  24. Okada R (2009) Discriminative generalized Hough transform for object detection. In: Proc IEEE intl conf on computer vision (ICCV)

    Google Scholar 

  25. Opelt A, Pinz A, Zisserman A (2008) Learning an alphabet of shape and appearance for multi-class object detection. Int J Comput Vis

    Google Scholar 

  26. Pang J, Huang Q, Jiang S (2008) Multiple instance boost using graph embedding based decision stump for pedestrian detection. In: Proc European conf on computer vision (ECCV). Springer, Berlin

    Google Scholar 

  27. Razavi N, Gall J, Van Gool L (2010) Backprojection revisited: scalable multi-view object detection and similarity metrics for detections. In: Proc European conf on computer vision (ECCV). Springer, Berlin

    Google Scholar 

  28. Schroff F, Criminisi A, Zisserman A (2008) Object class segmentation using random forests. In: Proc British machine vision conference (BMVC)

    Google Scholar 

  29. Seemann E, Schiele B (2006) Cross-articulation learning for robust detection of pedestrians. In: Proc annual symposium of the German association for pattern recognition (DAGM)

    Google Scholar 

  30. Sharp T (2008) Implementing decision trees and forests on a GPU. In: Proc European conf on computer vision (ECCV). Springer, Berlin

    Google Scholar 

  31. Shotton J, Blake A, Cipolla R (2008) Efficiently combining contour and texture cues for object recognition. In: Proc British machine vision conference (BMVC)

    Google Scholar 

  32. Shotton J, Blake A, Cipolla R (2008) Multiscale categorical object recognition using contour fragments. IEEE Trans Pattern Anal Mach Intell 30(7)

    Google Scholar 

  33. Shotton J, Johnson M, Cipolla R (2008) Semantic texton forests for image categorization and segmentation. In: Proc IEEE conf computer vision and pattern recognition (CVPR)

    Google Scholar 

  34. Torralba A, Murphy KP, Freeman WT, Rubin MA (2003) Context-based vision system for place and object recognition. In: Proc IEEE intl conf on computer vision (ICCV), Nice, France, October 2003, vol 2

    Google Scholar 

  35. Winn J, Shotton J (2006) The layout consistent random field for recognizing and segmenting partially occluded objects. In: Proc IEEE conf computer vision and pattern recognition (CVPR)

    Google Scholar 

  36. Yao A, Gall J, van Gool L (2010) A Hough transform-based voting framework for action recognition. In: Proc IEEE conf computer vision and pattern recognition (CVPR)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag London

About this chapter

Cite this chapter

Gall, J., Lempitsky, V. (2013). Class-Specific Hough Forests for Object Detection. In: Criminisi, A., Shotton, J. (eds) Decision Forests for Computer Vision and Medical Image Analysis. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-4929-3_11

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-4929-3_11

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-4928-6

  • Online ISBN: 978-1-4471-4929-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics