Abstract
Object detection has over the past few years converged on using linear SVMs over HOG features. Training linear SVMs however is quite expensive, and can become intractable as the number of categories increase. In this work we revisit a much older technique, viz. Linear Discriminant Analysis, and show that LDA models can be trained almost trivially, and with little or no loss in performance. The covariance matrices we estimate capture properties of natural images. Whitening HOG features with these covariances thus removes naturally occuring correlations between the HOG features. We show that these whitened features (which we call WHO) are considerably better than the original HOG features for computing similarities, and prove their usefulness in clustering. Finally, we use our findings to produce an object detection system that is competitive on PASCAL VOC 2007 while being considerably easier to train and test.
This work was funded by ONR-MURI Grant N00014-10-1-0933 and NSF Grant 0954083.
Chapter PDF
References
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR (2005)
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. TPAMI 32 (2010)
Bourdev, L., Malik, J.: Poselets: Body part detectors trained using 3D human pose annotations. In: ICCV (2009)
Malisiewicz, T., Gupta, A., Efros, A.A.: Ensemble of exemplar-svms for object detection and beyond. In: ICCV (2011)
Fisher, R.: The use of multiple measurements in taxonomic problems. Annals of Human Genetics (1936)
Hastie, T., Tibshirani, R., Friedman, J.J.H.: The elements of statistical learning. Springer (2009)
Duda, R., Hart, P.: Pattern recognition and scene analysis (1973)
Belhumeur, P., Hespanha, J., Kriegman, D.: Eigenfaces vs. fisherfaces: Recognition using class specific linear projection. TPAMI 19 (1997)
Turk, M., Pentland, A.: Eigenfaces for recognition. Journal of Cognitive Neuroscience (1991)
Murase, H., Nayar, S.: Visual learning and recognition of 3-D objects from appearance. IJCV 14 (1995)
Ke, Y., Sukthankar, R.: Pca-sift: A more distinctive representation for local image descriptors. In: CVPR (2004)
Schwartz, W., Kembhavi, A., Harwood, D., Davis, L.: Human detection using partial least squares analysis. In: ICCV (2009)
Hyvärinen, A., Hurri, J., Hoyer, P.: Natural Image Statistics: A probabilistic approach to early computational vision (2009)
Rue, H., Held, L.: Gaussian Markov random fields: theory and applications (2005)
Marlin, B., Schmidt, M., Murphy, K.: Group sparse priors for covariance estimation. In: UAI (2009)
Vedaldi, A., Zisserman, A.: Structured output regression for detection with partial truncation. In: NIPS (2009)
Gao, T., Packer, B., Koller, D.: A segmentation-aware object detection model with occlusion handling. In: CVPR (2011)
Dalal, N.: Finding people in Images and Videos. PhD thesis, INRIA (2006)
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Results, http://www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html
Shi, J., Malik, J.: Normalized cuts and image segmentation. TPAMI 22 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hariharan, B., Malik, J., Ramanan, D. (2012). Discriminative Decorrelation for Clustering and Classification. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7575. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33765-9_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-33765-9_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33764-2
Online ISBN: 978-3-642-33765-9
eBook Packages: Computer ScienceComputer Science (R0)