Abstract
Object classification and detection are two fundamental problems in computer vision and pattern recognition. In this paper, we discuss these two research topics, including their backgrounds, challenges, recent progress and our solutions which achieve excellent performance in PASCAL VOC competitions on object classification and detection. Moreover, potential directions are outlined for future research.
Chapter PDF
Similar content being viewed by others
References
http://pascallin.ecs.soton.ac.uk/challenges/VOC/voc2010/index.html
http://pascallin.ecs.soton.ac.uk/challenges/VOC/voc2011/index.html
Barla, A., Odone, F., Verri, A.: Histogram intersection kernel for image classification. In: Proc. IEEE Inter. Conf. Image. Process. (2003)
Bengio, Y., Courville, A., Vincent, P.: Representation learning: A review and new perspectives. IEEE T-PAMI 35(8), 1798–1828 (2013)
Bradley, D.M., Bagnell, J.A.: Differential sparse coding. In: Proc. Neu. Inf. Process. Sys. (2008)
Crandall, D., Felzenszwalb, P., Huttenlocher, D.: Spatial priors for part-based recognition using statistical models. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (2005)
Csurka, G., Bray, C., Dance, C., Fan, L.: Visual categorization with bags of keypoints. In: Proc. Eur. Conf. Comput. Vis. (2004)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (2005)
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE T-PAMI 32(9), 1627–1645 (2010)
Ferrari, V., Fevrier, L., Jurie, F., Schmid, C.: Groups of adjacent contour segments for object detection. IEEE T-PAMI 30(1), 36–51 (2008)
Fischler, M., Elschlager, R.: The representation and matching of pictorial structures. IEEE Trans. Comput. C-22(1), 67–92 (1973)
van Gemert, J.C., Geusebroek, J.-M., Veenman, C.J., Smeulders, A.W.M.: Kernel codebooks for scene categorization. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 696–709. Springer, Heidelberg (2008)
Huang, Y., Huang, K., Tao, D., Tan, T., Li, X.: Enhanced biological inspired model for object recognition. IEEE T-SMC-Part B 41(6), 1668–1680 (2011)
Huang, Y., Huang, K., Tao, D., Wang, L., Tan, T., Li, X.: Enhanced biological inspired model. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (2008)
Huang, Y., Huang, K., Yu, Y., Tan, T.: Salient coding for image classification. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (2011)
Huang, Y., Wu, Z., Wang, L., Tan, T.: Feature coding in image classification: A comprehensive study. IEEE T-PAMI (accepted, 2013)
Lampert, C.H., Blaschko, M.B., Hofmann, T.: Beyond sliding windows: Object localization by efficient subwindow search. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (2008)
Lee, T.S.: Image representation using 2D gabor wavelets. IEEE T-PAMI 18, 959–971 (1996)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)
Mairal, J., Bach, F., Ponce, J., Sapiro, G.: Supervised dictionary learning. In: NIPS (2008)
Mark, E., Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The Pascal Visual Object Classes (VOC) Challenge. IJCV 88(2), 303–338 (2010)
Ojala, T., Petikainen, M., Harwood, D.: Performance evaluation of texture measures with classification based on kullback discrimination of distributions. In: Proc. IAPR Inter. Conf. Pattern Recognit. (1994)
Perronnin, F., Dance, C.: Fisher kernels on visual vocabularies for image categorization. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (2007)
Perronnin, F., Sánchez, J., Mensink, T.: Improving the Fisher kernel for large-scale image classification. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 143–156. Springer, Heidelberg (2010)
Sande, K., Gevers, T., Snoek, C.: Evaluation of color descriptors for object and scene recognition. IEEE T-PAMI 32(9), 1582–1596 (1998)
Schnitzspan, P., Roth, S., Schiele, B.: Automatic discovery of meaningful object parts with latent crfs. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (2010)
Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M., Poggio, T.: Robust object recognition with cortex-like mechanisms. IEEE T-TPAMI 29(3), 411–426 (2007)
Vedaldi, A., Gulshan, V., Varma, M., Zisserman, A.: Multiple kernels for object detection. In: Proc. IEEE Inter. Conf. Comput. Vis. (2009)
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Locality-constrained linear coding for image classification. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (2010)
Wold, H.: Partial least squares. In: Encyclopedia of Statistical Sciences (2004)
Wu, Z., Huang, Y., Wang, L., Tan, T.: Group encoding of local features in image classification. In: Proc. IAPR Inter. Conf. Pattern Recognit. (2012)
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (2009)
Yu, K., Zhang, T.: Improved local coordinate coding using local tangents. In: Proc. Int. Conf. Mach. Learning (2010)
Yu, Y., Zhang, J., Huang, Y., Zheng, S., Ren, W., Wang, C., Huang, K., Tan, T.: Object detection by context and boosted hog-lbp. In: ECCV workshop on PASCAL VOC (2010)
Zhang, J., Huang, K., Yu, Y., Tan, T.: Boosted Local Structured HOG-LBP for Object Localization. In: Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (2011)
Zhang, J., Huang, Y., Huang, K., Wu, Z., Tan, T.: Data decomposition and spatial mixture modeling for part based model. In: Proc. Asi. Conf. Compt. Vis. (2013)
Zhang, J., Yu, Y., Huang, Y., Wang, C., Ren, W., Wu, J., Huang, K., Tan, T.: Object detection based on data decomposition, spatial mixture modeling and context. In: International Conference on Computer Vision Workshop on Visual Object Classes Challenge (2011)
Zhou, X., Yu, K., Zhang, T., Huang, T.S.: Image classification using super-vector coding of local image descriptors. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 141–154. Springer, Heidelberg (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tan, T., Huang, Y., Zhang, J. (2013). Recent Progress on Object Classification and Detection. In: Ruiz-Shulcloper, J., Sanniti di Baja, G. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2013. Lecture Notes in Computer Science, vol 8259. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41827-3_1
Download citation
DOI: https://doi.org/10.1007/978-3-642-41827-3_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41826-6
Online ISBN: 978-3-642-41827-3
eBook Packages: Computer ScienceComputer Science (R0)