Skip to main content
Log in

Visual attribute detction for pedestrian detection

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Attributes are expected to narrow down the semantic gap between low-level visual features and high-level semantic meanings. Such superiority motivates us to explore pedestrian attributes which has became a critical problem to boost image understanding and improve the performance of pedestrian detection, retrieval, re-identification, etc. Based on the PETA dateset, we manually relabel two subset VIPeR and PRID as our experimental dataset. Moreover, we proposed an evaluation protocol for researchers to evaluate pedestrian attribute classification algorithms. In this paper, we utilized two baseline methods to to demonstrate the performance of the attribute in pedestrian detection. The first one directly uses color and texture features to train Support Vector Machine (SVM) classification while the other one uses DSIFT (Dense SIFT) with Bag-of-Visual-Words (BoVW) to train SVM classification. Finally, we report and discuss the baseline performance on the database following the proposed evaluation protocol.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

Notes

  1. The Y channel of YCbCr is selected as the luminance channel.

References

  1. Bosch A, Zisserman A, Muoz X (2008) Scene classification using a hybrid generative/discriminative approach. Pattern Anal Machine Intelligence:712–727

  2. Bossard L, Dantone M, Leistner C, Wengert C, Quack T, Van Gool L (2013) Apparel classification with style. In: Asian conference on computer vision, pp 321–335

  3. Bourdev L, Malik J (2009) Poselets: body part detectors trained using 3d human pose annotations. In: Computer vision international conference, pp 1365–1372

  4. Bourdev L, Maji S, Brox T, Malik J (2010) Detecting people using mutually consistent poselet activations. In: European conference on computer vision, pp 168–181

  5. Bourdev L, Maji S, Malik J (2011) Describing people: a poselet-based approach to attribute classification. In: IEEE international conference on computer vision, pp 1543–1550

  6. Chen H, Gallagher A, Girod B (2012) Describing clothing by semantic attributes. In: European conference on computer vision, pp 609–623

  7. Chen B, Shu H, Coatrieux G, Chen G, Sun X, Coatrieux J (2015) Color image analysis by quaternion-type moments. J Math Imaging and Vision:124–144

  8. Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: Workshop on statistical learning in computer vision, European conference on computer vision, pp 1–22

  9. Deng Y, Luo P, Loy CC, Tang X (2014) Pedestrian attribute recognition at far distance

  10. Dorko G, Schmid C (2005) Object class recognition using discriminative local features

  11. Duan K, Parikh D, Crandall D, Grauman K (2012) Discovering localized attributes for fine-grained recognition. In: Computer vision and pattern recognition, pp 3474–3481

  12. Ferrari V, Zisserman A (2007) Learning visual attributes, in advances in neural information processing systems:433–440

  13. Fogel I, Sagi D (1989) Gabor filters as texture discriminator. Biol Cybern:103–113

  14. Fu Y, Hospedales TM, Xiang T, Gong S (2012) Attribute learning for understanding unstructured social activity. In: European conference on computer vision, pp 521–534

  15. Gu B, Sheng VS (2016) A robust regularization path algorithm for -support vector classification. IEEE Trans Neural Networks Learning Systems

  16. Jingyuan C, Xuemeng S, Liqiang N, Xiang W, Hanwang Z, Tat-Seng C (2016) Micro tells macro: predicting the popularity of micro-videos via a transductive model. In: Proceedings of the 2016 ACM Conference on Multimedia Conference, MM 2016, Amsterdam, pp 898–907

  17. Layne R, Hospedales TM, Gong S, Mary Q (2012) Person re-identification by attributes. In: British Machine Vision Conference

  18. Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Computer vision and pattern recognition, pp 2169– 2178

  19. Li A, Liu L, Wang K, Liu S, Yan S (2014) Clothing attributes assisted person re-identification, pp 869–878

    Google Scholar 

  20. Liu J, Kuipers B, Savarese S (2011) Recognizing human actions by attributes. In: Computer vision and pattern recognition, pp 3337–3344

  21. Liu Z, Huang H, He Q, Chiew K, Gao Y (2015) Rare category exploration on linear time complexity. In: Database systems for advanced applications - 20th International Conference, DASFAA 2015, Hanoi, Vietnam, April 20-23, 2015, Proceedings, Part II, pp 37–54

  22. Lowe DG (1999) Object recognition from local scale-invariant features. In: The proceedings of the IEEE international conference, pp 1150–1157

  23. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis:91–110

  24. Luming Z, Yahong H, Yi Y, Mingli S, Shuicheng Y, Qi T (2013) Discovering discriminative graphlets for aerial image categories recognition. IEEE Trans Image Process:5071–5084

  25. Luming Z, Meng W, Richang H, Bao-Cai Y, Xuelong L (2016) Large-scale aerial image categorization using a multitask topological codebook. IEEE Trans Cybernetics:535–545

  26. Luming Z, Yang Y, Meng W, Richang H, Liqiang N, Xuelong L (2016) Detecting densely distributed graph patterns for fine-grained image categorization. IEEE Trans Image Processing:553–565

  27. Maji S, Berg AC, Malik J (2008) Classification using intersection kernel support vector machines is efficient. In: Computer vision and pattern recognition, pp 1–8

  28. Ojala T, Pietikäinen M, Harwood D (1996) A comparative study of texture measures with classification based on featured distributions. Pattern Recog:51–59

  29. Schmid C (2001) Constructing models for content-based image retrieval. In: Computer vision and pattern recognition, p 39

  30. Siddiquie B, Feris RS, Davis LS (2011) Image ranking and retrieval based on multi-attribute queries. In: Computer vision and pattern recognition, pp 801–808

  31. Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. In: Proceedings ninth IEEE international conference on computer vision, p 1470

  32. Song X, Nie L, Zhang L, Liu M, Chua T (2015) Interest inference via structure-constrained multi-source multi-task learning. In: Proceedings of the twenty-fourth international joint conference on artificial intelligence. IJCAI, Buenos Aires, pp 2371–2377

  33. Vaquero D, Feris RS, Tran D, Brown L, Hampapur A, Turk M, et al. (2009) Attribute-based people search in surveillance environments. In: Workshop on applications of computer vision, pp 1–8

  34. Wang S, Joo J, Wang Y, Zhu S-C (2013) Weakly supervised learning for attribute localization in outdoor scenes. In: Computer vision and pattern recognition, pp 3111–3118

  35. Wang W, Yan Y, Zhang L, Hong R, Sebe N (2016) Collaborative sparse coding for multiview action recognition, pp 80–87

    Google Scholar 

  36. Wen X, Shao L, Xue Y, Fang W (2015) A rapid learning algorithm for vehicle classification. Inf Sci:395–406

  37. Xuemeng S, Zhaoyan M, Liqiang N, Yi-Liang Z, Tat-Seng C (2016) Volunteerism tendency prediction via harvesting multiple social networks. ACM Trans Inf Syst:10

  38. Yang M, Yu K (2011) Real-time clothing recognition in surveillance videos. In: IEEE international conference image processing, pp 2937–2940

  39. Zhang L, Song M, Sun L, Liu X, Wang Y, Tao D, Bu J, Chen C (2012) Spatial graphlet matching kernel for recognizing aerial image categories. In: International Conference on Pattern Recognition, pp 2813–2816

  40. Zhang L, Song M, Liu X, Bu J, Chen C (2013) Fast multi-view segment graph kernel for object classification. Signal Process:1597–1607

  41. Zhang L, Gao Y, Xia Y, Dai Q, Li X (2015) A fine-grained image categorization system by cellet-encoded spatial pyramid modeling. IEEE Trans Ind Electron:564–571

  42. Zheng Y, Jeon B, Xu D, Wu QMJ, Zhang H (2015) Image segmentation by generalized hierarchical fuzzy c-means algorithm. J Intell Fuzzy Syst:961–973

  43. Zhenguang L, Kevin C, Qinming H, Hao H, Butian H (2014) Prior-free rare category detection: more effective and efficient solutions. Expert Syst Appl:7691–7706

Download references

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China (61502337, 61472275, 61170239, 61303208), the Tianjin Research Program of Application Foundation and Advanced Technology (15JCYBJC162000), and the grant of Elite Scholar Program of Tianjin University (2014XRG-0046).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wei-Zhi Nie.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, J., Li, FW., Nie, WZ. et al. Visual attribute detction for pedestrian detection. Multimed Tools Appl 78, 26833–26850 (2019). https://doi.org/10.1007/s11042-016-4258-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-016-4258-5

Keywords

Navigation