A Part-Based and Feature Fusion Method for Clothing Classification

  • Pan Huo
  • Yunhong Wang
  • Qingjie LiuEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9916)


Clothing recognition and parsing have attracted substantial attention in computer vision community, which contribute to applications like scene recognition, event recognition, e-commerce, etc. In our work, a part-based and feature fusion method is proposed to classify clothing in natural scenes. Firstly, clothing is described with a part-based model, in which a Deformable Part based Model (DPM) and a key point regression method are used to locate the head-shoulder and human torso. Then, a novel Distinctive Efficient Robust Feature (DERF) and four other low-level features are extracted to represent human clothing. Finally, a feature fusion strategy is utilized to promote the classification performance. Experiments are conducted on a new and well labeled image dataset. The experimental results show the efficiency of our proposed method.


Image analysis Clothing classification Part-based model Feature fusion 



The work is supported by the Hong Kong, Macao and Taiwan Science and Technology Cooperation Program of China (No. L2015TGA9004).


  1. 1.
    Yamaguchi, K., Hadi Kiapour, M., Ortiz, L.E., Berg, T.L.: Retrieving similar styles to parse clothing. TPAMI 37(5), 1028–1040 (2015)CrossRefGoogle Scholar
  2. 2.
    Moctezuma, D., Conde, C., Diego, I.M.D., Cabello, E.: Soft-biometrics evaluation for people re-identification in uncontrolled multi-camera environments. EURASIP J. Image Video Process. 2015(1), 1–20 (2015)CrossRefGoogle Scholar
  3. 3.
    Borràs, A., Tous, F., Lladós, J., Vanrell, M.: High-level clothes description based on colour-texture and structural features. In: Perales, F.J., Campilho, A.J.C., Blanca, N.P., Sanfeliu, A. (eds.) IbPRIA 2003. LNCS, vol. 2652, pp. 108–116. Springer, Heidelberg (2003). doi: 10.1007/978-3-540-44871-6_13 CrossRefGoogle Scholar
  4. 4.
    Gallagher, A.C., Chen, T.: Clothing cosegmentation for recognizing people. In: CVPR, pp. 1–8. IEEE (2008)Google Scholar
  5. 5.
    Yamaguchi, K., Hadi Kiapour, M., Ortiz, L.E., Berg, T.L.: Parsing clothing in fashion photographs. In: CVPR, pp. 3570–3577. IEEE (2012)Google Scholar
  6. 6.
    Zhu, J., Liao, S., Yi, D., Lei, Z., Li, S.Z.: Multi-label cnn based pedestrian attribute learning for soft biometrics. In: ICB, pp. 535–540. IEEE (2015)Google Scholar
  7. 7.
    Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. TPAMI 32(9), 1627–1645 (2010)CrossRefGoogle Scholar
  8. 8.
    Bourdev, L., Malik, J.: Poselets: body part detectors trained using 3d human pose annotations. In: ICCV, pp. 1365–1372. IEEE (2009)Google Scholar
  9. 9.
    Bourdev, L., Maji, S., Malik, J.: Describing people: a poselet-based approach to attribute classification. In: ICCV, pp. 1543–1550. IEEE (2011)Google Scholar
  10. 10.
    Bossard, L., Dantone, M., Leistner, C., Wengert, C., Quack, T., Gool, L.: Apparel classification with style. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds.) ACCV 2012. LNCS, vol. 7727, pp. 321–335. Springer, Heidelberg (2013). doi: 10.1007/978-3-642-37447-0_25 CrossRefGoogle Scholar
  11. 11.
    Song, Z., Wang, M., Hua, X., Yan, S.: Predicting occupation via human clothing and contexts. In: ICCV, pp. 1084–1091. IEEE (2011)Google Scholar
  12. 12.
    Weng, D., Wang, Y., Gong, M., Tao, D., Wei, H., Huang, D.: Derf: distinctive efficient robust features from the biological modeling of the p ganglion cells. TIP 24(8), 2287–2302 (2015)MathSciNetGoogle Scholar
  13. 13.
    Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (voc) challenge. IJCV 88(2), 303–338 (2010)CrossRefGoogle Scholar
  14. 14.
    Li, C.-Y., Zhou, Y.-X., Pei, X., Qiu, F.-T., Tang, C.-Q., Xu, X.-Z.: Extensive disinhibitory region beyond the classical receptive field of cat retinal ganglion cells. Vis. Res. 32(2), 219–228 (1992)CrossRefGoogle Scholar
  15. 15.
    Li, F.-F., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: CVPR, vol. 2, pp. 524–531. IEEE (2005)Google Scholar
  16. 16.
    Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR, vol. 2, pp. 2169–2178. IEEE (2006)Google Scholar
  17. 17.
    Yang, W., Luo, P., Lin, L.: Clothing co-parsing by joint image segmentation and labeling. In: CVPR, IEEE (2013)Google Scholar
  18. 18.
    Grauman, K., Darrell, T.: The pyramid match kernel: discriminative classification with sets of image features. In: ICCV, vol. 2, pp. 1458–1465. IEEE (2005)Google Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  1. 1.State Key Laboratory of Virtual Reality Technology and SystemBeihang UniversityBeijingChina

Personalised recommendations