Visual Attribute Extraction Using Human Pose Estimation

Nodari, Angelo; Vanetti, Marco; Gallo, Ignazio

doi:10.1007/978-3-642-38628-2_41

Angelo Nodari¹⁹,
Marco Vanetti¹⁹ &
Ignazio Gallo¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7887))

Included in the following conference series:

Iberian Conference on Pattern Recognition and Image Analysis

1927 Accesses

Abstract

We propose a method to describe how a person is dressed, using an innovative way to extract Visual Information exploiting the Human Pose Estimation. Given the lack of algorithms in this field, we aims to pave the way giving a baseline and publishing a detailed dataset for future comparisons. In particular in this study we show how using the Human Pose Estimation, we are able to extract the essential features for the description of the Visual Attributes. Furthermore, the proposed method is able to manage the problems highlighted in literature regarding the extraction of features from images of people due to their articulated poses. For this reason we also propose a formalization of how describe people’s clothing in order to give a starting point and facilitate the analysis and the Visual Attributes extraction phase. Moreover we show how the use of Deformable Structures let us to extract Visual Attributes without the using of segmentation algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Anguelov, D., Lee, K.C., Göktürk, S.B., Sumengen, B., Inc, R.: Contextual identity recognition in personal photo albums. In: IEEE Conference on In Computer Vision and Pattern Recognition, CVPR (2007)
Google Scholar
Bosch, A., Zisserman, A., Muoz, X.: Representing shape with a spatial pyramid kernel. In: Conference on Image and Video Retrieval (2007)
Google Scholar
Bourdev, L., Maji, S., Malik, J.: Describing people: Poselet-based attribute classification. In: International Conference on Computer Vision, ICCV (2011)
Google Scholar
Chen, H., Xu, Z., Liu, Z., Zhu, S.C.: Composite Templates for Cloth Modeling and Sketching. In: Computer Vision and Pattern Recognition (2006)
Google Scholar
Cohen, J.: A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20(1), 37–46 (1960)
Article Google Scholar
Datta, R., Li, J., Wang, J.Z.: Content-based image retrieval: approaches and trends of the new age. In: Multimedia Information Retrieval, pp. 253–262 (2005)
Google Scholar
Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.: Describing objects by their attributes. In: Proceedings of the IEEE Computer Society Conference on CVPR (2009)
Google Scholar
Ferrari, V., Marin-Jimenez, M., Zisserman, A.: Pose search: Retrieving people using their pose. In: Computer Vision and Pattern Recognition, pp. 1–8 (2009)
Google Scholar
Ferrari, V., Zisserman, A.: Learning visual attributes. In: Advances in Neural Information Processing Systems (December 2007)
Google Scholar
Ferrari, V., Marín-jiménez, M.J., Zisserman, A.: Progressive search space reduction for human pose estimation. In: Computer Vision and Pattern Recognition (2008)
Google Scholar
Fischler, M.A., Elschlager, R.A.: The representation and matching of pictorial structures. IEEE Transactions on Computers C-22, 67–92 (1973)
Article Google Scholar
Hay, G.J., Castilla, G.: Geographic Object-Based Image Analysis (GEOBIA): A new name for a new discipline. Springer (2008)
Google Scholar
Naga Jyothi, B., Babu, G.R., Murali Krishna, I.V.: Object Oriented and Multi-Scale Image Analysis: Strengths, Weaknesses, Opportunities and Threats-A Review. Journal of Computer Science 4, 706–712 (2008)
Article Google Scholar
Khakimdjanova, L., Park, J.: Online visual merchandising practice of apparel e-merchants. Journal of Retailing and Consumer Services 12, 307–318 (2005)
Article Google Scholar
Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33(1), 159–174 (1977)
Article MathSciNet MATH Google Scholar
Liu, Y., Zhang, D., Lu, G., Ma, W.Y.: A survey of content-based image retrieval with high-level semantics. Pattern Recognition 40, 262–282 (2007)
Article MATH Google Scholar
Ma, W.-Y., Zhang, H.J.: Benchmarking of image features for content-based retrieval. In: Asilomar Conference on Signals, Systems Computers, vol. 1 (1998)
Google Scholar
Mei, T., Hua, X.S., Li, S.: Contextual in-image advertising. In: ACM Multimedia Conference, pp. 439–448 (2008)
Google Scholar
Nodari, A., Gallo, I., Vanetti, M.: Automatic visual attributes extraction from web offers images. In: Computational Modeling of Objects Presented in Images: Foundamentals Method and Applications (2012)
Google Scholar
Nodari, A., Vanetti, M., Gallo, I., Albertini, S.: Color and texture indexing using an object segmentation approach. In: EANN/AIAI. CRL Publisher (2012)
Google Scholar
Ramanan, D., Sminchisescu, C.: Training deformable models for localization. In: Computer Vision and Pattern Recognition, vol. 1, pp. 206–213 (2006)
Google Scholar
Sivic, J., Zitnick, C.L., Szeliski, R.: Finding people in repeated shots of the same scene. In: Proceedings of the British Machine Vision Conference (2006)
Google Scholar
Vaquero, D.A., Feris, R.S., Tran, D., Brown, L., Hampapur, A., Turk, M.: Attribute-based people search in surveillance environments. In: IEEE Workshop on Applications of Computer Vision (2009)
Google Scholar
Weber, M., Bauml, M.: Part-based clothing segmentation for person retrieval. In: Advanced Video and Signal Based Surveillance (2011)
Google Scholar
Yamaguchi, K., Kiapour, M.H., Ortiz, L.E., Berg, T.L.: Parsing clothing in fashion photographs. In: CVPR, pp. 3570–3577. IEEE (2012)
Google Scholar
Zuffi, S., Freifeld, O., Black, M.J.: From pictorial structures to deformable structures. In: Proceedings of the 2012 IEEE, CVPR 2012. IEEE Computer Society (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Scienze Teoriche e Applicate, Università dell’Insubria, via Mazzini 5, 21100, Varese, Italy
Angelo Nodari, Marco Vanetti & Ignazio Gallo

Authors

Angelo Nodari
View author publications
You can also search for this author in PubMed Google Scholar
Marco Vanetti
View author publications
You can also search for this author in PubMed Google Scholar
Ignazio Gallo
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute for Systems and Robotics, Instituto Superior Técnico, Portugal
João M. Sanches
University of Alicante, Spain
Luisa Micó
INESC and University of Porto, Porto, Portugal
Jaime S. Cardoso

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nodari, A., Vanetti, M., Gallo, I. (2013). Visual Attribute Extraction Using Human Pose Estimation. In: Sanches, J.M., Micó, L., Cardoso, J.S. (eds) Pattern Recognition and Image Analysis. IbPRIA 2013. Lecture Notes in Computer Science, vol 7887. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38628-2_41

Download citation

DOI: https://doi.org/10.1007/978-3-642-38628-2_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38627-5
Online ISBN: 978-3-642-38628-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics