Fast Cross-Scenario Clothing Retrieval Based on Indexing Deep Features

Li, Zongmin; Li, Yante; Gao, Yongbiao; Liu, Yujie

doi:10.1007/978-3-319-48890-5_11

Zongmin Li¹⁶,
Yante Li¹⁶,
Yongbiao Gao¹⁶ &
…
Yujie Liu¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9916))

Included in the following conference series:

Pacific Rim Conference on Multimedia

2450 Accesses
3 Citations

Abstract

In this paper, we propose a new approach for large scale daily clothing retrieval. Fast clothing image search in cross scenarios is a challenging task due to the large amount of clothing images on the internet and visual differences between street photos (pictures of people wearing clothing taken in our daily life with complex background) and online shop photos (pictures of clothing items on people, captured by professionals in more controlled settings). We tackle the problem of cross-scenario clothing retrieval through clothing segmentation based on coarse-fine hierarchical superpixel segmentation and pose estimation to remove the background of clothing image and employ deep features representing the clothing item aimed at describing various clothing effectively. In addition, in order to speed up the retrieval process for large scale online clothing images, we adopt inverted indexing on deep feature by regarding deep features as Bag-of-Word model. In this way, we obtain similar clothing items far faster. Experiments demonstrate that our method significantly outperforms state-of-the-art approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Deng, J., Dong, W., Socher, R, Li, L.-J., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 248–255 (2009)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: International Conference on Computer Vision, pp. 1470–1477 (2003)
Google Scholar
Yamaguchi, K., Kiapour, M.H., Ortiz, L.E., Berg, T.L.: Parsing clothing in fashion photographs. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3570–3577 (2012)
Google Scholar
Rother, C., Kolmogorov, V., Blake, A.: Grabcut - interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. (TOG) 23, 309–314 (2004)
Article Google Scholar
Liu, S., Song, Z., Liu, G.: Street-to-shop: cross-scenario clothing retrieval via parts alignment and auxiliary set. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3330–3337 (2012)
Google Scholar
Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. In: IEEE Conference Computer Vision and Pattern Recognition, pp. 1385–1392 (2011)
Google Scholar
Liu, S., et al.: Hi, magic closet, tell me what to wear! In: Proceedings of the 20th ACM International Conference on Multimedia, pp. 619–628 (2012)
Google Scholar
Fu, J., Wang, J., Li, Z., et al.: Efficient clothing retrieval with semantic preserving visual phrases. In: Proceedings of 11th Asian Conference on Computer Vision, pp. 420–431 (2013)
Google Scholar
Di, W., Wah, C., Bhardwaj, A., Piramuthu, R., Sundaresan, N.: Style finder: fine-grained clothing style recognition and retrieval. In: Computer Vision and Pattern Recognition Workshops, pp. 8–13 (2013)
Google Scholar
Chen, H. Gallagher, A. Girod, B.: Describing clothing by semantic attributes. In: Proceedings of the 12th European Conference on Computer Vision, pp. 609–623 (2012)
Google Scholar
Kalantidis, Y., Kennedy, L., Li, L.J.: Getting the look: clothing recognition and segmentation for automatic product suggestions in everyday photos. In: The 3rd ACM Conference on International Conference on Multimedia Retrieval, pp. 105–112 (2013)
Google Scholar
Malisiewicz, T., Gupta, A., Efros, A.A.: A. Ensemble of exemplar-SVMs for object detection. In: International Conference on Computer Vision, pp. 89–96 (2011)
Google Scholar
Sutskever, I., Krizhevsky, A., Hinton, G.: Imagenet classification with deep convolutional neural networks. In: Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Babenko, A., Lempitsky, V.: The inverted multi-index. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3069–3076 (2012)
Google Scholar
Jurie, F., Nowak, E., Triggs, B.: Sampling strategies for bag of features image classification. In: European Conference on Computer Vision, pp. 490–503 (2006)
Google Scholar
Kiapour, M.H., Lazebnik, S., Han, X.: Where to buy it: matching street clothing photos in online shops. In: IEEE International Conference on Computer Vision, pp. 3343–3351 (2015)
Google Scholar
Girshick, R., Donahue, J., Darrell, T.: Region based convolutional networks for accurate object detection and semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38, 142–158 (2015)
Article Google Scholar
Kuang, Z., Li, Z., Lv, Q.: Modal function transformation for isometric 3D shape representation. Comput. Graph. 46, 209–220 (2015)
Article Google Scholar
Liu, R., Zhao, Y., Wei, S., Zhu, Z., Liao, L., Qiu, S.: Indexing of CNN features for large scale image search. CoRR, abs/1508.00217 (2015)
Google Scholar
Uijlings, J., van Sande, K.E.A.: Selective search for object recognition. Int. J. Comput. Vis. 104, 154–171 (2013)
Article Google Scholar
Avrithis, Y., Kalantidis, Y.: Approximate Gaussian mixtures for large scale vocabularies. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7574, pp. 15–28. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33712-3_2
Chapter Google Scholar

Download references

Acknowledgment

The authors would like to thank the support of National Natural Science Foundation of China, the Scientific Research Foundation for the Excellent Middle-Aged and Youth Scientists of Shandong Province of China.

Author information

Authors and Affiliations

College of Computer and Communication Engineering, China University of Petroleum (East China), Beijing, China
Zongmin Li, Yante Li, Yongbiao Gao & Yujie Liu

Authors

Zongmin Li
View author publications
You can also search for this author in PubMed Google Scholar
Yante Li
View author publications
You can also search for this author in PubMed Google Scholar
Yongbiao Gao
View author publications
You can also search for this author in PubMed Google Scholar
Yujie Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zongmin Li .

Editor information

Editors and Affiliations

Zhengzhou University, Zhengzhou, China
Enqing Chen
Jiaotong University, Xi’an, China
Yihong Gong
Zhengzhou University, Zhengzhou, China
Yun Tie

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, Z., Li, Y., Gao, Y., Liu, Y. (2016). Fast Cross-Scenario Clothing Retrieval Based on Indexing Deep Features. In: Chen, E., Gong, Y., Tie, Y. (eds) Advances in Multimedia Information Processing - PCM 2016. PCM 2016. Lecture Notes in Computer Science(), vol 9916. Springer, Cham. https://doi.org/10.1007/978-3-319-48890-5_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-48890-5_11
Published: 27 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48889-9
Online ISBN: 978-3-319-48890-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics