Multi-level feature learning with attention for person re-identification

Xiang, Suncheng; Fu, Yuzhuo; Chen, Hao; Ran, Wei; Liu, Ting

doi:10.1007/s11042-020-09569-z

Multi-level feature learning with attention for person re-identification

Published: 25 August 2020

Volume 79, pages 32079–32093, (2020)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Suncheng Xiang ORCID: orcid.org/0000-0002-9141-6460¹,
Yuzhuo Fu¹,
Hao Chen¹,
Wei Ran¹ &
…
Ting Liu¹

632 Accesses
8 Citations
Explore all metrics

Abstract

Person re-identification (re-ID) aims to match a specific person in a large gallery with different cameras and locations. Previous part-based methods mainly focus on part-level features with uniform partition, which increases learning ability for discriminative feature but not efficient or robust to scenarios with large variances. To address this problem, in this paper, we propose a novel feature fusion strategy based on traditional convolutional neural network. Then, a multi-branch deeper feature fusion network architecture is designed to perform discriminative learning for three semantically aligned region. Based on it, a novel self-attention mechanism is employed to softly assign corresponding weights to the semantic aligned feature during back-propagation. Comprehensive experiments have been conducted on several large-scale benchmark datasets, which demonstrates that proposed approach yields consistent and competitive re-ID accuracy compared with current single-domain re-ID methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning convolutional multi-level transformers for image-based person re-identification

Article Open access 13 October 2023

Cascaded attention-guided multi-granularity feature learning for person re-identification

Article 18 November 2022

A part-based attention network for person re-identification

Article 25 May 2020

References

Chang X, Hospedales TM, Xiang T (2018) Multi-level factorisation net for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 2109–2118
Chen LC, Yang Y, Wang J, Xu W, Yuille AL (2016) Attention to scale: Scale-aware semantic image segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 3640–3649
Chen Y, Zhu X, Gong S (2017) Person re-identification by deep learning multi-scale representations. In: IEEE International Conference on Computer Vision Workshops, pp 2590–2600
Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L (2009) Imagenet: A large-scale hierarchical image database. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 248–255
Fang HS, Xie S, Tai YW, Lu C (2017) Rmpe: Regional multi-person pose estimation. In: IEEE International Conference on Computer Vision, pp 2334–2343
Fu Y, Wei Y, Wang G, Zhou Y, Shi H, Huang TS (2019) Self-similarity grouping: A simple unsupervised cross domain adaptation approach for person re-identification. In: Proceedings of the IEEE International Conference on Computer Vision, pp 6112–6121
Gao M, Ai H, Bai B (2016) A feature fusion strategy for person re-identification. In: IEEE international conference on image processing, pp 4274–4278
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 770–778
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprint arXiv:170307737
Huang P, Huang R, Huang J, Yangchen R, He Z, Li X, Chen J (2019) Deep feature fusion with multiple granularity for vehicle re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp 80–88
Li D, Chen X, Zhang Z, Huang K (2017) Learning deep context-aware features over body and latent parts for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 384–393
Li W, Zhao R, Xiao T, Wang X (2014) Deepreid: Deep filter pairing neural network for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 152–159
Li W, Zhu X, Gong S (2018) Harmonious attention network for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 2285–2294
Liao S, Hu Y, Zhu X, Li SZ (2015) Person re-identification by local maximal occurrence representation and metric learning. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 2197–2206
Lin TY, Maire M, Belongie S, Hays J, Perona P, Ramanan D, Dollár P, Zitnick CL (2014) Microsoft coco: Common objects in context. In: European Conference on Computer Vision, pp 740–755
Navaneet K, Sarvadevabhatla R K, Shekhar S, Babu R V, Chakraborty A (2019) Operator-in-the-loop deep sequential multi-camera feature fusion for person re-identification. IEEE Transactions on Information Forensics and Security 15:2375–2385
Article Google Scholar
Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, Killeen T, Lin Z, Gimelshein N, Antiga L, et al. (2019) Pytorch: An imperative style, high-performance deep learning library. In: Advances in Neural Information Processing Systems, pp 8024–8035
Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: European Conference on Computer Vision, pp 17–35
Rui Z, Ouyang W, Wang X (2014) Learning mid-level filters for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 144–151
Shen Y, Li H, Xiao T, Yi S, Chen D, Wang X (2018) Deep group-shuffling random walk for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 2265–2274
Sun Y, Zheng L, Deng W, Wang S (2017) Svdnet for pedestrian retrieval. In: IEEE International Conference on Computer Vision, pp 3800–3808
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In: European Conference on Computer Vision, pp 480–496
Varior RR, Haloi M, Wang G (2016) Gated siamese convolutional neural network architecture for human re-identification. In: European Conference on Computer Vision, pp 791–808
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. In: Advances in Neural Information Processing Systems, pp 5998–6008
Wang G, Yuan Y, Chen X, Li J, Zhou X (2018) Learning discriminative features with multiple granularities for person re-identification. In: ACM International Conference on Multimedia, pp 274–282
Wang Y, Wang L, You Y, Zou X, Chen V, Li S, Huang G, Hariharan B, Weinberger KQ (2018) Resource aware person re-identification across multiple resolutions. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 8042–8051
Wei L, Zhang S, Yao H, Gao W, Tian Q (2017) Glad: Global-local-alignment descriptor for pedestrian retrieval. In: Proceedings of the 25th ACM international conference on Multimedia, pp 420–428
Xiang S, Fu Y, Xie M, Yu z, Liu T (2020) Unsupervised person re-identification by hierarchical cluster and domain transfer. MULTIMEDIA TOOLS AND APPLICATIONS
Yi D, Lei Z, Liao S, Li SZ (2014) Deep metric learning for person re-identification. In: International Conference on Pattern Recognition, pp 34–39
Zhang X, Luo H, Fan X, Xiang W, Sun Y, Xiao Q, Jiang W, Zhang C, Sun J (2017) Alignedreid: Surpassing human-level performance in person re-identification. arXiv preprint arXiv:171108184
Zhao L, Li X, Zhuang Y, Wang J (2017) Deeply-learned part-aligned representations for person re-identification. In: IEEE International Conference on Computer Vision, pp 3219–3228
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: A benchmark. In: IEEE International Conference on Computer Vision, pp 1116–1124
Zheng L, Yang Y, Hauptmann AG (2016) Person re-identification: Past, present and future. arXiv preprint arXiv:161002984
Zheng M, Karanam S, Wu Z, Radke RJ (2019) Re-identification with consistent attentive siamese networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 5735–5744
Zheng Z, Zheng L, Yang Y (2017) Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In: Proceedings of the IEEE International Conference on Computer Vision, pp 3754–3762
Zheng Z, Zheng L, Yang Y (2018) Pedestrian alignment network for large-scale person re-identification. IEEE Transactions on Circuits and Systems for Video Technology 29(10):3037–3045
Article Google Scholar
Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 3652–3661
Xiang S, Fu Y, You G, Liu T (2020) Unsupervised domain adaptation through synthesis for person re-identification. In: IEEE International Conference on Multimedia and Expo, pp 16

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China under Project(Grant No.61977045), the National Defense Pre-Research Foundation of China(Grant No.513110501). The authors would like to thank the anonymous reviewers for their valuable suggestions and constructive criticism.

Author information

Authors and Affiliations

School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, Shanghai, China
Suncheng Xiang, Yuzhuo Fu, Hao Chen, Wei Ran & Ting Liu

Authors

Suncheng Xiang
View author publications
You can also search for this author in PubMed Google Scholar
Yuzhuo Fu
View author publications
You can also search for this author in PubMed Google Scholar
Hao Chen
View author publications
You can also search for this author in PubMed Google Scholar
Wei Ran
View author publications
You can also search for this author in PubMed Google Scholar
Ting Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Suncheng Xiang.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Xiang, S., Fu, Y., Chen, H. et al. Multi-level feature learning with attention for person re-identification. Multimed Tools Appl 79, 32079–32093 (2020). https://doi.org/10.1007/s11042-020-09569-z

Download citation

Received: 22 March 2020
Revised: 10 July 2020
Accepted: 06 August 2020
Published: 25 August 2020
Issue Date: November 2020
DOI: https://doi.org/10.1007/s11042-020-09569-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-level feature learning with attention for person re-identification

Abstract

Access this article

Similar content being viewed by others

Learning convolutional multi-level transformers for image-based person re-identification

Cascaded attention-guided multi-granularity feature learning for person re-identification

A part-based attention network for person re-identification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Multi-level feature learning with attention for person re-identification

Abstract

Access this article

Similar content being viewed by others

Learning convolutional multi-level transformers for image-based person re-identification

Cascaded attention-guided multi-granularity feature learning for person re-identification

A part-based attention network for person re-identification

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation