Semi-supervised learning for person re-identification based on style-transfer-generated data by CycleGANs

Zhu, Shangdong; Zhang, Yunzhou; Coleman, Sonya; Wang, Song; Li, Ruilong; Liu, Shuangwei

doi:10.1007/s00138-021-01239-w

Semi-supervised learning for person re-identification based on style-transfer-generated data by CycleGANs

Original Paper
Published: 01 October 2021

Volume 32, article number 122, (2021)
Cite this article

Machine Vision and Applications Aims and scope Submit manuscript

Shangdong Zhu¹,
Yunzhou Zhang ORCID: orcid.org/0000-0003-0610-3732²,
Sonya Coleman³,
Song Wang¹,
Ruilong Li² &
…
Shuangwei Liu²

700 Accesses
7 Citations
1 Altmetric
Explore all metrics

Abstract

Person re-identification (re-ID) is an exceedingly significant branch in the field of computer vision, especially for video surveillance. It is still a challenge to obtain more labeled training data and use them reasonably for more precise matching, though the person re-ID performance has been improved significantly. In order to solve this challenge, this study proposes a semi-supervised learning algorithm for data augmentation, the style-transfer-generated data as an extra class (STGDEC), which is aided by the Cycle-Consistent Adversarial Networks (CycleGANs) in generating extra unlabeled training data. Specifically, the algorithm firstly trains the CycleGANs and Deep Convolutional Generative Adversarial Networks so as to generate large amounts of unlabeled data. Secondly, we propose an adaptive receptive field module to expand the size of receptive fields and select the appropriate receptive field features dynamically in order to learn more contextual information and discriminative feature representation and embed the module in the backbone network easily. Thirdly, we use the combination of label smoothing regularization for outliers and an extra class loss to regularize the generated data and encourage the network not to be too confident to the ground-truth. Finally, this paper proposes three training strategies for the combination of standard dataset and generated samples. Comprehensive experiments based on the STGDEC are conducted, and these results show that the proposed algorithm gains a significant improvement over the baseline, the Basel. + LSRO and state-of-the-art approaches of person re-ID in many cases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SSD: Single Shot MultiBox Detector

End-to-End Object Detection with Transformers

A survey on Image Data Augmentation for Deep Learning

Article Open access 06 July 2019

References

Zheng, L., Yang, Y., Hauptmann, A.G.: Person re-identification: past, present and future. arXiv:1610.02984 (2016)
Zhong, Z., Zheng, L., Zheng, Z., Li, S., Yang, Y.: Camera style adaptation for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5157–5166 (2018)
Ahmed, E., Jones, M., Marks, T.K.: An improved deep learning architecture for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3908–3916 (2015)
Chen, S., Guo, C., Lai, J.: Deep ranking for person re-identification via joint representation learning. IEEE Trans. Image Process. (TIP) 25(5), 2353–2367 (2016)
Article MathSciNet Google Scholar
Liao, S., Hu, Y., Zhu, X., Li, S.Z.: Person re-identification by local maximal occurrence representation and metric learning. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2197–2206 (2015)
Paisitkriangkrai, S., Shen, C., van den Hengel, A.: Learning to rank in person re-identification with metric ensembles. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1846–1855 (2015)
Tao, D., Guo, Y., Song, M., Li, Y., Yu, Z., Tang, Y.Y.: Person re-identification by dual-regularized KISS metric learning. IEEE Trans. Image Process. (TIP) 25(6), 2726–2738 (2016)
Article MathSciNet Google Scholar
Wu, L., Shen, C., van den Hengel, A.: Deep linear discriminant analysis on fisher networks: a hybrid architecture for person re-identification. Pattern Recognit. (PR) 65, 238–250 (2017)
Article Google Scholar
Zheng, Z., Zheng, L., Yang, Y.: Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In: IEEE International Conference on Computer Vision (ICCV), pp. 3774–3782 (2017)
Huang, Y., Xu, J., Wu, Q., Zheng, Z., Zhang, Z., Zhang, J.: Multi-pseudo regularized label for generated data in person re-identification. IEEE Trans. Image Process. (TIP) 28(3), 1391–1403 (2019)
Article MathSciNet Google Scholar
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., Bengio, Y.: Generative adversarial nets. In: Neural Information Processing Systems (NIPS), pp. 2672–2680 (2014)
Xin, X., Wang, J., Xie, R., Zhou, S., Huang, W., Zheng, N.: Semi-supervised person re-identification using multi-view clustering. Pattern Recognit. (PR) 88, 285–297 (2019)
Article Google Scholar
Liang, C., Huang, B., Hu, R., Zhang, C., Jing, X., Xiao, J.: A unsupervised person re-identification method using model based representation and ranking. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp. 771–774 (2015)
Peng, P., Xiang, T., Wang, Y., Pontil, M., Gong, S., Huang, T., Tian, Y.: Unsupervised cross-dataset transfer learning for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1306–1315 (2016)
Zhu. J., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: IEEE International Conference on Computer Vision (ICCV), pp. 2242–2251 (2017)
Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. arxiv:1511.06434 (2016)
Zheng, L., Shen, L., Tian, L., Wang, S., Wang, J., Tian, Q.: Scalable person re-identification: a benchmark. In: IEEE International Conference on Computer Vision (ICCV), pp. 1116–1124 (2015)
Gatys, L.A., Ecker, A.S., Bethge, M.: Image style transfer using convolutional neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2414–2423 (2016)
Isola, P., Zhu, J.-Y., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1125–1134 (2017)
Liu, M.-Y., Tuzel, O.: Coupled generative adversarial networks. In: Neural Information Processing Systems (NIPS), pp. 469–477 (2016)
Taigman, Y., Polyak, A., Wolf, L.: Unsupervised cross-domain image generation. arxiv:1611.02200 (2016)
Zhao, L., Li, X., Zhuang, Y., Wang, J.: Deeply-learned part-aligned representations for person re-identification. In: IEEE International Conference on Computer Vision (ICCV), pp. 3219–3228 (2017)
Lin, Y., Zheng, L., Zheng, Z., Wu, Y., Hu, Z., Yan, C., Yang, Y.: Improving person re-identification by attribute and identity learning. Pattern Recognit. (PR) 95, 151–161 (2019)
Article Google Scholar
He, L., Liang, J., Li, H., Sun, Z.: Deep spatial feature reconstruction for partial person re-identification: alignment-free approach. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7073–7082 (2018)
Zheng, L., Huang, Y., Lu, H., Yang, Y.: Pose-invariant embedding for deep person re-identification. IEEE Trans. Image Process. (TIP) 28(9), 4500–4509 (2019)
Article MathSciNet Google Scholar
Yi, D., Lei, Z., Liao, S., Li, S.Z.: Deep metric learning for person re-identification. In: 2014 22nd International Conference on Pattern Recognition (ICPR), pp. 34–39 (2014)
Li, W., Zhao, R., Xiao, T., Wang, X.: Deepreid: deep filter pairing neural network for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 152–159 (2014)
Figueira, D., Bazzani, L., Minh, H.Q., Cristani, M., Bernardino, A., Murino, V.: Semi-supervised multi-feature learning for person re-identification. In: IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), pp. 111–116 (2013)
Liu, X., Song, M., Tao, D., Zhou. X., Chen, C., Bu, J.: Semi-supervised coupled dictionary learning for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3550–3557 (2014)
Yang, X., Wang, M., Hong, R., Tian, Q., Rui, Y.: Enhancing person re-identification in a self-trained subspace. ACM Trans. Multimed. Comput. Commun. Appl. (TOMCCAP) 13(3), 27:1-27:23 (2017)
Google Scholar
Liu, Y., Song, G., Shao, J., Jin, X., Wang, X.: Transductive centroid projection for semi-supervised large-scale recognition. In: European Conference on Computer Vision (ECCV), pp. 70–86 (2018)
Salimans, T., Goodfellow, I., Zaremba, W., Cheung, V., Radford, A., Chen, X.: Improved techniques for training gans. In: Neural Information Processing Systems (NIPS), pp. 2234–2242 (2016)
Ding, G., Zhang, S., Khan, S., Tang, Z., Zhang, J., Porikli, F.: Feature affinity based pseudo labeling for semi-supervised person re-identification. IEEE Trans. Multimed. (TOM) 21(11), 2891–2902 (2019)
Article Google Scholar
Sun, Y., Zheng, L., Yang, Y., Tian, Q., Wang, S.: Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In: The European Conference on Computer Vision (ECCV), pp. 501–518 (2018)
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–9 (2015)
Hu, J., Shen, L., Sun, G.: Squeeze-and-Excitation Networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7132–7141 (2018)
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2818–2826 (2016)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 32(9), 1627–1645 (2010)
Article Google Scholar
Ristani, E., Solera, F., Zou, R., Cucchiara, R., Tomasi, C.: Performance measures and a data set for multi-target, multi-camera tracking. In: European Conference on Computer Vision (ECCV), pp. 17–35 (2016)
Lee, D.-H.: Pseudo-label: the simple and efficient semi-supervised learning method for deep neural networks. In: Workshop on Challenges in Representation Learning, ICML, pp. 2 (2013)
Vedaldi, A., Lenc, K.: Matconvnet: convolutional neural networks for matlab. In: Proceedings of the 23rd ACM International Conference on Multimedia, pp. 689–692 (2015)
Luo, H., Gu, Y., Liao, X., Lai, S., Jiang, W.: Bags of tricks and a strong baseline for deep person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, pp. 0 (2019)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv:1412.6980 (2014)
Ustinova, E., Ganin, Y., Lempitsky, V.: Multi bilinear convolutional neural networks for person re-identification. arXiv:1512.05300 (2015)
Chen, D., Yuan, Z., Chen, B., Zheng, N.: Similarity learning with spatial constraints for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1268–1277 (2016)
Zhang, L., Xiang, T., Gong, S.: Learning a discriminative null space for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1239–1248 (2016)
Varior, R.R., Haloi, M., Wang, G.: Gated siamese convolutional neural network architecture for human re-identification. In: The European Conference on Computer Vision (ECCV), pp. 791–808 (2016)
Barbosa, I.B., Cristani, M., Caputo, B., Rognhaugen, A., Theoharis, T.: Looking beyond appearances: synthetic training data for deep CNNs in re-identification. Comput. Vis. Image Underst. (CVIU) 167, 50–62 (2018)
Article Google Scholar
Zhao, H., Tian, M., Sun, S., Shao, J., Yan, J., Yi, S., Wang, X., Tang, X.: Spindle net: person re-identification with human body region guided feature decomposition and fusion. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 907–915 (2017)
Li, W., Zhu, X., Gong, S.: Harmonious attention network for person re-identification. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2285–2294 (2018)

Download references

Acknowledgements

This work is partly supported by National Natural Science Foundation of China (No. 61973066), Distinguished Creative Talent Program of Liaoning Colleges and Universities (LR2019027), and Fundamental Research Funds for the Central Universities (N182608004, N2004022).

Author information

Authors and Affiliations

Faculty of Robot Science and Engineering, Northeastern University, Shenyang, 110819, China
Shangdong Zhu & Song Wang
College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China
Yunzhou Zhang, Ruilong Li & Shuangwei Liu
Intelligent Systems Research Centre, University of Ulster, Derry, BT52 1SA, UK
Sonya Coleman

Authors

Shangdong Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yunzhou Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Sonya Coleman
View author publications
You can also search for this author in PubMed Google Scholar
Song Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ruilong Li
View author publications
You can also search for this author in PubMed Google Scholar
Shuangwei Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yunzhou Zhang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhu, S., Zhang, Y., Coleman, S. et al. Semi-supervised learning for person re-identification based on style-transfer-generated data by CycleGANs. Machine Vision and Applications 32, 122 (2021). https://doi.org/10.1007/s00138-021-01239-w

Download citation

Received: 11 October 2019
Revised: 29 January 2021
Accepted: 09 August 2021
Published: 01 October 2021
DOI: https://doi.org/10.1007/s00138-021-01239-w

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Semi-supervised learning for person re-identification based on style-transfer-generated data by CycleGANs

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

End-to-End Object Detection with Transformers

A survey on Image Data Augmentation for Deep Learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Semi-supervised learning for person re-identification based on style-transfer-generated data by CycleGANs

Abstract

Access this article

Similar content being viewed by others

SSD: Single Shot MultiBox Detector

End-to-End Object Detection with Transformers

A survey on Image Data Augmentation for Deep Learning

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation