Skip to main content
Log in

DSF-net: occluded person re-identification based on dual structure features

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

In many person re-identification application scenarios, such as supermarkets, subway stations, and streets, it is necessary to solve an occlusion problem. We propose a dual branch named Dual structural Features Network to solve the problem. Our method obtains features embedding from two types of structural data, that is, Euclidean structural data and non-Euclidean structural data. We argue that the features of these two structures are equally important to ease the occlusion problem. In our first branch, we introduce a Position Attention Drop Block to extract the Euclidean structural feature. This branch focuses on the information that pixels can represent within a certain receptive field. In order to better target our network at the occlusion problem, we propose a drop method based on the pixel attention score of the person image, in which the area with the highest score is lost. In this way, we design a network that pays more attention to other more detailed information. In the other branch, a new U-shaped Residual Graph Convolutional Network is used to extract the features from non-Euclidean structural data, which is an effective multilayer graph convolution. We argue that good non-Euclidean structural data can express more topological correlation information, thereby reducing the interference of the occluded part. From the experimental results, we have achieved competitive performance with state-of-the-art methods, and our method is especially effective for person re-identification with occluded body parts.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

Data availability

The datasets used in this paper are public datasets. The data that support the fundings of the study are available from the corresponding author upon reasonable request.

References

  1. Zhu J, Zeng H, Liao S et al (2018) Deep hybrid similarity learning for person re-identification. IEEE Trans Circuits Syst Video Technol 28(11):3183–3193. https://doi.org/10.1109/TCSVT.2017.2734740

    Article  Google Scholar 

  2. Li W, Zhu X, Gong S (2018b) Harmonious attention network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2285–2294

  3. Ahmed E, Jones M, Marks TK (2015) An improved deep learning architecture for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR)

  4. Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737

  5. Luo H, Gu Y, Liao X, et al (2019) Bag of tricks and a strong baseline for deep person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops, pp. 0–0

  6. Zhong Z, Zheng L, Cao D, et al (2017) Re-ranking person re-identification with k-reciprocal encoding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1318–1327

  7. Ristani E, Solera F, Zou R, et al (2016a) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision. Springer, pp. 17–35

  8. Zheng L, Shen L, Tian L, et al (2015a) Scalable person re-identification: a benchmark. In: Proceedings of the IEEE international conference on computer vision, pp. 1116–1124

  9. Huang H, Yang W, Chen X, et al (2018b) Eanet: enhancing alignment for cross-domain person re-identification. arXiv preprint arXiv:1812.11369

  10. Fu Y, Wei Y, Wang G, et al (2019) Self-similarity grouping: A simple unsupervised cross domain adaptation approach for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 6112–6121

  11. Chen Y, Zhu X, Gong S (2019) Instance-guided context rendering for cross-domain person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 232–242

  12. Huang Z, Wang Z, Tsai CC et al (2021) Dotscn: group re-identification via domain-transferred single and couple representation learning. IEEE Trans Circuits Syst Video Technol 31(7):2739–2750. https://doi.org/10.1109/TCSVT.2020.3031303

    Article  Google Scholar 

  13. Zhang X, Luo H, Fan X, et al (2017) Alignedreid: Surpassing human-level performance in person re-identification. arXiv preprint arXiv:1711.08184

  14. Yang Q, Wu A, Zheng WS (2019) Person re-identification by contour sketch under moderate clothing change. In: IEEE transactions on pattern analysis and machine intelligence

  15. Miao J, Wu Y, Liu P, et al (2019) Pose-guided feature alignment for occluded person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 542–551

  16. Zhuo J, Chen Z, Lai J, et al (2018) Occluded person re-identification. In: 2018 IEEE international conference on multimedia and expo (ICME). IEEE, pp. 1–6

  17. Luo H, Jiang W, Gu Y et al (2020) A strong baseline and batch normalization neck for deep person re-identification. IEEE Trans Multimed 22(10):2597–2609. https://doi.org/10.1109/TMM.2019.2958756

    Article  Google Scholar 

  18. Wang G, Yuan Y, Chen X, et al (2018) Learning discriminative features with multiple granularities for person re-identification. In: Proceedings of the 26th ACM international conference on Multimedia

  19. Gong X, Yao Z, Li X, et al (2021) Lag-net: Multi-granularity network for person re-identification via local attention system. In: IEEE transactions on multimedia pp 1–1. https://doi.org/10.1109/TMM.2021.3050082

  20. Sun Y, Zheng L, Yang Y, et al (2018) Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In: Proceedings of the European conference on computer vision (ECCV), pp. 480–496

  21. Gao S, Wang J, Lu H, et al (2020) Pose-guided visible part matching for occluded person reid. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 11744–11752

  22. Yang J, Zhang C, Tang Y et al (2022) Pafm: pose-drive attention fusion mechanism for occluded person re-identification. Neural Comput Appl 34(10):8241–8252

    Article  Google Scholar 

  23. Zhang L, Jiang N, Diao Q, et al (2022) Person re-identification with pose variation aware data augmentation. In: Neural computing and applications, pp. 1–14

  24. Wang G, Yang S, Liu H, et al (2020) High-order information matters: Learning relation and topology for occluded person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 6449–6458

  25. Pan H, Bai Y, He Z et al (2022) Aagcn: adjacency-aware graph convolutional network for person re-identification. Knowl Based Syst 236(107):300

    Google Scholar 

  26. Zheng L, Yang Y, Hauptmann AG (2016) Person re-identification: past, present and future. arXiv preprint arXiv:1610.02984

  27. Alemu LT, Pelillo M, Shah M (2019) Deep constrained dominant sets for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 9855–9864

  28. Chen H, Wang Y, Shi Y, et al (2018) Deep transfer learning for person re-identification. In: 2018 IEEE fourth international conference on multimedia big data (BigMM), IEEE, pp 1–5

  29. Zhang S, Wen L, Bian X, et al (2018) Occlusion-aware r-cnn: Detecting pedestrians in a crowd. In: Proceedings of the European conference on computer vision (ECCV), pp. 637–653

  30. Pang Y, Xie J, Khan MH, et al (2019) Mask-guided attention network for occluded pedestrian detection. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 4967–4975

  31. Liu S, Huang D, Wang Y (2019) Adaptive nms: Refining pedestrian detection in a crowd. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 6459–6468

  32. Zheng WS, Li X, Xiang T, et al (2015b) Partial person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp. 4678–4686

  33. He L, Liang J, Li H, et al (2018a) Deep spatial feature reconstruction for partial person re-identification: Alignment-free approach. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 7073–7082

  34. Fan X, Luo H, Zhang X, et al (2018) Scpnet: Spatial-channel parallelism network for joint holistic and partial person re-identification. In: Asian conference on computer vision, Springer, pp. 19–34

  35. Sun Y, Xu Q, Li Y, et al (2019) Perceive where to focus: learning visibility-aware part-level features for partial person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 393–402

  36. Zhao Y, Zhu S, Wang D, et al (2022) Short range correlation transformer for occluded person re-identification. In: Neural computing and applications, pp. 1–13

  37. Niepert M, Ahmed M, Kutzkov K (2016) Learning convolutional neural networks for graphs. In: International conference on machine learning, PMLR, pp. 2014–2023

  38. Bronstein MM, Bruna J, LeCun Y et al (2017) Geometric deep learning: going beyond euclidean data. IEEE Signal Process Mag 34(4):18–42

    Article  Google Scholar 

  39. Kipf TN, Welling M (2016) Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907

  40. Srivastava N, Hinton G, Krizhevsky A et al (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958

    MATH  Google Scholar 

  41. Ghiasi G, Lin TY, Le QV (2018) Dropblock: A regularization method for convolutional networks. arXiv preprint arXiv:1810.12890

  42. Dai Z, Chen M, Gu X, et al (2019) Batch dropblock network for person re-identification and beyond. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 3691–3701

  43. Fu J, Liu J, Tian H et al (2019). Dual attention network for scene segmentation. https://doi.org/10.1109/CVPR.2019.00326

  44. Li S, Bak S, Carr P, et al (2018a) Diversity regularized spatiotemporal attention for video-based person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 369–378

  45. Wu D, Ye M, Lin G, et al (2021) Person re-identification by context-aware part attention and multi-head collaborative learning. In: IEEE transactions on information forensics and security

  46. Chen P, Liu W, Dai P, et al (2021) Occlude them all: Occlusion-aware attention network for occluded person re-id. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 11833–11842

  47. Tompson J, Goroshin R, Jain A, et al (2015) Efficient object localization using convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 648–656

  48. Szegedy C, Vanhoucke V, Ioffe S, et al (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2818–2826

  49. Wen Y, Zhang K, Li Z, et al (2016) A discriminative feature learning approach for deep face recognition. In: European conference on computer vision. Springer, pp. 499–515

  50. Cao Z, Hidalgo G, Simon T et al (2019) Openpose: realtime multi-person 2d pose estimation using part affinity fields. IEEE Trans Pattern Anal Mach Intell 43(1):172–186

    Article  Google Scholar 

  51. Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 7132–7141

  52. Zhong Z, Zheng L, Kang G, et al (2020) Random erasing data augmentation. In: Proceedings of the AAAI conference on artificial intelligence, pp. 13001–13008

  53. He K, Zhang X, Ren S, et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778

  54. Zheng Z, Zheng L, Yang Y (2017) Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In: Proceedings of the IEEE international conference on computer vision, pp. 3754–3762

  55. Ristani E, Solera F, Zou R, et al (2016b) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision workshop on benchmarking multi-target tracking

  56. Li W, Zhao R, Xiao T, et al (2014) Deepreid: Deep filter pairing neural network for person re-identification. In: CVPR

  57. Zhao L, Li X, Zhuang Y, et al (2017) Deeply-learned part-aligned representations for person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp. 3219–3228

  58. He L, Wang Y, Liu W, et al (2019) Foreground-aware pyramid reconstruction for alignment-free occluded person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 8450–8459

  59. He L, Sun Z, Zhu Y, et al (2018b) Recognizing partial biometric patterns. arXiv preprint arXiv:1810.07399

  60. Huang H, Li D, Zhang Z, et al (2018a) Adversarially occluded samples for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 5098–5107

  61. Zhong Z, Zheng L, Cao D, et al (2017) Re-ranking person re-identification with k-reciprocal encoding. In: 2017 IEEE conference on computer vision and pattern recognition (CVPR), pp. 3652–3661, https://doi.org/10.1109/CVPR.2017.389

  62. Yang J, Zhang J, Yu F, et al (2021) Learning to know where to see: a visibility-aware approach for occluded person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp. 11885–11894

  63. Wang P, Ding C, Shao Z, et al (2022) Quality-aware part models for occluded person re-identification. arXiv preprint arXiv:2201.00107

  64. Zheng F, Deng C, Sun X, et al (2019) Pyramidal person re-identification via multi-loss dynamic training. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 8514–8522

  65. Wang G, Lai J, Huang PY, et al (2019) Spatial-temporal person re-identification. ArXiv abs/1812.03282

  66. Wieczorek M, Rychalska B, Dabrowski J (2021) On the unreasonable effectiveness of centroids in image retrieval. In: ICONIP

  67. Qi L, Huo J, Wang L, et al (2018a) Maskreid: a mask based deep ranking neural network for person re-identification. ArXiv abs/1804.03864

  68. Yin J, Xie J, Ma Z, Guo J (2022) Mpccl: multiview predictive coding with contrastive learning for person re-identification. Pattern Recognit 129:108710

    Article  Google Scholar 

  69. Qi L, Huo J, Wang L, et al (2018b) Maskreid: a mask based deep ranking neural network for person re-identification. arXiv preprint arXiv:1804.03864

  70. Wattenberg M, Viégas F, Johnson I (2016) How to use t-sne effectively. Distill https://doi.org/10.23915/distill.00002, http://distill.pub/2016/misread-tsne

Download references

Acknowledgements

This work is partly supported by The National Natural Science Foundation of China (No. 61876158) and the Fundamental Research Funds for the Central Universities (2682021ZTPY030).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xun Gong.

Ethics declarations

Conflict of interest

The research described in the article was performed with funding from The National Natural Science Foundation of China. Dr. Gong is a principal of the Foundation. However, none of these foundation is described or presented in this article. The foundation is used to promote the development of basic research in the natural sciences. The Institute receives research grants from The National Natural Science Foundation of China. However, there are no other conflicts of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fan, Y., Gong, X. & He, Y. DSF-net: occluded person re-identification based on dual structure features. Neural Comput & Applic 35, 3537–3550 (2023). https://doi.org/10.1007/s00521-022-07927-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-022-07927-6

Keywords

Navigation