Abstract
Learning with Deep Neural Networks has recently reached state-of-the-art outcomes for Person Re-Identification. Effective learning can be accomplished only with efficient features robust to illumination and viewpoint changes. This paper proposes a new feature representation method called PoolNet Deep Feature (PNDF) for person re-identification with Convolution Neural Networks. The proposed CNN architecture called PoolNet consists of two Pool Added Blocks (PAB) and a Pool Concatenated Block (PCB) to extract the more sophisticated dominant and precise features for better learning towards a person’s re-identification. The efficiency of the proposed method is demonstrated in terms of re-identification accuracy by implementing it on the challenging small scale & large-scale person re-identification datasets such as VIPeR, Market1501, CUHK03, GRID, and LaST.
Similar content being viewed by others
Data availability
The datasets which are analysed during the current study are available in the open repositories such as https://paperswithcode.com/dataset/market-1501, https://paperswithcode.com/dataset/viper, https://github.com/manideep2510/CUHK03_dataset, https://paperswithcode.com/dataset/grid, https://github.com/shuxjweb/last.git
References
Ahmed E, Jones M, Marks TK (2015) An improved deep learning architecture for person re-identification. In: Proceedings of the IEEE conference on computer vision and Pattern Recognition (CVPR). 3908-3916.
Ainam JP, Qin K, Liu G, Luo G (2019) Sparse label smoothing regularization for person re-identification. IEEE Access Translations and content mining:27899–27910
Chen J, Wang Y, Qin J, Liu L, Shao L (2017) Fast person re-identification via cross-camera semantic binary transformation. IEEE Conference on Computer Vision and Pattern Recognition, pp:3873–3882
Chen B, Deng W, Hu J (2019) Mixed high-order attention network for person re-identification. In IEEE/CVF international conference on computer vision (ICCV), pp 371–381.
Chen T, Shaojin D, Jingyi X, Ye Y, Wuyang C, Yang Y, Zhou R, Zhangyang W (2019) Abd-net: attentive but diverse person re-identification. In IEEE/CVF international conference on computer vision (ICCV), pp 8350–8360.
Dai Z, Chen M, Gu X, Zhu S, Tan P (2019) Batch drop block network for person re-identification and beyond. In IEEE/CVF international conference on computer vision (ICCV), pp 3690–3700.
Fu Y, Wei Y, Zhou Y, Shi H, Huang G, Wang X, Yao Z, Huang T (2019) Horizontal pyramid matching for person re-identification. Proceedings of the AAAI Conference on Artificial Intelligence. https://doi.org/10.1609/aaai.v33i01.33018295
Gray D, Tao H (2008) Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: European Conference on Computer Vision. 262–275.
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In proceedings of the IEEE conference on computer vision and pattern recognition, pp 770-778
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. Computer vision and pattern recognition. arXiv preprint arXiv:1703.07737
Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science:504–507
Jayapriya K, Jacob IJ, Mary N (2020) Person re-identification using prioritized chromatic texture (PCT) with deep learning. Multimed Tools Appl:29399–29410
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of Advances in Neural Information Processing Systems. 1097–1105
Kumar M, Jindal MK, Kumar M (2021) A novel attack on monochrome and greyscale Devanagari CAPTCHAs. ACM Transactions on Asian and Low-Resource Language Information Processing. https://doi.org/10.1145/3439798
Lee H, Pham P, Largman Y, Andrew NG (2009) Unsupervised feature learning for audio classification using convolutional deep belief networks. In proceedings of the conference on neural information processing systems. 1096-1104.
Li W, Zhao R, Xiao T, Wang X (2014) Deepreid: Deep filter pairing neural network for person re-identification. In: Proceedings of the IEEE conference on computer vision and Pattern Recognition (CVPR). 152–159
Li W, Zhu X, Gong S (2017) Person Re-Identification by Deep Joint Learning of Multi-Loss Classification. Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI-17). https://doi.org/10.48550/arXiv.1705.04724
Li W, Zhu X, Gong S (2018) Harmonious attention network for person re-identification. In proceedings of the IEEE conference on computer vision and pattern recognition, pp 2285-2294.
Li M, Zhu X, Gong S (2020) Unsupervised Tracklet person re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence. https://doi.org/10.1109/TPAMI.2019.2903058
Liao S, Shao L (2020) Interpretable and generalizable person re-identification with query-adaptive convolution and temporal lifting. In Proceedings of the European Conference on Computer Vision (ECCV), pp 456–474
Liao S, Hu Y, Zhu X, Li SZ (2015) Person re-identification by local maximal occurrence representation and metric learning. In proceedings of the conference on computer vision and pattern recognition, pp 2197-2206.
Lin Y, Zheng L, Zheng Z, Wu Y, Hu Z, Yan C, Yang Y (2017) Improving person re-identification by attribute and identity learning. arXiv preprint arXiv:1703. 07220
Liu F, Zhang L (2019) View confusion feature learning for person re-identification. Proceedings of the IEEE/CVF international conference on computer vision (ICCV), pp 6639-6648.
Liu X, Zhao H, Tian M, Sheng L, Shao J, Shuai Y, Yan J, Wang X (2017) HydraPlus-net: attentive deep features for pedestrian analysis. Proceedings of the IEEE international conference on computer vision, pp 350-359.
Loy CC, Xiang T, Gong S (2010) Time-delayed correlation analysis for multi-camera activity understanding. Int J Comput Vis:106–129
Lu X, Ni B, Ma C, Yang X (2019) Learning transform-aware attentive network for object tracking. Elsevier, Neuro computing, pp 133–144
Lu X, Wang W, Ma C, Shen J, Shao L, Porikli F (2019) See more, know more: unsupervised video object segmentation with co-attention Siamese networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 3623-3632
Luo C, Chen Y, Wang N, Zhang Z (2019) Spectral feature transformation for person re-identification. In IEEE/CVF international conference on computer vision (ICCV), pp 4975–4984.
Luo H, Gu Y, Liao X, Lai S, Jiang W (2019) Bag of tricks and a strong baseline for deep person reidentification. In IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW), pp 1487–1495.
Martinel N, Foresti GL, Micheloni C (2019) Aggregating deep pyramidal representations for person reidentification. In IEEE/CVF conference on computer vision and pattern recognition workshops (CVPRW), pp 1544–1554.
Opitz M, Waltner G, Poier G, Possegger H, Bischof H (2016) Grid loss: detecting occluded faces. In: Proceedings of the European Conference on Computer Vision. 386-402
Ouyang W, Wang X (2013) Joint deep learning for pedestrian detection. In proceedings of the IEEE conference on computer vision. 2056-2063
Quispe R, Pedrini H (2021) Top-db-net: top drop block for activation enhancement in person re-identification. In international conference on pattern recognition (ICPR). https://doi.org/10.1109/ICPR48806.2021.9412017
Rani J, Augasta MG (2020) A deep learning model for human re-identification with Split LOMO and deep features using XQDA. Test Engineering & Management:13776–13786
Rani J, Augasta MG (2022) Human re-identification by local maximal occurrence of color and scale Invariant Channel integrated statistical pattern. Multimedia Tools and Applications https://doi.org/10.1007/s11042-022-12888-y
Ren QQ, Tian WD, Zhao ZQ (2019) Person re-identification based on feature fusion. Lecture notes in computer science, pp 65–73.
Rena CX, Xub XL, Leic Z (2019) A deep and structured metric learning method for robust person re-identification. Pattern Recogn. https://doi.org/10.1016/j.patcog.2019.106995
Sarfraz MS, Schumann A, Eberle A, Stiefelhagen R (2018) A pose-sensitive embedding for person re-identification with expanded cross neighborhood re-ranking. In proceedings of the IEEE conference on computer vision and pattern recognition, pp 420-429.
Shi H, Zhu X, Liao S, Lei Z, Yang Y, Li SZ (2015) Constrained deep metric learning for person re-identification. Comput Vision Patt Recogni. https://doi.org/10.48550/arXiv.1511.07545
Shu X, Wang X, Zang X, Zhang S, Chen Y, Li G, Qi T (2022) Large-scale Spatio-temporal person re-identification: algorithm and benchmark. IEEE Transactions on Circuits and Systems for Video Technology:4390–4403
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Comput Vision Patt Recogn. https://doi.org/10.48550/arXiv.1409.1556
Song M, Gong S, Liu C, Ji Y, Dong H (2015) Person re- identification by improved local maximal occurrence with color names. 8th international congress on image and signal processing (CISP). https://doi.org/10.1109/CISP.2015.7407963
Song J, Yang Y, Song YZ, Xiang T (2019) Generalizable Person Re-identification by Domain-Invariant Mapping Network. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp 719–728
Stamatios L (2018) Universal Denoising networks: a novel CNN architecture for image Denoising. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3204-3213.
Su C, Zhang S, Xing J, Gao W, Tian Q (2016) Deep attributes driven multi-camera person re-identification. European Conference on Computer Vision pp:475–491
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In proceedings of the European conference on computer vision (ECCV), pp 501–518.
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 1-9.
Tang Z, Huang J (2022) Harmonious multi-branch network for person re-identification with harder triplet loss. ACM Trans Multimed Comput Commun Appl. https://doi.org/10.1145/3501405
Tao D, Guo Y, Yu B, Pang J, Yu Z (2017) Deep multi-view feature learning for person re-identification. In IEEE Transactions on Circuits and Systems for Video Technology https://doi. https://doi.org/10.1109/TCSVT.2017.2726580
Wang N, Yeung DY (2013) Learning a deep compact image representation for visual tracking. In: Proceedings of Advances in Neural Information Processing Systems
Wang G, Yuan Y, Chen X, Li J, Zhou X (2018) Learning discriminative features with multiple granularities for person re-identification. In ACM Multimedia, pp:274–282
Wang G, Yang S, Liu H, Wang Z, Yang Y, Wang S, Yu G, Zhou E, Sun J (2020) High-order information matters: learning relation and topology for occluded person re-identification. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 6448–6457.
Wang G, Gong S, Cheng J, Hou Z (2020) Faster person re-identification. In Proceedings of the European Conference on Computer Vision (ECCV), pp 275–292
Wu L, Shen C, Hengel A (2016) PersonNet: person re-identification with deep convolutional neural networks. Comput Vision Pattern Recogn. https://doi.org/10.48550/arXiv.1601.07255
Wu S, Chen YC, Li X, Wu AC, You JJ, Zhen WS (2016) An enhanced deep feature representation for person re-identification. IEEE winter conference on applications of computer vision, pp 1-8
Wu D, Zheng SJ, Bao WZ, Zhang XP, Yuan CA, Huang DS (2019) A novel deep model with multi-loss and efficient training for person re-identification. Neurocomputing:69–75
Yang F, Yan K, Lu S, Jia H, Xie X, Gao W (2019) Attention driven person re-identification. Patt Recog:143–155
Yi D, Lei Z, Liao S, Li SZ (2014) Deep metric learning for person re-identification. In proceedings of the International Conference on Pattern Recognition. 34-39
Yu HX, Zheng WS, Wu A, Guo X, Gong S, Lai JH (2019) Unsupervised person re-identification by soft multilabel learning. In proceedings of the IEEE international conference on computer vision and pattern recognition (CVPR), pp 2148-2157.
Yu HX, Wu A, Zheng WS (2020) Unsupervised person re-identification by deep asymmetric metric embedding. IEEE Transactions on Pattern Analysis and Machine Intelligence https://doi.org/10.1109/TPAMI.2018.2886878
Zhang X, Luo H, Fan X, Xiang W, Sun Y, Xiao Q, Jiang W, Zhang C, Sun J (2017) AlignedReID: surpassing human-level performance in person re-identification. arXiv preprint arXiv:1711.08184
Zhang J, Yuan Y, Wang Q (2019) Night Person Re-Identification and a Benchmark. https://doi.org/10.1109/ACCESS.2019.2929854
Zhao H, Tian M, Sun S, Shao J, Yan J, Yi S, Wang X, Tang X (2017) Spindle net: person re-identification with human body region guided feature decomposition and fusion. In proceedings of the IEEE conference on computer vision and pattern recognition, pp 1077-1085
Zhao L, Li X, Zhuang Y, Wang J (2017) Deeply-learned part-aligned representations for person re-identification. In IEEE international conference on computer vision, pp 3219–3228.
Zheng X, Ouyang W, Wang X (2013) Multi-stage contextual deep learning for pedestrian detection. In proceedings of the IEEE conference on computer vision. 121-128
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Qi (2015) Scalable person re-identification: A benchmark. In Proceedings of the IEEE International Conference on Computer Vision, pp 1116–1124
Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. IEEE Conference on Computer Vision and Pattern Recognition, pp:1318–1327
Zhou B, Agata L, Xiao J, Antonio T, Aude O (2015) Learning deep features for scene recognition using places database. In: Proceedings of the Conference on neural information processing systems. 487-495
Zhou S, Wang J, Meng D, Liang Y, Gong Y, Zheng N (2019) Discriminative feature learning with foreground attention for person re-identification. IEEE Trans Image Process:4671–4684
Zhou K, Yang Y, Cavallaro A, Xiang T (2021) Learning Generalisable Omni-scale representations for person re-identification. IEEE Trans Pattern Anal Mach Intell:5056–5069
Acknowledgements
The authors are very grateful to the editors and reviewers of MTAP for their valuable and prolific suggestions to refine the quality of the paper.
Funding
No funds, grants, or other support was received for the preparation of this manuscript.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors declare that they have no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Rani, J.S.J., Augasta, M.G. PoolNet deep feature based person re-identification. Multimed Tools Appl 82, 24967–24989 (2023). https://doi.org/10.1007/s11042-023-14364-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-14364-7