Advertisement

Deep Convolutional Neural Network for Person Re-identification: A Comprehensive Review

  • Harendra Chahar
  • Neeta Nain
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 703)

Abstract

In video surveillance, person re-identification (re-id) is a popular technique to automatically finding whether a person has been already seen in a group of cameras. In the recent years, availability of large-scale datasets, the deep learning-based approaches have made significant improvement in the accuracy over the years as compared to hand-crafted approaches. In this paper, we have distinguished the person re-id approaches into two categories, i.e., image-based and video-based approaches; deep learning approaches are reviewed in both categories. This paper contains the brief survey of deep learning approaches on both image and video person re-id datasets. We have also presented the current ongoing works, issues, and future directions in large-scale datasets.

Keywords

Person re-identification Convolutional neural network Open-world person re-identification 

References

  1. 1.
    Plantinga, A.: Things and persons. The Review of Metaphysics, pp. 493–519 (1961)Google Scholar
  2. 2.
    D’Orazio, T., Grazia C.: People re-identification and tracking from multiple cameras: A review., In 19th IEEE International Conference on Image Processing (ICIP), pp. 1601–1604 (2012)Google Scholar
  3. 3.
    Bedagkar, G., Apurva, Shishir K.S.: A survey of approaches and trends in person re-identification, Image and Vision Computing, Vol. 32 no. 4, pp. 270–286 (2014)Google Scholar
  4. 4.
    Gong, S., Cristani, M., Yan, S., Loy, C. C.: Person re-identification, Springer, Vol. 1 (2014)Google Scholar
  5. 5.
    Satta, R.: Appearance descriptors for person re-identification: a comprehensive review, arXiv preprint arXiv:1307.5748 (2013)
  6. 6.
    Wang, X.: Intelligent multi-camera video surveillance: A review, Pattern recognition letters, Vol. 34 no. 1, pp. 3–19 (2013)Google Scholar
  7. 7.
    Huang, T., Russell, S.: Object identification in a bayesian context, In IJCAI, Vol. 97, pp. 1276–1282 (1997)Google Scholar
  8. 8.
    Zajdel, W., Zivkovic, Z., Krose, B. J. A.: Keeping track of humans: Have I seen this person before?, In Proceedings of the IEEE International Conference on Robotics and Automation, pp. 2081–2086, IEEE (2005)Google Scholar
  9. 9.
    Bazzani, L., Cristani, M., Perina, A., Farenzena, M., Murino, V.: Multiple-shot person re-identification by hpe signature, In 20th International Conference on Pattern Recognition (ICPR), pp. 1413–1416, IEEE (2010)Google Scholar
  10. 10.
    Farenzena, M., Bazzani, L., Perina, A., Murino, V., Cristani, M.: Person re-identification by symmetry-driven accumulation of local features, In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2360–2367, IEEE (2010)Google Scholar
  11. 11.
    Yi, D., Lei, Z., Liao, S., Li, S. Z.: Deep metric learning for person re-identification, In 22nd International Conference on Pattern Recognition (ICPR), pp. 34–39, IEEE (2014)Google Scholar
  12. 12.
    Li, W., Zhao, R., Xiao, T., Wang, X.: Deepreid: Deep filter pairing neural network for person re-identification, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 152–159, (2014)Google Scholar
  13. 13.
    Gheissari, N., Sebastian, T. B., Hartley, R.: Person reidentification using spatiotemporal appearance. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Vol. 2, pp. 1528–1535, IEEE (2006)Google Scholar
  14. 14.
    Krizhevsky, A., Sutskever, I., Hinton, G. E.: Imagenet classification with deep convolutional neural networks, In Advances in neural information processing systems, pp. 1097–1105 (2012)Google Scholar
  15. 15.
    Girshick, R., Donahue, J., Darrell, T., Malik, J.: Rich feature hierarchies for accurate object detection and semantic segmentation, In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 580–587 (2014)Google Scholar
  16. 16.
    Ahmed, E., Jones, M., Marks, T. K.: An improved deep learning architecture for person re-identification, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3908–3916 (2015)Google Scholar
  17. 17.
    Varior, R. R., Shuai, B., Lu, J., Xu, D., Wang, G.: A siamese long short-term memory architecture for human re-identification, In European Conference on Computer Vision, Springer International Publishing, pp. 135–153 (2016)Google Scholar
  18. 18.
    Varior, R. R., Haloi, M., Wang, G.: Gated siamese convolutional neural network architecture for human re-identification, In European Conference on Computer Vision, Springer International Publishing, pp. 791–808 (2016)Google Scholar
  19. 19.
    Liu, H., Feng, J., Qi, M., Jiang, J., Yan, S.: End-to-end comparative attention networks for person re-identification, arXiv preprint arXiv:1606.04404 (2016)
  20. 20.
    Cheng, D., Gong, Y., Zhou, S., Wang, J., Zheng, N.: Person re-identification by multi-channel parts-based CNN with improved triplet loss function, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1335–1344 (2016)Google Scholar
  21. 21.
    Su, C., Zhang, S., Xing, J., Gao, W., Tian, Q.: Deep attributes driven multi-camera person re-identification, In European Conference on Computer Vision, Springer International Publishing, pp. 475–491 (2016)Google Scholar
  22. 22.
    Xiao, T., Li, H., Ouyang, W., Wang, X.: Learning deep feature representations with domain guided dropout for person re-identification, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1249–1258 (2016)Google Scholar
  23. 23.
    Zheng, L., Zhang, H., Sun, S., Chandraker, M., Tian, Q.: Person re-identification in the wild, arXiv preprint arXiv:1604.02531 (2016)
  24. 24.
    Zheng, L., Bie, Z., Sun, Y., Wang, J., Su, C., Wang, S., Tian, Q.: Mars: A video benchmark for large-scale person re-identification, In European Conference on Computer Vision, Springer International Publishing, pp. 868–884 (2016)Google Scholar
  25. 25.
    Wu, L., Shen, C., van den Hengel, A.: Deep linear discriminant analysis on fisher networks: A hybrid architecture for person re-identification, Pattern Recognition (2016)Google Scholar
  26. 26.
    Perronnin, F., Snchez, J., Mensink, T.: Improving the fisher kernel for large-scale image classification, In European conference on computer vision, Springer Berlin Heidelberg, pp. 143–156 (2010)Google Scholar
  27. 27.
    Gray, D., Tao, H.: Viewpoint invariant pedestrian recognition with an ensemble of localized features, In European conference on computer vision, Springer Berlin Heidelberg, pp. 262–275 (2008)Google Scholar
  28. 28.
    Wei-Shi, Z., Shaogang, G., Tao, X.,: Associating groups of people, In Proceedings of the British Machine Vision Conference, pp. 23.1–23.11 (2009)Google Scholar
  29. 29.
    Loy, C. C., Xiang, T., Gong, S.: Multi-camera activity correlation analysis, In IEEE Conference on Computer Vision and Pattern Recognition, pp. 1988–1995, IEEE (2009)Google Scholar
  30. 30.
    Li, W., Zhao, R., Wang, X.: Human reidentification with transferred metric learning, In Asian Conference on Computer Vision, Springer Berlin Heidelberg, pp. 31–44 (2012)Google Scholar
  31. 31.
    Li, W., Wang, X.: Locally aligned feature transforms across views, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3594–3601 (2013)Google Scholar
  32. 32.
    Roth, P. M., Hirzer, M., Kstinger, M., Beleznai, C., Bischof, H.: Mahalanobis distance learning for person re-identification, In Person Re-Identification, pp. 247–267, Springer (2014)Google Scholar
  33. 33.
    Zheng, L., Shen, L., Tian, L., Wang, S., Wang, J., Tian, Q.: Scalable person re-identification: A benchmark, In Proceedings of the IEEE International Conference on Computer Vision, pp. 1116–1124 (2015)Google Scholar
  34. 34.
    Yi, D., Lei, Z., Liao, S., Li, S. Z.: Deep metric learning for person re-identification, in Proceedings of International Conference on Pattern Recognition, pp. 2666–2672 (2014)Google Scholar
  35. 35.
    Chen, S. Z., Guo, C. C., Lai, J. H.: Deep ranking for person re-identification via joint representation learning, IEEE Transactions on Image Processing, Vol. 25 no.5, pp. 2353–2367 (2016)Google Scholar
  36. 36.
    Wu, L., Shen, C., Hengel, A. V. D.: Personnet: person re-identification with deep convolutional neural networks. arXiv preprint arXiv:1601.07255 (2016)
  37. 37.
    Wang, F., Zuo, W., Lin, L., Zhang, D., Zhang, L.: Joint learning of single-image and cross-image representations for person re-identification, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1288–1296 (2016)Google Scholar
  38. 38.
    McLaughlin, N., Martinez del Rincon, J., Miller, P.: Recurrent convolutional network for video-based person re-identification, In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1325–1334 (2016)Google Scholar
  39. 39.
    Yan, Y., Ni, B., Song, Z., Ma, C., Yan, Y., Yang, X.: Person re-identification via recurrent feature aggregation, In European Conference on Computer Vision, Springer International Publishing, pp. 701–716 (2016)Google Scholar
  40. 40.
    Fernando, B., Gavves, E., Oramas, J., Ghodrati, A., Tuytelaars, T.: Rank pooling for action recognition, IEEE transactions on pattern analysis and machine intelligence (2016)Google Scholar
  41. 41.
    Wang, P., Cao, Y., Shen, C., Liu, L., Shen, H. T.: Temporal pyramid pooling based convolutional neural networks for action recognition, arXiv preprint arXiv:1503.0122 (2015)
  42. 42.
    Wu, L., Shen, C., Hengel, A. V. D.: Deep recurrent convolutional networks for video-based person re-identification: An end-to-end approach, arXiv preprint arXiv:1606.01609 (2016)
  43. 43.
    Wu, Z., Wang, X., Jiang, Y. G., Ye, H., Xue, X.: Modeling spatial-temporal clues in a hybrid deep learning framework for video classification, In Proceedings of the 23rd ACM international conference on Multimedia, pp. 461–470 (2015)Google Scholar
  44. 44.
    Ess, A., Leibe, B., Van Gool, L.: Depth and appearance for mobile scene analysis, In IEEE 11th International Conference on Computer Vision, pp. 1–8 (2007)Google Scholar
  45. 45.
    Baltieri, D., Vezzani, R., Cucchiara, R.: 3dpes: 3d people dataset for surveillance and forensics, In Proceedings of the 2011 joint ACM workshop on Human gesture and behavior understanding, pp. 59–64 (2011)Google Scholar
  46. 46.
    Hirzer, M., Beleznai, C., Roth, P. M., Bischof, H.: Person re-identification by descriptive and discriminative classification, In Scandinavian conference on Image analysis, Springer Berlin Heidelberg, pp. 91–102 (2011)Google Scholar
  47. 47.
    Wang, T., Gong, S., Zhu, X., Wang, S.: Person re-identification by video ranking, In European Conference on Computer Vision, Springer International Publishing, pp. 688–703 (2014)Google Scholar
  48. 48.
    Zheng, W. S., Gong, S., Xiang, T.: Towards open-world person re-identification by one-shot group-based verification, IEEE transactions on pattern analysis and machine intelligence, Vol. 38 no. 3, pp. 591–606 (2016)Google Scholar
  49. 49.
    Liao, S., Mo, Z., Zhu, J., Hu, Y., Li, S. Z.: Open-set person re-identification, arXiv preprint arXiv:1408.0872 (2014)

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  1. 1.Department of Computer Science and EngineeringMalaviya National Institute of TechnologyJaipurIndia

Personalised recommendations