Skip to main content

Person Re-identification: System Design and Evaluation Overview

  • Chapter
  • First Online:
Person Re-Identification

Part of the book series: Advances in Computer Vision and Pattern Recognition ((ACVPR))

Abstract

Person re-identification has important applications in video surveillance. It is particularly challenging because observed pedestrians undergo significant variations across camera views, and there are a large number of pedestrians to be distinguished given small pedestrian images from surveillance videos. This chapter discusses different approaches of improving the key components of a person re-identification system, including feature design, feature learning, and metric learning, as well as their strength and weakness. It provides an overview of various person re-identification systems and their evaluation on benchmark datasets. Multiple benchmark datasets for person re-identification are summarized and discussed. The performance of some state-of-the-art person identification approaches on benchmark datasets is compared and analyzed. It also discusses a few future research directions on improving benchmark datasets, evaluation methodology, and system design.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Abdel-Hakim, A.E., Farag, A.A.: Csift: A sift descriptor with color invariant characteristics. In: Proceedings of European Conference Computer Vision, (2006)

    Google Scholar 

  2. Baltieri, D., Vezzani, R., Cucchiara, R.: 3dpes: 3d people dataset for surveillance and forensics. In: Proceedings of the 1st International ACM Workshop on Multimedia Access to 3D Human Objects (2011)

    Google Scholar 

  3. Barbosa, B.I., Cristani, M., Del Bue, A., Bazzani, L., Murino, V.: Re-identification with rgb-d sensors. In: First International Workshop on Re-Identification, (2012)

    Google Scholar 

  4. Bazzani, L., Cristani, M., Murino, V.: Symmetry-driven accumulation of local features for human characterization and re-identification. Comput. Vis. Image Underst. 117(2), 130–144 (2013)

    Google Scholar 

  5. Belongie, S., Malik, J., Puzicha, J.: Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 24, 509–512 (2002)

    Google Scholar 

  6. Bengio, Y.: Learning Deep Architectures for AI. Now Publishers, Hanover (2009)

    Google Scholar 

  7. Bialkowski, A., Denman, S., Sridharan, S., Fookes, C., Lucey, P.: A database for person re-identification in multi-camera surveillance networks. In Proceedings of International Conference on Digital Image Computing-Techniques and Applications, (2012)

    Google Scholar 

  8. Bo, Y., Fowlkes, C.C.: Shape-based pedestrian parsing. In: Proceedings of the IEEE International Conference on Computer Vision and, Pattern Recognition, (2011)

    Google Scholar 

  9. Cai, Y., Chen, W., Huang, K., Tan, T.: Continuously tracking objects across multiple widely separated cameras. In: Asian Conference on Computer Vision, (2007)

    Google Scholar 

  10. Cheng, D., Cristani, M., Stoppa, M., Bazzani, L., Murino, V.: Custom pictorial structures for re-identification. In: Proceedings of the British Machine Vision Conference, (2011)

    Google Scholar 

  11. Cheng, E.D., Piccardi, M.: Matching of objects moving across disjoint cameras. In: Proceedings of the IEEE International Conference on Image Processing (2006)

    Google Scholar 

  12. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2005)

    Google Scholar 

  13. Daugman, J.G.: Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. J. Opt. Soc. Am. A 2, 1160–1169 (1985)

    Article  Google Scholar 

  14. Davis, J., Kulis, B., Jain, P., Sra, S., Dhillon, I.: Information theoretic metric learning. In: Proceedings of the International Conference on Machine Learning, (2007)

    Google Scholar 

  15. Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: A large-scale hierarchical image database. In: Proceedings of IEEE International Conference on Computer Vision and, Pattern Recognition, (2009)

    Google Scholar 

  16. Dikmen, M., Akbas, E., Huang, T.S., Ahuja, N.: Pedestrian recognition with a learned metric. In: Asian Conference on Computer Vision, (2010)

    Google Scholar 

  17. Eslami, S.M.A., Williams, C.K.I.: A generative model for parts-based object segmentation. In: Proceedings of the Neural Information Processing Systems, (2012)

    Google Scholar 

  18. Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.: Describing objects by their attributes. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, (2009)

    Google Scholar 

  19. Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2008)

    Google Scholar 

  20. Gheissari, N., Sebastian, T.B., Rittscher, J., Hartley, R.: Person reidentification using spatiotemporal appearance. In: Proceedings of IEEE International Conference on Computer Vision and, Pattern Recognition, (2006)

    Google Scholar 

  21. Gray, D., Brennan, S., Tao, H.: Evaluating appearance models for recognition, reacquisition, and tracking. In: Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, (2007)

    Google Scholar 

  22. Gray, D., Tao, H.: Viewpoint invariant pedestrian recognition with an ensemble of localized features. In: Proceedings of the European Conference on Computer Vision, (2008)

    Google Scholar 

  23. Guo, Y., Rao, C., Samarasekera, S., Kim, J., Kumar, R., Sawhney, H.: Matching vehicles under large pose transformations using approximate 3d models and piecewise mrf model. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2008)

    Google Scholar 

  24. Hamdoun, O., Moutarde, F., Stanciulescu, B., Steux, B.: Person re-identification in multi-camera system by signature based on interest point descriptors collected on short video sequences. In: Proceedings of IEEE Conference on Distributed Smart Cameras, (2008)

    Google Scholar 

  25. Hirzer, M., Beleznai, C., Roth, P.M., Bischof, H.: Person re-identification by descriptive and discriminative classification. In: Proceedings of the Scandinavian Conference on Image, Analysis, (2011)

    Google Scholar 

  26. Hirzer, M., M., R.P., Kostinger, M., Bischof: Relaxed pairwise learned metric for person re-identification. In: Proceedings of the European Conference on Computer Vision, (2012)

    Google Scholar 

  27. Huang, G., Ramesh, M., Berg, T., Learned-Miller, E.: Labeled faces in the wild: A database for studying face recognition in unconstrained environments. University of Massachusetts, Amherst, Tech. rep. (2007)

    Google Scholar 

  28. Huang, J., Kumar, S.R., Mitra, M., Zhu, M., Zabih, R.: Image indexing using color correlograms. In: Proceedings of the IEEE International Conference on Computer Vision and, Pattern Recognition, (1997)

    Google Scholar 

  29. Javed, O., Rasheed, Z., Shafique, K., Shah, M.: Tracking across multiple cameras with disjoint views. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2003)

    Google Scholar 

  30. Javed, O., Shafique, K., Shah, M.: Appearance modeling for tracking in multiple non-overlapping cameras. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2005)

    Google Scholar 

  31. Jurie, F., Mignon, A.: Pcca: a new approach for distance learning from sparse pairwise constraints. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, (2012)

    Google Scholar 

  32. Khan, S., Shah, M.: Consistent labeling of tracked objects in multiple cameras with overlapping fields of view. IIEEE Trans. Pattern Anal. Mach. Intell. 25, 1355–1360 (2003)

    Google Scholar 

  33. Kostinger, M., Hirzer, M., Wohlhart, P., Roth, P., Bischof, H.: Large scale metric learning from equivalence constraints. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2011)

    Google Scholar 

  34. Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. In: Proceedings of the Neural Information Processing Systems, (2012)

    Google Scholar 

  35. Layne, R., Hospedales, T., Gong, S.: Person re-identification by attributes. In: Proceedings of the British Machine Vision Conference, (2012)

    Google Scholar 

  36. Li, W., Wang, X.: Locally aligned feature transforms across views. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2013)

    Google Scholar 

  37. Li, W., Zhao, R., Wang, X.: Human reidentification with transferred metric learning. In: Asian Conference on Computer Vision, (2012)

    Google Scholar 

  38. Lin, Z., Davis, L.: Learning pairwise dissimilarity profiles for appearance recognition in visual surveillance. In: Proceedings of the International Symposium on Advances in Visual Computing, (2008)

    Google Scholar 

  39. Liu, C., Gong, S., Loy, C.C., Lin, X.: Person re-identification: What features are important? In: Proceedings of the First International Workshop on Re-Identification (2012)

    Google Scholar 

  40. Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)

    Article  Google Scholar 

  41. Loy, C.C., Xiang, T., Gong, S.: Multi-camera activity correlation analysis. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2009)

    Google Scholar 

  42. Loy, C.C., Xiang, T., Gong, S.: Multi-camera activity correlation analysis. Int. J. Comput. Vision 90, 106–129 (2010)

    Article  Google Scholar 

  43. Ma, B., Su, Y., Jurie, F.: Bicov: a novel image representation for person re-identification and face verification. In: Proceedings of the British Machine Vision Conference, (2012)

    Google Scholar 

  44. Ma, B., Su, Y., Jurie, F.: Local descriptors encoded by fisher vectors for person re-identification. In: Proceedings of the First International Workshop on Re-identification, (2012)

    Google Scholar 

  45. Mignon, A., Jurie, F.: Pcca: A new approach for distance learning from sparse pairwise constraints. In: Proceedings of the IEEE Internatonal Conference on Computer Vision and Pattern Recognition, (2012)

    Google Scholar 

  46. Mittal, A., Davis, L.S.: M2tracker: a multi-view approach to segmenting and tracking people in a cluttered scene. Int. J. Comput. Vision 51, 189–203 (2003)

    Article  Google Scholar 

  47. Nakajima, C., Pontil, M., Heisele, B., Poggio, T.: Full-body recognition system. Pattern Recognit. 36, 1977–2006 (2003)

    Google Scholar 

  48. Ojala, T., Pietikäinen, M., Mäenpää, T.: Multiresolution gray-scale and rotation invariant texture classification with local binary patterns, IEEE Trans. Pattern Anal. Mach. Intell. 24(7):971–987 (2002)

    Google Scholar 

  49. Orwell, J., Remagnino, P., Jones, G.A.: Multiple camera color tracking. In: Proceedings of the IEEE Workshop on Visual Surveillance, (1999)

    Google Scholar 

  50. Ouyang, W., Wang, X.: A discriminative deep model for pedestrian detection with occlusion handling. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2012)

    Google Scholar 

  51. Park, U., Jain, A., Kitahara, I., Kogure, K., Hagita, N.: Vise: Visual search engine using multiple networked cameras. In: Proceedings of the IEEE International Conference on Pattern Recognition, (2006)

    Google Scholar 

  52. Porikli, F.: Inter-camera color calibration by correlation model function. In: Proceedings of the IEEE International Conference on Image Processing, (2003)

    Google Scholar 

  53. Prosser, B., Gong, S., Xiang, T.: Multi-camera matching using bi-directional cumulative brightness transfer function. In: Proceedings of the British Machine Vision Conference, (2008)

    Google Scholar 

  54. Prosser, B., Zheng, W., Gong, S., Xiang, T.: Person re-identification by support vector ranking. In: Proceedings of the British Machine Vision Confernce, (2010)

    Google Scholar 

  55. Rauschert, I., Collins, R.T.: A generative model for simultaneous estimation of human body shape and pixel-level segmentation. In: Proceedings of the European Conference on Computer Vision (2012)

    Google Scholar 

  56. Savarese, S., Winn, J., Criminisi, A.: Discriminative object class models of appearance and shape by correlatons. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2006)

    Google Scholar 

  57. Schwartz, W., Davis, L.: Learning discriminative appearance-based models using partial least sqaures. In: Proceedings of the XXII SIBGRAPI, (2009)

    Google Scholar 

  58. Shan, Y., Sawhney, H., Kumar, R.: Vehicle identification between non-overlapping cameras without direct feature matching. In: Proceedings of the IEEE International Conference on Computer Vision, (2005)

    Google Scholar 

  59. Slater, D., Healey, G.: The illumination-invariant recognition of 3d objects using local color invariants. IEEE Trans. Pattern Anal. Mach. Intell. 18:206–210 (1996)

    Google Scholar 

  60. Tian, Y., Zitnick, C.L., Narasimhan, S.G.: Exploring the spatial hierarchy of mixture models for human pose estimation. In: Proceedings of the European Conference on Computer Vision, (2012)

    Google Scholar 

  61. Tuzel, O., Porikli, F., Meer, P.: Region covariance: a fast descriptor for detection and classification. In: Proceedings of the European Conference on Computer Vision, (2006)

    Google Scholar 

  62. Varma, M., Zisserman, A.: A statistical approach to texture classification from single images. Int. J. Comput. Vision 62, 61–81 (2005)

    Google Scholar 

  63. Wang, M., Li, W., Wang, X.: Transferring a generic pedestrian detector towards specific scenes. In: Proceedings of the IEEE International Conference on Computer Vision and, Pattern Recognition, (2012)

    Google Scholar 

  64. Wang, X., Doretto, G., Sebastian, T., Rittscher, J., Tu, P.: Shape and appearance context modeling. In: Proceedings of the IEEE International Conference on Computer Vision, (2007)

    Google Scholar 

  65. Wang, X., Qiu, S., Liu, K., Tang, X.: Web image re-ranking using query-specific semantic signatures. IEEE Trans. Pattern Anal. Mach. Intell. 34(3):436–450 (2013)

    Google Scholar 

  66. Weijer, J., Schmid, C.: Coloring local feature extraction. In: Proceedings of the European Conference on Computer Vision, (2006)

    Google Scholar 

  67. Weinberger, K., Blitzer, J., Saul, L.: Distance metric learning for large margin nearest neighbor classification. In: Proceedings of the Neural Information Processing Systems, (2006)

    Google Scholar 

  68. Winn, J., Criminisi, A., Minka, T.: Object categorization by learned universal visual dictionary. In: Proceedings of the IEEE International Conference on Computer Vision, (2005)

    Google Scholar 

  69. Yang, Y., Ramanan, D.: Articulated pose estimation using flexible mixtures of parts. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2011)

    Google Scholar 

  70. Yilmaz, A., Javed, O., Shah, M.: Object tracking: a survey. ACM Comput. Surv. 38, 1–45 (2006)

    Article  Google Scholar 

  71. Yin, Q., Tang, X., Sun, J.: An associate-predict model for face recognition. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2011)

    Google Scholar 

  72. Zhao, R., Ouyang, W., Wang, X.: Unsupervised salience learning for person re-identification. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2013)

    Google Scholar 

  73. Zhao, T., Nevatia, R., Wu, B.: Segmentation and tracking of multiple humans in crowded environments. IEEE Trans. Pattern Anal. Mach. Intell. 30:1198–1211 (2008)

    Google Scholar 

  74. Zheng, W., Gong, S., Xiang, T.: Associating groups of people. In: Proceedings of the British Machine Vision Conference, (2009)

    Google Scholar 

  75. Zheng, W., Gong, S., Xiang, T.: Person re-identification by probabilistic relative distance comparison. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaogang Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag London

About this chapter

Cite this chapter

Wang, X., Zhao, R. (2014). Person Re-identification: System Design and Evaluation Overview. In: Gong, S., Cristani, M., Yan, S., Loy, C. (eds) Person Re-Identification. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-6296-4_17

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-6296-4_17

  • Published:

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-6295-7

  • Online ISBN: 978-1-4471-6296-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics