Abstract
Unsupervised person re-identification has achieved great success through the self-improvement of individual neural networks. However, limited by the lack of diversity of discriminant information, a single network has difficulty learning sufficient discrimination ability by itself under unsupervised conditions. To address this limit, we develop a population-based evolutionary gaming (PEG) framework in which a population of diverse neural networks are trained concurrently through selection, reproduction, mutation, and population mutual learning iteratively. Specifically, the selection of networks to preserve is modeled as a cooperative game and solved by the best-response dynamics, then the reproduction and mutation are implemented by cloning and fluctuating hyper-parameters of networks to learn more diversity, and population mutual learning improves the discrimination of networks by knowledge distillation from each other within the population. In addition, we propose a cross-reference scatter (CRS) to approximately evaluate re-ID models without labeled samples and adopt it as the criterion of network selection in PEG. CRS measures a model’s performance by indirectly estimating the accuracy of its predicted pseudo-labels according to the cohesion and separation of the feature space. Extensive experiments demonstrate that (1) CRS approximately measures the performance of models without labeled samples; (2) and PEG produces new state-of-the-art accuracy for person re-identification, indicating the great potential of population-based network cooperative training for unsupervised learning. The code is released on github.com/YunpengZhai/PEG.
Similar content being viewed by others
Data Availability
The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.
References
Ali, B., Moriyama, K., Kalintha, W., Numao, M., & Fukui, K. I. (2020). Reinforcement learning based metric filtering for evolutionary distance metric learning. Intelligent Data Analysis, 24(6), 1345–1364.
Baker, F. B., & Hubert, L. J. (1975). Measuring the power of hierarchical cluster analysis. Journal of the American Statistical Association, 70(349), 31–38.
Caron, M., Bojanowski, P., Joulin, A., & Douze, M. (2018). Deep clustering for unsupervised learning of visual features. In Proceedings of the European conference on computer vision (ECCV)(pp. 132–149).
Chen, G., Lu, Y., Lu, J., & Zhou, J. (2020). Deep credible metric learning for unsupervised domain adaptation person re-identification. In Computer vision–ECCV 2020: 16th European conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part VIII 16 (pp. 643–659). Springer
Chen, H., Lagadec, B., & Bremond, F. (2021a). Ice: Inter-instance contrastive encoding for unsupervised person re-identification. arXiv:2103.16364
Chen, H., Wang, Y., Lagadec, B., Dantcheva, A., & Bremond, F. (2021b). Joint generative and contrastive learning for unsupervised person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 2004–2013).
Dai, Z., Wang, G., Zhu, S., Yuan, W., & Tan, P. (2021). Cluster contrast for unsupervised person re-identification. arXiv:2103.11568
Davies, D. L., & Bouldin, D. W. (1979). A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI, 1(2), 224–227. https://doi.org/10.1109/TPAMI.1979.4766909.
Deng, J., Dong, W., Socher, R., Li, L., Li, K., & Li, F. (2009). Imagenet: A large-scale hierarchical image database. In IEEE CVPR.
Deng, W., Zheng, L., Ye, Q., Kang, G., Yang, Y., & Jiao, J. (2018). Image-image domain adaptation with preserved self-similarity and domain-dissimilarity for person re-identification. In IEEE CVPR.
Dietterich, T. G. (2000). Ensemble methods in machine learning. In International workshop on multiple classifier systems (pp. 1–15). Springer.
Dunn, J. C. (1973). A fuzzy relative of the isodata process and its use in detecting compact well-separated clusters.
Ester, M., Kriegel, H., Sander, J., & Xu, X. (1996). A density-based algorithm for discovering clusters in large spatial databases with noise. In KDD (pp. 226–231).
Fan, H., Zheng, L., Yan, C., & Yang, Y. (2018). Unsupervised person re-identification: Clustering and fine-tuning. TOMCCAP, 14(4), 83:1-83:18.
Fu, Y., Wei, Y., Wang, G., Zhou, Y., Shi, H., & Huang, T. S. (2019). Self-similarity grouping: A simple unsupervised cross domain adaptation approach for person re-identification. In Proceedings of the IEEE international conference on computer vision (ICCV) (pp. 6112–6121).
Fu, D., Chen, D., Bao, J., Yang, H., Yuan, L., Zhang, L., Li, H., & Chen, D. (2021). Unsupervised pre-training for person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 14750–14759).
Fukui, Ki., Ono, S., Megano, T., & Numao, M. (2013). Evolutionary distance metric learning approach to semi-supervised clustering with neighbor relations. In 2013 IEEE 25th international conference on tools with artificial intelligence (pp. 398–403). IEEE.
Ge, Y., Chen, D., & Li, H. (2020a). Mutual mean-teaching: Pseudo label refinery for unsupervised domain adaptation on person re-identification. arXiv:2001.01526
Ge, Y., Zhu, F., Chen, D., Zhao, R., & Li, H. (2020b). Self-paced contrastive learning with hybrid memory for domain adaptive object re-id. arXiv:2006.02713
Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., & Bengio, Y. (2014). Generative adversarial nets. In Advances in neural information processing systems (pp. 2672–2680).
Halkidi, M., Batistakis, Y., & Vazirgiannis, M. (2002). Clustering validity checking methods: Part ii. ACM Sigmod Record, 31(3), 19–27.
Hansen, L. K., & Salamon, P. (1990). Neural network ensembles. IEEE Transactions on Pattern Analysis and Machine Intelligence, 12(10), 993–1001.
Ho, D., Liang, E., Chen, X., Stoica, I., & Abbeel, P. (2019). Population based augmentation: Efficient learning of augmentation policy schedules. In International conference on machine learning, PMLR (pp. 2731–2741).
Huang, G., Sun, Y., Liu, Z., Sedra, D., & Weinberger, K.Q. (2016). Deep networks with stochastic depth. In European conference on computer vision (ECCV) (pp. 646–661). Springer.
Huang, G., Li, Y., Pleiss, G., Liu, Z., Hopcroft, J. E., & Weinberger, K. Q. (2017a). Snapshot ensembles: Train 1, get m for free. arXiv:1704.00109
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K. Q. (2017b). Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) (pp. 4700–4708).
Huang, Y., Peng, P., Jin, Y., Xing, J., Lang, C., & Feng, S. (2019). Domain adaptive attention model for unsupervised cross-domain person re-identification. arXiv:1905.10529
Hubert, L. J., & Levin, J. R. (1976). A general statistical framework for assessing categorical clustering in free recall. Psychological Bulletin, 83(6), 1072.
Jaderberg, M., Dalibard, V., Osindero, S., Czarnecki, W.M., Donahue, J., Razavi, A., Vinyals, O., Green, T., Dunning, I., & Simonyan, K., et al. (2017). Population based training of neural networks. arXiv:1711.09846
Jaderberg, M., Czarnecki, W. M., Dunning, I., Marris, L., Lever, G., Castaneda, A. G., et al. (2019). Human-level performance in 3d multiplayer games with population-based reinforcement learning. Science, 364(6443), 859–865.
Ji, H., Wang, L., Zhou, S., Tang, W., Zheng, N., & Hua, G. (2021). Meta pairwise relationship distillation for unsupervised person re-identification. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 3661–3670).
Jin, X., Lan, C., Zeng, W., & Chen, Z. (2020). Global distance-distributions separation for unsupervised person re-identification. arXiv:2006.00752
Kalintha, W., Ono, S., Numao, M., & Ki, F. (2019). Kernelized evolutionary distance metric learning for semi-supervised clustering. Intelligent Data Analysis, 23(6), 1271–1297.
Kendall, M. G. (1938). A new measure of rank correlation. Biometrika, 30(1/2), 81–93.
Kingma, D. P., & Ba, J. (2014). Adam: A method for stochastic optimization. arXiv:1412.6980
Krogh, A., & Vedelsby, J. (1994). Neural network ensembles, cross validation, and active learning. Advances in Neural Information Processing Systems, 7, 231–238.
Lakshminarayanan, B., Pritzel, A., & Blundell, C. (2017). Simple and scalable predictive uncertainty estimation using deep ensembles. In Advances in neural information processing systems (pp. 6402–6413).
Li, J., & Zhang, S. (2020). Joint visual and temporal consistency for unsupervised domain adaptive person re-identification. In European conference on computer vision (pp. 483–499). Springer.
Li, M., Zhu, X., & Gong, S. (2018). Unsupervised person re-identification by deep learning tracklet association. In Proceedings of the European conference on computer vision (ECCV) (pp. 737–753).
Li, M., Zhu, X., & Gong, S. (2019a). Unsupervised tracklet person re-identification. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(7), 1770–1782.
Li, Y. J., Lin, C. S., Lin, Y. B., & Wang, Y. C. F. (2019b). Cross-dataset person re-identification via unsupervised pose disentanglement and adaptation. In Proceedings of the IEEE international conference on computer vision (ICCV) (pp. 7919–7929).
Liao, S., Hu, Y., Zhu, X., & Li, S. Z. (2015). Person re-identification by local maximal occurrence representation and metric learning. In The IEEE conference on computer vision and pattern recognition (CVPR).
Lin, Y., Dong, X., Zheng, L., Yan, Y., & Yang, Y. (2019). A bottom-up clustering approach to unsupervised person re-identification. Proceedings of the AAAI Conference on Artificial Intelligence, 33, 8738–8745.
Lin, Y., Xie, L., Wu, Y., Yan, C., & Tian, Q. (2020). Unsupervised person re-identification via softened similarity learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 3390–3399).
Liu, J., Zha, Z.J., Chen, D., Hong, R., & Wang, M. (2019). Adaptive transfer network for cross-domain person re-identification. In IEEE CVPR.
Maulik, U., & Bandyopadhyay, S. (2002). Performance evaluation of some clustering algorithms and validity indices. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(12), 1650–1654.
Pan, X., Luo, P., Shi, J., & Tang, X. (2018). Two at once: Enhancing learning and generalization capacities via ibn-net. In Proceedings of the European conference on computer vision (ECCV) (pp. 464–479).
Peng, P., Xiang, T., Wang, Y., Pontil, M., Gong, S., Huang, T., & Tian, Y. (2016). Unsupervised cross-dataset transfer learning for person re-identification. In The IEEE conference on computer vision and pattern recognition (CVPR).
Peng, P., Xing, J., & Cao, L. (2020). Hybrid learning for multi-agent cooperation with sub-optimal demonstrations. In IJCAI (pp. 3037–3043).
Perrone, M. P., & Cooper, L. N. (1992). When networks disagree: Ensemble methods for hybrid neural networks. Technical report: Brown Univ Providence RI Inst for Brain and Neural Systems.
Qi, L., Wang, L., Huo, J., Zhou, L., Shi, Y., & Gao, Y. (2019). A novel unsupervised camera-aware domain adaptation framework for person re-identification. In Proceedings of the IEEE international conference on computer vision (ICCV) (pp. 8080–8089).
Ristani, E., Solera, F., Zou, R. S., Cucchiara, R., & Tomasi, C. (2016). Performance measures and a data set for multi-target, multi-camera tracking. In IEEE ECCV workshops.
Rousseeuw, P. J. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics, 20, 53–65.
Shen, Z., He, Z., & Xue, X. (2019). Meal: Multi-model ensemble via adversarial learning. Proceedings of the AAAI Conference on Artificial Intelligence, 33, 4886–4893.
Singh, S., Hoiem, D., & Forsyth, D. (2016). Swapout: Learning an ensemble of deep architectures. In Advances in neural information processing systems (pp. 28–36).
Song, L., Wang, C., Zhang. L., Du, B., Zhang, Q., Huang, C., & Wang, X. (2018). Unsupervised domain adaptive re-identification: Theory and practice. CoRR abs/1807.11334
Spearman, C. (1961). The proof and measurement of association between two things.
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., & Salakhutdinov, R. (2014). Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research, 15(1), 1929–1958.
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z. (2016). Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) (pp. 2818–2826).
Tarvainen, A., & Valpola, H. (2017). Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results. In Advances in neural information processing systems (pp. 1195–1204).
Vinyals, O., Babuschkin, I., Czarnecki, W. M., Mathieu, M., Dudzik, A., Chung, J., et al. (2019). Grandmaster level in starcraft ii using multi-agent reinforcement learning. Nature, 575(7782), 350–354.
Wan, L., Zeiler, M., Zhang, S., Le Cun, Y., & Fergus, R. (2013). Regularization of neural networks using dropconnect. In International conference on machine learning (pp. 1058–1066).
Wang, D., & Zhang, S. (2020). Unsupervised person re-identification via multi-label classification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 10981–10990).
Wang, J., Zhu, X., Gong, S., & Li, W. (2018). Transferable joint attribute-identity deep learning for unsupervised person re-identification. In IEEE CVPR.
Wang, M., Lai, B., Huang, J., Gong, X., & Hua, X. S. (2020a). Camera-aware proxies for unsupervised person re-identification. arXiv:2012.10674
Wang, Z., Zhang, J., Zheng, L., Liu, Y., Sun, Y., Li, Y., & Wang, S. (2020b). Cycas: Self-supervised cycle association for learning re-identifiable descriptions. In Computer vision-ECCV 2020: 16th European conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XI 16 (pp. 72–88). Springer.
Wei, L., Zhang, S., Gao, W., & Tian, Q. (2018). Person transfer gan to bridge domain gap for person re-identification. In IEEE CVPR.
Wu, J., Yang, Y., Liu, H., Liao, S., Lei, Z., & Li, S.Z. (2019). Unsupervised graph association for person re-identification. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 8321–8330).
Xuan, S., & Zhang, S. (2021). Intra-inter camera similarity for unsupervised person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 11926–11935).
Yang, F., Yan, K., Lu, S., Jia, H., Xie, D., Yu, Z., Guo, X., Huang, F., & Gao, W. (2020). Part-aware progressive unsupervised domain adaptation for person re-identification. IEEE Transactions on Multimedia.
Yang, F., Zhong, Z., Luo, Z., Cai, Y., Lin, Y., Li, S., & Sebe, N. (2021). Joint noise-tolerant learning and meta camera shift adaptation for unsupervised person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 4855–4864).
Ye, M., Ma, A.J., Zheng, L., Li, J., & Yuen, P.C. (2017). Dynamic label graph matching for unsupervised video re-identification. In Proceedings of the IEEE international conference on computer vision (pp. 5142–5150).
Yuan, X., He, P., Zhu, Q., & Li, X. (2019). Adversarial examples: Attacks and defenses for deep learning. IEEE Transactions on Neural Networks and Learning Systems, 30(9), 2805–2824.
Zeng, K., Ning, M., Wang, Y., & Guo, Y. (2020). Hierarchical clustering with hard-batch triplet loss for person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 13657–13665).
Zhai, Y., Lu, S., Ye, Q., Shan, X., Chen, J., Ji, R., & Tian, Y. (2020a). Ad-cluster: Augmented discriminative clustering for domain adaptive person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 9021–9030).
Zhai, Y., Lu, S., Ye, Q., Shan, X., Chen, J., Ji, R., & Tian, Y. (2020b). Ad-cluster: Augmented discriminative clustering for domain adaptive person re-identification. In IEEE/CVF conference on computer vision and pattern recognition (CVPR).
Zhai, Y., Ye, Q., Lu, S., Jia, M., Ji, R., & Tian, Y. (2020c). Multiple expert brainstorming for domain adaptive person re-identification. arXiv:2007.01546
Zhang, X., Cao, J., Shen, C., & You, M. (2019). Self-training with progressive augmentation for unsupervised cross-domain person re-identification. In Proceedings of the IEEE international conference on computer vision (ICCV). (pp. 8222–8231).
Zhang, Y., Xiang, T., Hospedales, T. M., & Lu, H. (2018). Deep mutual learning. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) (pp. 4320–4328).
Zhao, F., Liao, S., Xie, G. S., Zhao, J., Zhang, K., & Shao, L. (2020). Unsupervised domain adaptation with noise resistible mutual-training for person re-identification. European conference on computer vision (ECCV) (pp. 1–18). Glasgow.
Zheng, K., Liu, W., He, L., Mei, T., Luo, J., & Zha, Z. J. (2021a). Group-aware label transfer for domain adaptive person re-identification. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition (pp. 5310–5319).
Zheng, L., Shen, L., Tian, L., Wang, S., Wang, J., & Tian, Q. (2015). Scalable person re-identification: A benchmark. In The IEEE international conference on computer vision (ICCV).
Zheng, L., Yang, Y., & Hauptmann, A. G. (2016). Person re-identification: Past, present and future. arXiv:1610.02984
Zheng, Y., Tang, S., Teng, G., Ge, Y., Liu, K., Qin, J., Qi, D., & Chen, D. (2021b). Online pseudo label generation by hierarchical cluster dynamics for adaptive person re-identification. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 8371–8381).
Zheng, Z., Zheng, L., & Yang, Y. (2017). Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In IEEE ICCV.
Zhong, Z., Zheng, L., Cao, D., & Li, S. (2017). Re-ranking person re-identification with k-reciprocal encoding. In IEEE CVPR.
Zhong, Z., Zheng, L., Li, S., & Yang, Y. (2018). Generalizing a person retrieval model hetero- and homo-geneously. In ECCV (pp. 176–192).
Zhong, Z., Zheng, L., Luo, Z., Li, S., & Yang. Y. (2019a). Invariance matters: Exemplar memory for domain adaptive person re-identification. In IEEE CVPR.
Zhong, Z., Zheng, L., Zheng, Z., Li, S., & Yang, Y. (2019b). Camstyle: A novel data augmentation method for person re-identification. IEEE TIP, 28(3), 1176–1190.
Zhong, Z., Zheng, L., Luo, Z., Li, S., & Yang, Y. (2020). Learning to adapt invariance in memory for person re-identification. In IEEE transactions on pattern analysis and machine intelligence.
Zhou, K., Yang, Y., Cavallaro, A., & Xiang, T. (2019). Omni-scale feature learning for person re-identification. In Proceedings of the IEEE/CVF international conference on computer vision (pp. 3702–3712).
Zou, Y., Yang, X., Yu, Z., Kumar, B.V., & Kautz, J. (2020). Joint disentangling and adaptation for cross-domain person re-identification. In Computer vision-ECCV 2020: 16th European conference, Glasgow, UK, August 23–28, 2020, proceedings, Part II 16 (pp. 87–104). Springer.
Acknowledgements
This work is partially supported by grants from the Key-Area Research and Development Program of Guangdong Province under contact No. 2019B010153002, and grants from the National Natural Science Foundation of China under contract No. 61825101 and No. 62088102. The computing resources of Pengcheng Cloudbrain are used in this research.
Author information
Authors and Affiliations
Corresponding authors
Additional information
Communicated by Wenjun Kevin Zeng.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhai, Y., Peng, P., Jia, M. et al. Population-Based Evolutionary Gaming for Unsupervised Person Re-identification. Int J Comput Vis 131, 1–25 (2023). https://doi.org/10.1007/s11263-022-01693-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11263-022-01693-7