MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition

  • Yandong Guo
  • Lei Zhang
  • Yuxiao Hu
  • Xiaodong He
  • Jianfeng Gao
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9907)

Abstract

In this paper, we design a benchmark task and provide the associated datasets for recognizing face images and link them to corresponding entity keys in a knowledge base. More specifically, we propose a benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data. The rich information provided by the knowledge base helps to conduct disambiguation and improve the recognition accuracy, and contributes to various real-world applications, such as image captioning and news video analysis. Associated with this task, we design and provide concrete measurement set, evaluation protocol, as well as training data. We also present in details our experiment setup and report promising baseline results. Our benchmark task could lead to one of the largest classification problems in computer vision. To the best of our knowledge, our training dataset, which contains 10M images in version 1, is the largest publicly available one in the world.

Keywords

Face recognition Large scale Benchmark Training data Celebrity recognition Knowledge base 

References

  1. 1.
    Guo, Y., Zhang, L., Hu, Y., He, X., Gao, J.: MS-Celeb-1M: challenge of recognizing one million celebrities in the real world. In: IS&T International Symposium on Electronic Imaging (2016)Google Scholar
  2. 2.
    Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: DeepFace: closing the gap to human-level performance in face verification. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), June 2014Google Scholar
  3. 3.
    Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Web-scale training for face identification. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2746–2754. IEEE (2015)Google Scholar
  4. 4.
    Schroff, F., Kalenichenko, D., Philbin, J.: FaceNet: a unified embedding for face recognition and clustering. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), June 2015Google Scholar
  5. 5.
    Google: Freebase data dumps (2015). https://developers.google.com/freebase/data
  6. 6.
    Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., Berg, A.C., Fei-Fei, L.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. (IJCV) 115(3), 211–252 (2015)MathSciNetCrossRefGoogle Scholar
  7. 7.
    Huang, G.B., Ramesh, M., Berg, T., Learned-Miller, E.: Labeled faces in the wild: a database for studying face recognition in unconstrained environments. Technical report 07–49, University of Massachusetts, Amherst, October 2007Google Scholar
  8. 8.
    Huang, G.B., Learned-Miller, E.: Labeled faces in the wild: updates and new reporting procedures. Technical report UM-CS-2014-003, University of Massachusetts, Amherst, May 2014Google Scholar
  9. 9.
    Sun, Y., Wang, X., Tang, X.: DeepID3: face recognition with very deep neural networks. arXiv preprint arXiv:1502.00873 (2014)
  10. 10.
    Fan, H., Yang, M., Cao, Z., Jiang, Y., Yin, Q.: Learning compact face representation: packing a face into an int32. In: Proceedings of ACM International Conference on Multimedia, pp. 933–936. ACM (2014)Google Scholar
  11. 11.
    Kemelmacher-Shlizerman, I., Seitz, S., Miller, D., Brossard, E.: The MegaFace Benchmark: 1 Million Faces for Recognition at Scale. ArXiv e-prints (2015)Google Scholar
  12. 12.
    Ng, H.W., Winkler, S.: A data-driven approach to cleaning large face datasets. In: Proceedings of IEEE International Conference on Image Processing (ICIP), October 2014Google Scholar
  13. 13.
    Panis, G., Lanitis, A.: An overview of research activities in facial age estimation using the FG-NET aging database. In: Agapito, L., Bronstein, M.M., Rother, C. (eds.) ECCV 2014 Workshops. LNCS, vol. 8926, pp. 737–750. Springer, Heidelberg (2015)Google Scholar
  14. 14.
    Wolf, L., Hassner, T., Maoz, I.: Face recognition in unconstrained videos with matched background similarity. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR) (2011)Google Scholar
  15. 15.
    Sun, Y., Wang, X., Tang, X.: Deep learning face representation from predicting 10,000 classes. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), June 2014Google Scholar
  16. 16.
    Yi, D., Lei, Z., Liao, S., Li, S.Z.: Learning face representation from scratch. arXiv preprint arXiv:1411.7923 (2014)
  17. 17.
    Klare, B.F., Klein, B., Taborsky, E., Blanton, A., Cheney, J., Allen, K., Grother, P., Mah, A., Jain, A.K.: Pushing the frontiers of unconstrained face detection and recognition: IARPA Janus benchmark A. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), June 2015Google Scholar
  18. 18.
    Parkhi, O.M., Vedaldi, A., Zisserman, A.: Deep face recognition. In: Proceedings of the British Machine Vision Conference (BMVC) (2015)Google Scholar
  19. 19.
    Eastman, G.: Camera. US Patent 388850 A (1888)Google Scholar
  20. 20.
    Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (NIPS), pp. 1097–1105. MIT Press (2012)Google Scholar
  21. 21.
    Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: analysis and an algorithm. In: Advances in Neural Information Processing Systems (NIPS), pp. 849–856. MIT Press (2001)Google Scholar
  22. 22.
    Belkin, M., Niyogi, P.: Semi-supervised learning on Riemannian manifolds. J. Mach. Learn. 56(1–3), 209–239 (2004)CrossRefMATHGoogle Scholar
  23. 23.
    Zhu, X., Ghahramani, Z., Lafferty, J.: Semi-supervised learning using gaussian fields and harmonic functions. In: Proceedings of International Conference on Machine Learning, pp. 912–919 (2003)Google Scholar
  24. 24.
    Zhou, D., Bousquet, O., Lal, T.N., Weston, J., Schlkopf, B.: Learning with local and global consistency. In: Advances in Neural Information Processing Systems (NIPS), pp. 321–328. MIT Press (2004)Google Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  • Yandong Guo
    • 1
  • Lei Zhang
    • 1
  • Yuxiao Hu
    • 1
  • Xiaodong He
    • 1
  • Jianfeng Gao
    • 1
  1. 1.Microsoft ResearchRedmondUSA

Personalised recommendations