Model-Driven Domain Adaptation on Product Manifolds for Unconstrained Face Recognition
- 784 Downloads
- 8 Citations
Abstract
Many classification algorithms see a reduction in performance when tested on data with properties different from that used for training. This problem arises very naturally in face recognition where images corresponding to the source domain (gallery, training data) and the target domain (probe, testing data) are acquired under varying degree of factors such as illumination, expression, blur and alignment. In this paper, we account for the domain shift by deriving a latent subspace or domain, which jointly characterizes the multifactor variations using appropriate image formation models for each factor. We formulate the latent domain as a product of Grassmann manifolds based on the underlying geometry of the tensor space, and perform recognition across domain shift using statistics consistent with the tensor geometry. More specifically, given a face image from the source or target domain, we first synthesize multiple images of that subject under different illuminations, blur conditions and 2D perturbations to form a tensor representation of the face. The orthogonal matrices obtained from the decomposition of this tensor, where each matrix corresponds to a factor variation, are used to characterize the subject as a point on a product of Grassmann manifolds. For cases with only one image per subject in the source domain, the identity of target domain faces is estimated using the geodesic distance on product manifolds. When multiple images per subject are available, an extension of kernel discriminant analysis is developed using a novel kernel based on the projection metric on product spaces. Furthermore, a probabilistic approach to the problem of classifying image sets on product manifolds is introduced. We demonstrate the effectiveness of our approach through comprehensive evaluations on constrained and unconstrained face datasets, including still images and videos.
Keywords
Domain adaptation Unconstrained face recognition Manifold learning Tensor computationReferences
- Arandjelović, O. (2009). Unfolding a face: From singular to manifold. In Proc. ACCV (pp. 203–213).Google Scholar
- Arandjelović, O., Shakhnarovich, G., Fisher, J., Cipolla, R., & Darrell, T. (2005). Face recognition with image sets using manifold density divergence. In Proc. CVPR (pp. 581–588).Google Scholar
- Basri, R., & Jacobs, D. W. (2003). Lambertian reflectance and linear subspaces. The IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(2), 218–233.CrossRefGoogle Scholar
- Begelfor, E., & Werman, M. (2006). Affine invariance revisited. In Proc. CVPR (pp 2087–2094).Google Scholar
- Ben-David, S., Blitzer, J., Crammer, K., Kulesza, A., Pereira, F., & Vaughan, J. W. (2010). A theory of learning from different domains. Machine Learning, 79(1), 151–175.CrossRefMathSciNetGoogle Scholar
- Biswas, S., Aggarwal, G., & Chellappa, R. (2009). Robust estimation of albedo for illumination-invariant matching and shape recovery. The IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(5), 884–899.CrossRefGoogle Scholar
- Björck, A., & Golub, G. H. (1973). Numerical methods for computing angles between linear subspaces. Mathematics of Computations, 27, 579–594.CrossRefMATHGoogle Scholar
- Blanz, V., & Vetter, T. (1999). A morphable model for the synthesis of 3D faces. In SIGGRAPH (pp. 187–194).Google Scholar
- Brooks, M. J., & Horn, B. K. P. (1985). Shape and source from shading. In Proc. IJAI (pp. 932–936).Google Scholar
- Cevikalp, H., & Triggs, B. (2010). Face recognition based on image sets. In Proc. CVPR (pp. 2567–2573).Google Scholar
- Chen, Y., Patel, V. M., Phillips, P. J., & Chellappa, R. (2012). Dictionary-based face recognition from video. In Proc. ECCV (pp. 766–779).Google Scholar
- Daumé, H, I. I. I., & Marcu, D. (2006). Domain adaptation for statistical classifiers. Journal of Artificial Intelligence Research, 26(1), 101–126.MATHMathSciNetGoogle Scholar
- Duan, L., Tsang, I., Xu, D., & Chua, T.-S. (2009). Domain adaptation from multiple sources via auxiliary classifiers. In Proc. ICML (pp. 289–296).Google Scholar
- Duan, L., Tsang, I., Xu, D., & Chua, T.-S. (2012). Domain transfer multiple Kernel learning. The IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(3), 465–479.CrossRefGoogle Scholar
- Edelman, A., Arias, T. A., & Smith, S. T. (1999). The geometry of algorithms with orthogonality constraints. The SIAM Journal on Matrix Analysis and Applications, 20, 303–353.CrossRefMathSciNetGoogle Scholar
- Gong, B., Shi, Y., Sha, F., & Grauman, K. (2012). Geodesic flow Kernel for unsupervised domain adaptation. In Proc. CVPR (pp. 2066–2073).Google Scholar
- Gopalan, R., Li, R., & Chellappa, R. (2011). Domain adaptation for object recognition: An unsupervised approach. In Proc. ICCV (pp. 999–1006).Google Scholar
- Gopalan, R., Taheri, S., Turaga, P., & Chellappa, R. (2012). A blur-robust descriptor with applications to face recognition. The IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(6), 1220–1226.CrossRefGoogle Scholar
- Hamm, J., & Lee, D. (2008). Grassmann discriminant analysis: A unifying view on subspace-based learning. In Proc. ICML (pp. 376–383).Google Scholar
- Hoffman, J., Kulis, B., Darrell, T., & Saenko, K. (2012). Discovering latent domains for multisource domain adaptation. In Proc. ECCV (pp. 702–715).Google Scholar
- Hu, Y., Mian, A. S., & Owens, R. (2011). Sparse approximated nearest points for image set classification. In Proc. CVPR (pp. 27–40).Google Scholar
- Huang, G. B., Jain, V., & Learned-Miller, E. (2007). Unsupervised joint alignment of complex images. In Proc. ICCV.Google Scholar
- Huang, G. B., Ramesh, M., Berg, T., & Learned-Miller, E. (2007). Labeled faces in the wild: A database for studying face recognition in unconstrained environments. Technical Report 07–49, University of Massachusetts, Amherst.Google Scholar
- Jhuo, I.-H., Liu, D., Lee, D., & Chang, S.-F. (2012). Robust visual domain adaptation with low-rank reconstruction. In Proc. CVPR (pp. 2168–2175).Google Scholar
- Jia, H., & Martinez, A. M. (2008). Face recognition with occlusions in the training and testing sets. In Proc. FG (pp. 1–6).Google Scholar
- Jia, H., & Martinez, A. M. (2009). Support vector machines in face recognition with occlusions. In Proc. CVPR (pp. 136–141).Google Scholar
- Joliffe, I. T. (1986). Principal component analysis. Berlin: Springer.CrossRefGoogle Scholar
- Kim, T. K., Kittler, J., & Cipolla, R. (2007). Discriminative learning and recognition of image set classes using canonical correlations. The IEEE Transactions on Pattern Analysis and Machine Intelligence, 29(6), 1005–1018.CrossRefGoogle Scholar
- Kulis, B., Saenko, K., & Darrell, T. (2011). What you saw is not what you get: Domain adaptation using asymmetric Kernel transforms. In Proc. CVPR (pp. 1785–1792).Google Scholar
- Lee, J. M. (2010). Introduction to topological manifolds (2nd ed.). Berlin: Springer.Google Scholar
- Lee, K. C., Ho, J., & Kriegman, D. J. (2005). Acquiring linear subspaces for face recognition under variable lighting. The IEEE Transactions on Pattern Analysis and Machine Intelligence, 27(5), 684–698.CrossRefGoogle Scholar
- Lee, K. C., Ho, J., Yang, M. H., & Kriegman, D. J. (2005). Visual tracking and recognition using probabilistic appearance manifolds. CVIU, 99(3), 303–331.Google Scholar
- Li, H., Hua, G., Lin, Z., Brandt, J., & Yang, J. (2013). Probabilistic elastic matching for pose invariant face recognition. In Proc. CVPR.Google Scholar
- Li, Y., Du, Y., & Lin, X. (2005). Kernel-based multifactor analysis for image synthesis and recognition. In Proc. ICCV (pp. 114–119).Google Scholar
- Liu, J., Chen, S., Zhou, Z., & Tan, X. (2007). Single image subspace for face recognition. In Proc. AMFG (pp. 205–219).Google Scholar
- Lowe, D. J. (2004). Distinctive image features from scale-invariant keypoints. IJCV, 60(2), 91–110.CrossRefGoogle Scholar
- Lu, J., Tan, Y.P., & Wang, G. (2011). Discriminative multi-manifold analysis for face recognition from a single training sample per person. In Proc. ICCV (pp. 1943–1950).Google Scholar
- Lui, Y. M. (2012). Advances in matrix manifolds for computer vision. Image and Vision Computing, 30, 380–388.CrossRefGoogle Scholar
- Lui, Y. M., & Beveridge, J. R. (2008). Grassmann registration manifolds for face recognition. In Proc. ECCV (vol. 2, pp. 44–57).Google Scholar
- Lui, Y. M., Beveridge, J. R., & Kirby, M. (2010). Action classifications on product manifolds. In Proc. CVPR (pp. 833–839).Google Scholar
- Martinez, A. M. (2009). Recognizing imprecisely localized, partially occluded, and expression variant faces from a single sample per class. The IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(6), 748–763.CrossRefGoogle Scholar
- Martinez, A. M., & Benavente, R. (1998). The AR Face Database. CVC Technical Report, 24.Google Scholar
- Ni, J., Qiu, Q., & Chellappa, R. (2013). Subspace interpolation via dictionary learning for unsupervised domain adaptation. In Proc. CVPR.Google Scholar
- Nishiyama, M., Hadid, A., Takeshima, H., Shotton, J., Kozakaya, T., & Yamaguchi, O. (2011). Facial deblur inference using subspace analysis for recognition of blurred faces. The IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(4), 838–845.CrossRefGoogle Scholar
- Nowak, E., & Jurie, F. (2007). Learning visual similarities measures for comparing never seen objects. In Proc. CVPR.Google Scholar
- Ojala, T., Pietikäinen, M., & Mäenpää, T. (2002). Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. The IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(7), 971–987.CrossRefGoogle Scholar
- Ojansivu, V., & Heikkilä, J. (2008). Blur insensitive texture classification using local phase quantization. In Proc. ICISP (pp. 236–243).Google Scholar
- Park, S. W., & Savvides, M. (2010). An extension of multifactor analysis for face recognition based on submanifold learning. In Proc. CVPR (pp. 2645–2652).Google Scholar
- Park, S. W., & Savvides, M. (2011a). Multifactor analysis based on factor-dependent geometry. In Proc. CVPR (pp. 2817–2824).Google Scholar
- Park, S. W., & Savvides, M. (2011b). The multifactor extension of grassmann manifolds for face recognition. In Proc. FG (pp. 464–469). Google Scholar
- Pinto, N., Dicarlo, J. J., & Cox, D. D. (2009). How far can you get with a modern face recognition test set using only simple features. In Proc. CVPR.Google Scholar
- Qiu, Q., Patel, V., Turaga, P., & Chellappa, R. (2012). Domain adaptive dictionary learning. In Proc. ECCV (pp. 631–645).Google Scholar
- Roweis, S. T., & Saul, L. K. (2000). Nonlinear dimensionality reduction by locally linear embedding. Science, 290, 2323–2326.CrossRefGoogle Scholar
- Saenko, K., Kulis, B., Fritz, M., & Darrell, T. (2010). Adapting visual category models to new domains. In Proc. ECCV (pp. 213–226).Google Scholar
- Sanderson, C., & Lovell, B. C. (2009). Multi-region probabilistic histograms for robust and scalable identity inference. In Proc. ICB (pp. 199–208).Google Scholar
- Shakhnarovich, G., Fisher, J. W., & Darell, T. (2002). Face recognition from long term observations. In Proc. ECCV (pp. 851–868).Google Scholar
- Shekhar, S., Patel, V., Nguyen, H., & Chellappa, R. (2013). Generalized domain-adaptive dictionaries. In Proc. CVPR.Google Scholar
- Shi, Y., & Sha, F. (2012). Information-theoretical learning of discriminative clusters for unsupervised domain adaptation. In Proc. ICML.Google Scholar
- Sim, T., Baker, S., & Bsat, M. (2003). The CMU pose, illumination, and expression database. The IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(12), 1615–1618.CrossRefGoogle Scholar
- Tenenbaum, J. B., de Silva, V., & Langford, J. C. (2000). A global geometric framework for non-linear dimensionality reduction. Science, 290, 2319–2323.CrossRefGoogle Scholar
- Vageeswaran, P., Mitra, K., & Chellappa, R. (2013). Blur and illumination robust face recognition via set-theoretic characterization. IEEE Transactions on Image Processing, 22(4), 1362–1372.CrossRefMathSciNetGoogle Scholar
- Vapnik, V. N. (1998). Statistical Learning Theory. New York: Wiley.MATHGoogle Scholar
- Vasilescu, M. A. O., & Terzopoulos, D. (2002). Multilinear analysis of image ensembles. In Proc. ECCV (pp. 447–460).Google Scholar
- Vasilescu, M. A. O., & Terzopoulos, D. (2007). Multilinear projection for appearance-based recognition in the tensor framework. In Proc. ICCV (pp. 1–8).Google Scholar
- Viola, P., & Jones, M. (2001). Rapid object detection using a boosted cascade of simple features. In Proc. CVPR (pp. 511–518).Google Scholar
- Wang, R., & Chen, X. (2009). Manifold discriminant analysis. In Proc. CVPR (pp. 429–436).Google Scholar
- Wolf, L., Hassner, T., & Taigman, Y. (2008). Descriptor based methods in the wild. In Faces in real-life images workshop in ECCV.Google Scholar
- Yang, J., Yan, R., & Hauptmann, A. (2007). Cross-domain video concept detection using adaptive SVMs. In Proc. ACM MM (pp. 188–197).Google Scholar
- Zhao, W., Chellappa, R., Phillips, P. J., & Rosenfeld, A. (2003). Face recognition: A literature survey. ACM Computing Surveys, 35(4), 399–458.CrossRefGoogle Scholar
- Zheng, J., Liu, M.-Y., Chellappa, R., & Phillips, J. (2012). A grassmann manifold-based domain adaptation approach. In Proc. ICPR (pp. 2095–2099).Google Scholar