Cost Sensitive Semi-Supervised Canonical Correlation Analysis for Multi-view Dimensionality Reduction

Wan, Jianwu; Wang, Hongyuan; Yang, Ming

doi:10.1007/s11063-016-9532-z

Cost Sensitive Semi-Supervised Canonical Correlation Analysis for Multi-view Dimensionality Reduction

Published: 23 June 2016

Volume 45, pages 411–430, (2017)
Cite this article

Neural Processing Letters Aims and scope Submit manuscript

Jianwu Wan¹,
Hongyuan Wang¹ &
Ming Yang²

756 Accesses
10 Citations
Explore all metrics

Abstract

To deal with the cost sensitive and semi-supervised learning problems in Multi-view Dimensionality Reduction (MDR), we propose a Cost Sensitive Semi-Supervised Canonical Correlation Analysis $(\hbox {CS}^{3}\hbox {CCA}). \hbox {CS}^{3}\hbox {CCA}$ first uses the $L_2$ norm approach to obtain the soft label for each unlabeled data, and then embed the misclassification cost into the framework of Canonical Correlation Analysis (CCA). Compared with existing CCA based methods, $\hbox {CS}^{3}\hbox {CCA}$ has the following advantages: (1) It uses the $L_2$ norm approach to infer the soft label for unlabeled data, which is computationally efficient and effective, especially for cost sensitive face recognition. (2) The objective function of $\hbox {CS}^{3}\hbox {CCA}$ not only maximizes the soft cost sensitive within-class correlations and minimizes the soft cost sensitive between-class correlations in the inter-view, but also considers the class imbalance problem simultaneously. With the discriminant projections learned by $\hbox {CS}^{3}\hbox {CCA}$, we employ it for cost sensitive face recognition. The experimental results on four well-known face data sets, including AR, Extended Yale B, PIE and ORL, demonstrate the effectiveness of $\hbox {CS}^{3}\hbox {CCA}$.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Manifold Regularized Discriminative Canonical Correlation Analysis for Semi-supervised Data

Sparse regularized discriminative canonical correlation analysis for multi-view semi-supervised learning

Article 07 June 2018

Semi-discriminative Multiview Canonical Correlation Analysis for Recognition

References

Sun SL (2013) A survey of multi-view machine learning. Neural Comput Appl 23(7):2031–2038
Article Google Scholar
Xu C, Tao DC, Xu C (2013) A survey on multi-view learning. arXiv preprint, arXiv:1304.5634
Yu J, Tao DC, Rui Y, Cheng J (2013) Pairwise constraints based multiview features fusion for scene classification. Pattern Recognit 46(2):483–496
Article MATH Google Scholar
Kan M, Shan SG, Zhang HH, Lao SH, Chen XL (2012) Multi-view Discriminant Analysis. In: proceedings of the 12th European Conference on Computer Vision, Florence, pp 808–821
Diethe T, Hardoon DR, Shawe-Taylor J (2008) Multiview fisher discriminant analysis. In: Proceedings of NIPS workshop on learning from multiple source with applications to robotics, Edinburgh, pp 976–983
Hou C, Zhang C, Wu Y, Nie F (2010) Multiple view semi-supervised dimensionality reduction. Pattern Recognit 43(3):720–730
Article MATH Google Scholar
Cheng XH, Chen SC, Xue H, Zhou XD (2012) A unified dimensionality reduction framework for semi-paired and semi-supervised multi-view data. Pattern Recognit 45(5):2005–2018
Article MATH Google Scholar
Hotelling H (1936) Relation between two sets of variables. Biometrica 28(3/4):322–3377
Article Google Scholar
Lai PL, Fyfe C (2010) Kernel and nonlinear canonical correlation analysis. Int J Neural Syst 10(5):365–377
Article Google Scholar
Sun TK, Chen SC (2007) Locality preserving CCA with applications to data visualization and pose estimation. Image Vis Comput 25(5):531–543
Article Google Scholar
Wang FS, Zhang DQ (2013) A new locality-preserving canonical correlation analysis Algorithm for multi-view dimensionality reduction. Neural Process Lett 37:135–146
Article Google Scholar
Hardoon DR, Shawe-Taylor J (2011) Sparse canonical correlation analysis. Mach Learn 83(3):331–353
Article MathSciNet MATH Google Scholar
Chu DL, Liao LZ, Ng MK, Zhang X (2013) Sparse canonical correlation analysis: new formulation and algorithm. IEEE Trans Pattern Anal Mach Intell 35(12):3050–3065
Article Google Scholar
Yuan YH, Sun QS, Ge HW (2014) Fractional-order embedding canonical correlation analysis and its applications to multi-view dimensionality reduction and recognition. Pattern Recognit 47:1411–1424
Article MATH Google Scholar
Sun TK, Chen SC, Yang JY, Shi PF (2008) A novel method of combined feature extraction for recognition. In: Proceedings of the IEEE international conference on data mining, Pisa, pp 1043–1048
Peng Y, Zhang DQ, Zhang JC (2010) A new canonical correlation analysis algorithm with local discrimination. Neural Process Lett 31:1–15
Article Google Scholar
Sun SL, Xie XJ, Yang M (2015) Multiview uncorrelated discriminant analysis. IEEE Trans Cybern 99:1–13
Google Scholar
Yang M, Sun SL (2014) Multi-view uncorrelated linear discriminant analysis with applications to handwritten digit recognition. International Joint Conference on Neural Networks. Beijing, pp 4175–4181
Sun L, Ji SW, Ye JP (2010) Canonical correlation analysis for multilabel classification: a least-squares formulation, extensions, and analysis. IEEE Trans Pattern Anal Mach Intell 33(1):194–200
Google Scholar
He ZY, Chen C, Bu JJ, Li P, Cai D (2015) Multi-view based multi-label propagation for image annotation. Neurocomputing 168:853–860
Article Google Scholar
Wang YQ, Li P, Yao C (2014) Hypergraph canonical correlation analysis for multi-label classification. Signal Process 105:258–267
Article Google Scholar
Zhen Y, Gao Y, Yeung DY, Zha HY, Li XL (2016) Spectral multimodal hashing and its application to multimedia retrieval. IEEE Trans Cybern 46(1):27–38
Article Google Scholar
Irie G, Arai H, Taniguchi Y (2015) Alternating co-quantization for cross-modal hashing. In: Proceedings of the IEEE international conference on computer vision. Santiago. pp 1886–1894
Shen XB, Sun QS (2014) A novel semi-supervised canonical correlation analysis and extensions for multi-view dimensionality reduction. J Vis Commun Image Represent 25:1894–1904
Article Google Scholar
Wright J, Yang A, Sastry S, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227
Article Google Scholar
Shi QF, Eriksson A, Shen CH (2011) Is face recognition really a compressive sensing problem?. In: Proceedings of the IEEE international conference on computer vision and pattern recognition. Colorado Springs. pp 553–560
Wan JW, Yang M, Gao Y, Chen YJ (2014) Pairwise costs in semisupervised discriminant analysis for face recognition. IEEE Trans Inf Forensics Secur 9(10):1569–1580
Article Google Scholar
Lu JW, Tan YP (2010) Cost-sensitive subspace learning for face recognition. In: Proceedings of the IEEE international conference on computer vision and pattern recognition. San Francisco, pp 2661–2666
Lu JW, Zhou XZ, Tan YP, Shang YY, Zhou J (2012) Cost-sensitive semi-supervised discriminant analysis for face recognition. IEEE Trans Inf Forensics Secur 7(3):944–953
Article Google Scholar
Miao LS, Liu MX, Zhang DQ (2012) Cost-sensitive feature selection with application in software defect prediction. In: Proceedings of the IEEE 21th international conference on pattern recognition. Tsukuba, pp 976–970
Wan JW, Yang M, Chen YJ (2015) Discriminative cost sensitive Laplacian score for face recognition. Neurocomputing 152:333–344
Article Google Scholar
Martinez AM, Benavente R (1998) The AR face database. CVC Technical Report, 24
Georghiades AS, Belhumeur PN, Kriegman DJ (2001) From few to many: illumination cone models for face recognition under variable lighting and pose. IEEE Trans Pattern Anal Mach Intell 23(6):643–660
Article Google Scholar
Turk M, Pentland A (1991) Eigenfaces for recognition. J Cognit Neurosci 3(1):71–86
Article Google Scholar
Samaria F, Harter A (1994) Parameterisation of a stochastic model for human face identification. In: Proceedings of IEEE workshop applications computer vision. Sarasota, pp 138–142
Zhang Y, Zhou ZH (2010) Cost-sensitive face recognition. IEEE Trans Pattern Anal Mach Intell 32(10):1758–1769
Article Google Scholar
Rencher AC (2002) Methods of multivariate, 2nd edn. Wiley, New York
Book MATH Google Scholar
Ting KM (2002) An instance-weighting method to induce cost-sensitive trees. IEEE Trans Knowl Data Eng 14(3):659–665
Article Google Scholar
Liu XY, Zhou ZH (2006) The influence of class imbalance on cost-sensitive learning: an empirical study. In: Proceedings of the IEEE international conference on data mining. Hong Kong, pp 970–974
Ojala T, Pietikainen M, Harwood D (1996) A comparative study of texture measures with classification based on feature distributions. Pattern Recognit 29(1):51–59
Article Google Scholar
Fernandez-Delgado M, Cernadas E, Barro S, Amorim D (2014) Do we need hundreds of classifiers to solve real world classification problems? J Mach Learn Res 15:3133–3281
MathSciNet MATH Google Scholar

Download references

Acknowledgments

Jianwu Wan—This work was supported in part by Key Project of National Natural Science Foundation of China under Grant 61432008, National Natural Science Foundation of China under Grants 61272222, 61272367, 61502058, 61572085, 61201096, Natural Science Foundation of Educational Committee of Jiangsu Province under Grant 15KJB520002, Foundation of Changzhou University under Grant ZMF13020060. The authors would like to thank the anonymous referees and the editors for their helpful comments and suggestions.

Author information

Authors and Affiliations

School of Information Science and Engineering, Changzhou University, Changzhou, 213164, Jiangsu, People’s Republic of China
Jianwu Wan & Hongyuan Wang
School of Computer Science and Technology, Nanjing Normal University, Nanjing, 210023, Jiangsu, People’s Republic of China
Ming Yang

Authors

Jianwu Wan
View author publications
You can also search for this author in PubMed Google Scholar
Hongyuan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ming Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jianwu Wan.

Appendix 1

In this section, we introduce to use SVD to solve the optimal problems of CCA, LPbSCCA and our $\hbox {CS}^{3}\hbox {CCA}$, which are defined by Eqs. (1), (4) and (11), respectively. Note that, they can also be solved by computing a generalized eigenvalue decomposition problem, i.e., Eq. (2). As the form of their objective functions are similar, the only difference between them is the definitions of $\tilde{M}_{xy}, \tilde{M}_{xx}$ and $\tilde{M}_{yy}$, they can be unified written as follows:

$$\begin{aligned} \begin{aligned} \max ~&W_x^T\tilde{M}_{xy} W_y\\ s.t. ~&W_x^T\tilde{M}_{xx}W_x=1,~~W_y^T\tilde{M}_{yy}W_y=1. \end{aligned} \end{aligned}$$

(12)

Firstly, we define the Lagrange function of Eq. (12) as follows:

$$\begin{aligned} L(\lambda _1,\lambda _2,W_x,W_y)=W_x^T\tilde{M}_{xy} W_y-\frac{\lambda _1}{2}(W_x^T\tilde{M}_{xx} W_x-1)-\frac{\lambda _2}{2}(W_y^T\tilde{M}_{yy} W_y-1), \end{aligned}$$

(13)

and get its partial derivatives:

$$\begin{aligned} \left\{ \begin{aligned}&\partial L/\partial W_x=\tilde{M}_{xy}W_y-\lambda _1\tilde{M}_{xx}W_x=0,\\&\partial L/\partial W_y=\tilde{M}_{yx}W_x-\lambda _2\tilde{M}_{yy}W_y=0. \end{aligned}\right. \end{aligned}$$

(14)

By simple derivation, we can prove $\lambda _1=\lambda _2$ and get:

$$\begin{aligned} \left\{ \begin{aligned}&\tilde{M}_{xy}\tilde{M}_{yy}^{-1}\tilde{M}_{yx}W_x=\lambda ^2\tilde{M}_{xx}W_x,\\&\tilde{M}_{yx}\tilde{M}_{xx}^{-1}\tilde{M}_{xy}W_y=\lambda ^2\tilde{M}_{yy}W_y. \end{aligned}\right. \end{aligned}$$

(15)

In the following, we use the SVD to find the solution. Let $H=\tilde{M}_{xx}^{-1/2}\tilde{M}_{xy}\tilde{M}_{yy}^{-1/2}, U=\tilde{M}_{xx}^{1/2}W_x, V=\tilde{M}_{yy}^{1/2}W_y$, Eq. (15) becomes:

$$\begin{aligned} \left\{ \begin{aligned}&HH^TU=\lambda ^2U,\\&H^THV=\lambda ^2V. \end{aligned}\right. \end{aligned}$$

(16)

Observing Eq. (16), we discover that it only needs to do SVD of the matrix $H=UDV^T=\sum _{i=1}^d {\lambda _i u_iv_i^T}, U=[u_1,\ldots ,u_d], V=[v_1,\ldots ,v_d]$, and then the projection directions of $W_x$ and $W_y$ can be got respectively.

$$\begin{aligned} \left\{ \begin{aligned}&W_x=\tilde{M}_{xx}^{-1/2}U,\\&W_y=\tilde{M}_{yy}^{-1/2}V. \end{aligned}\right. \end{aligned}$$

(17)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wan, J., Wang, H. & Yang, M. Cost Sensitive Semi-Supervised Canonical Correlation Analysis for Multi-view Dimensionality Reduction. Neural Process Lett 45, 411–430 (2017). https://doi.org/10.1007/s11063-016-9532-z

Download citation

Published: 23 June 2016
Issue Date: April 2017
DOI: https://doi.org/10.1007/s11063-016-9532-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Cost Sensitive Semi-Supervised Canonical Correlation Analysis for Multi-view Dimensionality Reduction

Abstract

Access this article

Similar content being viewed by others

Manifold Regularized Discriminative Canonical Correlation Analysis for Semi-supervised Data

Sparse regularized discriminative canonical correlation analysis for multi-view semi-supervised learning

Semi-discriminative Multiview Canonical Correlation Analysis for Recognition

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix 1

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Cost Sensitive Semi-Supervised Canonical Correlation Analysis for Multi-view Dimensionality Reduction

Abstract

Access this article

Similar content being viewed by others

Manifold Regularized Discriminative Canonical Correlation Analysis for Semi-supervised Data

Sparse regularized discriminative canonical correlation analysis for multi-view semi-supervised learning

Semi-discriminative Multiview Canonical Correlation Analysis for Recognition

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix 1

Appendix 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation