Abstract
Recently, multi-view dictionary learning technique has attracted lots of research interest. Although some multi-view dictionary learning methods have been addressed, there exists much room for improvement. How to explore and utilize both the diversity and the useful correlation information of different views with dictionaries has not been well studied. In this paper, we propose a novel multi-view dictionary learning approach named multi-view discriminant dictionary learning via learning view-specific and shared structured dictionaries (MDVSD), which aims to learn a structured dictionary shared by all views and multiple view-specific structured dictionaries with each corresponding to a specific view. The shared dictionary is combined with each view-specific dictionary to represent data of the specific view. MDVSD makes the view-specific dictionaries corresponding to different views uncorrelated for effectively exploring the diversity of different views. Furthermore, we introduce structural uncorrelation into shared dictionary learning procedure, such that the useful correlation information of different views can be effectively exploited. Dictionary-atoms in shared and view-specific dictionaries have correspondence to class labels so that the learned dictionaries have favorable discriminant ability and the obtained reconstruction error is discriminative. Three widely used datasets are employed as test data. Experimental results demonstrate the effectiveness of the proposed approach.
Similar content being viewed by others
References
Memisevic R (2012) On multi-view feature learning. In: International conference on machine learning (ICML), pp 161–168
Kumar A, Daumé H (2011) A co-training approach for multi-view spectral clustering. In: International conference on machine learning (ICML), pp 393–400
Kumar A, Rai P, Daumé H (2011) Co-regularized multi-view spectral clustering. In: Advances in neural information processing systems (NIPS), pp 1413–1421
Gönen M, Alpaydın E (2011) Multiple kernel learning algorithms. J Mach Learn Res 12:2211–2268
Kloft M, Brefeld U, Sonnenburg S, Zien A (2011) Lp-norm multiple kernel learning. J Mach Learn Res 12:953–997
Sun S (2013) A survey of multi-view machine learning. Neural Comput Appl 23(7–8):2031–2038
Xu C, Tao D, Xu C (2013) A survey on multi-view learning. arXiv preprint arXiv:1304.5634
Gao L, Qi L, Chen E, Guan L (2012) Discriminative multiple canonical correlation analysis for multi-feature information fusion. In: IEEE international symposium on multimedia, pp 36–43
Shen X, Sun Q (2015) Orthogonal multiset canonical correlation analysis based on fractional-order and its application in multiple feature extraction and recognition. Neural Process Lett 42(2):301–316
Yuan YH, Sun QS, Ge HW (2014) Fractional-order embedding canonical correlation analysis and its applications to multi-view dimensionality reduction and recognition. Pattern Recogn 47(3):1411–1424
Li YO, Adali T, Wang W, Calhoun VD (2009) Joint blind source separation by multiset canonical correlation analysis. IEEE Trans Signal Process 57(10):3918–3929
Jing X, Hu R, Zhu Y, Wu S, Liang C, Yang J (2014). Intra-view and inter-view supervised correlation analysis for multi-view feature learning. In: AAAI conference on artificial intelligence (AAAI), pp 1882–1889
Sharma A, Kumar A, Daume H, Jacobs DW (2012) Generalized multiview analysis: a discriminative latent space. In IEEE conference on computer vision and pattern recognition (CVPR), pp 2160–2167
Diethe T, Hardoon DR, Shawe-Taylor J (2008) Multiview fisher discriminant analysis. In: NIPS workshop on learning from multiple sources
Kan M, Shan S, Zhang H, Lao S, Chen X (2012) Multi-view discriminant analysis. In European conference on computer vision (ECCV), pp 808–821
Sun S, Xie X, Yang M (2015) Multiview uncorrelated discriminant analysis. IEEE Trans Cybern (in press)
Shekhar S, Patel VM, Nasrabadi NM, Chellappa R (2014) Joint sparse representation for robust multimodal biometrics recognition. IEEE Trans Pattern Anal Mach Intell 36(1):113–126
Jia Y, Salzmann M, Darrell T (2010) Factorized latent spaces with structured sparsity. In: Advances in neural information processing systems (NIPS), pp 982–990
Zheng S, Xie B, Huang K, Tao D (2011) Multi-view pedestrian recognition using shared dictionary learning with group sparsity. In: International conference on neural information processing (ICONIP), pp 629–638
Zheng J, Jiang Z, Phillips PJ, Chellappa R (2012) Cross-view action recognition via a transferable dictionary pair. In: British machine vision conference (BMVC)
Gao Z, Zhang H, Xu GP, Xue YB, Hauptmann AG (2014) Multi-view discriminative and structured dictionary learning with group sparsity for human action recognition. Signal Process 112:83–97
Zhuang Y, Wang Y, Wu F, Zhang Y, Lu W (2013) Supervised coupled dictionary learning with group structures for multi-modal retrieval. In AAAI conference on artificial intelligence (AAAI), pp 1070–1076
Patel VM, Gopalan R, Li R, Chellappa R (2015) Visual domain adaptation: a survey of recent advances. IEEE Signal Process Mag 32(3):53–69
Shekhar S, Patel VM, Nguyen HV, Chellappa R (2013) Generalized domain-adaptive dictionaries. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 361–368
Zhang H, Nasrabadi NM, Zhang Y, Huang TS (2012) Joint dynamic sparse representation for multi-view face recognition. Pattern Recogn 45(4):1290–1298
Zheng J, Jiang Z (2013) Learning view-invariant sparse representations for cross-view action recognition. In: IEEE conference on computer vision (ICCV), pp 3176–3183
Shi Y, Gao Y, Yang Y, Zhang Y, Wang D (2013) Multi-modal sparse representation-based classification for lung needle biopsy images. IEEE Trans Biomed Eng 60(10):2675–2685
Jing X, Hu R, Wu F, Chen X, Liu Q, Yao Y (2014) Uncorrelated multi-view discrimination dictionary learning for recognition. In: AAAI conference on artificial intelligence (AAAI), pp 2787–2795
Gao S, Tsang IW, Ma Y (2014) Learning category-specific dictionary and shared dictionary for fine-grained image categorization. IEEE Trans Image Process 23(2):623–634
Wang D, Kong S (2014) A classification-oriented dictionary learning model: explicitly learning the particularity and commonality across categories. Pattern Recogn 47(2):885–898
Cai D, He X, Han J, Zhang HJ (2006) Orthogonal laplacianfaces for face recognition. IEEE Trans Image Process 15(11):3608–3614
Murase H, Nayar SK (1995) Visual learning and recognition of 3-D objects from appearance. Int J Comput Vis 14(1):5–24
LeCun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Chen CF, Wei CP, Wang YC (2012) Low-rank matrix recovery with structural incoherence for robust face recognition. In: IEEE conference on computer vision and pattern recognition (CVPR), pp 2618–2625
Rosasco L, Verri A, Santoro M, Mosci S, Villa S (2009) Iterative projection methods for structured sparsity regularization. MIT Technical Reports, MIT-CSAIL-TR-2009-050, CBCL-282, Massachusetts Institute of Technology
Wright J, Yang AY, Ganesh A, Sastry SS, Ma Y (2009) Robust face recognition via sparse representation. IEEE Trans Pattern Anal Mach Intell 31(2):210–227
Turk M, Pentland A (1991) Eigenfaces for recognition. J Cognit Neurosci 3(1):71–86
Grigorescu SE, Petkov N, Kruizinga P (2002) Comparison of texture features based on Gabor filters. IEEE Trans Image Process 11(10):1160–1167
Fukunaga K, Koontz WL (1970) Application of the Karhunen–Loeve expansion to feature selection and ordering. IEEE Trans Comput 19(4):311–318
Ahonen T, Hadid A, Pietikainen M (2006) Face description with local binary patterns: application to face recognition. IEEE Trans Pattern Anal Mach Intell 28(12):2037–2041
Draper BA, Yambor WS, Beveridge JR (2002) Analyzing PCA-based face recognition algorithms: eigenvector selection and distance measures. In: IEEE workshop empirical evaluation methods in computer vision, pp 1–15
Acknowledgments
The authors want to thank the anonymous reviewers for their constructive comments and suggestions. The work described in this paper was fully supported by the National Natural Science Foundation of China under Projects No. 61272273, No. 61502245, and No. 61533010.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Wu, F., Jing, XY. & Yue, D. Multi-view Discriminant Dictionary Learning via Learning View-specific and Shared Structured Dictionaries for Image Classification. Neural Process Lett 45, 649–666 (2017). https://doi.org/10.1007/s11063-016-9545-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11063-016-9545-7