Abstract
Support vector machine (SVM) is one of the highly efficient classification algorithms. Unfortunately, it is designed only for input samples expressed as vectors. In real life, most input samples are naturally in matrix form and include structural information, such as electroencephalogram (EEG) signals and gray images. Support matrix machine (SMM), which can capture the latent structure within input matrices by regularizing the regression matrix to be low rank, is more suitable for matrix-form data than the SVM. However, the SMM adopts hinge loss, which is easily sensitive to noise and unstable to re-sampling. In this paper, to tackle this issue, we propose a new SMM with pinball loss (Pin−SMM), which can simultaneously consider the intrinsic structural information of input matrices and noise insensitivity. Our Pin−SMM is defined as a spectral elastic net with pinball loss, penalizing the rightly classified points. The optimization problem of Pin−SMM is also convex, which motivates us to construct the fast alternating direction method of multipliers (Fast ADMM) to solve it. Comprehensive experiments on two popular image datasets and an EEG dataset with different noises are conducted, and the experimental results confirm the effectiveness of our presented algorithm.
Similar content being viewed by others
Notes
For the detailed proof, please see Appendix A.
References
Vapnik V (1995) The nature of statistical learning theory, 267–290. Springer, New York
Hua S, Sun Z (2001) Support vector machine approach for protein subcellular localization prediction. Bioinformatics 17:721–728
Zhang W, Yoshida T, Tang X (2008) Text classification based on multi-word with support vector machine. Knowl-Based Syst 21(8):879–886
Zhao Y, Huang G, Hu Q, Li B (2020) An improved weighted one class support vector machine for turboshaft engine fault detection. Eng Appl Artif Intell 94:103796
Deng N, Tian Y, Zhang C (2012) Support vector machines: optimization based theory, algorithms, and extensions, 41–60. CRC Press, Philadelphia, PA
Kotsia I, Patras I (2011) Support tucker machines. IEEE Conf Computer Vision Pattern Recogn (CVPR) 6:633–640
Wolf L, Jhuang H, Hazan T (2007) Modeling Appearances with Low-Rank SVM, in: IEEE Conference on Computer Vision and Pattern Recognition, pp 1−6
Pirsiavash H, Ramanan D, Fowlkes C.C (2009) Bilinear classifiers for visual recognition, in: Proceedings of the Advances in Neural Information Processing Systems, pp 1482-1490
You C, Palade V, Wu X (2019) Robust structure low-rank representation in latent space. Eng Appl Artif Intell 77:117–124
Luo L, Xie Y, Zhang Z, Li W. J (2015) Support matrix machines, in: Proceedings of the 32nd International Conference on Machine Learning, pp 333-342
Kobayashi T, Otsu N (2012) Efficient optimization for low-rank integrated bilinear classifiers, in: European Conference on Computer Vision, pp 474-487
Zheng Q, Zhu F, Qin J, Chen B, Heng P (2018) Sparse support matrix machine. Pattern Recogn 76:715–726
Zhu C, Wang Z (2017) Entropy-based matrix learning machine for imbalanced datasets. Pattern Recogn Lett 88:72–80
Bi J, Zhang T (2005) Support vector classification with input data uncertainty, in: Proceedings of the Advances in Neural Information Processing Systems, pp 161-168
Song Q, Hu W, Xie W (2002) Robust support vector machine with bullet hole image classification. IEEE Trans Syst, Man, Cybern Part C (Appl Rev) 32(4):440–448
Shivaswamy PK, Bhattacharyya C, Smola AJ (2006) Second order cone programming approaches for handling missing and uncertain data. J Mach Learn Res 7(7):1283–1314
Zhang J, Wang Y (2008) A rough margin based support vector machine. Inf Sci 178(9):2204–2214
Zhu W, Song Y, Xiao Y (2020) Support vector machine classifier with huberized pinball loss. Eng Appl Artif Intell 91:103635
Huang X, Shi L, Suykens JAK (2014) Support vector machine classifier with pinball loss. IEEE Trans Pattern Anal Mach Intell 36(5):984–997
Xu Y, Yang Z, Pan X (2017) A novel twin support vector machine with pinball loss. IEEE Trans Neural Netw Learn Syst 28(2):359–370
Shen X, Niu L, Qi Z, Tian Y (2017) Support vector machine classifier with truncated pinball loss. Pattern Recogn 68:199–210
Tanveer M, Sharma A, Suganthan PN (2019) General twin support vector machine with pinball loss function. Inf Sci 494:311–327
Goldstein T, O’Donoghue B, Setzer S (2014) Fast alternating direction optimization methods. Siam J Imag Sci 7(3):1588–1623
Glowinski R, Marrocco A (1975) Sur lápproximation, par éléments finis dórdre un, et. la résolution, par pénalisation-dualité, dúne classe de problèmes de Dirichlet non linéaires, Journal of Equine Veterinary Science, 2 (R2), 41-76
Cai JF, Candès EJ, Shen Z (2010) A singular value thresholding algorithm for matrix completion. SIAM J Optim 20(4):1956–1982
Watson GA (1991) Characterization of the subdifferential of some matrix norms. Linear Algebra Appl 170(6):33–45
Koenker R, Hallock KF (2001) Quantile regression. J Econ Perspect 15(4):143–156
Barat C, Ducottet C (2016) String representations and distances in deep convolutional neural networks for image classification. Pattern Recogn 54:104–115
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection, in: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 886-893
Li F, Fergus R, Perona P (2006) One-shot learning of object categories. IEEE Trans Pattern Anal Mach Intell 28(4):561–594
Ang KK, Chin ZY, Wang C, Guan C, Zhang H (2012) Filter bank common spatial pattern algorithm on BCI competition IV datasets 2a and 2b. Front Neurosci 6:39
Nam CS, Jeon Y, Kim YJ, Lee I, Park K (2011) Movement imagery-related lateralization of event-related (de) synchronization (ERD/ERS): motor-imagery duration effects. Clin Neurophysiol Off J Int Feder Clin Neurophysiol 122(3):567–577
Basar E, Güntekin B (2013) Chapter 19-Review of delta, theta, alpha, beta, and gamma response oscillations in neuropsychiatric disorders, Supplements to Clinical Neurophysiology, Elsevier Health Sciences, 63, 303−341
Lotte F (2014) A tutorial on eeg signal-processing techniques for mental-state recognition in brain-computer interfaces, in: Guide to Brain-Computer Music Interfacing, Springer, pp 133-161
Pfurtscheller G, Neuper C, Flotzinger D, Pregenzer M (1997) EEG-based discrimination between imagination of right and left hand movement. Electroencephal Clin Neurophysiol 103(6):642
Vidaurre C, Krämer N, Blankertz B, Schlögl A (2009) Time domain parameters as a feature for EEG-based brain-computer interfaces. Neural Netw 22(9):1313–1319
Rieke F, Warland D, Steveninck RDRV, Bialek W (1999) Spikes: exploring the neural code, 15–25. MIT Press, Cambridge, Mass
Crone N.E, Miglioretti D.L, Gordon B, Lesser R.P (1998) Functional mapping of human sensorimotor cortex with electrocorticographic spectral analysis. II. Event-related synchronization in the gamma band, Brain 121 (12), 2301-2315
Crone NE, Miglioretti DL, Gordon B, Sieracki JM, Wilson MT, Uematsu S, Lesser RP (1998) Functional mapping of human sensorimotor cortex with electrocorticographic spectral analysis, I Alpha and beta event-related desynchronization. Brain 121(12):2271–2299
Miller K, Leuthardt E, Schalk G, Rao R, Anderson N, Moran D, Miller J, Ojemann J (2007) Spectral changes in cortical surface potentials during motor movement. J Neurosci 27(9):2424–2432
Demsšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30
Salvador G, Alberto F, Julián L, Herrera F (2010) Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: experimental analysis of power. Inf Sci 180(10):2044–2064
Fazel M, Hindi H, Boyd S. P (2001) A rank minimization heuristic with application to minimum order system approximation, Proceedings of the 2001 American Control Conference, 6, 4734−4739
Acknowledgements
The authors gratefully acknowledge the helpful comments and suggestions of the reviewers, which have improved the presentation. This work was supported in part by the National Natural Science Foundation of China (No. 12071475, 11671010) and Beijing Natural Science Foundation (No.4172035).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declared that they have no conflicts of interest to this work.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix A. Convexity of Pin−SMM
Appendix A. Convexity of Pin−SMM
In order to prove that problem (9) is convex, we first define
It is obvious that the function \(\varPhi _{1}({\mathbf {W}})\) is convex. The nuclear norm of a matrix is a convex alternative of the matrix rank [10], which has been proved in Appendix A of the literature [43]. Therefore, \(\varPhi _{2}({\mathbf {W}})\) is also convex. Next, we demonstrate that the \(\varPhi _{3}({\mathbf {W}}, b)\) is convex. The sum of the convex functions is a convex function. We only need to prove that the function \(L_{\tau }({\mathbf {X}}_{i},y_{i},f({\mathbf {X}}_{i}))\) is convex. Let \(y_{i}f({\mathbf {X}}_{i}) - 1 = \mu\), and the pinball loss can be rewritten as:
Then we have
where \(\tau > 0\). Thus the function \(L_{\tau }(\mu )\) is convex. Because \(f({\mathbf {X}}_{i}) = {\text {tr}}({\mathbf {W}}^{T}{\mathbf {X}}_{i})+b\), we have \(L_{\tau }({\mathbf {X}}_{i},y_{i},f({\mathbf {X}}_{i}))\) is convex. So \(\varPhi _{3}({\mathbf {W}}, b)\) is convex with respect to \({\mathbf {W}}\) and b, respectively.
Moreover, parameters \(\lambda\) and C are positive; therefore, \(\varPhi _{1}({\mathbf {W}}) + \lambda \varPhi _{2}({\mathbf {W}}) + C\varPhi _{3}({\mathbf {W}}, b)\) is convex. That is, the Pin−SMM is convex.
This completes the proof. \(\Box\).
Rights and permissions
About this article
Cite this article
Feng, R., Xu, Y. Support matrix machine with pinball loss for classification. Neural Comput & Applic 34, 18643–18661 (2022). https://doi.org/10.1007/s00521-022-07460-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-022-07460-6