DC programming and DCA for sparse Fisher linear discriminant analysis
- 289 Downloads
We consider the supervised pattern classification in the high-dimensional setting, in which the number of features is much larger than the number of observations. We present a novel approach to the sparse Fisher linear discriminant problem using the \(\ell _0\)-norm. The resulting optimization problem is nonconvex, discontinuous and very hard to solve. We overcome the discontinuity by using appropriate approximations to the \(\ell _0\)-norm such that the resulting problems can be formulated as difference of convex functions (DC) programs to which DC programming and DC Algorithms (DCA) are investigated. The experimental results on both simulated and real datasets demonstrate the efficiency of the proposed algorithms compared to some state-of-the-art methods.
KeywordsClassification Feature selection Fisher linear discriminant analysis DC programming DCA
This research is funded by Foundation for Science and Technology Development of Ton Duc Thang University (FOSTECT), website: http://fostect.tdt.edu.vn, under Grant FOSTECT.2015.BR.15. The authors would like to thank the referees for their valuable comments which helped to improve the manuscript.
- 4.Bradley PS, Mangasarian OL (1998) Feature selection via concave minimization and support vector machines. In: Proceeding of international conference on machine learning ICML98Google Scholar
- 9.Collobert R, Sinz F, Weston J, Bottou L (2006) Trading convexity for scalability. In Proceedings of the 23rd international conference on machine learning, NY, USA, pp 201–208Google Scholar
- 13.Gordon GJ, Jensen RV, Hsiao LL, Gullans SR, Blumenstock JE, Ramaswamy S, Richards WG, Sugarbaker DJ, Bueno R (2002) Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma. Cancer Res 62:4963–4967Google Scholar
- 19.Krause N, Singer Y (2004) Leveraging the margin more carefully. In: Proceedings of the twenty first international conference on machine learning, NY, USAGoogle Scholar
- 28.Le Thi HA, Nguyen VV, Ouchani S (2009) Gene selection for cancer classification using DCA. J Front Comput Sci Technol 3:612–620Google Scholar
- 37.Mardia KV, Kent JT, Bibby JM (1979) Multivariate Analysis. Academic Press, London, New York, Toronto, Sydney, San FranciscoGoogle Scholar
- 43.Pham Dinh T, Le Thi HA (2014) Recent advances in dc programming and dca. Trans Comput Collect Intell 8342:1–37Google Scholar