Interval Data Classification under Partial Information: A Chance-Constraint Approach
This paper presents a Chance-constraint Programming approach for constructing maximum-margin classifiers which are robust to interval-valued uncertainty in training examples. The methodology ensures that uncertain examples are classified correctly with high probability by employing chance-constraints. The main contribution of the paper is to pose the resultant optimization problem as a Second Order Cone Program by using large deviation inequalities, due to Bernstein. Apart from support and mean of the uncertain examples these Bernstein based relaxations make no further assumptions on the underlying uncertainty. Classifiers built using the proposed approach are less conservative, yield higher margins and hence are expected to generalize better than existing methods. Experimental results on synthetic and real-world datasets show that the proposed classifiers are better equipped to handle interval-valued uncertainty than state-of-the-art.
KeywordsGaussian Mixture Model Partial Information Synthetic Dataset Interval Data Second Order Cone Program
Unable to display preview. Download preview PDF.
- 1.Natsoulis, G., Ghaoui, L.E., Lanckriet, G.R.G., Tolley, A.M., Leroy, F., Dunlea, S., Eynon, B.P., Pearson, C.I., Tugendreich, S., Jarnagin, K.: Classification of a Large Microarray Data Set: Algorithm Comparison and Analysis of Drug Signatures. Genome Research 15, 724–736 (2005)CrossRefGoogle Scholar
- 3.Ghaoui, L.E., Lanckriet, G.R.G., Natsoulis, G.: Robust Classification with Interval Data. Technical Report UCB/CSD-03-1279, Computer Science Division, University of California, Berkeley (2003)Google Scholar
- 7.Ben-Tal, A., Nemirovski, A.: Selected Topics in Robust Convex Optimization. Mathematical Programming 112(1) (2007)Google Scholar