Novel Methods for Feature Subset Selection with Respect to Problem Knowledge

* Final gross prices may vary according to local VAT.

Get Access

Abstract

Recent advances in the statistical methodology for selecting optimal subsets of features for data representation and classification are presented. This chapter attempts to provide a guideline of which approach to choose with respect to the extent of a priori knowledge of the problem. Two basic approaches are reviewed and the conditions under which they should be used are specified. One approach involves the use of the computationally effective Floating search methods. The alternative approach trades off the requirement for a priori information for the requirement of sufficient data to represent the distributions involved. Owing to its nature it is particularly suitable for cases when the underlying probability distributions are not unimodal. The approach attempts to achieve simultaneous feature selection and decision rule inference. According to the criterion adopted there are two variants allowing the selection of features either for optimal representation or discrimination.

Supported by the grants of Czech Ministry of Education MŠMT No.VS96063, Czech Acad.Sci. A2075608 and Grant Agency of the Czech Republic No.402/97/1242.