Flexible-Hybrid Sequential Floating Search in Statistical Feature Selection
Among recent topics studied in context of feature selection the hybrid algorithms seem to receive particular attention. In this paper we propose a new hybrid algorithm, the flexible hybrid floating sequential search algorithm, that combines both the filter and wrapper search principles. The main benefit of the proposed algorithm is its ability to deal flexibly with the quality-of-result versus computational time trade-off and to enable wrapper based feature selection in problems of higher dimensionality than before. We show that it is possible to trade significant reduction of search time for negligible decrease of the classification accuracy. Experimental results are reported on two data sets, WAVEFORM data from the UCI repository and SPEECH data from British Telecom.
KeywordsFeature Selection Hybrid Algorithm Feature Subset Subset Size Feature Subset Selection
Unable to display preview. Download preview PDF.
- 2.Yu, L., Liu, H.: Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution. In: Proc. 20th Intl. Conf. Machine Learning, pp. 856–863 (2003)Google Scholar
- 3.Dash, M., Choi, K., Scheuermann, P., Liu, H.: Feature Selection for Clustering - a Filter Solution. In: Proc. Second Int. Conf. Data Mining, pp. 15–122 (2002)Google Scholar
- 5.Das, S.: Filters, Wrappers and a Boosting-Based Hybrid for Feature Selection. In: Proc. 18th Intl. Conf. Machine Learning, pp. 74–81 (2001)Google Scholar
- 8.Pudil, P., Novovicova, J., Somol, P.: Recent Feature Selection Methods in Statistical Pattern Recognition. In: Pattern Recognition and String Matching. Springer, Berlin (2003)Google Scholar
- 9.Jain, A.K., Zongker, D.: Feature selection: evaluation, application and small sample performance. IEEE Trans. PAMI 19, 153–158 (1997)Google Scholar
- 12.Murphy, P.M., Aha, D.W.: UCI Repository of Machine Learning Databases [Machine-readable data repository]. University of California, Department of Information and Computer Science, Irvine, CA (1994)Google Scholar