Abstract
A large package of algorithms for feature ranking and selection has been developed. Infosel++, Information Based Feature Selection C++ Library, is a collection of classes and utilities based on probability estimation that can help developers of machine learning methods in rapid interfacing of feature selection algorithms, aid users in selecting an appropriate algorithm for a given task (embed feature selection in machine learning task), and aid researchers in developing new algorithms, especially hybrid algorithms for feature selection. A few examples of such possibilities are presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Guyon, I., Gunn, S., Nikravesh, M., Zadeh, L.: Feature Extraction, Foundations and Applications. Studies in Fuzziness and Soft Computing Series. Springer, Heidelberg (2006)
Duch, W., Maszczyk, T.: Universal learning machines. In: Chan, J.H. (ed.) ICONIP 2009, Part II. LNCS, vol. 5864, pp. 206–215. Springer, Heidelberg (2009)
Saeys, Y., Inza, I., Larrańaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 23(19), 2507–2517 (2007)
Liu, H., Motoda, M. (eds.): Computational Methods of Feature Selection. CRC Press, Boca Raton (2007)
Saeys, Y., Liu, H., Inza, I., Wehenkel, L., de Peer, Y.V.: New challenges for feature selection in data mining and knowledge discovery. In: JMLR Workshop and Conf. Proc. (2008)
Duch, W.: Filter methods. In: Guyon, I., Gunn, S., Nikravesh, M., Zadeh, L. (eds.) Feature extraction, foundations and applications, pp. 89–118. Springer, Heidelberg (2006)
Kohavi, R., Sommerfield, D., Dougherty, J.: Data mining using MLC++, a machine learning library in C++. Int. J. of Artificial Intelligence Tools 6(4), 537–566 (1997)
Witten, I., Frank, E.: Data Mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
Mierswa, I., Wurst, M., Klinkenberg, R., Scholz, M., Euler, T.: YALE: Rapid prototyping for complex data mining tasks. In: Proc. of the 12th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, KDD 2006 (2006)
Pudil, P., Novovicova, J., Somol, P.: Feature selection toolbox software package. Pattern Recognition Lettters 23(4), 487–492 (2002)
Su, Y., Murali, T., Pavlovic, V., Schaffer, M., Kasif, S.: Rankgene: identification of diagnostic genes based on expression data. Bioinformatics 19, 1578–1579 (2003)
Liu, H., Yu, L.: Toward integrating feature selection algorithms for classification and clustering. IEEE Trans. on Knowledge and Data Engineering 17(4), 491–502 (2005)
Press, W., Teukolsky, S., Vetterling, W., Flannery, G.: Numerical recipes in C. The art of scientific computing. Cambridge University Press, Cambridge (1988)
Vilmansen, T.: Feature evalution with measures of probabilistic dependence. IEEE Transaction on Computers 22(4), 381–388 (1973)
Golub, T., et al.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999)
Ding, C., Peng, F.: Minimum redundancy feature selection from microarray gene expression data. Journal of Bioinformatics and Computational Biology 3(2), 185–205 (2004)
Battiti, R.: Using mutual information for selecting features in supervised neural net learning. IEEE Trans. Neural Networks 5(4) (July 1994)
Kwak, N., Choi, C.H.: Input feature selection for classification problems. IEEE Transactions on Evolutionary Computation 13(1), 143–159 (2002)
Tesmer, M., Este’vez, P.: AMIFS: Adaptive feature selection by using mutual information. In: Proc. of Int. Joint Conf. on Neural Networks, Budapeszt, pp. 1415–1420. IEEE Press, Los Alamitos (2004)
Yu, L., Liu, H.: Efficient feature selection via analysis of relevance and redundancy. Journal of Machine Learning Research, JMLR 5, 1205–1224 (2004)
Duch, W., Biesiada, J.: Feature Selection for High-Dimensional Data: A Kolmogorov-Smirnov Correlation-Based Filter Solution. In: Advances in Soft Computing, pp. 95–104. Springer, Heidelberg (2005)
Biesiada, J., Duch, W.: A Kolmogorov-Smirnov correlation-based filter solution for microarray gene expressions data. In: Ishikawa, M., Doya, K., Miyamoto, H., Yamakawa, T. (eds.) ICONIP 2007, Part II. LNCS, vol. 4985, pp. 285–294. Springer, Heidelberg (2008)
Blachnik, M., Duch, W., Kachel, A., Biesiada, J.: Feature Selection for Supervised Classification: A Kolmogorov-Smirnov Class Correlation-Based Filter. In: AIMeth, Symposium on Methods of Artificial Intelligence, Gliwice, Poland, November 10-19 (2009)
Koller, D., Sahami, M.: Toward optimal feature selection. In: Proc. of the 13th Int. Conf. on Machine Learning, pp. 284–292. Morgan Kaufmann, San Francisco (1996)
Xing, E., Jordan, M., Karp, R.: Feature selection for high-dimensional genomic microarray data. In: Proc. of the 8th Int. Conf. on Machine Learning (2001)
Lorenzo, J., Hermandez, M., Mendez, J.: GD: A Measure based on Information Theory for Attribute Selection. In: Coelho, H. (ed.) IBERAMIA 1998. LNCS (LNAI), vol. 1484, pp. 124–135. Springer, Heidelberg (1998)
Sridhar, D., Barlett, E., Seagrave, R.: Informatic theoretic susbset selection for neural networks models. Computers & Chemical Engineering 22(4), 613–626 (1998)
John, G., Kohavi, R., Pfleger, K.: Irrelevant features and the subset selection problem. In: Proc. Eleventh Inter. Conf. on Machine Learning, pp. 121–129. Morgan Kaufmann, San Francisco (1994)
Biesiada, J., Duch, W.: Feature Selection for High-Dimensional Data: A Pearson Redundancy Based Filter. In: Advances in Soft Computing, vol. 45, pp. 242–249. Springer, Heidelberg (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kachel, A., Biesiada, J., Blachnik, M., Duch, W. (2010). Infosel++: Information Based Feature Selection C++ Library. In: Rutkowski, L., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2010. Lecture Notes in Computer Science(), vol 6113. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13208-7_49
Download citation
DOI: https://doi.org/10.1007/978-3-642-13208-7_49
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13207-0
Online ISBN: 978-3-642-13208-7
eBook Packages: Computer ScienceComputer Science (R0)