A Feature Selection Method Using Hierarchical Clustering
Feature selection refers to a problem to select a subset of features which are most optimal for intended tasks. As one of well-known feature selection methods, clustering features into several groups and picking one feature from each group have been used for unsupervised feature selection. Since the purpose of clustering in feature selection is to select a feature from each group, the quality of the feature to be selected should be considered in the clustering process. In this paper, we propose a feature selection method using hierarchical clustering. A new similarity measure between two feature groups is defined by directly using the representative feature in each group. Experimental results show that our method can select good features even for supervised learning.
KeywordsFeature selection Hierarchical clustering Ward method
Unable to display preview. Download preview PDF.
- 3.Krier, C., Francois, D., Rossi, F., Verleysen, M.: Feature clustering and mutual information for the selection of variables in spectral data. In: Proceedings of European Symposium on Artificial Neural Networks, Bruges, Belgium (2007)Google Scholar
- 4.Butterworth, R., Piatetsky-Shapiro, G.: On feature selection through clustering. In: Proceedings of the Fifth IEEE International Conference on Data Mining (2005)Google Scholar
- 5.He, X., Cai, D., Niyogi, P.: Laplacian score for feature selection. In: Proceedings of Advances in Neural Information Processing Systems, Vancouver, Canada (2005)Google Scholar