A Fractal Dimension Based Filter Algorithm to Select Features for Supervised Learning

  • Huei Diana Lee
  • Maria Carolina Monard
  • Feng Chung Wu
Conference paper

DOI: 10.1007/11874850_32

Part of the Lecture Notes in Computer Science book series (LNCS, volume 4140)
Cite this paper as:
Lee H.D., Monard M.C., Wu F.C. (2006) A Fractal Dimension Based Filter Algorithm to Select Features for Supervised Learning. In: Sichman J.S., Coelho H., Rezende S.O. (eds) Advances in Artificial Intelligence - IBERAMIA-SBIA 2006. Lecture Notes in Computer Science, vol 4140. Springer, Berlin, Heidelberg

Abstract

Feature selection plays an important role in machine learning and is often applied as a data pre-processing step. Its objective is to choose a subset from the original set of features that describes a data set, according to some importance criterion, by removing irrelevant and/or redundant features, as they may decrease data quality and reduce the comprehensibility of hypotheses induced by supervised learning algorithms. Most of the state-of-art feature selection algorithms mainly focus on finding relevant features. However, it has been shown that relevance alone is not sufficient to select important features. It is also important to deal with the problem of features’ redundancy. For the purpose of selecting features and discarding others, it is necessary to measure the features’ goodness (importance), and many importance measures have been proposed. This work proposes a filter algorithm that decouples relevance and redundancy analysis, and introduces the use of Fractal Dimension to deal with redundant features. Empirical results on several data sets show that Fractal Dimension is an appropriate criterion to filter out redundant features for supervised learning.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Huei Diana Lee
    • 1
    • 2
  • Maria Carolina Monard
    • 1
    • 2
  • Feng Chung Wu
    • 1
    • 2
  1. 1.Laboratory of Computational Intelligence – LABICUniversity of São Paulo – USP, Institute of Mathematics and Computer Science – ICMCSão CarlosBrazil
  2. 2.Bioinformatics Laboratory – LABIWest Paraná State University – UNIOESTE, Itaipu Technological Park – PTIFoz do IguaçuBrazil

Personalised recommendations