Soft Computing

, Volume 21, Issue 8, pp 2069–2089

Multi-objective genetic programming for feature extraction and data visualization

  • Alberto Cano
  • Sebastián Ventura
  • Krzysztof J. Cios
Methodologies and Application

DOI: 10.1007/s00500-015-1907-y

Cite this article as:
Cano, A., Ventura, S. & Cios, K.J. Soft Comput (2017) 21: 2069. doi:10.1007/s00500-015-1907-y

Abstract

Feature extraction transforms high-dimensional data into a new subspace of lower dimensionality while keeping the classification accuracy. Traditional algorithms do not consider the multi-objective nature of this task. Data transformations should improve the classification performance on the new subspace, as well as to facilitate data visualization, which has attracted increasing attention in recent years. Moreover, new challenges arising in data mining, such as the need to deal with imbalanced data sets call for new algorithms capable of handling this type of data. This paper presents a Pareto-based multi-objective genetic programming algorithm for feature extraction and data visualization. The algorithm is designed to obtain data transformations that optimize the classification and visualization performance both on balanced and imbalanced data. Six classification and visualization measures are identified as objectives to be optimized by the multi-objective algorithm. The algorithm is evaluated and compared to 11 well-known feature extraction methods, and to the performance on the original high-dimensional data. Experimental results on 22 balanced and 20 imbalanced data sets show that it performs very well on both types of data, which is its significant advantage over existing feature extraction algorithms.

Keywords

Classification Feature extraction Visualization  Genetic programming 

Copyright information

© Springer-Verlag Berlin Heidelberg 2015

Authors and Affiliations

  • Alberto Cano
    • 1
    • 2
  • Sebastián Ventura
    • 1
    • 3
  • Krzysztof J. Cios
    • 2
    • 4
  1. 1.Department of Computer ScienceVirginia Commonwealth UniversityRichmondUSA
  2. 2.Department of Computer Science and Numerical AnalysisUniversity of CórdobaCórdobaSpain
  3. 3.Computer Sciences Department, Faculty of Computing and Information TechnologyKing Abdulaziz UniversityJeddahSaudi Arabia
  4. 4.IITiS Polish Academy of SciencesGliwicePoland

Personalised recommendations