Genetic Algorithms for Exploratory Data Analysis

  • Alberto Perez-Jimenez
  • Juan-Carlos Perez-Cortes
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2396)


Data projection is a commonly used technique applied to analyse high dimensional data. In the present work, we propose a new data projection method that uses genetic algorithms to find linear projections, providing meaningful representations of the original data. The proposed technique is compared with well known methods as Principal Components Analysis (PCA) and neural networks for non-linear discriminant analysis (NDA). A comparative study of these methods with several data sets is presented.


Principal Component Analysis Genetic Algorithm Linear Discriminant Analysis Class Structure High Dimensional Data 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    C. L. Blake and C. J. Merz. UCI repository of machine learning databases, 1998., University of California, Irvine.Google Scholar
  2. 2.
    J. H. Friedman. Exploratory projection pursuit. Journal of the American Statistical Association, 82(397), 1987.Google Scholar
  3. 3.
    K. Fukunaga. Statistical Pattern Recognition. Academic Press, second edition edition, 1990.Google Scholar
  4. 4.
    D. E. Goldberg. Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley, 1989.Google Scholar
  5. 5.
    J. H. Holland. Adaptation in Natural and Artificial Systems. Ann Tabor: The University of Michigan Press, 1975.Google Scholar
  6. 6.
    T. Kohonen. The self organizing map. Proceedings IEEE, pages 1464–1480, 1990.Google Scholar
  7. 7.
    B. Lerner, H. Guterman, M. Aladjem, I. Dinstein, and Y. Romen. On pattern classification with sammon’s nonlinear mapping (an experimental study). Pattern Recognition, 31(4):371–381, 1998.CrossRefGoogle Scholar
  8. 8.
    J. Mao and A. K. Jain. Artificial neural networks for feature extraction and multi-variate data projection. IEEE Transactions on Neural Networks, 6(2), 1995.Google Scholar
  9. 9.
    M. Mitchell. An Introduction to Genetic Algorithms. MIT Press, Cambridge, MA, 1996.Google Scholar
  10. 10.
    J. W. Sammon. A non-linear mapping for data structure analysis. IEEE Transactions on Computers, C-18(5), 1969.Google Scholar
  11. 11.
    L. Shyh-Chang, W. F. Punch III, and E. D. Goodman. Coarse-grain parallel genetic algorithms: Categorization and new approach. Parallel and Distributed Processing, 1994.Google Scholar
  12. 12.
    W. Siedlecki, K. Siedlecka, and J. Sklansky. An overview of mapping techniques for exploratory pattern analysis. Pattern Recognition, 21(5):411–429, 1988.zbMATHCrossRefMathSciNetGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Alberto Perez-Jimenez
    • 1
  • Juan-Carlos Perez-Cortes
    • 1
  1. 1.Departamento de Informatica de Sistemas y ComputadoresUniversidad Politecnica de ValenciaValenciaSpain

Personalised recommendations