Projecting Financial Data Using Genetic Programming in Classification and Regression Tasks

  • César Estébanez
  • José M. Valls
  • Ricardo Aler
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3905)


The use of Constructive Induction (CI) methods for the generation of high-quality attributes is a very important issue in Machine Learning. In this paper, we present a CI method based in Genetic Programming (GP). This method is able to evolve projections that transform the dataset, constructing a new coordinates space in which the data can be more easily predicted. This coordinates space can be smaller than the original one, achieving two main goals at the same time: on one hand, improving classification tasks; on the other hand, reducing dimensionality of the problem. Also, our method can handle classification and regression problems. We have tested our approach in two financial prediction problems because their high dimensionality is very appropriate for our method. In the first one, GP is used to tackle prediction of bankruptcy of companies (classification problem). In the second one, an IPO Underpricing prediction domain (a classical regression problem) is confronted. Our method obtained in both cases competitive results and, in addition, it drastically reduced dimensionality of the problem.


Normalize Mean Square Error Linear Separation Regression Task Bankruptcy Prediction Classic Linear Regression 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Cristianini, N., Shawe-Taylor, J.: An introduction to Support Vector Machines (and other kernel-based learning methods). Cambridge University Press, Cambridge (2000)CrossRefMATHGoogle Scholar
  2. 2.
    Hastie, T., Tibshirani, R., Friedman, J.: The elements of statistical learning. Springer Series in Statistics. Springer, New York (2001)CrossRefMATHGoogle Scholar
  3. 3.
    Fawcett, T., Utgoff, P.: A hybrid method for feature generation. In: Proceedings of the Eighth International Workshop on Machine Learning, Evanston, IL, pp. 137–141 (1991)Google Scholar
  4. 4.
    Kramer, S.: Cn2-mci: A two-step method for constructive induction. In: Proceedings of ML-COLT 1994 (1994)Google Scholar
  5. 5.
    Pfahringer, B.: Cipf 2.0: A robust constructive induction system. In: Proceedings of ML-COLT 1994 (1994)Google Scholar
  6. 6.
    Koza, J.R.: Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge (1992)MATHGoogle Scholar
  7. 7.
    Quinlan, J.R.: C4.5 – Programs for Machine Learning. The Morgan Kaufmann series in machine learning. Morgan Kaufman Publishers, San Francisco (1993)Google Scholar
  8. 8.
    Douglas Zongker and Bill Punch. lil-gp 1.1 (September 1998),
  9. 9.
    Koza, J.R.: Genetic Programming II: Automatic Discovery of Reusable Programs. MIT Press, Cambridge Massachusetts (1994)MATHGoogle Scholar
  10. 10.
    Altman, E.I.: Business failure prediction models: An international survey. Journal of Banking Accounting and Finance 8, 171 (1984)CrossRefGoogle Scholar
  11. 11.
    Tam, K.Y., Kiang, M.Y.: Managerial applications of neural networks: the case bank failure predictions. Management Science 38, 926–947 (1992)CrossRefMATHGoogle Scholar
  12. 12.
    Han, I., Jo, H., Shin, K.S.: The hybrid systems for credit rating. Journal of the Korean Operations Research and Management Science Society 22(3), 163–173 (1997)Google Scholar
  13. 13.
    Fletcher, D., Goss, E.: Forecasting with neural networks: An application using bankruptcy data. Information and Management 24(3), 159–167 (1993)CrossRefGoogle Scholar
  14. 14.
    Vieira, A., Bas, N.: A training algorithm for classification of high-dimensional data. Neurocomputing 50, 461–472 (2003)CrossRefMATHGoogle Scholar
  15. 15.
    Ritter, J.R., Welch, I.: A review of ipo activity, pricing, and allocations. Journal of Finance 57, 1795–1828 (2002)CrossRefGoogle Scholar
  16. 16.
    Jain, B.A., Nag, B.N.: Artificial neural network models for pricing initial public offerings. Decision Sciences 26, 283–299 (1995)CrossRefGoogle Scholar
  17. 17.
    Vieira, A., Ribeiro, B., Neves, J.C.: A method to improve generalization of neural networks: Application to the problem of bankruptcy prediction. In: Springer Verlag series on Adaptative and Natural Computing Algorithms Proceeding of 7th International Conference on Adaptive and Natural, Computing Algorithms. ICANNGA 2005, vol. 1, p. 417 (2005)Google Scholar
  18. 18.
    Witten, I.H., Frank, E.: Data Mining: practical machine learning tools and techniques with java implementations. Morgan Kaufmann, San Francisco (2000)Google Scholar
  19. 19.
    Quintana, D., Luque, C., Isasi, P.: Evolutionary rule-based system for ipo underpricing prediction. In: Proceedings of the Genetic and Evolutionary Computation Conference GECO 2005, vol. 1 (2005)Google Scholar
  20. 20.
    Otero, F.E.B., Silva, M.M.S., Freitas, A.A., Nievola, J.C.: Genetic programming for attribute construction in data mining. In: Ryan, C., Soule, T., Keijzer, M., Tsang, E.P.K., Poli, R., Costa, E. (eds.) EuroGP 2003. LNCS, vol. 2610, pp. 389–398. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  21. 21.
    Krawiec, K.: Genetic programming-based construction of features for machine learning and knowledge discovery tasks. Genetic Programming and Evolvable Machines 3(4), 329–343 (2002)CrossRefMATHGoogle Scholar
  22. 22.
    Shafti, L.S., P´erez, E.: Constructive induction and genetic algorithms for learning concepts with complex interaction. In: GECCO 2005, pp. 1811–1818 (2005)Google Scholar
  23. 23.
    Kuscu, I.: A genetic constructive induction model. In: 1999 Congress on Evolutionary Computation, pp. 212–217. IEEE Service Center, Piscataway (1999)Google Scholar
  24. 24.
    Hu, Y.-J.: A genetic programming approach to constructive induction. In: Koza, J.R., Banzhaf, W., Chellapilla, K., Deb, K., Dorigo, M., Fogel, D.B., Garzon, M.H., Goldberg, D.E., Iba, H., Riolo, R. (eds.) Genetic Programming 1998: Proceedings of the Third Annual Conference, University of Wisconsin, Madison, Wisconsin, USA, pp. 146–151. Morgan Kaufmann, San Francisco (1998)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • César Estébanez
    • 1
  • José M. Valls
    • 1
  • Ricardo Aler
    • 1
  1. 1.Universidad Carlos III de MadridLeganés (Madrid)Spain

Personalised recommendations