Skip to main content

Genetic Programming for Classification and Algorithm Design

  • Chapter
  • First Online:
Automating the Design of Data Mining Algorithms

Part of the book series: Natural Computing Series ((NCS))

  • 1023 Accesses

Abstract

Chapter 4 (entitled “Genetic Programming for Classification and Algorithm Design”), consists of two broad parts. The first part is about the classification task of data mining. In this first part, the chapter first explains important differences between classification models and classification algorithms - a crucial point to understand the contribution of this book, since the proposed genetic programming system produces a classification algorithm, rather than a classification model as usual in the literature. Then this chapter reviews genetic programming algorithms for evolving classification models, for evolving components of rule induction algorithms and for evolving classification systems as a whole (as a combination of datasets plus a classification algorithm). The second part of the chapter is about genetic programming for evolving the design of combinatorial optimization algorithms, rather than classification algorithms. Although combinatorial optimization is not the focus of this book, this topic was included in this chapter because the research on automatically evolving combinatorial optimization algorithms seems to be in a more advanced stage than the research on automatically evolving data mining algorithms, so that lessons from the former can be useful to the latter.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abe, H., Yamaguchi, T.: Comparing the parallel automatic composition of inductive applications with stacking methods. In: R. Camacho, A. Srinivasan (eds.) Proc. of the Workshop on Parallel and Distributed Computing for Machine Learning (ECML/PKDD-03), pp. 1–12. Cavtat-Dubrovnik, Croatia (2003)

    Google Scholar 

  2. Abraham, A.: Meta learning evolutionary artificial neural networks. Neurocomputing 56, 1–38 (2004)

    Article  Google Scholar 

  3. Almal, A., Mitra, A., Datar, R., Lenehan, P., Fry, D., Cote, R., Worzel, W.: Using genetic programming to classify node positive patients in bladder cancer. In: Proc. of Genetic and Evolutionary Computation Conf. (GECCO-06), pp. 239–246. Morgan Kaufmann (2006)

    Google Scholar 

  4. Archetti, F., Lanzani, S., Messina, E., Vanneschi, L.: Genetic programming for human oral bioavailability of drugs. In: Proc. of Genetic and Evolutionary Computation Conf. (GECCO-06), pp. 255–262. Morgan Kaufmann (2006)

    Google Scholar 

  5. Bader-El-Den, M., Poli, R.: Generating SAT local-search heuristics using a GP hyper-heuristic framework. In: Artificial Evolution (Proc. of 8th Int. Conf. on Evolution Artificielle), LNCS, vol. 4926, pp. 37–49. Springer-Verlag (2007)

    Google Scholar 

  6. Basgalupp, M., Barros, R., Carvalho, A., Freitas, A., Ruiz, D.: Legal-tree: a lexicographic multi-objective genetic algorithm for decision tree induction. In: Proc. of 24th Annual ACM Symposium on Applied Computing (SAC 2009), Hawaii, USA, pp. 1085–1090 (2009)

    Google Scholar 

  7. Blum, C., Roli, A.: Metaheuristics in combinatorial optimization: overview and conceptual comparison. ACM Computing Surveys 35(3), 268–308 (2003)

    Article  Google Scholar 

  8. Bojarczuk, C., Lopes, H., Freitas, A.: Discovering comprehensible classification rules using genetic programming: a case study in a medical domain. In: Proc. of the Genetic and Evolutionary Computation Conf. (GECCO-99), pp. 953–958. Morgan Kaufmann (1999)

    Google Scholar 

  9. Bojarczuk, C.C., Lopes, H.S., Freitas, A.A.: Genetic programming for knowledge discovery in chest pain diagnosis. IEEE Engineering in Medicine and Biology Magazine 19(4), 38–44 (2000)

    Article  Google Scholar 

  10. Bot, M., Langdon, W.: Application of genetic programming to induction of linear classification trees. In: Proc. of the 3rd European Conf. on Genetic Programming (EuroGP-00), LNCS, vol. 1802, pp. 247–258. Springer (2000)

    Google Scholar 

  11. Burke, E., Kendall, G., Newall, J., Hart, E., Ross, P., Schulenburg, S.: Hyper-heuristics: an emerging direction in modern search technology. In: F. Glover, G. Kochenberger (eds.) Handbook of Meta-Heuristics, pp. 457–474. Kluwer (2003)

    Google Scholar 

  12. Cantu-Paz, E., Kamath, C.: Using evolutionary algorithms to induce oblique decision trees. In: Proc. of the Genetic and Evolutionary Computation Conf. (GECCO-00), pp. 1053–1060. Morgan Kaufmann (2000)

    Google Scholar 

  13. Cavaretta, M.J., Chellapilla, K.: Data mining using genetic programming: the implications of parsimony on generalization error. In: P.J. Angeline, Z. Michalewicz, M. Schoenauer, X. Yao, A. Zalzala (eds.) Proc. of the Congress on Evolutionary Computation (CEC-99), vol. 2, pp. 1330–1337. IEEE Press (1999)

    Google Scholar 

  14. Eggermont, J., Eiben, A., van Hemert, J.: A comparison of genetic programming variants for data classification. In: Proc. of Conf. on Intelligent Data Analysis (EuroGP-99). Springer (1999)

    Google Scholar 

  15. Eiben, A.E., Smith, J.E.: Introduction to Evolutionary Computation. Springer-Verlag (2003)

    Google Scholar 

  16. Folino, G., Pizzyti, C., Spezzano, G.: Genetic programming and simulated annealing: a hybrid method to evolve decision trees. In: Proc. of the 3rd European Conf. on Genetic Programming (EuroGP-00), LNCS, vol. 1802, pp. 294–303. Springer (2000)

    Google Scholar 

  17. Freitas, A.A.: Understanding the crucial differences between classification and discovery of association rules: a position paper. ACM SIGKDD Explorations 2(1), 65–69 (2000)

    Article  Google Scholar 

  18. Freitas, A.A.: Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer-Verlag (2002)

    Google Scholar 

  19. Fu, Z.: An innovative ga-based decision tree classifier in large scale data mining. In: Proc. of the 3rd European Conf. on Principles and Practice of Knowledge Discovery in Databases (PKDD-99), pp. 348–353. Springer (1999)

    Google Scholar 

  20. Fukunaga, A.: Automated discovery of composite SAT variable-selection heuristics. In: Proc. of the National Conf. on Artificial Intelligence (AAAI-02), pp. 641–648. AAAI Press (2002)

    Google Scholar 

  21. Hirsch, L., Saeedi, M., Hirsch, R.: Evolving text classifiers with genetic programming. In: Proc. of the 7th European Conf. on Genetic Programming (EuroGP-04), LNCS, vol. 3003, pp. 309–317. Springer-Verlag (2004)

    Google Scholar 

  22. Hong, J., Cho, S.: Lymphoma cancer classification using genetic programming with SNR features. In: Proc. of the European Conf. on Genetic Programming (EuroGP-04), LNCS, vol. 3003, pp. 78–88. Springer-Verlag (2004)

    Google Scholar 

  23. Howley, T., Madden, M.G.: The genetic kernel support vector machine: description and evaluation. Artificial Intelligence Review 24(3-4), 379–395 (2005)

    Article  Google Scholar 

  24. Hu, Y.: A genetic programming approach to constructive induction. In: Proc. of the 3rd Annual Conf. on Genetic Programming (GP-98), pp. 146–151. Morgan Kaufmann (1998)

    Google Scholar 

  25. Kishore, J., Patnaik, L., Mani, V., Agrawal, V.: Application of genetic programming for multicategory pattern classification. IEEE Transactions on Evolutionary Computation 4(3), 242–258 (2000)

    Article  Google Scholar 

  26. Koza, J.R.: Genetic Programming: on the programming of computers by the means of natural selection. The MIT Press, Massachusetts (1992)

    MATH  Google Scholar 

  27. Mendes, R.R.F., Voznika, F.B., Freitas, A.A., Nievola, J.C.: Discovering fuzzy classification rules with genetic programming and co-evolution. In: Proc. of the European Conf. on Principles of Data Mining and Knowledge Discovery (PKDD-01), pp. 314–325. Springer Verlag (2001)

    Google Scholar 

  28. Montana, D.J.: Strongly typed genetic programming. Evolutionary Computation 3(2), 199–230 (1995)

    Article  Google Scholar 

  29. Oltean, M.: Evolving evolutionary algorithms using linear genetic programming. Evolutionary Computation 13(3), 387–410 (2005)

    Article  Google Scholar 

  30. Oltean, M., Dumitrescu, D.: Evolving TSP heuristics using multi expression programming. In: Proc. of the 4th Int. Conf. on Computational Science, LNCS, vol. 3037, pp. 670–673. Springer (2004)

    Google Scholar 

  31. Papagelis, A., Kalles, D.: Breeding decision trees using evolutionary techniques. In: Proc. of the 18th Int. Conf. on Machine Learning (ICML-01), pp. 393–400. Morgan Kaufmann Publishers, San Francisco, CA, USA (2001)

    Google Scholar 

  32. Quinlan, J.R.: Learning logical definitions from relations. Machine Learning 5, 239–266 (1990)

    Google Scholar 

  33. Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann (1993)

    Google Scholar 

  34. Rivero, D., Dorado, J., Rabuñal, J.R., Pazos, A., Pereira, J.: Artificial neural network development by means of genetic programming with graph codification. Transactions on Engineering, Computing and Technology 16, 209–214 (2006)

    Google Scholar 

  35. Rouwhorst, S., Engelbrecht, A.: Searching the forest: using decision trees as building blocks for evolutionary search in classification databases. In: Proc. of Congress on Evolutionary Computation (CEC-00). IEEE Press (2000)

    Google Scholar 

  36. Ryan, M., Rayward-Smith, V.: The evolution of decision trees. In: Proc. of the 3rd Annual Conference on Genetic Programming(GP-98), pp. 350–358. Morgan Kaufmann (1998)

    Google Scholar 

  37. Smart, W., Zhang, M.: Using genetic programming for multiclass classification by simultaneously solving component binary classification problems. In: Proc. of the European Conf. on Genetic Programming (EuroGP-05), LNCS, vol. 3447, pp. 227–239. Springer-Verlag (2005)

    Google Scholar 

  38. Suyama, A., Negishi, N., Yamaguchi, T.: CAMLET: A platform for automatic composition of inductive learning systems using ontologies. In: Proc. of the Pacific Rim Int. Conf. on Artificial Intelligence, pp. 205–215 (1998)

    Google Scholar 

  39. Whitley, D.: Permutations. In: T. Back, D. Fogel, T. Michalewicz (eds.) Evolutionary Computation 1: Basic Algorithms and Operators, pp. 274–284. Institute of Physics Publishing (2000)

    Google Scholar 

  40. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, 2nd edn. Morgan Kaufmann (2005)

    Google Scholar 

  41. Wong, M.L.: An adaptive knowledge-acquisition system using generic genetic programming. Expert Systems with Applications 15(1), 47–58 (1998)

    Article  Google Scholar 

  42. Wong, M.L., Leung, K.S.: Data Mining Using Grammar-Based Genetic Programming and Applications. Kluwer, Norwell, MA, USA (2000)

    MATH  Google Scholar 

  43. Yao, X.: Evolving artificial neural networks. Proceedings of the IEEE 87(9), 1423–1447 (1999)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gisele L. Pappa .

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Pappa, G.L., Freitas, A.A. (2010). Genetic Programming for Classification and Algorithm Design. In: Automating the Design of Data Mining Algorithms. Natural Computing Series. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02541-9_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-02541-9_4

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-02540-2

  • Online ISBN: 978-3-642-02541-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics