Abstract
Chapter 4 (entitled “Genetic Programming for Classification and Algorithm Design”), consists of two broad parts. The first part is about the classification task of data mining. In this first part, the chapter first explains important differences between classification models and classification algorithms - a crucial point to understand the contribution of this book, since the proposed genetic programming system produces a classification algorithm, rather than a classification model as usual in the literature. Then this chapter reviews genetic programming algorithms for evolving classification models, for evolving components of rule induction algorithms and for evolving classification systems as a whole (as a combination of datasets plus a classification algorithm). The second part of the chapter is about genetic programming for evolving the design of combinatorial optimization algorithms, rather than classification algorithms. Although combinatorial optimization is not the focus of this book, this topic was included in this chapter because the research on automatically evolving combinatorial optimization algorithms seems to be in a more advanced stage than the research on automatically evolving data mining algorithms, so that lessons from the former can be useful to the latter.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abe, H., Yamaguchi, T.: Comparing the parallel automatic composition of inductive applications with stacking methods. In: R. Camacho, A. Srinivasan (eds.) Proc. of the Workshop on Parallel and Distributed Computing for Machine Learning (ECML/PKDD-03), pp. 1–12. Cavtat-Dubrovnik, Croatia (2003)
Abraham, A.: Meta learning evolutionary artificial neural networks. Neurocomputing 56, 1–38 (2004)
Almal, A., Mitra, A., Datar, R., Lenehan, P., Fry, D., Cote, R., Worzel, W.: Using genetic programming to classify node positive patients in bladder cancer. In: Proc. of Genetic and Evolutionary Computation Conf. (GECCO-06), pp. 239–246. Morgan Kaufmann (2006)
Archetti, F., Lanzani, S., Messina, E., Vanneschi, L.: Genetic programming for human oral bioavailability of drugs. In: Proc. of Genetic and Evolutionary Computation Conf. (GECCO-06), pp. 255–262. Morgan Kaufmann (2006)
Bader-El-Den, M., Poli, R.: Generating SAT local-search heuristics using a GP hyper-heuristic framework. In: Artificial Evolution (Proc. of 8th Int. Conf. on Evolution Artificielle), LNCS, vol. 4926, pp. 37–49. Springer-Verlag (2007)
Basgalupp, M., Barros, R., Carvalho, A., Freitas, A., Ruiz, D.: Legal-tree: a lexicographic multi-objective genetic algorithm for decision tree induction. In: Proc. of 24th Annual ACM Symposium on Applied Computing (SAC 2009), Hawaii, USA, pp. 1085–1090 (2009)
Blum, C., Roli, A.: Metaheuristics in combinatorial optimization: overview and conceptual comparison. ACM Computing Surveys 35(3), 268–308 (2003)
Bojarczuk, C., Lopes, H., Freitas, A.: Discovering comprehensible classification rules using genetic programming: a case study in a medical domain. In: Proc. of the Genetic and Evolutionary Computation Conf. (GECCO-99), pp. 953–958. Morgan Kaufmann (1999)
Bojarczuk, C.C., Lopes, H.S., Freitas, A.A.: Genetic programming for knowledge discovery in chest pain diagnosis. IEEE Engineering in Medicine and Biology Magazine 19(4), 38–44 (2000)
Bot, M., Langdon, W.: Application of genetic programming to induction of linear classification trees. In: Proc. of the 3rd European Conf. on Genetic Programming (EuroGP-00), LNCS, vol. 1802, pp. 247–258. Springer (2000)
Burke, E., Kendall, G., Newall, J., Hart, E., Ross, P., Schulenburg, S.: Hyper-heuristics: an emerging direction in modern search technology. In: F. Glover, G. Kochenberger (eds.) Handbook of Meta-Heuristics, pp. 457–474. Kluwer (2003)
Cantu-Paz, E., Kamath, C.: Using evolutionary algorithms to induce oblique decision trees. In: Proc. of the Genetic and Evolutionary Computation Conf. (GECCO-00), pp. 1053–1060. Morgan Kaufmann (2000)
Cavaretta, M.J., Chellapilla, K.: Data mining using genetic programming: the implications of parsimony on generalization error. In: P.J. Angeline, Z. Michalewicz, M. Schoenauer, X. Yao, A. Zalzala (eds.) Proc. of the Congress on Evolutionary Computation (CEC-99), vol. 2, pp. 1330–1337. IEEE Press (1999)
Eggermont, J., Eiben, A., van Hemert, J.: A comparison of genetic programming variants for data classification. In: Proc. of Conf. on Intelligent Data Analysis (EuroGP-99). Springer (1999)
Eiben, A.E., Smith, J.E.: Introduction to Evolutionary Computation. Springer-Verlag (2003)
Folino, G., Pizzyti, C., Spezzano, G.: Genetic programming and simulated annealing: a hybrid method to evolve decision trees. In: Proc. of the 3rd European Conf. on Genetic Programming (EuroGP-00), LNCS, vol. 1802, pp. 294–303. Springer (2000)
Freitas, A.A.: Understanding the crucial differences between classification and discovery of association rules: a position paper. ACM SIGKDD Explorations 2(1), 65–69 (2000)
Freitas, A.A.: Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer-Verlag (2002)
Fu, Z.: An innovative ga-based decision tree classifier in large scale data mining. In: Proc. of the 3rd European Conf. on Principles and Practice of Knowledge Discovery in Databases (PKDD-99), pp. 348–353. Springer (1999)
Fukunaga, A.: Automated discovery of composite SAT variable-selection heuristics. In: Proc. of the National Conf. on Artificial Intelligence (AAAI-02), pp. 641–648. AAAI Press (2002)
Hirsch, L., Saeedi, M., Hirsch, R.: Evolving text classifiers with genetic programming. In: Proc. of the 7th European Conf. on Genetic Programming (EuroGP-04), LNCS, vol. 3003, pp. 309–317. Springer-Verlag (2004)
Hong, J., Cho, S.: Lymphoma cancer classification using genetic programming with SNR features. In: Proc. of the European Conf. on Genetic Programming (EuroGP-04), LNCS, vol. 3003, pp. 78–88. Springer-Verlag (2004)
Howley, T., Madden, M.G.: The genetic kernel support vector machine: description and evaluation. Artificial Intelligence Review 24(3-4), 379–395 (2005)
Hu, Y.: A genetic programming approach to constructive induction. In: Proc. of the 3rd Annual Conf. on Genetic Programming (GP-98), pp. 146–151. Morgan Kaufmann (1998)
Kishore, J., Patnaik, L., Mani, V., Agrawal, V.: Application of genetic programming for multicategory pattern classification. IEEE Transactions on Evolutionary Computation 4(3), 242–258 (2000)
Koza, J.R.: Genetic Programming: on the programming of computers by the means of natural selection. The MIT Press, Massachusetts (1992)
Mendes, R.R.F., Voznika, F.B., Freitas, A.A., Nievola, J.C.: Discovering fuzzy classification rules with genetic programming and co-evolution. In: Proc. of the European Conf. on Principles of Data Mining and Knowledge Discovery (PKDD-01), pp. 314–325. Springer Verlag (2001)
Montana, D.J.: Strongly typed genetic programming. Evolutionary Computation 3(2), 199–230 (1995)
Oltean, M.: Evolving evolutionary algorithms using linear genetic programming. Evolutionary Computation 13(3), 387–410 (2005)
Oltean, M., Dumitrescu, D.: Evolving TSP heuristics using multi expression programming. In: Proc. of the 4th Int. Conf. on Computational Science, LNCS, vol. 3037, pp. 670–673. Springer (2004)
Papagelis, A., Kalles, D.: Breeding decision trees using evolutionary techniques. In: Proc. of the 18th Int. Conf. on Machine Learning (ICML-01), pp. 393–400. Morgan Kaufmann Publishers, San Francisco, CA, USA (2001)
Quinlan, J.R.: Learning logical definitions from relations. Machine Learning 5, 239–266 (1990)
Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann (1993)
Rivero, D., Dorado, J., Rabuñal, J.R., Pazos, A., Pereira, J.: Artificial neural network development by means of genetic programming with graph codification. Transactions on Engineering, Computing and Technology 16, 209–214 (2006)
Rouwhorst, S., Engelbrecht, A.: Searching the forest: using decision trees as building blocks for evolutionary search in classification databases. In: Proc. of Congress on Evolutionary Computation (CEC-00). IEEE Press (2000)
Ryan, M., Rayward-Smith, V.: The evolution of decision trees. In: Proc. of the 3rd Annual Conference on Genetic Programming(GP-98), pp. 350–358. Morgan Kaufmann (1998)
Smart, W., Zhang, M.: Using genetic programming for multiclass classification by simultaneously solving component binary classification problems. In: Proc. of the European Conf. on Genetic Programming (EuroGP-05), LNCS, vol. 3447, pp. 227–239. Springer-Verlag (2005)
Suyama, A., Negishi, N., Yamaguchi, T.: CAMLET: A platform for automatic composition of inductive learning systems using ontologies. In: Proc. of the Pacific Rim Int. Conf. on Artificial Intelligence, pp. 205–215 (1998)
Whitley, D.: Permutations. In: T. Back, D. Fogel, T. Michalewicz (eds.) Evolutionary Computation 1: Basic Algorithms and Operators, pp. 274–284. Institute of Physics Publishing (2000)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, 2nd edn. Morgan Kaufmann (2005)
Wong, M.L.: An adaptive knowledge-acquisition system using generic genetic programming. Expert Systems with Applications 15(1), 47–58 (1998)
Wong, M.L., Leung, K.S.: Data Mining Using Grammar-Based Genetic Programming and Applications. Kluwer, Norwell, MA, USA (2000)
Yao, X.: Evolving artificial neural networks. Proceedings of the IEEE 87(9), 1423–1447 (1999)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Pappa, G.L., Freitas, A.A. (2010). Genetic Programming for Classification and Algorithm Design. In: Automating the Design of Data Mining Algorithms. Natural Computing Series. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02541-9_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-02541-9_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02540-2
Online ISBN: 978-3-642-02541-9
eBook Packages: Computer ScienceComputer Science (R0)