Abstract
Chapter 5 (entitled “Automating the Design of Rule Induction Algorithms”) describes in detail the main contribution of this book, which is a grammar-based genetic programming system for automatically evolving the design of rule induction algorithms. First, this chapter describes the grammar used by the system, which incorporates background knowledge about how human experts manually design a rule induction algorithm. Next, this chapter describes all other components of the system related to the genetic programming algorithm itself - i.e., individual representation, population initialization, individual evaluation (based on a single-objective or multiobjective fitness function) and finally crossover and mutation operations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bhattacharyya, S.: Direct marketing response models using genetic algorithms. In: Proc. of 4th Int. Conf. on Knowledge Discovery and Data Mining (KDD-98), pp. 144–148 (1998)
Caruana, R., Niculescu-Mizil, A.: Data mining in metric space: an empirical analysis of supervised learning performance criteria. In: Proc. of the 10th Int. Conf. on Knowledge Discovery and Data Mining (KDD-04), pp. 69–78. ACM Press (2004)
Cavaretta, M.J., Chellapilla, K.: Data mining using genetic programming: the implications of parsimony on generalization error. In: P.J. Angeline, Z. Michalewicz, M. Schoenauer, X. Yao, A. Zalzala (eds.) Proc. of the Congress on Evolutionary Computation (CEC-99), vol. 2, pp. 1330–1337. IEEE Press (1999)
Cendrowska, J.: Prism: an algorithm for inducing modular rules. International Journal of Man-Machine Studies 27, 349–370 (1987)
Clark, P., Boswell, R.: Rule induction with CN2: some recent improvements. In: Y. Kodratoff (ed.) Proc. of the European Working Session on Learning on Machine Learning (EWSL-91), pp. 151–163. Springer-Verlag, New York, NY, USA (1991)
Clark, P., Niblett, T.: The CN2 induction algorithm. Machine Learning 3, 261–283 (1989)
Cohen, W.W.: Fast effective rule induction. In: A. Prieditis, S. Russell (eds.) Proc. of the 12th Int. Conf. on Machine Learning (ICML-95), pp. 115–123. Morgan Kaufmann, Tahoe City, CA (1995)
Fawcett, T.: ROC graphs: notes and practical considerations for data mining researchers. Tech. Rep. HPL-2003-4, HP Labs (2003)
Flach, P.: The geometry of ROC space: understanding machine learning metrics through ROC isometrics. In: Proc. 20th Int. Conf. on Machine Learning (ICML-03), pp. 194–201. AAAI Press (2003)
Freitas, A.A.: Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer-Verlag (2002)
Fürnkranz, J.: A pathology of bottom-up hill-climbing in inductive rule learning. In: Proc. of the 13th Int. Conf. on Algorithmic Learning Theory (ALT-02), pp. 263–277. Springer-Verlag, London, UK (2002)
Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley, Reading, MA (1989)
Hekanaho, J.: Background knowledge in GA-based concept learning. In: T. Fogarty, G. Venturini (eds.) 13th Int. Conf. on Machine Learning (ICML-96), pp. 234–242 (1996)
Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J.: UCI Repository of machine learning databases. University of California, Irvine, http://www.ics.uci.edu/~mlearn/MLRepository.html (1998)
O’Neill, M., Brabazon, A., Ryan, C., Collins, J.: Evolving market index trading rules using grammatical evolution. In: E.J.W. Boers, S. Cagnoni, J. Gottlieb, E. Hart, P.L. Lanzi, G.R. Raidl, R.E. Smith, H. Tijink (eds.) Applications of Evolutionary Computing, LNCS, vol. 2037, pp. 343–352. Springer-Verlag (2001)
Pappa, G.L.: Automatically evolving rule induction algorithms with grammar-based genetic programming. Ph.D. thesis, Computing Laboratory, University of Kent, Canterbury, UK (2007)
Pappa, G.L., Freitas, A.A.: Towards a genetic programming algorithm for automatically evolving rule induction algorithms. In: J. Fürnkranz (ed.) Proc. of the ECML/PKDD-04 Workshop on Advances in Inductive Learning, pp. 93–108. Pisa (2004)
Pappa, G.L., Freitas, A.A.: Automatically evolving rule induction algorithms. In: J. Fürnkranz, T. Scheffer, M. Spiliopoulou (eds.) Proc. of the 17th European Conf. on Machine Learning (ECML-06), Lecture Notes in Computer Science, vol. 4212, pp. 341–352. Springer Berlin/Heidelberg (2006)
Pappa, G.L., Freitas, A.A.: Evolving rule induction algorithms with multi-objective grammar-based genetic programming. Knowledge and Information Systems 19(3), 283–309 (2009)
Pappa, G.L., Freitas, A.A., Kaestner, C.A.A.: Multi-objective algorithms for attribute selection in data mining. In: C.A.C. Coello, G. Lamont (eds.) Applications of Multi-Objective Evolutionary Algorithms, pp. 603–626. World Scientific (2004)
Provost, F., Fawcett, T., Kohavi, R.: The case against accuracy estimation for comparing induction algorithms. In: Proc. of the 15th Int. Conf. on Machine Learning (ICML-98), pp. 445–453. Morgan Kaufmann Publishers, San Francisco, CA, USA (1998)
Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann (1993)
Tsakonas, A., Dounias, G., Jantzen, J., Axer, H., Bjerregaard, B., von Keyserlingk, D.G.: Evolving rule-based systems in two medical domains using genetic programming. Artificial Intelligence in Medicine 32(3), 195–216 (2004)
Webb, G.I., Brkic, N.: Learning decision lists by prepending inferred rules. In: Proc. of the AI-93 Workshop on Machine Learning and Hybrid Systems, pp. 6–10. World Scientific (1993)
Weiss, S.M., Indurkhya, N.: Optimized rule induction. IEEE Expert: Intelligent Systems and Their Applications 8(6), 61–69 (1993)
Whigham, P.A.: Grammatically-based genetic programming. In: J.P. Rosca (ed.) Proc. of the Workshop on Genetic Programming: From Theory to Real-World Applications, pp. 33–41. Tahoe City, California, USA (1995)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, 2nd edn. Morgan Kaufmann (2005)
Zhang, J.: Selecting typical instances in instance-based learning. In: Proc. of the 9th Int. Workshop on Machine Learning (ML-92), pp. 470–479. Morgan Kaufmann, San Francisco, CA, USA (1992)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Pappa, G.L., Freitas, A.A. (2010). Automating the Design of Rule Induction Algorithms. In: Automating the Design of Data Mining Algorithms. Natural Computing Series. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02541-9_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-02541-9_5
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02540-2
Online ISBN: 978-3-642-02541-9
eBook Packages: Computer ScienceComputer Science (R0)