Skip to main content

Automating the Design of Rule Induction Algorithms

  • Chapter
  • First Online:
Automating the Design of Data Mining Algorithms

Part of the book series: Natural Computing Series ((NCS))

Abstract

Chapter 5 (entitled “Automating the Design of Rule Induction Algorithms”) describes in detail the main contribution of this book, which is a grammar-based genetic programming system for automatically evolving the design of rule induction algorithms. First, this chapter describes the grammar used by the system, which incorporates background knowledge about how human experts manually design a rule induction algorithm. Next, this chapter describes all other components of the system related to the genetic programming algorithm itself - i.e., individual representation, population initialization, individual evaluation (based on a single-objective or multiobjective fitness function) and finally crossover and mutation operations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bhattacharyya, S.: Direct marketing response models using genetic algorithms. In: Proc. of 4th Int. Conf. on Knowledge Discovery and Data Mining (KDD-98), pp. 144–148 (1998)

    Google Scholar 

  2. Caruana, R., Niculescu-Mizil, A.: Data mining in metric space: an empirical analysis of supervised learning performance criteria. In: Proc. of the 10th Int. Conf. on Knowledge Discovery and Data Mining (KDD-04), pp. 69–78. ACM Press (2004)

    Google Scholar 

  3. Cavaretta, M.J., Chellapilla, K.: Data mining using genetic programming: the implications of parsimony on generalization error. In: P.J. Angeline, Z. Michalewicz, M. Schoenauer, X. Yao, A. Zalzala (eds.) Proc. of the Congress on Evolutionary Computation (CEC-99), vol. 2, pp. 1330–1337. IEEE Press (1999)

    Google Scholar 

  4. Cendrowska, J.: Prism: an algorithm for inducing modular rules. International Journal of Man-Machine Studies 27, 349–370 (1987)

    Article  MATH  Google Scholar 

  5. Clark, P., Boswell, R.: Rule induction with CN2: some recent improvements. In: Y. Kodratoff (ed.) Proc. of the European Working Session on Learning on Machine Learning (EWSL-91), pp. 151–163. Springer-Verlag, New York, NY, USA (1991)

    Google Scholar 

  6. Clark, P., Niblett, T.: The CN2 induction algorithm. Machine Learning 3, 261–283 (1989)

    Google Scholar 

  7. Cohen, W.W.: Fast effective rule induction. In: A. Prieditis, S. Russell (eds.) Proc. of the 12th Int. Conf. on Machine Learning (ICML-95), pp. 115–123. Morgan Kaufmann, Tahoe City, CA (1995)

    Google Scholar 

  8. Fawcett, T.: ROC graphs: notes and practical considerations for data mining researchers. Tech. Rep. HPL-2003-4, HP Labs (2003)

    Google Scholar 

  9. Flach, P.: The geometry of ROC space: understanding machine learning metrics through ROC isometrics. In: Proc. 20th Int. Conf. on Machine Learning (ICML-03), pp. 194–201. AAAI Press (2003)

    Google Scholar 

  10. Freitas, A.A.: Data Mining and Knowledge Discovery with Evolutionary Algorithms. Springer-Verlag (2002)

    Google Scholar 

  11. Fürnkranz, J.: A pathology of bottom-up hill-climbing in inductive rule learning. In: Proc. of the 13th Int. Conf. on Algorithmic Learning Theory (ALT-02), pp. 263–277. Springer-Verlag, London, UK (2002)

    Chapter  Google Scholar 

  12. Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley, Reading, MA (1989)

    MATH  Google Scholar 

  13. Hekanaho, J.: Background knowledge in GA-based concept learning. In: T. Fogarty, G. Venturini (eds.) 13th Int. Conf. on Machine Learning (ICML-96), pp. 234–242 (1996)

    Google Scholar 

  14. Newman, D.J., Hettich, S., Blake, C.L., Merz, C.J.: UCI Repository of machine learning databases. University of California, Irvine, http://www.ics.uci.edu/~mlearn/MLRepository.html (1998)

  15. O’Neill, M., Brabazon, A., Ryan, C., Collins, J.: Evolving market index trading rules using grammatical evolution. In: E.J.W. Boers, S. Cagnoni, J. Gottlieb, E. Hart, P.L. Lanzi, G.R. Raidl, R.E. Smith, H. Tijink (eds.) Applications of Evolutionary Computing, LNCS, vol. 2037, pp. 343–352. Springer-Verlag (2001)

    Google Scholar 

  16. Pappa, G.L.: Automatically evolving rule induction algorithms with grammar-based genetic programming. Ph.D. thesis, Computing Laboratory, University of Kent, Canterbury, UK (2007)

    Google Scholar 

  17. Pappa, G.L., Freitas, A.A.: Towards a genetic programming algorithm for automatically evolving rule induction algorithms. In: J. Fürnkranz (ed.) Proc. of the ECML/PKDD-04 Workshop on Advances in Inductive Learning, pp. 93–108. Pisa (2004)

    Google Scholar 

  18. Pappa, G.L., Freitas, A.A.: Automatically evolving rule induction algorithms. In: J. Fürnkranz, T. Scheffer, M. Spiliopoulou (eds.) Proc. of the 17th European Conf. on Machine Learning (ECML-06), Lecture Notes in Computer Science, vol. 4212, pp. 341–352. Springer Berlin/Heidelberg (2006)

    Google Scholar 

  19. Pappa, G.L., Freitas, A.A.: Evolving rule induction algorithms with multi-objective grammar-based genetic programming. Knowledge and Information Systems 19(3), 283–309 (2009)

    Article  Google Scholar 

  20. Pappa, G.L., Freitas, A.A., Kaestner, C.A.A.: Multi-objective algorithms for attribute selection in data mining. In: C.A.C. Coello, G. Lamont (eds.) Applications of Multi-Objective Evolutionary Algorithms, pp. 603–626. World Scientific (2004)

    Google Scholar 

  21. Provost, F., Fawcett, T., Kohavi, R.: The case against accuracy estimation for comparing induction algorithms. In: Proc. of the 15th Int. Conf. on Machine Learning (ICML-98), pp. 445–453. Morgan Kaufmann Publishers, San Francisco, CA, USA (1998)

    Google Scholar 

  22. Quinlan, J.R.: C4.5: programs for machine learning. Morgan Kaufmann (1993)

    Google Scholar 

  23. Tsakonas, A., Dounias, G., Jantzen, J., Axer, H., Bjerregaard, B., von Keyserlingk, D.G.: Evolving rule-based systems in two medical domains using genetic programming. Artificial Intelligence in Medicine 32(3), 195–216 (2004)

    Article  Google Scholar 

  24. Webb, G.I., Brkic, N.: Learning decision lists by prepending inferred rules. In: Proc. of the AI-93 Workshop on Machine Learning and Hybrid Systems, pp. 6–10. World Scientific (1993)

    Google Scholar 

  25. Weiss, S.M., Indurkhya, N.: Optimized rule induction. IEEE Expert: Intelligent Systems and Their Applications 8(6), 61–69 (1993)

    Google Scholar 

  26. Whigham, P.A.: Grammatically-based genetic programming. In: J.P. Rosca (ed.) Proc. of the Workshop on Genetic Programming: From Theory to Real-World Applications, pp. 33–41. Tahoe City, California, USA (1995)

    Google Scholar 

  27. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, 2nd edn. Morgan Kaufmann (2005)

    Google Scholar 

  28. Zhang, J.: Selecting typical instances in instance-based learning. In: Proc. of the 9th Int. Workshop on Machine Learning (ML-92), pp. 470–479. Morgan Kaufmann, San Francisco, CA, USA (1992)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gisele L. Pappa .

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Pappa, G.L., Freitas, A.A. (2010). Automating the Design of Rule Induction Algorithms. In: Automating the Design of Data Mining Algorithms. Natural Computing Series. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02541-9_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-02541-9_5

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-02540-2

  • Online ISBN: 978-3-642-02541-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics