Simplifying decision trees by pruning and grafting: New results (Extended abstract)

  • Floriana Esposito
  • Donato Malerba
  • Giovanni Semeraro
Extended Abstracts
Part of the Lecture Notes in Computer Science book series (LNCS, volume 912)


This paper presents some empirical results on simplification methods of decision trees induced from data. We observe that those methods exploiting an independent pruning set do not perform uniformly better than the others. Furthermore, a clear definition of bias towards overpruning and underpruning is exploited in order to interpret empirical data concerning the size of the simplified trees.


  1. 1.
    Breiman, L., Friedman, J., Olshen, R., & C. Stone, Classification and regression trees, Belmont, CA: Wadsworth International, 1984.Google Scholar
  2. 2.
    Cestnik, B., & I. Bratko, On estimating probabilities in tree pruning. In Y. Kodratoff (Ed.), Machine Learning-EWSL-91, Lecture Notes in Artificial Intelligence, Berlin: Springer-Verlag, 138–150, 1991.Google Scholar
  3. 3.
    Esposito, F., Malerba, D., & G. Semeraro, Decision tree pruning as a search in the state space. In P. Brazdil (Ed.), Machine Learning: ECML-93, Lecture Notes in Artificial Intelligence, Berlin:Springer-Verlag, 165–184, 1993.Google Scholar
  4. 4.
    Holte, R.C., Very simple classification rules perform well on most commonly used datasets. Machine Learning, 11, 63–90, 1993.Google Scholar
  5. 5.
    Mingers, J., Expert systems-rule induction with statistical data. Journal of the Operational Research Society, 38, 39–47, 1987.Google Scholar
  6. 6.
    Mingers, J., An empirical comparison of pruning methods for decision tree induction. Machine Learning, 4, 227–243, 1989.Google Scholar
  7. 7.
    Niblett, T., & I. Bratko, Learning decision rules in noisy domains. Proceedings of Expert Systems 86, Cambridge: Cambridge University Press, 1986.Google Scholar
  8. 8.
    Quinlan, J.R., Simplifying decision trees. International Journal of Man-Machine Studies, 27, 221–234, 1987.Google Scholar
  9. 9.
    Quinlan, J.R., C4.5: Programs for machine learning. San Mateo, CA: Morgan Kaufmann, 1993.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1995

Authors and Affiliations

  • Floriana Esposito
    • 1
  • Donato Malerba
    • 1
  • Giovanni Semeraro
    • 1
  1. 1.Dipartimento di InformaticaUniversità degli Studi di BariBariItaly

Personalised recommendations