A Distance-Based Attribute Selection Measure for Decision Tree Induction

De Mántaras, R. López

doi:10.1023/A:1022694001379

A Distance-Based Attribute Selection Measure for Decision Tree Induction

Published: January 1991

Volume 6, pages 81–92, (1991)
Cite this article

Download PDF

Machine Learning Aims and scope Submit manuscript

A Distance-Based Attribute Selection Measure for Decision Tree Induction

Download PDF

R. López De Mántaras¹

3455 Accesses
232 Citations
3 Altmetric
Explore all metrics

Abstract

This note introduces a new attribute selection measure for ID3-like inductive algorithms. This measure is based on a distance between partitions such that the selected attribute in a node induces the partition which is closest to the correct partition of the subset of training examples corresponding to this node. The relationship of this measure with Quinlan's information gain is also established. It is also formally proved that our distance is not biased towards attributes with large numbers of values. Experimental studies with this distance confirm previously reported results showing that the predictive accuracy of induced decision trees is not sensitive to the goodness of the attribute selection measure. However, this distance produces smaller trees than the gain ratio measure of Quinlan, especially in the case of data whose attributes have significantly different numbers of values.

References

Bratko, I., & Kononenko, I.(1986).Learning diagnostic rules from incomplete and noisy data.Seminar on AI Methods in Statistics.London.
Breiman, I., Friedman, J., Olshen, R., & Stone, C.(1984).Classification and regressing trees.Belmont, CA: Wadsworth International Group.
Google Scholar
Cestnik, B., Kononenko, I., & Bratko, I.(1987).ASSISTANT 86:A knowledge-elicitation tool for sophisticated users.In I. Bratko & N. Lavrac (Ed.), Progress in machine learning,Sigma Press.
Clark, P., & Niblett, T.(1987).Induction in noisy domains.In I. Bratko, & N. Lavrac (Eds.), Progress in machine learning,Sigma Press.
Hart, A.(1984).Experience in the use of an inductive system in knowledge engineering.In M. Bramer (Ed.), Research and developments in expert systems.Cambridge University Press.
Kononenko, I., Bratko, I., & Roskar, E.(1984).Experiments in automatic learning of medical diagnostic rules.(Technical Report)Ljubljana, Yugoslavia: Jozef Stefan Institute.
Google Scholar
Lopez de Mantaras, R.(1977).Autoapprentissage d 'une partition:Application au classement iteratif de donnees multidimensionelles.Ph.D.thesis.Paul Sabatier University, Toulouse (France).
Google Scholar
Mingers, J.(1989).An empirical comparison of selection measures for decision-tree induction.Machine learn-ing, 3, 319-342.
Google Scholar
Quinlan, J.R.(1979).Discovering rules by induction from large collections of examples.In D. Michie (Ed.), Expert systems in the microelectronic age.Edinburg University Press.
Quinlan, J.R.(1986).Induction of decision trees.Machine learning, 1, 81-106.
Google Scholar

Download references

Author information

Authors and Affiliations

Centre of Advanced Studies, CSIC, 17300 Blanes, Girona, Spain
R. López De Mántaras

Authors

R. López De Mántaras
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

De Mántaras, R.L. A Distance-Based Attribute Selection Measure for Decision Tree Induction. Machine Learning 6, 81–92 (1991). https://doi.org/10.1023/A:1022694001379

Download citation

Issue Date: January 1991
DOI: https://doi.org/10.1023/A:1022694001379

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Distance-Based Attribute Selection Measure for Decision Tree Induction

Abstract

Article PDF

Similar content being viewed by others

SPAARC: A Fast Decision Tree Algorithm

Improvement of Decision Tree ID3 Algorithm

More Interpretable Decision Trees

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Navigation

A Distance-Based Attribute Selection Measure for Decision Tree Induction

Abstract

Article PDF

Similar content being viewed by others

SPAARC: A Fast Decision Tree Algorithm

Improvement of Decision Tree ID3 Algorithm

More Interpretable Decision Trees

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation