Abstract
This paper presents a new tool for the study of relationships between total path length or average depth and number of nodes of decision trees. In addition to algorithm, the paper also presents the results of experiments with datasets from UCI ML Repository [1].
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Frank A, Asuncion A (2010) UCI Machine Learning Repository
Alkhalid A, Chikalov I, Moshkov M (2010) On algorithm for building of optimal \(\alpha \)-decision trees. In: Szczuka MS, Kryszkiewicz M, Ramanna S, Jensen R, Hu Q (eds) RSCTC. Springer, Heidelberg, pp 438–445
Alkhalid A, Chikalov I, Moshkov M (2010) A tool for study of optimal decision trees. In: Yu J, Greco S, Lingras P, Wang G, Skowron A (eds) RSKT. LNCS, vol 6401. Springer, Heidelberg, pp 353–360
Alkhalid A, Chikalov I, Hussain S, Moshkov M (2012) In: Extensions of dynamic programming as a new tool for decision tree optimization. SIST, vol 13. Springer, Heidelberg, pp 16–36
Alkhalid A, Amin T, Chikalov I, Hussain S, Moshkov M, Zielosko B (2011) Dagger: a tool for analysis and optimization of decision trees and rules. In: Francisco V. C. Ficarra (ed) Computational informatics, social factors and new information technologies: hypermedia perspectives and avant-garde experiencies in the Era of communicability expansion. Blue Herons, Bergamo, pp 29–39
Chikalov I, Hussain S, Moshkov M (2011) Relationships between depth and number of missclassifications for decision trees. In: Kuznetsov SO, Slezak D, Hepting DH, Mirkin B (eds) Thirteenth international conference on rough sets, fuzzy sets, data mining and granualr computing (RSFDGrC 2011). LNCS, vol 6743. Springer, Heidelberg, pp 286–292
Pawlak Z (1991) Theoretical aspects of reasoning about data. Kluwer Academic Publishers, Dordrecht
Skowron A, Rauszer C (1992) The discernibility matrices and functions in information systems. In: Slowinski R (ed) Intelligent decision support. Handbook of applications and advances of the rough set theory. Kluwer Academic Publishers, Dordrecht, pp 331–362
Nguyen HS (1998) From optimal hyperplanes to optimal decision trees. Fundam Inf 34(1–2):145–174
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix: Transformation of Functions
Appendix: Transformation of Functions
Let \(f\) and \(g\) be two functions from a set \(A\) onto \(C_f\) and \(C_g\) respectively, where \(C_f\) and \(C_g\) are finite sets of nonnegative integers. Let \(B_f = \{m_f, m_f+1, \ldots , M_f\}\) and \(B_g = \{n_g, n_g+1, \ldots , N_g\}\) where \(m_f = \min \{m : m\in C_f\}\) and \(n_g = \min \{n: n\in C_g\}\). Furthermore, \(M_f\) and \(N_g\) are natural numbers such that \(m\le M_f\) and \(n\le N_g\) for any \(m\in C_f\) and \(n\in C_g\), respectively.
We define two functions \({\mathcal{F}} :B_g \rightarrow B_f \) and \({\mathcal{G}} :B_f \rightarrow B_g\) as follows:
It is clear that both \({\mathcal{F}} \) and \({\mathcal{G}}\) are nonincreasing functions.
The following proposition states that the functions \({\mathcal{F}} \) and \({\mathcal{G}}\) can be used interchangeably and we can evaluate \({\mathcal{F}} \) using \({\mathcal{G}}\) and vice versa, i.e., it is enough to know only one function to evaluate the other.
Proposition 3
For any \(n\in B_g\),
and for any \(m\in B_f\),
Proof
Let for some \(n\in B_g\)
Furthermore, we assume that
From (3) it follows that
-
(i)
there exists \(b\in A\) such that \(g(b) \le n\) and \(f(b) = m_0\);
-
(ii)
for any \(a\in A\) if \(g(a) \le n\) then \(f(a) \ge m_0\).
From (i) it follows that \({\mathcal{G}} (m_0) \le n\). This implies \(t\le m_0\). Let us assume that \(t< m_0\). In this case, there exits \(m_1<\,m_0\) for which \({\mathcal{G}} (m_1) \le n\). Therefore, there exists \(a\in A\) such that \(f(a) \le m_1\) and \(g(a) \le n\), but from (ii) it follows that \(f(a) \ge m_0\), which is impossible. So \(t=m_0\).
Similarly, we can prove the second part of the statement.
Proposition 3 allows us to transform the function \({\mathcal{G}}\) given by a tuple \(\left( {\mathcal{G}} (m_f), {\mathcal{G}} (m_f+1), \ldots , {\mathcal{G}} (M_f)\right) \) into the function \({\mathcal{F}} \) and vice versa. We know that \({\mathcal{G}} (m_f) \ge {\mathcal{G}} (m_f + 1) \ge \cdots \ge {\mathcal{G}} (M_f)\), to find the minimum \(m\in B_f\) such that \({\mathcal{G}} (m) \le m\) we can use binary search which requires \(O(\log |B_f|)\) comparisons of numbers. So to find the value \({\mathcal{F}} (n)\) for \(n\in B_g\) it is enough to make \(O(\log |B_f|)\) operations of comparison.
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chikalov, I., Hussain, S., Moshkov, M. (2014). Relationships Between Average Depth and Number of Nodes for Decision Trees. In: Sun, F., Li, T., Li, H. (eds) Knowledge Engineering and Management. Advances in Intelligent Systems and Computing, vol 214. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37832-4_47
Download citation
DOI: https://doi.org/10.1007/978-3-642-37832-4_47
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37831-7
Online ISBN: 978-3-642-37832-4
eBook Packages: EngineeringEngineering (R0)